Skip to main content
eScholarship
Open Access Publications from the University of California

School of Medicine

Department of Pediatrics - Open Access Policy Deposits bannerUC San Diego

This series is automatically populated with publications deposited by UC San Diego School of Medicine Department of Pediatrics researchers in accordance with the University of California’s open access policies. For more information see Open Access Policy Deposits and the UC Publication Management System.

Genetic analysis of elevated levels of creatinine and cystatin C biomarkers reveals novel genetic loci associated with kidney function

(2025)

The rising prevalence of chronic kidney disease (CKD), affecting an estimated 37 million adults in the United States, presents a significant global health challenge. CKD is typically assessed using estimated Glomerular Filtration Rate (eGFR), which incorporates serum levels of biomarkers such as creatinine and cystatin C. However, these biomarkers do not directly measure kidney function; their elevation in CKD results from diminished glomerular filtration. Genome-wide association studies (GWAS) based on eGFR formulas using creatinine (eGFRcre) or cystatin C (eGFRcys) have identified distinct non-overlapping loci, raising questions about whether these loci govern kidney function or biomarker metabolism. In this study, we show that GWAS on creatinine and cystatin C levels in healthy individuals reveal both nonoverlapping genetic loci impacting their metabolism as well as overlapping genetic loci associated with kidney function; whereas GWAS on elevated levels of these biomarkers uncover novel loci primarily associated with kidney function in CKD patients.

Cover page of RUNX2 promotes fibrosis via an alveolar-to-pathological fibroblast transition

RUNX2 promotes fibrosis via an alveolar-to-pathological fibroblast transition

(2025)

A hallmark of pulmonary fibrosis is the aberrant activation of lung fibroblasts into pathological fibroblasts that produce excessive extracellular matrix1-3. Thus, the identification of key regulators that promote the generation of pathological fibroblasts can inform the development of effective countermeasures against disease progression. Here we use two mouse models of pulmonary fibrosis to show that LEPR+ fibroblasts that arise during alveologenesis include SCUBE2+ alveolar fibroblasts as a major constituent. These alveolar fibroblasts in turn contribute substantially to CTHRC1+POSTN+ pathological fibroblasts. Genetic ablation of POSTN+ pathological fibroblasts attenuates fibrosis. Comprehensive analyses of scRNA-seq and scATAC-seq data reveal that RUNX2 is a key regulator of the expression of fibrotic genes. Consistently, conditional deletion of Runx2 with LeprcreERT2 or Scube2creERT2 reduces the generation of pathological fibroblasts, extracellular matrix deposition and pulmonary fibrosis. Therefore, LEPR+ cells that include SCUBE2+ alveolar fibroblasts are a key source of pathological fibroblasts, and targeting Runx2 provides a potential treatment option for pulmonary fibrosis.

Cover page of Ethnic and racial differences in children and young people with respiratory and neurological post-acute sequelae of SARS-CoV-2: an electronic health record-based cohort study from the RECOVER Initiative.

Ethnic and racial differences in children and young people with respiratory and neurological post-acute sequelae of SARS-CoV-2: an electronic health record-based cohort study from the RECOVER Initiative.

(2025)

BACKGROUND: Children from racial and ethnic minority groups are at greater risk for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, but it is unclear whether they have increased risk for post-acute sequelae of SARS-CoV-2 (PASC). Our objectives were to assess whether the risk of respiratory and neurologic PASC differs by race/ethnicity and social drivers of health. METHODS: We conducted a retrospective cohort study of individuals <21 years seeking care at 24 health systems across the U.S, using electronic health record (EHR) data. Our cohort included those with a positive SARS-CoV-2 molecular, serology or antigen test, or with a COVID-19, multisystem inflammatory disease in children, or PASC diagnosis from February 29, 2020 to August 1, 2022. We identified children/youth with at least 2 codes associated with respiratory and neurologic PASC. We measured associations between sociodemographic and clinical characteristics and respiratory and neurologic PASC using odds ratios and 95% confidence intervals estimated from multivariable logistic regression models adjusted for other sociodemographic characteristics, social vulnerability index or area deprivation index, time period of cohort entry, presence and complexity of chronic respiratory (respectively, neurologic) condition and healthcare utilization. FINDINGS: Among 771,725 children in the cohort, 203,365 (26.3%) had SARS-CoV-2 infection. Among children with documented infection, 3217 children had respiratory PASC and 2009 children/youth had neurologic PASC. In logistic regression models, children <5 years (Odds Ratio [OR] 1.78, 95% CI 1.62-1.97), and of Hispanic White descent (OR 1.19, 95% CI 1.05-1.35) had higher odds of having respiratory PASC. Children/youth living in regions with higher area deprivation indices (OR 1.25, 95% CI 1.10-1.420 for 60-79th percentile) and with chronic complex respiratory conditions (OR 3.28, 95% CI 2.91-3.70) also had higher odds of respiratory PASC. In contrast, older (OR 1.57, 95% CI 1.40-1.77 for those aged 12-17 years), non-Hispanic White individuals and those with chronic pre-existing neurologic conditions (OR 2.04, 95% CI 1.78-2.35) were more likely to have a neurologic PASC diagnosis. INTERPRETATION: Racial and ethnic differences in healthcare utilization for neurologic and respiratory PASC may reflect social drivers of health and inequities in access to care. FUNDING: National Institutes of Health.

Cover page of Ethnic and racial differences in children and young people with&nbsp;respiratory and neurological post-acute sequelae of SARS-CoV-2: an electronic health record-based cohort study from the RECOVER Initiative

Ethnic and racial differences in children and young people with respiratory and neurological post-acute sequelae of SARS-CoV-2: an electronic health record-based cohort study from the RECOVER Initiative

(2025)

Background

Children from racial and ethnic minority groups are at greater risk for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, but it is unclear whether they have increased risk for post-acute sequelae of SARS-CoV-2 (PASC). Our objectives were to assess whether the risk of respiratory and neurologic PASC differs by race/ethnicity and social drivers of health.

Methods

We conducted a retrospective cohort study of individuals <21 years seeking care at 24 health systems across the U.S, using electronic health record (EHR) data. Our cohort included those with a positive SARS-CoV-2 molecular, serology or antigen test, or with a COVID-19, multisystem inflammatory disease in children, or PASC diagnosis from February 29, 2020 to August 1, 2022. We identified children/youth with at least 2 codes associated with respiratory and neurologic PASC. We measured associations between sociodemographic and clinical characteristics and respiratory and neurologic PASC using odds ratios and 95% confidence intervals estimated from multivariable logistic regression models adjusted for other sociodemographic characteristics, social vulnerability index or area deprivation index, time period of cohort entry, presence and complexity of chronic respiratory (respectively, neurologic) condition and healthcare utilization.

Findings

Among 771,725 children in the cohort, 203,365 (26.3%) had SARS-CoV-2 infection. Among children with documented infection, 3217 children had respiratory PASC and 2009 children/youth had neurologic PASC. In logistic regression models, children <5 years (Odds Ratio [OR] 1.78, 95% CI 1.62-1.97), and of Hispanic White descent (OR 1.19, 95% CI 1.05-1.35) had higher odds of having respiratory PASC. Children/youth living in regions with higher area deprivation indices (OR 1.25, 95% CI 1.10-1.420 for 60-79th percentile) and with chronic complex respiratory conditions (OR 3.28, 95% CI 2.91-3.70) also had higher odds of respiratory PASC. In contrast, older (OR 1.57, 95% CI 1.40-1.77 for those aged 12-17 years), non-Hispanic White individuals and those with chronic pre-existing neurologic conditions (OR 2.04, 95% CI 1.78-2.35) were more likely to have a neurologic PASC diagnosis.

Interpretation

Racial and ethnic differences in healthcare utilization for neurologic and respiratory PASC may reflect social drivers of health and inequities in access to care.

Funding

National Institutes of Health.

Cover page of Genetic Discovery and Risk Prediction for Type 1 Diabetes in Individuals Without High-Risk HLA-DR3/DR4 Haplotypes.

Genetic Discovery and Risk Prediction for Type 1 Diabetes in Individuals Without High-Risk HLA-DR3/DR4 Haplotypes.

(2025)

OBJECTIVE: More than 10% of patients with type 1 diabetes (T1D) do not have high-risk HLA-DR3 or -DR4 haplotypes with distinct clinical features, such as later onset and reduced insulin dependence. We aimed to identify genetic drivers of T1D in the absence of DR3/DR4 and improve prediction of T1D risk in these individuals. RESEARCH DESIGN AND METHODS: We performed T1D association and fine-mapping analyses in 12,316 non-DR3/DR4 samples. Next, we performed heterogeneity tests to examine differences in T1D risk variants in individuals without versus those with DR3/DR4 haplotypes. We further assessed genome-wide differences in gene regulatory element and biological pathway enrichments between the non-DR3/DR4 and DR3/DR4 cohorts. Finally, we developed a genetic risk score (GRS) to predict T1D in individuals without DR3/DR4 and compared with an existing T1D GRS. RESULTS: A total of 18 T1D risk variants in non-DR3/DR4 samples were identified. Risk variants at the MHC and multiple other loci genome wide had heterogeneity in effects on T1D dependent on DR3/DR4 status, and non-DR3/DR4 T1D had evidence for a greater polygenic burden. T1D-associated variants in non-DR3/DR4 were more enriched for regulatory elements and pathways involved in antigen presentation, innate immunity, and β-cells and depleted in T cells compared with DR3/DR4. A non-DR3/DR4 GRS outperformed an existing risk score GRS2 in discriminating non-DR3/DR4 T1D from no diabetes (area under the curve 0.867; P = 7.48 × 10-32) and type 2 diabetes (0.907; P = 4.94 × 10-44). CONCLUSIONS: In total, we identified heterogeneity in T1D genetic risk dependent on high-risk HLA-DR3/DR4 haplotype, which uncovers disease mechanisms and enables more accurate prediction of T1D across the HLA background.

Cover page of Decomposition of the pangenome matrix reveals a structure in gene distribution in the Escherichia coli species.

Decomposition of the pangenome matrix reveals a structure in gene distribution in the Escherichia coli species.

(2025)

UNLABELLED: Thousands of complete genome sequences for strains of a species that are now available enable the advancement of pangenome analytics to a new level of sophistication. We collected 2,377 publicly available complete genomes of Escherichia coli for detailed pangenome analysis. The core genome and accessory genomes consisted of 2,398 and 5,182 genes, respectively. We developed a machine learning approach to define the accessory genes characterizing the major phylogroups of E. coli plus Shigella: A, B1, B2, C, D, E, F, G, and Shigella. The analysis resulted in a detailed structure of the genetic basis of the phylogroups differential traits. This pangenome structure was largely consistent with a housekeeping-gene-based MLST distribution, sequence-based Mash distance, and the Clermont quadruplex classification. The rare genome (consisting of genes found in <6.8% of all strains) consisted of 163,619 genes, about 79% of which represented variations of 315 underlying transposon elements. This analysis generated a mathematical definition of the genetic basis for a species. IMPORTANCE: The comprehensive analysis of the pangenome of Escherichia coli presented in this study marks a significant advancement in understanding bacterial genetic diversity. By employing machine learning techniques to analyze 2,377 complete E. coli genomes, the study provides a detailed mapping of core, accessory, and rare genes. This approach reveals the genetic basis for differential traits across phylogroups, offering insights into pathogenicity, antibiotic resistance, and evolutionary adaptations. The findings enhance the potential for genome-based diagnostics and pave the way for future studies aimed at achieving a global genetic definition of bacterial phylogeny.

Cover page of Soil microbiome interventions for carbon sequestration and climate mitigation.

Soil microbiome interventions for carbon sequestration and climate mitigation.

(2025)

Mitigating climate change in soil ecosystems involves complex plant and microbial processes regulating carbon pools and flows. Here, we advocate for the use of soil microbiome interventions to help increase soil carbon stocks and curb greenhouse gas emissions from managed soils. Direct interventions include the introduction of microbial strains, consortia, phage, and soil transplants, whereas indirect interventions include managing soil conditions or additives to modulate community composition or its activities. Approaches to increase soil carbon stocks using microbially catalyzed processes include increasing carbon inputs from plants, promoting soil organic matter (SOM) formation, and reducing SOM turnover and production of diverse greenhouse gases. Marginal or degraded soils may provide the greatest opportunities for enhancing global soil carbon stocks. Among the many knowledge gaps in this field, crucial gaps include the processes influencing the transformation of plant-derived soil carbon inputs into SOM and the identity of the microbes and microbial activities impacting this transformation. As a critical step forward, we encourage broadening the current widespread screening of potentially beneficial soil microorganisms to encompass functions relevant to stimulating soil carbon stocks. Moreover, in developing these interventions, we must consider the potential ecological ramifications and uncertainties, such as incurred by the widespread introduction of homogenous inoculants and consortia, and the need for site-specificity given the extreme variation among soil habitats. Incentivization and implementation at large spatial scales could effectively harness increases in soil carbon stocks, helping to mitigate the impacts of climate change.

Cover page of Revealing systematic changes in the transcriptome during the transition from exponential growth to stationary phase.

Revealing systematic changes in the transcriptome during the transition from exponential growth to stationary phase.

(2025)

UNLABELLED: The composition of bacterial transcriptomes is determined by the transcriptional regulatory network (TRN). The TRN regulates the transition from one physiological state to another. Here, we use independent component analysis to monitor the composition of the transcriptome during the transition from the exponential growth phase to the stationary phase. With Escherichia coli K-12 MG1655 as a model strain, we trigger the transition using carbon, nitrogen, and sulfur starvation. We find that (i) the transition to the stationary phase accompanies common transcriptome changes, including increased stringent responses and reduced production of cellular building blocks and energy regardless of the limiting element; (ii) condition-specific changes are strongly associated with transcriptional regulators (e.g., Crp, NtrC, CysB, Cbl) responsible for metabolizing the limiting element; and (iii) the shortage of each limiting element differentially affects the production of amino acids and extracellular polymers. This study demonstrates how the combination of genome-scale datasets and new data analytics reveals the fundamental characteristics of a key transition in the life cycle of bacteria. IMPORTANCE: Nutrient limitations are critical environmental perturbations in bacterial physiology. Despite its importance, a detailed understanding of how bacterial transcriptomes are adjusted has been limited. By utilizing independent component analysis (ICA) to decompose transcriptome data, this study reveals key regulatory events that enable bacteria to adapt to nutrient limitations. The findings not only highlight common responses, such as the stringent response, but also condition-specific regulatory shifts associated with carbon, nitrogen, and sulfur starvation. The insights gained from this work advance our knowledge of bacterial physiology, gene regulation, and metabolic adaptation.

Cover page of Pangenome mining of the Streptomyces genus redefines species biosynthetic potential.

Pangenome mining of the Streptomyces genus redefines species biosynthetic potential.

(2025)

BACKGROUND: Streptomyces is a highly diverse genus known for the production of secondary or specialized metabolites with a wide range of applications in the medical and agricultural industries. Several thousand complete or nearly complete Streptomyces genome sequences are now available, affording the opportunity to deeply investigate the biosynthetic potential within these organisms and to advance natural product discovery initiatives. RESULTS: We perform pangenome analysis on 2371 Streptomyces genomes, including approximately 1200 complete assemblies. Employing a data-driven approach based on genome similarities, the Streptomyces genus was classified into 7 primary and 42 secondary Mash-clusters, forming the basis for comprehensive pangenome mining. A refined workflow for grouping biosynthetic gene clusters (BGCs) redefines their diversity across different Mash-clusters. This workflow also reassigns 2729 known BGC families to only 440 families, a reduction caused by inaccuracies in BGC boundary detections. When the genomic location of BGCs is included in the analysis, a conserved genomic structure, or synteny, among BGCs becomes apparent within species and Mash-clusters. This synteny suggests that vertical inheritance is a major factor in the diversification of BGCs. CONCLUSIONS: Our analysis of a genomic dataset at a scale of thousands of genomes refines predictions of BGC diversity using Mash-clusters as a basis for pangenome analysis. The observed conservation in the order of BGCs genomic locations shows that the BGCs are vertically inherited. The presented workflow and the in-depth analysis pave the way for large-scale pangenome investigations and enhance our understanding of the biosynthetic potential of the Streptomyces genus.

Deciphering metabolic differentiation during Bacillus subtilis sporulation.

(2025)

The bacterium Bacillus subtilis undergoes asymmetric cell division during sporulation, producing a mother cell and a smaller forespore connected by the SpoIIQ-SpoIIIA (or Q-A) channel. The two cells differentiate metabolically, and the forespore becomes dependent on the mother cell for essential building blocks. Here, we investigate the metabolic interactions between mother cell and forespore using genome-scale metabolic and expression models as well as experiments. Our results indicate that nucleotides are synthesized in the mother cell and transported in the form of nucleoside di- or tri-phosphates to the forespore via the Q-A channel. However, if the Q-A channel is inactivated later in sporulation, then glycolytic enzymes can form an ATP and NADH shuttle, providing the forespore with energy and reducing power. Our integrated in silico and in vivo approach sheds light into the intricate metabolic interactions underlying cell differentiation in B. subtilis, and provides a foundation for future studies of metabolic differentiation.