Search

Article
Peer Reviewed

Evaluating human genetic support for hypothesized metabolic disease genes

UCLA Previously Published Works (2022)

We investigate the extent to which human genetic data are incorporated into studies that hypothesize novel links between genes and metabolic disease. To lower the barriers to using genetic data, we present an approach to enable researchers to evaluate human genetic support for experimentally determined hypotheses.

Cover page: Evaluating human genetic support for hypothesized metabolic disease genes

Article
Peer Reviewed

Targeted next-generation sequencing in anophthalmia and microphthalmia patients confirms SOX2, OTX2 and FOXE3 mutations

UC San Francisco Previously Published Works (2011)

Abstract Background Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. Methods We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. Results We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Conclusions Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M.

1 supplemental file

Cover page: Targeted next-generation sequencing in anophthalmia and microphthalmia patients confirms <i>SOX2<i>, <i>OTX2<i> and <i>FOXE3<i> mutations
</i></i></i></i></i></i>

Article
Peer Reviewed

The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease

UC San Diego Previously Published Works (2015)

Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α = 2.5 × 10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci.

Cover page: The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease

Article
Peer Reviewed

Leveraging type 1 diabetes human genetic and genomic data in the T1D knowledge portal.

UC San Diego Previously Published Works (2023)

To address the challenge of translating genetic discoveries for type 1 diabetes (T1D) into mechanistic insight, we have developed the T1D Knowledge Portal (T1DKP), an open-access resource for hypothesis development and target discovery in T1D.

Cover page: Leveraging type 1 diabetes human genetic and genomic data in the T1D knowledge portal.

Article
Peer Reviewed

A combined polygenic score of 21,293 rare and 22 common variants improves diabetes diagnosis based on hemoglobin A1C levels

UCLA Previously Published Works (2022)

Polygenic scores (PGSs) combine the effects of common genetic variants^1,2 to predict risk or treatment strategies for complex diseases^3-7. Adding rare variation to PGSs has largely unknown benefits and is methodically challenging. Here, we developed a method for constructing rare variant PGSs and applied it to calculate genetically modified hemoglobin A1C thresholds for type 2 diabetes (T2D) diagnosis^7-10. The resultant rare variant PGS is highly polygenic (21,293 variants across 154 genes), depends on ultra-rare variants (72.7% observed in fewer than three people) and identifies significantly more undiagnosed T2D cases than expected by chance (odds ratio = 2.71; P = 1.51 × 10^-6). A PGS combining common and rare variants is expected to identify 4.9 million misdiagnosed T2D cases in the United States-nearly 1.5-fold more than the common variant PGS alone. These results provide a method for constructing complex trait PGSs from rare variants and suggest that rare variants will augment common variants in precision medicine approaches for common disease.

Cover page: A combined polygenic score of 21,293 rare and 22 common variants improves diabetes diagnosis based on hemoglobin A1C levels

Article
Peer Reviewed

The Lipid Droplet Knowledge Portal: A resource for systematic analyses of lipid droplet biology

UC Berkeley Previously Published Works (2022)

Lipid droplets (LDs) are organelles of cellular lipid storage with fundamental roles in energy metabolism and cell membrane homeostasis. There has been an explosion of research into the biology of LDs, in part due to their relevance in diseases of lipid storage, such as atherosclerosis, obesity, type 2 diabetes, and hepatic steatosis. Consequently, there is an increasing need for a resource that combines datasets from systematic analyses of LD biology. Here, we integrate high-confidence, systematically generated human, mouse, and fly data from studies on LDs in the framework of an online platform named the "Lipid Droplet Knowledge Portal" (https://lipiddroplet.org/). This scalable and interactive portal includes comprehensive datasets, across a variety of cell types, for LD biology, including transcriptional profiles of induced lipid storage, organellar proteomics, genome-wide screen phenotypes, and ties to human genetics. This resource is a powerful platform that can be utilized to identify determinants of lipid storage.

Cover page: The Lipid Droplet Knowledge Portal: A resource for systematic analyses of lipid droplet biology

Article
Peer Reviewed

Discovering metabolic disease gene interactions by correlated effects on cellular morphology

UC San Diego Previously Published Works (2019)

Objective

Impaired expansion of peripheral fat contributes to the pathogenesis of insulin resistance and Type 2 Diabetes (T2D). We aimed to identify novel disease-gene interactions during adipocyte differentiation.

Methods

Genes in disease-associated loci for T2D, adiposity and insulin resistance were ranked according to expression in human adipocytes. The top 125 genes were ablated in human pre-adipocytes via CRISPR/CAS9 and the resulting cellular phenotypes quantified during adipocyte differentiation with high-content microscopy and automated image analysis. Morphometric measurements were extracted from all images and used to construct morphologic profiles for each gene.

Results

Over 10⁷ morphometric measurements were obtained. Clustering of the morphologic profiles accross all genes revealed a group of 14 genes characterized by decreased lipid accumulation, and enriched for known lipodystrophy genes. For two lipodystrophy genes, BSCL2 and AGPAT2, sub-clusters with PLIN1 and CEBPA identifed by morphological similarity were validated by independent experiments as novel protein-protein and gene regulatory interactions.

Conclusions

A morphometric approach in adipocytes can resolve multiple cellular mechanisms for metabolic disease loci; this approach enables mechanistic interrogation of the hundreds of metabolic disease loci whose function still remains unknown.

Cover page: Discovering metabolic disease gene interactions by correlated effects on cellular morphology

Article
Peer Reviewed

Rare Complete Knockouts in Humans: Population Distribution and Significant Role in Autism Spectrum Disorders

UC San Francisco Previously Published Works (2013)

To characterize the role of rare complete human knockouts in autism spectrum disorders (ASDs), we identify genes with homozygous or compound heterozygous loss-of-function (LoF) variants (defined as nonsense and essential splice sites) from exome sequencing of 933 cases and 869 controls. We identify a 2-fold increase in complete knockouts of autosomal genes with low rates of LoF variation (≤ 5% frequency) in cases and estimate a 3% contribution to ASD risk by these events, confirming this observation in an independent set of 563 probands and 4,605 controls. Outside the pseudoautosomal regions on the X chromosome, we similarly observe a significant 1.5-fold increase in rare hemizygous knockouts in males, contributing to another 2% of ASDs in males. Taken together, these results provide compelling evidence that rare autosomal and X chromosome complete gene knockouts are important inherited risk factors for ASD.

Cover page: Rare Complete Knockouts in Humans: Population Distribution and Significant Role in Autism Spectrum Disorders

Article
Peer Reviewed

Distribution and Medical Impact of Loss-of-Function Variants in the Finnish Founder Population

UCLA Previously Published Works (2014)

Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5-5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10⁻⁸) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10⁻¹¹⁷). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10⁻⁴), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.

Cover page: Distribution and Medical Impact of Loss-of-Function Variants in the Finnish Founder Population

Article
Peer Reviewed

Rare variants in PPARG with decreased activity in adipocyte differentiation are associated with increased risk of type 2 diabetes

UC San Diego Previously Published Works (2014)

Peroxisome proliferator-activated receptor gamma (PPARG) is a master transcriptional regulator of adipocyte differentiation and a canonical target of antidiabetic thiazolidinedione medications. In rare families, loss-of-function (LOF) mutations in PPARG are known to cosegregate with lipodystrophy and insulin resistance; in the general population, the common P12A variant is associated with a decreased risk of type 2 diabetes (T2D). Whether and how rare variants in PPARG and defects in adipocyte differentiation influence risk of T2D in the general population remains undetermined. By sequencing PPARG in 19,752 T2D cases and controls drawn from multiple studies and ethnic groups, we identified 49 previously unidentified, nonsynonymous PPARG variants (MAF < 0.5%). Considered in aggregate (with or without computational prediction of functional consequence), these rare variants showed no association with T2D (OR = 1.35; P = 0.17). The function of the 49 variants was experimentally tested in a novel high-throughput human adipocyte differentiation assay, and nine were found to have reduced activity in the assay. Carrying any of these nine LOF variants was associated with a substantial increase in risk of T2D (OR = 7.22; P = 0.005). The combination of large-scale DNA sequencing and functional testing in the laboratory reveals that approximately 1 in 1,000 individuals carries a variant in PPARG that reduces function in a human adipocyte differentiation assay and is associated with a substantial risk of T2D.

Cover page: Rare variants in PPARG with decreased activity in adipocyte differentiation are associated with increased risk of type 2 diabetes