Objective
Scleroderma is a genetically complex autoimmune disease with substantial phenotypic heterogeneity. Previous genome-wide association studies have identified common genetic variants associated with disease risk, but these studies are not designed to capture rare or potential causal variants. Our goal was to identify rare as well as common genetic variants in patients with diffuse cutaneous systemic sclerosis (dcSSc) through whole-exome sequencing (WES) in order to identify potential causal variants.Methods
We generated WES data for 32 dcSSc patients with or without interstitial lung disease (ILD) and for 17 healthy "in-house" controls. Variants were annotated and filtered by quality, minor allele frequency, and deleterious effects on gene function. We applied a gene burden test to identify novel dcSSc and dcSSc-associated ILD candidate genes that were enriched with deleterious variants in cases compared to in-house controls as well as controls from the 1000 Genomes Project (n = 130).Results
We identified 70 genes that were enriched with deleterious variants in dcSSc patients. Two of them (BANK1 and TERT) were in pathways previously implicated in SSc or ILD pathogenesis or known susceptibility loci. Newly identified genes (COL4A3, COL4A4, COL5A2, COL13A1, and COL22A1) were significantly enriched in the extracellular matrix-related pathway, which is relevant to the fibrotic features of dcSSc, and in the DNA repair pathway (XRCC4).Conclusion
This study demonstrates the value of WES for the identification of novel gene variants and pathways that may contribute to scleroderma risk and/or severity. The candidate genes we discovered are potential targets for in-depth functional studies.