Blog

OBJECTIVE: To identify a causative variant(s) that may contribute to Alzheimer disease (AD) in African Americans (AA) in the ATP-binding cassette, subfamily A (ABC1), member 7 (ABCA7) gene, a known risk factor for late-onset AD.
METHODS: Custom capture sequencing was performed on ∼150 kb encompassing ABCA7 in 40 AA cases and 37 AA controls carrying the AA risk allele (rs115550680). Association testing was performed for an ABCA7 deletion identified in large AA data sets (discovery n = 1,068; replication n = 1,749) and whole exome sequencing of Caribbean Hispanic (CH) AD families.
RESULTS: A 44-base pair deletion (rs142076058) was identified in all 77 risk genotype carriers, which shows that the deletion is in high linkage disequilibrium with the risk allele. The deletion was assessed in a large data set (531 cases and 527 controls) and, after adjustments for age, sex, and APOE status, was significantly associated with disease (p = 0.0002, odds ratio [OR] = 2.13 [95% confidence interval (CI): 1.42-3.20]). An independent data set replicated the association (447 cases and 880 controls, p = 0.0117, OR = 1.65 [95% CI: 1.12-2.44]), and joint analysis increased the significance (p = 1.414 × 10(-5), OR = 1.81 [95% CI: 1.38-2.37]). The deletion is common in AA cases (15.2%) and AA controls (9.74%), but in only 0.12% of our non-Hispanic white cohort. Whole exome sequencing of multiplex, CH families identified the deletion cosegregating with disease in a large sibship. The deleted allele produces a stable, detectable RNA strand and is predicted to result in a frameshift mutation (p.Arg578Alafs) that could interfere with protein function.
CONCLUSIONS: This common ABCA7 deletion could represent an ethnic-specific pathogenic alteration in AD.

BACKGROUND: RNA molecules fold into complex three-dimensional shapes, guided by the pattern of hydrogen bonding between nucleotides. This pattern of base pairing, known as RNA secondary structure, is critical to their cellular function. Recently several diverse methods have been developed to assay RNA secondary structure on a transcriptome-wide scale using high-throughput sequencing. Each approach has its own strengths and caveats, however there is no widely available tool for visualizing and comparing the results from these varied methods.
METHODS: To address this, we have developed Structure Surfer, a database and visualization tool for inspecting RNA secondary structure in six transcriptome-wide data sets from human and mouse ( http://tesla.pcbi.upenn.edu/strucuturesurfer/ ). The data sets were generated using four different high-throughput sequencing based methods. Each one was analyzed with a scoring pipeline specific to its experimental design. Users of Structure Surfer have the ability to query individual loci as well as detect trends across multiple sites.
RESULTS: Here, we describe the included data sets and their differences. We illustrate the database’s function by examining known structural elements and we explore example use cases in which combined data is used to detect structural trends.
CONCLUSIONS: In total, Structure Surfer provides an easy-to-use database and visualization interface for allowing users to interrogate the currently available transcriptome-wide RNA secondary structure information for mammals.

Alzheimer’s disease (AD) is a complex genetic disorder with no effective treatments. More than 20 common markers have been identified, which are associated with AD. Recently, several rare variants have been identified in Amyloid Precursor Protein (APP), Triggering Receptor Expressed On Myeloid Cells 2 (TREM2) and Unc-5 Netrin Receptor C (UNC5C) that affect risk for AD. Despite the many successes, the genetic architecture of AD remains unsolved. We used Genome-wide Complex Trait Analysis to (1) estimate phenotypic variance explained by genetics; (2) calculate genetic variance explained by known AD single nucleotide polymorphisms (SNPs); and (3) identify the genomic locations of variation that explain the remaining unexplained genetic variance. In total, 53.24% of phenotypic variance is explained by genetics, but known AD SNPs only explain 30.62% of the genetic variance. Of the unexplained genetic variance, approximately 41% is explained by unknown SNPs in regions adjacent to known AD SNPs, and the remaining unexplained genetic variance outside these regions.

INTRODUCTION: African-American (AA) individuals have a higher risk for late-onset Alzheimer’s disease (LOAD) than Americans of primarily European ancestry (EA). Recently, the largest genome-wide association study in AAs to date confirmed that six of the Alzheimer’s disease (AD)-related genetic variants originally discovered in EA cohorts are also risk variants in AA; however, the risk attributable to many of the loci (e.g., APOE, ABCA7) differed substantially from previous studies in EA. There likely are risk variants of higher frequency in AAs that have not been discovered.
METHODS: We performed a comprehensive analysis of genetically determined local and global ancestry in AAs with regard to LOAD status.
RESULTS: Compared to controls, LOAD cases showed higher levels of African ancestry, both globally and at several LOAD relevant loci, which explained risk for AD beyond global differences.
DISCUSSION: Exploratory post hoc analyses highlight regions with greatest differences in ancestry as potential candidate regions for future genetic analyses.

Small non-coding RNAs (sncRNAs) are highly abundant RNAs, typically <100 nucleotides long, that act as key regulators of diverse cellular processes. Although thousands of sncRNA genes are known to exist in the human genome, no single database provides searchable, unified annotation, and expression information for full sncRNA transcripts and mature RNA products derived from these larger RNAs. Here, we present the Database of small human noncoding RNAs (DASHR). DASHR contains the most comprehensive information to date on human sncRNA genes and mature sncRNA products. DASHR provides a simple user interface for researchers to view sequence and secondary structure, compare expression levels, and evidence of specific processing across all sncRNA genes and mature sncRNA products in various human tissues. DASHR annotation and expression data covers all major classes of sncRNAs including microRNAs (miRNAs), Piwi-interacting (piRNAs), small nuclear, nucleolar, cytoplasmic (sn-, sno-, scRNAs, respectively), transfer (tRNAs), and ribosomal RNAs (rRNAs). Currently, DASHR (v1.0) integrates 187 smRNA high-throughput sequencing (smRNA-seq) datasets with over 2.5 billion reads and annotation data from multiple public sources. DASHR contains annotations for ∼ 48,000 human sncRNA genes and mature sncRNA products, 82% of which are expressed in one or more of the curated tissues. DASHR is available at http://lisanwanglab.org/DASHR.

INTRODUCTION: Few high penetrance variants that explain risk in late-onset Alzheimer’s disease (LOAD) families have been found.
METHODS: We performed genome-wide linkage and identity-by-descent (IBD) analyses on 41 non-Hispanic white families exhibiting likely dominant inheritance of LOAD, and having no mutations at known familial Alzheimer’s disease (AD) loci, and a low burden of APOE ε4 alleles.
RESULTS: Two-point parametric linkage analysis identified 14 significantly linked regions, including three novel linkage regions for LOAD (5q32, 11q12.2-11q14.1, and 14q13.3), one of which replicates a genome-wide association LOAD locus, the MS4A6A-MS4A4E gene cluster at 11q12.2. Five of the 14 regions (3q25.31, 4q34.1, 8q22.3, 11q12.2-14.1, and 19q13.41) are supported by strong multipoint results (logarithm of odds [LOD*] ≥1.5). Nonparametric multipoint analyses produced an additional significant locus at 14q32.2 (LOD* = 4.18). The 1-LOD confidence interval for this region contains one gene, C14orf177, and the microRNA Mir_320, whereas IBD analyses implicates an additional gene BCL11B, a regulator of brain-derived neurotrophic signaling, a pathway associated with pathogenesis of several neurodegenerative diseases.
DISCUSSION: Examination of these regions after whole-genome sequencing may identify highly penetrant variants for familial LOAD.

Aging is a major risk factor for many neurodegenerative disorders. A key feature of aging biology that may underlie these diseases is cellular senescence. Senescent cells accumulate in tissues with age, undergo widespread changes in gene expression, and typically demonstrate altered, pro-inflammatory profiles. Astrocyte senescence has been implicated in neurodegenerative disease, and to better understand senescence-associated changes in astrocytes, we investigated changes in their transcriptome using RNA sequencing. Senescence was induced in human fetal astrocytes by transient oxidative stress. Brain-expressed genes, including those involved in neuronal development and differentiation, were downregulated in senescent astrocytes. Remarkably, several genes indicative of astrocytic responses to injury were also downregulated, including glial fibrillary acidic protein and genes involved in the processing and presentation of antigens by major histocompatibility complex class II proteins, while pro-inflammatory genes were upregulated. Overall, our findings suggest that senescence-related changes in the function of astrocytes may impact the pathogenesis of age-related brain disorders.

A rare variant in TREM2 (p.R47H, rs75932628) was recently reported to increase the risk of Alzheimer’s disease (AD) and, subsequently, other neurodegenerative diseases, i.e. frontotemporal lobar degeneration (FTLD), amyotrophic lateral sclerosis (ALS), and Parkinson’s disease (PD). Here we comprehensively assessed TREM2 rs75932628 for association with these diseases in a total of 19,940 previously untyped subjects of European descent. These data were combined with those from 28 published data sets by meta-analysis. Furthermore, we tested whether rs75932628 shows association with amyloid beta (Aβ42) and total-tau protein levels in the cerebrospinal fluid (CSF) of 828 individuals with AD or mild cognitive impairment. Our data show that rs75932628 is highly significantly associated with the risk of AD across 24,086 AD cases and 148,993 controls of European descent (odds ratio or OR = 2.71, P = 4.67 × 10(-25)). No consistent evidence for association was found between this marker and the risk of FTLD (OR = 2.24, P = .0113 across 2673 cases/9283 controls), PD (OR = 1.36, P = .0767 across 8311 cases/79,938 controls) and ALS (OR = 1.41, P = .198 across 5544 cases/7072 controls). Furthermore, carriers of the rs75932628 risk allele showed significantly increased levels of CSF-total-tau (P = .0110) but not Aβ42 suggesting that TREM2’s role in AD may involve tau dysfunction.

Observational research shows that higher body mass index (BMI) increases Alzheimer’s disease (AD) risk, but it is unclear whether this association is causal. We applied genetic variants that predict BMI in Mendelian randomization analyses, an approach that is not biased by reverse causation or confounding, to evaluate whether higher BMI increases AD risk. We evaluated individual-level data from the AD Genetics Consortium (ADGC: 10,079 AD cases and 9613 controls), the Health and Retirement Study (HRS: 8403 participants with algorithm-predicted dementia status), and published associations from the Genetic and Environmental Risk for AD consortium (GERAD1: 3177 AD cases and 7277 controls). No evidence from individual single-nucleotide polymorphisms or polygenic scores indicated BMI increased AD risk. Mendelian randomization effect estimates per BMI point (95% confidence intervals) were as follows: ADGC, odds ratio (OR) = 0.95 (0.90-1.01); HRS, OR = 1.00 (0.75-1.32); GERAD1, OR = 0.96 (0.87-1.07). One subscore (cellular processes not otherwise specified) unexpectedly predicted lower AD risk.

IMPORTANCE: Mutations in known causal Alzheimer disease (AD) genes account for only 1% to 3% of patients and almost all are dominantly inherited. Recessive inheritance of complex phenotypes can be linked to long (>1-megabase [Mb]) runs of homozygosity (ROHs) detectable by single-nucleotide polymorphism (SNP) arrays.
OBJECTIVE: To evaluate the association between ROHs and AD in an African American population known to have a risk for AD up to 3 times higher than white individuals.
DESIGN, SETTING, AND PARTICIPANTS: Case-control study of a large African American data set previously genotyped on different genome-wide SNP arrays conducted from December 2013 to January 2015. Global and locus-based ROH measurements were analyzed using raw or imputed genotype data. We studied the raw genotypes from 2 case-control subsets grouped based on SNP array: Alzheimer’s Disease Genetics Consortium data set (871 cases and 1620 control individuals) and Chicago Health and Aging Project-Indianapolis Ibadan Dementia Study data set (279 cases and 1367 control individuals). We then examined the entire data set using imputed genotypes from 1917 cases and 3858 control individuals.
MAIN OUTCOMES AND MEASURES: The ROHs larger than 1 Mb, 2 Mb, or 3 Mb were investigated separately for global burden evaluation, consensus regions, and gene-based analyses.
RESULTS: The African American cohort had a low degree of inbreeding (F ~ 0.006). In the Alzheimer’s Disease Genetics Consortium data set, we detected a significantly higher proportion of cases with ROHs greater than 2 Mb (P = .004) or greater than 3 Mb (P = .02), as well as a significant 114-kilobase consensus region on chr4q31.3 (empirical P value 2 = .04; ROHs >2 Mb). In the Chicago Health and Aging Project-Indianapolis Ibadan Dementia Study data set, we identified a significant 202-kilobase consensus region on Chr15q24.1 (empirical P value 2 = .02; ROHs >1 Mb) and a cluster of 13 significant genes on Chr3p21.31 (empirical P value 2 = .03; ROHs >3 Mb). A total of 43 of 49 nominally significant genes common for both data sets also mapped to Chr3p21.31. Analyses of imputed SNP data from the entire data set confirmed the association of AD with global ROH measurements (12.38 ROHs >1 Mb in cases vs 12.11 in controls; 2.986 Mb average size of ROHs >2 Mb in cases vs 2.889 Mb in controls; and 22% of cases with ROHs >3 Mb vs 19% of controls) and a gene-cluster on Chr3p21.31 (empirical P value 2 = .006-.04; ROHs >3 Mb). Also, we detected a significant association between AD and CLDN17 (empirical P value 2 = .01; ROHs >1 Mb), encoding a protein from the Claudin family, members of which were previously suggested as AD biomarkers.
CONCLUSIONS AND RELEVANCE: To our knowledge, we discovered the first evidence of increased burden of ROHs among patients with AD from an outbred African American population, which could reflect either the cumulative effect of multiple ROHs to AD or the contribution of specific loci harboring recessive mutations and risk haplotypes in a subset of patients. Sequencing is required to uncover AD variants in these individuals.