Blog

INTRODUCTION: The objective of this pilot study was to establish the feasibility of recruiting older Vietnamese Americans for research addressing genetic and nongenetic risk factors for Alzheimer disease (AD).
METHODS: Twenty-six Vietnamese Americans were recruited from communities in San Diego. A Community Advisory Board provided cultural and linguistic advice. Bilingual/bicultural staff measured neuropsychological, neuropsychiatric, lifestyle, and medical/neurological functioning remotely. Saliva samples allowed DNA extraction. A consensus team reviewed clinical data to determine a diagnosis of normal control (NC), mild cognitive impairment (MCI), or dementia. Exploratory analyses addressed AD risk by measuring subjective cognitive complaints (SCC), depression, and vascular risk factors (VRFs).
RESULTS: Twenty-five participants completed the study (mean age=73.8 y). Eighty percent chose to communicate in Vietnamese. Referrals came primarily from word of mouth within Vietnamese communities. Diagnoses included 18 NC, 3 MCI, and 4 dementia. Participants reporting SCC acknowledged more depressive symptoms and had greater objective cognitive difficulty than those without SCC. Eighty-eight percent of participants reported at least 1 VRF.
DISCUSSION: This pilot study supports the feasibility of conducting community-based research in older Vietnamese Americans. Challenges included developing linguistically and culturally appropriate cognitive and neuropsychiatric assessment tools. Exploratory analyses addressing nongenetic AD risk factors suggest topics for future study.

BACKGROUND: Progressive supranuclear palsy (PSP) is a rare neurodegenerative disease characterized by the accumulation of aggregated tau proteins in astrocytes, neurons, and oligodendrocytes. Previous genome-wide association studies for PSP were based on genotype array, therefore, were inadequate for the analysis of rare variants as well as larger mutations, such as small insertions/deletions (indels) and structural variants (SVs).
METHOD: In this study, we performed whole genome sequencing (WGS) and conducted association analysis for single nucleotide variants (SNVs), indels, and SVs, in a cohort of 1,718 cases and 2,944 controls of European ancestry. Of the 1,718 PSP individuals, 1,441 were autopsy-confirmed and 277 were clinically diagnosed.
RESULTS: Our analysis of common SNVs and indels confirmed known genetic loci at MAPT, MOBP, STX6, SLCO1A2, DUSP10, and SP1, and further uncovered novel signals in APOE, FCHO1/MAP1S, KIF13A, TRIM24, TNXB, and ELOVL1. Notably, in contrast to Alzheimer’s disease (AD), we observed the APOE ε2 allele to be the risk allele in PSP. Analysis of rare SNVs and indels identified significant association in ZNF592 and further gene network analysis identified a module of neuronal genes dysregulated in PSP. Moreover, seven common SVs associated with PSP were observed in the H1/H2 haplotype region (17q21.31) and other loci, including IGH, PCMT1, CYP2A13, and SMCP. In the H1/H2 haplotype region, there is a burden of rare deletions and duplications (P = 6.73 × 10-3) in PSP.
CONCLUSIONS: Through WGS, we significantly enhanced our understanding of the genetic basis of PSP, providing new targets for exploring disease mechanisms and therapeutic interventions.

INTRODUCTION: Despite a two-fold risk, individuals of African ancestry have been underrepresented in Alzheimer’s disease (AD) genomics efforts.
METHODS: Genome-wide association studies (GWAS) of 2,903 AD cases and 6,265 controls of African ancestry. Within-dataset results were meta-analyzed, followed by functional genomics analyses.
RESULTS: A novel AD-risk locus was identified in MPDZ on chromosome (chr) 9p23 (rs141610415, MAF = 0.002, p = 3.68×10-9). Two additional novel common and nine rare loci were identified with suggestive associations (P < 9×10-7). Comparison of association and linkage disequilibrium (LD) patterns between datasets with higher and lower degrees of African ancestry showed differential association patterns at chr12q23.2 (ASCL1), suggesting that this association is modulated by regional origin of local African ancestry.
DISCUSSION: These analyses identified novel AD-associated loci in individuals of African ancestry and suggest that degree of African ancestry modulates some associations. Increased sample sets covering as much African genetic diversity as possible will be critical to identify additional loci and deconvolute local genetic ancestry effects.
HIGHLIGHTS: Genetic ancestry significantly impacts risk of Alzheimer's Disease (AD). Although individuals of African ancestry are twice as likely to develop AD, they are vastly underrepresented in AD genomics studies. The Alzheimer's Disease Genetics Consortium has previously identified 16 common and rare genetic loci associated with AD in African American individuals. The current analyses significantly expand this effort by increasing the sample size and extending ancestral diversity by including populations from continental Africa. Single variant meta-analysis identified a novel genome-wide significant AD-risk locus in individuals of African ancestry at the MPDZ gene, and 11 additional novel loci with suggestive genome-wide significance at p < 9×10-7. Comparison of African American datasets with samples of higher degree of African ancestry demonstrated differing patterns of association and linkage disequilibrium at one of these loci, suggesting that degree and/or geographic origin of African ancestry modulates the effect at this locus. These findings illustrate the importance of increasing number and ancestral diversity of African ancestry samples in AD genomics studies to fully disentangle the genetic architecture underlying AD, and yield more effective ancestry-informed genetic screening tools and therapeutic interventions.

Detecting structural variants (SVs) in whole-genome sequencing poses significant challenges. We present a protocol for variant calling, merging, genotyping, sensitivity analysis, and laboratory validation for generating a high-quality SV call set in whole-genome sequencing from the Alzheimer’s Disease Sequencing Project comprising 578 individuals from 111 families. Employing two complementary pipelines, Scalpel and Parliament, for SV/indel calling, we assessed sensitivity through sample replicates (N = 9) with in silico variant spike-ins. We developed a novel metric, D-score, to evaluate caller specificity for deletions. The accuracy of deletions was evaluated by Sanger sequencing. We generated a high-quality call set of 152,301 deletions of diverse sizes. Sanger sequencing validated 114 of 146 detected deletions (78.1%). Scalpel excelled in accuracy for deletions ≤100 bp, whereas Parliament was optimal for deletions >900 bp. Overall, 83.0% and 72.5% of calls by Scalpel and Parliament were validated, respectively, including all 11 deletions called by both Parliament and Scalpel between 101 and 900 bp. Our flexible protocol successfully generated a high-quality deletion call set and a truth set of Sanger sequencing-validated deletions with precise breakpoints spanning 1-17,000 bp.

OBJECTIVE: This study examines whether individualism weakens the effectiveness of the COVID-19 vaccine eligibility expansions in the United States in 2021, and assesses the associated social benefits or costs associated with individualism.
METHODS: We construct a county-level composite individualism index as a proxy of culture and the fraction of vaccine eligible population as a proxy of vaccination campaign (mean: 41.34%). We estimate whether the COVID-19 vaccine eligibility policy is less effective in promoting vaccine coverage, reducing in COVID-19 related hospitalization and death using a linear two-way fixed effect model in a sample of 2866 counties for the period between early December 2020 and July 1, 2021. We also test whether individualism shapes people’s attitudes towards vaccine using a linear probability model in a sample of 625,308 individuals aged 18-65 (mean age: 43.3; 49% male; 59.1% non-Hispanic white, 19.1% Hispanic, 12% African American; 5.9% Asian) from the Household Pulse Survey.
RESULTS: The effects of expanded vaccine eligibility are diminished in counties with greater individualism, as evidenced by lower effectiveness in increasing vaccination rates and reducing outpatient doctor visits primarily for COVID-related symptoms and COVID deaths. Moreover, our results show that this cultural influence on attitudes towards vaccine is more pronounced among the less educated, but unrelated to race.
CONCLUSION: Assuming an average level of vaccine eligibility policies and an average intensity of individualism across the nation, we calculate that the average social cost associated with an individualistic culture amid the pandemic is approximately $50.044 billion, equivalent to 1.32% of the total U.S. health care spending in 2019. Our paper suggests that strategies to promote public policy compliance should be tailored to accommodate cultural and social contexts.

IMPORTANCE: The chromosome 17q21.31 region, containing a 900 Kb inversion that defines H1 and H2 haplotypes, represents the strongest genetic risk locus in progressive supranuclear palsy (PSP). In addition to H1 and H2, various structural forms of 17q21.31, characterized by the copy number of α, β, and γ duplications, have been identified. However, the specific effect of each structural form on the risk of PSP has never been evaluated in a large cohort study.
OBJECTIVE: To assess the association of different structural forms of 17q.21.31, defined by the copy numbers of α, β, and γ duplications, with the risk of PSP and MAPT sub-haplotypes.
DESIGN SETTING AND PARTICIPANTS: Utilizing whole genome sequencing data of 1,684 (1,386 autopsy confirmed) individuals with PSP and 2,392 control subjects, a case-control study was conducted to investigate the association of copy numbers of α, β, and γ duplications and structural forms of 17q21.31 with the risk of PSP. All study subjects were selected from the Alzheimer’s Disease Sequencing Project (ADSP) Umbrella NG00067.v7. Data were analyzed between March 2022 and November 2023.
MAIN OUTCOMES AND MEASURES: The main outcomes were the risk (odds ratios [ORs]) for PSP with 95% CIs. Risks for PSP were evaluated by logistic regression models.
RESULTS: The copy numbers of α and β were associated with the risk of PSP only due to their correlation with H1 and H2, while the copy number of γ was independently associated with the increased risk of PSP. Each additional duplication of γ was associated with 1.10 (95% CI, 1.04-1.17; P = 0.0018) fold of increased risk of PSP when conditioning H1 and H2. For the H1 haplotype, addition γ duplications displayed a higher odds ratio for PSP: the odds ratio increases from 1.21 (95%CI 1.10-1.33, P = 5.47 × 10-5) for H1β1γ1 to 1.29 (95%CI 1.16-1.43, P = 1.35 × 10-6) for H1β1γ2, 1.45 (95%CI 1.27-1.65, P = 3.94 × 10-8) for H1β1γ3, and 1.57 (95%CI 1.10-2.26, P = 1.35 × 10-2) for H1β1γ4. Moreover, H1β1γ3 is in linkage disequilibrium with H1c (R2 = 0.31), a widely recognized MAPT sub-haplotype associated with increased risk of PSP. The proportion of MAPT sub-haplotypes associated with increased risk of PSP (i.e., H1c, H1d, H1g, H1o, and H1h) increased from 34% in H1β1γ1 to 77% in H1β1γ4.
CONCLUSIONS AND RELEVANCE: This study revealed that the copy number of γ was associated with the risk of PSP independently from H1 and H2. The H1 haplotype with more γ duplications showed a higher odds ratio for PSP and were associated with MAPT sub-haplotypes with increased risk of PSP. These findings expand our understanding of how the complex structure at 17q21.31 affect the risk of PSP.

The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer’s Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD > 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community.

The prevalence of dementia among South Asians across India is approximately 7.4% in those 60 years and older, yet little is known about genetic risk factors for dementia in this population. Most known risk loci for Alzheimer’s disease (AD) have been identified from studies conducted in European Ancestry (EA) but are unknown in South Asians. Using whole-genome sequence data from 2680 participants from the Diagnostic Assessment of Dementia for the Longitudinal Aging Study of India (LASI-DAD), we performed a gene-based analysis of 84 genes previously associated with AD in EA. We investigated associations with the Hindi Mental State Examination (HMSE) score and factor scores for general cognitive function and five cognitive domains. For each gene, we examined missense/loss-of-function (LoF) variants and brain-specific promoter/enhancer variants, separately, both with and without incorporating additional annotation weights (e.g., deleteriousness, conservation scores) using the variant-Set Test for Association using Annotation infoRmation (STAAR). In the missense/LoF analysis without annotation weights and controlling for age, sex, state/territory, and genetic ancestry, three genes had an association with at least one measure of cognitive function (FDR q<0.1). APOE was associated with four measures of cognitive function, PICALM was associated with HMSE score, and TSPOAP1 was associated with executive function. The most strongly associated variants in each gene were rs429358 (APOE ε4), rs779406084 (PICALM), and rs9913145 (TSPOAP1). rs779406084 is a rare missense mutation that is more prevalent in LASI-DAD than in EA (minor allele frequency=0.075% vs. 0.0015%); the other two are common variants. No genes in the brain-specific promoter/enhancer analysis met criteria for significance. Results with and without annotation weights were similar. Missense/LoF variants in some genes previously associated with AD in EA are associated with measures of cognitive function in South Asians from India. Analyzing genome sequence data allows identification of potential novel causal variants enriched in South Asians.

INTRODUCTION: Clinical research in Alzheimer’s disease (AD) lacks cohort diversity despite being a global health crisis. The Asian Cohort for Alzheimer’s Disease (ACAD) was formed to address underrepresentation of Asians in research, and limited understanding of how genetics and non-genetic/lifestyle factors impact this multi-ethnic population.
METHODS: The ACAD started fully recruiting in October 2021 with one central coordination site, eight recruitment sites, and two analysis sites. We developed a comprehensive study protocol for outreach and recruitment, an extensive data collection packet, and a centralized data management system, in English, Chinese, Korean, and Vietnamese.
RESULTS: ACAD has recruited 606 participants with an additional 900 expressing interest in enrollment since program inception.
DISCUSSION: ACAD’s traction indicates the feasibility of recruiting Asians for clinical research to enhance understanding of AD risk factors. ACAD will recruit > 5000 participants to identify genetic and non-genetic/lifestyle AD risk factors, establish blood biomarker levels for AD diagnosis, and facilitate clinical trial readiness.
HIGHLIGHTS: The Asian Cohort for Alzheimer’s Disease (ACAD) promotes awareness of under-investment in clinical research for Asians. We are recruiting Asian Americans and Canadians for novel insights into Alzheimer’s disease. We describe culturally appropriate recruitment strategies and data collection protocol. ACAD addresses challenges of recruitment from heterogeneous Asian subcommunities. We aim to implement a successful recruitment program that enrolls across three Asian subcommunities.

BACKGROUND: Mitochondrial DNA (mtDNA) is a double-stranded circular DNA and has multiple copies in each cell. Excess heteroplasmy, the coexistence of distinct variants in copies of mtDNA within a cell, may lead to mitochondrial impairments. Accurate determination of heteroplasmy in whole-genome sequencing (WGS) data has posed a significant challenge because mitochondria carrying heteroplasmic variants cannot be distinguished during library preparation. Moreover, sequencing errors, contamination, and nuclear mtDNA segments can reduce the accuracy of heteroplasmic variant calling.
OBJECTIVE: To efficiently and accurately call mtDNA homoplasmic and heteroplasmic variants from the large-scale WGS data generated from the Alzheimer’s Disease Sequencing Project (ADSP), and test their association with Alzheimer’s disease (AD).
METHODS: In this study, we present MitoH3-a comprehensive computational pipeline for calling mtDNA homoplasmic and heteroplasmic variants and inferring haplogroups in the ADSP WGS data. We first applied MitoH3 to 45 technical replicates from 6 subjects to define a threshold for detecting heteroplasmic variants. Then using the threshold of 5% ≤variant allele fraction≤95%, we further applied MitoH3 to call heteroplasmic variants from a total of 16,113 DNA samples with 6,742 samples from cognitively normal controls and 6,183 from AD cases.
RESULTS: This pipeline is available through the Singularity container engine. For 4,311 heteroplasmic variants identified from 16,113 samples, no significant variant count difference was observed between AD cases and controls.
CONCLUSIONS: Our streamlined pipeline, MitoH3, enables computationally efficient and accurate analysis of a large number of samples.