Evidence of polygenic regulation of the physiological presence of neurofilament light chain in human serum

doi:10.21203/rs.3.rs-422221/v1

Download PDF

Research Article

Evidence of polygenic regulation of the physiological presence of neurofilament light chain in human serum

https://doi.org/10.21203/rs.3.rs-422221/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 07 Mar, 2023

Read the published version in Frontiers in Neurology →

Version 1

posted

You are reading this latest preprint version

The measurement of neurofilament light chain (NfL) in blood is a promising biomarker of neurological injury and disease. We investigated the genetic factors that underlie serum NfL levels (sNfL) of individuals without neurological conditions. We performed a discovery genome-wide association study (GWAS) of sNfL in participants of the German BiDirect Study (N=1,899). A secondary GWAS for meta-analysis was performed in a small Austrian cohort (N=287). Results from the meta-analysis were investigated in relation with several clinical variables in BiDirect. Our discovery GWAS identified 12 genetic loci showing suggestive associations (p<1x10-5) with sNfL. After meta-analysis, 7 loci were significant at this threshold. Genotype-specific differences in sNfL were observed for the lead variants of meta-analysis loci (rs2462121, rs114956339, rs529938, rs73198093, rs34372929, rs10982883 and rs1842909) in BiDirect participants. We found mild associations in meta-analysis loci with markers of inflammation and renal function. At least 6 protein-coding genes (ACTG2, TPRKB, DMXL1, COL23A1, NAT1 and RIMS2) were implicated as potential genetic factors contributing to baseline sNfL levels. Our findings suggest that polygenic regulation of neuronal processes, inflammation, metabolism and clearance modulate the variability of NfL in the circulation. These will aid in the interpretation of sNfL measurements in a personalized manner.

GWAS

neurofilament light chain

serum biomarkers

Neurofilament light chain (NfL) is a subunit of neurofilaments (NFs), cytoskeletal components found exclusively in neurons and particularly abundant in axons. NfL is a major component of the backbone of NFs in the central and peripheral nervous systems [1]. Axonal damage and neuronal death due to neurological diseases, including those of inflammatory, neurodegenerative, traumatic and cerebrovascular nature, result in NfL release into the cerebrospinal fluid (CSF) and blood. Recent technological advances in immunoassay detection have enabled the accurate measurement of the small amounts of NfL that reach the circulation, facilitating its application as a universal peripheral biomarker of the presence and progression of neurological conditions, and of treatment responses [1-3]. Therefore, investigating the factors that influence concentrations of NfL in the periphery becomes crucial for the interpretation of results. To date, it has been demonstrated that NfL serum levels (sNfL) increase with age [4] and potential confounding factors, such as body mass index and cardiovascular risk factors, have been suggested [5,6].

Studies in population-based cohorts have shown a polygenic nature of numerous health-related serum biomarkers, including alanine transaminase (liver function), fibrinogen (clot formation) and glycated hemoglobin (type 2 diabetes mellitus), among many others. These findings can provide novel biological insights and facilitate disease diagnosis and stratification [7]. Nevertheless, to our knowledge, no genetic associations with sNfL have been investigated. We hypothesized that the identification of genetic factors that modulate sNfL in physiological conditions will help interpretation on an individual basis, consequently improving the clinical applications of sNfL as a biomarker. To test our hypothesis, we performed the first genome-wide association study (GWAS) and meta-analysis of sNfL in a total of 2,186 individuals of European descent without known neurological conditions, and correlated our findings with clinical data to identify potential sources of sNfL variability.

Study populations.

The BiDirect Study was initiated in 2009 as a prospective, observational study integrating three cohorts: 1) community-dwelling adults (control cohort), 2) patients with an acute depressive episode (depression cohort), and 3) patients who recently suffered from acute myocardial infarction (MI cohort). The study, whose principal goal is the exploration of the bidirectional relationship between depression and subclinical arteriosclerosis, recruited participants in the district of Münster, Germany, and carried out extensive phenotyping and follow-up of all cohorts in parallel. The study design and methods have been previously described in detail [8]. Here, we included 1,899 BiDirect participants (977 males, 922 females; mean age: 52.1 ± 7.9) from the control (763), depression (851) and MI (285) cohorts.

The Austrian Stroke Prevention Family Study (ASPS)-Fam cohort represents an extension of the prospective, population-based ASPS (Austrian Stroke Prevention Study) on the effects of vascular risk factors in normal aging. ASPS was established in 1991 in the city of Graz, Austria [9]. For ASPS-Fam, first-degree relatives of ASPS participants were invited to join the study. The study’s composition and inclusion criteria have been described elsewhere [10,11]. Here, we included 287 ASPS-Fam participants (115 males, 172 females; mean age: 64.3 ± 10.6).

The basic descriptive information of the BiDirect and ASPS-Fam cohorts are shown in Table 1. All participants of the BiDirect and ASPS-Fam cohorts provided written informed consent. Methods were carried out in accordance with the ethical standards laid down in the updated version of the 1964 Declaration of Helsinki. The BiDirect Study was approved by the ethics committee of the University of Münster and the Westphalian Chamber of Physicians in Münster, North-Rhine-Westphalia, Germany. The ASPS-Fam protocol was approved by the ethics committee of the Medical University of Graz, Austria.

Serum measurements of NfL.

Quantification of sNfL in BiDirect and ASPS-Fam was conducted at the University Hospital Basel, Switzerland, using the single molecule array (Simoa^®) HDX analyzer (Quanterix, Lexington, MA, USA). In BiDirect participants, measurements of sNfL were obtained from non-fasting blood samples collected at the first visit, using the Simoa^® NF-light Advantage Kit. In ASPS-Fam participants, sNfL measurement has been previously described [4]. The sNfL values obtained at initial assessment were log2-transformed and used for all analyses herein reported.

Because it is known that sNfL concentrations increase during aging [4], we tested for age-adjusted sex- and cohort-dependent sNfL differences in BiDirect using analysis of covariance (ANCOVA). We also tested for sNfL correlations, using the Pearson’s method, with markers of inflammation, renal and liver function, lipids, hormones and brain volumes derived from magnetic resonance imaging (MRI) data (106 clinical variables in total). All p<0.05 values were considered statistically significant. Here, age represented the age at participant recruitment, when baseline phenotyping (s0) took place. Clinical variables coming from up to three subsequent follow-up visits were identified as time points s2, s4 and s6.

Genotype data.

For BiDirect genotypes, genomic DNA was isolated from whole blood samples with EDTA using standard DNA extraction kits and procedures at the University of Münster. Genome-wide genotyping was performed with the Infinium PsychArray BeadChip v1 (Illumina) at Life&Brain GmbH (Bonn, Germany). Basic quality control (QC) was employed to remove samples and variants with high rates of missing data. This included removal of individuals with genotyping rate <2%, cryptic relatedness (PI-HAT ≥1/16), and genetic outliers (distance in first two multidimensional scaling components >5 standard deviations from the mean), as well as the removal of variants with call rate <2% and minor allele frequency (MAF) <1%. Genotype imputation was performed with SHAPEIT (pre-phasing) [12] and IMPUTE2 [13] using the 1000 Genomes Project, phase 3, European population reference panel (from here on, 1KG Reference Panel). Imputed variants were filtered for the INFO metric (≥0.8), MAF≥0.01 and Hardy-Weinberg equilibrium (HWE p≥1x10⁻⁶). Individuals were further removed from the sample based on missing phenotypic data (age and baseline sNfL measurement). The final BiDirect GWAS dataset consisted of 5,597,244 genetic variants and 1,899 individuals.

For ASPS-Fam genotypes, genome-wide genotyping was performed with the Genome-Wide Human SNP Array 6.0 (Affymetrix). During the initial QC, variants with MAF<0.05, HWE<5x10^-6 and low variant call rate (>2%) were excluded. Individuals with sex mismatch, cryptic relatedness, low sample call rate (>2%) and other detected failures were removed. Genotype imputation was performed using the Michigan Imputation Server [14] and the 1KG Reference Panel.

Of note, genetic variants herein comprise single nucleotide polymorphisms (SNPs), as well as small insertions/deletions (indels) present in the datasets.

Screening for genetic associations with sNfL.

We conducted a discovery GWAS in the BiDirect dataset under an additive regression model, adjusting for age, sex, cohort and the first 10 principal components. A secondary GWAS in the smaller ASPS-Fam dataset was performed independently at the Medical University of Graz, using age and sex as covariates. After harmonization of summary statistics from both studies, we performed a weighted meta-analysis of all overlapping variants with Rsq≥0.8 and MAF≥0.01 using Plink 1.9 [15]. Variants with high heterogeneity between studies (I>40 and Q<0.1) were subsequently neglected.

Definition of genomic loci for sNfL.

For the discovery GWAS and the meta-analysis, we carried out downstream analyses on the FUMA GWAS platform [16] and defined genomic loci at the suggestive threshold of significance for genome-wide studies (p<1x10^-5), obtained variant annotations and identified the level of support for each signal. Linkage disequilibrium (LD) was defined by r²≥0.6 and a window of 500 kb, according to the 1KG Reference Panel. Subsequently, LD blocks were formed with variants under the suggestive threshold as lead variants, and containing all nominally significant (p<0.05) variants in the dataset that were in LD with the corresponding lead variants. Positional (gene) mapping was performed according to a maximum distance of 1 kb for the categories protein-coding, long non-coding RNA (lncRNA), non-coding RNA (ncRNA) and processed transcripts. Expression quantitative trait loci (eQTLs) were mapped using the BRAINEAC and GTEx v8 Brain databases. Only SNP-gene pairs with false discovery rate (FDR) <0.05 were annotated.

Functional implications of suggested candidate genes.

To inform the biological meaning of our findings, we created a protein-protein interaction (PPI) network using our suggested meta-analysis candidate genes as input. The network was generated with the Gene Set analysis tool of the ReactomeFIViz app for Cytoscape v.3.7.1 [17,18]. Linker proteins and functional interaction (FI) annotations were incorporated into the network (version 2018). In addition, we performed clustering of nodes, as well as enrichment analyses of pathways and gene ontology cellular components (GO_CC) for each network cluster. Gene sets with FDR<0.05 were considered significantly enriched.

SNP Heritability (h²_SNP).

We calculated the proportion of variance in sNfL concentrations explained by our discovery GWAS in BiDirect using the GREML-LDMS (LD- and MAF-stratified GREML) method implemented in GCTA [19,20]. For all autosomal variants with MAF≥0.01 in the imputed dataset, we calculated the 200 kb segment-based LD scores, stratified variants according to LD scores of individual SNPs, computed one genetic relationship matrix for each quartile of the stratified variants, and performed a restricted maximum likelihood analysis using these four matrices. The variance explained was adjusted for the same covariates as the GWAS. SNP heritability from our meta-analysis summary statistics was calculated using LDSC software [21]⁠ with LD scores pre-computed in 1KG Reference Panel data, as suggested by the authors.

Screening for associations with clinical variables.

For the lead variant of each loci resulting from our meta-analysis, we performed genotype-specific comparisons in BiDirect participants using an ANCOVA model adjusted for age. Moreover, for all variants within meta-analysis loci, we tested for associations with the same set of clinical variables used in the correlation analyses. These association tests were performed in the same manner as for baseline sNfL. For these analyses, p<0.05 values were considered suggestive of statistical associations.

Basic characterization of sNfL in BiDirect.

Our initial characterization of sNfL in BiDirect found similar distributions of sNfL in the three cohorts (mean sNfL ± standard deviation: control 2.15 ± 0.44, depression 2.13 ± 0.43, MI 2.29 ± 0.5; Figure 1A) and a positive association with age (p<2x10^-16, beta=0.03), which was independent of the cohort (Figure 1B). Age-adjusted comparisons showed mean differences in sNfL levels between both patient cohorts (depression p=8.2x10^-5, MI p=1.4x10^-3) and the reference cohort, while no differences could be attributed to sex (p=0.56) in this dataset (Figure 1C). Moreover, baseline (s0) sNfL correlated well with all other sNfL measurements (i.e. log- and non-transformed values from follow-up visits), and with markers of inflammation, and of the functions of kidneys, liver and thyroid glands (Additional File 1:Suppl.Table.1). Although highly significant, correlations with these markers showed only mild strength.

Genetic associations with sNfL.

We identified no genetic associations with sNfL surpassing the desired genome-wide significant threshold (p<5x10^-8). However, our observations reached the significance threshold commonly accepted for suggestive associations (p<1x10^-5) in GWASs. Therefore, we adopted the latter to establish significant findings from our GWAS and meta-analysis.

With our discovery GWAS in BiDirect (N=1,899), we observed suggestive signals in 10 chromosomes (Figure 2A). Because the SNP2GENE tool integrates observations coming from GWAS summary statistics with information on LD structure coming from well-established reference panels to define lead variants and genomic loci, and can also be used to annotate an array of functional features for SNPs within the defined loci, we considered this tool to provide an appropriate means for the interpretation of our results. Twelve suggestive genomic loci for sNfL were defined through this analysis. These loci contained 13 lead variants, 14 independent signals, and implicated a total of 246 genetic variants and of 18 mapped genes, from which 7 (CNTNAP5, NAT1, NATP, MTDH, RIMS2, VWA8, and RBFOX1) are protein-coding (Table 2; Additional File 1:Suppl.Table.2). The SNP heritability estimation performed with GCTA showed that this GWAS explained about 30% of the variance in sNfL (h²_SNP= 0.299). However, the analysis also suggested that a larger sample size would be required to confidently detect the genetic component of sNfL (LRT=2.4, p=0.061).

Because the ASPS-Fam cohort has a small sample size and differences in its composition, in comparison with BiDirect, were evident, we chose not to seek validation of our findings in ASPS-Fam, but to use this cohort to carry out a meta-analysis with the aim to gain statistical power (N=2,186). After performing a weighted meta-analysis and filtering out heterogeneous variants (i.e. variants with inconsistent effects), we applied again the SNP2GENE approach to extract a relevant interpretation of our results. Even with the addition of the ASPS-Fam cohort, we did not observe genomic variants reaching genome-wide significance (Figure 2B). Nevertheless, we were able to define 7 suggestive meta-analysis loci spanning 5 chromosomes, 144 variants and 17 mapped genes, including 6 protein-coding genes (ACTG2, TPRKB, DMXL1, COL23A1, NAT1, and RIMS2), that associated with sNfL levels in individuals without neurological conditions (Table 2; Additional File 1:Suppl.Table.3; Additional File 2:Suppl.Figure.1-7). In comparison with our discovery GWAS, meta-analysis loci represented the identification of 4 robust signals (i.e. meta-analysis loci that overlapped GWAS loci; meta-analysis loci #4-7 in chromosomes 8, 9 and 11), as well as the addition of 3 new signals (i.e. meta-analysis loci not found with the discovery GWAS; meta-analysis loci #1-3 in chromosomes 2 and 5). SNP heritability performed with LDSC in our sNfL meta-analysis was estimated to be about 7% (h²_SNP= 0.0711). Nevertheless, we observed a low Chi² statistic (mean Chi²=1.006) for this analysis, which may be due to the small sample size.

Investigation of biological context.

The PPI network created with the protein-coding genes implicated by our meta-analysis loci was able to link 5/6 (exception of NAT1) genes by the incorporation of 9 linker proteins (Figure 3). Four small clusters were defined within this network, which illustrated the differential, yet interconnected functional properties between clusters. The most prominent pathways enriched in each cluster (Additional File 1:Suppl.Table.4) were related to cell signaling and organization of the extracellular matrix (lilac module: ACTG2, COL23A1, FURIN, MMP13, MMP16), senescence, inflammation and cell death (green module: AKT1, TP53, TP53RK, TPRKB), glucose and insulin metabolism (magenta module: MYH9, RAB8A, RIMS2), and immune processes (olive module: DMXL1, RICTOR). These pathways showed consistency with the associations observed between sNfL and clinical variables, including not only inflammation but also those related to thyroid and renal functions, and to blood lipids (e.g. Parathyroid hormone synthesis, secretion and action-FDR=0.0086 in lilac module-; Thyroid hormone signaling pathway-FDR=0.0052 in green module-; Plasma lipoprotein assembly, remodeling, and clearance-FDR=0.03 in lilac module). Additionally, network modules were enriched for distinct cellular compartments (Additional File 1:Suppl.Table.5), mainly: extracellular matrix and Golgi (lilac module), cytoplasm and nucleus (green module), presynaptic cytoskeleton and transport vesicles (magenta module), and the RAVE (regulator of ATPase of vacuoles and endosomes) and TORC2 (target of rapamycin complex 2) complexes (olive module).

Because none of the variants in our GWAS reached the common threshold accepted for genome-wide significance, we sought to demonstrate that these represent true associations. For all lead variants from our meta-analysis loci (rs2462121, rs114956339, rs529938, rs73198093, rs34372929, rs10982883 and rs1842909), we found significant differences in sNfL levels from BiDirect participants with different genotypes, particularly in those individuals with two copies of the effect/minor allele (AA genotype), as compared to those homozygous for the non-effect/major allele (BB genotype) (Table 3). With the exception of rs114956339 (p=0.0016), we found no interactions for sNfL measurements between the genotypes of these variants and the diagnostic group (i.e. depression, MI and control).

Finally, we found evidence suggesting associations of meta-analysis loci with several clinical variables (Additional File 1:Suppl.Table.1). When prioritizing these by the integration of our results from genetic association and sNfL correlation tests, we identified overlaps for 18 variables from the clinical phenotypes (Table 4). These included markers of inflammation (interferon-α, and interleukins 6 and 1α), renal function (cystatin, creatinine, albumin and urea), liver and muscle function (lactate dehydrogenase and lipase), thyroid function (free thyroxine and free triiodothyronine), and blood lipids (HDL cholesterol and triglycerides). Noticeably, the index of comorbidity (which included stroke, leg thrombosis, peripheral artery disease, hypertension, MI, diabetes, depression, cancer, kidney and lung diseases, chronic arthritis, and Parkinson’s disease) and grey matter volume (relative to total brain, coming from magnetic resonance imaging data) were also prioritized. Moreover, as expected, the associations with all sNfL measurements from follow-up visits remained significant (Additional File 1:Suppl.Table.1).

With the increasing interest in the clinical use of sNfL as a peripheral biomarker for the presence, progression and treatment response of neurological conditions in general, there is a need to define which biological factors contribute to physiological variations in sNfL concentrations. Previous studies have reported age, body mass index, blood volume, renal function (as measured by serum creatinine levels), hypertension and pregnancy may act as confounding factors for sNfL [3-6,22]. Here, we corroborated the association of sNfL with aging and renal function, and uncovered other physiological variables associating with sNfL in the BiDirect study. Nevertheless, because of the small-effect interactions and overlaps at the genetic level that we observed, more studies will be necessary to clarify whether these physiological associations represent true confounding factors for sNfL or an epiphenomenon.

As our primary goal was to determine genetic factors that contribute to modulate sNfL concentrations, we performed the first (to our knowledge) discovery GWAS and meta-analysis study in Europeans. Although we report here the findings from both analyses, we wish to focus further on the 7 suggestive loci resulting from our meta-analysis of the BiDirect and ASPS-Fam study populations to gain some biological insights on the implicated genomic regions. Results from our network analysis and overlapping genetic associations with a set of clinical variables show consistency. These highlighted particularly important roles for inflammation, lipids, thyroid hormones and vesicular transport. We also found in the literature, for all protein-coding mapped and/or any-tissue eQTL genes for variants in all of our meta-analysis loci, functions that are relevant for neuronal development and function. As neuronal processes may impact the release of NfL into the CSF and, consequently, its dissemination into peripheral blood, we focused on identifying potential roles of our meta-analysis loci in neuronal functions. However, as suggested by our analyses, it is possible that some variants contribute to regulate sNfL levels through effects on the body’s metabolism and renal clearance.

In our study, NAT1, RIMS2 and DEC1 (meta-analysis loci #4-6, respectively) were the more robustly suggested candidate genes. The NAT1 (N-Acetyltransferase 1) protein forms an enzymatic complex with ARD1 (N-Alpha-Acetyltransferase 10, NatA Catalytic Subunit; NAA10 gene) that is required for neuronal differentiation and dendritic arborization [23,24]. The product of RIMS2 (Regulating Synaptic Membrane Exocytosis 2) functions as a Rab effector involved in synaptic membrane exocytosis [25]. DEC1 (deleted in esophageal cancer 1, DELEC1), a lncRNA gene, is a candidate tumor suppressor [26]. which means that it may regulate the cell cycle and other fundamental cellular processes.

Moreover, meta-analysis locus #1 mapped to ACTG2 (Actin Gamma 2) and implicated TPRKB (TP53RK Binding Protein) as a brain eQTL gene. Although the ACTG2 protein primarily localizes to the cytoskeleton of enteric smooth muscle, this gene has also been found downregulated during the chemical conversion of cultured human cortical astrocytes into neurons by treatment with small molecules [27], suggesting a role for ACTG2 in neuronal development. TPRKB is a subunit of the KEOPS (Kinase, Endopeptidase and Other Proteins of small Size) complex, which is required for the threonyl carbamoyl adenosine (t6A) transfer (t)RNA modification [28]. An increasing number of reports link defects in these modifications to various neurodevelopmental disorders, suggesting a role in the development of the nervous system [29,30]. Additionally, when looking at any-tissue eQTL effects, genetic variants in meta-analysis locus #1 were found to regulate the expression of DCTN1 (Dynactin Subunit 1) and DGUOK (Deoxyguanosine Kinase). The product of DCTN1 is essential for the retrograde transport of vesicles and organelles along microtubules mediated by dynein. In neurons, it activates retrograde axonal transport and regulates microtubule stability [31,32]. On the other hand, DGUOK is a mitochondrial protein that may be involved in neuronal differentiation, as suggested by experiments in retinoic acid-induced differentiated neuronal-like cells [33].

Meta-analysis locus #2 mapped to DMXL1 (Dmx Like 1). In ngr1^-/- mice, this gene was upregulated in axotomized corticospinal motor neurons 4 weeks after pyramidotomy [34], suggesting a role in axonal repair. Meta-analysis locus #3 mapped to COL23A1 (Collagen Type XXIII Alpha 1 Chain), whose dysregulated expression has been reported in different brain regions of mice with repeated experience of agonistic interactions [35]. The work suggested the involvement of extracellular matrix remodeling (and of COL23A1) in the development of experimental psychopathologies. Although meta-analysis locus #7 did not map to protein-coding genes or showed eQTL effects on any in the brain datasets, we found variants in this locus with any-tissue eQTL effects on PTPN5 (Protein Tyrosine Phosphatase Non-Receptor Type 5). This gene regulates synaptic plasticity, and has been implicated in diverse neurological and psychiatric disorders [36-38].

We acknowledge that the relatively small sample size of our study limited our power to detect genetic associations at the genome-wide level and, therefore, to estimate SNP heritability as well. This was indeed reflected by the statistics from our heritability analyses. Nevertheless, we are positive that the future inclusion of appropriate population-based cohorts will help establish these and other genomic regions as genetic drivers of sNfL variations in individuals without neurological conditions. Further bioinformatics and functional studies should help to elucidate the biological relevance of our findings for sNfL measurements. The potential genetic and physiological factors associated with sNfL that were identified by our study warrant future investigations that will pave the way for an optimal application of sNfL as a marker of neuronal conditions.

Ethics approval and consent to participate

All participants of the BiDirect and ASPS-Fam cohorts provided written informed consent. Methods were carried out in accordance with the ethical standards laid down in the updated version of the 1964 Declaration of Helsinki. The BiDirect Study was approved by the ethics committee of the University of Münster and the Westphalian Chamber of Physicians in Münster, North-Rhine-Westphalia, Germany. The ASPS-Fam protocol was approved by the ethics committee of the Medical University of Graz, Austria.

Consent for publication

Not applicable.

Availability of data and materials

The datasets used for this study are available from the authors on reasonable request. The derived data supporting the conclusions presented in this article are included within the article and the corresponding additional files.

Competing interests

D.L. is Chief Medical Officer at GeNeuro. The other authors report no conflicts of interest relevant to the manuscript.

Funding

The BiDirect Study is funded by the German Federal Ministry of Education and Research (BMBF, grant numbers FKZ-01ER0816 and -01ER1506).

Authors' contributions

MHR: Project design, data analysis and interpretation; manuscript preparation. EH, MK and RS: GWAS in ASPS-Fam. MS and KB: Project design and critical revisions. KB: Coordination of the BiDirect Study. HW, AM, DL, PB and JK: Measurements of NfL.

Acknowledgements

The authors thank Till Andlauer for his support with genotype imputation.

Yuan, A., Rao, M.V., Veeranna, Nixon, R.A. Neurofilaments and Neurofilament Proteins in Health and Disease. Cold Spring Harb. Perspect. Biol.; 9, a018309 (2017).
Khalil, M. et al. Neurofilaments as biomarkers in neurological disorders. Nat. Rev. Neurol. 14, 577-589 (2018).
Barro, C., Chitnis, T., Weiner, H.L. Blood neurofilament light: a critical review of its application to neurologic disease. Ann. Clin. Transl. Neurol. 7, 2508-23 (2020).
Khalil, M. et al. Serum neurofilament light levels in normal aging and their association with morphologic brain changes. Nat. Commun. 11, 812 (2020).
Manouchehrinia, A. et al. Confounding effect of blood volume and body mass index on blood neurofilament light chain levels. Ann. Clin. Transl. Neurol. 7, 139-143 (2020).
Korley, F.K. et al. Serum NfL (Neurofilament Light Chain) Levels and Incident Stroke in Adults With Diabetes Mellitus. Stroke. 50, 1669-1675 (2019).
Prins, B.P. et al. Genome-wide analysis of health-related biomarkers in the UK Household Longitudinal Study reveals novel associations. Sci. Rep. 7, 11008 (2017).
Teismann, H. et al. Establishing the bidirectional relationship between depression and subclinical arteriosclerosis--rationale, design, and characteristics of the BiDirect Study. BMC Psychiatry. 14, 174 (2014).
Schmidt, R. et al. Assessment of cerebrovascular risk profiles in healthy persons: definition of research goals and the Austrian Stroke Prevention Study (ASPS). Neuroepidemiology. 13, 308-13 (1994).
Seiler, S. et al. Magnetization transfer ratio relates to cognitive impairment in normal elderly. Front. Aging Neurosci. 6, 263 (2014).
Hilal, S. et al. Enlarged perivascular spaces and cognition: A meta-analysis of 5 population-based studies. Neurology. 91, e832-e842 (2018).
Delaneau, O., Zagury, J.F., Marchini, J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods. 10, 5-6 (2013).
Howie, B.N., Donnelly, P., Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284-1287 (2016).
Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., Lee, J.J. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 4, 7 (2015).
Watanabe, K., Taskesen, E., van Bochoven, A., Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498-504 (2003).
Wu, G., Feng, X., Stein, L. A human functional protein interaction network and its application to cancer data analysis. Genome Biol. 11, R53 (2010).
Yang, J., Lee, S.H., Goddard, M.E., Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Yang, J. et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat. Genet. 47, 1114–20 (2015).
Bulik-Sullivan, B.K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–5 (2015).
Akamine, S. et al. Renal function is associated with blood neurofilament light chain level in older adults. Sci. Rep. 10, 20350 (2020).
Sugiura, N., Adams, S.M., Corriveau, R.A. An evolutionarily conserved N-terminal acetyltransferase complex associated with neuronal development. J. Biol. Chem. 278, 40113-20 (2003).
Ohkawa, N. et al. N-acetyltransferase ARD1-NAT1 regulates neuronal dendritic development. Genes Cells. 13, 1171-83 (2008).
Wang, Y., Südhof, T.C. Genomic definition of RIM proteins: evolutionary amplification of a family of synaptic regulatory proteins. Genomics. 81, 126-37 (2003).
Nishiwaki, T., Daigo, Y., Kawasoe, T., Nakamura, Y. Isolation and mutational analysis of a novel human cDNA, DEC1 (deleted in esophageal cancer 1), derived from the tumor suppressor locus in 9q32. Genes Chromosomes Cancer. 27, 169-76 (2000).
Ma, N.X., Yin, J.C., Chen, G. Transcriptome analysis of small molecule-mediated astrocyte-to-neuron reprogramming. Front. Cell Dev. Biol. 7, 82 (2019).
Srinivasan, M. et al. The highly conserved KEOPS/EKC complex is essential for a universal tRNA modification, t6A. EMBO J. 30, 873-81 (2011).
Ramos, J., Fu, D. The emerging impact of tRNA modifications in the brain and nervous system. Biochim. Biophys. Acta Gene Regul. Mech. 1862, 412-428 (2019).
Arrondel, C. et al. Defects in t6A tRNA modification due to GON7 and YRDC mutations lead to Galloway-Mowat syndrome. Nat. Commun. 10, 3967 (2019).
Lazarus, J.E., Moughamian, A.J., Tokito, M.K., Holzbaur, E.L. Dynactin subunit p150(Glued) is a neuron-specific anti-catastrophe factor. PLoS Biol. 11, e1001611 (2013).
Ayloo, S., Lazarus, J.E., Dodda, A., Tokito, M., Ostap, E.M., Holzbaur, E.L. Dynactin functions as both a dynamic tether and brake during dynein-driven motility. Nat. Commun. 5, 4807 (2014).
Moutaoufik, M.T. et al. Rewiring of the human mitochondrial interactome during neuronal reprogramming reveals regulators of the respirasome and neurogenesis. iScience. 19, 1114-1132 (2019).
Fink, K.L., López-Giráldez, F., Kim, I.J., Strittmatter, S.M., Cafferty, W.B.J. Identification of intrinsic axon growth modulators for intact CNS neurons after injury. Cell Rep. 18, 2687-2701 (2017). Table S4.
Smagin, D.A., Galyamina, A.G., Kovalenko, I.L., Babenko, V.N., Kudryavtseva, N.N. Aberrant expression of collagen gene family in the brain regions of male mice with behavioral psychopathologies induced by chronic agonistic interactions. Biomed. Res. Int. 2019, 7276389 (2019).
Olausson, P., Venkitaramani, D.V., Moran, T.D., Salter, M.W., Taylor, J.R., Lombroso, P.J. The tyrosine phosphatase STEP constrains amygdala-dependent memory formation and neuroplasticity. Neuroscience. 225, 1-8 (2012).
Karasawa, T., Lombroso, P.J. Disruption of striatal-enriched protein tyrosine phosphatase (STEP) function in neuropsychiatric disorders. Neurosci. Res. 89, 1-9 (2014).
Jang, S.S. et al. Regulation of STEP61 and tyrosine-phosphorylation of NMDA and AMPA receptors during homeostatic synaptic plasticity. Mol. Brain. 8, 55 (2015).

Table 1. Basic description of BiDirect and ASPS-Fam cohorts.

Cohort	Log2 sNfL (mean ± SD)	Age (mean ± SD)	Males (n)	Females (n)	Total (N)
BiDirect	2.16 ± 0.45	52.1 ± 7.9	977	922	1899
BiDirect-Control	2.15 ± 0.44	53.4 ± 8.2	385	378	763
BiDirect-Depression	2.13 ± 0.43	49.9 ± 7.3	348	503	851
BiDirect-MI	2.29 ± 0.5	55.2 ± 6.7	244	41	285
ASPS-Fam	4.99 ± 0.65	64.3 ± 10.6	115	172	287
MI: myorcardial infarction, sNfL: serum neurofilament light chain, SD: standard deviation.

Table 2. Suggestive genomic loci for sNfL measures in BiDirect and the meta-analysis with ASPS-Fam. Loci were defined using FUMA GWAS (LD block r²>=0.6, window 500 kb, lead variant p<1e^-5, clumped variant p<0.05).

Locus	Index variant	Alleles	Chr	Index BP	Index P	Index effect	Start (BP)	End (BP)	# Variants	#Ind.Sig. Variants	Ind.Sig.Variants	#Lead variants	Lead variants	Genes (protein-coding)
Discovery GWAS (BiDirect; N=1,899)
1	rs76037384	A/T	2	125357776	2.17E-06	+	125357776	125403277	23	1	rs76037384	1	rs76037384	CNTNAP5
2	rs12674781	C/T	8	1377915	9.64E-06	+	1356333	1378411	25	1	rs12674781	1	rs12674781	-
3	rs184931198	C/T	8	18210838	2.90E-06	+	17954598	18218371	10	2	rs184931198, rs73198093	1	rs184931198	NAT1, NATP
4	rs142838371	A/G	8	98741426	6.88E-06	+	98656430	98741426	5	1	rs142838371	1	rs142838371	MTDH
5	rs34372929	A/AT	8	104596668	8.87E-06	+	104530581	104718242	7	1	rs34372929	1	rs34372929	RIMS2
6	rs62576696	A/G	9	118311682	2.15E-06	+	118167915	118488131	100	2	rs62576696, rs12380012	2	rs62576696, rs12380012	-
7	rs1842909	C/G	11	18918227	9.10E-06	+	18873142	18939666	25	1	rs1842909	1	rs1842909	-
8	rs146801204	C/T	12	117050196	4.07E-06	-	117039399	117060536	10	1	rs146801204	1	rs146801204	-
9	rs76207901	G/T	13	42524241	7.12E-06	+	42388330	42524241	3	1	rs76207901	1	rs76207901	VWA8
10	rs1514928	A/C	14	62678303	1.29E-06	+	62669677	62678303	3	1	rs1514928	1	rs1514928	-
11	rs8060528	C/T	16	7024428	7.69E-07	-	7011164	7038560	34	1	rs8060528	1	rs8060528	RBFOX1
12	rs74607435	C/T	19	45235700	5.21E-06	+	45235700	45235700	1	1	rs74607435	1	rs74607435	-
Meta-analysis (BiDirect + ASPS-Fam; N=2,186)
1	rs2462121	C/G	2	74137893	9.98E-06	- -	74129474	74140230	40	1	rs2462121	1	rs2462121	ACTG2, TPRKB^a
2	rs114956339	A/G	5	118578014	7.46E-06	+ +	118365512	118595407	4	1	rs114956339	1	rs114956339	DMXL1
3	rs529938	C/T	5	177961577	7.90E-06	+ +	177959285	177963534	21	1	rs529938	1	rs529938	COL23A1
4	rs73198093	C/G	8	17954598	8.48E-06	+ +	17954598	18107883	8	1	rs73198093	1	rs73198093	NAT1
5	rs34372929	A/AT	8	104596668	6.25E-06	+ +	104530581	104718242	7	1	rs34372929	1	rs34372929	RIMS2
6	rs10982883	C/T	9	118461688	6.25E-06	+ +	118450617	118488131	40	1	rs10982883	1	rs10982883	-
7	rs1842909	C/G	11	18918227	5.61E-06	+ +	18873142	18939666	24	1	rs1842909	1	rs1842909	-
sNfL: serum neurofilament light chain, Chr: chromosome, BP: base pair, Ind.Sig.Variants: individual significant variants, ^aeQTL effects of variants in the locus according to the BRAINEAC and GTEx v8 Brain datasets

Table 3. Genotype-dependent differences in sNfL for lead meta-analysis variants. Summary of ANCOVA results (p-values) for comparisons of sNfL values among BiDirect participants according to genotype, and genotype by group interactions.

Locus	Variant (rsID)	Effect allele (A)	Non-effect allele (B)	By genotype (P, age adjust)	By genotype-group (P, age adjust)	AA vs AB (P, 2-1)	AA vs BB (P, 2-0)	AB vs BB (P, 1-0)
1	rs2462121	G	C	4.26E-05	0.337	6.8E-02	1.1E-04	6.9E-03
2	rs114956339	A	G	3.25E-05	0.00157	NA	NA	3.3E-05
3	rs529938	T	C	4.44E-05	0.18	3.6E-04	4.7E-05	6.1E-01
4	rs73198093	C	G	1.06E-05	0.311	NA	NA	1.1E-05
5	rs34372929	A	AT	2.82E-05	0.945	1.5E-03	1.9E-05	1.9E-01
6	rs10982883	C	T	2.78E-05	0.815	1.6E-02	1.2E-04	1.8E-02
7	rs1842909	G	C	9.01E-06	0.594	5.1E-02	8.5E-06	3.8E-03

Table 4. Prioritized clinical variables in BiDirect showed significant correlations with sNfL and the GWAS meta-analysis loci.

BiDirect time point	Variable label	Instrument	Effective N	# Variants p<0.05	sNfL Pearson p-value	sNfL Pearson coefficient
dx	Index of comorbidity	Comorbidity	1899	2	0	0.1889
s0	Grey matter volume relative total brain	(f)MRI	1208	6	0	-0.2749
s0	HDL cholesterol i.S. mmol/l	Blood lipids	1849	26	1.40E-03	0.0741
s0	Triglyceride i.S. mmol/l]	Blood lipids	1850	3	3.60E-02	-0.0488
s0	Interleukin-6 (IL-6) i.S. pg/ml	Inflammation	1880	7	3.60E-02	-0.0483
s0	Interleukin-1α (IL-1α) i.S. pg/ml	Inflammation	1880	5	4.10E-02	-0.047
s0	Lactate dehydrogenase i.S. µkatal/l	Liver + muscle function	1850	36	2.90E-07	0.1189
s0	Lipase i.S. µkatal/l	Liver + muscle function	1841	21	1.70E-03	0.0732
s0	Cystatin i.S. mg/l	Renal function	1842	51	0	0.3109
s0	Creatinine i.S. µmol/l	Renal function	1850	32	0	0.2028
s0	Urea i.S. mmol/l]	Renal function	1845	31	0	0.197
s0	Albumin in serum (i.S.) g/l	Renal function	1849	21	3.50E-04	-0.0831
s0	Free triiodothyronine (ft3) i.S. pmol/l	Thyroid function	1792	3	7.70E-04	-0.0793
s4	Interferon-alpha (IFN-α) i.S. pg/ml	Inflammation	957	21	3.10E-02	-0.0698
s4	Lactate dehydrogenase i.S. µkatal/l	Liver + muscle function	968	26	2.60E-06	0.1504
s4	Creatinine i.S. µmol/l	Renal function	970	4	2.50E-09	0.1899
s4	Urea i.S. mmol/l]	Renal function	971	4	1.30E-09	0.1933
s4	Free thyroxin (ft4) i.S. pmol/l	Thyroid function	970	23	8.00E-03	0.0851

Competing interest reported. D.L. is Chief Medical Officer at GeNeuro. The other authors report no conflicts of interest relevant to the manuscript.

Download PDF

Journal Publication

published 07 Mar, 2023

Read the published version in Frontiers in Neurology →

Version 1

posted

You are reading this latest preprint version

Evidence of polygenic regulation of the physiological presence of neurofilament light chain in human serum

Status:

Journal Publication

Version 1

Abstract

Figures

Background

Subjects And Methods

Results

Discussion

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1