Skip to main content

Associations of plasma neurofilament light chain with cognition and neuroimaging measures in community-dwelling early old age men



Plasma neurofilament light chain (NfL) is a promising biomarker of neurodegeneration with potential clinical utility in monitoring the progression of neurodegenerative diseases. However, the cross-sectional associations of plasma NfL with measures of cognition and brain have been inconsistent in community-dwelling populations.


We examined these associations in a large community-dwelling sample of early old age men (N = 969, mean age = 67.57 years, range = 61–73 years), who are either cognitively unimpaired (CU) or with mild cognitive impairment (MCI). Specifically, we investigated five cognitive domains (executive function, episodic memory, verbal fluency, processing speed, visual-spatial ability), as well as neuroimaging measures of gray and white matter.


After adjusting for age, health status, and young adult general cognitive ability, plasma NfL level was only significantly associated with processing speed and white matter hyperintensity (WMH) volume, but not with other cognitive or neuroimaging measures. The association with processing speed was driven by individuals with MCI, as it was not detected in CU individuals.


These results suggest that in early old age men without dementia, plasma NfL does not appear to be sensitive to cross-sectional individual differences in most domains of cognition or neuroimaging measures of gray and white matter. The revealed plasma NfL associations were limited to WMH for all participants and processing speed only within the MCI cohort. Importantly, considering cognitive status in community-based samples will better inform the interpretation of the relationships of plasma NfL with cognition and brain and may help resolve mixed findings in the literature.


Neurofilament light chain (NfL) has been researched as a non-specific biomarker of neurodegeneration in a wide range of neurological and neurodegenerative diseases such as Alzheimer’s disease (AD), frontotemporal dementia, and Parkinson disease [1,2,3]. As an axoskeletal protein that maintains myelinated large-caliber axons, NfL concentrations elevate in cerebrospinal fluid (CSF) and blood (i.e., plasma, serum) following neuroaxonal damages or degeneration [3, 4]. For instance, elevated CSF and blood-based NfL concentrations have been detected in individuals with AD and non-AD neurodegenerative diseases [1,2,3,4,5], as well as those undergoing normal aging process [6, 7]. Because blood-based NfL is less invasive and more cost-effective compared to neuroimaging and CSF measures of neurodegeneration, studies have evaluated blood-based NfL as a potential biomarker of neurodegeneration for monitoring and predicting neurodegenerative disease progression [1, 2, 8]. Specifically, research has examined blood-based NfL’s associations with multiple domains of cognition and neuroimaging measures of gray and white matter to better understand whether or not blood-based NfL shows specificity to certain cognitive domains and aspects of neurodegeneration. However, findings on the cross-sectional associations of blood-based NfL concentrations with cognition and neuroimaging markers of neurodegeneration have been mixed in community-dwelling individuals without a clinical diagnosis of dementia.

Several community-based studies in dementia-free middle-aged and older adults have shown that plasma or serum NfL level is associated with performance on the Mini Mental State Exam (MMSE) [6, 9, 10], as well as one or more specific cognitive domains including memory, language, attention, and executive function [10,11,12,13]. However, other studies in community-dwelling individuals found no reliable associations between plasma NfL and these cognitive domains [14, 15]. The inconsistencies might be attributed to differences in age ranges across the study samples (e.g., 30–66 years old vs. 55–90 years old) [10, 15], as there is evidence suggesting a stronger association between NfL and cognition in those greater than 50 years old relative to those who are much younger [10]. Moreover, differences in cognitive status among community-dwelling individuals may also affect the strength of associations, such that some associations were only detected in those with MCI but not cognitively unimpaired (CU) individuals [9]. Finally, some of the prior work only utilized a single task to measure a cognitive domain, which is not as reliable as those that used multiple tasks to tap a single domain [16, 17]. Nonetheless, there is still a lack of sufficiently large-scale studies to fully understand and resolve these inconsistent findings regarding the associations of blood-based NfL with multiple cognitive domains that are assessed by more than one task.

Similarly, studies on the cross-sectional associations between blood-based NfL and neuroimaging measures of neurodegeneration have yielded mixed results. For macrostructural measures of gray matter, there is evidence indicating that higher plasma NfL level is associated with lower cortical thickness and smaller hippocampal volume in community-based dementia-free individuals [11, 12, 18]. Conversely, there are studies that either failed to detect any association between plasma NfL, cortical thickness, and hippocampal volume [13, 19]. Similarly, a study that focused on cortical thickness of AD signature regions (i.e., regions vulnerable to AD pathology and neurodegeneration), also did not detect any association [13]. White matter hyperintensities (WMH), a marker of white matter damage, and diffusion-based metrics of white matter integrity (e.g., fractional anisotropy, mean diffusivity), have also been examined in relation to blood-based NfL. While one study found a cross-sectional association between higher plasma NfL level and lower white matter integrity in a large community-dwelling sample of individuals who are either CU or with MCI [13], two other studies in a large community-dwelling cohort with both CU individuals and those with MCI did not observe any association [11, 19]. Interestingly, another study examined these associations separately in CU and MCI groups, and found a significant association between plasma NfL and white matter integrity only in the MCI group [20]. These findings suggest that differences in sample composition with respect to cognitive status may explain the discrepant cross-sectional results. Finally, similar conflicting results have been reported for WMH, such that both no and moderate cross-sectional associations with blood-based NfL were found in community-dwelling samples without dementia [7, 11,12,13, 21].

These inconsistent cross-sectional findings of blood-based NfL’s associations with cognition and brain in community-dwelling samples undermines its potential clinical utility in the diagnosis or monitoring of neurodegeneration. As such, there is a need for additional large-scale community-based investigations before blood-based NfL is implemented in clinical settings. In the present study, we systematically examined the associations of plasma NfL with both cognition and neuroimaging measures in 969 early old age community-dwelling men (mean age = 67.57 years, range = 61–73 years) who were either CU or with MCI. Specifically, we investigated the associations of plasma NfL with five cognitive domains: executive function, episodic memory, verbal fluency, processing speed, and visual-spatial ability. Each of these cognitive domains was assessed by at least two tasks. Next, we examined plasma NfL’s associations with neuroimaging measures of gray and white matter commonly included in prior work (e.g., cortical thickness, volume), as well as a less examined measure of gray matter microstructural integrity – cortical mean diffusivity (MD) –which has been shown to be a sensitive marker of microstructural neurodegeneration that precedes atrophy in cortical thickness and volume [22]. Finally, we divided the sample into CU and MCI groups to explore whether the associations of plasma NfL with cognition and neuroimaging measures differ as a function of cognitive status, which may help resolve some of the mixed findings in community-dwelling samples and provide additional context to precisely characterize the nature of these associations.



A total of 969 community-dwelling men without dementia (mean age = 67.57 years, SD = 2.52; range = 61–73) from wave 3 of the Vietnam Era Twin Study of Aging (VETSA) were included in the present study [23]. VETSA is a multisite longitudinal study of aging and risk for MCI and AD beginning in middle age [23, 24]. All participants served in the U.S. military sometime between 1965 and 1975, with 80% reporting no combat exposure. They are similar to American men in their age cohort with respect to health, education, and lifestyle characteristics [25]. Participants traveled to the University of California, San Diego (UCSD) or Boston University (BU) to participate in the VETSA. However, MRI scans were only acquired at UCSD for wave 3. Informed consent was obtained from all participants and institutional review boards at both sites approved all protocols. Table 1 is a summary of the sample characteristics.

Table 1 Characteristics of the study sample

Plasma NfL collection and processing

Plasma specimens were collected under fasting conditions. Fasting began by 9:00 PM the night before testing, and specimens were acquired the following morning between 8:00 AM and 8:30 AM. Following the procedures of our prior work [26], NfL was assayed on a single-plex plate using the ultra-sensitive Simoa technology platform HD-1 (Simoa NFL Advantage Kit; Quanterix Corporation) by the USC Alzheimer’s Therapeutic Research Institute Biomarker Core (PI: Dr. Robert Rissman) [14]. All assays were performed according to the manufacturer’s instructions. The standard exclusion criteria included hemolysis and a coefficient of variation in plasma concentrations > 0.20.

Cognitive domains

Factor scores for all five cognitive domains were derived from scores of the neuropsychological tasks. All task scores were adjusted for practice effects for returning participants [27]. Higher scores represent better cognitive performance. Below, we briefly described the tasks and conditions included in each cognitive domain.

Executive function

Measures from six well-established neuropsychological tasks were used to compute the executive function factor score: (1) Stroop [28] – interference condition, (2) Delis-Kaplan Executive Function System (D-KEFS) Trail Making Test [29] – switching condition, (3) Category Switching [29] – fluency switching condition, (4) Wechsler Memory Scale (WMS-III) Letter-Number Sequencing [30] – total number of correct trials, (5) Reading Span [31] – total number of correct words recalled, (6) Digit Span [30] – forward and backward conditions. For executive function, a confirmatory factor analysis was performed based on scores from six tasks to derive the factor score for each participant as described previously [32].

Episodic memory

The following tasks and conditions were used: (1) WMS-III Logical Memory (LM) subtest [30] with immediate and delayed recall conditions, (2) WMS-III Visual Reproduction (VR) subtest [30] with immediate and delayed recall conditions, and (3) the California Verbal Learning Test (CVLT-II) [33] with short- and long-delay free recall conditions. For the episodic memory factor score, a confirmatory factor analysis was performed using measures from three tasks to derive a factor for each participant as described previously [34].

Verbal fluency

The D-KEFS Verbal Fluency Test was used [29], which includes the phonemic fluency subtests (F, A, and S), two semantic fluency subtests (Animals and Boys’ Names), and a category switching subtest in which participants were instructed to alternate between naming fruits and items of furniture. For category switching subtest, the correct number of words generated aloud during the test was used as the score, rather than the correct number of switches. For the verbal fluency factor score, a confirmatory factor analysis was performed based on measures from two tests to derive the score for each participant as described previously [35].

Processing speed

For processing speed, there were three different tasks: (1) the D-KEFS Trails Making Test [29] with number sequencing and letter sequencing conditions, (2) Simple Reaction Time [36] with left hand and right hand conditions, (3) Stroop [28] with the word reading condition and the color naming condition. For the processing speed factor score, measures from three tests were used to derive the score for each participant using confirmatory factor analyses as described previously [37].

Visual-spatial ability

The following three tasks were used to index visual-spatial ability: (1) Card Rotations [38], (2) Hidden Fig. [39], and (3) Armed Forces Qualification Test (AFQT) [40] Box Folding subtest. For the visual-spatial ability factor score, measures from three tasks were used to derive a factor score for each participant from standardized factor scoring coefficients using exploratory factor analyses.

MRI acquisition and processing

All acquisition parameters have been described in detail previously [41,42,43]. Briefly, T1-weighted images (sagittal 3D fast spoiled gradient echo (FSPGR), TE = 3.164 msec, TR = 8.084 msec), T2-weighted images (coronal 2D FRFSE-XL, TE = 94 msec, TR = 4600 msec), proton-density (PD)-weighted images (coronal, TE = min/full, TR = 3000 msec), and diffusion-weighted images (51 diffusion directions, b value = 1,000 s/mm2, integrated with a pair of b = 0 images with opposite phase encode polarity, TR = 9,700 ms, TE = 80–84 ms) were acquired on two General Electric (GE) Discovery MR750 3.0T scanners (GE Healthcare, Waukesha, WI, USA) at UCSD with an eight-channel phased array head coil.

For measures of cortical thickness and hippocampal volume, all MRI images were preprocessed at the UCSD Center for Multimodal Imaging Genetics as described previously [42, 44]. All raw and processed MRI images were visually inspected to exclude data with severe scanner artifacts or excessive head motion from subsequent analyses. T1-weighted images were processed as described previously [41, 42, 44]. Preprocessing steps included correction of distortion due to gradient nonlinearity, image intensity normalization, and rigid registration into standard orientation with 1 mm isotropic voxel size. Average cortical thickness, regional cortical thickness (for the signature below), and hippocampal volume were derived using FreeSurfer 6.0 ( software package.

Additionally, we derived an AD thickness/volume signature based on work by McEvoy and colleagues [45, 46]. This AD thickness/volume signature is a weighted average of cortical thickness in seven cortical regions (entorhinal cortex, middle temporal gyrus, bank of superior temporal sulcus, superior temporal gyrus, isthmus cingulate, medial orbitofrontal cortex, lateral orbitofrontal cortex) plus hippocampal volume. We regressed out the effects of age and scanner for each ROI, as well as estimated intracranial volume for the hippocampus. Standardized residuals of ROIs were then weighted accordingly and summed together to form the thickness/volume AD signature score.

For global volume of WMH, a multi-channel segmentation approach was used similar to our previous work [43]. This approach leverages complementary information from three volumes with different contrasts to increase measurement sensitivity while reducing the impact of MR acquisition noise. The measure used was the proportion of total white matter that had WMH, a ratio of global WMH to total white matter. Processing steps included standard alignment of the T1 (6 degrees-of-freedom, rigid transformation to an anterior/posterior commissure aligned space), registration of T2 and PD to T1 using a mutual information method [47] and intensity non-uniformity correction using N3 [48]. The tissue segmentation was performed using the statistical classification framework of the open source Insight Segmentation and Registration Toolkit (ITK) [49] leveraging Scott’s L2E method [50] to determine robust means and covariances, and in combination with anatomical considerations via ITK’s morphological operators [51]. This produced an automated segmentation including white matter, gray matter, CSF, and WMH. WMH included three categories within our T1-T2-PD feature space per Mahalanobis distance considerations [52], reflecting which primary tissue type was most similar to the abnormalities measured: WMH-white (properties more similar to white matter tissue), WMH-gray (properties more similar to gray matter), or WMH-fluid (properties more similar to CSF). Due to the dramatic curvature and convolutions of the most anterior and posterior aspects of the brain, common partial voluming of cortical gyri results in numerous false positive WMH voxel assignments. Therefore, we did not allow automated WMH assignments in the most posterior and anterior 20 mm of the brain. Additionally, since partial voluming of CSF and white matter along the edges of ventricles results in voxels with WMH-like signal, even in healthy individuals [53], our segmentation did not allow any voxels that touched (i.e., shared a common face, edge, or vertex with) a ventricular fluid voxel to be classified as WMH. The automated segmentations were visually reviewed and manually edited when necessary to correct misclassifications. To reduce human error, edits were restricted to only changing voxels between gray and WMH-gray (e.g., if a cortical voxel was mislabeled as WMH) or between fluid and WMH-fluid (e.g., if the central voxels of an WMH cluster were mislabeled as CSF). The automatically classified WMH-white voxels were never edited.

For regional white matter tract measures of integrity, diffusion-weighted images (DWI) were processed as described previously [54, 55]. Briefly, DWIs were corrected for eddy current distortion, head motion, B0 distortions, and gradient nonlinearity distortions. The b = 0 images were registered to T1 images using mutual information and then rigidly resampled into a standard orientation relative to the atlas-registered T1, with 2 mm isotropic resolution. Conventional diffusion tensor imaging (DTI) methods were used to model the diffusion tensor as an ellipsoid where eigenvalues λ1, λ2 and λ3 define the three primary axes. We computed Fractional anisotropy (FA) and mean diffusivity (MD) for 25 white matter fiber tracts using a probabilistic atlas (12 in each hemisphere plus corpus callosum) [56] and created a composite score based on the union of all fiber tracts.

Additionally, we computed MD for regional cortical gray matter (i.e., cortical MD) as described previously [55]. Briefly, white and gray matter voxels were labeled using the cortical surfaces and subcortical segmentations generated by FreeSurfer in processed T1-weighted image resolution (1 mm isotropic). Diffusion metrics were mapped to the cortical surface as previously described [55]. To minimize the effects of partial voluming and regional variations in cortical thickness, we take a weighted average of multiple samples across the cortical ribbon based on the proportion of gray matter at each sampling location. Vertex-wise values were then averaged within each cortical parcellation. Similarly, weighted averages of diffusion metrics within subcortical structures were based on voxel-wise proportions of gray matter within each subcortical segmentation [57]. Like the AD signature, a MD signature based on cortical MD (i.e., MD signature) of the same eight ROIs was computed as described previously [58].

Young adult general cognitive ability

Participants were on average 20 years of age when they completed the Armed Forces Qualification Test (AFQT). The AFQT is a standardized, validated 100-item multiple-choice paper-and-pencil test of general cognitive ability [40]. It includes 4 components: vocabulary, arithmetic, spatial processing, and knowledge and reasoning about tools. AFQT percentile scores were probit transformed and z-scored for analysis. This test is highly correlated with other tests of general cognitive ability such as the Wechsler Adult Intelligence Scale (r = 0.84) [59].

Health status

A modification of the Charlson Comorbidity Index was used [60]. One point was assigned for the presence of each of 15 different chronic medical conditions as described previously [42]. Higher scores indicate poorer health.

Statistical analyses

All statistical analyses were performed using R version 4.1.2. Plasma NfL was adjusted for testing site and sample storage time using the umx_residualize() function from the umx R package. Residualized scores of plasma NfL were then log transformed to improve their distribution properties. All neuroimaging measures were adjusted for scanner differences. Hippocampal volume was additionally adjusted for intracranial volume (ICV). To examine the associations of plasma NfL with cognitive and brain measures, linear mixed models were performed using the lme4 package with plasma NfL as the independent variable. Age, health status, and young adult general cognitive ability (GCA) were included as covariates in all models unless otherwise noted. The inclusion of young adult GCA aimed to account for the effect of longstanding individual differences in cognitive ability on late midlife cognition and brain structures. In supplemental analyses, years of education was included as a covariate rather than young adult GCA to make the analyses more comparable to prior work, since most of the previous studies on plasma NfL do not have early adult GCA. Because there were twin pairs in our sample, we included twin pair ID as a random intercept in linear mixed models to account for correlated outcomes. P-values were calculated using Satterthwaite degrees of freedom approximation. Multiple comparison correction was applied using the Li and Ji approach for false discovery rate (FDR) control [61], which accounts for the correlated relationships among our dependent variables.


Associations of plasma NfL with covariates

Consistent with prior work, higher plasma NfL level was significantly correlated with age (r = 0.13, p < 0.0001) and poorer health (r = 0.08, p = 0.014). The correlations of plasma NfL level with young adult GCA (r = 0.06, p = 0.059) and years of education (r = 0.06, p = 0.050) were not statistically significant.

Plasma NfL and cognition

Among the five cognitive domains, plasma NfL was only associated with processing speed (β=-0.12, pFDR−corrected=0.0004), such that higher plasma NfL level was associated with slower processing speed (Table 2). Associations of plasma NfL with the other four cognitive domains were not significant either before or after multiple comparisons correction. In our supplementary analyses that covaried for years of education instead of young adult GCA, we also observed the same significant association with processing speed (β=-0.11, pFDR−corrected=0.004), but not with other cognitive domains either before or after multiple comparisons correction (ps > 0.05) (Table S1).

Table 2 Plasma NfL’s associations with cognition in the full sample

We repeated the aforementioned analyses with only cognitively unimpaired (CU) participants, but did not find any significant associations with cognitive domains after correcting for multiple comparisons (Table 3). Higher plasma NfL level was associated with poorer executive function, but this association did not survive correction for multiple comparisons (β=-0.06, puncorrected=0.024, pFDR-corrected=0.096). Models covarying for years of education instead of GCA yielded similar results (Table S2). In the MCI group, plasma NfL level was only significantly associated with processing speed (β=-0.39, pFDR-corrected=0.0004) like it was in the full sample, but not with other cognitive domains (ps > 0.05). Covarying for years of education also did not change the results (β=-0.36, pFDR-corrected=0.0004) (Table S2).

Table 3 Plasma NfL’s associations with cognition in CU and MCI groups

To determine if the association of plasma NfL with processing speed in the full sample was driven by MCI status, we tested for an interaction between MCI status and plasma NfL. Indeed, we found a significant interaction effect (β = -0.29, p < 0.0001, 95% CI [-0.44, -0.14]), indicating that the association was significantly stronger in the MCI group compared to the CU group.

Plasma NfL and neuroimaging measures

Among all the neuroimaging measures, higher plasma NfL level was only significantly associated with greater WMH in the full sample both before and after FDR correction (β = 0.005, pFDR−corrected=0.035). As shown in Table 4, mean cortical thickness, hippocampal volume, cortical MD, FA and MD of white matter tracts, and AD and MD signatures were not associated with plasma NfL level (ps > 0.05, before and after FDR correction). Similar results were observed when covarying for years of education, such that plasma NfL level was only associated with the proportion of abnormal white matter volume both before and after FDR correction (β = 0.006, pFDR−corrected=0.015) (Table S3). When we repeated the analyses in CU and MCI groups separately, no significant associations between plasma NfL level and neuroimaging measures were detected (ps > 0.05, before and after FDR correction) (Tables S4-S5).

Table 4 Plasma NfL’s associations with brain structures in the full sample


Establishing reliable associations of plasma NfL with cognition and neuroimaging measures of neurodegeneration in community-based settings is important for understanding its utility in predicting risk and monitoring progress of neurodegenerative diseases. Existing cross-sectional associations have been largely inconsistent in community-dwelling dementia-free individuals. Leveraging a large-scale aging dataset with well-established neuropsychological tests encompassing five cognitive domains, we found that plasma NfL level was only significantly associated with processing speed in early old age men with MCI, but not in CU individuals. Similarly, there was no cross-sectional association between plasma NfL level with any of the neuroimaging measures of neurodegeneration, with the exception of an association with white matter pathology in the full sample. Together, our results suggest that plasma NfL may not be sensitive to cross-sectional differences in cognition or measures of gray matter neurodegeneration, but may capture some cross-sectional differences in white matter pathology among early old age dementia-free men at ages 61–73 years.

The overall lack of cross-sectional association between plasma NfL and various domains of cognition is in support of prior conflicting findings. The Mayo Clinic Study of Aging (MCSA) had a comparable sample size (N = 995) and a similar percentage of participants who were CU or with MCI as our sample [11]. That study showed significant plasma NfL associations with memory, attention, and language, which roughly corresponds to episodic memory, executive function, and verbal fluency in our study. In contrast, we only found a modest association with executive function in the CU group before FDR-correction. The inconsistency may be due to the age difference between the two samples. The median age of the MCSA sample is about 7 years older than VETSA, and the maximum age is about 10 years older: MCSA (median age [interquartile range, IQR]: 75.6 [67.0, 80.9]); VETSA (median age [IQR]: 68.3 [65.5, 69.8]). Because older age is consistently associated with higher plasma NfL level (i.e., neurodegeneration), it is possible that the amount of neurodegeneration in the relatively younger VETSA participants is not sufficient to exert any detectable effect on cognitive function. Indeed, the median level and IQR of plasma NfL in MCSA (median [IQR] 17.0 [11.8, 23.9]) is much higher than that of VETSA (median [IQR] 13.2 [9.0, 15.1]), suggesting a greater degree of neurodegeneration. This explanation is further corroborated by another study (N > 600) that included non-Hispanic white CU participants who are similar to VETSA participants in demographics, and found no reliable association with tasks of memory or language, but with tasks of executive function [14].

It is also worth highlighting that the modest effect sizes of these cross-sectional associations between plasma NfL and cognition might explain some of the observed inconsistencies in previous findings. A large population-based study (N 4500) found that plasma NfL was associated with memory, fluency, and executive function [13]. However, not only was the age range of the study much wider and older than our sample (71.9 ± 7.3 vs. 67.6 ± 2.5), but also the effect sizes of the observed associations in the two studies were small (β< ±0.13) and not significantly different from one another (i.e., overlapping confidence intervals of beta estimates). Thus, it is important to consider effect sizes rather than significance when evaluating the robustness of these associations and comparing the results across studies. Relatedly, although our CU and MCI groups were not significantly different in mean plasma NfL levels, we found a significant plasma NfL x MCI Status interaction effect for processing speed, such that the association was significantly in the MCI group. Moreover, the effect size of the MCI group was significantly stronger than those of the CU group and the full sample (i.e., medium vs. small effect). These findings suggest the need to first identify the cognitive status of each individual and then examine each cognitive group separately for a more precise characterization of the associations in community-dwelling populations.

For neuroimaging measures of gray and white matter, the lack of any associations with plasma NfL except WMH volume, was somewhat unexpected, since NfL has primarily been used as a biomarker of neurodegeneration. Nonetheless, our findings suggest no reliable associations between plasma NfL and macro- or micro-structural measures of neurodegeneration in gray matter or in regions vulnerable to AD. These findings are consistent with prior large-scale studies in community-dwelling samples (N > 400) that failed to detect an association with hippocampal volume [11, 13], mean cortical thickness [13, 18] or AD signature [13]. We additionally tested for an association with cortical MD, an earlier index of microstructural neurodegeneration of gray matter that precedes macrostructural neurodegeneration in thickness and volume [22]. However, plasma NfL was also not associated with this early marker of cortical microstructural neurodegeneration or our MD signature that focused on AD-vulnerable regions.

Considering the fact that NfL is most abundant in myelinated large-caliber axons found in white matter, it is possible, though speculative, that plasma NfL is not as sensitive to neurodegeneration in gray matter as it is to neurodegeneration in white matter. Alternatively, white matter degeneration might appear much earlier than gray matter atrophy in an early old age population and may thus be reflected earlier in plasma NfL levels [13]. A few studies have examined the associations between plasma NfL and white matter integrity (i.e., FA, MD). Although a large community-based study (N > 900, 67.7 ± 6.4 years old) found an association with MD but not FA [13], we did not detect an association with either measure in our relatively younger sample (N 430, 67.6 ± 2.5 years old). It is possible that this conflicting result may be due to age differences, as well as sex differences, as the other study did show that the associations were somewhat stronger in women than in men. However, additional work is needed to examine this possibility. Our only significant finding for white matter was the modest association of plasma NfL with WMH volume, a macrostructural measure of white matter damage or pathology. Previous studies have mostly detected a cross-sectional association between WMH volume and plasma NfL in large community-based samples (N 335 to 742) [7, 13, 62], with two other smaller sample size studies finding no association (N 192 to 341) [11, 12]. Although a direct comparison of effect sizes across the studies is impossible due to insufficient information, the inconsistent result regarding WMH volume is likely due to sample size differences.

Finally, our findings provide unique insights into the associations of plasma NfL with cognition and neuroimaging measures, in that we controlled for the effect of young adult general cognitive ability (GCA) in our models. This variable may be considered an index of an individual’s cognitive reserve prior to age-related cognitive decline [63]. Prior work from our group has shown that young adult GCA or cognitive reserve accounts for roughly 10% of variance in each of the five cognitive domains in later life, whereas years of education only accounts for less than 1% of variance [42]. Therefore, by accounting for the effect of young adult GCA we are able to more precisely examine associations of NfL with later life cognition and neuroimaging measures that are independent of longstanding individual differences in cognitive ability. In addition, it is possible that significant associations in other studies may have been driven largely by the older participants. In contrast, our study avoids this possibility on account of our narrow age range. Notwithstanding the strengths of our study, there are three limitations. First, because our sample only had male participants, we were not able to determine whether there are sex differences in the associations of plasma NfL with cognition and brain. However, our work provides valuable information on the strengths of these associations in male participants, which paves the way for future large-scale work focusing on sex differences. Second, most of our participants are largely non-Hispanic white, which means our results may not be generalizable to other racial/ethnic groups. Third, there may be potential selection bias in our longitudinal study sample, as participants with poor health are likely to drop out over the course of the study. However, this limitation is common to all research studies of aging, including those that begin enrolling at older ages (e.g., > 70 years old), which are subject to similar selection bias.


The present study suggests that higher plasma NfL level is modestly associated with greater volume of macrostructural white matter pathology, but does not reliably associate with cognition, neuroimaging measures of gray matter (thickness, volume, MD), or DTI-based measures of white matter integrity in community-dwelling early old age men. The differential strength of associations between plasma NfL and domains of cognition in those with and without MCI, as this study found for processing speed, indicates that sample composition with respect to cognitive status may explain discrepant findings in the literature. Future studies assessing the utility of NfL for predicting and monitoring brain or cognitive decline will benefit from characterizing how these associations evolve with age or disease progression.

Data availability

The Vietnam Era Twin Study of Aging dataset is publicly available to qualified researchers, with restrictions. Information regarding data access can be found at



Alzheimer’s disease


Armed Forces Qualification Test


Cognitively unimpaired


Diffusion tensor imaging


Fractional anisotropy


General cognitive ability


Mild cognitive impairment


Mean diffusivity


Neurofilament light chain


White matter hyperintensity


  1. Ashton NJ, Hye A, Rajkumar AP, Leuzy A, Snowden S, Suarez-Calvet M, et al. An update on blood-based biomarkers for non-alzheimer neurodegenerative disorders. Nat Rev Neurol. 2020;16(5):265–84.

    Article  PubMed  Google Scholar 

  2. Ashton NJ, Janelidze S, Al Khleifat A, Leuzy A, van der Ende EL, Karikari TK, et al. A multicentre validation study of the diagnostic value of plasma neurofilament light. Nat Commun. 2021;12(1):3400.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Khalil M, Teunissen CE, Otto M, Piehl F, Sormani MP, Gattringer T, et al. Neurofilaments as biomarkers in neurological disorders. Nat Reviews Neurol. 2018;14(10):577–89.

    Article  CAS  Google Scholar 

  4. Gaetani L, Blennow K, Calabresi P, Di Filippo M, Parnetti L, Zetterberg H. Neurofilament light chain as a biomarker in neurological disorders. J Neurol Neurosurg Psychiatry. 2019;90(8):870–81.

    Article  PubMed  Google Scholar 

  5. Zetterberg H, Skillback T, Mattsson N, Trojanowski JQ, Portelius E, Shaw LM, et al. Association of Cerebrospinal Fluid Neurofilament Light Concentration with Alzheimer Disease Progression. JAMA Neurol. 2016;73(1):60–7.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Ladang A, Kovacs S, Lengele L, Locquet M, Reginster JY, Bruyere O, et al. Neurofilament light chain concentration in an aging population. Aging Clin Exp Res. 2022;34(2):331–9.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Khalil M, Pirpamer L, Hofer E, Voortman MM, Barro C, Leppert D, et al. Serum neurofilament light levels in normal aging and their association with morphologic brain changes. Nat Commun. 2020;11(1):812.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Jung Y, Damoiseaux JS. The potential of blood neurofilament light as a marker of neurodegeneration for Alzheimer’s disease. Brain. 2023:awad267.

  9. He L, Morley JE, Aggarwal G, Nguyen AD, Vellas B, de Souto Barreto P, et al. Plasma neurofilament light chain is associated with cognitive decline in non-dementia older adults. Sci Rep. 2021;11(1):13394.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Beydoun MA, Noren Hooten N, Beydoun HA, Maldonado AI, Weiss J, Evans MK, et al. Plasma neurofilament light as a potential biomarker for cognitive decline in a longitudinal study of middle-aged urban adults. Translational Psychiatry. 2021;11(1):436.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Marks JD, Syrjanen JA, Graff-Radford J, Petersen RC, Machulda MM, Campbell MR, et al. Comparison of plasma neurofilament light and total tau as neurodegeneration markers: associations with cognitive and neuroimaging outcomes. Alzheimers Res Ther. 2021;13(1):199.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Malek-Ahmadi M, Su Y, Ghisays V, Luo J, Devadas V, Chen Y, et al. Plasma NfL is associated with the APOE ε4 allele, brain imaging measurements of neurodegeneration, and lower recall memory scores in cognitively unimpaired late-middle-aged and older adults. Alzheimers Res Ther. 2023;15(1):74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. van Arendonk J, Wolters FJ, Neitzel J, Vinke EJ, Vernooij MW, Ghanbari M et al. Plasma neurofilament light chain in relation to 10-year change in cognition and neuroimaging markers: a population-based study. Geroscience. 2023.

  14. O’Bryant S, Petersen M, Hall J, Johnson L, Yaffe K, Braskie M, et al. Characterizing plasma NfL in a community-dwelling multi-ethnic cohort: results from the HABLE study. Alzheimers Dement. 2022;18(2):240–50.

    Article  PubMed  Google Scholar 

  15. Mattsson N, Andreasson U, Zetterberg H, Blennow K. Alzheimer’s Disease Neuroimaging I. Association of plasma neurofilament light with neurodegeneration in patients with Alzheimer Disease. JAMA Neurol. 2017;74(5):557–66.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Gustavson DE, Elman JA, Sanderson-Cimino M, Franz CE, Panizzon MS, Jak AJ, et al. Extensive memory testing improves prediction of progression to MCI in late middle age. Alzheimers Dement (Amst). 2020;12(1):e12004.

    PubMed  Google Scholar 

  17. Jonaitis EM, Koscik RL, Clark LR, Ma Y, Betthauser TJ, Berman SE, et al. Measuring longitudinal cognition: individual tests versus composites. Alzheimers Dement (Amst). 2019;11:74–84.

    Article  PubMed  Google Scholar 

  18. Dittrich A, Ashton NJ, Zetterberg H, Blennow K, Simren J, Geiger F, et al. Plasma and CSF NfL are differentially associated with biomarker evidence of neurodegeneration in a community-based sample of 70-year-olds. Alzheimers Dement (Amst). 2022;14(1):e12295.

    Article  PubMed  Google Scholar 

  19. Mielke MM, Syrjanen JA, Blennow K, Zetterberg H, Vemuri P, Skoog I, et al. Plasma and CSF neurofilament light: relation to longitudinal neuroimaging and cognitive measures. Neurology. 2019;93(3):e252–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Andersson E, Janelidze S, Lampinen B, Nilsson M, Leuzy A, Stomrud E, et al. Blood and cerebrospinal fluid neurofilament light differentially detect neurodegeneration in early Alzheimer’s disease. Neurobiol Aging. 2020;95:143–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Sun Y, Tan L, Xu W, Wang ZT, Hu H, Li JQ, et al. Plasma neurofilament light and longitudinal progression of White Matter Hyperintensity in Elderly persons without Dementia. J Alzheimers Dis. 2020;75(3):729–37.

    Article  PubMed  Google Scholar 

  22. Weston PS, Simpson IJ, Ryan NS, Ourselin S, Fox NC. Diffusion imaging changes in grey matter in Alzheimer’s disease: a potential marker of early neurodegeneration. Alzheimers Res Ther. 2015;7(1):47.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Kremen WS, Franz CE, Lyons MJ. Current status of the Vietnam era Twin Study of Aging (VETSA). Twin Res Hum Genet. 2019;22(6):783–7.

    Article  PubMed  Google Scholar 

  24. Kremen WS, Franz CE, Lyons MJ. VETSA: the Vietnam era Twin Study of Aging. Twin Res Hum Genet. 2013;16(1):399–402.

    Article  PubMed  Google Scholar 

  25. Schoenborn CA, Heyman KM. Health characteristics of adults aged 55 years and over: United States, 2004–2007. Natl Health Stat Rep. 2009(16):1–31.

  26. Tang R, Panizzon MS, Elman JA, Gillespie NA, Hauger RL, Rissman RA, et al. Association of neurofilament light chain with renal function: mechanisms and clinical implications. Alzheimers Res Ther. 2022;14(1):189.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Elman JA, Jak AJ, Panizzon MS, Tu XM, Chen T, Reynolds CA, et al. Underdiagnosis of mild cognitive impairment: a consequence of ignoring practice effects. Alzheimers Dement (Amst). 2018;10:372–81.

    Article  PubMed  Google Scholar 

  28. Golden CJ. Stroop Color and Word Test. Multi-Health Systems; 2003.

  29. Delis DC, Kaplan E, Kramer JH. Delis-Kaplan executive function system (D-KEFS). San Antonio, TX: Psychological Corporation; 2001.

    Google Scholar 

  30. Wechsler D. Wechsler Memory Scale (WMS-III). San Antonio, TX: Psychological Corporation; 1997.

    Google Scholar 

  31. Daneman M, Merikle PM. Working memory and language comprehension: a meta-analysis. Psychonomic Bull. 1980;3:422–33.

    Article  Google Scholar 

  32. Gustavson DE, Panizzon MS, Franz CE, Friedman NP, Reynolds CA, Jacobson KC, et al. Genetic and environmental architecture of executive functions in midlife. Neuropsychology. 2018;32(1):18–30.

    Article  PubMed  Google Scholar 

  33. Delis DC, Kramer JH, Kaplan E, Ober BA. California Verbal Learning Test (CVLT-2). 2nd ed. San Antonio, TX: Psychological Corporation; 2000.

    Google Scholar 

  34. Kremen WS, Panizzon MS, Franz CE, Spoon KM, Vuoksimaa E, Jacobson KC, et al. Genetic complexity of episodic memory: a twin approach to studies of aging. Psychol Aging. 2014;29(2):404–17.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Gustavson DE, Panizzon MS, Elman JA, Franz CE, Beck A, Reynolds CA, et al. Genetic and environmental influences on Verbal Fluency in Middle Age: a longitudinal twin study. Behav Genet. 2018;48(5):361–73.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Nesselroade J. Methods in the study of life-span human development: issues and answers. Biology Cognition Methods life-span. 2010;1:36–55.

    Google Scholar 

  37. Sanderson-Cimino M, Panizzon MS, Elman JA, Gustavson DE, Franz CE, Reynolds CA, et al. Genetic and environmental architecture of processing speed across midlife. Neuropsychology. 2019;33(6):862–71.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Ekstrom RB, French JW, Harman HH, Dermen D. Manual for kit of factor-referenced cognitive tests: 1976. Princeton N.J.: Education Testing Service Princeton N.J.; 1976.

    Google Scholar 

  39. Thurstone LL. A Factorial Study of Perception. Chicago: University of Chicago Press; 1944.

    Google Scholar 

  40. Uhlaner JE, Bolanovich DJ. Development of the Armed Forces Qualification Test and predecessor army screening tests, 1946–1950. Pay Research Bureau (PRB), Report. (1952), Article AD0000191. 1952.

  41. Hatton SN, Franz CE, Elman JA, Panizzon MS, Hagler DJ Jr., Fennema-Notestine C, et al. Negative fateful life events in midlife and advanced predicted brain aging. Neurobiol Aging. 2018;67:1–9.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Kremen WS, Beck A, Elman JA, Gustavson DE, Reynolds CA, Tu XM, et al. Influence of young adult cognitive ability and additional education on later-life cognition. Proc Natl Acad Sci U S A. 2019;116(6):2021–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Fennema-Notestine C, McEvoy LK, Notestine R, Panizzon MS, Yau W-YW, Franz CE, et al. White matter disease in midlife is heritable, related to hypertension, and shares some genetic influence with systolic blood pressure. NeuroImage: Clin. 2016;12:737–45.

    Article  PubMed  Google Scholar 

  44. Eyler LT, Chen CH, Panizzon MS, Fennema-Notestine C, Neale MC, Jak A, et al. A comparison of heritability maps of cortical surface area and thickness and the influence of adjustment for whole brain measures: a magnetic resonance imaging twin study. Twin Res Hum Genet. 2012;15(3):304–14.

    Article  PubMed  PubMed Central  Google Scholar 

  45. McEvoy LK, Fennema-Notestine C, Roddey JC, Hagler DJ Jr., Holland D, Karow DS, et al. Alzheimer disease: quantitative structural neuroimaging for detection and prediction of clinical and structural changes in mild cognitive impairment. Radiology. 2009;251:195–205.

    Article  PubMed  PubMed Central  Google Scholar 

  46. McEvoy LK, Holland D, Hagler DJ Jr., Fennema-Notestine C, Brewer JB, Dale AM. Mild cognitive impairment: baseline and longitudinal structural MR imaging measures improve predictive prognosis. Radiology. 2011;259:834–43.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Maes F, Collignon A, Vandermeulen D, Marchal G, Suetens P. Multimodality image registration by maximization of mutual information. IEEE Trans Med Imaging. 1997;16:187–98.

    Article  CAS  PubMed  Google Scholar 

  48. Sled JG, Zijdenbos AP, Evans AC. A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE Trans Med Imaging. 1998;17:87–97.

    Article  CAS  PubMed  Google Scholar 

  49. Avants BB, Tustison NJ, Stauffer M, Song G, Wu B, Gee JC. The insight ToolKit image registration framework. Front Neuroinform. 2014;8:44.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Scott DW. Parametric statistical modeling by minimum integrated square error. Technometrics. 2001;43(3):274–85.

    Article  Google Scholar 

  51. Yoo TS, Ackerman MJ, Lorensen WE, Schroeder W, Chalana V, Aylward S et al. Engineering and algorithm design for an image processing Api: a technical report on ITK–the Insight Toolkit. Stud Health Technol Inform. 2002;85:586 – 92.

  52. McLachlan GJ. Discriminant analysis and statistical pattern recognition. Wiley; 2005.

  53. Valdés Hernández MC, Piper RJ, Bastin ME, Royle NA, Maniega SM, Aribisala BS, et al. Morphologic, distributional, volumetric, and intensity characterization of periventricular hyperintensities. AJNR Am J Neuroradiol. 2014;35(1):55–62.

    Article  PubMed  Google Scholar 

  54. Vuoksimaa E, Panizzon MS, Hagler DJ Jr., Hatton SN, Fennema-Notestine C, Rinker D, et al. Heritability of white matter microstructure in late middle age: a twin study of tract-based fractional anisotropy and absolute diffusivity indices. Hum Brain Mapp. 2017;38(4):2026–36.

    Article  PubMed  Google Scholar 

  55. Elman JA, Panizzon MS, Hagler DJ Jr., Fennema-Notestine C, Eyler LT, Gillespie NA, et al. Genetic and environmental influences on cortical mean diffusivity. NeuroImage. 2017;146:90–9.

    Article  PubMed  Google Scholar 

  56. Hagler DJ Jr., Ahmadi ME, Kuperman J, Holland D, McDonald CR, Halgren E, et al. Automated white-matter tractography using a probabilistic diffusion tensor atlas: application to temporal lobe epilepsy. Hum Brain Mapp. 2009;30(5):1535–47.

    Article  PubMed  Google Scholar 

  57. Gillespie NA, Neale MC, Hagler DJ Jr., Eyler LT, Fennema-Notestine C, Franz CE, et al. Genetic and environmental influences on mean diffusivity and volume in subcortical brain regions. Hum Brain Mapp. 2017;38(5):2589–98.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Williams ME, Elman JA, McEvoy LK, Andreassen OA, Dale AM, Eglit GML, et al. 12-year prediction of mild cognitive impairment aided by Alzheimer’s brain signatures at mean age 56. Brain Commun. 2021;3(3):fcab167.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Lyons MJ, York TP, Franz CE, Grant MD, Eaves LJ, Jacobson KC, et al. Genes determine stability and the environment determines change in cognitive ability during 35 years of adulthood. Psychol Sci. 2009;20(9):1146–52.

    Article  PubMed  Google Scholar 

  60. Charlson M, Szatrowski TP, Peterson J, Gold J. Validation of a combined comorbidity index. J Clin Epidemiol. 1994;47(11):1245–51.

    Article  CAS  PubMed  Google Scholar 

  61. Li J, Ji L. Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix. Heredity. 2005;95(3):221–7.

    Article  CAS  PubMed  Google Scholar 

  62. Rajan KB, Aggarwal NT, McAninch EA, Weuve J, Barnes LL, Wilson RS, et al. Remote blood biomarkers of longitudinal cognitive outcomes in a Population Study. Ann Neurol. 2020;88(6):1065–76.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Kremen WS, Elman JA, Panizzon MS, Eglit GML, Sanderson-Cimino M, Williams ME et al. Cognitive Reserve and related constructs: a unified Framework Across Cognitive and Brain dimensions of Aging. Front Aging Neurosci. 2022;14.

Download references


The U.S. Department of Veterans Affairs, Department of Defense; National Personnel Records Center, National Archives and Records Administration; National Opinion Research Center; National Research Council, National Academy of Sciences; and the Institute for Survey Research, Temple University provided invaluable assistance in the creation of the Vietnam Era Twin Registry (VETR). The Cooperative Studies Program of the U.S. Department of Veterans Affairs provided financial support for development and maintenance of the VETR. We would also like to acknowledge the continued cooperation and participation of the members of the VETR and their families.


This work was supported by the National Institute on Aging at the National Institutes of Health grant numbers R01s AG050595 to WSK, CEF, MJL, AG022381, AG076838 and AG064955 to WSK, P01 AG055367 and R01 AG037985 to CEF, R01s AG058252, AG073979, AG051848 to RAR, and biomarker core funds to RAR from AG057437 (USC ACTC), AG010483 (UCSD ADCS) and AG062429 (UCSD ADRC), and K01 AG063805 to JAE.

Author information

Authors and Affiliations



R.T., J.A.E., C.E.F. and W.S.K. contributed to the conception and design of the study; R.T., E.B., A.M.D., C.F., D.J.H., M.S.P., J.A.E., N.A.G., R.A.R., M.L.J., M.C.N., O.K.P., C.A.R., C.E.F. and W.S.K. contributed to the acquisition and analysis of data; R.T., M.S.P., J.A.E., C.F., C.E.F. and W.S.K. contributed to drafting the manuscript or preparing the figures. R.T., E.B., A.M.D., C.F., D.J.H., M.S.P., J.A.E., N.A.G., R.A.R., M.L.J., M.C.N., O.K.P., C.A.R., C.E.F. and W.S.K. contributed to the editing and revising of the manuscript; All authors reviewed the manuscript.

Corresponding author

Correspondence to Rongxiang Tang.

Ethics declarations

Ethics approval and consent to participate

For the VETSA dataset, full written informed consents were obtained from all participants or their authorized representatives and both studies were conducted in accordance with the ethical standards of the Declaration of Helsinki and the institutional review boards at University of California, San Diego and Boston University approved all study protocols.

Consent for publication

Not applicable.

Competing interests

Anders M. Dale, is a founder and holds equity in CorTechs Laboratories, Inc. and also serves on its Scientific Advisory Board. He is a member of the Scientific Advisory Board of Human Longevity, Inc. and receives funding through research agreements with General Electric Healthcare and Medtronic, Inc. The terms of this arrangement have been reviewed and approved by the University of California, San Diego, in accordance with its conflict of interest policies.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tang, R., Buchholz, E., Dale, A.M. et al. Associations of plasma neurofilament light chain with cognition and neuroimaging measures in community-dwelling early old age men. Alz Res Therapy 16, 90 (2024).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: