Skip to main content

Identifying clinically useful biomarkers in neurodegenerative disease through a collaborative approach: the NeuroToolKit



Alzheimer’s disease (AD) is a complex and heterogeneous disease, which requires reliable biomarkers for diagnosis and monitoring disease activity. Preanalytical protocol and technical variability associated with biomarker immunoassays makes comparability of biomarker data across multiple cohorts difficult. This study aimed to compare cerebrospinal fluid (CSF) biomarker results across independent cohorts, including participants spanning the AD continuum.


Measured on the NeuroToolKit (NTK) prototype panel of immunoassays, 12 CSF biomarkers were evaluated from three cohorts (ALFA+, Wisconsin, and Abby/Blaze). A correction factor was applied to biomarkers found to be affected by preanalytical procedures (amyloid-β1–42, amyloid-β1–40, and alpha-synuclein), and results between cohorts for each disease stage were compared. The relationship between CSF biomarker concentration and cognitive scores was evaluated.


Biomarker distributions were comparable across cohorts following correction. Correlations of biomarker values were consistent across cohorts, regardless of disease stage. Disease stage differentiation was highest for neurofilament light (NfL), phosphorylated tau, and total tau, regardless of the cohort. Correlation between biomarker concentration and cognitive scores was comparable across cohorts, and strongest for NfL, chitinase-3-like protein-1 (YKL40), and glial fibrillary acidic protein.


The precision of the NTK enables merging of biomarker datasets, after correction for preanalytical confounders. Assessment of multiple cohorts is crucial to increase power in future studies into AD pathogenesis.


High-quality, reliable, well-validated biomarkers, reflective of biological processes are required to understand the complexity of neurodegenerative diseases, such as Alzheimer’s disease (AD) [1]. In AD, using biomarkers would allow for increasing diagnostic accuracy, guiding patient stratification, monitoring the effect of treatment on underlying pathologies, and providing surrogate measures of disease activity to monitor and evaluate outcomes [2]. Comparability of cerebrospinal fluid (CSF) biomarkers between studies has been limited, partially due to methodology differences across cohorts. This issue has been somewhat circumvented by fully automated assays used in research and standardization of preanalytical procedures [3, 4]. Importantly, some CSF biomarker immunoassays (amyloid-β1–42 [Aβ42], phosphorylated 181P tau [pTau], and total tau [tTau]) are well validated for future wide-spread use in the clinical setting [5,6,7]. However, the challenge remains to have standardized clinical endpoints, statistical approaches, and immunoassay platforms that would enable unified biomarker utility and interpretation of results across independent multicenter studies.

The NeuroToolKit (NTK; Roche Diagnostics International Ltd) is a panel of 12 automated CSF immunoassays for biomarkers linked to neurodegeneration [8, 9]. This panel is designed to accelerate biomarker development in AD and other neurological disorders by generating robust, comparable, high-quality biomarker data across multiple research and clinical cohorts.

Using CSF biomarker data collected from participating sites, we aimed to address three prioritized research questions: (i) Comparative: Can a correction factor for biomarkers affected by different preanalytical procedures be applied that allows for comparison across multiple cohorts? (ii) Diagnostic: How much do the biomarker concentrations vary between cognitively unimpaired (CU) individuals and patients with mild cognitive impairment (MCI) or AD-dementia? (iii) Clinical: How well do biomarker concentrations correlate with clinical measures of cognition?


This analysis utilizes data from three cohorts participating in the NTK project, which were selected to provide data spanning the entire AD continuum. The ALFA+ study (NCT02485730) aimed to characterize preclinical AD in CU individuals, most with a family history of AD (n=398) [8]. The Wisconsin cohort (n=651) comprised several longitudinal studies that utilized the same preanalytic protocol and included CU individuals, participants with MCI, or AD-dementia, enriched for parental history of AD [10]. The Abby/Blaze cohort (n=164) comprised participants in the ABBY (NCT01343966) and the BLAZE (NCT01397578) studies for patients with mild to moderate AD-dementia [11, 12]. Full eligibility criteria for each of the respective cohorts are described in the Supplementary Methods. All cohorts in the present analysis excluded participants who had comorbidities that would affect cognition. Some medications that affected cognition, such as sleep aids, were permitted in the Wisconsin cohort.

For the purposes of this analysis, the correction reference group for each cohort was defined as participants who were CU, APOE-ε4 allele non-carriers, and aged <65 years. As the Abby/Blaze cohort only included participants with AD-dementia, a correction reference group could not be defined.


CSF biomarkers included chitinase-3-like protein-1 (YKL40), soluble triggering receptor expressed on myeloid cells 2 (sTREM2), glial fibrillary acidic protein (GFAP), interleukin (IL)-6, neurofilament light (NfL), neurogranin, S100, alpha-synuclein (α-Syn), amyloid-β1–40 (Aβ40), Aβ42, pTau, and tTau. CSF biomarker samples obtained at baseline/enrollment were included. All biomarkers were measured using the NTK panel of immunoassays, which currently includes the commercially available Elecsys β-amyloid (1–42) CSF, Elecsys total Tau CSF, and Elecsys phospho-Tau (181P) CSF immunoassays, and robust prototype assays for the nine remaining biomarkers. Biomarkers Aβ42, Aβ40, pTau, tTau, s100, and IL-6 were measured using the cobas e 601 analyzer, and the remaining biomarkers were measured using the cobas e 411 analyzer (both Roche Diagnostics International Ltd).

Preanalytical factor correction

The preanalytical procedures employed by each cohort are detailed in the Supplementary Materials. Sample collection within the Wisconsin cohort was initiated ahead of standardized preanalytical protocol dissemination [9]; therefore, the correction factors are calculated in the respective correction reference groups (participants who were CU, APOE-ε4 allele non-carriers, and aged <65 years) of the Wisconsin and ALFA+ cohorts assuming the ALFA+ cohort being the “standard cohort.” The correction factor was calculated using the formula:

$$\textrm{Correction}\ \textrm{factor}=\textrm{median}\left(\textrm{ALFA}+\textrm{cohort}\right)/\textrm{median}\left(\textrm{Wisconsin}\ \textrm{cohort}\right)$$

Application of the correction factor was deemed successful in accounting for preanalytical variations by assessment of biomarker distribution overlap before and after correction, i.e., if following correction the biomarker distributions had good overlap, the correction was a success. The correction was applied to CSF biomarkers: α-Syn, Aβ40, and Aβ42 (Table S1), which are known to be significantly affected by preanalytical protocols, specifically related to the ability of these biomarkers to stick to the tubes used during testing [4, 13]. Conversely, the remaining biomarkers, such as pTau and tTau, appear to be unaffected by the tubes employed [4]. Natural variations between the cohorts were unaffected. Variations between cohorts may also result from inherent cohort differences or from cultural bias, e.g., in cognitive assessments.

CSF amyloid-β cut-off value derivation

Amyloid-β pathology was determined by CSF Aβ42/Aβ40 ratio for this analysis; the results are provided in the Supplementary Materials. To derive the cut-off values for the CSF Aβ42/Aβ40 ratio, Gaussian mixture modeling was independently applied to the ALFA+ and Wisconsin cohorts. The optimal number of Gaussians was set as two, after testing models with two, three, and four Gaussians (Supplementary Materials). Derived cut-off values were defined as x±2*s with differently defined parameters for x and s: (i) x=μ, s=σ (mu, sigma; Gaussian parameters of the amyloid-β negative [A−] population); (ii) x=mean, s=SD of samples assigned to A− population; and (iii) x=median, s=rSD of samples assigned to A− population (Tables S2–S5). The resulting cut-off values for the Aβ42/Aβ40 ratio were defined as 0.071 for the ALFA+ cohort [8] and 0.060 (0.075 after correction) for the Wisconsin cohort. Only patients with AD-dementia were included in the ABBY and BLAZE studies; therefore, cut-off values were not defined as this cohort was not divided by amyloid-β status. For comparison, the cut-off values determined for the ALFA+ cohort were applied to the Wisconsin cohort after correction.

Cognitive assessments

All participants completed the MMSE [14] and Clinical Dementia Rating scale Sum of Boxes (CDR-SB) [15] cognitive assessments during the respective studies. The time between CSF biomarker collection at baseline/enrollment and cognitive assessment varied for each participant and in some cases was up to 1 year. For the longitudinal studies, the cognitive assessment closest to the first lumbar puncture was used in this analysis, including those cognitive assessments performed before the lumbar puncture. The ALFA+ cohort only included participants with CDR-SB=0, per the study exclusion criteria [16]. Calculations of a modified Preclinical Alzheimer Cognitive Composite (PACC) were based on methods proposed by Donohue et al. [17], Papp et al. [18], and Jonaitis et al. [19]. Variables included in the composite in the ALFA+ cohort were Semantic Fluency (animal naming), Free and Cued Selective Reminding Test with Total Immediate Recall, and Wechsler Adult Intelligence Scale-Revised Coding subtest. In the Wisconsin cohort, Semantic Fluency (animal naming), Rey Auditory Verbal Learning Test Trials 1–5 Sum, and Wechsler Adult Intelligence Scale-Revised Coding subtest were included. PACC was not used for the Abby/Blaze cohort.

Statistical analyses

To compare biomarker concentrations across cohorts, the median concentration and interquartile range of all NTK CSF biomarkers before and after correction were calculated for all cohorts, therefore enabling the inclusion of outlying samples. The robust-to-outliers standard deviation (rSD) was estimated based on percentile values (rSD=[value of 84.13% percentile − value of 15.87% percentile]/2). The distributions of the CSF biomarker concentrations within the same disease stage across the cohorts were statistically compared before and after correction. To compare baseline/enrollment values for each CSF biomarker for both CU individuals and patients with AD-dementia separately, correlation values were computed using Spearman’s rho.

Fold change was calculated using the canonical fold change calculation in CSF biomarker concentrations in CU A− individuals from either the ALFA+ or Wisconsin cohorts with (i) CU amyloid-β-positive (A+) individuals (ALFA+), (ii) CU A+ individuals (Wisconsin), (iii) patients with MCI A+ (Wisconsin), (iv) patients with AD-dementia (Wisconsin), and (v) patients with AD-dementia (Abby/Blaze). Receiver operating characteristic (ROC) analyses, presented with an area under the curve (AUC) and 95% confidence intervals, comparing CSF biomarker concentrations in CU A− individuals with (i) CU A+ individuals (ALFA+), (ii) CU A+ individuals (Wisconsin), (iii) patients with MCI A+ (Wisconsin), and (iv) and patients with AD-dementia (Wisconsin) were performed.

Spearman’s rho correlation between the concentration of all the biomarkers and cognitive performance, reflected in MMSE and/or PACC scores, in the different disease stages was computed. To assess cognitive scores from each cohort on a similar scale relative to that cohort’s control participants, standardization of each individual raw score into z-scores was performed using the means and rSDs obtained from the A− samples of each control group as a reference. All three obtained z-scores were averaged. The obtained PACC values were re-standardized using the mean and rSD from the A− samples of the control group. Missing data in any of the raw scores led to a missing PACC value.


No differences were found between cohorts in age, sex, years of education, MMSE score, or APOE-ε4 carriership status within the different disease stages (Table 1). Prior to correction for preanalytical protocol differences, concentrations of α-Syn, Aβ40, and Aβ42 were significantly higher in CU A− individuals in the ALFA+ cohort compared with the Wisconsin cohort. Biomarker distributions for the ALFA+ and Wisconsin cohorts (Fig. S1) illustrate that biomarker concentrations were comparable between cohorts in the reference groups following correction. CSF biomarkers GFAP, IL-6, S100, pTau, and pTau/Aβ42 were significantly lower in the ALFA+ cohort compared with the Wisconsin cohort in CU A− individuals (Table 2). These differences may arise from intrinsic cohort differences or from measurement bias; therefore, the raw values of these biomarkers cannot be directly compared between cohorts, but the trends within cohorts can be compared with each other. For the remaining biomarkers, there were no significant differences between cohorts. The details of all the CSF biomarker concentrations, including before and after correction for those affected, across all cohorts and disease stages are reported in Table 2. Biomarker concentration distributions before and after correction represented as boxplots are shown in Figs. 1 and 2. Classification of amyloid-β status, including the use of a single cut-off value for both the ALFA+ and Wisconsin cohorts, had no impact on the comparability of biomarker concentrations across cohorts (Table S6, Table S7).

Table 1 Characterization of cohorts (amyloid status as defined by Aβ42/Aβ40 ratio)
Table 2 Biomarker data for each cohort, including uncorrected and corrected data (amyloid status as defined by Aβ42/Aβ40 ratio)
Fig. 1
figure 1

Box plots of uncorrected biomarkers by disease stage and amyloid-β status as defined by Aβ42/Aβ40 ratio. Aβ40, amyloid-β1–40; Aβ42, amyloid-β1–42; AD, Alzheimer’s disease; CU, cognitively unimpaired; GFAP, glial fibrillary acidic protein; MCI, mild cognitive impairment; NfL, neurofilament light; pTau, phosphorylated tau; tTau, total tau; YKL40, chitinase-3-like protein-1

Fig. 2
figure 2

Box plots of biomarkers by disease stage and amyloid-β status as defined by Aβ42/Aβ40 ratio; A uncorrected values and B corrected values. Aβ40, amyloid-β1–40; Aβ42, amyloid-β1–42; AD, Alzheimer’s disease; CU, cognitively unimpaired; MCI, mild cognitive impairment; pTau, phosphorylated tau

Of the biomarker values that did not undergo correction, YKL40, GFAP, NfL, neurogranin, pTau, and tTau were all increased in patients with AD-dementia or MCI who were A+ compared with CU individuals and patients with MCI who were A−. Both α-Syn and the pTau/Aβ42 ratio values were increased in patients with AD-dementia or MCI who were A+. Values for Aβ42 and the Aβ42/Aβ40 ratio were decreased in patients with AD-dementia or MCI. These results applied to both the corrected and uncorrected values. Correlations of CSF biomarkers were comparable across cohorts within the same disease stage (Supplementary Results, Fig. S2).

Diagnostic variations

The fold change in CSF biomarker concentration compared with CU A− individuals was comparable across cohorts (Tables S8–S10). Across all cohorts, the fold change showed a similar pattern regardless of whether it was calculated using values from the CU A− individuals group from the same cohort or from a different cohort. NfL, pTau, and tTau concentrations were higher in patients with AD-dementia, followed by patients with MCI, compared with CU individuals. Figure 3 shows the fold change displayed as a forest plot of the biomarker concentration compared with the CU A− individual group (derived from either the Wisconsin cohort or the ALFA+ cohort).

Fig. 3
figure 3

Fold change vs CU, Aβ42/Aβ40 amyloid-β negative with age <65 years. Aβ40, amyloid-β1–40; Aβ42, amyloid-β1–42; AD, Alzheimer’s disease; CU, cognitively unimpaired; GFAP, glial fibrillary acidic protein; IL, interleukin; MCI, mild cognitive impairment; NfL, neurofilament light; pTau, phosphorylated tau; sTREM2, soluble triggering receptor expressed on myeloid cells 2; tTau, total tau; YKL40, chitinase-3-like protein-1

ROC analyses (Fig. S3) and AUC data (Fig. 4 and Table S11) confirmed the results shown by fold change. Among the CSF biomarkers evaluated, NfL (followed by YKL40, GFAP, Aβ42, pTau, and tTau) had the greatest diagnostic value for discriminating CU A− individuals from patients with MCI or AD-dementia. The results were not affected by amyloid-β status as Aβ42 showed the greatest ability of all the biomarkers measured to discriminate CU A− individuals from CU A+ individuals in both the ALFA+ and Wisconsin cohorts.

Fig. 4
figure 4

Forest plot of ROC analyses AUC with 95% confidence intervals (amyloid status as defined by Aβ42/Aβ40 ratio). Aβ40, amyloid-β1–40; Aβ42, amyloid-β1–42; AD, Alzheimer’s disease; AUC, area under the curve; CI, confidence interval; CU, cognitively unimpaired; GFAP, glial fibrillary acidic protein; IL, interleukin; MCI, mild cognitive impairment; NfL, neurofilament light; pTau, phosphorylated tau; ROC, receiver operating characteristics; sTREM2, soluble triggering receptor expressed on myeloid cells 2; tTau, total tau; YKL40, chitinase-3-like protein-1

Correlation with clinical measures of cognition

The correlation between CSF biomarker concentration and cognitive scores (MMSE and/or PACC) appears comparable across cohorts. The correlation of the biomarker concentration and the cognitive scores in the different disease stages and cohorts is shown as a forest plot in Fig. 5. For CU A+ individuals, the strongest correlation among different biomarker concentrations and PACC score was found in YKL40, GFAP, and NfL. For patients with AD-dementia (Wisconsin and Abby/Blaze cohorts), the strongest correlation between biomarker concentrations and MMSE was found in NfL.

Fig. 5
figure 5

Baseline correlation between NTK biomarkers and cognitive scores (amyloid status as defined by Aβ42/Aβ40 ratio). Aβ40, amyloid-β1–40; Aβ42, amyloid-β1–42; AD, Alzheimer’s disease; CI, confidence interval; CU, cognitively unimpaired; GFAP, glial fibrillary acidic protein; IL, interleukin; MCI, mild cognitive impairment; NfL, neurofilament light; NTK, NeuroToolKit; PACC, Preclinical Alzheimer Cognitive Composite; pTau, phosphorylated tau; sTREM2, soluble triggering receptor expressed on myeloid cells 2; tTau, total tau; YKL40, chitinase-3-like protein-1


The NTK project involves the evaluation of a large panel of CSF biomarkers of AD pathology and glial activity that may address the goals outlined in the introduction for diagnosis, prognosis, predicting treatment response, and surrogate outcomes in AD. Following recently published studies describing the utility of the NTK immunoassays in clinical settings [8, 9, 20], this analysis demonstrates that the application of a correction factor for biomarkers affected by preanalytical variability improves the comparability of NTK data across three independent cohorts spanning the AD continuum. After establishing the adequacy of the correction factor, we examined the cohorts for insights regarding biomarker relationships across the disease stages. In response to the three prioritized research questions discussed in the introduction, this analysis provided the following answers.

Comparative: The correction factor for preanalytical protocol variations employed here enabled the comparison of data across cohorts, rendering significant differences in CSF biomarker concentrations irrelevant. The correction factor was not developed specifically for the present cohorts and is generalizable, which enables the introduction of further cohorts into the comparative framework. Comparability of CSF biomarker data acquired with the NTK was found to be robust to methods of amyloid-β status classification, suggesting the possibility of introducing future cohorts with or without having calculated their own cut-off values. Further investigation with additional cohorts is needed. Correlations between all CSF biomarkers were consistent for CU individuals in both the ALFA+ and Wisconsin cohorts and for patients with AD-dementia in both the Wisconsin and Abby/Blaze cohorts. These results indicate that the data produced by the NTK were comparable across cohorts, meaning results and subsequent analyses may be combined.

Diagnostic: CSF biomarker differences for CU A− individuals compared with patients with AD-dementia, were comparable across cohorts, regardless of the CU A− individual cohort origin. The fold change for CU A− individuals compared with patients with MCI and patients with AD-dementia was largest for NfL, pTau, and tTau. The diagnostic utility of the aforementioned biomarkers was confirmed with ROC and AUC analyses, suggesting these biomarkers, and Aβ42, are the most appropriate for disease stage differentiation.

Clinical: Correlations of CSF biomarker concentrations with cognitive score were comparable across cohorts. Associations between biomarker concentrations and clinical cognitive scores have been the focus of several studies to validate measures of cognition but only few studies include a panel of CSF biomarkers [21,22,23,24,25]. Correlations between CSF biomarker concentrations and cognitive scores are indicative of the prognostic utility of the biomarker in question [21]. Here, we found that clinical cognitive scores had the strongest correlation with NfL, in the CU A+ and AD-dementia disease stages. For CU A− individuals, NfL correlated with the PACC score in the Wisconsin cohort, but less so in the ALFA+ cohort. Our findings on the moderate, inverse correlation of NfL with clinical measures of cognition are consistent with the literature [26] indicating markers of neurodegeneration track with cognition. No strong correlations of biomarkers for patients with MCI A−/A+ were observed.

Consistently across the three prioritized research questions examining correlations with biomarker concentrations, NfL is the most promising CSF biomarker for AD along with the well-established Aβ42, pTau, and tTau. No statistically significant differences in NfL concentration between cohorts were observed within disease stage. Additionally, NfL differentiated disease stage and correlated with measures of cognition. The observation of NfL differentiation between disease stages is consistent with recent literature, which includes the correlation of NfL with imaging markers of neurodegeneration [27]. However, NfL is not a specific biomarker for AD, but rather for several neuroinflammatory diseases [28,29,30]. Analyzing the biomarker results of NfL within the context of other biomarkers of neuroinflammation, glial activity, and known AD pathology is key to the utility of NfL within the AD landscape. As such, the NTK panel provides clinical utility from both the biomarkers at an individual level and as a whole. In the current study, NfL did not strongly correlate with the other biomarkers included in the NTK; these results are in contrast to the literature, which describes NfL correlations with (plasma) GFAP [31], pTau, tTau, and neurogranin [26].

In the AD field, vast amounts of biomarker data are generated; however, comparability between biomarker datasets is not routinely employed due to the heterogeneity of the disease and the technical variability associated with preanalytical protocols and the immunoassay platforms used. Several pre-competitive efforts to generate comparable datasets, harmonize and maximize the interpretability of the biomarker results collected, such as the NTK project and the Global Biomarker Standardization Consortium (GBSC; a collaborative effort in the acceleration of biomarker standardization [32]), are ongoing. Measures have also been taken to standardize methods of CSF collection and measurement to ensure reproducible and consistent results across multiple cohorts and immunoassays [3, 4]. The aim is to clarify the clinical utility of biomarkers to inform clinical trial design and for diagnostic development.

To our knowledge, the NTK project is the first large-scale project that aims to generate robust and comparable biomarker data across multiple independent cohorts in AD. While cross-cohort examinations of datasets have been employed for single biomarker validation studies [33], a project with the number of biomarkers described in this analysis is uncommon. Other large-scale studies investigating the utility of biomarkers at various disease stages include the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [34] and the Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging [35]. However, these studies do not include multiple cohorts and are geographically restricted, and the biomarkers are measured using multiple immunoassay platforms. The NTK project seeks to maximize the comparability of biomarker measurements between cohorts by generating and analyzing biomarker data using a common platform and immunoassay.

A key strength of this study is the large dataset included, spanning several countries and native languages. Importantly, the cloud-based DRE employed in the NTK project allows for an increase in data sharing with interpretation unified across several cohorts. The researchers involved in the NTK project retain control over their data and through collaborative efforts they can work with other consenting cohorts to enrich their datasets and provide further insights. The standardized statistical analysis on single cohorts, as well as the correction procedure, comparisons, and graphical representations across different cohorts described here, can also be applied to other cohorts via the NTK app (currently under development). Following the approval and willingness of collaboration of all data owners, and with the accrual of added data and cohorts, the feasibility of the NTK app will be thoroughly explored.


Possible limitations of the present study include the decision not to use amyloid-β positron emission tomography data as a common comparator across cohorts to determine amyloid-β status, and the use of only single-timepoint data for individual assessments (e.g., only one MMSE assessment was included per individual). Future studies should include the clinical follow-up data for the participants. In the longitudinal cohorts included in this study, CSF sample collection and cognitive assessments were not necessarily completed at the same time. Cognitive assessments and CSF sample collection were performed within a year of each other, or the sample was excluded from this analysis. Although AD is a slowly evolving disease, this may have led to variations in the correlations for these measures. However, for individuals with MCI and CU individuals, pTau, tTau, Aβ42, and neurogranin have been shown to be stable over a 2-year period [36, 37]. In addition, the Abby/Blaze cohort of individuals with AD-dementia, cognitive assessment, and CSF sample collection both occurred at baseline. Compared with the CU (n=944) and AD-dementia (n=213) patient populations, the MCI population (n=56) is relatively small. Expanding the results within this important patient population may lead to further insights into AD disease pathogenesis and the clinical utility of the NTK panel of immunoassays.

The correction for preanalytical protocols was based on several assumptions: (i) the ALFA+ cohort samples were all collected according to the standardized protocol, but the Wisconsin cohort samples were not; (ii) sample handling procedures only affect α-Syn, Aβ42, and Aβ40, which may be disproved with future research; (iii) sample handling procedures have a similar effect on samples with high concentrations and those with lower concentrations. These assumptions may explain for the variations between cohorts seen in the biomarker values that were not corrected; however, it is beyond the scope of this study to determine the cause of such variations. In addition, correction for preanalytical protocols was employed and not harmonization of all biomarker results. While harmonization of the biomarker results would have allowed for head-to-head analysis, as no bridging samples (measured on the same instrument at the same time from all cohorts) required to perform a harmonization were collected correction was employed. The rationale for the correction factor was that existing cohorts were used, meaning some of the samples had been collected prior to the initiation of the present analysis. As such, reference materials and methods were not used. The correction factor is therefore employed as a solution to enable the inclusion of these data. If the analysis were to be performed as part of a prospective study, the correction factor may not be required if standardized procedures and reference materials were employed.

In summary, the robust prototype NTK panel of immunoassays provides biomarker data that can be used to support the utility of biomarkers in clinical trials and in the diagnostic clinical setting. Our study supports the feasibility of cross-cohort collation of data provided by the NTK immunoassays to enable further insight gathering into the underlying pathogenesis of AD. Our next step is to use a DRE to implement the standardized statistical analysis plan and increase the interpretation of results across studies. In addition, the NTK project will expand to include supplementary CSF immunoassays beyond the 12 in this study, as well as plasma biomarker immunoassays.

Availability of data and materials

ALFA+ cohort: All requests for raw and analyzed data and materials will be promptly reviewed by the senior authors to verify whether the request is subject to any intellectual property or confidentiality obligations. Bulk anonymized data can be shared by request from any qualified investigator for the sole purpose of replicating procedures and results presented in the article, providing data transfer is in agreement with European Union legislation and decisions by the institutional review board of each participating center.

Wisconsin cohort: Deidentified data from the Wisconsin cohort are available by request through our website. All requests will be reviewed by study leadership, and if approved, data will be transferred subject to an institutional data use agreement.

Abby/Blaze cohort: Qualified researchers may request access to individual anonymised patient-level data through the clinical study data request platform ( Further details on Roche’s criteria for eligible studies are available here: For further details on Roche’s Global Policy on the Sharing of Clinical Information and how to request access to related clinical study documents, see here:









Alzheimer’s disease


Alzheimer’s Disease Neuroimaging Initiative


Australian Imaging, Biomarkers and Lifestyle


Area under the curve


Clinical Dementia Rating scale Sum of Boxes


Cerebrospinal fluid


Cognitively unimpaired


Digital Research Environment


Global Biomarker Standardization Consortium


Glial fibrillary acidic protein




In vitro diagnostic


Mild cognitive impairment


Mini-Mental State Examination


Neurofilament light




Preclinical Alzheimer Cognitive Composite


Phosphorylated tau


Receiver operating characteristic


Robust-to-outliers standard deviation


Soluble triggering receptor expressed on myeloid cells 2


Total tau


Chitinase-3-like protein-1


  1. Obrocki P, et al. Perspectives in fluid biomarkers in neurodegeneration from the 2019 biomarkers in neurodegenerative diseases course-a joint PhD student course at University College London and University of Gothenburg. Alzheimers Res Ther. 2020;12(1):20.

    Article  Google Scholar 

  2. Blennow K, Zetterberg H, Fagan AM. Fluid biomarkers in Alzheimer disease. Cold Spring Harb Perspect Med. 2012;2(9):a006221.

    Article  Google Scholar 

  3. Hansson O, et al. The Alzheimer's Association international guidelines for handling of cerebrospinal fluid for routine clinical measurements of amyloid beta and tau. Alzheimers Dement. 2021;17(9):1575.

  4. Hansson O, et al. Pre-analytical protocol for measuring Alzheimer’s disease biomarkers in fresh CSF. Alzheimers Dement (Amst). 2020;12(1):e12137.

    Google Scholar 

  5. Doecke JD, et al. Elecsys CSF biomarker immunoassays demonstrate concordance with amyloid-PET imaging. Alzheimers Res Ther. 2020;12(1):36.

    Article  CAS  Google Scholar 

  6. Lifke V, et al. Elecsys((R)) Total-tau and phospho-tau (181P) CSF assays: analytical performance of the novel, fully automated immunoassays for quantification of tau proteins in human cerebrospinal fluid. Clin Biochem. 2019;72:30–8.

    Article  CAS  Google Scholar 

  7. Tariciotti L, et al. Clinical experience with cerebrospinal fluid Abeta42, total and phosphorylated tau in the evaluation of 1,016 individuals for suspected dementia. J Alzheimers Dis. 2018;65(4):1417–25.

    Article  CAS  Google Scholar 

  8. Mila-Aloma M, et al. Amyloid beta, tau, synaptic, neurodegeneration, and glial biomarkers in the preclinical stage of the Alzheimer’s continuum. Alzheimers Dement. 2020;16(10):1358–71.

    Article  Google Scholar 

  9. Van Hulle C, et al. An examination of a novel multipanel of CSF biomarkers in the Alzheimer’s disease clinical and pathological continuum. Alzheimers Dement. 2021;17(3):431–45.

    Article  Google Scholar 

  10. Johnson SC, et al. The Wisconsin Registry for Alzheimer’s Prevention: a review of findings and current directions. Alzheimers Dement (Amst). 2018;10:130–42.

    Article  Google Scholar 

  11. Cummings JL, et al. ABBY: a phase 2 randomized trial of crenezumab in mild to moderate Alzheimer disease. Neurology. 2018;90(21):e1889–97.

    Article  CAS  Google Scholar 

  12. Salloway S, et al. Amyloid positron emission tomography and cerebrospinal fluid results from a crenezumab anti-amyloid-beta antibody double-blind, placebo-controlled, randomized phase II study in mild-to-moderate Alzheimer’s disease (BLAZE). Alzheimers Res Ther. 2018;10(1):96.

    Article  Google Scholar 

  13. Abdi IY, et al. Preanalytical stability of CSF total and oligomeric alpha-synuclein. Front Aging Neurosci. 2021;13:638718.

    Article  CAS  Google Scholar 

  14. Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12(3):189–98.

    Article  CAS  Google Scholar 

  15. O'Bryant SE, et al. Staging dementia using Clinical Dementia Rating Scale Sum of Boxes scores: a Texas Alzheimer’s research consortium study. Arch Neurol. 2008;65(8):1091–5.

    Article  Google Scholar 

  16. Morris JC. The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology. 1993;43(11):2412–4.

    Article  CAS  Google Scholar 

  17. Donohue MC, et al. The preclinical Alzheimer cognitive composite: measuring amyloid-related decline. JAMA Neurol. 2014;71(8):961–70.

    Article  Google Scholar 

  18. Papp KV, et al. Cued memory decline in biomarker-defined preclinical Alzheimer disease. Neurology. 2017;88(15):1431–8.

    Article  Google Scholar 

  19. Jonaitis EM, et al. Measuring longitudinal cognition: individual tests versus composites. Alzheimers Dement (Amst). 2019;11:74–84.

    Article  Google Scholar 

  20. Vilor-Tejedor N, et al. Perivascular spaces are associated with tau pathophysiology and synaptic dysfunction in early Alzheimer’s continuum. Alzheimers Res Ther. 2021;13(1):135.

    Article  CAS  Google Scholar 

  21. Snider BJ, et al. Cerebrospinal fluid biomarkers and rate of cognitive decline in very mild dementia of the Alzheimer type. Arch Neurol. 2009;66(5):638–45.

    Article  Google Scholar 

  22. Palmqvist S, et al. Comparison of brief cognitive tests and CSF biomarkers in predicting Alzheimer’s disease in mild cognitive impairment: six-year follow-up study. PLoS One. 2012;7(6):e38639.

    Article  CAS  Google Scholar 

  23. Llano DA, et al. Derivation of a new ADAS-cog composite using tree-based multivariate analysis: prediction of conversion from mild cognitive impairment to Alzheimer disease. Alzheimer Dis Assoc Disord. 2011;25(1):73–84.

    Article  Google Scholar 

  24. Mattsson N, et al. CSF biomarkers and incipient Alzheimer disease in patients with mild cognitive impairment. JAMA. 2009;302(4):385–93.

    Article  CAS  Google Scholar 

  25. Ewers M, et al. Prediction of conversion from mild cognitive impairment to Alzheimer’s disease dementia based upon biomarkers and neuropsychological test performance. Neurobiol Aging. 2012;33(7):1203–14.

    Article  CAS  Google Scholar 

  26. Jin M, Cao L, Dai YP. Role of neurofilament light chain as a potential biomarker for Alzheimer’s disease: a correlative meta-analysis. Front Aging Neurosci. 2019;11:254.

    Article  CAS  Google Scholar 

  27. Sanchez-Benavides G, et al. Amyloid-beta positive individuals with subjective cognitive decline present increased CSF neurofilament light levels that relate to lower hippocampal volume. Neurobiol Aging. 2021;104:24–31.

    Article  CAS  Google Scholar 

  28. Disanto G, et al. Serum neurofilament light chain levels are increased in patients with a clinically isolated syndrome. J Neurol Neurosurg Psychiatry. 2016;87(2):126–9.

    Google Scholar 

  29. Mollenhauer B, et al. Validation of serum neurofilament light chain as a biomarker of Parkinson’s disease progression. Mov Disord. 2020;35(11):1999–2008.

    Article  CAS  Google Scholar 

  30. Byrne LM, et al. Neurofilament light protein in blood as a potential biomarker of neurodegeneration in Huntington’s disease: a retrospective cohort analysis. Lancet Neurol. 2017;16(8):601–9.

    Article  CAS  Google Scholar 

  31. Zhu N, et al. Plasma glial fibrillary acidic protein and neurofilament light chain for the diagnostic and prognostic evaluation of frontotemporal dementia. Transl Neurodegener. 2021;10(1):50.

    Article  CAS  Google Scholar 

  32. Carrillo MC, et al. Global standardization measurement of cerebral spinal fluid for Alzheimer’s disease: an update from the Alzheimer’s Association Global Biomarkers Consortium. Alzheimers Dement. 2013;9(2):137–40.

    Article  Google Scholar 

  33. Karikari TK, et al. Blood phosphorylated tau 181 as a biomarker for Alzheimer’s disease: a diagnostic performance and prediction modelling study using data from four prospective cohorts. Lancet Neurol. 2020;19(5):422–33.

    Article  CAS  Google Scholar 

  34. Mueller SG, et al. The Alzheimer’s disease neuroimaging initiative. Neuroimaging Clin N Am. 2005;15(4):869–77 xi-xii.

    Article  Google Scholar 

  35. Ellis KA, et al. The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer’s disease. Int Psychogeriatr. 2009;21(4):672–87.

    Article  Google Scholar 

  36. Zetterberg H, et al. Intra-individual stability of CSF biomarkers for Alzheimer’s disease over two years. J Alzheimers Dis. 2007;12(3):255–60.

    Article  CAS  Google Scholar 

  37. Ostrowitzki S, et al. A phase III randomized trial of gantenerumab in prodromal Alzheimer’s disease. Alzheimers Res Ther. 2017;9(1):95.

    Article  Google Scholar 

Download references


The authors would like to thank Marta Milà-Alomà for her significant contributions to this work. The authors would also like to express their most sincere gratitude to the study participants as well as the many study team members, without whom this research would not have been possible. Third-party medical writing assistance, under the direction of the authors, was provided by Ashlie Butler, PhD (Bioscript Group, Macclesfield, UK), and was funded by Roche Diagnostics International Ltd (Rotkreuz, Switzerland).

ELECSYS, COBAS, and COBAS E are registered trademarks of Roche.


The Elecsys Aβ42 CSF immunoassay in use is not a commercially available IVD assay. It is an assay that is currently under development and for investigational use only. The measuring range of the assay is 200 (lower technical limit)–1700 pg/mL (upper technical limit). The performance of the assay beyond the upper technical limit has not been formally established. Therefore, the use of values above the upper technical limit, which are provided based on an extrapolation of the calibration curve, is restricted to exploratory research purposes and is excluded for clinical decision-making or for the derivation of medical decision points.


The ALFA+ study receives funding from “la Caixa” Foundation (ID 100010434), under agreement LCF/PR/GN17/50300004 and the Alzheimer’s Association and an international anonymous charity foundation through the TriBEKa Imaging Platform project (TriBEKa-17-519007). For the Wisconsin studies, CSF assay kits were provided by Roche Diagnostics GmbH. Data were collected under the following grants from the NIH National Institute of Aging: R01 AG027161, R01 AG037639, R01AG054059 UF1AG051216, P50 AG033514, and P30 AG062715. Support was also provided by the NIH National Center for Advancing Translational Sciences (NCATS), grant UL1TR000427 (Clinical and Translational Sciences). MSC receives funding from the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (grant agreement No. 948677), the Instituto de Salud Carlos III (PI19/00155), and a fellowship from “la Caixa” Foundation (ID 100010434) and from the European Union’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie (grant agreement No. 847648; LCF/BQ/PR21/11840004). JDG is supported by the Spanish Ministry of Science and Innovation (RYC-2013-13054). JDG has also received research support from the EU/EFPIA Innovative Medicines Initiative Joint Undertaking AMYPAD (grant agreement 115952), EIT Digital (Grant 2021), and from Ministerio de Ciencia y Universidades (grant agreement RTI2018-102261). HZ is a Wallenberg Scholar supported by grants from the Swedish Research Council (#2018-02532); the European Research Council (#681712); Swedish State Support for Clinical Research (#ALFGBG-720931); the Alzheimer Drug Discovery Foundation (ADDF), USA (#201809-2016862); the AD Strategic Fund and the Alzheimer’s Association (#ADSF-21-831376-C, #ADSF-21-831381-C and #ADSF-21-831377-C); the Olav Thon Foundation; the Erling-Persson Family Foundation; Stiftelsen för Gamla Tjänarinnor; Hjärnfonden, Sweden (#FO2019-0228); the European Union’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie grant agreement No 860197 (MIRIADE); and the UK Dementia Research Institute at UCL. KB is supported by the Swedish Research Council (#2017-00915); the Alzheimer Drug Discovery Foundation (ADDF), USA (#RDAPB-201809-2016615); the Swedish Alzheimer Foundation (#AF-742881); Hjärnfonden, Sweden (#FO2017-0243); the Swedish state under the agreement between the Swedish government and the County Councils, the ALF-agreement (#ALFGBG-715986); the European Union Joint Program for Neurodegenerative Disorders (JPND2019-466-236); the National Institute of Health (NIH), USA (grant #1R01AG068398-01); and the Alzheimer’s Association 2021 Zenith Award (ZEN-21-848495).

Author information

Authors and Affiliations



SCJ, MSC, CM, JDG, and EJ contributed to the data collection and analysis and critically reviewed and approved the manuscript for publication. IS, MC, TB, CR, and GK contributed to the study conception and design and critically reviewed and approved the manuscript for publication. AM, HZ, and KB contributed to the data analysis and critically reviewed and approved the manuscript for publication. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Marc Suárez-Calvet.

Ethics declarations

Ethics approval and consent to participate

This study was conducted in accordance with the Declaration of Helsinki of 1975 (revised 2013), the International Council on Harmonisation of Technical Requirements for Registration of Pharmaceuticals for Human Use Good Clinical Practice guidelines, and/or current national guidelines. Participants provided written informed consent, and the study was approved by the respective institutional review boards.

Consent for publication

Not applicable.

Competing interests

Sterling C. Johnson has served at scientific advisory boards for Roche Diagnostics and receives research funding from Cerveau Technologies. Marc Suárez-Calvet has served at scientific advisory boards and/or as a consultant for Roche Diagnostics International Ltd and has given lectures in symposia sponsored by Roche Diagnostics SLU and Roche Farma SA. Juan Domingo Gispert receives research funding from Roche Diagnostics and GE Healthcare and has given lectures in symposia sponsored by Biogen and Philips. Henrik Zetterberg has served at scientific advisory boards and/or as a consultant for AbbVie, Alector, Eisai, Denali, Roche Diagnostics, Wave, Samumed, Siemens Healthineers, Pinteon Therapeutics, Nervgen, AZTherapies, CogRx, and Red Abbey Labs; has given lectures in symposia sponsored by Cellectricon, Fujirebio, Alzecure, and Biogen; and is a co-founder of Brain Biomarker Solutions in Gothenburg AB (BBS), which is a part of the GU Ventures Incubator Program. Kaj Blennow has served as a consultant and at scientific advisory boards and/or data monitoring committees for Abcam, Axon, Biogen, JOMDD/Shimadzu, Julius Clinical, Lilly, MagQu, Novartis, Prothena, Roche Diagnostics, and Siemens Healthineers and is a co-founder of Brain Biomarker Solutions in Gothenburg AB (BBS), which is a part of the GU Ventures Incubator Program. Margherita Carboni is a full-time employee of Roche Diagnostics International Ltd and an owner of shares in Roche. Tobias Bittner is a full-time employee of F. Hoffmann-La Roche AG and owner of shares in Roche. Christina Rabe is a full-time employee of Genentech and owner of shares in Roche. Agata Michna is a full-time employee of Roche Diagnostics GmbH and an owner of shares in Roche. Ivonne Suridjan is a full-time employee of Roche Diagnostics International Ltd. Gwendlyn Kollmorgen is a full-time employee of Roche Diagnostics GmbH. Carolina Minguillón and Erin Jonaitis have nothing to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Supplementary methods, tables, and figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Johnson, S.C., Suárez-Calvet, M., Suridjan, I. et al. Identifying clinically useful biomarkers in neurodegenerative disease through a collaborative approach: the NeuroToolKit. Alz Res Therapy 15, 25 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: