Skip to main content

A language-based sum score for the course and therapeutic intervention in primary progressive aphasia



With upcoming therapeutic interventions for patients with primary progressive aphasia (PPA), instruments for the follow-up of patients are needed to describe disease progression and to evaluate potential therapeutic effects. So far, volumetric brain changes have been proposed as clinical endpoints in the literature, but cognitive scores are still lacking. This study followed disease progression predominantly in language-based performance within 1 year and defined a PPA sum score which can be used in therapeutic interventions.


We assessed 28 patients with nonfluent variant PPA, 17 with semantic variant PPA, 13 with logopenic variant PPA, and 28 healthy controls in detail for 1 year. The most informative neuropsychological assessments were combined to a sum score, and associations between brain atrophy were investigated followed by a sample size calculation for clinical trials.


Significant absolute changes up to 20% in cognitive tests were found after 1 year. Semantic and phonemic word fluency, Boston Naming Test, Digit Span, Token Test, AAT Written language, and Cookie Test were identified as the best markers for disease progression. These tasks provide the basis of a new PPA sum score. Assuming a therapeutic effect of 50% reduction in cognitive decline for sample size calculations, a number of 56 cases is needed to find a significant treatment effect. Correlations between cognitive decline and atrophy showed a correlation up to r = 0.7 between the sum score and frontal structures, namely the superior and inferior frontal gyrus, as well as with left-sided subcortical structures.


Our findings support the high performance of the proposed sum score in the follow-up of PPA and recommend it as an outcome measure in intervention studies.


Primary progressive aphasia (PPA) comprises a group of neurodegenerative disorders in which language problems are the principal cause of impaired daily living activities, whereas other neurobehavioral or cognitive deficits are rare during the initial stages of the illness [1]. PPA can be classified into three clinical subtypes [2]. The nonfluent/agrammatic variant (nfvPPA) presents with agrammatism in speech production and/or apraxia of speech with additional impaired comprehension of syntactically complex sentences, while object knowledge and single-word comprehension are spared [2]. Patients with the semantic variant (svPPA) have progressive deficits in comprehending single words as a widespread semantic memory deficit, often combined with impaired object knowledge and surface dyslexia or dysgraphia [2]. The logopenic variant (lvPPA) is specified by difficulties finding words and impaired sentence repetition [2, 3]. Typically, nfvPPA and svPPA are syndromes with underlying frontotemporal lobar degeneration (FTLD) pathology, i.e., nfvPPA is frequently linked to FTLD-tau, whereas svPPA is associated with FTLD-TDP [4]. In contrast, lvPPA often has an underlying Alzheimer’s disease pathology [5].

Several studies provide an initial insight into the progression of PPA [6,7,8,9]. Knopman et al. [8] measured whole brain and ventricular volume changes within 1 year in FTLD patients, including 17 nfvPPA, 16 svPPA, and 9 lvPPA, amongst others. A slightly higher whole brain atrophy rate was found in lvPPA than in nfvPPA and svPPA. Rogalski et al. [9] examined the longitudinal course of PPA over a 2-year period in 10 nfvPPA, 8 svPPA, and 8 lvPPA patients. They concluded that analyzing a focal cortical language network is a more sensitive clinical outcome measure than whole brain or ventricular volume measures. However, none of these studies provided a clinical tool that can be used for comprehensive language assessment and for sample size calculation of a clinical trial. This is, however, prerequisite for any upcoming trials, e.g., using tau protein immunization strategies [10].

The aim of this study was to follow disease progression predominantly using language-based performance in detail and to define a practicable and effective PPA sum score which can be used in planning clinical trials. Additionally, brain atrophy was investigated in predefined regions as a first exploratory validation step of this sum score.



We report 1-year follow-up data of 58 PPA patients, including 28 nfvPPA, 17 svPPA, and 13 lvPPA, and 28 neurologically healthy controls. All patients met the clinical criteria suggested by Gorno-Tempini et al. [2], whereupon the additional imaging supportive criteria applied to 48% of nfvPPA, 93% of svPPA, and 66.7% of lvPPA patients. None of the 58 patients showed a known mutation in MAPT, GRN, PSEN1, or C9orf72. On-site monitoring was conducted for all participants. Patients were recruited from 10 academic centers across Germany: Bonn, Erlangen, Göttingen, Hamburg-Eppendorf, Homburg/Saar, Leipzig, Munich, Rostock, Ulm, and Würzburg. The study was approved by the local Ethics Committees (proposal number at the central study center at University of Ulm, 39/11, 8 March 2011), and written informed consent was obtained from each patient, participant, caregiver, or legal representative.

Neuropsychological assessment

Patients underwent an extensive neuropsychological assessment covering a broad range of cognitive domains. The language investigations included naming ability (Boston Naming Test, 15-item short version from the Consortium to Establish a Registry for Alzheimer's Disease (CERAD)-plus battery [11]), word and sentence comprehension (Token Test [12]), reading and writing abilities (written language-subtest of the German Aachener Aphasie Test (AAT) [13]), semantic knowledge and word repetition (Repeat and Point Test [14]), phonemic and semantic word fluency (“s-words” and category “animals” [15]), and spontaneous speech production (“Cookie Theft picture” [16]). The rating for the latter was defined by mentioning 20 predefined items of the picture.

Episodic memory, visuo-spatial abilities, information processing speed, and cognitive flexibility were measured within the CERAD-plus battery. Executive functions such as short-term and working memory capacity (Digit and Block Span [17]), figural fluency [18], interference resolution (standard Stroop Test, adapted from the European Huntington’s Disease Network), and cognitive estimation abilities [19] were included in the assessment protocol.

Clinical rating scales

The well-established Clinical Dementia Rating Scale (CDR) [20] and the FTLD-specific rating scale (FTLD-CDR) [21], the latter including two additive domains for behavioral changes and language dysfunction, provide a global score with the aim of staging the severity of disease. Scoring was performed with the so-called “sum of boxes score”, a summation of all domains.

Neurochemical markers

Cerebrospinal fluid (CSF) was taken by lumbar puncture at baseline examination and analyzed for tau, phospho-tau (ptau) and Abeta1–42 using commercially available enzyme-linked immunosorbent assays (ELISAs) [22].

Imaging data acquisition

Patients underwent a 3-T magnetic resonance imaging (MRI) scan at baseline examination and follow-up. We analyzed 35 defined brain structures following a meta analysis [23] which had been calculated by an atlas-based volumetric (ABV) analysis beforehand. For a detailed description of this procedure, see Steinacker et al. [24].

Data analysis

Statistical analyses were performed with IBM SPSS Statistics for Windows, Version 21.0, and SAS 9.4. The level of significance was set to p = 0.05. Kruskal-Wallis tests, χ2 tests and post-hoc tests examined differences in demographic variables, biomarker levels, and sum score decline between the diagnostic groups. Cognitive change and atrophy rate between baseline and follow-up were analyzed by paired t tests, followed by a Bonferroni correction for all volumetric structures and for each domain-specific cluster (e.g., executive functions, memory, language). Spearman and partial correlations tested for associations between the sum score and demographic variables, clinical rating scale, and atrophy rate; a Bonferroni correction was applied if necessary. Sample size calculations were based on the observed mean decline in the sum score. The power was set to 80%, and the alpha error level to 5% for the use of a unpaired t test. A multiple imputation procedure (SAS 9.4 Proc MI with fully conditional specification methods, FCS and regression method, REG with ten consecutive imputation calculations and an adjustment for variable age) corrected for missing values for single neuropsychological tests in the data matrix, ruling out systematic bias beforehand (e.g., influence of the study site on missing values). Three cases were excluded from this procedure because of a massive deterioration in cognitive performance. A correction of variance values was added [25].

Development of a sum score

The PPA sum score has been derived from those tests that showed a strong decline within 1 year, covered a broad range of relevant cognitive domains (e.g., naming, fluency, word and sentence comprehension, reading and writing ability, spontaneous speech production, verbal short-term and working memory capacity), and had a tolerable execution time.

We defined maximum attainable test scores for those lacking an endpoint, which were additionally used to enable the presentation of absolute percentage changes in the single assessments. In accordance with the mean observed value in neurologically healthy controls (averaging the scores for men and women as well as for education [17, 26]), the maximum test score was set to 24 points for semantic fluency (category “animals”) and to 12 points for phonemic fluency (“s-words”), respectively. The Digit Span forward and backward were both set to a maximum of 6 points. As higher values represent better performance except for the Token Test, the latter was reversed (zero errors result in 50 points, whereas a maximum of 50 errors result in zero points). For the “unbalanced” version of the sum score, we simply calculated a summation of the eight test scores attained, resulting in a maximum score of 223 points. As the different attainable maximum test scores have a range from 6 to 90 points and a simple summation of the single scores therefore gives an unbalanced weight to single tests, we decided to adjust every test score to a maximum of 50 points via a simple multiplicative transformation. The overall balanced sum score, with a maximum attainable score of 400 points, is calculated as follows:

Sum score = (verbal fluency animals × 2.083) + (Boston Naming × 3.333) + (verbal fluency s-words × 4.167) + (Digit Span forward × 8.333) + (Digit Span backward × 8.333) + (AAT written language × 0.556) + Token Test reversed + (Cookie Theft × 2.5).



Table 1 gives an overview of all demographic variables, clinical rating scales, and biomarker values. Significant differences between diagnostic groups concerning age of initial symptoms (p =0.022) and disease duration (p =0.040) were found. Post-hoc tests revealed earlier symptom onset (p =0.011) and longer disease duration (p =0.008) in svPPA compared to nfvPPA.

Table 1 Demographic variables, clinical scores, and cerebrospinal fluid biomarker values at baseline examination, separated by diagnostic group

Cognitive test results

Figure 1 shows the absolute percentage change rates in cognitive assessment within 1 year for those tests selected for the sum score. Detailed results of the paired t tests for the whole assessment battery are listed in Additional file 1: Table S1.

Fig. 1
figure 1

Main cognitive test results within 1 year. Scores at baseline (v1) and follow-up after 1 year (v2) for the eight cognitive assessments that have been included in the sum score. Boxes show the 25–75% percentile range with median, and whiskers indicate minimum and maximum. For an easier comparison, we present absolute percentage scores that have been calculated by setting maximum sores as described in the Methods section. *p < 0.05, significant changes within 1 year, calculated via paired t tests. lvPPA logopenic variant primary progressive aphasia, nfvPPA nonfluent variant primary progressive aphasia, PPA primary progressive aphasia, svPPA semantic variant primary progressive aphasia

nfvPPA showed greater decline rates throughout the different tests than svPPA or lvPPA. Particularly worth mentioning are performance changes in the Cookie Theft (–22.2%, p < 0.001), the written language test (–18.4%, p = 0.001), the repeat condition of the Repeat and Point Test (–15.2%, p = 0.003), the Digit Span forward (–20%, p < 0.001), and the semantic fluency (–12.3%, p = 0002). svPPA also revealed a significant decline in semantic fluency (–11.6%, p = 0.006) and in the Boston Naming Test (–12.4%, p = 0.004). The phonemic fluency showed a high decline rate but failed to reach significance (–21.8%, p > 0.05). For lvPPA, the highest declines were found in phonemic fluency (–16.7%), digit span backward (–13.6%), wordlist saving (–15.3%), and the repeat condition of the Repeat and Point Test (–15.56%), but none of them reached significance. Healthy controls did not show any significant performance change. Regarding the FTLD-CDR, only nfvPPA and svPPA revealed a significant progression at 1 year. nfvPPA showed an increase in the FTLD-CDR of 1.9 points (p = 0.001) and in the CDR of 1.3 points (p = 0.01). svPPA presented with an increase of 3.1 points in the FTLD-CDR (p = 0.012) and in the CDR of 2.4 points (p = 0.019).

Atrophy progression

nfvPPA showed a significant atrophy rate in the left-sided superior frontal (–4.70%, p = 0.001), the left-sided superior temporal (–4.49%, p < 0.001), and the right superior frontal gyrus (–4.49%, p < 0.001). svPPA revealed a higher and more bilateral atrophy rate, e.g., significant changes were found in the insulae (left –6.57%, p = 0.001; right –5.91%, p < 0.001), the left hippocampus (–9.04%, p = 0.001), the left amygdala (–8.37%, p < 0.001), and in the left superior frontal gyrus (–4.42%, p = 0.001). In lvPPA, significant atrophy progression in the left middle temporal (–4.15%, p = 0.001) and the left angular gyrus (–3.10%, p < 0.001) was detected. For detailed results of all paired t tests, see Additional file 2: Table S2.

The FTLDc-PPA sum score

Tests covering the different language domains were chosen to be included in the PPA sum score. An important relevant factor, besides a significant decline, was seen in the time required to perform the tests. We aimed for a performance time between 30 and 40 min for the whole sum score. As already mentioned in the Methods section, we included tests covering word and sentence comprehension, naming, reading and writing abilities, verbal working memory capacity, semantic and phonemic retrieval, as well as spontaneous speech competence. The final sum score is presented either as raw data (unbalanced version), meaning a simple summation of test results (best 223 points), and as a weighted presentation of test results (balanced version) in which all speech domains have a similar representation (best 400 points, see Methods section).

Figure 2 shows the results observed for the balanced score version, depending on the PPA diagnosis. There was a highly significant decline in the sum scores within 1 year for all PPA subgroups with one exception in the svPPA group for the unbalanced score. The decline rate did not differ between the three subgroups and neurologically healthy controls showed no significant change. A significant correlation between education and both sum score variants was found, with an association between better performance and higher education level (r = 0.33, p = 0.007 for the unbalanced score and r = 0.43, p < 0.001 for the balanced version). All other demographic variables, including age at baseline, disease duration, and age at onset of disease, proved to be uncorrelated. Internal consistency of both sum score variants was satisfying (Cronbachs αbalanced score = 0.810, Cronbachs αunbalanced score = 0.802).

Fig. 2
figure 2

Balanced FTLDc PPA score results within 1 year. Balanced FTLDc-PPA sum scores at baseline (v1) and at follow-up of 1 year (v2). Boxes show the 25–75% percentile range with median, and whiskers indicate minimum and maximum. *Significant changes within 1 year, calculated via paired t tests are indicated by brackets (healthy controls (HC): p = 0.780; nonfluent variant primary progressive aphasia (nfvPPA): p = 0.002; semantic variant primary progressive aphasia (svPPA): p = 0.016; logopenic variant primary progressive aphasia (lvPPA): p = 0.001)

Correlations between sum score and clinical rating scale decline (controlling for education) showed no significant association. However, correlating both values at baseline status, again controlling for education, showed noteworthy relationships (all p < 0.001) with r unbalanced score  = –0.63 and r balanced score  = –0.64 for the CDR, and r unbalanced score  = –0.62 and r balanced score  = –0.63 for the FTLD-CDR.

Correlation analysis of FTLDc-PPA sum score and brain atrophy

Correlating the decrease in cognitive and linguistic performance and brain atrophy (both variables measured as relative percentage change with education as a control variable) showed a significant relationship between the left cerebral cortex and the sum score (Fig. 3). We set aside correlations for the subgroups due to small sample sizes. The balanced sum score showed significant correlations with the left frontal lobe (r = 0.647, p < 0.001), moreover with the left superior frontal gyrus (r = 0.61, p = 0.001) and with the left putamen (r = 0.593, p = 0.001). The unbalanced version of the sum score showed nearly the same findings for the left frontal lobe (r = 0.610, p = 0.001) and the left putamen (r = 0.592, p = 0.001). After exclusion of one extreme case from the calculation as an outlier, a nearly significant correlation (after Bonferroni correction) between the balanced sum score change and the left frontal lobe decrease (r = 0.576, p = 0.002) was revealed.

Fig. 3
figure 3

Correlation between balanced sum score change and volumetric change in left frontal lobe and left putamen. Both variables are measured as relative percentage change. N = 28. lvPPA logopenic variant primary progressive aphasia, nfvPPA nonfluent variant primary progressive aphasia, svPPA semantic variant primary progressive aphasia

Sample size calculations for therapeutic trials

Assuming a minimum treatment effect of about 30%, a sample size calculation was performed. Table 2 shows the number of cases which are necessary to detect a possible treatment effect in a placebo-controlled trial, graded for different magnitudes of a treatment effect. For example, assuming a treatment effect of 50% reduction for cognitive decline measured by the balanced sum score, a cohort of 29 nfvPPA patients per group (placebo and verum) is needed to prove a significant treatment effect. Numbers for the unbalanced sum score are higher, e.g., 37 nfvPPA patients per group assuming the same treatment effect. The presented data are based on the imputation calculation; however, a comparison of raw data without the imputation method showed nearly the same findings (Additional file 3: Table S3). For a comparison, we set up sample size calculations for the FTLD-CDR as well. Again, assuming a treatment effect of 50% reduction in the measured increase in the FTLD-CDR score, a cohort of nfvPPA twice the size with 58 patients per group (placebo and verum) is needed to prove a treatment effect. Considering lvPPA patients even raises the number to 180 patients per group, while svPPA patients show comparable results with the balanced sum score, i.e., 76 patients per group for a 50% reduction (Additional file 4: Table S4).

Table 2 Sample size calculation based on the observed mean decline of the sum score within 1 year


The overall aim of this analysis was to provide a clinical score that can be used as an outcome measure in clinical trials. With the aim of covering a range of important aspects of language to provide one score for all PPA subtypes, we identified eight cognitive tests, merging them into one overall sum score. The two variants of this score, one comprising a simple summation of all raw scores obtained, the second a balanced version precluding unbalanced weights for single scores, showed overall high progressive decline between –9% and –13% at 1 year, which is in line with the results of Hsieh et al. [7], who reported an approximate 10% reduction in a cognitive screening tool (Addenbrooke’s Cognitive Examination-Revised) in PPA patients. We did not find any meaningful differences in sum score decreases between the PPA subgroups, preventing a differentiation between the diagnostic groups. At baseline, the sum scores and clinical rating scale FTLD-CDR showed relevant correlations with r ≈ –0.6, suggesting a good overall representation of cognitive status. A correlation between sum score and volumetric changes showed slightly higher relations for the balanced score version, including both the left frontal lobe (r = 0.647) and the left putamen (r = 0.593). Although it has to be noted here that the quite high correlations seem to be reinforced by a strong progressive decline in single patients, we also see evidence that the cognitive decline measured by the sum score reflects atrophy progression.

Regarding the decrease rates of the cognitive assessments, we detected considerable variation between the PPA subgroups, with nfvPPA descriptively showing greater progression than svPPA and lvPPA. Up to 20% absolute percentage reductions for single assessments were revealed within 1 year.

Based on our cohort, we set up sample size calculations with the new scores to review applicability for possible prospective clinical trials. Assuming a treatment effect of 50% reduction in cognitive decline, 28 cases per placebo and verum group would be needed, using the balanced sum score, calculated for PPA as an entire group. Using the unbalanced sum score generally increases the cases needed, for example in the above-mentioned example to 45 cases per group. A comparison between the PPA groups shows similar numbers in nfvPPA and lvPPA, whereas in svPPA higher case numbers are needed. Higher variance values in the latter group are likely to account for this difference. Sample size estimations based on the FTLD-CDR revealed higher case numbers. For example, again assuming a treatment effect of 50% reduction in cognitive decline as mentioned above, 93 patients per group are needed. Compared with previous samples size calculations which were based on volumetric brain changes as an outcome measure [8, 9], comparable numbers were found. Using a cognitive assessment as an outcome measure shows a clear advantage, as it is easy to implement and time saving in tabulation. Comparing the two versions of the sum score, the balanced score seems to be preferable since it shows higher correlations with atrophy progression, clinical rating scales, and fewer numbers in the sample size calculation. Furthermore, it provides equal emphasis for the single assessments.

Although this is the largest follow-up study on PPA patients to our knowledge, a limitation of this study is the still small and unbalanced sample size. Our cohort showed considerable variation in disease progression at baseline and a more uniform sample would clearly be desirable. Additionally, it should be mentioned that the sum score includes an assessment which had been developed for the German language (AAT written language) and is based on normative data of German speaking samples. A simple transfer of the sum score characteristics into different languages must be taken with caution.


Our results show overall high decline in language-specific neuropsychological tests within 1 year for all PPA subtypes and progressive atrophy in frontal and temporal regions. Based on our cohort, we combined the most informative neuropsychological assessments covering different aspects of language to a new sum score, which showed high correlations with the frontal atrophy rate. Subsequent sample size calculations showed feasible numbers; therefore, we believe that we now hold a practical tool for investigating PPA patients, especially with a focus on nfvPPA patients at follow-up, which can be used for clinical trials.



Aachener Aphasie Test


Clinical Dementia Rating Scale


the Consortium to Establish a Registry for Alzheimer’s Disease


Cerebrospinal fluid


Frontotemporal lobar degeneration


Logopenic variant primary progressive aphasia


Nonfluent variant primary progressive aphasia


Primary progressive aphasia


Phospho tau


Semantic variant primary progressive aphasia


  1. Mesulam M-M, Wieneke C, Hurley R, Rademaker A, Thompson CK, Weintraub S, Rogalski EJ. Words and objects at the tip of the left temporal lobe in primary progressive aphasia. Brain. 2013;136:601–18.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Gorno-Tempini ML, Hillis AE, Weintraub S, Kertesz A, Mendez M, Cappa SF, Ogar JM, Rohrer JD, Black S, Boeve BF, et al. Classification of primary progressive aphasia and its variants. Neurology. 2011;76:1006–14.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Leyton CE, Hodges JR, McLean CA, Kril JJ, Piguet O, Ballard KJ. Is the logopenic-variant of primary progressive aphasia a unitary disorder? Cortex. 2015;67:122–33.

    Article  PubMed  Google Scholar 

  4. Irwin DJ, Cairns NJ, Grossman M, McMillan CT, Lee EB, Van Deerlin VM, Lee VM, Trojanowski JQ. Frontotemporal lobar degeneration: defining phenotypic diversity through personalized medicine. Acta Neuropathol. 2015;129:469–91.

    Article  PubMed  Google Scholar 

  5. Bonner MF, Ash S, Grossman M. The new classification of primary progressive aphasia into semantic, logopenic, or nonfluent/agrammatic variants. Curr Neurol Neurosci Rep. 2010;10:484–90.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Matias-Guiu JA, Cabrera-Martin MN, Moreno-Ramos T, Garcia-Ramos R, Porta-Etessam J, Carreras JL, Matias-Guiu J. Clinical course of primary progressive aphasia: clinical and FDG-PET patterns. J Neurol. 2015;262:570–7.

    Article  PubMed  Google Scholar 

  7. Hsieh S, Hodges JR, Leyton CE, Mioshi E. Longitudinal changes in primary progressive aphasias: differences in cognitive and dementia staging measures. Dement Geriatr Cogn Disord. 2012;34:135–41.

    Article  CAS  PubMed  Google Scholar 

  8. Knopman DS, Jack Jr CR, Kramer JH, Boeve BF, Caselli RJ, Graff-Radford NR, Mendez MF, Miller BL, Mercaldo ND. Brain and ventricular volumetric changes in frontotemporal lobar degeneration over 1 year. Neurology. 2009;72:1843–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Rogalski E, Cobia D, Martersteck A, Rademaker A, Wieneke C, Weintraub S, Mesulam MM. Asymmetry of cortical decline in subtypes of primary progressive aphasia. Neurology. 2014;83:1184–91.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Novak P, Schmidt R, Kontsekova E, Zilka N, Kovacech B, Skrabana R, Vince-Kazmerova Z, Katina S, Fialova L, Prcina M, et al. Safety and immunogenicity of the tau vaccine AADvac1 in patients with Alzheimer's disease: a randomised, double-blind, placebo-controlled, phase 1 trial. Lancet Neurol. 2016;16:123–34.

  11. Morris JC, Heyman A, Mohs RC, Hughes JP, van Belle G, Fillenbaum G, Mellits ED, Clark C. The Consortium to Establish a Registry for Alzheimer's Disease (CERAD). Part I. Clinical and neuropsychological assessment of Alzheimer's disease. Neurology. 1989;39:1159–65.

    Article  CAS  PubMed  Google Scholar 

  12. Orgass B. Token Test. Deutsche Bearbeitung des Token Test von E. De Renzi und L.A. Vignolo. Weinheim: Beltz Test; 1982.

    Google Scholar 

  13. Huber W, Poeck K, Weniger D, Willmes K. Der Aachener Aphasie Test (AAT). Götingen: Hogrefe; 1983.

    Google Scholar 

  14. Hodges JR, Martinos M, Woollams AM, Patterson K, Adlam AL. Repeat and Point: differentiating semantic dementia from progressive non-fluent aphasia. Cortex. 2008;44:1265–70.

    Article  PubMed  Google Scholar 

  15. Aschenbrenner S, Tucha O, Lange KW. Regensburger Wortflüssigkeits-Test (RWT). Göttingen: Hogrefe; 2000.

    Google Scholar 

  16. Goodglass H, Kaplan E, Barresi B. Boston Diagnostic Aphasia Examination-Third Edition (BDAE-3). Philadelphia: Williams & Wilkins; 2001.

    Google Scholar 

  17. Wechsler D. Wechsler Gedächtnistest—Revidierte Fassung (WMS-R); Deutsche Adaptation der revidierten Fassung der Wechsler Memory Scale. Bern: Verlag Hans Huber; 2000.

    Google Scholar 

  18. Haid T, Martl C, Schubert F, Wenzl M, Kofler M, Saltuari L. Der ‘‘HAMASCH 5 Punkt Test’’- erste Normierungsergebnisse. Zeitschrift für Neuropsychologie. 2000;13:233.

  19. Brand M, Kalbe E, Kessler J. Test zum kognitiven Schätzen (TKS). Manual. Göttingen: Beltz Test; 2002.

    Google Scholar 

  20. Morris JC. The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology. 1993;43:2412–4.

    Article  CAS  PubMed  Google Scholar 

  21. Knopman DS, Kramer JH, Boeve BF, Caselli RJ, Graff-Radford NR, Mendez MF, Miller BL, Mercaldo N. Development of methodology for conducting clinical trials in frontotemporal lobar degeneration. Brain. 2008;131:2957–68.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Steinacker P, Feneberg E, Weishaupt J, Brettschneider J, Tumani H, Andersen PM, von Arnim CA, Bohm S, Kassubek J, Kubisch C, et al. Neurofilaments in the diagnosis of motoneuron diseases: a prospective study on 455 patients. J Neurol Neurosurg Psychiatry. 2016;87:12–20.

    PubMed  Google Scholar 

  23. Bisenius S, Neumann J, Schroeter ML. Validating new diagnostic imaging criteria for primary progressive aphasia via anatomical likelihood estimation meta-analyses. Eur J Neurol. 2016;23:704–12.

    Article  CAS  PubMed  Google Scholar 

  24. Steinacker P, Semler E, Anderl-Straub S, Diehl-Schmid J, Schroeter ML, Uttner I, Foerstl H, Landwehrmeyer B, von Arnim CA, Kassubek J, et al. Neurofilament as a blood marker for diagnosis and monitoring of primary progressive aphasias. Neurology. 2017;88:961–9.

    Article  CAS  PubMed  Google Scholar 

  25. Rubin DB. Multiple imputation for nonresponse in surveys Hoboken. New Jersey: Wiley; 2004.

    Google Scholar 

  26. Cavallo M, Adenzato M, MacPherson SE, Karwig G, Enrici I, Abrahams S. Evidence of social understanding impairment in patients with amyotrophic lateral sclerosis. PLoS One. 2011;6:e25948.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We kindly thank all participants and their relatives for their participation as well as all coworkers of the FTLDc study group: Sandrine Bisenius (Leipzig), Emily Feneberg (Ulm), Jennifer Faber (Bonn), Anke Hammer (Erlangen), Sibylle Haefner (Göttingen), Elisabeth Kasper (Rostock), Delia Kurzwelly (Bonn), Johannes Levin (LMU München), Finn Lornsen (Göttingen), Maxine Luley (Homburg/Saar), Manuel Maler (Erlangen), Hans-Peter Müller (Ulm), Timo Oberstein (Erlangen), Hannah Pellkofer (Göttingen), Catharina Prix (LMU München), Isabelle Riederer (TU München), Carola Roßmeier (TU München), Robert Schomburg (Homburg/Saar), Sandra Roeske (Bonn), Sonja Schönecker (LMU München), Philipp Spitzer (Erlangen), Annika Spottke (Bonn), Katharina Stuke (Leipzig), Stefan Teipel (Rostock), Christine von Arnim (Ulm), Petra Wilken (Göttingen), and Jens Wiltfang (Göttingen).


The study was funded by the German Federal Ministry of Education and Research (FTLDc O1GI1007A), the JPND network SOPHIA, BiomarkAPD and PreFrontAls, the EU (Fairpark II), the foundation of the state of Baden-Württemberg (D.3830) and BIU (D.5009).

Availability of data and materials

All data generated and/or analyzed during this study are included in this published article and its supplementary information files.

Author information

Authors and Affiliations




Study conception and design of the study was performed by MO, ES, SAS, IU, JDS, AD, KFA, KFL, HJ, JKA, JKO, ML, JP, AJ, MLS, BL, and ACL. MO and ES drafted the manuscript; all other authors made substantial contributions to the interpretation of data. MO, ES, SAS, IU, JDS, AD, KFA, KFL, HJ, JKA, JKO, ML, JP, AJ, MLS, BL, and ACL contributed to acquisition of data. HJH performed the volumetric analyses. RM, BE, and ES conducted the statistics. All authors revised the manuscript critically for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Markus Otto.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the local Ethics Committee at the central study center at University of Ulm (proposal number 39/11, 8 March 2011) and by the Ethics Committees at each individual recruiting site and has therefore been performed in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki. Written informed consent was obtained from each patient, participant, caregiver, or legal representative prior to inclusion in the study.

Consent for publication

All authors have approved the manuscript for submission and gave consent for publication.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1: Table S1.

Summary of cognitive test scores and clinical rating scales within 1 year (visit 1 and visit 2) for all PPA subtypes and healthy controls. (PDF 129 kb)

Additional file 2: Table S2.

Summary of volumetric results in MRI within 1 year (visit 1 and visit 2) for all PPA subtypes and healthy controls. (PDF 154 kb)

Additional file 3:

Table S3. Sample size calculation based on the observed mean decline in the balanced and unbalanced version of the sum score within 1 year (visit 1 and visit 2) without imputation procedure. (PDF 21 kb)

Additional file 4:

Table S4. Sample size calculation based on the observed mean decline in FTLD-CDR scores within 1 year (visit 1 and visit 2). (PDF 13 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Semler, E., Anderl-Straub, S., Uttner, I. et al. A language-based sum score for the course and therapeutic intervention in primary progressive aphasia. Alz Res Therapy 10, 41 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: