Skip to main content

Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease



Processing of degraded speech is a promising model for understanding communication under challenging listening conditions, core auditory deficits and residual capacity for perceptual learning and cerebral plasticity in major dementias.


We compared the processing of sine-wave-degraded speech in 26 patients with primary progressive aphasia (non-fluent, semantic, and logopenic variants), 10 patients with typical Alzheimer’s disease and 17 healthy control subjects. Participants were required to identify sine-wave words that were more predictable (three-digit numbers) or less predictable (place names). The change in identification performance within each session indexed perceptual learning. Neuroanatomical associations of degraded speech processing were assessed using voxel-based morphometry.


Patients with non-fluent and logopenic progressive aphasia and typical Alzheimer’s disease showed impaired identification of sine-wave numbers, whereas all syndromic groups showed impaired identification of sine-wave place names. A significant overall identification advantage for numbers over place names was shown by patients with typical Alzheimer’s disease, patients with semantic progressive aphasia and healthy control participants. All syndromic groups showed spontaneous perceptual learning effects for sine-wave numbers. For the combined patient cohort, grey matter correlates were identified across a distributed left hemisphere network extending beyond classical speech-processing cortices.


These findings demonstrate resilience of auditory perceptual learning capacity across dementia syndromes, despite variably impaired perceptual decoding of degraded speech and reduced predictive integration of semantic knowledge. This work has implications for the neurobiology of dynamic sensory processing and plasticity in neurodegenerative diseases and for development of novel biomarkers and therapeutic interventions.


Deficits of speech perception have been documented in the three major ‘language-led’ dementias (non-fluent variant primary progressive aphasia [nfvPPA], semantic variant PPA [svPPA] and logopenic variant primary progressive aphasia [lvPPA]) [1,2,3,4,5] as well as in typical Alzheimer’s disease (tAD) [6,7,8,9]. However, the factors that affect speech perception have been less well studied than language output in these diseases. Normal speech perception entails high-fidelity encoding of incoming acoustic data, parsing of speech from extraneous noises and integration with prior expectations [10]. Healthy listeners rapidly and automatically adapt to speech degradation under challenging listening conditions, based on prior auditory experience [11]. This reflects perceptual learning: improved accuracy of perceptual processing following sustained exposure to the auditory stimulus, modulated by ‘top-down’ predictive mechanisms [12, 13].

Perception of degraded speech is impaired in patients with nfvPPA, svPPA and tAD [14, 15], whereas implicit auditory sequence learning and disambiguation of degraded speech are retained in nfvPPA [10, 16], albeit with reduced flexibility in using contextual information. Processing degraded sensory stimuli taxes the functional integrity of cortical circuits [17, 18], and disorders such as PPA and tAD constitute test cases for exposing such effects because they strike auditory processing networks early and relatively selectively. In contrast, perceptual learning reflects functional brain reorganisation or plasticity; it might therefore help compensate for effects of neurodegeneration [19]. Adaptation to degraded speech engages areas (such as sensorimotor cortex) beyond classical language and auditory networks [20, 21] and may be enhanced by cholinergic stimulation [15]. Taken together, these considerations suggest that understanding and perceptual learning of degraded speech might constitute a powerful and sensitive probe of neural network integrity and residual plasticity in neurodegenerative syndromes.

In the present study, we assessed identification of degraded speech and associated auditory perceptual learning in patients with PPA and tAD relative to healthy older individuals. We used sine-wave speech as a model paradigm. Sine-wave speech is a radical perceptual alteration that reduces speech signals to a series of ‘whistles’ that correspond to formant contours, retaining the long-range temporal scaffold of speech but stripped of all spectral detail (examples are available in the additional files). Sine-wave transformation renders speech initially unintelligible, yet induces spontaneous perceptual learning in healthy listeners primed to its linguistic origin [11, 22]. This effect relies on ‘top-down’ perceptual integration of apparently dissimilar acoustic events into a coherent speech-like signal. It is therefore likely a priori to depend on cognitive operations that are instantiated across the language network. To explore the effect of semantic predictability on the ‘top-down’ disambiguation of degraded speech [10], we applied sine-wave manipulations to a verbal category with uniformly high predictability (numbers) and a verbal category in which predictability could be varied on the basis of familiarity (geographical place names). These semantic categories are relatively preserved in svPPA [2, 23], allowing predictive processing mechanisms to be distinguished from semantic disintegration. Neuroanatomical associations of sine-wave speech processing in the patient cohort were assessed using voxel-based morphometry (VBM).

On the basis of available evidence [3,4,5, 10, 15, 24,25,26,27], we hypothesised that PPA syndromes and tAD would be associated with differential impairment of sine-wave speech identification and perceptual learning relative to controls. We predicted that nfvPPA and lvPPA would be associated with more severe perceptual decoding deficits than other groups, but that all groups would show retained adaptation to degraded speech and increased reliance on prior predictability, particularly in svPPA. Drawing on neuroanatomical evidence in the healthy brain and PPA [10, 13, 20, 28,29,30], we further hypothesised that sine-wave speech perception deficits would correlate with grey matter loss in posterior superior temporal and inferior parietal cortices, whereas modulation by prior predictability and perceptual learning effects would correlate with anterior sensorimotor, prefrontal and anterior temporal grey matter.



Nine patients with nfvPPA, ten patients with svPPA, seven patients with lvPPA and ten patients with tAD were recruited via a specialist cognitive clinic. Seventeen healthy older individuals with no history of neurological or psychiatric illness also participated in the study. All patients fulfilled current consensus diagnostic criteria either for the relevant PPA syndrome [1] or for Alzheimer’s disease [31]. Syndromic diagnoses were corroborated by a general neuropsychological assessment (Table 1) and brain magnetic resonance imaging (MRI) findings. No patients had radiological evidence of significant co-morbid cerebrovascular disease. Cerebrospinal fluid profiles of tau and beta-amyloid were available for five of the seven patients with lvPPA and were consistent with Alzheimer’s pathology in each case, based on local reference ranges (total tau/beta-amyloid1–42 ratio > 1). No participant had a history of clinically relevant hearing loss; each participant’s peripheral hearing function was assessed using a previously described pure-tone audiometry protocol [14].

Table 1 Demographic, clinical and general neuropsychological data for the participant groups

All participants gave informed consent for their involvement in the study. Ethical approval was granted by the University College London and National Hospital for Neurology and Neurosurgery Research Ethics Committees, in accordance with Declaration of Helsinki guidelines.

Experimental stimuli and procedures

Two lists of spoken words were recorded, corresponding to two experimental conditions: 40 three-digit numbers (e.g., ‘nine hundred and sixty-five’) and 40 geographical place names (e.g., ‘Germany’). To reduce any crossover of perceptual learning effects based on intonational idiosyncrasies of particular speakers [32, 33], numbers were recorded by a young male speaker and place names by a young female speaker, both using a Standard English (southern England) accent. Place names comprised 20 cities and 20 countries, selected such that half of the items were located relatively ‘near’ (i.e., English cities, European countries) and half were more remote (i.e., relatively ‘far’ away: American cities, non-European countries). Inclusion of these ‘near’ and ‘far’ subcategories was intended to modulate the prior predictability of their sine-wave versions because geographical proximity has been shown to determine the relative familiarity of place names [23].

Sine-wave replicas of the natural speech recordings were generated using a procedure reported previously [15] (see Fig. 1). In creating the final stimulus lists, number stimuli were split into 2 blocks of 20 trials: the second block comprised 10 numbers that had been presented in the first block plus 10 numbers presented de novo. This design allowed us to assess the generalisability of any perceptual learning effects beyond the ‘trained’ stimulus set. Examples of stimuli wave files are provided in Additional files 1, 2, 3, and 4.

Fig. 1
figure 1

Broadband time-frequency spectrograms of sine-wave and natural speech. Examples of natural speech stimuli are shown in the top panels, and corresponding sine-wave replicas are shown in the bottom panels. The y-axes code frequency (kilohertz), and the x-axes code time (milliseconds). The centre frequencies of the three sine-wave contours track the centre frequencies of the formants in each of the natural stimuli. Depicted are examples of the two types of speech stimuli used in the experiment: three-digit numbers (‘nine hundred and sixty-five’; left panels) and geographical place names (‘Germany’; right panels). Sound recordings of these stimuli are available in Additional files 1, 2, 3, and 4

All stimuli were administered in a quiet room via headphones (ATH-M50x; Audio-Technica®, Leeds, UK) at a comfortable listening level (at least 70 dB) for each participant. The number condition was always presented before the place name condition (because we anticipated that any confounding, non-specific ‘training’ effects from prior sessional exposure would be more relevant to sine-wave number than place name identification [34]. The order of trials was fully randomised within each condition. Before commencing the test, participants were first familiarised with examples of natural spoken numbers and place names and advised that, during the test, similar words would be presented in distorted ‘whistled’ form and that these might initially be difficult to understand but might become easier over the course of the session. Participants were instructed that their task on each trial was to try to repeat and/or write down the distorted word as accurately as possible. During the test, no feedback about performance was given, and no time limits on responses were imposed.

Following the sine-wave conditions, two control conditions were administered to provide a measure of patients’ ability to process the verbal content of the number and geographical stimuli in natural speech. Ability to perceive speech under natural listening conditions was assessed by presenting a list of 10 undistorted three-digit numbers and a list of 16 undistorted place names. On each trial, the participant was required to repeat or transcribe the stimulus, and performance was scored similarly to the respective sine-wave conditions. In addition, geographical semantic knowledge of the 40 places presented during the sine-wave experiment was assessed using a two-alternative forced-choice procedure. For each place name (presented in undistorted form), participants were asked to indicate whether it was a city or a country; for cities, they were then asked to indicate whether it was English or American, and for countries, to indicate whether it was in Europe or elsewhere. An aggregate geographical knowledge score (total possible score of 80) was calculated for each participant. Numbers were scored per digit correct, meaning that each trial had a maximum score of 3, whereas place names were scored as either correct or incorrect.

Analysis of clinical and behavioural data

Clinical and behavioural data were analysed using Stata version 14.0 software (StataCorp, College Station, TX, USA). Each patient group was compared with the healthy control participants using two-tailed, two-sample t tests for continuous variables and chi-square tests for categorical variables.

Analysis of variance (ANOVA) models were used to analyse participant group performance on all sine-wave and experimental control tests. Participants’ background geographical knowledge was analysed using total score on the geographical semantic control task as a dependent variable and participant group as an independent variable, covarying for general cognitive performance (Mini Mental State Examination [MMSE] score) to ensure that any group effects were not attributable simply to disease severity. The total score on this geographical knowledge index was then used as a covariate where indicated in subsequent analyses. In addition, one-tailed one-sample t tests were used within each diagnostic group separately to assess whether the performance difference between ‘near’ and ‘far’ trials was significantly greater than zero. Performance on the control tasks assessing natural speech perception was analysed for place names and numbers separately, incorporating syndromic diagnosis as an independent variable.

Participants’ overall perceptual decoding accuracy (identification) performance in each of the sine-wave speech conditions was analysed by incorporating syndromic diagnosis as an independent variable, with natural speech control task performance and (for place names) geographical semantic control task performance as covariates. The overall group effect of speech predictability (prior place name familiarity) was assessed using the difference between ‘near’ and ‘far’ location scores as the ANOVA dependent variable; post hoc t tests were used to compare this discrepancy index between participant groups. One-tailed one sample t tests were used within each diagnostic group separately to assess whether the difference between ‘near’ and ‘far’ trials was significantly greater than zero. In addition, to assess any condition-specific effects on sine-wave speech processing (and to capture potential variability of such effects between individuals), we calculated a ‘condition discrepancy index’ for each participant, defined as follows: ([score on sine-wave number condition/total score possible] minus [score on sine-wave geographical condition/total score possible]). This discrepancy index was analysed for any overall group effect as the ANOVA dependent variable with syndromic diagnosis as the independent variable; post hoc t tests were used to compare participant groups directly. One-sample t tests were used within each diagnostic group separately to assess whether the condition discrepancy index was significantly different from zero.

To assess change in participants’ performance with increasing exposure to sine-wave speech (auditory perceptual learning), we divided the number condition session into four blocks of ten trials and calculated a ‘perceptual learning index’ for each participant, defined as follows: ([block 4 score {trials 31–40}] minus [block 1 score {trials 1–10}]). Performance differences between initial and final stimulus presentation blocks have previously been shown to capture overall implicit learning of speech-like stimuli [16]. An analogous index was calculated for the place name condition. These learning index data were compared between participant groups using syndromic diagnosis as the ANOVA independent variable with natural speech task performance as a covariate. One-tailed one-sample t tests were used in each participant group separately to assess whether this perceptual learning index was significantly different from zero. Pearson’s correlations were used to assess any association of perceptual learning indices with MMSE score or Wechsler Abbreviated Scale of Intelligence (WASI) matrices score (proxies for global cognitive function), separately for sine-wave number and place name conditions in the combined patient cohort. For the sine-wave numbers condition, we also created a familiarity discrepancy index, defined as follows: (score on ‘trained’ [repeat] numbers minus score on ‘untrained’ [de novo] numbers). Participant groups were compared on this index using syndromic diagnosis as the ANOVA independent variable with natural speech task performance as a covariate. A threshold of p < 0.05 was accepted as the criterion for statistical significance in all tests.

Brain image acquisition and analysis

Volumetric brain MRI scans were acquired for 33 patients in a 3-T MAGNETOM Prisma scanner (Siemens Healthcare, Erlangen, Germany) using a 64-channel head-and-neck receiver array coil and a T1-weighted sagittal 3D magnetization-prepared rapid gradient-echo sequence (echo time = 2.93 ms, inversion time = 850 ms, repetition time = 2000 ms), with matrix size 256 × 256 × 208 and voxel dimensions 1.1 × 1.1 × 1.1 mm and overall scan acquisition duration 306 seconds.

For the VBM analysis, patients’ brain images were pre-processed and normalised to Montreal Neurological Institute (MNI) space with isotropic voxel size 1.5 mm using SPM12 software ( and the Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) toolbox with default parameters in MATLAB R2014b (MathWorks, Natick, MA, USA), in accordance with a procedure we have described previously [14, 30]. Disease-associated atrophy profiles were generated for each patient group separately, again using a previously described protocol [30].

In parallel analyses over the combined patient cohort, separate linear regression models were implemented to assess associations between voxel-wise grey matter volume and (1) total score for sine-wave numbers, (2) total score for sine-wave place names, (3) sine-wave condition discrepancy indices (as defined above) and (4) sine-wave perceptual learning indices (as defined above). Each model incorporated symptom duration (as an index for disease stage), syndromic diagnosis, age and total intracranial volume as nuisance covariates. Statistical parametric maps were generated using an initial threshold p < 0.001 and assessed at peak statistical significance level p < 0.05, after family-wise error (FWE) correction for multiple voxel-wise comparisons within a pre-specified anatomical ROI. This region incorporated cortical areas in the dominant hemisphere that have been implicated in previous studies of degraded speech processing and auditory perceptual learning in the healthy brain (see Additional file 5), comprising the temporoparietal junction (including posterior superior temporal gyrus and sulcus, planum temporale, inferior parietal lobe), anterior temporal lobe and inferior frontal gyrus [28, 29, 35, 36], and an orofacial sensorimotor region encompassing the inferior two-thirds of the pre-central and post-central gyri [20].


Background demographic, neuropsychological and clinical data for all participant groups are presented in Table 1; group performance profiles on the experimental tasks are presented in Table 2.

Table 2 Performance of participant groups on experimental tasks

General participant group characteristics

Participant groups did not differ in age, handedness, gender, education or peripheral hearing (all p > 0.05). Patient groups differed on MMSE score (p = 0.034; less severe in svPPA and nfvPPA) and symptom duration (p = 0.026; shorter in nfvPPA and lvPPA).

Experimental behavioural data

Performance on the natural speech control conditions differed significantly between participant groups for numbers [F(4,48) = 8.81, p < 0.001] and place names [F(4,47) = 5.52, p = 0.001]; post hoc comparisons between groups revealed that both the lvPPA and nfvPPA groups performed worse than healthy control participants and other patient groups (all p < 0.05) for both conditions. Performance on the geographical semantic control task was significantly affected by diagnosis [F(4,47) = 3.88, p = 0.008]. The svPPA group performed worse than all other participant groups (all p < 0.05); however, there were no other significant group performance differences on this task (all p > 0.05). In addition, the svPPA group (t = 3.28, p = 0.005) and the lvPPA group (t = 3.06, p = 0.011) showed a performance advantage for knowledge of ‘near’ over ‘far’ places; no other groups showed a significant performance discrepancy between trial types on this control task (all p > 0.05).

Overall accuracy of perceptual decoding (identification) of sine-wave speech differed significantly between participant groups, for both the sine-wave number [F(4,47) = 3.91, p = 0.008] and place name [F(4,46) = 3.88, p = 0.009] conditions. For the sine-wave number condition, post hoc group comparisons revealed that the nfvPPA, lvPPA and tAD groups (but not the svPPA group) performed worse than healthy control participants (all p < 0.05) (Fig. 2). Comparing patient groups, both the lvPPA group (t = − 2.39, p = 0.021) and the nfvPPA group (t = − 2.53, p = 0.015) performed worse than the svPPA group. For the sine-wave place name condition, post hoc group comparisons revealed that all patient groups performed worse than healthy control participants (all p < 0.05), whereas there were no significant performance differences between patient groups. All participant groups showed a significant performance advantage for ‘near’ over ‘far’ place names (all p < 0.05), but the effect of place name type also differed significantly between participant groups [F(4,46) = 6.21, p = 0.004]; the advantage for ‘near’ over ‘far’ place names was significantly higher in each patient group relative to healthy control participants (all p < 0.05), but there were no differences between patient groups.

Fig. 2
figure 2

Participant group performance over testing sessions. Data plots of participant group performance over the course of the testing sessions for processing sine-wave replicas of spoken numbers and geographical place names. Values represent mean group scores with SDs for each corresponding trial block (see text for details) in each condition. Controls Healthy control group, lvPPA Patient group with logopenic variant primary progressive aphasia, nfvPPA Patient group with non-fluent variant primary progressive aphasia, svPPA Patient group with semantic variant primary progressive aphasia, tAD Patient group with clinically typical Alzheimer’s disease

Comparing overall performance in the sine-wave number versus place name conditions in each participant group separately, the healthy control group (t = 3.70, p = 0.001), tAD group (t = 3.73, p = 0.002) and svPPA group (t = 9.93, p < 0.001) showed a significant performance advantage for identifying sine-wave numbers; there was no significant performance discrepancy between number and place name conditions in the lvPPA or nfvPPA groups (both p > 0.05). The performance condition discrepancy was significantly greater in the svPPA group than in each of the other groups (all p < 0.05). The svPPA group also showed the most individually consistent performance advantage for identifying sine-wave numbers: 80% of svPPA patients showed a performance discrepancy favouring numbers outside the healthy control range versus 20% of patients with tAD, whereas four patients with nfvPPA and three patients with lvPPA showed the reverse discrepancy, favouring sine-wave place name identification (see Fig. 3).

Fig. 3
figure 3

Individual performance. Scatterplots of individual performance on identification of sine-wave numbers relative to sine-wave place names (left), perceptual learning of sine-wave numbers (middle) and sine-wave place names (right). Sine-wave numbers vs place names (left panel) data were generated using the formula y = (sine-wave numbers score/total score possible) − (sine-wave places score/total score possible). Higher scores therefore indicate an advantage for identifying sine-wave numbers relative to sine-wave place names and vice versa. Perceptual learning data were generated by taking performance in block 1 away from block 4 for the sine-wave number (middle) and sine-wave place name (right) conditions. Red horizontal lines indicate the upper and lower bounds of the healthy control group range. Controls Healthy control group, lvPPA Patient group with logopenic variant primary progressive aphasia, nfvPPA Patient group with non-fluent variant primary progressive aphasia, svPPA Patient group with semantic variant primary progressive aphasia, tAD Patient group with typical Alzheimer’s disease

A significant within-group perceptual learning effect (indexed as block 4 score minus block 1 score) was evident for the sine-wave number condition in all participant groups (p < 0.05). This perceptual learning index was positively correlated with block 1 score across participants (p = 0.027). There was no difference between repeat versus new items over the combined participant cohort [F(4,47) = 2.29, p = 0.073] or the combined patient cohort [F(3,31) = 2.80, p = 0.056]. For the sine-wave place name condition, the healthy control, lvPPA and nfvPPA groups showed a significant perceptual learning effect (all p < 0.05), whereas the svPPA and tAD groups showed no such effect. However, there was no significant overall group effect on perceptual learning for the sine-wave number condition [F(4,47) = 1.79, p = 0.147] or place name condition [F(4,46) = 1.80, p = 0.144]. Inspection of individual performance data (Fig. 3) showed that for sine-wave numbers, only two patients (one tAD, one nfvPPA) had a perceptual learning rate below the lower bound seen in the healthy control group, whereas for sine-wave place names, two patients with tAD and one patient with svPPA scored lower than the control range. Perceptual learning index was not significantly correlated with MMSE score or WASI matrices score for either the number condition (MMSE, r = 0.001, p = 0.995; WASI matrices, r = − 0.059, p = 0.731) or place name condition (MMSE, r = 0.029, p = 0.863; WASI matrices, r = 0.109, p = 0.528) across the patient cohort.

Neuroanatomical data

Patient groups showed the anticipated syndromic profiles of disease-related grey matter atrophy; statistical parametric maps are presented in Fig. 4. Statistical parametric maps of grey matter regions associated with performance on the sine-wave speech-processing tasks are shown in Fig. 5; local grey matter maxima associated with performance on each variable of interest are summarised in Table 3. All contrasts describing associations with behavioural data are reported after FWE correction for multiple voxel-wise comparisons within the pre-specified neuroanatomical ROI.

Fig. 4
figure 4

Statistical parametric maps showing disease-related structural neuroanatomical differences between each patient group and controls. Maps are thresholded at p < 0.001 uncorrected over the whole brain and displayed on representative sections of a group (combined patient cohort) mean T1-weighted brain magnetic resonance image; the plane of each section is indicated using Montreal Neurological Institute (MNI) coordinates, and the left cerebral hemisphere is displayed on the left in coronal sections and on top in axial sections (colour bars code voxel-wise t scores for the relevant atrophy map). nfvPPA Non-fluent variant primary progressive aphasia, svPPA Semantic variant primary progressive aphasia, lvPPA Logopenic variant primary progressive aphasia, tAD Typical Alzheimer’s disease

Fig. 5
figure 5

Voxel-based morphometry results. Statistical parametric maps of regional grey matter volume positively associated with performance on sine-wave speech processing tasks for the combined patient cohort. The top panels show grey matter correlates of sine-wave number and place name identification accuracy; the middle panels, correlates of significant performance discrepancy between the sine-wave conditions (performance advantage for sine-wave numbers or place names); the bottom panels, correlates of the perceptual learning effect over the sine-wave number session (see text for details). Maps are rendered on sections of the group mean T1-weighted brain magnetic resonance image, thresholded at p < 0.001 uncorrected for multiple voxel-wise comparisons over the whole brain for display purposes (areas shown were significant at p < 0.05FWE for multiple comparisons within a pre-specified neuroanatomical ROI; see Table 3 and Additional file 5). The left hemisphere is presented on the left for coronal sections and on top in the axial section; Montreal Neurological Institute (MNI) coordinates for the plane of each section are indicated. The colour bar codes voxel-wise t scores for each map

Table 3 Structural neuroanatomical associations of sine-wave speech comprehension in the patient cohort

Across the combined patient cohort, identification of sine-wave numbers was significantly positively associated with grey matter volume in the left angular gyrus (p = 0.045FWE), whereas identification of sine-wave place names was positively associated with grey matter volume in the left planum temporale (p = 0.013FWE) and left angular gyrus (p = 0.035FWE). A performance advantage for sine-wave numbers over place names was significantly positively associated with grey matter volume in the left inferior frontal gyrus (p = 0.037FWE), whereas the inverse condition discrepancy effect (performance advantage for sine-wave place names over numbers) was significantly positively associated with grey matter volume in the left temporal pole (p = 0.018FWE). The perceptual learning index for sine-wave numbers was significantly positively associated with grey matter volume in the left inferolateral post-central gyrus (two peaks at p = 0.011FWE, p = 0.021FWE). There were no significant grey matter associations of perceptual learning for sine-wave place names over the combined patient cohort.


In the present study, we have demonstrated deficits of degraded speech perception in major syndromes of PPA and tAD relative to healthy control participants. Syndromic groups were stratified by their overall accuracy in decoding sine-wave speech, with patients with nfvPPA and lvPPA exhibiting the most severe and consistent impairments. The findings in tAD extend previous evidence in this disease [15]. Syndromic profiles were modulated by the prior predictability of verbal content: The svPPA and tAD groups showed a significant advantage for perceiving sine-wave speech with highly predictable content (numbers) compared with less predictable content (place names), whereas within the geographical condition, all groups showed a performance advantage for more familiar (‘near’) over less familiar (‘far’) place names, and this advantage was exaggerated in patient groups compared with the healthy control participants. All syndromic groups exhibited some capacity for auditory perceptual learning following sustained exposure to sine-wave speech: this occurred spontaneously, and there was evidence that the effect generalised to ‘new’ as well as ‘trained’ speech tokens. However, patients with svPPA and tAD showed a perceptual learning effect only for relatively predictable verbal content. These effects were evident after adjusting for performance on natural speech perception and geographical knowledge tasks and therefore not attributable to more generic deficits of phonological or semantic processing in the patient groups. Taken together, the present results imply a degree of residual cerebral plasticity in these syndromes, manifesting as resilient auditory perceptual learning of degraded speech.

Our finding that decoding of degraded speech is impaired in the nfvPPA, lvPPA and tAD groups corroborates recent evidence for core auditory processing deficits in these syndromes [5,6,7,8, 10, 14, 27, 30]. The less uniform decoding deficit identified in the svPPA group in the present study was also anticipated on the basis of previous work; in the healthy brain, decoding of degraded speech engages ‘top-down’ (including semantic) mechanisms that disambiguate the speech stream based on prior predictability [10, 13, 37]. Efficient access to stored semantic ‘priors’ (including ready access to lower-frequency priors) when interpreting degraded speech is likely to become increasingly limiting as verbal content becomes less predictable [26]; less predictable verbal content would place increased demands on semantic processing resources. This would account both for the striking performance advantage for (highly predictable) sine-wave numbers over (less predictable) place names shown in this study by patients with svPPA and for the ‘echo’ of this condition discrepancy effect in the performance advantage for more familiar over less familiar place names shown by all participant groups. This was not simply the consequence of a dwindling semantic lexicon; it was observed after taking geographical semantic competence into account, in keeping with a more specific limitation on the recruitment of semantic mechanisms during predictive processing of speech signals. Furthermore, the tAD group, but not the lvPPA group, showed a significant condition discrepancy effect, suggesting that these Alzheimer variant syndromes may be characterised by separable pathophysiological mechanisms [2].

Participant groups did not differ in perceptual learning of sine-wave numbers. Indeed, the magnitude of the perceptual learning effect across patient groups (including those with marked overall deficits of degraded speech perception) was comparable to that of healthy older control participants (Table 2, Figs. 2 and 3). Moreover, most individual patients in each syndromic group showed perceptual learning effects within the healthy control range. There was no association between general cognitive performance (indexed by MMSE and WASI matrices scores) and perceptual learning index for numbers or place names. Together, these findings argue for dissociable physiological mechanisms mediating the accuracy of degraded speech decoding and adaptation based on sustained, unsupervised exposure to degraded speech, and they suggest that perceptual learning of degraded speech may be relatively resilient to the effects of background cognitive decline. The syndromic profiles in the present study further suggest that perceptual learning is modulated by prior verbal predictability (the svPPA and tAD groups showed the effect for strongly predictable but not less predictable verbal stimuli), in line with current models of degraded speech learning based on minimising prediction errors [13]. Although data in PPA are limited, our findings are consistent with previous work in nfvPPA, indicating that separable mechanisms underpin decoding of sensory detail in degraded speech stimuli, ‘top-down’ predictions about such stimuli and implicit learning based on auditory experience [10, 16]. There are precedents for a dissociation of sensory accuracy and sensory learning or plasticity in other disorders (e.g., developmental dyslexia and amblyopia [38]), with potential substrates at cognitive, neurophysiological and neuroanatomical levels [39]. Speech may be a particularly potent stimulus to expose the component mechanisms of a processing hierarchy. Whereas perceptual decision making on speech and other complex auditory stimuli typically rests on integration of multiple spectrotemporal features, perceptual learning may depend on the extraction of more specific lower-level properties from ‘bottom-up’ sensory data, honed by ‘top-down’ predictions based on prior auditory experience and used in turn to update those predictions [13, 39]. This reciprocal interaction between sensory traffic and predictions could be instantiated on different neuroanatomical scales, ranging from local cortical circuits to large-scale distributed brain networks that could be differentially disrupted by neurodegenerative proteinopathies [10, 17, 40].

The neuroanatomical correlates of degraded speech decoding accuracy and perceptual learning identified in our patient cohort support the behavioural evidence that these processes are at least partly dissociable. Identification of both sine-wave numbers and place names was associated with grey matter volume in left angular gyrus, a region affected by the neurodegenerative pathologies studied here [41, 42] and previously implicated in processing speech under challenging listening conditions in functional neuroimaging and virtual lesion studies in the healthy brain [17, 43,44,45,46]. The present evidence in a patient cohort with variably impaired perception of degraded speech corroborates this previous work and further suggests that integrity of angular gyrus plays a critical role in determining whether degraded speech is disambiguated successfully. However, this region acts as the hub of a distributed processing network. Its functional connectivity and interactions with other modes of the network may be modulated by a number of factors, including output task, semantic context and perceived intelligibility [47]. In line with this, we identified additional neuroanatomical correlates that may mediate the effect of altered verbal predictability of degraded speech (summarised in Table 3). Grey matter in the left planum temporale was correlated with identification of sine-wave place names but not numbers at the prescribed threshold. This region is engaged in parsing the auditory scene under conditions of high computational load [13, 28, 36].

Candidate loci for ‘top-down’ control of perceptual analysis under different conditions of verbal predictability were identified in the left inferior frontal gyrus (for more highly predictable, sine-wave numbers) and left temporal pole (for less predictable sine-wave place names). Our findings do not resolve the mechanisms by which these regions communicate during sine-wave speech decoding, which are likely to differ between dementia syndromes. Disambiguation of degraded speech based on relatively constrained predictive algorithms (such as word verification or number identification) is likely to engage top-down control mechanisms in the inferior frontal cortex, a region that is heavily involved in nfvPPA [10], whereas decoding of ‘novel’ linguistic environments with less predictable verbal content (such as degraded place names) may demand active computation of the ‘best fit’ between incoming speech signal statistics and stored verbal concepts, accessed via the anterior temporal lobe semantic network that is blighted in svPPA [14, 26, 30, 48]. Considered together, our findings suggest that the overall accuracy of degraded speech decoding in these syndromes is likely to depend on a distributed peri-Sylvian network closely overlapping classical language cortices. Both nfvPPA and lvPPA (and less consistently svPPA) have been shown to be associated with atrophy or dysfunction of the dorsal language network, with involvement of anterior and posterior regions extending beyond the zone of maximal atrophy in particular syndromes [2, 5]. Impaired accuracy of sine-wave speech identification might plausibly result from a ‘double-hit’ to inferior frontal and temporoparietal regions previously implicated in decoding speech and other complex auditory signals [4, 5, 30].

A neuroanatomical substrate for perceptual learning of degraded speech (sine-wave numbers) was identified in the inferolateral post-central gyrus. This sector of sensorimotor cortex does not form part of the canonical language network and was not associated with sine-wave speech identification accuracy in this study, consistent with dissociable neural mechanisms for perceptual decision making and perceptual learning of degraded speech. However, this sensorimotor region hosts cortical representations of lips, mouth and tongue that are engaged during speech perception, particularly under difficult listening conditions in which subvocal rehearsal may help to resolve ambiguous speech sounds [13, 20] or in the context of disease processes primarily affecting the auditory cortex [21]. Sensorimotor cortices are relatively spared in PPA syndromes and tAD [1, 49]; we propose that a critical determinant of perceptual learning capacity (cerebral plasticity) is the degree of atrophy (or relative preservation) of these areas. Our findings suggest that some degree of residual neural plasticity is maintained beyond vulnerable language and auditory networks across canonical dementia syndromes. A stronger claim would be that some form of compensatory functional enhancement drives perceptual learning in the face of neurodegenerative pathology. We cannot evaluate this claim on the basis of the present evidence, though we note that no patient group in our study showed increased perceptual learning capacity relative to healthy older control participants. To understand the nature of the observed effects fully will require functional neuroimaging approaches that can address network connectivity and activity changes directly. Functional neuroimaging paradigms based on degraded speech stimuli have been developed in the healthy brain [13, 50], but they have yet to be applied to patients with dementia. The present findings corroborate our previous psychoacoustic work in tAD [15] and the recent demonstration that patients with nfvPPA benefit from retraining strategies for speech production and fluency, with lasting and generalisable improvement of communication function [51]. Our findings raise the further intriguing possibility that the efficacy of such communication interventions may be enhanced by engaging perceptual learning capacity (i.e., residual cerebral plasticity). If we are to exploit this potential, quantitative behavioural and neuroanatomical markers of perceptual learning will be required. Sine-wave speech (and related degraded speech manipulations) may offer a convenient and well-established route to development of relevant plasticity biomarkers.

This study suggests a number of directions for future work. Sine-wave speech served in the present study as a model paradigm for understanding speech under challenging listening conditions. We found that capacity for perceptual learning of this radically degraded speech-like signal is retained across diverse dementia syndromes, despite variably impaired understanding of the signal. Measures of individual responses (Fig. 3) suggest that these stimuli may represent novel markers for assessing and tracking communication function in particular patients and could have therapeutic potential. Dynamic markers of this kind might stratify dementia syndromes but also transcend conventional syndromic boundaries, constituting ‘stress tests’ of speech processing and auditory scene analysis in earlier-stage PPA and tAD and also presenting a target for intervention. This need not await the advent of disease-modifying therapies; combining currently available symptomatic pharmacotherapies (such as cholinesterase inhibitors) with speech retraining might be one rational approach [15, 16]. However, more information is required about the diagnostic sensitivity, specificity and relevance of perceptual measures on degraded speech over the course of disease, based on replication of these findings in larger patient cohorts, correlation with indices of daily life communication functions, and extension to other speech manipulations. In addition, detailed understanding of the pathophysiological mechanisms of degraded speech processing and perceptual learning in neurodegenerative syndromes will require functional neuroimaging and connectivity-based techniques that can capture activity profiles and time-varying interactions between brain regions.


This work has broad neurobiological and clinical implications. Neurobiologically, the findings suggest that neurodegenerative proteinopathies dissect dissociable mechanisms for auditory pattern decoding and adaptation and expose the critical brain substrates for these processes. Clinically, this work forecasts a fresh emphasis on dynamic physiological capacity and functional plasticity in dementia that should motivate novel biomarker development and neurorehabilitation strategies.



Analysis of variance


Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra


Family-wise error


Logopenic variant primary progressive aphasia


Mini Mental State Examination


Montreal Neurological Institute


Magnetic resonance imaging


Non-fluent variant primary progressive aphasia


Primary progressive aphasia


Semantic variant primary progressive aphasia


Typical Alzheimer’s disease


Voxel-based morphometry


  1. Gorno-Tempini ML, Hillis AE, Weintraub S, Kertesz A, Mendez M, Cappa SF, et al. Classification of primary progressive aphasia and its variants. Neurology. 2011;76:1006–14.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Marshall CR, Hardy CJD, Volkmer A, Russell LL, Bond RL, Fletcher PD, et al. Primary progressive aphasia: a clinical approach. J Neurol. 2018;265:1474–90.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Hailstone JC, Ridgway GR, Bartlett JW, Goll JC, Crutch SJ, Warren JD. Accent processing in dementia. Neuropsychologia. 2012;50:2233–44.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Rohrer JD, Sauter D, Scott S, Rossor MN, Warren JD. Receptive prosody in nonfluent primary progressive aphasias. Cortex. 2012;48:308–16.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Grube M, Bruffaerts R, Schaeverbeke J, Neyens V, De Weer AS, Seghers A, et al. Core auditory processing deficits in primary progressive aphasia. Brain. 2016;139(Pt 6):1817–29.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Goll JC, Kim LG, Ridgway GR, Hailstone JC, Lehmann M, Buckley AH, et al. Impairments of auditory scene analysis in Alzheimer’s disease. Brain. 2012;135:190–200.

    Article  PubMed  Google Scholar 

  7. Golden HL, Nicholas JM, Yong KXX, Downey LE, Schott JM, Mummery CJ, et al. Auditory spatial processing in Alzheimer’s disease. Brain. 2015;138:189–202.

    Article  PubMed  Google Scholar 

  8. Golden HL, Agustus JL, Goll JC, Downey LE, Mummery CJ, Schott JM, et al. Functional neuroanatomy of auditory scene analysis in Alzheimer’s disease. Neuroimage Clin. 2015;7:699–708.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Gates GA, Anderson ML, McCurry SM, Feeney MP, Larson EB. Central auditory dysfunction as a harbinger of Alzheimer dementia. Arch Otolaryngol Head Neck Surg. 2011;137:390–5.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Cope TE, Sohoglu E, Sedley W, Patterson K, Jones PS, Wiggins J, et al. Evidence for causal top-down frontal contributions to predictive processes in speech perception. Nat Commun. 2017;8:2154.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. Remez RE, Rubin PE, Pisoni DB, Carell TD. Speech perception without traditional speech cues. Science. 1981;212:947–50.

    Article  PubMed  CAS  Google Scholar 

  12. Gibson E. Perceptual learning. Annu Rev Psychol. 1963;14:29–56.

    Article  PubMed  CAS  Google Scholar 

  13. Sohoglu E, Davis MH. Perceptual learning of degraded speech by minimizing prediction error. Proc Natl Acad Sci U S A. 2016;113:E1747–56.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. Hardy CJD, Agustus JL, Marshall CR, Clark CN, Russell LL, Bond RL, et al. Behavioural and neuroanatomical correlates of auditory speech analysis in primary progressive aphasias. Alzheimers Res Ther. 2017;9:53.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Hardy CJD, Hwang YT, Bond RL, Marshall CR, Ridha BH, Crutch SJ, et al. Donepezil enhances understanding of degraded speech in Alzheimer’s disease. Ann Clin Transl Neurol. 2017;4:835–40.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Cope TE, Wilson B, Robson H, Drinkall R, Dean L, Grube M, et al. Artificial grammar learning in vascular and progressive non-fluent aphasias. Neuropsychologia. 2017;104:201–13.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Cooke SF, Bear MF, Bain A, Hebb D, Wiesel T, Hubel D, et al. How the mechanisms of long-term synaptic potentiation and depression serve experience-dependent plasticity in primary visual cortex. Philos Trans R Soc Lond Ser B Biol Sci. 2013;369:20130284.

    Article  CAS  Google Scholar 

  18. Ross B, Jamali S, Tremblay KL. Plasticity in neuromagnetic cortical responses suggests enhanced auditory object representation. BMC Neurosci. 2013;14:151.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Li W. Perceptual learning: use-dependent cortical plasticity. Annu Rev Vis Sci. 2016;2:109–30.

    Article  PubMed  Google Scholar 

  20. Schomers MR, Pulvermüller F. Is the sensorimotor cortex relevant for speech perception and understanding? An integrative review. Front Hum Neurosci. 2016;10:435.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Glick H, Sharma A. Cross-modal plasticity in developmental and age-related hearing loss: clinical implications. Hear Res. 2017;343:191–201.

    Article  PubMed  Google Scholar 

  22. Liebenthal E, Binder J, Possing E, Kaufman J, Piorkowski R, Remez R. Auditory and phonetic processing of sinewave speech: behavioral and neural correlates [abstract]. NeuroImage. 2001;13(6 Suppl):S559.

    Article  Google Scholar 

  23. Crutch SJ, Warrington EK. Spatial coding of semantic information: knowledge of country and city names depends on their geographical proximity. Brain. 2003;126:1821–9.

    Article  PubMed  Google Scholar 

  24. Goll JC, Crutch SJ, Loo JHY, Rohrer JD, Frost C, Bamiou D-E, et al. Non-verbal sound processing in the primary progressive aphasias. Brain. 2010;133:272–85.

    Article  PubMed  Google Scholar 

  25. Goll JC, Kim LG, Hailstone JC, Lehmann M, Buckley A, Crutch SJ, et al. Auditory object cognition in dementia. Neuropsychologia. 2011;49:2755–65.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Lambon Ralph MA, Jefferies E, Patterson K, Rogers TT. The neural and computational bases of semantic cognition. Nat Rev Neurosci. 2017;18:42–55.

    Article  CAS  Google Scholar 

  27. Hardy CJD, Marshall CR, Golden HL, Clark CN, Mummery CJ, Griffiths TD, et al. Hearing and dementia. J Neurol. 2016;263:2339–54.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Griffiths TD, Warren JD. The planum temporale as a computational hub. Trends Neurosci. 2002;25:348–53.

    Article  PubMed  CAS  Google Scholar 

  29. Rauschecker JP, Scott SK. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci. 2009;12:718–24.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  30. Hardy CJD, Agustus JL, Marshall CR, Clark CN, Russell LL, Brotherhood EV, et al. Functional neuroanatomy of speech signal decoding in primary progressive aphasias. Neurobiol Aging. 2017;56:190–201.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Dubois B, Feldman HH, Jacova C, Cummings JL, DeKosky ST, Barberger-Gateau P, et al. Revising the definition of Alzheimer’s disease: a new lexicon. Lancet Neurol. 2010;9:1118–27.

    Article  PubMed  Google Scholar 

  32. Remez RE, Fellowes JM, Nagel DS. On the perception of similarity among talkers. J Acoust Soc Am. 2007;122:3688–96.

    Article  PubMed  Google Scholar 

  33. Souza P, Gehani N, Wright R, McCloy D. The advantage of knowing the talker. J Am Acad Audiol. 2013;24:689–700.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Oba SI, Fu QJ, Galvin JJ 3rd. Digit training in noise can improve cochlear implant users’ speech understanding in noise. Ear Hear. 2011;32:573–81.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Hickok G, Poeppel D. The cortical organization of speech processing. Nat Rev Neurosci. 2007;8:393–402.

    Article  PubMed  CAS  Google Scholar 

  36. Scott SK, Blank CC, Rosen S, Wise RJS. Identification of a pathway for intelligible speech in the left temporal lobe. Brain. 2000;123:2400–6.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Davis MH, Johnsrude IS. Hearing speech sounds: top-down influences on the interface between audition and speech perception. Hear Res. 2007;229:132–47.

    Article  PubMed  Google Scholar 

  38. Gori S, Facoetti A. Perceptual learning as a possible new approach for remediation and prevention of developmental dyslexia. Vis Res. 2014;99:78–87.

    Article  PubMed  Google Scholar 

  39. Ahissar M, Nahum M, Nelken I, Hochstein S. Reverse hierarchies and sensory learning. Philos Trans R Soc B Biol Sci. 2009;364:285–99.

    Article  Google Scholar 

  40. Warren JD, Rohrer JD, Schott JM, Fox NC, Hardy J, Rossor MN. Molecular nexopathies: a new paradigm of neurodegenerative disease. Trends Neurosci. 2013;36:561–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Bejanin A, Schonhaut DR, La Joie R, Kramer JH, Baker SL, Sosa N, et al. Tau pathology and neurodegeneration contribute to cognitive impairment in Alzheimer’s disease. Brain. 2017;140:3286–300.

    Article  PubMed  Google Scholar 

  42. Giannini LAA, Irwin DJ, McMillan CT, Ash S, Rascovsky K, Wolk DA, et al. Clinical marker for Alzheimer disease pathology in logopenic primary progressive aphasia. Neurology. 2017;88:2276–84.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  43. Shahin AJ, Bishop CW, Miller LM. Neural mechanisms for illusory filling-in of degraded speech. NeuroImage. 2009;44:1133–43.

    Article  PubMed  Google Scholar 

  44. Obleser J, Kotz SA. Expectancy constraints in degraded speech modulate the language comprehension network. Cereb Cortex. 2010;20:633–40.

    Article  PubMed  Google Scholar 

  45. Golestani N, Hervais-Adelman A, Obleser J, Scott SK. Semantic versus perceptual interactions in neural processing of speech-in-noise. NeuroImage. 2013;79:52–61.

    Article  PubMed  Google Scholar 

  46. Hartwigsen G, Golombek T, Obleser J. Repetitive transcranial magnetic stimulation over left angular gyrus modulates the predictability gain in degraded speech comprehension. Cortex. 2015;68:100–10.

    Article  PubMed  Google Scholar 

  47. Smith JF, Braun AR, Alexander GE, Chen K, Horwitz B. Separating lexical-semantic access from other mnemonic processes in picture-name verification. Front Psychol. 2013;4:706.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Clark CN, Nicholas JM, Agustus JL, Hardy C, Russell L, Brotherhood E, et al. Auditory conflict and congruence in frontotemporal dementia. Neuropsychologia. 2017;104:144–56.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Rohrer JD, Ridgway GR, Modat M, Ourselin S, Mead S, Fox NC, et al. Distinct profiles of brain atrophy in frontotemporal lobar degeneration caused by progranulin and tau mutations. NeuroImage. 2010;53:1070–6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  50. Dehaene-Lambertz G, Pallier C, Serniclaes W, Sprenger-Charolles L, Jobert A, Dehaene S. Neural correlates of switching from auditory to speech perception. NeuroImage. 2005;24:21–33.

    Article  PubMed  Google Scholar 

  51. Henry ML, Hubbard HI, Grasso SM, Mandelli ML, Wilson SM, Sathishkumar MT, et al. Retraining speech production and fluency in non-fluent/agrammatic primary progressive aphasia. Brain. 2018;141:1799–814.

    Article  PubMed  Google Scholar 

  52. Goodglass & Weintraub. Kaplan E, Goodglass H, Weintraub S. Boston Naming Test. Philadelphia: Lea & Febiger. 1983.

  53. Dunn LM & Whetton C. British Picture Vocabulary Scale. Windsor, England: NFER-Nelson. 1997.

  54. Wechsler D. Wechsler memory scale: Revised. San Antonio, TX: The Psychological Corporation. 1987.

  55. Jackson M, Warrington EK. Arithmetic skills in patients with unilateral cerebral lesions. Cortex. 1986;22:611–20

  56. Mckenna P, Warrington EK. Testing for nominal dysphasia. J Neurol Neurosurgery, Psychiatry. 1987;43:781–8.

  57. Folstein M, Folstein S, McHugh P. “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–98.

  58. Kay J, Lesser R, Coltheart M. Psycholinguistic Assessments of Language Processing in Aphasia. Hove, UK: Lawrence Erlbaum Associates; 1996.

  59. Warrington EK. Recognition memory test. Windsor: NFER-Nelson; 1984.

  60. Warrington E, Mckenna P, Orpwood L. Single Word Comprehension: A Concrete and Abstract Word Synonym Test. Neuropsychol Rehabil. 1998;8:143–54.

  61. Tombaugh TN. Trail Making Test A and B: Normative data stratified by age and education. Arch Clin Neuropsychol. 2004;19:203–14.

  62. Warrington EK, James M. The Visual Object and Space Perception Battery. Bury St Edmunds, UK: Thames Valley Test Company; 1991.

  63. Wechsler D. Wechsler Adult Intelligence Scale-Revised. New York: Psychological Corporation; 1981.

Download references


The authors are grateful to all participants for their involvement. We thank Andrew Clark for technical support during speech recordings and Professor Elizabeth Warrington for helpful discussion.


The Dementia Research Centre is supported by Alzheimer’s Research UK, Brain Research Trust, and The Wolfson Foundation. This work was supported by the Alzheimer’s Society (AS-PG-16-007), the National Institute for Health Research University College London Hospitals Biomedical Research Centre, the UCL Leonard Wolfson Experimental Neurology Centre (PR/ylr/18575) and the Economic and Social Research Council (ES/K006711/1). Individual authors were supported by the Medical Research Council (PhD Studentship to CJDH and RLB; MRC Clinician Scientist Fellowship to JDR), the Wolfson Foundation (Clinical Research Fellowship to CRM), Alzheimer’s Research UK (ART-SRF2010-3 to SJC) and the Wellcome Trust (091673/Z/10/Z to JDW).

Availability of data and materials

The datasets used and/or analysed during the present study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations



CJDH and JDW were responsible for the conception and design of the study. CJDH, JDW, CRM, RLB, LLR, KD, CA, DLT, SJR, JLA, SJC, JDR and DEB were responsible for the analysis and interpretation of data. CJDH and JDW were responsible for drafting the manuscript and revising it critically for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jason D. Warren.

Ethics declarations

Ethics approval and consent to participate

All participants gave informed consent for their involvement in the study. Ethical approval was granted by the University College London and National Hospital for Neurology and Neurosurgery Research Ethics Committees in accordance with Declaration of Helsinki guidelines.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Clear place name. Natural speech version of SJR saying “Germany”. (WAV 99 kb)

Additional file 2:

Sine-wave place name. Sine wave speech version of SJR saying “Germany”. (WAV 77 kb)

Additional file 3:

Clear number. Natural speech version of CJDH saying “Nine hundred and sixty five”. (WAV 201 kb)

Additional file 4:

Sine-wave number. Sine-wave speech version of CJDH saying “Nine hundred and sixty five”. (WAV 163 kb)

Additional file 5:

ROIs. Representative brain MRI sections showing the neuroanatomical region (delineated in red) used to correct for multiple voxel-wise comparisons, based on prior anatomical hypotheses (see text). This region comprised the inferior frontal gyrus (triangularis + opercularis), anterior temporal lobe, temporal pole, posterior superior temporal gyrus, planum temporale, angular gyrus, supramarginal gyrus and inferior portions of the pre-central and post-central gyri. (PNG 138 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hardy, C.J.D., Marshall, C.R., Bond, R.L. et al. Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease. Alz Res Therapy 10, 70 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: