Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease

Hardy, Chris J. D.; Marshall, Charles R.; Bond, Rebecca L.; Russell, Lucy L.; Dick, Katrina; Ariti, Cono; Thomas, David L.; Ross, Sonya J.; Agustus, Jennifer L.; Crutch, Sebastian J.; Rohrer, Jonathan D.; Bamiou, Doris-Eva; Warren, Jason D.

doi:10.1186/s13195-018-0399-2

Research
Open access
Published: 25 July 2018

Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease

Chris J. D. Hardy¹,
Charles R. Marshall¹,
Rebecca L. Bond¹,
Lucy L. Russell¹,
Katrina Dick¹,
Cono Ariti^1,2,
David L. Thomas^3,4,
Sonya J. Ross¹,
Jennifer L. Agustus¹,
Sebastian J. Crutch¹,
Jonathan D. Rohrer¹,
Doris-Eva Bamiou⁵ &
…
Jason D. Warren¹

Alzheimer's Research & Therapy volume 10, Article number: 70 (2018) Cite this article

3369 Accesses
18 Citations
18 Altmetric
Metrics details

Abstract

Background

Processing of degraded speech is a promising model for understanding communication under challenging listening conditions, core auditory deficits and residual capacity for perceptual learning and cerebral plasticity in major dementias.

Methods

We compared the processing of sine-wave-degraded speech in 26 patients with primary progressive aphasia (non-fluent, semantic, and logopenic variants), 10 patients with typical Alzheimer’s disease and 17 healthy control subjects. Participants were required to identify sine-wave words that were more predictable (three-digit numbers) or less predictable (place names). The change in identification performance within each session indexed perceptual learning. Neuroanatomical associations of degraded speech processing were assessed using voxel-based morphometry.

Results

Patients with non-fluent and logopenic progressive aphasia and typical Alzheimer’s disease showed impaired identification of sine-wave numbers, whereas all syndromic groups showed impaired identification of sine-wave place names. A significant overall identification advantage for numbers over place names was shown by patients with typical Alzheimer’s disease, patients with semantic progressive aphasia and healthy control participants. All syndromic groups showed spontaneous perceptual learning effects for sine-wave numbers. For the combined patient cohort, grey matter correlates were identified across a distributed left hemisphere network extending beyond classical speech-processing cortices.

Conclusions

These findings demonstrate resilience of auditory perceptual learning capacity across dementia syndromes, despite variably impaired perceptual decoding of degraded speech and reduced predictive integration of semantic knowledge. This work has implications for the neurobiology of dynamic sensory processing and plasticity in neurodegenerative diseases and for development of novel biomarkers and therapeutic interventions.

Background

Deficits of speech perception have been documented in the three major ‘language-led’ dementias (non-fluent variant primary progressive aphasia [nfvPPA], semantic variant PPA [svPPA] and logopenic variant primary progressive aphasia [lvPPA]) [1,2,3,4,5] as well as in typical Alzheimer’s disease (tAD) [6,7,8,9]. However, the factors that affect speech perception have been less well studied than language output in these diseases. Normal speech perception entails high-fidelity encoding of incoming acoustic data, parsing of speech from extraneous noises and integration with prior expectations [10]. Healthy listeners rapidly and automatically adapt to speech degradation under challenging listening conditions, based on prior auditory experience [11]. This reflects perceptual learning: improved accuracy of perceptual processing following sustained exposure to the auditory stimulus, modulated by ‘top-down’ predictive mechanisms [12, 13].

Perception of degraded speech is impaired in patients with nfvPPA, svPPA and tAD [14, 15], whereas implicit auditory sequence learning and disambiguation of degraded speech are retained in nfvPPA [10, 16], albeit with reduced flexibility in using contextual information. Processing degraded sensory stimuli taxes the functional integrity of cortical circuits [17, 18], and disorders such as PPA and tAD constitute test cases for exposing such effects because they strike auditory processing networks early and relatively selectively. In contrast, perceptual learning reflects functional brain reorganisation or plasticity; it might therefore help compensate for effects of neurodegeneration [19]. Adaptation to degraded speech engages areas (such as sensorimotor cortex) beyond classical language and auditory networks [20, 21] and may be enhanced by cholinergic stimulation [15]. Taken together, these considerations suggest that understanding and perceptual learning of degraded speech might constitute a powerful and sensitive probe of neural network integrity and residual plasticity in neurodegenerative syndromes.

In the present study, we assessed identification of degraded speech and associated auditory perceptual learning in patients with PPA and tAD relative to healthy older individuals. We used sine-wave speech as a model paradigm. Sine-wave speech is a radical perceptual alteration that reduces speech signals to a series of ‘whistles’ that correspond to formant contours, retaining the long-range temporal scaffold of speech but stripped of all spectral detail (examples are available in the additional files). Sine-wave transformation renders speech initially unintelligible, yet induces spontaneous perceptual learning in healthy listeners primed to its linguistic origin [11, 22]. This effect relies on ‘top-down’ perceptual integration of apparently dissimilar acoustic events into a coherent speech-like signal. It is therefore likely a priori to depend on cognitive operations that are instantiated across the language network. To explore the effect of semantic predictability on the ‘top-down’ disambiguation of degraded speech [10], we applied sine-wave manipulations to a verbal category with uniformly high predictability (numbers) and a verbal category in which predictability could be varied on the basis of familiarity (geographical place names). These semantic categories are relatively preserved in svPPA [2, 23], allowing predictive processing mechanisms to be distinguished from semantic disintegration. Neuroanatomical associations of sine-wave speech processing in the patient cohort were assessed using voxel-based morphometry (VBM).

On the basis of available evidence [3,4,5, 10, 15, 24,25,26,27], we hypothesised that PPA syndromes and tAD would be associated with differential impairment of sine-wave speech identification and perceptual learning relative to controls. We predicted that nfvPPA and lvPPA would be associated with more severe perceptual decoding deficits than other groups, but that all groups would show retained adaptation to degraded speech and increased reliance on prior predictability, particularly in svPPA. Drawing on neuroanatomical evidence in the healthy brain and PPA [10, 13, 20, 28,29,30], we further hypothesised that sine-wave speech perception deficits would correlate with grey matter loss in posterior superior temporal and inferior parietal cortices, whereas modulation by prior predictability and perceptual learning effects would correlate with anterior sensorimotor, prefrontal and anterior temporal grey matter.

Methods

Participants

Nine patients with nfvPPA, ten patients with svPPA, seven patients with lvPPA and ten patients with tAD were recruited via a specialist cognitive clinic. Seventeen healthy older individuals with no history of neurological or psychiatric illness also participated in the study. All patients fulfilled current consensus diagnostic criteria either for the relevant PPA syndrome [1] or for Alzheimer’s disease [31]. Syndromic diagnoses were corroborated by a general neuropsychological assessment (Table 1) and brain magnetic resonance imaging (MRI) findings. No patients had radiological evidence of significant co-morbid cerebrovascular disease. Cerebrospinal fluid profiles of tau and beta-amyloid were available for five of the seven patients with lvPPA and were consistent with Alzheimer’s pathology in each case, based on local reference ranges (total tau/beta-amyloid_1–42 ratio > 1). No participant had a history of clinically relevant hearing loss; each participant’s peripheral hearing function was assessed using a previously described pure-tone audiometry protocol [14].

Table 1 Demographic, clinical and general neuropsychological data for the participant groups

Full size table

All participants gave informed consent for their involvement in the study. Ethical approval was granted by the University College London and National Hospital for Neurology and Neurosurgery Research Ethics Committees, in accordance with Declaration of Helsinki guidelines.

Experimental stimuli and procedures

Two lists of spoken words were recorded, corresponding to two experimental conditions: 40 three-digit numbers (e.g., ‘nine hundred and sixty-five’) and 40 geographical place names (e.g., ‘Germany’). To reduce any crossover of perceptual learning effects based on intonational idiosyncrasies of particular speakers [32, 33], numbers were recorded by a young male speaker and place names by a young female speaker, both using a Standard English (southern England) accent. Place names comprised 20 cities and 20 countries, selected such that half of the items were located relatively ‘near’ (i.e., English cities, European countries) and half were more remote (i.e., relatively ‘far’ away: American cities, non-European countries). Inclusion of these ‘near’ and ‘far’ subcategories was intended to modulate the prior predictability of their sine-wave versions because geographical proximity has been shown to determine the relative familiarity of place names [23].

Sine-wave replicas of the natural speech recordings were generated using a procedure reported previously [15] (see Fig. 1). In creating the final stimulus lists, number stimuli were split into 2 blocks of 20 trials: the second block comprised 10 numbers that had been presented in the first block plus 10 numbers presented de novo. This design allowed us to assess the generalisability of any perceptual learning effects beyond the ‘trained’ stimulus set. Examples of stimuli wave files are provided in Additional files 1, 2, 3, and 4.

All stimuli were administered in a quiet room via headphones (ATH-M50x; Audio-Technica®, Leeds, UK) at a comfortable listening level (at least 70 dB) for each participant. The number condition was always presented before the place name condition (because we anticipated that any confounding, non-specific ‘training’ effects from prior sessional exposure would be more relevant to sine-wave number than place name identification [34]. The order of trials was fully randomised within each condition. Before commencing the test, participants were first familiarised with examples of natural spoken numbers and place names and advised that, during the test, similar words would be presented in distorted ‘whistled’ form and that these might initially be difficult to understand but might become easier over the course of the session. Participants were instructed that their task on each trial was to try to repeat and/or write down the distorted word as accurately as possible. During the test, no feedback about performance was given, and no time limits on responses were imposed.

Following the sine-wave conditions, two control conditions were administered to provide a measure of patients’ ability to process the verbal content of the number and geographical stimuli in natural speech. Ability to perceive speech under natural listening conditions was assessed by presenting a list of 10 undistorted three-digit numbers and a list of 16 undistorted place names. On each trial, the participant was required to repeat or transcribe the stimulus, and performance was scored similarly to the respective sine-wave conditions. In addition, geographical semantic knowledge of the 40 places presented during the sine-wave experiment was assessed using a two-alternative forced-choice procedure. For each place name (presented in undistorted form), participants were asked to indicate whether it was a city or a country; for cities, they were then asked to indicate whether it was English or American, and for countries, to indicate whether it was in Europe or elsewhere. An aggregate geographical knowledge score (total possible score of 80) was calculated for each participant. Numbers were scored per digit correct, meaning that each trial had a maximum score of 3, whereas place names were scored as either correct or incorrect.

Analysis of clinical and behavioural data

Clinical and behavioural data were analysed using Stata version 14.0 software (StataCorp, College Station, TX, USA). Each patient group was compared with the healthy control participants using two-tailed, two-sample t tests for continuous variables and chi-square tests for categorical variables.

Analysis of variance (ANOVA) models were used to analyse participant group performance on all sine-wave and experimental control tests. Participants’ background geographical knowledge was analysed using total score on the geographical semantic control task as a dependent variable and participant group as an independent variable, covarying for general cognitive performance (Mini Mental State Examination [MMSE] score) to ensure that any group effects were not attributable simply to disease severity. The total score on this geographical knowledge index was then used as a covariate where indicated in subsequent analyses. In addition, one-tailed one-sample t tests were used within each diagnostic group separately to assess whether the performance difference between ‘near’ and ‘far’ trials was significantly greater than zero. Performance on the control tasks assessing natural speech perception was analysed for place names and numbers separately, incorporating syndromic diagnosis as an independent variable.

Participants’ overall perceptual decoding accuracy (identification) performance in each of the sine-wave speech conditions was analysed by incorporating syndromic diagnosis as an independent variable, with natural speech control task performance and (for place names) geographical semantic control task performance as covariates. The overall group effect of speech predictability (prior place name familiarity) was assessed using the difference between ‘near’ and ‘far’ location scores as the ANOVA dependent variable; post hoc t tests were used to compare this discrepancy index between participant groups. One-tailed one sample t tests were used within each diagnostic group separately to assess whether the difference between ‘near’ and ‘far’ trials was significantly greater than zero. In addition, to assess any condition-specific effects on sine-wave speech processing (and to capture potential variability of such effects between individuals), we calculated a ‘condition discrepancy index’ for each participant, defined as follows: ([score on sine-wave number condition/total score possible] minus [score on sine-wave geographical condition/total score possible]). This discrepancy index was analysed for any overall group effect as the ANOVA dependent variable with syndromic diagnosis as the independent variable; post hoc t tests were used to compare participant groups directly. One-sample t tests were used within each diagnostic group separately to assess whether the condition discrepancy index was significantly different from zero.

To assess change in participants’ performance with increasing exposure to sine-wave speech (auditory perceptual learning), we divided the number condition session into four blocks of ten trials and calculated a ‘perceptual learning index’ for each participant, defined as follows: ([block 4 score {trials 31–40}] minus [block 1 score {trials 1–10}]). Performance differences between initial and final stimulus presentation blocks have previously been shown to capture overall implicit learning of speech-like stimuli [16]. An analogous index was calculated for the place name condition. These learning index data were compared between participant groups using syndromic diagnosis as the ANOVA independent variable with natural speech task performance as a covariate. One-tailed one-sample t tests were used in each participant group separately to assess whether this perceptual learning index was significantly different from zero. Pearson’s correlations were used to assess any association of perceptual learning indices with MMSE score or Wechsler Abbreviated Scale of Intelligence (WASI) matrices score (proxies for global cognitive function), separately for sine-wave number and place name conditions in the combined patient cohort. For the sine-wave numbers condition, we also created a familiarity discrepancy index, defined as follows: (score on ‘trained’ [repeat] numbers minus score on ‘untrained’ [de novo] numbers). Participant groups were compared on this index using syndromic diagnosis as the ANOVA independent variable with natural speech task performance as a covariate. A threshold of p < 0.05 was accepted as the criterion for statistical significance in all tests.

Brain image acquisition and analysis

Volumetric brain MRI scans were acquired for 33 patients in a 3-T MAGNETOM Prisma scanner (Siemens Healthcare, Erlangen, Germany) using a 64-channel head-and-neck receiver array coil and a T1-weighted sagittal 3D magnetization-prepared rapid gradient-echo sequence (echo time = 2.93 ms, inversion time = 850 ms, repetition time = 2000 ms), with matrix size 256 × 256 × 208 and voxel dimensions 1.1 × 1.1 × 1.1 mm and overall scan acquisition duration 306 seconds.

For the VBM analysis, patients’ brain images were pre-processed and normalised to Montreal Neurological Institute (MNI) space with isotropic voxel size 1.5 mm using SPM12 software (http://www.fil.ion.ucl.ac.uk/spm/software/spm12/) and the Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) toolbox with default parameters in MATLAB R2014b (MathWorks, Natick, MA, USA), in accordance with a procedure we have described previously [14, 30]. Disease-associated atrophy profiles were generated for each patient group separately, again using a previously described protocol [30].

In parallel analyses over the combined patient cohort, separate linear regression models were implemented to assess associations between voxel-wise grey matter volume and (1) total score for sine-wave numbers, (2) total score for sine-wave place names, (3) sine-wave condition discrepancy indices (as defined above) and (4) sine-wave perceptual learning indices (as defined above). Each model incorporated symptom duration (as an index for disease stage), syndromic diagnosis, age and total intracranial volume as nuisance covariates. Statistical parametric maps were generated using an initial threshold p < 0.001 and assessed at peak statistical significance level p < 0.05, after family-wise error (FWE) correction for multiple voxel-wise comparisons within a pre-specified anatomical ROI. This region incorporated cortical areas in the dominant hemisphere that have been implicated in previous studies of degraded speech processing and auditory perceptual learning in the healthy brain (see Additional file 5), comprising the temporoparietal junction (including posterior superior temporal gyrus and sulcus, planum temporale, inferior parietal lobe), anterior temporal lobe and inferior frontal gyrus [28, 29, 35, 36], and an orofacial sensorimotor region encompassing the inferior two-thirds of the pre-central and post-central gyri [20].

Results

Background demographic, neuropsychological and clinical data for all participant groups are presented in Table 1; group performance profiles on the experimental tasks are presented in Table 2.

Table 2 Performance of participant groups on experimental tasks

Full size table

General participant group characteristics

Participant groups did not differ in age, handedness, gender, education or peripheral hearing (all p > 0.05). Patient groups differed on MMSE score (p = 0.034; less severe in svPPA and nfvPPA) and symptom duration (p = 0.026; shorter in nfvPPA and lvPPA).

Experimental behavioural data

Performance on the natural speech control conditions differed significantly between participant groups for numbers [F(4,48) = 8.81, p < 0.001] and place names [F(4,47) = 5.52, p = 0.001]; post hoc comparisons between groups revealed that both the lvPPA and nfvPPA groups performed worse than healthy control participants and other patient groups (all p < 0.05) for both conditions. Performance on the geographical semantic control task was significantly affected by diagnosis [F(4,47) = 3.88, p = 0.008]. The svPPA group performed worse than all other participant groups (all p < 0.05); however, there were no other significant group performance differences on this task (all p > 0.05). In addition, the svPPA group (t = 3.28, p = 0.005) and the lvPPA group (t = 3.06, p = 0.011) showed a performance advantage for knowledge of ‘near’ over ‘far’ places; no other groups showed a significant performance discrepancy between trial types on this control task (all p > 0.05).

Overall accuracy of perceptual decoding (identification) of sine-wave speech differed significantly between participant groups, for both the sine-wave number [F(4,47) = 3.91, p = 0.008] and place name [F(4,46) = 3.88, p = 0.009] conditions. For the sine-wave number condition, post hoc group comparisons revealed that the nfvPPA, lvPPA and tAD groups (but not the svPPA group) performed worse than healthy control participants (all p < 0.05) (Fig. 2). Comparing patient groups, both the lvPPA group (t = − 2.39, p = 0.021) and the nfvPPA group (t = − 2.53, p = 0.015) performed worse than the svPPA group. For the sine-wave place name condition, post hoc group comparisons revealed that all patient groups performed worse than healthy control participants (all p < 0.05), whereas there were no significant performance differences between patient groups. All participant groups showed a significant performance advantage for ‘near’ over ‘far’ place names (all p < 0.05), but the effect of place name type also differed significantly between participant groups [F(4,46) = 6.21, p = 0.004]; the advantage for ‘near’ over ‘far’ place names was significantly higher in each patient group relative to healthy control participants (all p < 0.05), but there were no differences between patient groups.

Comparing overall performance in the sine-wave number versus place name conditions in each participant group separately, the healthy control group (t = 3.70, p = 0.001), tAD group (t = 3.73, p = 0.002) and svPPA group (t = 9.93, p < 0.001) showed a significant performance advantage for identifying sine-wave numbers; there was no significant performance discrepancy between number and place name conditions in the lvPPA or nfvPPA groups (both p > 0.05). The performance condition discrepancy was significantly greater in the svPPA group than in each of the other groups (all p < 0.05). The svPPA group also showed the most individually consistent performance advantage for identifying sine-wave numbers: 80% of svPPA patients showed a performance discrepancy favouring numbers outside the healthy control range versus 20% of patients with tAD, whereas four patients with nfvPPA and three patients with lvPPA showed the reverse discrepancy, favouring sine-wave place name identification (see Fig. 3).

A significant within-group perceptual learning effect (indexed as block 4 score minus block 1 score) was evident for the sine-wave number condition in all participant groups (p < 0.05). This perceptual learning index was positively correlated with block 1 score across participants (p = 0.027). There was no difference between repeat versus new items over the combined participant cohort [F(4,47) = 2.29, p = 0.073] or the combined patient cohort [F(3,31) = 2.80, p = 0.056]. For the sine-wave place name condition, the healthy control, lvPPA and nfvPPA groups showed a significant perceptual learning effect (all p < 0.05), whereas the svPPA and tAD groups showed no such effect. However, there was no significant overall group effect on perceptual learning for the sine-wave number condition [F(4,47) = 1.79, p = 0.147] or place name condition [F(4,46) = 1.80, p = 0.144]. Inspection of individual performance data (Fig. 3) showed that for sine-wave numbers, only two patients (one tAD, one nfvPPA) had a perceptual learning rate below the lower bound seen in the healthy control group, whereas for sine-wave place names, two patients with tAD and one patient with svPPA scored lower than the control range. Perceptual learning index was not significantly correlated with MMSE score or WASI matrices score for either the number condition (MMSE, r = 0.001, p = 0.995; WASI matrices, r = − 0.059, p = 0.731) or place name condition (MMSE, r = 0.029, p = 0.863; WASI matrices, r = 0.109, p = 0.528) across the patient cohort.

Neuroanatomical data

Patient groups showed the anticipated syndromic profiles of disease-related grey matter atrophy; statistical parametric maps are presented in Fig. 4. Statistical parametric maps of grey matter regions associated with performance on the sine-wave speech-processing tasks are shown in Fig. 5; local grey matter maxima associated with performance on each variable of interest are summarised in Table 3. All contrasts describing associations with behavioural data are reported after FWE correction for multiple voxel-wise comparisons within the pre-specified neuroanatomical ROI.

Table 3 Structural neuroanatomical associations of sine-wave speech comprehension in the patient cohort

Full size table

Across the combined patient cohort, identification of sine-wave numbers was significantly positively associated with grey matter volume in the left angular gyrus (p = 0.045_FWE), whereas identification of sine-wave place names was positively associated with grey matter volume in the left planum temporale (p = 0.013_FWE) and left angular gyrus (p = 0.035_FWE). A performance advantage for sine-wave numbers over place names was significantly positively associated with grey matter volume in the left inferior frontal gyrus (p = 0.037_FWE), whereas the inverse condition discrepancy effect (performance advantage for sine-wave place names over numbers) was significantly positively associated with grey matter volume in the left temporal pole (p = 0.018_FWE). The perceptual learning index for sine-wave numbers was significantly positively associated with grey matter volume in the left inferolateral post-central gyrus (two peaks at p = 0.011_FWE, p = 0.021_FWE). There were no significant grey matter associations of perceptual learning for sine-wave place names over the combined patient cohort.

Discussion

In the present study, we have demonstrated deficits of degraded speech perception in major syndromes of PPA and tAD relative to healthy control participants. Syndromic groups were stratified by their overall accuracy in decoding sine-wave speech, with patients with nfvPPA and lvPPA exhibiting the most severe and consistent impairments. The findings in tAD extend previous evidence in this disease [15]. Syndromic profiles were modulated by the prior predictability of verbal content: The svPPA and tAD groups showed a significant advantage for perceiving sine-wave speech with highly predictable content (numbers) compared with less predictable content (place names), whereas within the geographical condition, all groups showed a performance advantage for more familiar (‘near’) over less familiar (‘far’) place names, and this advantage was exaggerated in patient groups compared with the healthy control participants. All syndromic groups exhibited some capacity for auditory perceptual learning following sustained exposure to sine-wave speech: this occurred spontaneously, and there was evidence that the effect generalised to ‘new’ as well as ‘trained’ speech tokens. However, patients with svPPA and tAD showed a perceptual learning effect only for relatively predictable verbal content. These effects were evident after adjusting for performance on natural speech perception and geographical knowledge tasks and therefore not attributable to more generic deficits of phonological or semantic processing in the patient groups. Taken together, the present results imply a degree of residual cerebral plasticity in these syndromes, manifesting as resilient auditory perceptual learning of degraded speech.

Our finding that decoding of degraded speech is impaired in the nfvPPA, lvPPA and tAD groups corroborates recent evidence for core auditory processing deficits in these syndromes [5,6,7,8, 10, 14, 27, 30]. The less uniform decoding deficit identified in the svPPA group in the present study was also anticipated on the basis of previous work; in the healthy brain, decoding of degraded speech engages ‘top-down’ (including semantic) mechanisms that disambiguate the speech stream based on prior predictability [10, 13, 37]. Efficient access to stored semantic ‘priors’ (including ready access to lower-frequency priors) when interpreting degraded speech is likely to become increasingly limiting as verbal content becomes less predictable [26]; less predictable verbal content would place increased demands on semantic processing resources. This would account both for the striking performance advantage for (highly predictable) sine-wave numbers over (less predictable) place names shown in this study by patients with svPPA and for the ‘echo’ of this condition discrepancy effect in the performance advantage for more familiar over less familiar place names shown by all participant groups. This was not simply the consequence of a dwindling semantic lexicon; it was observed after taking geographical semantic competence into account, in keeping with a more specific limitation on the recruitment of semantic mechanisms during predictive processing of speech signals. Furthermore, the tAD group, but not the lvPPA group, showed a significant condition discrepancy effect, suggesting that these Alzheimer variant syndromes may be characterised by separable pathophysiological mechanisms [2].

Participant groups did not differ in perceptual learning of sine-wave numbers. Indeed, the magnitude of the perceptual learning effect across patient groups (including those with marked overall deficits of degraded speech perception) was comparable to that of healthy older control participants (Table 2, Figs. 2 and 3). Moreover, most individual patients in each syndromic group showed perceptual learning effects within the healthy control range. There was no association between general cognitive performance (indexed by MMSE and WASI matrices scores) and perceptual learning index for numbers or place names. Together, these findings argue for dissociable physiological mechanisms mediating the accuracy of degraded speech decoding and adaptation based on sustained, unsupervised exposure to degraded speech, and they suggest that perceptual learning of degraded speech may be relatively resilient to the effects of background cognitive decline. The syndromic profiles in the present study further suggest that perceptual learning is modulated by prior verbal predictability (the svPPA and tAD groups showed the effect for strongly predictable but not less predictable verbal stimuli), in line with current models of degraded speech learning based on minimising prediction errors [13]. Although data in PPA are limited, our findings are consistent with previous work in nfvPPA, indicating that separable mechanisms underpin decoding of sensory detail in degraded speech stimuli, ‘top-down’ predictions about such stimuli and implicit learning based on auditory experience [10, 16]. There are precedents for a dissociation of sensory accuracy and sensory learning or plasticity in other disorders (e.g., developmental dyslexia and amblyopia [38]), with potential substrates at cognitive, neurophysiological and neuroanatomical levels [39]. Speech may be a particularly potent stimulus to expose the component mechanisms of a processing hierarchy. Whereas perceptual decision making on speech and other complex auditory stimuli typically rests on integration of multiple spectrotemporal features, perceptual learning may depend on the extraction of more specific lower-level properties from ‘bottom-up’ sensory data, honed by ‘top-down’ predictions based on prior auditory experience and used in turn to update those predictions [13, 39]. This reciprocal interaction between sensory traffic and predictions could be instantiated on different neuroanatomical scales, ranging from local cortical circuits to large-scale distributed brain networks that could be differentially disrupted by neurodegenerative proteinopathies [10, 17, 40].

The neuroanatomical correlates of degraded speech decoding accuracy and perceptual learning identified in our patient cohort support the behavioural evidence that these processes are at least partly dissociable. Identification of both sine-wave numbers and place names was associated with grey matter volume in left angular gyrus, a region affected by the neurodegenerative pathologies studied here [41, 42] and previously implicated in processing speech under challenging listening conditions in functional neuroimaging and virtual lesion studies in the healthy brain [17, 43,44,45,46]. The present evidence in a patient cohort with variably impaired perception of degraded speech corroborates this previous work and further suggests that integrity of angular gyrus plays a critical role in determining whether degraded speech is disambiguated successfully. However, this region acts as the hub of a distributed processing network. Its functional connectivity and interactions with other modes of the network may be modulated by a number of factors, including output task, semantic context and perceived intelligibility [47]. In line with this, we identified additional neuroanatomical correlates that may mediate the effect of altered verbal predictability of degraded speech (summarised in Table 3). Grey matter in the left planum temporale was correlated with identification of sine-wave place names but not numbers at the prescribed threshold. This region is engaged in parsing the auditory scene under conditions of high computational load [13, 28, 36].

Candidate loci for ‘top-down’ control of perceptual analysis under different conditions of verbal predictability were identified in the left inferior frontal gyrus (for more highly predictable, sine-wave numbers) and left temporal pole (for less predictable sine-wave place names). Our findings do not resolve the mechanisms by which these regions communicate during sine-wave speech decoding, which are likely to differ between dementia syndromes. Disambiguation of degraded speech based on relatively constrained predictive algorithms (such as word verification or number identification) is likely to engage top-down control mechanisms in the inferior frontal cortex, a region that is heavily involved in nfvPPA [10], whereas decoding of ‘novel’ linguistic environments with less predictable verbal content (such as degraded place names) may demand active computation of the ‘best fit’ between incoming speech signal statistics and stored verbal concepts, accessed via the anterior temporal lobe semantic network that is blighted in svPPA [14, 26, 30, 48]. Considered together, our findings suggest that the overall accuracy of degraded speech decoding in these syndromes is likely to depend on a distributed peri-Sylvian network closely overlapping classical language cortices. Both nfvPPA and lvPPA (and less consistently svPPA) have been shown to be associated with atrophy or dysfunction of the dorsal language network, with involvement of anterior and posterior regions extending beyond the zone of maximal atrophy in particular syndromes [2, 5]. Impaired accuracy of sine-wave speech identification might plausibly result from a ‘double-hit’ to inferior frontal and temporoparietal regions previously implicated in decoding speech and other complex auditory signals [4, 5, 30].

A neuroanatomical substrate for perceptual learning of degraded speech (sine-wave numbers) was identified in the inferolateral post-central gyrus. This sector of sensorimotor cortex does not form part of the canonical language network and was not associated with sine-wave speech identification accuracy in this study, consistent with dissociable neural mechanisms for perceptual decision making and perceptual learning of degraded speech. However, this sensorimotor region hosts cortical representations of lips, mouth and tongue that are engaged during speech perception, particularly under difficult listening conditions in which subvocal rehearsal may help to resolve ambiguous speech sounds [13, 20] or in the context of disease processes primarily affecting the auditory cortex [21]. Sensorimotor cortices are relatively spared in PPA syndromes and tAD [1, 49]; we propose that a critical determinant of perceptual learning capacity (cerebral plasticity) is the degree of atrophy (or relative preservation) of these areas. Our findings suggest that some degree of residual neural plasticity is maintained beyond vulnerable language and auditory networks across canonical dementia syndromes. A stronger claim would be that some form of compensatory functional enhancement drives perceptual learning in the face of neurodegenerative pathology. We cannot evaluate this claim on the basis of the present evidence, though we note that no patient group in our study showed increased perceptual learning capacity relative to healthy older control participants. To understand the nature of the observed effects fully will require functional neuroimaging approaches that can address network connectivity and activity changes directly. Functional neuroimaging paradigms based on degraded speech stimuli have been developed in the healthy brain [13, 50], but they have yet to be applied to patients with dementia. The present findings corroborate our previous psychoacoustic work in tAD [15] and the recent demonstration that patients with nfvPPA benefit from retraining strategies for speech production and fluency, with lasting and generalisable improvement of communication function [51]. Our findings raise the further intriguing possibility that the efficacy of such communication interventions may be enhanced by engaging perceptual learning capacity (i.e., residual cerebral plasticity). If we are to exploit this potential, quantitative behavioural and neuroanatomical markers of perceptual learning will be required. Sine-wave speech (and related degraded speech manipulations) may offer a convenient and well-established route to development of relevant plasticity biomarkers.

This study suggests a number of directions for future work. Sine-wave speech served in the present study as a model paradigm for understanding speech under challenging listening conditions. We found that capacity for perceptual learning of this radically degraded speech-like signal is retained across diverse dementia syndromes, despite variably impaired understanding of the signal. Measures of individual responses (Fig. 3) suggest that these stimuli may represent novel markers for assessing and tracking communication function in particular patients and could have therapeutic potential. Dynamic markers of this kind might stratify dementia syndromes but also transcend conventional syndromic boundaries, constituting ‘stress tests’ of speech processing and auditory scene analysis in earlier-stage PPA and tAD and also presenting a target for intervention. This need not await the advent of disease-modifying therapies; combining currently available symptomatic pharmacotherapies (such as cholinesterase inhibitors) with speech retraining might be one rational approach [15, 16]. However, more information is required about the diagnostic sensitivity, specificity and relevance of perceptual measures on degraded speech over the course of disease, based on replication of these findings in larger patient cohorts, correlation with indices of daily life communication functions, and extension to other speech manipulations. In addition, detailed understanding of the pathophysiological mechanisms of degraded speech processing and perceptual learning in neurodegenerative syndromes will require functional neuroimaging and connectivity-based techniques that can capture activity profiles and time-varying interactions between brain regions.

Conclusions

This work has broad neurobiological and clinical implications. Neurobiologically, the findings suggest that neurodegenerative proteinopathies dissect dissociable mechanisms for auditory pattern decoding and adaptation and expose the critical brain substrates for these processes. Clinically, this work forecasts a fresh emphasis on dynamic physiological capacity and functional plasticity in dementia that should motivate novel biomarker development and neurorehabilitation strategies.

Abbreviations

ANOVA:: Analysis of variance
DARTEL:: Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra
FWE:: Family-wise error
lvPPA:: Logopenic variant primary progressive aphasia
MMSE:: Mini Mental State Examination
MNI:: Montreal Neurological Institute
MRI:: Magnetic resonance imaging
nfvPPA:: Non-fluent variant primary progressive aphasia
PPA:: Primary progressive aphasia
svPPA:: Semantic variant primary progressive aphasia
tAD:: Typical Alzheimer’s disease
VBM:: Voxel-based morphometry

References

Gorno-Tempini ML, Hillis AE, Weintraub S, Kertesz A, Mendez M, Cappa SF, et al. Classification of primary progressive aphasia and its variants. Neurology. 2011;76:1006–14. https://doi.org/10.1212/WNL.0b013e31821103e6.
Article PubMed PubMed Central Google Scholar
Marshall CR, Hardy CJD, Volkmer A, Russell LL, Bond RL, Fletcher PD, et al. Primary progressive aphasia: a clinical approach. J Neurol. 2018;265:1474–90. https://doi.org/10.1007/s00415-018-8762-6.
Article PubMed PubMed Central Google Scholar
Hailstone JC, Ridgway GR, Bartlett JW, Goll JC, Crutch SJ, Warren JD. Accent processing in dementia. Neuropsychologia. 2012;50:2233–44. https://doi.org/10.1016/j.neuropsychologia.2012.05.027.
Article PubMed PubMed Central Google Scholar
Rohrer JD, Sauter D, Scott S, Rossor MN, Warren JD. Receptive prosody in nonfluent primary progressive aphasias. Cortex. 2012;48:308–16. https://doi.org/10.1016/j.cortex.2010.09.004.
Article PubMed PubMed Central Google Scholar
Grube M, Bruffaerts R, Schaeverbeke J, Neyens V, De Weer AS, Seghers A, et al. Core auditory processing deficits in primary progressive aphasia. Brain. 2016;139(Pt 6):1817–29. https://doi.org/10.1093/brain/aww067.
Article PubMed PubMed Central Google Scholar
Goll JC, Kim LG, Ridgway GR, Hailstone JC, Lehmann M, Buckley AH, et al. Impairments of auditory scene analysis in Alzheimer’s disease. Brain. 2012;135:190–200. https://doi.org/10.1093/brain/awr260.
Article PubMed Google Scholar
Golden HL, Nicholas JM, Yong KXX, Downey LE, Schott JM, Mummery CJ, et al. Auditory spatial processing in Alzheimer’s disease. Brain. 2015;138:189–202. https://doi.org/10.1093/brain/awu337.
Article PubMed Google Scholar
Golden HL, Agustus JL, Goll JC, Downey LE, Mummery CJ, Schott JM, et al. Functional neuroanatomy of auditory scene analysis in Alzheimer’s disease. Neuroimage Clin. 2015;7:699–708. https://doi.org/10.1016/j.nicl.2015.02.019.
Article PubMed PubMed Central Google Scholar
Gates GA, Anderson ML, McCurry SM, Feeney MP, Larson EB. Central auditory dysfunction as a harbinger of Alzheimer dementia. Arch Otolaryngol Head Neck Surg. 2011;137:390–5. https://doi.org/10.1001/archoto.2011.28.
Article PubMed PubMed Central Google Scholar
Cope TE, Sohoglu E, Sedley W, Patterson K, Jones PS, Wiggins J, et al. Evidence for causal top-down frontal contributions to predictive processes in speech perception. Nat Commun. 2017;8:2154. https://doi.org/10.1038/s41467-017-01958-7.
Article PubMed PubMed Central CAS Google Scholar
Remez RE, Rubin PE, Pisoni DB, Carell TD. Speech perception without traditional speech cues. Science. 1981;212:947–50. https://doi.org/10.1126/science.7233191.
Article PubMed CAS Google Scholar
Gibson E. Perceptual learning. Annu Rev Psychol. 1963;14:29–56. https://doi.org/10.1146/annurev.ps.14.020163.000333.
Article PubMed CAS Google Scholar
Sohoglu E, Davis MH. Perceptual learning of degraded speech by minimizing prediction error. Proc Natl Acad Sci U S A. 2016;113:E1747–56. https://doi.org/10.1073/pnas.1523266113.
Article PubMed PubMed Central CAS Google Scholar
Hardy CJD, Agustus JL, Marshall CR, Clark CN, Russell LL, Bond RL, et al. Behavioural and neuroanatomical correlates of auditory speech analysis in primary progressive aphasias. Alzheimers Res Ther. 2017;9:53. https://doi.org/10.1186/s13195-017-0278-2.
Article PubMed PubMed Central Google Scholar
Hardy CJD, Hwang YT, Bond RL, Marshall CR, Ridha BH, Crutch SJ, et al. Donepezil enhances understanding of degraded speech in Alzheimer’s disease. Ann Clin Transl Neurol. 2017;4:835–40.
Article PubMed PubMed Central CAS Google Scholar
Cope TE, Wilson B, Robson H, Drinkall R, Dean L, Grube M, et al. Artificial grammar learning in vascular and progressive non-fluent aphasias. Neuropsychologia. 2017;104:201–13. https://doi.org/10.1016/j.neuropsychologia.2017.08.022.
Article PubMed PubMed Central Google Scholar
Cooke SF, Bear MF, Bain A, Hebb D, Wiesel T, Hubel D, et al. How the mechanisms of long-term synaptic potentiation and depression serve experience-dependent plasticity in primary visual cortex. Philos Trans R Soc Lond Ser B Biol Sci. 2013;369:20130284. https://doi.org/10.1098/rstb.2013.0284.
Article CAS Google Scholar
Ross B, Jamali S, Tremblay KL. Plasticity in neuromagnetic cortical responses suggests enhanced auditory object representation. BMC Neurosci. 2013;14:151. https://doi.org/10.1186/1471-2202-14-151.
Article PubMed PubMed Central Google Scholar
Li W. Perceptual learning: use-dependent cortical plasticity. Annu Rev Vis Sci. 2016;2:109–30. https://doi.org/10.1146/annurev-vision-111815-114351.
Article PubMed Google Scholar
Schomers MR, Pulvermüller F. Is the sensorimotor cortex relevant for speech perception and understanding? An integrative review. Front Hum Neurosci. 2016;10:435. https://doi.org/10.3389/fnhum.2016.00435.
Article PubMed PubMed Central Google Scholar
Glick H, Sharma A. Cross-modal plasticity in developmental and age-related hearing loss: clinical implications. Hear Res. 2017;343:191–201. https://doi.org/10.1016/j.heares.2016.08.012.
Article PubMed Google Scholar
Liebenthal E, Binder J, Possing E, Kaufman J, Piorkowski R, Remez R. Auditory and phonetic processing of sinewave speech: behavioral and neural correlates [abstract]. NeuroImage. 2001;13(6 Suppl):S559. https://doi.org/10.1016/S1053-8119(01)91902-0.
Article Google Scholar
Crutch SJ, Warrington EK. Spatial coding of semantic information: knowledge of country and city names depends on their geographical proximity. Brain. 2003;126:1821–9. https://doi.org/10.1093/brain/awg187.
Article PubMed Google Scholar
Goll JC, Crutch SJ, Loo JHY, Rohrer JD, Frost C, Bamiou D-E, et al. Non-verbal sound processing in the primary progressive aphasias. Brain. 2010;133:272–85. https://doi.org/10.1093/brain/awp235.
Article PubMed Google Scholar
Goll JC, Kim LG, Hailstone JC, Lehmann M, Buckley A, Crutch SJ, et al. Auditory object cognition in dementia. Neuropsychologia. 2011;49:2755–65. https://doi.org/10.1016/j.neuropsychologia.2011.06.004.
Article PubMed PubMed Central Google Scholar
Lambon Ralph MA, Jefferies E, Patterson K, Rogers TT. The neural and computational bases of semantic cognition. Nat Rev Neurosci. 2017;18:42–55. https://doi.org/10.1038/nrn.2016.150.
Article CAS Google Scholar
Hardy CJD, Marshall CR, Golden HL, Clark CN, Mummery CJ, Griffiths TD, et al. Hearing and dementia. J Neurol. 2016;263:2339–54. https://doi.org/10.1007/s00415-016-8208-y.
Article PubMed PubMed Central Google Scholar
Griffiths TD, Warren JD. The planum temporale as a computational hub. Trends Neurosci. 2002;25:348–53. https://doi.org/10.1016/S0166-2236(02)02191-4.
Article PubMed CAS Google Scholar
Rauschecker JP, Scott SK. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci. 2009;12:718–24. https://doi.org/10.1038/nn.2331.
Article PubMed PubMed Central CAS Google Scholar
Hardy CJD, Agustus JL, Marshall CR, Clark CN, Russell LL, Brotherhood EV, et al. Functional neuroanatomy of speech signal decoding in primary progressive aphasias. Neurobiol Aging. 2017;56:190–201. https://doi.org/10.1016/j.neurobiolaging.2017.04.026.
Article PubMed PubMed Central Google Scholar
Dubois B, Feldman HH, Jacova C, Cummings JL, DeKosky ST, Barberger-Gateau P, et al. Revising the definition of Alzheimer’s disease: a new lexicon. Lancet Neurol. 2010;9:1118–27. https://doi.org/10.1016/S1474-4422(10)70223-4.
Article PubMed Google Scholar
Remez RE, Fellowes JM, Nagel DS. On the perception of similarity among talkers. J Acoust Soc Am. 2007;122:3688–96. https://doi.org/10.1121/1.2799903.
Article PubMed Google Scholar
Souza P, Gehani N, Wright R, McCloy D. The advantage of knowing the talker. J Am Acad Audiol. 2013;24:689–700. https://doi.org/10.3766/jaaa.24.8.6.
Article PubMed PubMed Central Google Scholar
Oba SI, Fu QJ, Galvin JJ 3rd. Digit training in noise can improve cochlear implant users’ speech understanding in noise. Ear Hear. 2011;32:573–81. https://doi.org/10.1097/AUD.0b013e31820fc821.
Article PubMed PubMed Central Google Scholar
Hickok G, Poeppel D. The cortical organization of speech processing. Nat Rev Neurosci. 2007;8:393–402. https://doi.org/10.1038/nrn2113.
Article PubMed CAS Google Scholar
Scott SK, Blank CC, Rosen S, Wise RJS. Identification of a pathway for intelligible speech in the left temporal lobe. Brain. 2000;123:2400–6.
Article PubMed PubMed Central Google Scholar
Davis MH, Johnsrude IS. Hearing speech sounds: top-down influences on the interface between audition and speech perception. Hear Res. 2007;229:132–47.
Article PubMed Google Scholar
Gori S, Facoetti A. Perceptual learning as a possible new approach for remediation and prevention of developmental dyslexia. Vis Res. 2014;99:78–87. https://doi.org/10.1016/j.visres.2013.11.011.
Article PubMed Google Scholar
Ahissar M, Nahum M, Nelken I, Hochstein S. Reverse hierarchies and sensory learning. Philos Trans R Soc B Biol Sci. 2009;364:285–99. https://doi.org/10.1098/rstb.2008.0253.
Article Google Scholar
Warren JD, Rohrer JD, Schott JM, Fox NC, Hardy J, Rossor MN. Molecular nexopathies: a new paradigm of neurodegenerative disease. Trends Neurosci. 2013;36:561–9. https://doi.org/10.1016/j.tins.2013.06.007.
Article PubMed PubMed Central CAS Google Scholar
Bejanin A, Schonhaut DR, La Joie R, Kramer JH, Baker SL, Sosa N, et al. Tau pathology and neurodegeneration contribute to cognitive impairment in Alzheimer’s disease. Brain. 2017;140:3286–300. https://doi.org/10.1093/brain/awx243.
Article PubMed Google Scholar
Giannini LAA, Irwin DJ, McMillan CT, Ash S, Rascovsky K, Wolk DA, et al. Clinical marker for Alzheimer disease pathology in logopenic primary progressive aphasia. Neurology. 2017;88:2276–84. https://doi.org/10.1212/WNL.0000000000004034.
Article PubMed PubMed Central CAS Google Scholar
Shahin AJ, Bishop CW, Miller LM. Neural mechanisms for illusory filling-in of degraded speech. NeuroImage. 2009;44:1133–43. https://doi.org/10.1016/j.neuroimage.2008.09.045.
Article PubMed Google Scholar
Obleser J, Kotz SA. Expectancy constraints in degraded speech modulate the language comprehension network. Cereb Cortex. 2010;20:633–40. https://doi.org/10.1093/cercor/bhp128.
Article PubMed Google Scholar
Golestani N, Hervais-Adelman A, Obleser J, Scott SK. Semantic versus perceptual interactions in neural processing of speech-in-noise. NeuroImage. 2013;79:52–61. https://doi.org/10.1016/J.NEUROIMAGE.2013.04.049.
Article PubMed Google Scholar
Hartwigsen G, Golombek T, Obleser J. Repetitive transcranial magnetic stimulation over left angular gyrus modulates the predictability gain in degraded speech comprehension. Cortex. 2015;68:100–10. https://doi.org/10.1016/j.cortex.2014.08.027.
Article PubMed Google Scholar
Smith JF, Braun AR, Alexander GE, Chen K, Horwitz B. Separating lexical-semantic access from other mnemonic processes in picture-name verification. Front Psychol. 2013;4:706. https://doi.org/10.3389/fpsyg.2013.00706.
Article PubMed PubMed Central Google Scholar
Clark CN, Nicholas JM, Agustus JL, Hardy C, Russell L, Brotherhood E, et al. Auditory conflict and congruence in frontotemporal dementia. Neuropsychologia. 2017;104:144–56.
Article PubMed PubMed Central Google Scholar
Rohrer JD, Ridgway GR, Modat M, Ourselin S, Mead S, Fox NC, et al. Distinct profiles of brain atrophy in frontotemporal lobar degeneration caused by progranulin and tau mutations. NeuroImage. 2010;53:1070–6. https://doi.org/10.1016/j.neuroimage.2009.12.088.
Article PubMed PubMed Central CAS Google Scholar
Dehaene-Lambertz G, Pallier C, Serniclaes W, Sprenger-Charolles L, Jobert A, Dehaene S. Neural correlates of switching from auditory to speech perception. NeuroImage. 2005;24:21–33. https://doi.org/10.1016/j.neuroimage.2004.09.039.
Article PubMed Google Scholar
Henry ML, Hubbard HI, Grasso SM, Mandelli ML, Wilson SM, Sathishkumar MT, et al. Retraining speech production and fluency in non-fluent/agrammatic primary progressive aphasia. Brain. 2018;141:1799–814. https://doi.org/10.1093/brain/awy101.
Article PubMed Google Scholar
Goodglass & Weintraub. Kaplan E, Goodglass H, Weintraub S. Boston Naming Test. Philadelphia: Lea & Febiger. 1983.
Dunn LM & Whetton C. British Picture Vocabulary Scale. Windsor, England: NFER-Nelson. 1997.
Wechsler D. Wechsler memory scale: Revised. San Antonio, TX: The Psychological Corporation. 1987.
Jackson M, Warrington EK. Arithmetic skills in patients with unilateral cerebral lesions. Cortex. 1986;22:611–20
Mckenna P, Warrington EK. Testing for nominal dysphasia. J Neurol Neurosurgery, Psychiatry. 1987;43:781–8.
Folstein M, Folstein S, McHugh P. “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–98.
Kay J, Lesser R, Coltheart M. Psycholinguistic Assessments of Language Processing in Aphasia. Hove, UK: Lawrence Erlbaum Associates; 1996.
Warrington EK. Recognition memory test. Windsor: NFER-Nelson; 1984.
Warrington E, Mckenna P, Orpwood L. Single Word Comprehension: A Concrete and Abstract Word Synonym Test. Neuropsychol Rehabil. 1998;8:143–54.
Tombaugh TN. Trail Making Test A and B: Normative data stratified by age and education. Arch Clin Neuropsychol. 2004;19:203–14.
Warrington EK, James M. The Visual Object and Space Perception Battery. Bury St Edmunds, UK: Thames Valley Test Company; 1991.
Wechsler D. Wechsler Adult Intelligence Scale-Revised. New York: Psychological Corporation; 1981.

Download references

Acknowledgements

The authors are grateful to all participants for their involvement. We thank Andrew Clark for technical support during speech recordings and Professor Elizabeth Warrington for helpful discussion.

Funding

The Dementia Research Centre is supported by Alzheimer’s Research UK, Brain Research Trust, and The Wolfson Foundation. This work was supported by the Alzheimer’s Society (AS-PG-16-007), the National Institute for Health Research University College London Hospitals Biomedical Research Centre, the UCL Leonard Wolfson Experimental Neurology Centre (PR/ylr/18575) and the Economic and Social Research Council (ES/K006711/1). Individual authors were supported by the Medical Research Council (PhD Studentship to CJDH and RLB; MRC Clinician Scientist Fellowship to JDR), the Wolfson Foundation (Clinical Research Fellowship to CRM), Alzheimer’s Research UK (ART-SRF2010-3 to SJC) and the Wellcome Trust (091673/Z/10/Z to JDW).

Availability of data and materials

The datasets used and/or analysed during the present study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

Dementia Research Centre, Department of Neurodegenerative Disease, UCL Institute of Neurology, Queen Square, London, WC1N 3BG, UK
Chris J. D. Hardy, Charles R. Marshall, Rebecca L. Bond, Lucy L. Russell, Katrina Dick, Cono Ariti, Sonya J. Ross, Jennifer L. Agustus, Sebastian J. Crutch, Jonathan D. Rohrer & Jason D. Warren
London School of Hygiene and Tropical Medicine, London, UK
Cono Ariti
Neuroradiological Academic Unit, Department of Brain Repair and Rehabilitation, UCL Institute of Neurology, London, UK
David L. Thomas
Leonard Wolfson Experimental Neurology Centre, UCL Institute of Neurology, London, UK
David L. Thomas
UCL Ear Institute and UCLH Biomedical Research Centre, National Institute for Health Research, London, UK
Doris-Eva Bamiou

Authors

Chris J. D. Hardy
View author publications
You can also search for this author in PubMed Google Scholar
Charles R. Marshall
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca L. Bond
View author publications
You can also search for this author in PubMed Google Scholar
Lucy L. Russell
View author publications
You can also search for this author in PubMed Google Scholar
Katrina Dick
View author publications
You can also search for this author in PubMed Google Scholar
Cono Ariti
View author publications
You can also search for this author in PubMed Google Scholar
David L. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Sonya J. Ross
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer L. Agustus
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian J. Crutch
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan D. Rohrer
View author publications
You can also search for this author in PubMed Google Scholar
Doris-Eva Bamiou
View author publications
You can also search for this author in PubMed Google Scholar
Jason D. Warren
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CJDH and JDW were responsible for the conception and design of the study. CJDH, JDW, CRM, RLB, LLR, KD, CA, DLT, SJR, JLA, SJC, JDR and DEB were responsible for the analysis and interpretation of data. CJDH and JDW were responsible for drafting the manuscript and revising it critically for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jason D. Warren.

Ethics declarations

Ethics approval and consent to participate

All participants gave informed consent for their involvement in the study. Ethical approval was granted by the University College London and National Hospital for Neurology and Neurosurgery Research Ethics Committees in accordance with Declaration of Helsinki guidelines.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Clear place name. Natural speech version of SJR saying “Germany”. (WAV 99 kb)

Additional file 2:

Sine-wave place name. Sine wave speech version of SJR saying “Germany”. (WAV 77 kb)

Additional file 3:

Clear number. Natural speech version of CJDH saying “Nine hundred and sixty five”. (WAV 201 kb)

Additional file 4:

Sine-wave number. Sine-wave speech version of CJDH saying “Nine hundred and sixty five”. (WAV 163 kb)

Additional file 5:

ROIs. Representative brain MRI sections showing the neuroanatomical region (delineated in red) used to correct for multiple voxel-wise comparisons, based on prior anatomical hypotheses (see text). This region comprised the inferior frontal gyrus (triangularis + opercularis), anterior temporal lobe, temporal pole, posterior superior temporal gyrus, planum temporale, angular gyrus, supramarginal gyrus and inferior portions of the pre-central and post-central gyri. (PNG 138 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Hardy, C.J.D., Marshall, C.R., Bond, R.L. et al. Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease. Alz Res Therapy 10, 70 (2018). https://doi.org/10.1186/s13195-018-0399-2

Download citation

Received: 20 April 2018
Accepted: 27 June 2018
Published: 25 July 2018
DOI: https://doi.org/10.1186/s13195-018-0399-2

Retained capacity for perceptual learning of degraded speech in primary progressive aphasia and Alzheimer’s disease

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Participants

Experimental stimuli and procedures

Analysis of clinical and behavioural data

Brain image acquisition and analysis

Results

General participant group characteristics

Experimental behavioural data

Neuroanatomical data

Discussion

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1:

Additional file 2:

Additional file 3:

Additional file 4:

Additional file 5:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Alzheimer's Research & Therapy

Contact us