Music in the Brain: From Listening to PlayingView this Special Issue
Interaction of Musicianship and Aging: A Comparison of Cortical Auditory Evoked Potentials
Objective. The goal of this study was to begin to explore whether the beneficial auditory neural effects of early music training persist throughout life and influence age-related changes in neurophysiological processing of sound. Design. Cortical auditory evoked potentials (CAEPs) elicited by harmonic tone complexes were examined, including P1-N1-P2, mismatch negativity (MMN), and P3a. Study Sample. Data from older adult musicians () and nonmusicians () (ages 55–70 years) were compared to previous data from young adult musicians () and nonmusicians () (ages 18–33 years). Results. P1-N1-P2 amplitudes and latencies did not differ between older adult musicians and nonmusicians; however, MMN and P3a latencies for harmonic tone deviances were earlier for older musicians than older nonmusicians. Comparisons of P1-N1-P2, MMN, and P3a components between older and young adult musicians and nonmusicians suggest that P1 and P2 latencies are significantly affected by age, but not musicianship, while MMN and P3a appear to be more sensitive to effects of musicianship than aging. Conclusions. Findings support beneficial influences of musicianship on central auditory function and suggest a positive interaction between aging and musicianship on the auditory neural system.
The perception of music infuses the human brain with a rich auditory sensory experience and is a cognitively complex task requiring the integration of multiple cortical levels and neural systems. Because of the intense training and skill acquisition a musician receives from an early age, a musician’s brain provides unique opportunities to explore the impact of music perception and music training on neural structural and functional adaptation and development.
Ground-breaking anatomic studies using functional magnetic resonance imaging (fMRI) [1, 2] revealed that musicians compared to nonmusicians have 5% greater cerebellar volume, significantly larger anterior corpus callosum, and increased gray matter volume in the left Heschl’s gyrus and left inferior frontal gyrus. In addition to altered cortical structure, electroencephalographic (EEG) and magnetoencephalographic (MEG) recordings of cortical auditory evoked potentials (CAEPs) parallel anatomic studies showing cortical enlargement of auditory areas important for music perception [3, 4] and reveal superior preattentive auditory sensory memory representations for musicians compared to nonmusicians [5–14]. Specifically, findings suggest that musicians relative to nonmusicians (a) discriminate auditory differences that are undetectable for nonmusicians at a preattentive processing level implying that sensory memory traces containing auditory information may be enhanced by musical expertise , (b) discriminate embedded pitch shifts faster (shorter latencies) within familiar and unfamiliar interval patterns , (c) have larger amplitude responses to changes in relative pitch structure, such as melodic contour and interval , rhythmic deviation , and timbre of expertise instrument , (d) respond faster (shorter latencies) to pitch deviances of harmonic tone complexes [9, 12], and (e) enhanced sensitivity to acoustic changes of harmonic complexes and speech syllables [10, 13, 14]. Interestingly, effects of musicianship on the plasticity of CAEPs can be seen even after short-term (e.g., two weeks) musical training (e.g., ).
These remarkable effects of music training on auditory neural structure and function have been observed from childhood through young adulthood; however, research is limited when it comes to exploring whether these auditory neural effects of music training extend throughout the life of the musician. Given their history of extraordinary exposure to and experience with spectrally and temporally rich auditory stimuli at an early age, it is questioned whether older adult musicians might experience age-related changes in central auditory processing differently than older nonmusicians.
It is a well-known fact that older adults often complain that although they can hear speech, they cannot understand it. Age-related physiological changes typically alter the way in which spectral and temporal information are encoded . Older adults often exhibit neural activation patterns that are qualitatively different and more frontally oriented than those of younger adults [19–21]. These patterns suggest relatively slower neural transit time and altered auditory inhibition/arousal by irrelevant stimuli for older adults and characterize an inefficient aging auditory system that contributes to poor speech understanding in noisy, real-world listening environments . Given that young adult musicians have altered auditory neural structures and superior auditory neural processing for music and speech, is it then possible that older adult musicians may have a more efficient auditory system than their nonmusician cohorts?
Investigations of age-related changes affecting auditory neural processing have used both behavioral and electrophysiological measures (e.g., EEG, MEG). Psychoacoustic measures provide information about the listener’s perception of sound and reflect a conscious attentional process. Recent behavioral studies exploring the rate of age-related decline on peripheral and central auditory processing between older adult musicians and age-comparable nonmusicians (ages 45–65) reported that older musicians demonstrate increased auditory working memory  and less age-related decline for gap-detection and speech-in-noise thresholds than nonmusicians . Findings suggest that musical training may diminish the impact of age-related auditory decline  and, further, that musicians may experience less age-related decline in central auditory processing than nonmusicians . We question what impact, if any, the interaction between musicianship and aging has on preattentive neurophysiological processing of sound for older adults.
Preattentive cortical auditory evoked responses (e.g., P1-N1-P2, MMN, and P3a), the focus of the current study, are particularly suitable when investigating responses of older adults because they reflect the automatic detection and discrimination of acoustic sensory memories prior to attentional focus without being contaminated by attention, motivation, or cognitive demands of the task [24, 25]. CAEPs provide excellent temporal resolution and allow for a noninvasive evaluation of the various stages of auditory processing from preattentive sensory perception to later cognitive levels. To date, CAEP studies comparing older musicians and nonmusicians are unknown.
CAEP studies investigating age-related changes of central auditory processes in older nonmusician listeners have reported similar aging effects on P1-N1-P2 elicited by pure tones  and gap-detection . The P1-N1-P2 complex reflects the physiological detection of audible stimulus energy . In both studies comparing older and younger adults with normal hearing, there was a pattern of larger P1 amplitudes and slower P2 latencies for the older adults. For gap-detection, P1 latency was earlier for older adults than younger; however, there was no P1 latency difference between older and younger adults for detection of pure tones. In neither study did N1 or P2 amplitudes or the N1 latency differ by age group. It may be speculated that larger P1 amplitude and earlier P1 latency as well as longer P2 latencies may represent an adaptive neural response to aging. That is to say, compared to younger adults, older adults may activate or recruit more or different neural resources to increase the efficiency of stimulus detection and inhibition and may have slower processing or extended neural conduction time once the auditory stimulus is detected.
Mismatch negativity (MNN) is a passively elicited negative potential typically considered to be independent of attention and higher cognitive processing . In general, the MMN is thought to be a preperceptual or preattentive central auditory response to an acoustic deviation based on the detection of an auditory regularity in a preceding sound sequence and an automatic sensory memory-based comparison process [30, 31]. Indexing in sensory memory as reflected by the MMN is considered to be a preattentive because it has been shown to occur in the absence of attention and even conscious awareness . Note that, while the MMN can be elicited outside of attention, it can also be influenced by attentional modulation (see  for modeling evidence of temporal attention enhancing the MMN).
While the MMN can occur preattentively, the P3a, which often follows the MMN, is associated with an involuntary and automatic shift of attention to and conscious perception of a deviant or new stimulus [30, 32, 33]. Typically, the P3a is elicited in response to infrequent task-irrelevant stimuli that are unexpected (e.g., pitch “C”) in a sequence of frequently presented standard stimuli (e.g., pitch “A”) within which the listener is actively attending to the detection of infrequent target stimuli (pitch “B”). The infrequent presence of pitch “C” automatically shifts the listener’s attention to that tone, resulting in a P3a.
Age-related studies using CAEPs to elicit MMN and P3a are limited; however, results are consistent. Comparisons of older and younger adults suggest that MMN is affected more by stimulus contrast and presentation rate than age alone [19, 26, 34]. Age alone did not affect the MMN latency or amplitude for stimulus-change detection of frequency  or stimulus duration with a short interstimulus interval (ISI = 0.5 sec); rather, as the time between the stimuli increased, the MMN attenuated more for older listeners than younger suggesting that MMN may reflect the gradual decay of the stimulus trace in the auditory system and that this trace decays faster for older than younger adults [26, 34]. We are aware of only two studies of P3a and aging for auditory stimuli. Those studies indicate that P3a is absent  or delayed  for older as compared to younger adults, indicating that older adults are slower to switch attention to and evaluate distracting auditory stimulus outside of their focused attention.
Our current research sought to examine cortical auditory evoked responses of older musicians and is based on the framework of our earlier research with young adult musicians [9, 10]. It was an exploratory study to investigate the interaction between the aging process and possible effects of early music training and the possible impact of this interaction on the neurophysiological processing of sound. Preattentive CAEPs elicited by harmonic tones are compared between normal hearing older musicians and their nonmusician cohorts. In addition, CAEP data from older musicians and nonmusicians is compared to that of their younger counterparts  to further elucidate effects of age-related changes on central auditory processing. Specifically, the P1-N1-P2 complex was elicited by a standard harmonic tone stimulus to establish the physiological detection of sound at the level of the auditory cortex. MMN and P3a were elicited by small deviant frequencies (i.e., 1.5% and 6% lower in frequency than a standard) of a harmonic tone complex. Based on previous research of the P1-N1-P2 potentials [10, 19, 21] and the assumption that larger P1 amplitude reflects increased neural recruitment for regulating incoming auditory stimuli in older adults, it was predicted that, given their musical expertise and training, older musicians may have smaller P1 amplitudes for harmonic tones than older nonmusicians and similar group latencies. We also predicted a shorter P2 latency for older musicians compared to older nonmusicians, given the general increase in P2 latency with age and reduction effects of musicianship on aging.
Based on previous electrophysiological research among young adult musicians and nonmusicians, it was predicted that older musicians may have shorter MMN and P3a latencies and larger MMN amplitudes than age-comparable nonmusicians for harmonic complexes; however, no significant effects of musicianship were predicted for P3a amplitudes. MMN and P3a amplitude were predicted to increase as the size of frequency deviance increased. Based on the literature described above, the following aging effects were predicted for P1-N1-P2, MMN, and P3a: (1) P1 amplitude would be larger; (2) P1 latency is earlier and P2 latency later for older adults compared to younger; (3) neither P2, N1, or MMN amplitudes nor MMN latency would be affected by age; and (4) P3a amplitude would be smaller and latency later for older compared to younger adults.
Eight older adult musicians (mean age = 62.5 yrs, range = 58–67 yrs) and eight older nonmusicians (mean = 61.4 yrs, range = 55–70 yrs) were recruited from local communities, community music organizations, and the University of South Florida (USF). Musicians averaged 11.5 years of formal music training that began between the ages of 8 and 16 (see Table 1 for musical history). Nonmusicians had fewer than 12 months of music training. For the purpose of this investigation, formal music training refers to a minimum of 6 years of professionally directed and implemented music instruction and technical exercises provided by a professional musician and/or music educator . All participants were right-handed and native speakers of English and had normal pure-tone thresholds (less than or equal to 25 dB HL bilaterally) for the frequencies of interest in the present study (≤3000 Hz). Performance on the Words-in-Noise (WIN) [37, 38] test, in terms of the dB signal-to-noise ratio for 50% correct word recognition in noise, was age-appropriate for all participants. A summary of hearing thresholds and WIN scores, shown in Table 2, indicates that the hearing sensitivity of the two older adult groups was highly similar. None had a history of neurological impairment, absolute pitch ability, exposure to tone languages, or previous participation in psychoacoustic experiments. The study was approved by the USF Institutional Review Board and documented informed consent was obtained from all subjects. Participants were reimbursed $15 per hour of participation.
The identical inclusion and exclusion criteria were used for our earlier recruitment of 40 young adult musicians and 20 nonmusicians . Young adult musicians were between the ages of 18 and 33 (mean age = 22 years) and had an average of 9.8 years (median = 9.18 yrs, range = 5 to 17 yrs) of formal music training. Nonmusicians were between the ages of 20 and 34 (mean age = 23) and had fewer than 12 months of music training.
Harmonic complexes were digitally generated (sampling rate = 50 kHz), controlled, and presented using a Tucker-Davis Technologies (TDT) RP2 Real-Time Processor with model HB 7 headphone buffer. Duration of all stimuli was 200 ms and shaped with a cos2 window to create a 10 ms rise/fall time. Interstimulus interval (ISI) was 500 ms. All stimuli were presented bilaterally via Etymotic Research (ER2) insert earphones at 75 dB SPL.
The present study is a continuation of a series of studies investigating the relationship between pitch perception and vocal production by types of musicians (i.e., vocalists, string instrumentalists, and wind instrumentalists) and nonmusicians . Thus, auditory stimuli consisted of harmonic tone complexes containing fundamental frequencies () that occur within the mid-frequency range of the female vocal register, Hz to 392 Hz (C4 to G4) . The standard tone was G4, Hz, and was chosen from a behavioral task in a previous study because this tone elicited the best overall difference limen for frequency (DLF) across groups . The two deviant harmonic complexes, Hz (HMD1.5), 1.5% difference from the standard tone (an eighth tone difference) and Hz (HMD6), 6% difference from the standard tone (a semitone difference), were selected based on DLFs measured behaviorally in a companion study and represented a range of behavioral performance. In physical terms, the interval between adjacent whole tones is a 12% difference between the fundamental frequencies of each tone. The interval between a reference tone and its semitones is a fundamental frequency difference of roughly 6% (e.g., G4 to # equals 392 Hz to 370 Hz). Each harmonic tone stimulus contained a and three harmonics. The amplitude of each harmonic was divided by its harmonic number to create a natural amplitude contour in the frequency spectrum.
2.3.1. Electroencephalographic Recording
CAEPs were recorded and analyzed using a Compumedics Neuroscan EEG system with a SynAmps 2 amplifier and Neuroscan Scan 4.3 acquisition software. A cap of 64 sintered electrodes was placed on the subject’s head with additional electrodes above and below the left eye and at the outer canthus of each eye to monitor eyeblink activity. The nose served as reference and the forehead served as ground. To minimize any auditory attentive behavior, the participant was comfortably seated in a sound-attenuated booth and instructed to watch a closed-caption movie of choice and to ignore the auditory stimuli. All impedances were kept below 30 kΩ. The acquisition of EEG data was by continuous sampling, recorded at an AD (analog to digital) sampling rate of 1000 Hz and stored on the computer for offline averaging. The raw signal was amplified within a frequency band of 0.05–100 Hz.
The harmonic tone condition was designed as a multideviant oddball paradigm to elicit MMN and P3a. The protocol consisted of one standard tone ( Hz) and two infrequently occurring deviant tones (HMD6 and HMD1.5). The standard tone occurred on a minimum of 75% (1080 minimum) of the trials and the two deviant frequencies occurred on 25% (180 per deviant, 360 total deviants) of the trials. Stimuli were presented in a pseudorandom sequence with at least three standard stimuli separating presentations of deviant stimuli; thus, two deviant stimuli did not occur in succession. The response to the standard stimulus was analyzed to establish physiological detection of the auditory stimuli (P1-N1-P2).
2.3.2. Data Analysis
Offline analysis of the continuous EEG waveforms was conducted using Neuroscan Scan 4.3 Edit software and began with manual artifact rejection. As a precaution for data analysis, the first 10 CAEP responses were omitted from the averaging process to exclude the variation of the N1 amplitude (i.e., the refractoriness) associated with the start of the stimulation sequence [40, 41]. EEG epochs of 700 ms (−100 to 600 ms) were obtained, baseline corrected (−100–0 ms), and averaged separately for the standard and deviant stimuli. To eliminate ocular movement contamination, epochs containing artifacts exceeding ±80 μV in the horizontal and vertical eye channels were rejected from averaging. CAEP waves were digitally band-pass filtered at 0.1–30 Hz with a squared Butterworth zero-phase filter (12 dB/octave roll-off). In order to maximize signal-to-noise ratio at Fz, all of the processed average files for MMN/P3a analysis were individually rereferenced to the mastoids.
P1-N1-P2 response had the largest amplitudes measured at Fz; thus, reported measures and statistical analysis are based on CAEP responses measured from the Fz electrode for the standard stimulus. P1 was identified as the first positive peak occurring between 25 and 90 ms in the group average CAEP waveforms . The N1 was defined as the largest negativity occurring between 70 and 140 ms. The P2 was defined as the largest positivity occurring between 140 and 255 ms. Latency windows of ±25 ms around each amplitude peak for each group were determined and individual peak amplitudes and latencies were quantified within these preselected windows using scripts within the Neuroscan Scan 4.3 Edit software; the script selected the time point with the highest (for P1 and P2) or lowest (for N1) amplitude value within the designated window. Visual inspection of individual waveforms was conducted to ensure that clear peaks did not fall outside of the designated windows for any participant.
The MMN is illustrated by a difference wave obtained by subtracting the averaged CAEP elicited by the standard stimulus from the averaged CAEP elicited by a deviant stimulus . Difference waveforms to illustrate MMN and P3a were calculated for each deviant stimulus condition. The MMN response was largest at electrodes Fz and Cz with the largest amplitudes measured at Fz. Thus, reported measures and statistical analysis are based on CAEP responses measured from Fz. The MMN was verified by polarity inversion at the mastoids prior to rereferencing all individual files to the mastoids. MMN has been shown to invert in polarity at electrodes below the level of the Sylvian fissure . Polarity inversion at the mastoids is an accepted method to verify the MMN response to tonal changes . CAEP amplitudes were quantified by first determining the peak latencies from the grand-average difference waves separately for each deviant as the largest peak between 100 and 300 ms at Fz for MMN . The P3a was chosen as the first positive peak following the MMN.
Grand average group difference waveforms for each harmonic tone deviant stimulus were derived from the Fz electrode for purposes of illustrating MMN and P3a and for selecting latency windows. Latency windows of ±25 ms around each amplitude peak for each group and deviant condition were determined . MMN and P3a peak amplitudes and latencies for each participant were quantified within these preselected windows for each harmonic deviant condition using scripts within the Neuroscan Scan 4.3 Edit software to select the time point with the highest (for P3a) or lowest (for MMN) amplitude value within the designated window. Visual inspection of individual waveforms was conducted to ensure that clear peaks did not fall outside of the designated windows for any participant.
Peak amplitude and latency data were analyzed using IBM SPSS Statistics software (version 21.0). To determine the effect of age and musicianship on P1-N1-P2, MMN, and P3a, univariate analyses of variance (ANOVAs) were conducted on individual mean component latencies and amplitudes. An alpha level of 0.05 was used for all statistical tests.
3.1. Effects of Musicianship within Older Adults
3.1.1. P1-N1-P2 Components
Analysis of P1-N1-P2 was conducted at Fz for the standard harmonic tone stimulus ( Hz). The peaks are designated in Figure 1 for visual inspection and the latency and amplitude means and standard deviations are reported in Table 3. There were no significant group differences of P1, N1, and P2 latencies and amplitudes between older adult musicians and nonmusicians, (1,14) = 1.19, for P1 latency; (1,14) = 0.18, for P1 amplitude; (1,14) = 0.71, for N1 latency; (1,14) = 0.34, for N1 amplitude; (1,14) = 0.11, for P2 latency; and (1,14) = 0.23, for P2 amplitude.
3.1.2. Mismatch Negativity (MMN)
The MMN and P3a were elicited from older musicians and nonmusicians by two harmonic tone pitch deviants and are illustrated by comparing CAEPs to the standard stimulus and CAEPs to the deviant stimuli elicited from the oddball paradigm (Figure 2). The MMN and P3a latency and amplitude means and standard deviations are reported in Table 4. The grand average group CAEP difference waveforms reflect the MMN and P3a for the two frequency deviances (Figure 3). Parallel two-way repeated-measures ANOVAs were conducted on individual mean MMN and P3a latencies and amplitudes for each stimulus condition with the within-subject factor Stimulus (HMD1.5 [ Hz], HMD6 [ Hz]) and the between-subject factor Musicianship (older nonmusicians, older musicians). MMN latencies to harmonic tone pitch deviants were significantly shorter for older adult musicians than older nonmusicians, (1,14) = 7.96, . Further, there was a significant effect of deviant magnitude, (1,14) = 200.68, for MMN latency. Across groups, as the magnitude of the frequency deviance increased, response latency decreased (Figure 3; see Table 4 for mean group latency values). The interaction between Musicianship and Stimulus was also significant, (1,14) = 7.57, for MMN latency. A post hoc analysis of the significant interaction revealed that the effect of Stimulus on MMN latency was significant for both older nonmusicians () and older musicians (). The effect of Musicianship on MMN latency was also significant for both HMD1.5 () and HMD6 ().
In contrast, MMN amplitude did not differ significantly by Musicianship, (1,14) = 1.66, ; however, there was a positive effect of deviant magnitude, (1,14) = 5.35, . For all, as the magnitude of the frequency deviance increased, response amplitude increased. Interaction between Musicianship and Stimulus was not significant, (1,14) = 1.04, for MMN amplitude.
3.1.3. P3a Component
Overall, P3a latency occurred significantly earlier for older adult musicians than older nonmusicians, (1,14) = 12.65, (Figure 3). For all participants, the larger frequency deviance elicited a shorter latency response, (1,14) = 114.90, (Table 4). There was no interaction between group membership and the magnitude of the frequency deviant, (1,14) = 2.64, . P3a amplitude was not dependent on Musicianship, (1,14) = 0.59, , nor magnitude of deviance, (1,14) = 3.07, . The interaction was not significant, (1,14) = 0.50, .
3.2. Effects of Age and Musicianship
Amplitude and latency of CAEPs P1-N1-P2, MMN, and P3a, elicited from older adult musicians and nonmusicians by harmonic tone complexes, were compared to CAEPs previously elicited from young adult musicians and nonmusicians  in univariate ANOVAs with Age (younger, older) and Musicianship (musicians, nonmusicians) as the between-subject factors. CAEP data from older and younger musicians is compared to further examine and elucidate the effects of age-related changes on central auditory processing. All of the previously collected data from young adult musicians and nonmusicians was reanalyzed using methods described in this paper.
3.2.1. P1-N1-P2 Components
P1-N1-P2 latency and amplitude means and standard deviations for all four groups are reported in Table 3. P1 latencies were significantly affected by Age, (1,72) = 14.93, , with earlier latencies for older adults than younger listeners, but was not affected by Musicianship, . There were no significant effects of Age or Musicianship or interaction between the two for P1 amplitude, N1 latency, or N1 amplitude, . P2 latencies were significantly affected by Age, (1,72) = 64.75, , with significantly earlier latencies for younger listeners compared to older listeners, but was not affected by Musicianship or the interaction of Age and Musicianship, . P2 amplitude was not affected by Age, (1,72) = 2.44, , or Musicianship or the interaction of the two, . Averaged group waveforms for P1-N1-P2 components are illustrated in Figure 4.
3.2.2. Mismatch Negativity (MMN)
MMN latency and amplitude means and standard deviations are reported in Table 4. There were significant effects of Age for MMN latency: younger participants had shorter latencies than older participants for HMD1.5, (1,72) = 155.81, , and HMD6, (1,72) = 10.81, . Younger participants also had significantly larger MMN amplitudes than older participants for HMD1.5, (1,72) = 6.21, , and for HMD6 (1,72) = 5.43, . There were also significant effects of Musicianship for MMN latency: musicians had shorter latencies than nonmusicians for HMD1.5, (1,72) = 60.75, , and HMD6, (1,72) = 33.74, . Effects of Musicianship on MMN amplitude for both stimuli were not significant, .
MMN latency effects for HMD1.5 were qualified by an Age × Musicianship interaction, (1,72) = 42.44, . The two young adult groups (YM, YNM) had very similar, short latencies; the OMs had slightly longer latencies and the ONMs had the longest latencies of all. The latencies of the YM and YNM groups were not significantly different, (1,72) = 1.77, , but all other groups showed latencies significantly different from each other, . MMN amplitude effects for HMD6 were qualified by an Age × Musicianship interaction, (1,72) = 4.45, . The largest amplitude overall was found for the YNM group while the amplitudes of the two musician groups (YM, OM) were very similar and smaller than that of the YNM. The ONM group showed the smallest overall amplitude. The difference between the YMs and OMs was not significant, , but YNMs had significantly larger amplitudes than ONMs, (1,72) = 9.15, .
3.2.3. P3a Component
P3a latency and amplitude means and standard deviations are reported in Table 4. There were significant effects of Age on P3a latency for HMD1.5, (1,72) = 55.00, , and on P3a amplitude for HMD6, (1,72) = 8.90, = 0.004. There were also significant effects of Musicianship for P3a latency: musicians had shorter latencies than nonmusicians for HMD1.5, (1,72) = 11.52, , and HMD6, (1,72) = 21.06, . Effects of Musicianship on P3a amplitude for both stimuli were not significant, . P3a latency effects for HMD6 were qualified by an Age × Musicianship interaction, (1,72) = 9.21, . P3a latency for HMD6 was shortest for the OM group, followed closely by the YM group, and longest latencies were observed for the YNM and ONM groups, respectively. P3a latency for the YM group was not significantly different from that of the OM or YNM groups but all other group comparisons were significant.
Musicians from early childhood through young adulthood have been shown to have enhanced auditory sensory-memory and sensitivity to acoustic changes for music and speech; however, there is a paucity of research exploring whether older musicians retain this acoustic processing advantage throughout life. This was an exploratory study using preattentive CAEPs, specifically the P1-N1-P2, MMN, and P3a components, to compare responses between formally trained older musicians and an age-comparable group of nonmusicians. Moreover, to investigate the interaction between aging and musicianship, CAEPs from the older musicians and nonmusicians in the current study were compared to CAEPs of young adult musicians and nonmusicians from an earlier study . As an exploratory study, the sample size for the older adult groups was small (’s = 8) compared to the younger adult group previously published (musicians ; nonmusicians ) [9, 10]; however, the data are useful in showing neurophysiological responses of musicianship that are age-agnostic as well as responses that are age-specific.
Consistent with our hypotheses and similar to our findings among young adults, preattentive neurophysiological responses to music stimuli do distinguish older adult musicians from older nonmusicians. Unique to this investigation is the added comparison of preattentive CAEPs between groups of young and older musicians and nonmusicians and findings that further differentiate the obligatory evoked potentials from the change-detection potentials. It appears that the early obligatory P1-N1-P2 complex may be more affected by aging than musicianship, whereas the inverse is true of the preattentive responses representing the change-detection paradigm. That is, MMN and P3a derived components appear to be influenced by musicianship rather than aging.
4.1. P1-N1-P2 Complex
As predicted, older adults regardless of music background detected the presence of acoustic energy earlier than younger adults (i.e., shorter P1 latency); yet once arousal was triggered, neural conduction was longer from P1 to P2 (i.e., later P2 latency) for older adults than the younger. Findings are consistent with others [19, 21] and suggest perhaps an age-related adaptive neural response that increases neural recruitment to initiate auditory arousal to the presence of acoustic energy. Once aroused, the delayed time-course may be explained by age-related refractory-time differences between younger and older slowing of synchronous neural firing in the central auditory pathways. That is, once the neurons are activated, the older system takes longer to recover before the neurons can fire again. This increase in refractory time results in slower neural travel time.
We expected an effect of both aging and musicianship on P1 amplitude and found neither. We also expected an effect of musicianship on P2 latency in older adults, which was not present in this data set. As for the effect of aging on P1 amplitude, it is well documented that older listeners typically demonstrate an increase in the strength of the obligatory response (P1 amplitude) as a result of an adaptive increase in neural recruitment for the physiological detection of the auditory stimulus [18, 19, 21]. Thus, it was predicted that, as a group, the older listeners would have larger P1 amplitude than the young listeners. As for the influence of musicianship, smaller P1 amplitude has been shown to reflect reduced processing demands . Therefore, it was speculated that highly trained listeners, such as musicians, would be more efficient at inhibiting irrelevant stimuli and regulating familiar stimuli (i.e., harmonic tones) than untrained listeners and, thus, musicians were predicted to have smaller P1 amplitude than nonmusicians. One plausible explanation for the absence of significant differences may be the choice of stimulus. The P1-N1-P2 complex was elicited by a single repetitive harmonic tone. A review of aging studies indicates that both the latency and amplitude patterns of P1, N1, and P2 may be altered depending on the level and spectrum of the stimulus as well as the presentation paradigm (for review see ). It is plausible that an adaptive increase in neural activation (i.e., larger P1 amplitude) may have been unnecessary to perceive this less complex auditory stimulus; whereas, previously reported P1 amplitude findings have been linked to more complex stimuli such as neural detection of gaps in noise , pure tones with and without noise , and detection of consonant-vowel speech syllables . Another possible explanation may be due to the small number of older participants compared to the larger group of young participants. Future research will have larger numbers of older participants more compatible in size to the younger groups.
4.2. MMN and P3a Components
MMN and P3a derived components appear to be more sensitive to neural effects of musicianship than aging. Consistent with our hypotheses, MMN and P3a elicited by small deviant frequencies in harmonic tones distinguished older adult musicians from older nonmusicians. Prior to attentional focus, older musicians discriminated small changes of pitch in harmonic tone complexes faster (i.e., shorter MMN latencies) than older untrained listeners indicating superior sensory memory-based comparison processes and supporting the premise that early and intensive music training may affect central auditory processing throughout life. Consistent with the literature, for both groups of older listeners, as the magnitude of stimulus deviance increased, MMN latency decreased and amplitude increased [5, 9, 24]. As predicted, MMN amplitude did not significantly differ between the two older groups suggesting that older musicians discriminated changes in pitch faster than nonmusicians without an increase in neural activation.
P3a latency and amplitude patterns for the older adult groups were similar to the MMN, but not identical. Older musicians switched attention faster (i.e., shorter P3a latency) to acoustic changes in music without an increase in neural activation. This difference between trained and untrained listeners speaks to the musician’s extensive auditory experience and exposure to multiple, concurrent, and sequential music stimuli. Consequently, even a minor change in pitch elicited a swift shift of attention toward the distraction. Like the MMN latency, all older listeners responded faster to the larger frequency deviance (earlier P3a latency).
To further elucidate the interaction between aging and musicianship, MMN and P3a data were compared among four groups: young musicians (YM), young nonmusicians (YNM), older musicians (OM), and older nonmusicians (ONM). For the larger frequency deviance (i.e., 6% or 22 Hz), musicianship appears to have the advantage; that is, all the musicians (YM and OM) reacted to the pitch change faster with comparable neural effort than the nonmusicians (YNM and ONM). Since attentional modulation of the MMN is possible , this effect of musical expertise on the MMN could in part be due to a difference in the attentional demand of the stimuli since tone stimuli might be inherently more “interesting” for musicians, or musicians might be more used to directing their attention to the sounds.
The YNM had the strongest neural activation (largest MMN amplitude), while the ONM had the smallest. Findings support previous literature suggesting that preattentive change-detection (MMN) may be affected more by stimulus contrast and presentation rate than age alone [19, 26, 34]. However, despite the fact that the older adults had normal pure-tone thresholds up to 3 kHz and that low-frequency stimulus contrasts were used to minimize the confounding effects of age-related high-frequency hearing loss, the possibility remains that physiological discrimination of the 1.5% pitch deviance (6 Hz difference) may have been compromised by aging effects.
P3a latency patterns were similar to those of the MMN. Like the MMN response to the larger harmonic tone deviant, musicianship appears to have prevailed. All musicians (YM and OM) compared to the untrained listeners (YNM and ONM) demonstrated faster preattentive registration of a deviant harmonic stimulus outside attentional focus and shifted their attention toward the distraction. This reflexive auditory-neural response may be a residual adaptation related to music training and experience during childhood. Consistent with the MMN response to the smaller 1.5% pitch deviance, older adults were slower to register this distraction outside of their attentional focus than the younger adults and neural effort was not a distinguishing factor. In summary, MMN and P3a components appear to be more influenced by musicianship rather than aging. Further, it appears that early and intensive music training may alter aging effects on central auditory processing throughout life.
Within the neuroscience of music, this study was an initial foray into the investigation of the effects of aging and musicianship on the auditory neural system. Youth and musicianship appear to be an advantageous combination for efficient and enhanced preattentive auditory neural processing. In terms of physiological detection of sound, P1-N1-P2 appears to be more sensitive to effects of aging rather than musicianship. All older listeners, regardless of musicianship, demonstrated faster auditory arousal to the presence of acoustic energy than all younger listeners suggesting an adaptive increase in neural recruitment; yet once arousal was initiated, neural conduction of acoustic energy was slower for the older adults than younger. In terms of automatic acoustic discrimination, MMN and P3a appear to be more affected by musicianship than aging. All musicians, regardless of age group, demonstrated a pattern of enhanced auditory sensory-memory-based comparison processes for harmonic tone stimuli and exceptional sensitivity to and involuntary distraction by an acoustic change of music. Findings suggest that musicianship has beneficial neurophysiological consequences on central auditory processing throughout life, and, further, some of these neurophysiological effects may be independent of age-related changes.
The lifelong ability to adapt to environmental demands and sensory stimulation is based on the dynamic capacity of the human brain to modify and alter its structure and function. Formal music training has been shown to facilitate and enhance encoding of the acoustic signal, shape subcortical, and early cortical stages of acoustic perception and may retard age-related neural changes and facilitate adaptive neural function. The benefits of music training are socially, clinically, and educationally relevant. Music training and practice may be used as an educational tool or a rehabilitative strategy to facilitate neurophysiological processing of sound. The possibilities for off-setting age-related physiological changes of the auditory neural system through music training have far-reaching effects not only for the field of neuroscience and music education, but also for gerontology, speech-language pathology, and hearing science. Regardless of the remarkable technological advances in hearing devices such as programmable digital hearing aids and cochlear implants, none can duplicate our original auditory system. Consequently, it is essential to investigate other means by which our auditory abilities may be enhanced and protected.
This was an introductory study to explore possibilities that early music training may influence auditory processing in later life and to assess the methodological challenges faced when working with an older population. Implications from this study have been interpreted with caution and generalizations were kept to a minimum due to the small sample of older musicians and the intrasubject variability that occurs when working with older adults. Future recommendations for studies of older adults include recruiting larger numbers to offset individual variability and allowing for frequent breaks during the experiment to offset participant fatigue. To further elucidate the effects of aging and musicianship, it would be beneficial to include higher-level listening tasks using more complex auditory stimuli such as speech stimuli and to also include psychoacoustic measures of auditory perception. Finally, while the information presented suggests that formal music training and extensive auditory sensory exposure facilitate neurophysiological responses throughout life, the influence of genetic factors and inherent musical ability cannot be dismissed. It is yet unknown whether musical abilities and cortical structural differences are due to learning or whether these differences reflect innate abilities and capacities that are advanced by early exposure to music.
|ANOVA:||Analysis of variance|
|CAEPs:||Cortical auditory evoked potentials|
|DLF:||Difference limen for frequency|
|fMRI:||Functional magnetic resonance imaging|
|HMD1.5:||Stimulus with 1.5% difference from the standard tone|
|HMD6:||Stimulus with 6% difference from the standard tone|
|USF:||University of South Florida|
Portions of the work described here were presented as follows: (1) D. Nikjeh and J. Lister (2012): Preattentive CAEPs: Exploring Effects of Musicianship and Aging in American Speech-Language-Hearing Association conference, Atlanta, GA; (2) J. Lister, J. Edwards, J. O’Brien, D. Nikjeh, and V. Williams (2011): Effects of Auditory Training and Music Background: A CAEP Study of Older Adults in Aging and Speech Communication—An International and Interdisciplinary Conference, Bloomington, IN; and (3) D. Nikjeh, J. Lister, and K. Weinbaum (2010): Aging Musicians: Influence of Early Music Training on Cortical Auditory Evoked Potentials in AHEAD III (Assessment of Hearing in the Elderly: Aging and Degeneration-Integration through Immediate Intervention) Adult Hearing Screening conference, Cernobbio, Italy.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
G. Schlaug, “The brain of musicians: a model for functional and structural adaptation,” Annals of the New York Academy of Sciences, vol. 930, pp. 281–299, 2001.View at: Google Scholar
C. Gaser and G. Schlaug, “Brain structures differ between musicians and non-musicians,” The Journal of Neuroscience, vol. 23, no. 27, pp. 9240–9245, 2003.View at: Google Scholar
C. Pantev, A. Engelien, V. Candia, and T. Elbert, “Representational cortex in musicians: plastic alterations in response to musical practice,” Annals of the New York Academy of Sciences, vol. 930, pp. 300–314, 2001.View at: Google Scholar
S. Bertoli, J. Smurzynski, and R. Probst, “Effects of age, age-related hearing loss, and contralateral cafeteria noise on the discrimination of small frequency changes: psychoacoustic and electrophysiological measures,” Journal of the Association for Research in Otolaryngology, vol. 6, no. 3, pp. 207–222, 2005.View at: Publisher Site | Google Scholar
H. Menning, L. Roberts, and C. Pantev, “Plastic changes in the auditory cortex induced by intensive frequency discrimination training,” NeuroReport, vol. 11, no. 4, pp. 817–822, 2000.View at: Google Scholar
C. McNamara, Ways to Look at Training and Development Processes: Informal/Formal and Self-Directed/Other-Directed, Free Management Library, 2005, http://www.managementhelp.org/trng_dev/ways/ways.htm.
R. H. Wilson, “Development of a speech-in-multitalker-babble paradigm to assess word-recognition performance,” Journal of the American Academy of Audiology, vol. 14, no. 9, pp. 453–470, 2003.View at: Google Scholar
M. Hirano, Clinical Examination of Voice, Springer, New York, NY, USA, 1981.
M. Tervaniemi, S. Kruck, W. de Baene, E. Schröger, K. Alter, and A. D. Friederici, “Top-down modulation of auditory processing: effects of sound context, musical expertise and attentional focus,” European Journal of Neuroscience, vol. 30, no. 8, pp. 1636–1642, 2009.View at: Publisher Site | Google Scholar