20047084	Compared to high school graduates, adolescents who drop out of school are more likely to have a range of negative outcomes, including lower verbal capacities; however, the true nature of this association is not well-understood. Dropping out of school could have an important effect on reducing verbal skills, or the link between dropping out of school and diminished verbal skills could be a spurious association that is the result of unmeasured confounding variables. The current study tested these two competing perspectives by using propensity-score-matching (PSM) to unpack the association between school dropout and verbal skills among 7,317 respondents from the National Longitudinal Study of Adolescent Health (51% female, 49% male; 62% Caucasian, 38% minority). The results of the PSM models indicated a small yet meaningful statistically significant effect of dropout on verbal skills in adulthood even after taking into account a range of confounders. We conclude by discussing the implications of our results.	t	\N
20236795	To determine the effectiveness of gore-tex medialization thyroplasty for the management of glottic incompetence (GI) in patients with mobile vocal folds. Twenty patients with glottic incompetence (GI) and mobile vocal folds were retrospectively analyzed after gore-tex medialization laryngoplasty. Pre- and postoperative outcome measures including grade, roughness, breathiness, asthenia, strain of the voice (GRBAS), glottal function index (GFI), and voice-related quality of life (VRQOL) were compared to detect surgical effectiveness. Two anesthetic subgroups were identified and compared: general anesthesia, via laryngeal mask airway (LMA) anesthetic, and local anesthesia. Statistically significant differences were identified between pre- and postoperative VRQOL (P<0.0001), GFI (P<0.01), and composite GRBAS (P<0.0001) after a mean follow-up time of 7.8 months. Both the LMA and the local anesthetic subgroups demonstrated similar significance across these measures. GFI and VRQOL scores demonstrate a moderate correlation (ρ=0.71). Perceptual voice quality (GRBAS) correlates slightly better with VRQOL scores (ρ=-0.6; P<0.01) than qualitative measures of glottal function (GFI) (ρ=0.43). Gore-tex thyroplasty provides reliable medium-term improvement in both perceptual and subjective voice parameters in the setting of GI with mobile vocal folds.	t	\N
20388591	Electroencephalogram (EEG) recordings, and especially the Mu-rhythm over the sensorimotor cortex that relates to the activation of the mirror neuron system (MNS), were acquired from two subject groups (orchestral musicians and nonmusicians), in order to explore action representation processes involved in the perception and performance of musical pieces. Two types of stimuli were used, i.e., an auditory one consisting of an excerpt of Beethoven's fifth symphony and a visual one presenting a conductor directing an orchestra performing the same excerpt of the piece. Three tasks were conducted including auditory stimulation, audiovisual stimulation, and visual stimulation only, and the acquired signals were processed using fractal [time-dependent fractal dimension (FD) estimation] and statistical analysis (analysis of variance, Mann-Whitney). Experimental results showed significant differences between the two groups while desychronization of the Mu-rhythm, which can be linked to MNS activation, was observed during all tasks for the musicians' group, as opposed to the nonmusicians' group who exhibited similar response only when the visual stimulus was present. The mobility of the conductor was also correlated to the estimated FD signals, showing significantly higher correlation for the case of musicians compared to nonmusicians' one. The present study sheds light upon the difference in action representation in auditory perception between musicians and nonmusicians and paves the way for better comprehension of the underlying mechanisms of the MNS.	t	\N
20447726	The purpose of the study was to compare the speech recognition capacity between listeners with and without acoustic reflex using different types of noises and intensities. We studied 18 women allocated to 2 groups: acoustic reflex present (20 ears) and absent (16 ears). They were presented with 180 disyllable words (90 to each ear), emitted randomly at a fixed intensity of 40 dB above the pure tone average hearing level. At the same time, 3 types of noises were presented ipsilaterally (white, pink, and speech), one at a time, at 3 intensities: 40, 50, and 60 dB above the pure tone average hearing level. The ages and auditory thresholds were statistically equal between the groups. There was a significant difference in mean number of hits between the 2 groups for the 3 types of noises used. There was also a significant difference in mean number of hits for noise type and intensity when white and pink noise was used at 40 and 50 dB and for all the intensities when speech was used. Acoustic reflex helps communication in high-noise environments and is more efficient for speech sounds.	t	\N
20507830	On its 43rd anniversary the Simon effect can look back at a long and varied history. First treated as a curious observation with implications for human factors research, it slowly evolved not only into a valuable target of psychological theorizing itself but also into a handy means to investigate attentional operations, the representation of space and of one's body, the cognitive representation of intentional action, and executive control. This article discusses the major characteristics of the Simon effect and the Simon task that laid the ground for this success and reviews the major lines of research, theoretical developments, and ongoing controversies on and around the Simon Effect and the cognitive processes it reflects.	t	\N
20557486	Predominantly, the impact of environmental noise is measured using sound level, ignoring the influence of other factors on subjective experience. The present study tested physiological responses to natural urban soundscapes, using functional magnetic resonance imaging and vector cardiogram. City-based recordings were matched in overall sound level (71 decibel A-weighted scale), but differed on ratings of pleasantness and vibrancy. Listening to soundscapes evoked significant activity in a number of auditory brain regions. Compared with soundscapes that evoked no (neutral) emotional response, those evoking a pleasant or unpleasant emotional response engaged an additional neural circuit including the right amygdala. Ratings of vibrancy had little effect overall, and brain responses were more sensitive to pleasantness than was heart rate. A novel finding is that urban soundscapes with similar loudness can have dramatically different effects on the brain's response to the environment.	t	\N
20562170	Estrogen may be involved in schizophrenia by inhibiting serotonin-1A (5-HT(1A)) receptor function. We examined the effects of estrogen pre-treatment on modulation of loudness dependence of the auditory evoked potential (LDAEP) and mismatch negativity by the 5-HT(1A) receptor partial agonist, buspirone. Using a double-blind, placebo-controlled, repeated-measures design in healthy female volunteers, we observed that buspirone treatment significantly increased LDAEP slope. Estrogen increased LDAEP slope on its own, and a further LDAEP increase by buspirone was not seen after estrogen pre-treatment. Similar results were observed for mismatch negativity, where buspirone caused a small increase of latency, although not amplitude, after placebo but not estrogen pre-treatment, which enhanced mismatch negativity latency on its own. These results are in line with our previous findings on prepulse inhibition showing an inhibitory effect of estrogen on the action of buspirone. Taken together, these data suggest that estrogen may inhibit 5-HT(1A) receptor-mediated disruptions of auditory processing.	t	\N
20570253	An increasing number of neuroimaging studies in music cognition research suggest that "language areas" are involved in the processing of musical syntax, but none of these studies clarified whether these areas are a prerequisite for normal syntax processing in music. The present electrophysiological experiment tested whether patients with lesions in Broca's area (N=6) or in the left anterior temporal lobe (N=7) exhibit deficits in the processing of structure in music compared to matched healthy controls (N=13). A chord sequence paradigm was applied, and the amplitude and scalp topography of the Early Right Anterior Negativity (ERAN) was examined, an electrophysiological marker of musical syntax processing that correlates with activity in Broca's area and its right hemisphere homotope. Left inferior frontal gyrus (IFG) (but not anterior superior temporal gyrus - aSTG) patients with lesions older than 4 years showed an ERAN with abnormal scalp distribution, and subtle behavioural deficits in detecting music-syntactic irregularities. In one IFG patient tested 7 months post-stroke, the ERAN was extinguished and the behavioural performance remained at chance level. These combined results suggest that the left IFG, known to be crucial for syntax processing in language, plays also a functional role in the processing of musical syntax. Hence, the present findings are consistent with the notion that Broca's area supports the processing of syntax in a rather domain-general way.	t	\N
20672879	The purpose of the present study was to investigate the relationship between perception and production of metrical patterns in Swedish children with language impairment (LI), in order to add to the knowledge on underlying explanations of LI. A further aim was to explore whether omissions are mainly related to prosodic aspects or to linguistic function. Children with LI omitted significantly more unstressed syllables than did children with typical language development. Exploration of the relationship between perception and production of phrasal stress patterns demonstrated that children with LI might be divided into three subgroups: Group a: children who perform better on perception than production; Group b: children who perform better on production than perception; and Group c: children with rather poor results on both perception and production.	t	\N
20689026	To use eye tracking to investigate age differences in real-time lexical processing in quiet and in noise in light of the fact that older adults find it more difficult than younger adults to understand conversations in noisy situations. Twenty-four younger and 24 older adults followed spoken instructions referring to depicted objects, for example, "Look at the candle." Eye movements captured listeners' ability to differentiate the target noun (candle) from a similar-sounding phonological competitor (e.g., candy or sandal). Manipulations included the presence/absence of noise, the type of phonological overlap in target-competitor pairs, and the number of syllables. Having controlled for age-related differences in word recognition accuracy (by tailoring noise levels), similar online processing profiles were found for younger and older adults when targets were discriminated from competitors that shared onset sounds. Age-related differences were found when target words were differentiated from rhyming competitors and were more extensive in noise. Real-time spoken word recognition processes appear similar for younger and older adults in most conditions; however, age-related differences may be found in the discrimination of rhyming words (especially in noise), even when there are no age differences in word recognition accuracy. These results highlight the utility of eye movement methodologies for studying speech processing across the life span.	t	\N
20689036	In this study, the authors investigated the effects of age on the use of fundamental frequency differences (ΔF(0)) in the perception of competing synthesized vowels in simulations of electroacoustic and cochlear-implant hearing. Twelve younger listeners with normal hearing and 13 older listeners with (near) normal hearing were evaluated in their use of ΔF(0) in the perception of competing synthesized vowels for 3 conditions: unprocessed synthesized vowels (UNP), envelope-vocoded synthesized vowels that simulated a cochlear implant (VOC), and synthesized vowels processed to simulate electroacoustic stimulation (EAS) hearing. Tasks included (a) multiplicity, which required listeners to identify whether a stimulus contained 1 or 2 sounds and (b) double-vowel identification, which required listeners to attach phonemic labels to the competing synthesized vowels. Multiplicity perception was facilitated by ΔF(0) in UNP and EAS but not in VOC, with no age-related deficits evident. Double-vowel identification was facilitated by ΔF(0), with ΔF(0) benefit largest in UNP, reduced in EAS, and absent in VOC. Age adversely affected overall identification and ΔF(0) benefit on the double-vowel task. Some but not all older listeners derived ΔF(0) benefit in EAS hearing. This variability may partly be due to how listeners are able to draw on higher-level processing resources in extracting and integrating cues in EAS hearing.	t	\N
20695697	In object substitution masking (OSM) a sparse, temporally trailing 4-dot mask impairs target identification, even though it has different contours from, and does not spatially overlap with the target. Here, we demonstrate a previously unknown characteristic of OSM: Observers show reduced masking at prolonged (e.g., 640 ms) relative to intermediate mask durations (e.g., 240 ms). We propose that with prolonged exposure, the mask's visual representation is consolidated, which allows processing of the lingering target icon to be reinitiated, thereby improving performance. Our findings suggest that when the visual system is confronted with 2 temporally contiguous stimuli, although one may initially gain access to consciousness above the other, the "losing" stimulus is not irreversibly lost to awareness.	t	\N
20810622	Multisensory events in our natural environment unfold at multiple temporal scales over extended periods of time. This functional magnetic resonance imaging study investigated whether the brain uses transient (onset, offset) or sustained temporal codes to effectively integrate incoming visual and auditory signals within the cortical hierarchy. Subjects were presented with 1) velocity-modulated radial motion, 2) amplitude-modulated sound, or 3) an in phase combination of both in blocks of variable durations to dissociate transient and sustained blood oxygen level-dependent responses. Audiovisual interactions emerged primarily for transient onset and offset responses highlighting the importance of rapid stimulus transitions for multisensory integration. Strikingly, audiovisual interactions for onset and offset transients were dissociable at the functional and anatomical level. Low-level sensory areas integrated audiovisual inputs at stimulus onset in a superadditive fashion to enhance stimulus salience. In contrast, higher order association areas showed subadditive integration profiles at stimulus offset possibly reflecting the formation of higher order representations. In conclusion, multisensory integration emerges at multiple levels of the cortical hierarchy using different temporal codes and integration profiles. From a methodological perspective, these results highlight the limitations of conventional event related or block designs that cannot characterize these rich dynamics of audiovisual integration.	t	\N
20814962	Auditory functional magnetic resonance imaging (fMRI) was used to assess neural activation in the human auditory brainstem (AB) and cortex (AC) as a function of bandwidth (BW). We recorded brain activation of 22 normal hearing listeners induced by band pass filtered pink noise stimuli with equal sound pressure level of 70 dB SPL. Tested bandwidths were 50, 500, 1,500, 3,000, 6,000, and 8,000 Hz. The center frequency was 4,000 Hz. Categorical loudness scaling had been performed in a silent booth with all of these stimuli. Loudness as a function of bandwidth followed a concave-shaped curve which reflected the influence of spectral loudness summation (SLS) for higher BW and the influence of large amplitude fluctuations for very low BW, which itself could be explained by peak-listening. While neural activation of the AB, as measured by the percent signal change from baseline (PSC), was tuned to the physical BW of the stimuli in a straight linear fashion, the trend of perceived loudness as a function of BW was reflected in several aspects by corresponding neural activation in the primary auditory cortex (PAC). Finally, from the absolute differences of the PSC between PAC and AB, gains in perceived loudness associated with SLS and the effect of large amplitude fluctuations could be predicted with an accuracy of 1-2 dB for the whole group of participants.	t	\N
20829245	Several perisylvian brain regions show preferential activation for spoken language above and beyond other complex sounds. These "speech-selective" effects might be driven by regions' intrinsic biases for processing the acoustical or informational properties of speech. Alternatively, such speech selectivity might emerge through extensive experience in perceiving and producing speech sounds. This functional magnetic resonance imaging (fMRI) study disambiguated such audiomotor expertise from speech selectivity by comparing activation for listening to speech and music in female professional violinists and actors. Audiomotor expertise effects were identified in several right and left superior temporal regions that responded to speech in all participants and music in violinists more than actresses. Regions associated with the acoustic/information content of speech were identified along the entire length of the superior temporal sulci bilaterally where activation was greater for speech than music in all participants. Finally, an effect of performing arts training was identified in bilateral premotor regions commonly activated by finger and mouth movements as well as in right hemisphere "language regions." These results distinguish the seemingly speech-specific neural responses that can be abolished and even reversed by long-term audiomotor experience.	t	\N
20838845	We examined horizontal directional hearing in patients with acquired severe unilateral conductive hearing loss (UCHL). All patients (n = 12) had been fitted with a bone conduction device (BCD) to restore bilateral hearing. The patients were tested in the unaided (monaural) and aided (binaural) hearing condition. Five listeners without hearing loss were tested as a control group while listening with a monaural plug and earmuff, or with both ears (binaural). We randomly varied stimulus presentation levels to assess whether listeners relied on the acoustic head-shadow effect (HSE) for horizontal (azimuth) localization. Moreover, to prevent sound localization on the basis of monaural spectral shape cues from head and pinna, subjects were exposed to narrow band (1/3 octave) noises. We demonstrate that the BCD significantly improved sound localization in 8/12 of the UCHL patients. Interestingly, under monaural hearing (BCD off), we observed fairly good unaided azimuth localization performance in 4/12 of the patients. Our multiple regression analysis shows that all patients relied on the ambiguous HSE for localization. In contrast, acutely plugged control listeners did not employ the HSE. Our data confirm and further extend results of recent studies on the use of sound localization cues in chronic and acute monaural listening.	t	\N
20858647	There have been few reports showing a correlation between hearing levels and life style in young people. In this study, we succeeded in sensitively evaluating hearing levels in 51 young male adults of 21-23 years in age by 12 k Hz extra-high-frequency auditory thresholds, which cannot be measured by usual audiometry devices for clinical use. Noise exposure, alcohol consumption and sleeping time did not affect hearing levels in young adults. Auditory thresholds of 12 kHz frequency in smokers were significantly (p < 0.05) higher than those in non-smokers, while there were no differences in 1 kHz, 4 kHz and 8 kHz frequencies of hearing levels between smokers and non-smokers. Since the Brinkman Index (BI; cigarettes/day multiplied by number of years) of smokers in this study was from 12 to 60, our results suggest that even light smoking of less than 20 cigarettes/day for 3 years can result in the development of hearing loss of 12 kHz frequency in young adults. Binary logistic regression analysis again showed a correlation between hearing loss (≥ 40 dB of auditory thresholds in 12 kHz frequency) and light smoking (12 ≤ BI ≤ 60). Thus, this study showed that auditory threshold at 12 kHz frequency could be a sensitive marker for hearing in young adults. More importantly, we for the first time provided epidemiological evidence that light smoking might affect hearing level at 12 kHz frequency and revealed a new risk of light smoking.	t	\N
20862531	In this paper we use information theory to quantify the information in the output spike trains of modeled cochlear nucleus globular bushy cells (GBCs). GBCs are part of the sound localization pathway. They are known for their precise temporal processing, and they code amplitude modulations with high fidelity. Here we investigated the information transmission for a natural sound, a recorded vowel. We conclude that the maximum information transmission rate for a single neuron was close to 1,050 bits/s, which corresponds to a value of approximately 5.8 bits per spike. For quasi-periodic signals like voiced speech, the transmitted information saturated as word duration increased. In general, approximately 80% of the available information from the spike trains was transmitted within about 20 ms. Transmitted information for speech signals concentrated around formant frequency regions. The efficiency of neural coding was above 60% up to the highest temporal resolution we investigated (20 μs). The increase in transmitted information to that precision indicates that these neurons are able to code information with extremely high fidelity, which is required for sound localization. On the other hand, only 20% of the information was captured when the temporal resolution was reduced to 4 ms. As the temporal resolution of most speech recognition systems is limited to less than 10 ms, this massive information loss might be one of the reasons which are responsible for the lack of noise robustness of these systems.	t	\N
20869305	The present study examined the differential effects of voice auditory feedback perturbation direction and magnitude on voice fundamental frequency (F(0)) responses and event-related potentials (ERPs) from EEG electrodes on the scalp. The voice F(0) responses and N1 and P2 components of ERPs were examined from 12 right-handed speakers when they sustained a vowel phonation and their mid-utterance voice pitch feedback was shifted ±100, ±200, and ±500 cents with 200 ms duration. Downward voice pitch feedback perturbations led to larger voice F(0) responses than upward perturbations. The amplitudes of N1 and P2 components were larger for downward compared with upward pitch-shifts for 200 and 500 cents stimulus magnitudes. Shorter N1 and P2 latencies were also associated with larger magnitudes of pitch feedback perturbations. Corresponding changes in vocal and neural responses to upward and downward voice pitch feedback perturbations suggest that the N1 and P2 components of ERPs reflect neural concomitants of the vocal responses. The findings of interactive effects between the magnitude and direction of voice feedback pitch perturbation on N1 and P2 ERP components indicate that the neural mechanisms underlying error detection and correction in voice pitch auditory feedback are differentially sensitive to both the magnitude and direction of pitch perturbations.	t	\N
20878201	Practice can lead to dramatic improvements in the discrimination of auditory stimuli. In this study, we investigated changes of the frequency-following response (FFR), a subcortical component of the auditory evoked potentials, after a period of pitch discrimination training. Twenty-seven adult listeners were trained for 10 h on a pitch discrimination task using one of three different complex tone stimuli. One had a static pitch contour, one had a rising pitch contour, and one had a falling pitch contour. Behavioral measures of pitch discrimination and FFRs for all the stimuli were measured before and after the training phase for these participants, as well as for an untrained control group (n = 12). Trained participants showed significant improvements in pitch discrimination compared to the control group for all three trained stimuli. These improvements were partly specific for stimuli with the same pitch modulation (dynamic vs. static) and with the same pitch trajectory (rising vs. falling) as the trained stimulus. Also, the robustness of FFR neural phase locking to the sound envelope increased significantly more in trained participants compared to the control group for the static and rising contour, but not for the falling contour. Changes in FFR strength were partly specific for stimuli with the same pitch modulation (dynamic vs. static) of the trained stimulus. Changes in FFR strength, however, were not specific for stimuli with the same pitch trajectory (rising vs. falling) as the trained stimulus. These findings indicate that even relatively low-level processes in the mature auditory system are subject to experience-related change.	t	\N
20883593	This study aimed to evaluate the effect of lengthening the transition duration of selected speech segments upon the perception of those segments in individuals with auditory dys-synchrony. Thirty individuals with auditory dys-synchrony participated in the study, along with 30 age-matched normal hearing listeners. Eight consonant-vowel syllables were used as auditory stimuli. Two experiments were conducted. Experiment one measured the 'just noticeable difference' time: the smallest prolongation of the speech sound transition duration which was noticeable by the subject. In experiment two, speech sounds were modified by lengthening the transition duration by multiples of the just noticeable difference time, and subjects' speech identification scores for the modified speech sounds were assessed. Subjects with auditory dys-synchrony demonstrated poor processing of temporal auditory information. Lengthening of speech sound transition duration improved these subjects' perception of both the placement and voicing features of the speech syllables used. These results suggest that innovative speech processing strategies which enhance temporal cues may benefit individuals with auditory dys-synchrony.	t	\N
20884355	Developmental differences in phonological and orthographic processing of Chinese spoken words were examined in 9-year-olds, 11-year-olds and adults using functional magnetic resonance imaging (fMRI). Rhyming and spelling judgments were made to two-character words presented sequentially in the auditory modality. Developmental comparisons between adults and both groups of children combined showed that age-related changes in activation in visuo-orthographic regions depended on a task. There were developmental increases in the left inferior temporal gyrus and the right inferior occipital gyrus in the spelling task, suggesting more extensive visuo-orthographic processing in a task that required access to these representations. Conversely, there were developmental decreases in activation in the left fusiform gyrus and left middle occipital gyrus in the rhyming task, suggesting that the development of reading is marked by reduced involvement of orthography in a spoken language task that does not require access to these orthographic representations. Developmental decreases may arise from the existence of extensive homophony (auditory words that have multiple spellings) in Chinese. In addition, we found that 11-year-olds and adults showed similar activation in the left superior temporal gyrus across tasks, with both groups showing greater activation than 9-year-olds. This pattern suggests early development of perceptual representations of phonology. In contrast, 11-year-olds and 9-year-olds showed similar activation in the left inferior frontal gyrus across tasks, with both groups showing weaker activation than adults. This pattern suggests late development of controlled retrieval and selection of lexical representations. Altogether, this study suggests differential effects of character acquisition on development of components of the language network in Chinese as compared to previous reports on alphabetic languages.	t	\N
20888628	Mechanisms of cortical reorganization underlying the enhancement of speech processing have been poorly investigated. In the present study, we addressed changes in functional and effective connectivity induced in subjects who learned to deliberately increase activation in the right inferior frontal gyrus (rIFG), and improved their ability to identify emotional intonations by using a real-time fMRI Brain-Computer Interface. At the beginning of their training process, we observed a massive connectivity of the rIFG to a widespread network of frontal and temporal areas, which decreased and lateralized to the right hemisphere with practice. Volitional control of activation strengthened connectivity of this brain region to the right prefrontal cortex, whereas training increased its connectivity to bilateral precentral gyri. These findings suggest that changes of connectivity in a functionally specific manner play an important role in the enhancement of speech processing. Also, these findings support previous accounts suggesting that motor circuits play a role in the comprehension of speech.	t	\N
20933093	The reproducibility of three different aspects of fMRI activations-namely binary activation maps, effect size and spatial distribution of local maxima-was evaluated for an auditory sentence comprehension task with high attention demand on a group of 17 subjects that were scanned on five different occasions. While in the scanner subjects were asked to listen to a series of six short everyday sentences from the CUNY sentence test. Comprehension and attention to the stimuli were monitored after each listen condition epoch by having subjects answer a series of multiple-choice questions. Statistical maps of activation for the listen condition were computed at three different levels: overall results for all imaging sessions, group-level/single-session results for each of the five imaging occasions, and single-subject/single-session results computed for each subject and each scanning occasion independently. The experimental task recruited a distributed bilateral network with processing nodes located in the lateral temporal cortex, inferior frontal cortex, medial BA6, medial occipital cortex and subcortical structures such as the putamen and the thalamus. Reproducibility of these activations at the group level was high (83.95% of the imaged volume was consistently classified as active/inactive across all five imaging sessions), indicating that sites of neuronal activity associated with auditory comprehension can reliably be detected with fMRI in healthy subjects, across repeated measures after group averaging. At the single-subject level reproducibility ranged from moderate to high, although no significant differences were found on behavioral measures across subjects or sessions. This result suggests that contextual differences-i.e., those specific to each imaging session, can modulate our ability to detect fMRI activations associated with speech comprehension in individual subjects.	t	\N
20951812	Auditory event-related potentials (ERPs) have been used to understand how the brain processes auditory input, and to track developmental change in sensory systems. Localizing ERP generators can provide invaluable insights into how and where auditory information is processed. However, age-appropriate infant brain templates have not been available to aid such developmental mapping. In this study, auditory change detection responses of brain ERPs were examined in 6-month-old infants using discrete and distributed source localization methods mapped onto age-appropriate magnetic resonance images. Infants received a passive oddball paradigm using fast-rate non-linguistic auditory stimuli (tone doublets) with the deviant incorporating a pitch change for the second tone. Data was processed using two different high-pass filters. When a 0.5 Hz filter was used, the response to the pitch change was a large frontocentral positive component. When a 3 Hz filter was applied, two temporally consecutive components associated with change detection were seen: one with negative voltage, and another with positive voltage over frontocentral areas. Both components were localized close to the auditory cortex with an additional source near to the anterior cingulate cortex. The sources for the negative response had a more tangential orientation relative to the supratemporal plane compared to the positive response, which showed a more lateral, oblique orientation. The results described here suggest that at 6 months of age infants generate similar response patterns and use analogous cortical areas to that of adults to detect changes in the auditory environment. Moreover, the source locations and orientations, together with waveform topography and morphology provide evidence in infants for feature-specific change detection followed by involuntary switching of attention.	t	\N
20961171	The present study addresses the question whether accentuation and prosodic phrasing can have a similar function, namely, to group words in a sentence together. Participants listened to locally ambiguous sentences containing object- and subject-control verbs while ERPs were measured. In Experiment 1, these sentences contained a prosodic break, which can create a certain syntactic grouping of words, or no prosodic break. At the disambiguation, an N400 effect occurred when the disambiguation was in conflict with the syntactic grouping created by the break. We found a similar N400 effect without the break, indicating that the break did not strengthen an already existing preference. This pattern held for both object- and subject-control items. In Experiment 2, the same sentences contained a break and a pitch accent on the noun following the break. We argue that the pitch accent indicates a broad focus covering two words [see Gussenhoven, C. On the limits of focus projection in English. In P. Bosch & R. van der Sandt (Eds.), Focus: Linguistic, cognitive, and computational perspectives. Cambridge: University Press, 1999], thus grouping these words together. For object-control items, this was semantically possible, which led to a "good-enough" interpretation of the sentence. Therefore, both sentences were interpreted equally well and the N400 effect found in Experiment 1 was absent. In contrast, for subject-control items, a corresponding grouping of the words was impossible, both semantically and syntactically, leading to processing difficulty in the form of an N400 effect and a late positivity. In conclusion, accentuation can group words together on the level of information structure, leading to either a semantically "good-enough" interpretation or a processing problem when such a semantic interpretation is not possible.	t	\N
20966383	Delayed auditory feedback (DAF) of a speaker's voice disturbs normal speech production. Various traditional theories assume that the content of the delayed feedback signal interferes with the actual production of a particular speech unit (phonemic content hypothesis). The displaced rhythm hypothesis as an alternative explanation suggests that speech disturbances arise from a disruptive rhythm that is produced by the delayed speech signal. The present experimental study directly contrasted the role of rhythm and speech content in a DAF task using speech units as stimuli. One hundred fifty-one participants read aloud 4 different sequences of double syllables that varied in phonemic content and rhythm while auditory feedback was either nondelayed or delayed by 200 or 400 ms. In line with previous studies, the authors found a peak of disturbances at a delay of about 200 ms, independent of speech rate. More important, the present results clearly support the displaced rhythm hypothesis. A speech rate dependency of this effect was also found. Rhythm seems to be a significant criterion of speech monitoring, and hence a mismatch between spoken words and auditory feedback realized by DAF induces obvious speech problems on rhythmic level regardless of phonemic discrepancy at the same time.	t	\N
21054432	Performance improvement during an hour of auditory perceptual training is accompanied by rapid physiological changes. These changes may reflect learning or simply task repetition independent of learning. We assessed the contribution of learning and task repetition to changes in auditory evoked potentials during a difficult speech identification task and an easy tone identification task. We posited that only task repetition effects would occur in the tone task but that task repetition and learning would interact in the speech task. Speech identification improved with practice (increased sensitivity d' with a constant response bias β). This behavioral improvement coincided with a decrease in the amplitude of sensory evoked responses (N1, P2) and a decrease in the amplitude of a slow wave (peak=320 ms after onset) over the left frontal and parietal sites. Results show rapid physiological changes associated with learning, distinct from changes related to task repetition.	t	\N
21060139	Improved speech recognition in binaurally combined acoustic-electric stimulation (otherwise known as bimodal hearing) could arise when listeners integrate speech cues from the acoustic and electric hearing. The aims of this study were (a) to identify speech cues extracted in electric hearing and residual acoustic hearing in the low-frequency region and (b) to investigate cochlear implant (CI) users' ability to integrate speech cues across frequencies. Normal-hearing (NH) and CI subjects participated in consonant and vowel identification tasks. Each subject was tested in 3 listening conditions: CI alone (vocoder speech for NH), hearing aid (HA) alone (low-pass filtered speech for NH), and both. Integration ability for each subject was evaluated using a model of optimal integration--the PreLabeling integration model (Braida, 1991). Only a few CI listeners demonstrated bimodal benefit for phoneme identification in quiet. Speech cues extracted from the CI and the HA were highly redundant for consonants but were complementary for vowels. CI listeners also exhibited reduced integration ability for both consonant and vowel identification compared with their NH counterparts. These findings suggest that reduced bimodal benefits in CI listeners are due to insufficient complementary speech cues across ears, a decrease in integration ability, or both.	t	\N
21068039	Noise-vocoded (NV) speech is often regarded as conveying phonetic information primarily through temporal-envelope cues rather than spectral cues. However, listeners may infer the formant frequencies in the vocal-tract output-a key source of phonetic detail-from across-band differences in amplitude when speech is processed through a small number of channels. The potential utility of this spectral information was assessed for NV speech created by filtering sentences into six frequency bands, and using the amplitude envelope of each band (≤30 Hz) to modulate a matched noise-band carrier (N). Bands were paired, corresponding to F1 (≈N1 + N2), F2 (≈N3 + N4) and the higher formants (F3' ≈ N5 + N6), such that the frequency contour of each formant was implied by variations in relative amplitude between bands within the corresponding pair. Three-formant analogues (F0 = 150 Hz) of the NV stimuli were synthesized using frame-by-frame reconstruction of the frequency and amplitude of each formant. These analogues were less intelligible than the NV stimuli or analogues created using contours extracted from spectrograms of the original sentences, but more intelligible than when the frequency contours were replaced with constant (mean) values. Across-band comparisons of amplitude envelopes in NV speech can provide phonetically important information about the frequency contours of the underlying formants.	t	\N
21086218	Two experiments investigated the role that different face regions play in a variety of social judgements that are commonly made from facial appearance (sex, age, distinctiveness, attractiveness, approachability, trustworthiness, and intelligence). These judgements lie along a continuum from those with a clear physical basis and high consequent accuracy (sex, age) to judgements that can achieve a degree of consensus between observers despite having little known validity (intelligence, trustworthiness). Results from Experiment 1 indicated that the face's internal features (eyes, nose, and mouth) provide information that is more useful for social inferences than the external features (hair, face shape, ears, and chin), especially when judging traits such as approachability and trustworthiness. Experiment 2 investigated how judgement agreement was affected when the upper head, eye, nose, or mouth regions were presented in isolation or when these regions were obscured. A different pattern of results emerged for different characteristics, indicating that different types of facial information are used in the various judgements. Moreover, the informativeness of a particular region/feature depends on whether it is presented alone or in the context of the whole face. These findings provide evidence for the importance of holistic processing in making social attributions from facial appearance.	t	\N
21092939	The ratio of scalp-recorded brain responses occurring 50 msec after paired clicks (S2-evoked P50/S1-evoked P50) serves as a measure of sensory gating. An abnormally large ratio is commonly found in schizophrenia and is considered as a sign of reduced sensory gating or otherwise dysfunctional organization of the auditory/verbal system as a factor contributing to psychopathology and cognitive dysfunction in schizophrenia. This initial randomized clinical trial compared the efficacy of two 4-week, computer-based cognitive training methods that emphasize either auditory discrimination and verbal memory or a broader range of cognitive functions in schizophrenia. Thirty-nine schizophrenia patients (ICD-F20.0 diagnosis) were assigned to Cognitive Exercises (CE) or Cognitive Package (Cogpack). The M50, the magnetoencephalographic analogue of electroencephalographic P50, and performance on verbal learning and memory tests were used to evaluate training effects. As expected, patients exhibited higher pretreatment gating ratios than 28 age-matched healthy comparison participants. Gating ratios decreased after CE but not after Cogpack. Cognitive test performance improved more after CE than after Cogpack. Appropriately specific psychological training changes the neural performance in schizophrenia, normalizing sensory and cognitive function.	t	\N
21105829	The problem of multimodal clustering arises whenever the data are gathered with several physically different sensors. Observations from different modalities are not necessarily aligned in the sense there there is no obvious way to associate or compare them in some common space. A solution may consist in considering multiple clustering tasks independently for each modality. The main difficulty with such an approach is to guarantee that the unimodal clusterings are mutually consistent. In this letter, we show that multimodal clustering can be addressed within a novel framework: conjugate mixture models. These models exploit the explicit transformations that are often available between an unobserved parameter space (objects) and each of the observation spaces (sensors). We formulate the problem as a likelihood maximization task and derive the associated conjugate expectation-maximization algorithm. The convergence properties of the proposed algorithm are thoroughly investigated. Several local and global optimization techniques are proposed in order to increase its convergence speed. Two initialization strategies are proposed and compared. A consistent model selection criterion is proposed. The algorithm and its variants are tested and evaluated within the task of 3D localization of several speakers using both auditory and visual data.	t	\N
21106695	To examine resource allocation and sentence processing, this study examined the effects of auditory distraction on grammaticality judgment (GJ) of sentences varied by semantics (reversibility) and short-term memory requirements. Experiment 1: Typical young adult females (N = 60) completed a whole-sentence GJ task in distraction (Quiet, Noise, or Talk). Participants judged grammaticality of Passive sentences varied by sentence (length), grammaticality, and reversibility. Reaction time (RT) data were analyzed using a mixed analysis of variance. Experiment 2: A similar group completed a self-paced reading GJ task using the similar materials. Experiment 1: Participants responded faster to Bad and to Nonreversible sentences, and in the Talk distraction. The slowest RTs were noted for Good-Reversible-Padded sentences in the Quiet condition. Experiment 2: Distraction did not differentially affect RTs for sentence components. Verb RTs were slower for Reversible sentences. Results suggest that narrative distraction affected GJ, but by speeding responses, not slowing them. Sentence variables of memory and reversibility slowed RTs, but narrative distraction resulted in faster processing times regardless of individual sentence variables. More explicit, deliberate tasks (self-paced reading) resulted in less effect from distraction. Results are discussed in terms of recent theories about auditory distraction.	t	\N
21112404	Visual dominance refers to the observation that in bimodal environments vision often has an advantage over other senses in human. Therefore, a better memory performance for visual compared to, e.g., auditory material is assumed. However, the reason for this preferential processing and the relation to the memory formation is largely unknown. In this fMRI experiment, we manipulated cross-modal competition and attention, two factors that both modulate bimodal stimulus processing and can affect memory formation. Pictures and sounds of objects were presented simultaneously in two levels of recognisability, thus manipulating the amount of cross-modal competition. Attention was manipulated via task instruction and directed either to the visual or the auditory modality. The factorial design allowed a direct comparison of the effects between both modalities. The resulting memory performance showed that visual dominance was limited to a distinct task setting. Visual was superior to auditory object memory only when allocating attention towards the competing modality. During encoding, cross-modal competition and attention towards the opponent domain reduced fMRI signals in both neural systems, but cross-modal competition was more pronounced in the auditory system and only in auditory cortex this competition was further modulated by attention. Furthermore, neural activity reduction in auditory cortex during encoding was closely related to the behavioural auditory memory impairment. These results indicate that visual dominance emerges from a less pronounced vulnerability of the visual system against competition from the auditory domain.	t	\N
21133642	Tonsillar hypertrophy is common in young children and affects several aspects of the speech such as distortions of the dento-alveolar consonants. The study objective was to assess /s/-articulation, perceptually and acoustically, in children with tonsillar hypertrophy and compare effects of two types of surgery, total tonsillectomy and tonsillotomy. Sixty-seven children, aged 50-65 months, on the waiting list for surgery, were randomized to tonsillectomy or tonsillotomy. The speech material was collected preoperatively and 6 months postoperatively. Two groups of age-matched children were controls. /S/-articulation was affected acoustically with lower spectral peak locations and perceptually with less distinct /s/-production before surgery, in comparison to controls. After surgery /s/-articulation was normalized perceptually, but acoustic differences remained. No significant differences between surgical methods were found.	t	\N
21145901	The allocation of visual processing capacity is a key topic in studies and theories of visual attention. The load theory of Lavie (1995) proposes that allocation happens in two steps where processing resources are first allocated to task-relevant stimuli and secondly remaining capacity 'spills over' to task-irrelevant distractors. In contrast, the Theory of Visual Attention (TVA) proposed by Bundesen (1990) assumes that allocation happens in a single step where processing capacity is allocated to all stimuli, both task-relevant and task-irrelevant, in proportion to their relative attentional weight. Here we present data from two partial report experiments where we varied the number and discriminability of the task-irrelevant stimuli (Experiment 1) and perceptual load (Experiment 2). The TVA fitted the data of the two experiments well thus favoring the simple explanation with a single step of capacity allocation. We also show that the effects of varying perceptual load can only be explained by a combined effect of allocation of processing capacity as well as limits in visual working memory. Finally, we link the results to processing capacity understood at the neural level based on the neural theory of visual attention by Bundesen et al. (2005).	t	\N
21147178	We investigated the effects of non-native language (English) exposure on event-related potentials (ERPs) in first- and second-year (four- and five-year-old) preschool Japanese native speakers while they listened to semantically congruent and incongruent Japanese sentences. The children were divided into a non-native language exposed group (exposed group) and a group without such experiences (control group) on the basis of their exposure to non-native language. We compared the ERPs recorded from the two groups in each of the two preschool years. N400 was observed both in the first- and second-year preschoolers. Differences owing to exposure to non-native language appeared in the second-year preschoolers but not in the first-year preschoolers. In the second-year preschoolers, the N400 onset in the exposed group was shorter than that in the control group, but there was no difference in the N400 offset between the exposed and control groups. Furthermore, the scalp distribution of the N400 in the exposed group was broader than that in the control group. These results indicate that the time course and scalp distribution of semantic processing for native language sentences in young children fluctuated depending on exposure to non-native language.	t	\N
21148089	Numerous studies have demonstrated effects of spatial attention within single sensory modalities (within-modal spatial attention) and the effect of directing attention to one sense compared with the other senses (intermodal attention) on cortical neuronal activity. Furthermore, recent studies have been revealing that the effects of spatial attention directed to a certain location in a certain sense spread to the other senses at the same location in space (cross-modal spatial attention). The present study used magnetoencephalography to examine the temporal dynamics of the effects of within-modal and cross-modal spatial and intermodal attention on cortical processes responsive to visual stimuli. Visual or tactile stimuli were randomly presented on the left or right side at a random interstimulus interval and subjects directed attention to the left or right when vision or touch was a task-relevant modality. Sensor-space analysis showed that a response around the occipitotemporal region at around 150 ms after visual stimulation was significantly enhanced by within-modal, cross-modal spatial, and intermodal attention. A later response over the right frontal region at around 200 ms was enhanced by within-modal spatial and intermodal attention, but not by cross-modal spatial attention. These effects were estimated to originate from the occipitotemporal and lateral frontal areas, respectively. Thus the results suggest different spatiotemporal dynamics of neural representations of cross-modal attention and intermodal or within-modal attention.	t	\N
21185855	External noise paradigms have been widely used to probe different levels of visual processing (Pelli & Farell, 1999). A basic assumption of this paradigm is that the processing strategy is noise-invariant, remaining the same in low and high external noise. We tested this assumption by examining crowding in a detection task where traditionally crowding has no effect. In the first experiment, we measured detection thresholds for a vertically oriented sine wave grating (target) surrounded by four sine wave gratings (flankers) that were either vertically or horizontally oriented. At low noise levels, the detection threshold for the target was unaffected by the orientation of the flankers--there was no crowding. Surprisingly, however, there was crowding for detection at high noise levels: the threshold increased for the similarly-oriented flankers. This suggests that high noise triggered a change in processing strategy, increasing the range of space or features over which the visual signal was sampled. In a second experiment, we evaluated the impact of the spatial and temporal window of the noise on this crowding effect. Although crowding was observed for detection when the spatial and/or temporal window of the noise was localized (i.e. identical to the signal window), no crowding was observed when the noise was spatially and temporally extended (i.e. continuously displayed, full screen dynamic noise). Our results show that certain spatiotemporal distributions of external noise can elicit a change in processing strategy, invalidating the noise-invariant assumption that underlies external noise paradigms. In contrast, spatiotemporally extended noise maintains the required noise-indifference, perhaps because it matches the characteristics of the internal noise that determines the contrast threshold in low noise.	t	\N
21187749	Sentence stress is a vital attribute of speech since it indicates the importance of specific words within an utterance. Basic acoustic correlates of stress are syllable duration, intensity, and fundamental frequency (F0). Objectives of the study were to determine cochlear implant (CI) users' perception of the acoustic correlates and to uncover which cues are used for stress identification. Several experiments addressed the discrimination of changes in syllable duration, intensity, and F0 as well as stress identification based on these cues. Moreover, the discrimination of combined cues and identification of stress in conversational speech was examined. Both natural utterances and artificial manipulations of the acoustic cues were used as stimuli. Discrimination of syllable duration did not differ significantly between CI recipients and a control group of normal-hearing listeners. In contrast, CI users performed significantly worse on tasks of discrimination and stress identification based on F0 as well as on intensity. Results from these measurements were significantly correlated with the ability to identify stress in conversational speech. Discrimination performance for covarying F0 and intensity changes was more strongly correlated to identification performance than was found for discrimination of either F0 or intensity alone. Syllable duration was not related to stress identification in natural utterances. The outcome emphasizes the importance of both F0 and intensity for CI users' identification of sentence-based stress. Both cues were used separately for stress perception, but combining the cues provided extra benefit for most of the subjects.	t	\N
21187751	The current study was designed to see how hearing-impaired individuals judge level differences between speech sounds with and without hearing amplification. It was hypothesized that hearing aid compression should adversely affect the user's ability to judge level differences. Thirty-eight hearing-impaired participants performed an adaptive tracking procedure to determine their level-discrimination thresholds for different word and sentence tokens, as well as speech-spectrum noise, with and without their hearing aids. Eight normal-hearing participants performed the same task for comparison. Level discrimination for different word and sentence tokens was more difficult than the discrimination of stationary noises. Word level discrimination was significantly more difficult than sentence level discrimination. There were no significant differences, however, between mean performance with and without hearing aids and no correlations between performance and various hearing aid measurements. There is a clear difficulty in judging the level differences between words or sentences relative to differences between broadband noises, but this difficulty was found for both hearing-impaired and normal-hearing individuals and had no relation to hearing aid compression measures. The lack of a clear adverse effect of hearing aid compression on level discrimination is suggested to be due to the low effective compression ratios of currently fit hearing aids.	t	\N
21190394	The purpose of the present study was to test the hypothesis that cochlear implant (CI) users' music perception is correlated with their lexical tone perception, and the two types of perception share similar mechanisms in electric hearing. A lexical tone perception test and a pitch interval discrimination test were administered to a group of CI users and a group of normal-hearing (NH) listeners. SAMPLE STUDY: Nineteen adult CI users and 10 NH listeners who are native-Mandarin-Chinese speakers participated in the study. Tone-perception performance of the CI group was, on average, 58.3% correct (± 19.78% correct), and performance of the NH group was near perfect. The CI group had a mean threshold of 5.66 semitones (± 5.57 semitones) in pitch discrimination as compared to the threshold of 0.44 semitone from the NH group. There was a strong correlation between the CI users' tone-perception performance and their pitch discrimination threshold (r = -0.75, p < 0.001). Musical and lexical pitch perceptions are strongly correlated with each other and they might share similar mechanisms in electric hearing.	t	\N
21206391	The round window application of the Vibrant Sound bridge, the so-called round window vibroplasty, is gaining increasing popularity for hearing rehabilitation of patients with mixed hearing loss or conductive hearing loss. In these patients, conventional hearing amplification and/or surgical restoration is either not possible or has failed because of chronic ear disease, extensive otosclerosis, or malformations. The exact mechanisms of direct cochlear stimulation via the round window membrane are not yet completely understood. It is unclear what kind and what degree of contact is required between the floating mass transducer (FMT) and the round window membrane (RWM) to elicit a functional hearing perception with the implant. We investigated the coupling efficiency between the FMT and the RWM and how the efficiency is altered by the FMT position, the degree of FMT-RWM contact, and the use of a soft tissue coupler. Prospective cohort study. Tertiary referral center in Western Australia. Patients undergoing round window vibroplasty for a mixed or conductive hearing loss otherwise not aidable. Patients underwent round window vibroplasty and received audiological and coupling analysis in the follow-up. These data were then correlated with FMT positioning and the extent of FMT-RWM interface as determined by postoperative high-resolution temporal bone computed tomography. Coupling and hearing levels in relation to FMT positioning and degree of FMT-RWM contact. Of 10 patients, 8 were available for vibroplasty behavioral threshold testing. In 2 patients, testing could not be done because of wound breakdown requiring device explantation in 1 case, and in the other case, the bone conduction thresholds dropped 2 months after implantation, thus falling out of the performance range of the device. Postoperative FMT migration occurred in 50% of the patients (3/6) with recurrent chronic ear disease and status after multiple previous ear operations. All patients, including the 3 patients requiring surgical repositioning of the FMT, attained significantly improved speech in quiet and speech in noise when compared with the preoperatively best aided performance. All patients showed significantly improved average Abbreviated Profile of Hearing Benefit scores with the use of the FMT. Direct (partial or complete) contact with the RWM resulted in good coupling efficiency; soft tissue coupling resulted in a reduced coupling efficiency.	t	\N
21258612	This study had three goals: (1) to document the literacy skills of deaf adolescents who received cochlear implants (CIs) as preschoolers; (2) to examine reading growth from elementary grades to high school; (3) to assess the contribution of early literacy levels and phonological processing skills, among other factors, to literacy levels in high school. A battery of reading, spelling, expository writing, and phonological processing assessments were administered to 112 high school (CI-HS) students, ages 15.5 to 18.5 yrs, who had participated in a reading assessment battery in early elementary grades (CI-E), ages 8.0 to 9.9 yrs. The CI-HS students' performance was compared with either a control group of hearing peers (N = 46) or hearing norms provided by the assessment developer. Many of the CI-HS students (47 to 66%) performed within or above the average range for hearing peers on reading tests. When compared with their CI-E performance, good early readers were also good readers in high school. Importantly, the majority of CI-HS students maintained their reading levels over time compared with hearing peers, indicating that the gap in performance was, at the very least, not widening for most students. Written expression and phonological processing tasks posed a great deal of difficulty for the CI-HS students. They were poorer spellers, poorer expository writers, and displayed poorer phonological knowledge than hearing age-mates. Phonological processing skills were a critical predictor of high school literacy skills (reading, spelling, and expository writing), accounting for 39% of variance remaining after controlling for child, family, and implant characteristics. Many children who receive CIs as preschoolers achieve age-appropriate literacy levels as adolescents. However, significant delays in spelling and written expression are evident compared with hearing peers. For children with CIs, the development of phonological processing skills is not just important for early reading skills, such as decoding, but is critical for later literacy success as well.	t	\N
21261633	Humans must often focus attention onto relevant sensory signals in the presence of simultaneous irrelevant signals. This type of attention has been explored in vision with the N2pc component, and the present study sought to find an analogous auditory effect. In Experiment 1, two 750-ms sounds were presented simultaneously, one from each of two lateral speakers. On each trial, participants indicated whether one of the two sounds was a pre-defined target. We found that targets elicited an N2ac component: a negativity in the N2 latency range at anterior contralateral electrodes. We also observed a later and more posterior contralateral positivity. Experiment 2 replicated these effects and demonstrated that they arose from competition between attended and unattended tones rather than reflecting lateralized effects of attention for individual tones. The N2ac component may provide a useful tool for studying selective attention within auditory scenes.	t	\N
21264649	In two experiments, each including a simple reaction time (RT) task, a localization task, and a passive oddball paradigm, the physical similarity between two dichotically presented auditory stimuli was manipulated. In both experiments, a redundant signals effect (RSE), high localization performance, and a reliable mismatch negativity (MMN) was observed for largely differing stimuli, suggesting that these are coded separately in auditory memory. In contrast, no RSE and a localization rate close to chance level (experiment 1) or at chance (experiment 2) were observed for stimuli differing to a lesser degree. Crucially, for such stimuli a small (experiment 1) or no (experiment 2) MMN were observed. These MMN results indicate that such stimuli tend to fuse into a single percept and that this fusion occurs rather early within information processing.	t	\N
21264730	Fundamental limitations in performing multiple tasks concurrently are well illustrated by the attentional blink (AB) deficit, which refers to the difficulty in reporting a second target (T2) when it is presented shortly after a first target (T1). Surprisingly, recent studies have shown that the AB, which is often thought of as a manifestation of capacity limitations in central processing, can be reduced when the AB task is performed simultaneously with concurrent distracting activities. In the present study, we sought to investigate whether such concurrency benefits would also be observed when the AB task was performed concurrently with a central demanding timing task. The AB was reduced under concurrent-task conditions, as compared with single-AB-task conditions, even though T1 performance was unaffected by the concurrent task. Moreover, shifts in decision criteria were found to be associated with the concurrency benefit effect.	t	\N
21277208	Crowding by nearby features causes identification failures in the peripheral visual field. However, prominent visual features can sometimes fail to reach awareness, causing scenes to be incorrectly interpreted. Here we examine whether awareness of the flanking features is necessary for crowding to occur. Flankers that were physically present were rendered perceptually absent with adaptation-induced blindness. In a letter identification task, targets were presented unflanked or with up to four flanker letters. On each trial, observers reported both the number of letters they perceived and the identity of a target letter. This paradigm allowed trial-by-trial assessment of awareness and crowding and ensured that both targets and flankers were attended. Target-letter identification performance was correlated with the number of flanking letters that were perceived on a given trial, regardless of the number that were physically present. Our data demonstrate that crowding can be released when flanking elements at attended locations are suppressed from visual awareness.	t	\N
21299272	We argue that 4 fundamental gestalt phenomena in perception apply to the control of motor action. First, a motor gestalt, like a perceptual gestalt, is holistic in the sense that it is processed as a single unit. This notion is consistent with reaction time results indicating that all gestures for a brief unit of action must be programmed prior to initiation of any part of the movement. Additional reaction time results related to initiation of longer responses are consistent with processing in terms of a sequence of indivisible motor gestalts. Some actions (e.g., many involving coordination of the hands) can be carried out effectively only if represented as a unitary gestalt. Second, a perceptual gestalt is independent of specific sensory receptors, as evidenced by perceptual constancy. In a similar manner a motor gestalt can be represented independently of specific muscular effectors, thereby allowing motor constancy. Third, just as a perceptual pattern (e.g., a Necker cube) is exclusively structured into only 1 of its possible configurations at any moment in time, processing prior to action is limited to 1 motor gestalt. Fourth, grouping in apparent motion leads to stream segregation in visual and auditory perception; this segregation is present in motor action and is dependent on the temporal rate. We discuss congruence of gestalt phenomena across perception and motor action (a) in relation to a unitary perceptual-motor code, (b) with respect to differences in the role of awareness, and (c) in conjunction with separate neural pathways for conscious perception and motor control.	t	\N
21300112	Children's foreign-language (FL) learning is a matter of much social as well as scientific debate. Previous behavioral research indicates that starting language learning late in life can lead to problems in phonological processing. Inadequate phonological capacity may impede lexical learning and semantic processing (phonological bottleneck hypothesis). Using both behavioral and neuroimaging data, here we examine the effects of age of first exposure (AOFE) and total hours of exposure (HOE) to English, on 350 Japanese primary-school children's semantic processing of spoken English. Children's English proficiency scores and N400 event-related brain potentials (ERPs) were analyzed in multiple regression analyses. The results showed (1) that later, rather than earlier, AOFE led to higher English proficiency and larger N400 amplitudes, when HOE was controlled for; and (2) that longer HOE led to higher English proficiency and larger N400 amplitudes, whether AOFE was controlled for or not. These data highlight the important role of amount of exposure in FL learning, and cast doubt on the view that starting FL learning earlier always produces better results.	t	\N
21308886	Anxiety Sensitivity (AS), the tendency to fear the thoughts, symptoms, and social consequences associated with the experience of anxiety, is associated with increased risk for developing anxiety disorders. Some evidence suggests that higher scores on the Anxiety Sensitivity Index (ASI), a measure of the AS construct, are associated with activation of the anterior insular cortex during overt emotion perception. Although the ASI provides subscale scores measuring Physical, Mental Incapacitation, and Social Concerns of AS, no study has examined the relationship between these factors and regional brain activation during affect processing. We hypothesized that insular responses to fear-related stimuli would be primarily related to the Physical Concerns subscale of the ASI, particularly for a sample of subjects with specific phobias. Adult healthy controls (HC; n = 22) and individuals with specific phobia, small animal subtype (SAP; n = 17), completed the ASI and underwent functional magnetic resonance imaging while engaged in a backward-masked affect perception task that presents emotional facial stimuli below the threshold of conscious perception. Groups did not differ in ASI, state or trait anxiety scores, or insula activation. Total ASI scores were positively correlated with activation in the right middle/anterior insula for the combined sample and for the HC and SAP groups separately. Multiple regression analysis revealed that the relationship between AS and insular activation was primarily accounted for by Physical Concerns only. Findings support the hypothesized role of the right anterior insula in the visceral/interoceptive aspects of AS, even in response to masked affective stimuli.	t	\N
21315438	Using the mismatch negativity (MMN) response, we examined how Standard French and Southern French speakers access the meaning of words ending in /e/ or /ε/ vowels which are contrastive in Standard French but not in Southern French. In Standard French speakers, there was a significant difference in the amplitude of the brain response after the deviant-minus-standard subtraction between the frontocentral (FC) and right lateral (RL) recording sites for the final-/ε/ word but not the final-/e/ word. In contrast, the difference in the amplitude of the brain response between the FC and RL recording sites did not significantly vary as a function of the word's final vowel in Southern French speakers. Our findings provide evidence that access to lexical meaning in spoken word recognition depends on the speaker's native regional accent.	t	\N
21315875	The present study examined the occurrence and content of auditory hallucinatory experiences in 41 non-clinical participants scoring high or low on the Oxford-Liverpool Inventory of Feelings and Experiences (brief version; OLIFE-B) measure of schizotypy. Participants listened to 10 1-min recordings of white noise, some of which contained embedded concrete or abstract words, and were asked to record the words that they had heard. High scorers on the unusual experiences (UE) scale of the OLIFE-B reported hearing more words, not actually present, relative to low scorers on that measure. In addition, high UE scorers showed a bias toward making hallucinatory reports of an abstract type over a concrete type. These results suggest a bias toward more auditory hallucinatory reports in high scorers in schizotypy, and particularly to those of an abstract type.	t	\N
21319937	A Danish version of the hearing in noise test (HINT) has been developed and evaluated in normal-hearing (NH) and hearing-impaired (HI) listeners. The speech material originated from Nielsen & Dau (2009) where a sentence-based intelligibility equalization method was presented. In the present study, the speech material was evaluated for naturalness and a subset of sentences selected. The new sentence lists were validated, and after three weeks retested. An additional experiment investigated how recollection of sentences affected the listeners' performance. 16 NH and 16 HI listeners participated in the validation and retest. Twelve HI listeners participated in the experiment on recollection. The average speech recognition threshold in noise (SRT(N)) for the NH listeners was -2.52 dB, with an overall standard deviation of 0.87 dB. The within-subject standard deviation was similar for the NH and the HI listeners. In the retest, the SRT(N) decreased by 0.4 dB in both groups. The Danish HINT consists of 10 test lists and three practice lists each containing 20 sentences. The validation results are comparable to those of other versions of HINT. The test seems equally reliable for NH and HI listeners. After three weeks, reliable results can be obtained when sentence lists are reused with the same listeners.	t	\N
21324531	In recent years, new speech coding strategies have been developed with the aim of improving the transmission of temporal fine structure to cochlear implant recipients. This study reports on the implementation of one such strategy (fine structure processing, FSP) in children. This was a prospective study investigating the upgrade to a new speech processor. The upgrade used a repeated measures design with an alternating order of conditions (A-B-A-B design). Twelve pre- and perilingually deaf children with MED-EL C40+ cochlear implants were enrolled in the study. Patients were upgraded from their Tempo+ speech processor, which used continuous interleaved sampling (CIS) in combination with a frequency spectrum of 200-8500 Hz, to an Opus speech processor, which used FSP with an extended frequency spectrum of 70-8500 Hz. The primary means of testing was an HSM (Hochmair, Schulz and Moser) sentence test at 65 and 80 dB in quiet. In addition, the "Mainzer Kindersprachtest" (Mainz audiometric speech test for children) was applied at 65 and 70 dB. When the new FSP speech processor was used together with the extended low frequency range, HSM sentence tests at 65 and 80 dB resulted in scores indicating statistically significant improvements of 7.1 and 9.9 percentage points, respectively. Scores in the "Mainzer Kindersprachtest" at 65 and 70 dB indicated statistically significant improvements of 9.3 and 6.1 percentage points, respectively. The present study clearly shows that children benefit from the fine structure speech coding strategy in combination with an extended frequency spectrum in the low frequencies, as is offered by the Opus speech processors. This should be taken into consideration when fitting pre- and perilingually deaf children implanted almost a decade previously.	t	\N
21329941	Children with speech sound disorder (SSD) and reading disability (RD) have poor phonological awareness, a problem believed to arise largely from deficits in processing the sensory information in speech, specifically individual acoustic cues. However, such cues are details of acoustic structure. Recent theories suggest that listeners also need to be able to integrate those details to perceive linguistically relevant form. This study examined abilities of children with SSD, RD, and SSD+RD not only to process acoustic cues but also to recover linguistically relevant form from the speech signal. Ten- to 11-year-olds with SSD (n=17), RD (n=16), SSD+RD (n=17), and Controls (n=16) were tested to examine their sensitivity to (1) voice onset times (VOT); (2) spectral structure in fricative-vowel syllables; and (3) vocoded sentences. Children in all groups performed similarly with VOT stimuli, but children with disorders showed delays on other tasks, although the specifics of their performance varied. Children with poor phonemic awareness not only lack sensitivity to acoustic details, but are also less able to recover linguistically relevant forms. This is contrary to one of the main current theories of the relation between spoken and written language development. Readers will be able to (1) understand the role speech perception plays in phonological awareness, (2) distinguish between segmental and global structure analysis of speech perception, (3) describe differences and similarities in speech perception among children with speech sound disorder and/or reading disability, and (4) recognize the importance of broadening clinical interventions to focus on recognizing structure at all levels of speech analysis.	t	\N
21330649	English proficiency must be considered when a bilingual individual is to be evaluated clinically with English speech material. This study describes the minimum level of self-reported English proficiency that identifies bilingual individuals who may perform on par with monolingual listeners on an English word recognition test. A total of 125 normal hearing bilingual listeners rated their English proficiency in listening, speaking, and reading on an 11-point scale. Other related linguistic variables were also obtained. A randomly selected Northwestern University Auditory Test No. 6 (NU-6) list (50 English monosyllabic words) was presented to all participants at 45 dB HL in quiet. Over 90% of the listeners self-rated to have at least "good" proficiency in English listening, speaking, or reading. Of these participants, more than 30% did not achieve a monolingual normative level in English as delimited by binomial distribution. Composite proficiency ratings across language domains better predicted word recognition performance than self-ratings for listening proficiency only. Combining language dominance and age of English acquisition with proficiency ratings further improved prediction specificity. Self-rated English proficiency can predict bilingual listeners' performance on the NU-6 test. For desirable sensitivity and specificity in predicting monolingual-like performance, a minimum rating of 8 out of 10 across all language domains is recommended.	t	\N
21331781	Data are limited on the role of psychotherapy in the treatment of Ménière disease. We sought to document the effect of a psychotherapeutic technique known as autogenic training on clinical outcome in Ménière disease. Six patients with Ménière disease were studied. Retrospective chart review was conducted. All patients were refractory to conventional therapy and completed a course of autogenic training, which was offered as a complementary treatment. Autogenic training with initial psychological counseling was conducted by a clinical psychologist during 45-min sessions. Outcome measures assessed were the frequency of vertigo and functional levels 2 years after initiation of autogenic training. Functional levels were evaluated according to the 1995 guidelines of the American Academy of Otolaryngology-Head Neck Surgery (AAO-HNS). As a personality measure, we used the Maudsley Personality Inventory (MPI), devised by Eysenck, which measures neuroticism (N), extraversion (E), and propensity to lie (L). Five of six patients showed improved functional level after three to eight sessions of psychotherapy; hearing level did not change. The score of the N scale of the MPI was closely related to the number of psychotherapy sessions. Prognosis was evaluated based on the AAO-HNS reporting guidelines, as follows: A = 3, B = 1, C = 1, F = 1. The value of N in MPI was closely related to the number of psychological counseling sessions (R = 0.97, P < 0.05). In conclusion, autogenic training may enhance the mental well-being of patients with Ménière disease and improve clinical outcome.	t	\N
21332488	Although a deficit perceiving phonemes, as indexed by the mismatch negativity (MMN), is apparent in developmental dyslexia (DD), studies have not yet addressed whether this deficit might be a result of deficient native language speech representations. The present study examines how a native-vowel prototype and an atypical vowel are discriminated by 9-year-old children with (n = 14) and without (n = 12) DD. MMN was elicited in all conditions in both groups. The control group revealed enhanced MMN to the native-vowel prototype in comparison to the atypical vowel. Children with DD did not show enhanced MMN amplitude to the native-vowel prototype, suggesting impaired tuning to native language speech representations. Furthermore, higher MMN amplitudes to the native-vowel prototype correlated with more advanced reading (r = - .47) and spelling skills (r = - .52).	t	\N
21336137	Werner and Bargones (1991) observed that a 4-10-kHz noise band can mask a 1-kHz signal during infancy. The purpose of this study was to examine whether remote-noise masking extends into the school-aged years. Listeners were 4-6-yr-olds, 7-9-yr-olds, and adults. Detection thresholds were measured for the 1-kHz signal in quiet and in the presence of the remote-frequency noise. In separate conditions, masker level was either 40 or 60 dB SPL. On average, thresholds for the 1-kHz signal were elevated in the presence of the remote-frequency noise for 4-6-yr-olds, but not for 7-9-yr-olds or adults. Group average thresholds were similar across masker levels, indicating nonperipheral effects. Susceptibility to remote-frequency masking in children extends to 4-6 yrs of age for some children.	t	\N
21340665	The songbird model is widely established in a number of laboratories for the investigation of the neurobiology and development of vocal learning. While vocal learning is rare in the animal kingdom, it is a trait that songbirds share with humans. The neuroanatomical and physiological organization of the brain circuitry that controls learned vocalizations has been extensively characterized, particularly in zebra finches (Taeniopygia guttata). Recently, several powerful molecular and genomic tools have become available in this organism, making it an attractive choice for neurobiologists interested in the neural and genetic basis of a complex learned behavior. Here, we briefly review some of the main features of vocal learning and associated brain structures in zebra finches and comment on some examples that illustrate how themes related to nutrition and addiction can be explored using this model organism.	t	\N
21346208	Research on the control of visually guided limb movements indicates that the brain learns and continuously updates an internal model that maps the relationship between motor commands and sensory feedback. A growing body of work suggests that an internal model that relates motor commands to sensory feedback also supports vocal control. There is evidence from arm-reaching studies that shows that when provided with a contextual cue, the motor system can acquire multiple internal models, which allows an animal to adapt to different perturbations in diverse contexts. In this study we show that trained singers can rapidly acquire multiple internal models regarding voice fundamental frequency (F(0)). These models accommodate different perturbations to ongoing auditory feedback. Participants heard three musical notes and reproduced each one in succession. The musical targets could serve as a contextual cue to indicate which direction (up or down) feedback would be altered on each trial; however, participants were not explicitly instructed to use this strategy. When participants were gradually exposed to altered feedback adaptation was observed immediately following vocal onset. Aftereffects were target specific and did not influence vocal productions on subsequent trials. When target notes were no longer a contextual cue, adaptation occurred during altered feedback trials and evidence for trial-by-trial adaptation was found. These findings indicate that the brain is exceptionally sensitive to the deviations between auditory feedback and the predicted consequence of a motor command during vocalization. Moreover, these results indicate that, with contextual cues, the vocal control system may maintain multiple internal models that are capable of independent modification during different tasks or environments.	t	\N
21354285	This study investigates behavioural and objective measures of temporal auditory processing and their relation to the ability to understand speech in noise. The experiments were carried out on a homogeneous group of seven hearing-impaired listeners with normal sensitivity at low frequencies (up to 1 kHz) and steeply sloping hearing losses above 1 kHz. For comparison, data were also collected for five normal-hearing listeners. Temporal processing was addressed at low frequencies by means of psychoacoustical frequency discrimination, binaural masked detection and amplitude modulation (AM) detection. In addition, auditory brainstem responses (ABRs) to clicks and broadband rising chirps were recorded. Furthermore, speech reception thresholds (SRTs) were determined for Danish sentences in speech-shaped noise. The main findings were: (1) SRTs were neither correlated with hearing sensitivity as reflected in the audiogram nor with the AM detection thresholds which represent an envelope-based measure of temporal resolution; (2) SRTs were correlated with frequency discrimination and binaural masked detection which are associated with temporal fine-structure coding; (3) The wave-V thresholds for the chirp-evoked ABRs indicated a relation to SRTs and the ability to process temporal fine structure. Overall, the results demonstrate the importance of low-frequency temporal processing for speech reception which can be affected even if pure-tone sensitivity is close to normal.	t	\N
21355809	This study examined listeners' endorsement of cognitive, linguistic, segmental, and suprasegmental strategies employed when listening to speakers with dysarthria. The study also examined whether strategy endorsement differed between listeners who earned the highest and lowest intelligibility scores. Speakers were eight individuals with dysarthria and cerebral palsy. Listeners were 80 individuals who transcribed speech stimuli and rated their use of each of 24 listening strategies on a 4-point scale. Results showed that cognitive and linguistic strategies were most highly endorsed. Use of listening strategies did not differ between listeners with the highest and lowest intelligibility scores. Results suggest that there may be a core of strategies common to listeners of speakers with dysarthria that may be supplemented by additional strategies, based on characteristics of the speaker and speech signal.	t	\N
21368051	Certain features of objects or events can be represented by more than a single sensory system, such as roughness of a surface (sight, sound, and touch), the location of a speaker (audition and sight), and the rhythm or duration of an event (by all three major sensory systems). Thus, these properties can be said to be sensory-independent or amodal. A key question is whether common multisensory cortical regions process these amodal features, or does each sensory system contain its own specialized region(s) for processing common features? We tackled this issue by investigating simple duration-detection mechanisms across audition and touch; these systems were chosen because fine duration discriminations are possible in both. The mismatch negativity (MMN) component of the human event-related potential provides a sensitive metric of duration processing and has been elicited independently during both auditory and somatosensory investigations. Employing high-density electroencephalographic recordings in conjunction with intracranial subdural recordings, we asked whether fine duration discriminations, represented by the MMN, were generated in the same cortical regions regardless of the sensory modality being probed. Scalp recordings pointed to statistically distinct MMN topographies across senses, implying differential underlying cortical generator configurations. Intracranial recordings confirmed these noninvasive findings, showing generators of the auditory MMN along the superior temporal gyrus with no evidence of a somatosensory MMN in this region, whereas a robust somatosensory MMN was recorded from postcentral gyrus in the absence of an auditory MMN. The current data clearly argue against a common circuitry account for amodal duration processing.	t	\N
21375597	The neural processing of auditory motion information shows a pronounced interhemispheric asymmetry. In previous electrophysiological studies, the so-called motion-onset response (MOR), a prominent auditory-evoked potential to the onset of sound motion, was stronger over the hemisphere contralateral to the side of motion. Here, effects of lateral-onset position and direction of motion on MOR contralaterality were investigated. Eighteen listeners were presented with free-field sound stimuli that, after an initial stationary phase at a lateral spatial position within the left or right hemifield, started to move either left- or rightward. The early part of the MOR, the so-called change-N1, exhibited contralaterality that depended on the lateral motion-onset position with stronger activations over the hemisphere contralateral to the side of motion onset, whereas the contralaterality of the later part of the MOR, the so-called change-P2, merely depended on the direction of motion. Cortical source localization indicated that this pattern of contralaterality primarily resulted from asymmetric activation in primary auditory cortex and insula. These findings suggest that the early and late parts of the MOR reflect different phases in auditory motion perception, supporting the notion of a modular organization of discrete processing stages.	t	\N
21382389	Evidence from event-related potential (ERP) analyses of English spoken words suggests that the time course of English word recognition in monosyllables is cumulative. Different types of phonological competitors (i.e., rhymes and cohorts) modulate the temporal grain of ERP components differentially (Desroches, Newman, & Joanisse, 2009). The time course of Chinese monosyllabic spoken word recognition could be different from that of English due to the differences in syllable structure between the two languages (e.g., lexical tones). The present study investigated the time course of Chinese monosyllabic spoken word recognition using ERPs to record brain responses online while subjects listened to spoken words. During the experiment, participants were asked to compare a target picture with a subsequent picture by judging whether or not these two pictures belonged to the same semantic category. The spoken word was presented between the two pictures, and participants were not required to respond during its presentation. We manipulated phonological competition by presenting spoken words that either matched or mismatched the target picture in one of the following four ways: onset mismatch, rime mismatch, tone mismatch, or syllable mismatch. In contrast to the English findings, our findings showed that the three partial mismatches (onset, rime, and tone mismatches) equally modulated the amplitudes and time courses of the N400 (a negative component that peaks about 400ms after the spoken word), whereas, the syllable mismatched words elicited an earlier and stronger N400 than the three partial mismatched words. The results shed light on the important role of syllable-level awareness in Chinese spoken word recognition and also imply that the recognition of Chinese monosyllabic words might rely more on global similarity of the whole syllable structure or syllable-based holistic processing rather than phonemic segment-based processing. We interpret the differences in spoken word processing between Chinese and English listeners as being due to morphosyllabic structural differences between the two languages.	t	\N
21382438	Emotions signal the particular relevance of situations, threatening or rewarding, and influence perception and behaviour accordingly. Research to date has predominantly investigated the impact of negative emotional stimuli. However, rapid reactions to positive emotional stimuli are similarly adaptive. Here, we tested the influence of positive emotional stimuli on attentional control, which enables reacting to conflicting stimulation. We therefore presented positive emotional and neutral words in an auditory Simon task. Reaction times revealed faster resolution of conflict when target stimuli were positive compared to neutral words. Also, emotion modulated the first conflict-sensitive event-related brain potential, a negativity at 420 ms, indicating an influence on early stages of conflict processing. These results complement recent data on negative stimuli and suggest that positive stimuli are equally salient. The rapid impact on attentional control is evolutionary highly adaptive as it reduces the time that conflict yields an organism incapable of responding to reward-signalling stimuli.	t	\N
21385014	The objective of this study was to describe the auditory evoked response to silent gaps for a group of older adults using stimulus conditions identical to those used in psychophysical studies of gap detection. The P1-N1-P2 response to the onsets of stimuli (markers) defining a silent gap for within-channel (spectrally identical markers) and across-channel (spectrally different markers) conditions was examined using four perceptually-equated gap durations. A group of 24 older adults (mean age = 63 years) with normal hearing or minimal hearing loss participated. Older adults exhibited neural activation patterns that were qualitatively different and more frontally oriented than those observed in a previous study (Lister et al., 2007) of younger listeners. Older adults showed longer P2 latencies and larger P1 amplitudes than younger adults, suggesting relatively slower neural travel time and altered auditory inhibition/arousal by irrelevant stimuli. Older adults appeared to recruit later-occurring T-complex-like generators for gap processing, compared to earlier-occurring T-complex-like generators by the younger group. Early and continued processing of channel cues with later processing of gap cues may represent the inefficiency of the aging auditory system and may contribute to poor speech understanding in noisy, real-world listening environments.	t	\N
21387016	Important sounds can be easily missed or misidentified in the presence of extraneous noise. We describe an auditory illusion in which a continuous ongoing tone becomes inaudible during a brief, non-masking noise burst more than one octave away, which is unexpected given the frequency resolution of human hearing. Participants strongly susceptible to this illusory discontinuity did not perceive illusory auditory continuity (in which a sound subjectively continues during a burst of masking noise) when the noises were short, yet did so at longer noise durations. Participants who were not prone to illusory discontinuity showed robust early electroencephalographic responses at 40-66 ms after noise burst onset, whereas those prone to the illusion lacked these early responses. These data suggest that short-latency neural responses to auditory scene components reflect subsequent individual differences in the parsing of auditory scenes.	t	\N
21391255	Semantic knowledge is supported by a widely distributed neuronal network, with differential patterns of activation depending upon experimental stimulus or task demands. Despite a wide body of knowledge on semantic object processing from the visual modality, the response of this semantic network to environmental sounds remains relatively unknown. Here, we used fMRI to investigate how access to different conceptual attributes from environmental sound input modulates this semantic network. Using a range of living and manmade sounds, we scanned participants whilst they carried out an object attribute verification task. Specifically, we tested visual perceptual, encyclopedic, and categorical attributes about living and manmade objects relative to a high-level auditory perceptual baseline to investigate the differential patterns of response to these contrasting types of object-related attributes, whilst keeping stimulus input constant across conditions. Within the bilateral distributed network engaged for processing environmental sounds across all conditions, we report here a highly significant dissociation within the left hemisphere between the processing of visual perceptual and encyclopedic attributes of objects.	t	\N
21401449	There is no significant difference in speech recognition scores obtained with the Vibrant Soundbridge and the open-fit hearing aid. However, the Vibrant Soundbridge may be superior to open-fit hearing aids in improving hearing at high frequencies (4-8 kHz). To assess whether an improvement in speech recognition conferred by Vibrant Soundbridge is more marked than that afforded by open-fit hearing aids in patients with sloping high frequency sensorineural hearing loss (SNHL). This study had a self-control prospective design. Seven patients aged 21-62 years with sloping high frequency SNHL were recruited into the study. Each patient received a Vibrant Soundbridge middle ear implant (Vibrant MED-EL) and wore an open-fit hearing aid (Danavox, DOT 10). Speech recognition tests were performed according to protocols suggested by árpád Götze's speech definition test in Hungarian language. In the first session, conventional hearing thresholds (unaided pure tone thresholds) were measured. In the second session, the aided sound-field threshold, speech understanding score and functional gain obtained using middle ear implants and open-fit hearing aids were determined after programming of the devices. Regarding speech recognition scores, there were no significant differences between data obtained with the middle ear implant and those with the open-fit hearing aid.	t	\N
21417673	The effects of asymmetric directional microphone fittings (i.e., an omnidirectional microphone on one ear and a directional microphone on the other) on speech understanding in noise and acceptance of background noise were investigated in 15 full-time hearing aid users. Subjects were fitted binaurally with four directional microphone conditions (i.e., binaural omnidirectional, right asymmetric directional, left asymmetric directional and binaural directional microphones) using Siemens Intuis Directional behind-the-ear hearing aids. Speech understanding in noise was assessed using the Hearing in Noise Test, and acceptance of background noise was assessed using the Acceptable Noise Level procedure. Speech was presented from 0° while noise was presented from 180° azimuth. The results revealed that speech understanding in noise improved when using asymmetric directional microphones compared to binaural omnidirectional microphone fittings and was not significantly hindered compared to binaural directional microphone fittings. The results also revealed that listeners accepted more background noise when fitted with asymmetric directional microphones as compared to binaural omnidirectional microphones. Lastly, the results revealed that the acceptance of noise was further increased for the binaural directional microphones when compared to the asymmetric directional microphones, maximizing listeners' willingness to accept background noise in the presence of noise. Clinical implications will be discussed.	t	\N
21425399	Multisensory integration assists us to identify objects by providing multiple cues with respect to object category and spatial location. We used a semantic audiovisual object matching task to determine the effect of spatial congruency on response behavior and fMRI brain activation. Fifteen subjects responded in a four-alternative response paradigm, which visual quadrant contained the object best matched to the sound presented. Realistic sounds based on head-related transfer functions were presented binaurally with the simulated sound source corresponding to one of the four quadrants. Following a random sequence, the location of the sound corresponded to the quadrant containing the semantically congruent target on half the trials, whereas on other trials the sound arose from an incongruent location. We examined the effects of spatial congruency on response latencies, hit-rates and fMRI responses. Preliminary behavioral results revealed a significant effect of spatial congruency on response latency or performance for stimuli with noise added. In the fMRI experiment, spatial congruency had a significant effect on the BOLD response. A cluster in the right middle and superior temporal gyrus was more activated when the auditory sound sources were spatially congruent with the semantically matching visual stimulus. In an exploratory post-hoc analysis, in which we correlated the BOLD signal with the subjects' ability to locate the sound sources, we found a significant cluster in the left inferior frontal cortex, where the BOLD response increased with sound-source localization performance. Thus spatial congruency appears to enhance the semantic integration of audiovisual object information in these brain regions.	t	\N
21425900	Absolute pitch is a rare pitch-naming ability with unknown etiology. Some scientists maintain that its manifestation depends solely on environmental factors, while others suggest that genetic factors contribute to it. We sought to further investigate the hypothesis that genetic factors support the acquisition of absolute pitch and to better elucidate the inheritance pattern of this trait. To this end, we conducted a twin study and a segregation analysis using data collected from a large population of absolute pitch possessors. The casewise concordance rate of 14 monozygotic twin pairs, 78.6%, was significantly different from that of 31 dizygotic twin pairs, 45.2%, assuming single ascertainment (x(2) = 5.57, 1 df, p = .018), supporting a role for genetics in the development of absolute pitch. Segregation analysis of 1463 families, assuming single ascertainment, produced a segregation ratio p(D) = .089 with SEp(D) = 0.006. Unlike an earlier segregation analysis on a small number of absolute pitch probands from musically educated families, our study indicates that absolute pitch is not inherited in a simple Mendelian fashion. Based on these data, absolute pitch is likely genetically heterogeneous, with environmental, epigenetic, and stochastic factors also perhaps contributing to its genesis. These findings are in agreement with the results of our recent linkage analysis.	t	\N
21425948	Acute migraine could be associated with neurophysiological and cognitive changes. This study evaluates the neurophysiological changes in auditory information processing in adolescents with acute migraine by means of magnetoencephalography. The multifeature sound mismatch negativity (MMN) paradigm was used to study nine adolescents with an acute migraine and nine age- and gender-matched healthy controls. Latencies and amplitudes of M100, M150, M200, and MMNm responses were evaluated. Migraine subjects had smaller M150 amplitudes than healthy subjects. The latencies of MMNm response for the frequency change were delayed in both hemispheres in migraine subjects, as compared with healthy controls. Our results indicate that the function of neural substrates, responsible for different stages of auditory information processing, is impaired during the acute migraine. The identification of underlying cortical dysfunction during an acute migraine can lead to future identification of neurophysiological biomarkers for studying acute migraine and response to treatment.	t	\N
21426312	To examine the social cognitive vulnerabilities mediating delusion formation in children presenting with hallucinatory experiences. A sample of 259 12- and 13-year-old children, from a baseline case-control sample of children with and without auditory hallucinations (AH), were re-assessed after 5 years for presence of AH. Presence of delusions and theory of mind (ToM) were also assessed, to examine the hypothesized moderating role of ToM in delusion formation in children hearing voices. In children with AH at age 7-8 and/or 12-13 years, the risk of delusion formation was significantly higher (P interaction = 0.027) in those with lower ToM skills (OR = 4.3, 95% CI 1.9-9.9, P = 0.000), compared to those with higher ToM skills (OR 1.6, 95% CI 0.7-3.7, P = 0.26), independently from secondary school level. The results suggest that better mentalizing abilities confer protection against delusion formation in children experiencing perceptual anomalies, not reducible to general cognitive ability.	t	\N
21429648	Dichotic listening (DL) techniques have been used extensively as a non-invasive procedure to assess language lateralization among children with and without learning disabilities (LD), and with individuals who have other auditory system related brain disorders. Results of studies using DL have indicated that language is lateralized in children with LD and that the lateralized language asymmetries do not develop after age 6 nor are they affected by gender. Observed differences in lateralized language processes between control children and those with LD were found not due to delayed cerebral dominance, but rather to deficits in selective attention. In addition, attention factors have a greater influence on auditory processing of verbal than nonverbal stimuli for children with LD, and children with LD exhibit a general processing bias to the same hemisphere unlike control children. Furthermore, employing directed attention conditions in DL experiments has played an important role in explaining learning disabled children's performance on DL tasks. We conclude that auditory perceptual asymmetries as assessed by DL with children who experience LD are the result of the interaction of hemispheric capability and attention factors.	t	\N
21439268	Studies using event related potentials have shown that men are more likely than women to rely on semantic cues when understanding emotional speech. In a previous functional Magnetic Resonance Imaging (fMRI) study, using an affective sentence classification task, we were able to separate areas involved in semantic processing and areas involved in the processing of affective prosody (Beaucousin et al., 2007). Here we searched for sex-related differences in the neural networks active during emotional speech processing in groups of men and women. The ortholinguistic abilities of the participants did not differ when evaluated with a large battery of tests. Although the neural networks engaged by men and women during emotional sentence classification were largely overlapping, sex-dependent modulations were detected during emotional sentence classification, but not during grammatical sentence classification. Greater activity was observed in men, compared with women, in inferior frontal cortical areas involved in emotional labeling and in attentional areas. In conclusion, at equivalent linguistic abilities and performances, men activate semantic and attentional cortical areas to a larger extent than women during emotional speech processing.	t	\N
21439350	Bone-conducted ultrasound (BCU) modulated by speech sound is recognized as speech sound and activates the auditory cortex similar to audible sound. To investigate the mechanisms of perception, the effects of stimulus duration on N1m were compared among air-conducted audible speech sound (AC speech), AC speech with carrier BCU and speech-modulated BCU in eight native Japanese with normal hearing. The Japanese vowel sound /a/ was used as a stimulus with durations of 10, 15, 20, 30, 40 and 60 ms. Comparison between AC speech with and without carrier showed that the presentation of carrier had no effect on N1m evoked by AC speech. Comparison among the three conditions showed that N1m amplitude for speech-modulated BCU differed from that for the two AC speeches. Moreover, N1m amplitude growth saturated at 40 ms for speech-modulated BCU, and at 20 ms for two AC speeches. These results suggest a difference in temporal integration of N1m between speech-modulated BCU and AC speech. Considering these results, it is reasonable to conclude that N1m evoked by speech-modulated BCU is influenced mainly by the ultrasonic component rather than demodulated audible sound. Given this finding, the notion needs to be considered that the mechanisms underlying perception and recognition of speech-modulated BCU depend on the ultrasonic component to some extent.	t	\N
21441013	Many reports have described that individuals with Alzheimer's disease show neural breakdown in the brainstem nuclei, hippocampus and auditory cortex in the temporal lobe of the brain. However, it is still unclear whether auditory skills mediated by these areas differ across individuals with and without Alzheimer's disease and how these auditory skills are further confounded by reduction in cognitive function in individuals with AD. The aim of this study is to discover the hidden and nonlinear associations among higher-order auditory and cognitive processes in individuals with and without Alzheimer's disease through artificial neural network analysis. The analyses were based on auditory test data obtained from nine clinically confirmed Alzheimer's disease cases and nine age-matched controls. Hearing threshold sensitivity was equivalent across groups, indicating similar peripheral auditory function. Auditory function was evaluated by standardized tests of auditory closure, auditory attention, and auditory figure ground in listeners with and without Alzheimer's disease. The inputs used for analyses were cognitive status and auditory function. The dependent variables were RAU scores computed from scores of auditory tests. Artificial neural networks showed a complex relationship between the input variables (cognitive status and auditory function) that cannot be predicted simply on the basis of cognition differences between individuals with and without Alzheimer's disease. The results of this study suggest that central auditory function declines with age, regardless of changes in cognitive function.	t	\N
21452943	In this work, we show that electrophysiological responses during pitch perception are best explained by distributed activity in a hierarchy of cortical sources and, crucially, that the effective connectivity between these sources is modulated with pitch strength. Local field potentials were recorded in two subjects from primary auditory cortex and adjacent auditory cortical areas along the axis of Heschl's gyrus (HG) while they listened to stimuli of varying pitch strength. Dynamic causal modeling was used to compare system architectures that might explain the recorded activity. The data show that representation of pitch requires an interaction between nonprimary and primary auditory cortex along HG that is consistent with the principle of predictive coding.	t	\N
21455973	The loudness dependence of auditory evoked potentials (LDAEP) has been described as a measure of central serotonergic activity. Single-electrode estimation and dipole source analysis (DSA) are the most utilized methods for the estimation of LDAEP. To date, it is assumed that both methods are equally reliable. Nevertheless, according to our knowledge, the advantage of either method has not yet been shown directly. The aim of our study was to compare single-electrode estimation and dipole source analysis in the determination of the LDAEP. Tones of five different intensities were presented binaurally to 10 healthy volunteers. Amplitudes of N1/P2 and LDAEP were determined at the central electrode site referenced to average and to linked mastoids and with DSA in the left and the right hemispheres. Scores were normalized (z-scores), compared, and correlated. Contrary to our expectations, we found a significant difference between scores obtained with single-electrode estimation and with DSA. The difference may be caused by confounding activation of a frontal source in the single-electrode method. The single-electrode approach cannot be equated with DSA in the determination of the LDAEP. This should be considered when comparing the results of different LDAEP studies using only one of these methods.	t	\N
21458862	The purposes of the present study were to establish the Standard-Chinese version of Lexical Neighborhood Test (LNT) and to examine the lexical and age effects on spoken-word recognition in normal-hearing children. Six lists of monosyllabic and six lists of disyllabic words (20 words/list) were selected from the database of daily speech materials for normal-hearing (NH) children of ages 3-5 years. The lists were further divided into "easy" and "hard" halves according to the word frequency and neighborhood density in the database based on the theory of Neighborhood Activation Model (NAM). Ninety-six NH children (age ranged between 4.0 and 7.0 years) were divided into three different age groups of 1-year intervals. Speech-perception tests were conducted using the Standard-Chinese monosyllabic and disyllabic LNT. The inter-list performance was found to be equivalent and inter-rater reliability was high with 92.5-95% consistency. Results of word-recognition scores showed that the lexical effects were all significant. Children scored higher with disyllabic words than with monosyllabic words. "Easy" words scored higher than "hard" words. The word-recognition performance also increased with age in each lexical category. A multiple linear regression analysis showed that neighborhood density, age, and word frequency appeared to have increasingly more contributions to Chinese word recognition. The results of the present study indicated that performances of Chinese word recognition were influenced by word frequency, age, and neighborhood density, with word frequency playing a major role. These results were consistent with those in other languages, supporting the application of NAM in the Chinese language. The development of Standard-Chinese version of LNT and the establishment of a database of children of 4-6 years old can provide a reliable means for spoken-word recognition test in children with hearing impairment.	t	\N
21461784	Four eyetracking experiments examined whether semantic and visual-shape representations are routinely retrieved from printed word displays and used during language-mediated visual search. Participants listened to sentences containing target words that were similar semantically or in shape to concepts invoked by concurrently displayed printed words. In Experiment 1, the displays contained semantic and shape competitors of the targets along with two unrelated words. There were significant shifts in eye gaze as targets were heard toward semantic but not toward shape competitors. In Experiments 2-4, semantic competitors were replaced with unrelated words, semantically richer sentences were presented to encourage visual imagery, or participants rated the shape similarity of the stimuli before doing the eyetracking task. In all cases, there were no immediate shifts in eye gaze to shape competitors, even though, in response to the Experiment 1 spoken materials, participants looked to these competitors when they were presented as pictures (Huettig & McQueen, 2007). There was a late shape-competitor bias (more than 2,500 ms after target onset) in all experiments. These data show that shape information is not used in online search of printed word displays (whereas it is used with picture displays). The nature of the visual environment appears to induce implicit biases toward particular modes of processing during language-mediated visual search.	t	\N
21467806	Vital-sign checks and physical assessment have been performed by physicians and nurses among medical staff in particular. However, pharmacists must also have basic skills of vital-sign checking and physical assessment to evaluate the patient condition/drug efficacy or prevent adverse reactions to drugs. To promote the acquisition of these skills, we prepared simulation programs with an emergency-care simulator, which facilitate the reproduction of excess-dose drug administration/condition changes. We used an emergency-care simulator equipped with a personal computer. General condition was established using the blinking velocity, cardiac/respiratory sounds and blood pH as parameters. As a results, concerning drug administration, the simulation programs facilitated the reproduction of symptoms related to the excess-dose insulin administration. With respect to changes in the condition, it facilitated the reproduction of asthma, hyperglycemia, and hemorrhage. This facilitated the palpation-, visual perception-, and auditory perception-mediated understanding of changes in the patient condition through fingertips and warnings/alarms on the monitor. Evaluation of the student for these program contents increased significantly (p<0.01). These programs can be downloaded via the Internet. Experience regarding excess-dose drug administration/condition changes with an emergency-care simulator is useful for checking patients' vital signs, evaluating the drug efficacy, and confirming adverse reactions to drugs. By the practice of these programs, we can teach pharmacy students how to check for vital signs (pulse palpation, auscultation, blood pressure measurement, and electrocardiography) in a school setting, not a hospital setting. Mastering these techniques may allow pharmacy students to determine the efficacy of a drug and adverse reactions.	t	\N
21476651	Recently, it has been suggested that the annoyance of residents at a given aircraft noise exposure level increases over the years. The objective of the present study was to verify the hypothesized trend and to identify its possible causes. To this end, the large database used to establish earlier exposure-response relationships on aircraft noise was updated with original data from several recent surveys, yielding a database with data from 34 separate airports. Multilevel grouped regression was used to determine the annoyance response per airport, after which meta-regression was used to investigate whether study characteristics could explain the heterogeneity in annoyance response between airports. A significant increase over the years was observed in annoyance at a given level of aircraft noise exposure. Furthermore, the type of annoyance scale, the type of contact, and the response percentage were found to be sources of heterogeneity. Of these, only the scale factor could statistically account for the trend, although other findings rule it out as a satisfactory explanation. No evidence was found for increased self-reported noise sensitivity. The results are of importance to the applicability of current exposure-annoyance relationships for aircraft noise and provide a basis for decisions on whether these need to be updated.	t	\N
21476675	Key features of the voice--fundamental frequency (F(0)) and formant frequencies (Fn)--can vary extensively among individuals. Some of this variation might cue fitness-related, biosocial dimensions of speakers. Three experiments tested the independent, joint and relative effects of F(0) and Fn on listeners' assessments of the body size, masculinity (or femininity), and attractiveness of male and female speakers. Experiment 1 replicated previous findings concerning the joint and independent effects of F(0) and Fn on these assessments. Experiment 2 established frequency discrimination thresholds (or just-noticeable differences, JND's) for both vocal features to use in subsequent tests of their relative salience. JND's for F(0) and Fn were consistent in the range of 5%-6% for each sex. Experiment 3 put the two voice features in conflict by equally discriminable amounts and found that listeners consistently tracked Fn over F(0) in rating all three dimensions. Several non-exclusive possibilities for this outcome are considered, including that voice Fn provides more reliable cues to one or more dimensions and that listeners' assessments of the different dimensions are partially interdependent. Results highlight the value of first establishing JND's for discrimination of specific features of natural voices in future work examining their effects on voice-based social judgments.	t	\N
21477130	Evaluation of clinical characteristics, bacteriology and hearing in paediatric patients with and without chronic suppurative otitis media (CSOM) in Luanda, Angola. Interview, clinical examination, ear-discharge culture, open air pure-tone audiometry and brainstem auditory-evoked potentials of 23 outpatients with CSOM and 23 controls in a paediatric hospital. Of the CSOM vs. control children, 35% vs. 26% had running water, 70% vs. 70% electricity, 64% vs. 0% HIV (p<0.0001) and 36% vs. 0% tuberculosis in history (p=0.002). Ten (43%) children had bilateral CSOM. The major ear-discharge pathogens were Proteus spp. (44%) and Pseudomonas (22%). Hearing impairment of >25 dB was present in 52% of CSOM-affected ears and bilateral hearing loss in 7 (30%) CSOM children vs. zero control child (p=0.009). Only one hearing-impaired child's family had previously detected the handicap. CSOM occurred in children with high co-morbidity. Persistent otorrhoea was usually caused by Proteus spp. or Pseudomonas, and often suggestive of either HIV or hearing impairment. In the developing countries, prompt diagnosis and treatment of CSOM would enhance the children's linguistic and academic development.	t	\N
21477198	To investigate the interaction between segmental and supra-segmental stress-related information in early word learning, two experiments were conducted with 20- to 24-month-old English-learning children. In an adaptation of the object categorization study designed by Nazzi and Gopnik (2001), children were presented with pairs of novel objects whose labels differed by their initial consonant (Experiment 1) or their medial consonant (Experiment 2). Words were produced with a stress initial (trochaic) or a stress final (iambic) pattern. In both experiments successful word learning was established when the to-be-remembered contrast was embedded in a stressed syllable, but not when embedded in unstressed syllables. This was independent of the overall word pattern, trochaic or iambic, or the location of the phonemic contrast, word-initial or -medial. Results are discussed in light of the use of phonetic information in early lexical acquisition, highlighting the role of lexical stress and ambisyllabicity in early word processing.	t	\N
21479656	It is well established that sounds can enhance visual-target detection, but the mechanisms that govern these cross-sensory effects, as well as the neural pathways involved, are largely unknown. Here, we tested behavioral predictions stemming from the neurophysiologic and neuroanatomic literature. Participants detected near-threshold visual targets presented either at central fixation or peripheral to central fixation that were sometimes paired with sounds that originated from widely misaligned locations (up to 104° from the visual target). Our results demonstrate that co-occurring sounds improve the likelihood of visual-target detection (1) regardless of retinal eccentricity and (2) despite wide audiovisual misalignments. With regard to the first point, these findings suggest that auditory facilitation of visual-target detection is unlikely to operate through previously described corticocortical pathways from auditory cortex that predominantly terminate in regions of visual cortex that process peripheral visual space. With regard to the second point, auditory facilitation of visual-target detection seems to operate through a spatially non-specific modulation of visual processing.	t	\N
21483666	Acute stress is a stereotypical, but multimodal response to a present or imminent challenge overcharging an organism. Among the different branches of this multimodal response, the consequences of glucocorticoid secretion have been extensively investigated, mostly in connection with long-term memory (LTM). However, stress responses comprise other endocrine signaling and altered neuronal activity wholly independent of pituitary regulation. To date, knowledge of the impact of such "paracorticoidal" stress responses on higher cognitive functions is scarce. We investigated the impact of an ecological stressor on the ability to direct selective attention using event-related potentials in humans. Based on research in rodents, we assumed that a stress-induced imbalance of catecholaminergic transmission would impair this ability. The stressor consisted of a single cold pressor test. Auditory negative difference (Nd) and mismatch negativity (MMN) were recorded in a tonal dichotic listening task. A time series of such tasks confirmed an increased distractibility occurring 4-7 minutes after onset of the stressor as reflected by an attenuated Nd. Salivary cortisol began to rise 8-11 minutes after onset when no further modulations in the event-related potentials (ERP) occurred, thus precluding a causal relationship. This effect may be attributed to a stress-induced activation of mesofrontal dopaminergic projections. It may also be attributed to an activation of noradrenergic projections. Known characteristics of the modulation of ERP by different stress-related ligands were used for further disambiguation of causality. The conjuncture of an attenuated Nd and an increased MMN might be interpreted as indicating a dopaminergic influence. The selective effect on the late portion of the Nd provides another tentative clue for this. Prior studies have deliberately tracked the adrenocortical influence on cognition, as it has proven most influential with respect to LTM. However, current cortisol-optimized study designs would have failed to detect the present findings regarding attention.	t	\N
21487700	Anxious individuals have been shown to interpret others' emotional states negatively. Since most studies have used facial expressions as emotional cues, we examined whether trait anxiety affects the recognition of emotion in a dynamic face and voice that were presented in synchrony. The face and voice cues conveyed either matched (e.g., happy face and voice) or mismatched emotions (e.g., happy face and angry voice). Participants with high or low trait anxiety were to indicate the perceived emotion using one of the cues while ignoring the other. The results showed that individuals with high trait anxiety were more likely to interpret others' emotions in a negative manner, putting more weight on the to-be-ignored angry cues. This interpretation bias was found regardless of the cue modality (i.e., face or voice). Since trait anxiety did not affect recognition of the face or voice cues presented in isolation, this interpretation bias appears to reflect an altered integration of the face and voice cues among anxious individuals.	t	\N
21491276	In order to determine how the interior of a car should sound, automotive manufacturers often rely on obtaining data from individual evaluations of vehicle sounds. Company identity could play a role in these appraisals, particularly when individuals are comparing cars from opposite ends of the performance spectrum. This research addressed the question: does company identity influence the evaluation of automotive sounds belonging to cars of a similar performance level and from the same market segment? Participants listened to car sounds from two competing manufacturers, together with control sounds. Before listening to each sound, participants were presented with the correct company identity for that sound, the incorrect identity or were given no information about the identity of the sound. The results showed that company identity did not influence appraisals of high performance cars belonging to different manufacturers. These results have positive implications for methodologies employed to capture the perceptions of individuals. STATEMENT OF RELEVANCE: A challenge in automotive design is to set appropriate targets for vehicle sounds, relying on understanding subjective reactions of individuals to such sounds. This paper assesses the role of company identity in influencing these subjective reactions and will guide sound evaluation studies, in which the manufacturer is often apparent.	t	\N
21493243	To determine whether patterns of functional connectivity of cortical regions responsible for auditory processing and executive functions differ in children with unilateral hearing loss (UHL) versus their normal-hearing (NH) siblings. Prospective observational study. Academic medical center. Children with severe-to-profound UHL (9 right UHL, 7 left UHL) and 10 NH sibling controls were imaged using resting state functional connectivity magnetic resonance imaging (rs-fcMRI). All MRI images were transformed to a single common atlas; regions of interest (ROI) were chosen based on previous literature and unpublished results. Mean regionwise correlations and conjunction analyses were performed across 34 seed ROIs to identify temporally synchronized, low-frequency spontaneous fluctuations in the resting state blood oxygenation level-dependent signal that reveal functionally related regions. Mean regionwise Student t tests found a left posterior opercular region with more correlated resting state activity with the inferior parietal lobule seed in the children with both left and right UHL than NH. In conjunction analysis, 4 regions showed different resting-state functional interactions between the NH and both UHL groups. These differences were in left medial globus pallidus, left middle temporal gyrus, right parahippocampal gyrus, and mid-cingulate cortex. These regions include areas associated with auditory processing, executive function, and memory formation. Resting state fcMRI identified differences in brain network interconnections between children with UHL and NH and may inform further investigation into the educational and behavioral difficulties experienced by children with UHL.	t	\N
21495790	To assess factors that contribute to Tinnitus Handicap Inventory (THI) scores in Japan. Case series with chart review. Two hundred and eighty-five tinnitus patients at tertiary referral center, who completed the Japanese version of the THI, the Self-rating Depression Scale (SDS), and the State Trait Anxiety Inventory (STAI). In multiple regression analysis, the SDS score contributed the most to the THI score. The state section of the STAI score and pure tone average (PTA) at four high frequencies also contributed significantly, but to lesser degrees. The other following factors were not statistically significant: age, gender, time from the onset of tinnitus to the first clinical visit, PTA at three mid frequencies, and trait section of the STAI score. This model may account for approximately 45% of THI score variability. The THI scores may be influenced by depressive symptoms, state anxiety, and pure tone thresholds in Japan.	t	\N
21502927	Mismatch negativity (MMN) is an event-related potential that is elicited by deviant sounds that are presented along with frequent sounds in the absence of attention. Auditory MMN is generated by the comparison process between sensory memory trace of a frequent auditory event and a deviant event. It is well known that frequent sounds are encoded in memory trace and processed as a single unit within 160-170 ms. This study examined whether deviant sound would be similarly processed as a temporal unit. Twelve healthy men were presented with relatively short standard sounds and relatively long deviant sounds that contained an omitted (i.e. silent) part. Three types of deviant sounds were designed to vary in duration. The MMN amplitude was gradually enhanced from the short to long duration deviant events that contained an omitted part. In contrast, MMN latency showed no significant differences among the deviants. These findings show that deviant sounds are also processed as a unitary event.	t	\N
21503649	Primate multisensory object perception involves distributed brain regions. To investigate the network character of these regions of the human brain, we applied data-driven group spatial independent component analysis (ICA) to a functional magnetic resonance imaging (fMRI) data set acquired during a passive audio-visual (AV) experiment with common object stimuli. We labeled three group-level independent component (IC) maps as auditory (A), visual (V), and AV, based on their spatial layouts and activation time courses. The overlap between these IC maps served as definition of a distributed network of multisensory candidate regions including superior temporal, ventral occipito-temporal, posterior parietal and prefrontal regions. During an independent second fMRI experiment, we explicitly tested their involvement in AV integration. Activations in nine out of these twelve regions met the max-criterion (A < AV > V) for multisensory integration. Comparison of this approach with a general linear model-based region-of-interest definition revealed its complementary value for multisensory neuroimaging. In conclusion, we estimated functional networks of uni- and multisensory functional connectivity from one dataset and validated their functional roles in an independent dataset. These findings demonstrate the particular value of ICA for multisensory neuroimaging research and using independent datasets to test hypotheses generated from a data-driven analysis.	t	\N
21507543	Dichotic listening originally was a means of studying attention. Half a century ago Doreen Kimura parlayed the dichotic method into a noninvasive indicator of lateralized cerebral language representation. The ubiquitous right-ear advantage (REA) for verbal material was accepted as a concomitant of left-sided language lateralization and preferential conduction of right-ear messages to the left hemisphere. As evidence has accumulated over the past 50years showing the REA to be dynamic and modifiable, the concept of attention has become essential for interpreting the findings. Progress in understanding the role of attention has been manifested as a transition from efforts to document attention effects to efforts to characterize their mechanisms. We summarize the relevant evidence, trace the evolution of explanatory models, and outline contemporary accounts of the role of attention in dichotic listening.	t	\N
21508086	Recent evidence demonstrates that perceptual rivalry rate can be modulated by perturbation of the serotonergic system. Specifically, pharmacologically lowering the availability of serotonin results in slower rivalry rates. As it has been suggested that brain serotonin is low during the interictal phase of migraine, we hypothesized that perceptual rivalry rates would be reduced in individuals with migraine. Visual and auditory perceptual rivalry measures were obtained for a group of 30 participants with migraine (15 migraine with aura, 15 migraine without aura) and 20 non-headache control individuals. Our experiments reveal fewer perceptual rivalry switches within both visual and auditory domains for our migraine without aura group, while the with-aura group performed similarly to non-headache controls. Dividing the data by headache frequency rather than headache subtype classification revealed fewer perceptual switches in those with more frequent headaches. Our data provides further support for interictal differences in brain sensory reactivity in migraine, with the observed effects being in the same direction as those caused by pharmacologically reducing brain availability of serotonin in normal observers.	t	\N
21509624	Elderly persons frequently complain about problems with speech understanding especially in complex acoustic situations. Besides hearing impairment the decline of cognitive functions might explain these problems. In 12 normal hearing young subjects and 14 elderly listeners with extraordinarily good hearing speech perception was measured in a broad range of different acoustic situations. Cognitive functioning was evaluated with different neuropsychological tests. Despite comparable pure tone thresholds the elderly listeners revealed worse speech discrimination than the young subjects in almost all test situations. Largest differences were found in situations with fluctuating maskers and competing talkers. Most of the speech perception results revealed significant correlations with the outcome from a neuropsychological test addressing declarative verbal memory. In complex listening situations elderly persons reveal worse speech understanding than younger subjects. Differences in speech perception can partly be attributed to cognitive abilities. In particular, working memory seems to be an important factor.	t	\N
21512805	The purpose of this study was to investigate whether multi-stimulus auditory steady-state responses were capable of estimating hearing thresholds in high-risk infants. A retrospective chart review study. Three tertiary referral centers. Infants born between January 2004 and December 2006 who met the criteria for risk factors of congenital hearing loss were enrolled in the study. While under sedation, the multi-stimulus auditory steady-state response was used to determine multi-channel auditory steady-state response thresholds for high-risk infants younger than 13 months. Conditioned play audiometry was then applied to these children at 23-48 months of age to obtain pure tone audiograms. Auditory steady-state response thresholds and pure tone thresholds were then compared. A total of 249 high-risk infants were enrolled in the study. 39 infants were lost during follow-up. The remaining 216 infants completed both examinations. The Pearson correlation coefficients (r) between the ASSR levels and pure tone thresholds were 0.88, 0.94, 0.94 and 0.97 at 500, 1,000, 2,000 and 4,000 Hz, respectively. The strength of the relationship between the auditory steady-state responses and pure tone thresholds increased with more severe degrees of hearing loss and higher frequencies. We conclude that initial multichannel ASSR thresholds measured under sedation are highly correlated with pure tone thresholds obtained 2 or 3 years later. ASSR can be used to predict the frequency-specific hearing thresholds of high-risk infants and can provide information for early hearing intervention.	t	\N
21524014	Recent research provides evidence that individuals shift in their perception of variants depending on social characteristics attributed to the speaker.This paper reports on a speech perception experiment designed to test the degree to which the age attributed to a speaker influences the perception of vowels undergoing a chain shift. As a result of the shift, speakers from different generations produce different variants from one another. Results from the experiment indicate that a speaker's perceived age can influence vowel categorization in the expected direction. However, only older participants are influenced by perceived speaker age.This suggests that social characteristics attributed to a speaker affect speech perception differently depending on the salience of the relationship between the variant and the characteristic.The results also provide evidence of an unexpected interaction between the sex of the participant and the sex of the stimulus.The interaction is interpreted as an effect of the participants' previous exposure with male and female speakers.The results are analyzed under an exemplar model of speech production and perception where social information is indexed to acoustic information and the weight of the connection varies depending on the perceived salience of sociophonetic trends.	t	\N
21526589	In a sample of 18 European nations, suicide rates were positively associated with the proportion of low notes in the national anthems and, albeit less strongly, with students' ratings of how gloomy and how sad the anthems sounded, supporting a hypothesis proposed by Rihmer.	t	\N
21534716	We suggest that cochlear implantation (CI) should be a good therapeutic modality for hearing restoration in patients with common cavity malformed ears. To analyze hearing improvement from CI performed in common cavity malformed cochleae. A total of 11 patients (5 male and 6 female, mean age 4.5 ± 2.8 years) and 12 ears were enrolled in this study. During the insertion of electrodes, we used C-arm fluoroscopy to avoid intrameatal placement. We evaluated hearing improvement every 6 months and the mean follow-up period was 80.5 ± 24.1 months (53-125 months). During the operation, there were only four cases with fully inserted electrodes. Cerebrospinal fluid gushed out in two cases during the cochleostomy and postoperative meningitis occurred in two patients. One patient had to undergo reimplantation 4 years later due to device failure and recurrent meningitis. During the 48 months follow-up hearing evaluation, the ability of hearing increased along with the age. The final average MAIS, CAP, SIR, and open set one- and two-syllable word scores were 90.3 ± 18.1%, 4.9 ± 1.6, 3.1 ± 0.9, 24.1 ± 25.9%, and 48.6 ± 38.7%, respectively.	t	\N
21537615	Profound hearing loss is a disability that affects personality and when it involves teenagers before language acquisition, these bio-psychosocial conflicts can be exacerbated, requiring careful evaluation and choice of them for cochlear implant. To evaluate speech perception by adolescents with profound hearing loss, users of cochlear implants. Prospective. Twenty-five individuals with severe or profound pre-lingual hearing loss who underwent cochlear implantation during adolescence, between 10 to 17 years and 11 months, who went through speech perception tests before the implant and 2 years after device activation. For comparison and analysis we used the results from tests of four choice, recognition of vowels and recognition of sentences in a closed setting and the open environment. The average percentage of correct answers in the four choice test before the implant was 46.9% and after 24 months of device use, this value went up to 86.1% in the vowels recognition test, the average difference was 45.13% to 83.13% and the sentences recognition test together in closed and open settings was 19.3% to 60.6% and 1.08% to 20.47% respectively. All patients, although with mixed results, achieved statistical improvement in all speech tests that were employed.	t	\N
21545766	This paper presents longitudinal case studies of children who received (cochlear implants) CIs and a controlled sample of children with normal hearing (NH). Phoneme discrimination (i.e., /sa-ma/, /a-i/, /a-u/, /u-i/, /ta-da/, /pa-ka/) was assessed prior to receiving CIs and monthly for 3 mo following CI activation. Case studies. Three cochlear implant recipients and seven NH control participants were recruited through the University of Colorado Hospital and the University of Colorado, Boulder. The study utilized the visual reinforcement audiometry and interactive play assessment of speech pattern contrasts (VRASPAC) algorithm. A comparison of scoring was conducted using Cohen's kappa to determine interrater reliability. Findings from this study revealed that CI recipients could discriminate at least three out of five phoneme contrasts at mastery level (≥90%) by 2 mo of device use. None of the CI recipients reached mastery prior to implantation. Following 3 mo of CI use there was no difference in contrast discrimination performance between the CI users and their NH age-matched peers (with the exception of /pa-ka/ for one CI user. The CI users in this case study, who were implanted between 12 and 16 mo of age, were able to master the phoneme contrasts regardless of bilateral or unilateral CI, socioeconomic status, or language spoken at home.	t	\N
21552274	Responses of neurons that integrate multiple sensory inputs are traditionally characterized in terms of a set of empirical principles. However, a simple computational framework that accounts for these empirical features of multisensory integration has not been established. We propose that divisive normalization, acting at the stage of multisensory integration, can account for many of the empirical principles of multisensory integration shown by single neurons, such as the principle of inverse effectiveness and the spatial principle. This model, which uses a simple functional operation (normalization) for which there is considerable experimental support, also accounts for the recent observation that the mathematical rule by which multisensory neurons combine their inputs changes with cue reliability. The normalization model, which makes a strong testable prediction regarding cross-modal suppression, may therefore provide a simple unifying computational account of the important features of multisensory integration by neurons.	t	\N
21553997	Using a novel variant of dichotic selective listening, we examined the control of auditory selective attention. In our task, subjects had to respond selectively to one of two simultaneously presented auditory stimuli (number words), always spoken by a female and a male speaker, by performing a numerical size categorization. The gender of the task-relevant speaker could change, as indicated by a visual cue prior to auditory stimulus onset. Three experiments show clear performance costs with instructed attention switches. Experiment 2 varied the cuing interval to examine advance preparation for an attention switch. Experiment 3 additionally isolated auditory switch costs from visual cue priming by using two cues for each gender, so that gender repetition could be indicated by a changed cue. Experiment 2 showed that switch costs decreased with prolonged cuing intervals, but Experiment 3 revealed that preparation did not affect auditory switch costs but only visual cue priming. Moreover, incongruent numerical categories in competing auditory stimuli produced interference and substantially increased error rates, suggesting continued processing of task-relevant information that often leads to responding to the incorrect auditory source. Together, the data show clear limitations in advance preparation of auditory attention switches and suggest a considerable degree of inertia in intentional control of auditory selection criteria.	t	\N
21559468	In humans, emotions from music serve important communicative roles. Despite a growing interest in the neural basis of music perception, action and emotion, the majority of previous studies in this area have focused on the auditory aspects of music performances. Here we investigate how the brain processes the emotions elicited by audiovisual music performances. We used event-related functional magnetic resonance imaging, and in Experiment 1 we defined the areas responding to audiovisual (musician's movements with music), visual (musician's movements only), and auditory emotional (music only) displays. Subsequently a region of interest analysis was performed to examine if any of the areas detected in Experiment 1 showed greater activation for emotionally mismatching performances (combining the musician's movements with mismatching emotional sound) than for emotionally matching music performances (combining the musician's movements with matching emotional sound) as presented in Experiment 2 to the same participants. The insula and the left thalamus were found to respond consistently to visual, auditory and audiovisual emotional information and to have increased activation for emotionally mismatching displays in comparison with emotionally matching displays. In contrast, the right thalamus was found to respond to audiovisual emotional displays and to have similar activation for emotionally matching and mismatching displays. These results suggest that the insula and left thalamus have an active role in detecting emotional correspondence between auditory and visual information during music performances, whereas the right thalamus has a different role.	t	\N
21564122	We studied whether a multifeature mismatch negativity (MMN) paradigm using naturally produced speech stimuli is feasible for studies of auditory discrimination accuracy of adult participants. A naturally produced trisyllabic pseudoword was used in the paradigm, and MMNs were recorded to changes that were acoustic (changes in fundamental frequency or intensity) or potentially phonological (changes in vowel identity or vowel duration). All the different changes were presented in three different word segments (initial, middle, or final syllable). All changes elicited an MMN response, but the vowel duration change elicited a different response pattern than the other deviant types. Changes in vowel duration and identity also had an effect on MMN lateralization. Our results show that assessing speech sound discrimination of several features in word context is possible in a short recording time (30 min) with the multifeature paradigm.	t	\N
21568161	Although receptive priming has long been used as a way to examine lexical access in adults, few studies have applied this method to children and rarely in an auditory modality. We compared auditory associative priming in children and adults. A testing battery and a Lexical Decision (LD) task was administered to 42 adults and 27 children (8;1-10; 11 years-old) from Spain. They listened to Spanish word pairs (semantically related/unrelated word pairs and word-pseudoword pairs), and tone pairs. Then participants pressed one key for word pairs, and another for pairs with a word and a pseudoword. They also had to press the two keys alternatively for tone pairs as a basic auditory control. Both groups of participants, children and adults, exhibited semantic priming, with significantly faster Reaction Times (RTs) to semantically related word pairs than to unrelated pairs and to the two word-pseudoword sets. The priming effect was twice as large in the adults compared to children, and the children (not the adults) were significantly slower in their response to word-pseudoword pairs than to the unrelated word pairs. Moreover, accuracy was somewhat higher in adults than children for each word pair type, but especially in the word-pseudoword pairs. As expected, children were significantly slower than adults in the RTs for all stimulus types, and their RTs decreased significantly from 8 to 10 years of age and they also decreased in relation to some of their language abilities development (e.g., relative clauses comprehension). In both age groups, the Reaction Time average for tone pairs was lower than for speech pairs, but only all adults obtained 100% accuracy (which was slightly lower in children). Auditory processing and semantic networks are still developing in 8-10 year old children.	t	\N
21568376	The corruption of intonation contours has detrimental effects on sentence-based speech recognition in normal-hearing listeners Binns and Culling [(2007). J. Acoust. Soc. Am. 122, 1765-1776]. This paper examines whether this finding also applies to cochlear implant (CI) recipients. The subjects' F0-discrimination and speech perception in the presence of noise were measured, using sentences with regular and inverted F0-contours. The results revealed that speech recognition for regular contours was significantly better than for inverted contours. This difference was related to the subjects' F0-discrimination providing further evidence that the perception of intonation patterns is important for the CI-mediated speech recognition in noise.	t	\N
21568411	This study examined the time course of cochlear suppression using a tone-burst suppressor to measure decrement of distortion-product otoacoustic emissions (DPOAEs). Seven normal-hearing subjects with ages ranging from 19 to 28 yr participated in the study. Each subject had audiometric thresholds ≤ 15 dB HL [re ANSI (2004) Specifications for Audiometers] for standard octave and inter-octave frequencies from 0.25 to 8 kHz. DPOAEs were elicited by primary tones with f(2) = 4.0 kHz and f(1) = 3.333 kHz (f(2)/f(1) = 1.2). For the f(2), L(2) combination, suppression was measured for three suppressor frequencies: One suppressor below f(2) (3.834 kHz) and two above f(2) (4.166 and 4.282 kHz) at three levels (55, 60, and 65 dB SPL). DPOAE decrement as a function of L(3) for the tone-burst suppressor was similar to decrements obtained with longer duration suppressors. Onset- and setoff- latencies were ≤ 4 ms, in agreement with previous physiological findings in auditory-nerve fiber studies that suggest suppression results from a nearly instantaneous compression of the waveform. Persistence of suppression was absent for the below-frequency suppressor (f(3) = 3.834 kHz) and was ≤ 3 ms for the two above-frequency suppressors (f(3) = 4.166 and 4.282 kHz).	t	\N
21568423	The octave or Deutsch illusion occurs when two tones, separated by about one octave, are presented simultaneously but alternating between ears, such that when the low tone is presented to the left ear the high tone is presented to the right ear and vice versa. Most subjects hear a single tone that alternates both between ears and in pitch; i.e., they hear a low pitched tone in one ear alternating with a high pitched tone in the other ear. The present study examined whether the illusion can be elicited by aperiodic signals consisting of low-frequency band-pass filtered noises with overlapping spectra. The amount of spectral overlap was held constant, but the high- and low-frequency content of the signals was systematically varied. The majority of subjects perceived an auditory illusion in terms of a dominant ear for pitch and lateralization by frequency, as proposed by Deutsch [(1975a) Sci. Am. 233, 92-104]. Furthermore, the salience of the illusion increased as the high frequency of the content in the signal increased. Since no harmonics were present in the stimuli, it is highly unlikely that this illusion is perceived on the basis of binaural diplacusis or harmonic binaural fusion.	t	\N
21569617	After a prolonged exposure to a paired presentation of different types of signals (e.g., color and motion), one of the signals (color) becomes a driver for the other signal (motion). This phenomenon, which is known as contingent motion aftereffect, indicates that the brain can establish new neural representations even in the adult's brain. However, contingent motion aftereffect has been reported only in visual or auditory domain. Here, we demonstrate that a visual motion aftereffect can be contingent on a specific sound. Dynamic random dots moving in an alternating right or left direction were presented to the participants. Each direction of motion was accompanied by an auditory tone of a unique and specific frequency. After a 3-minutes exposure, the tones began to exert marked influence on the visual motion perception, and the percentage of dots required to trigger motion perception systematically changed depending on the tones. Furthermore, this effect lasted for at least 2 days. These results indicate that a new neural representation can be rapidly established between auditory and visual modalities.	t	\N
21598839	This study examined the relationship between proactive learning in hypnosis, post-hypnotic suggestion, and academic performance. Participants (N = 56) were randomly assigned to a control group or a treatment group. The treatment group was hypnotized and read a passage while in hypnosis. Concurrently, they were given a post-hypnotic suggestion, which attempted to aid recognition and performance on a test immediately following the hypnosis session. Both groups completed a multiple-choice test based on the aforementioned passage. An analysis of covariance discerned the effect of proactive learning and post-hypnotic suggestion on test performance, while controlling for the variance introduced by scholastic aptitude as measured by the ACT. Results indicated that the hypnosis sessions predicted significantly impaired test performance.	t	\N
21599613	To develop and evaluate the Mandarin speech signal content on the acceptable noise level (ANL) test in listeners with normal hearing in mainland China. The Mandarin ANL tests were conducted using three different sets of Mandarin running speech materials which were chosen from textbooks for primary school, secondary school, and high school, respectively. For each discourse, two ANL measurements were obtained and averaged for each experimental condition using ANL test procedures. Thirty-one normal-hearing listeners participated in this study. There were significant differences for ANLs among the normal-hearing listeners, but no differences were found for MCLs and ANLs for the three sets of test materials. The Pearson correlations suggested significant correlations between MCL and ANL among the three test materials; also the results showed that the correlation coefficient between MCL-ANL of the primary material was much better than other two materials. (1) The contents of different Mandarin running speeches may not affect the acceptable noise level in Mandarin normal-hearing listeners; (2) The running speech selected from the primary school ought to be used as the Mandarin acceptable noise level test material to evaluate the outcomes of hearing aid fitting.	t	\N
21604474	To enhance speech recognition in noise, as well as tone recognition, we presented a new kind of speech coding strategy, called one-octave wavelet transform zero-crossing stimulation (WTZS), for cochlear implants based on amplitude and frequency modulation. We selected 15 volunteers with normal hearing ability to carry out hearing simulation experiments by picking up the amplitude (amplitude modulation, AM), zero-crossings (frequency modulation, FM) and gradient parameters from processed speech signal in the domain of one-octave wavelet transform to synthesize the stimulating pulstile series. The experimental results demonstrated that the phonetic recognition in quiet surroundings with amplitude modulation only strategy (CIS) is similar to that of amplitude and frequency modulations strategies (FAME and WTZS), while the tone perception of CIS is inferior to that of FAME and WTZS strategies. However, in noisy environment, the phonetic recognition, tone perception, as well as sentence recognition of WTZS strategy are better than those of CIS and FAME strategies. WTZS strategy, utilizing amplitude (AM), zero-crossings (FM) and gradient parameters to synthesize stimulus, can enhance the phonetic and tonal language recognition in noise environment effectively, and could be used in cochlear implant system for speech processor design after arithmetic optimization.	t	\N
21613467	There is an increasing interest to integrate electrophysiological and hemodynamic measures for characterizing spatial and temporal aspects of cortical processing. However, an informative combination of responses that have markedly different sensitivities to the underlying neural activity is not straightforward, especially in complex cognitive tasks. Here, we used parametric stimulus manipulation in magnetoencephalography (MEG) and functional magnetic resonance imaging (fMRI) recordings on the same subjects, to study effects of noise on processing of spoken words and environmental sounds. The added noise influenced MEG response strengths in the bilateral supratemporal auditory cortex, at different times for the different stimulus types. Specifically for spoken words, the effect of noise on the electrophysiological response was remarkably nonlinear. Therefore, we used the single-subject MEG responses to construct parametrization for fMRI data analysis and obtained notably higher sensitivity than with conventional stimulus-based parametrization. fMRI results showed that partly different temporal areas were involved in noise-sensitive processing of words and environmental sounds. These results indicate that cortical processing of sounds in background noise is stimulus specific in both timing and location and provide a new functionally meaningful platform for combining information obtained with electrophysiological and hemodynamic measures of brain function.	t	\N
21613485	Growing evidence from electrophysiological data in animal and human studies suggests that multisensory interaction is not exclusively a higher-order process, but also takes place in primary sensory cortices. Such early multisensory interaction is thought to be mediated by means of phase resetting. The presentation of a stimulus to one sensory modality resets the phase of ongoing oscillations in another modality such that processing in the latter modality is modulated. In humans, evidence for such a mechanism is still sparse. In the current study, the influence of an auditory stimulus on visual processing was investigated by measuring the electroencephalogram (EEG) and behavioral responses of humans to visual, auditory, and audiovisual stimulation with varying stimulus-onset asynchrony (SOA). We observed three distinct oscillatory EEG responses in our data. An initial gamma-band response around 50 Hz was followed by a beta-band response around 25 Hz, and a theta response around 6 Hz. The latter was enhanced in response to cross-modal stimuli as compared to either unimodal stimuli. Interestingly, the beta response to unimodal auditory stimuli was dominant in electrodes over visual areas. The SOA between auditory and visual stimuli--albeit not consciously perceived--had a modulatory impact on the multisensory evoked beta-band responses; i.e., the amplitude depended on SOA in a sinusoidal fashion, suggesting a phase reset. These findings further support the notion that parameters of brain oscillations such as amplitude and phase are essential predictors of subsequent brain responses and might be one of the mechanisms underlying multisensory integration.	t	\N
21615286	Research into speech perception by nonhuman animals can be crucially informative in assessing whether specific perceptual phenomena in humans have evolved to decode speech, or reflect more general traits. Birds share with humans not only the capacity to use complex vocalizations for communication but also many characteristics of its underlying developmental and mechanistic processes; thus, birds are a particularly interesting group for comparative study. This review first discusses commonalities between birds and humans in perception of speech sounds. Several psychoacoustic studies have shown striking parallels in seemingly speech-specific perceptual phenomena, such as categorical perception of voice-onset-time variation, categorization of consonants that lack phonetic invariance, and compensation for coarticulation. Such findings are often regarded as evidence for the idea that the objects of human speech perception are auditory or acoustic events rather than articulations. Next, I highlight recent research on the production side of avian communication that has revealed the existence of vocal tract filtering and articulation in bird species-specific vocalization, which has traditionally been considered a hallmark of human speech production. Together, findings in birds show that many of characteristics of human speech perception are not uniquely human but also that a comparative approach to the question of what are the objects of perception--articulatory or auditory events--requires careful consideration of species-specific vocal production mechanisms.	t	\N
21616132	Presbycusis, or age-related hearing loss is a growing problem as the general population ages. In this longitudinal study, the influence of noise or styrene exposure on presbycusis was investigated in Brown Norway rats. Animals were exposed at 6 months of age, either to a band noise centered at 8 kHz at a Lex,8h = 85 dB (86.2 dB SPL for 6 h), or to 300 ppm of styrene for 6 h per day, five days per week, for four weeks. Cubic distortion product otoacoustic emissions (2f1-f2 DPOAEs) were used to test the capacity of the auditory receptor over the lifespan of the animals. 2f1-f2DPOAE measurements are easy to implement and efficiently track the age-related deterioration of mid- and high-frequencies. They are good indicators of temporary auditory threshold shift, especially with a level of primaries close to 60 dB SPL. Post-exposure hearing defects are best identified using moderate, rather than high, levels of primaries. Like many aging humans, aging rats lose sensitivity to high-frequencies faster than to medium-frequencies. Although the results obtained with the styrene exposure were not entirely conclusive, histopathological data showed the presbycusis process to be enhanced. Noise-exposed rats exhibit a loss of spiral ganglion cells from 12 months and a 7 dB drop in 2f1-f2DPOAEs at 24 months, indicating that even moderate-intensity noise can accelerate the presbycusis process. Even though the results obtained with the styrene exposure are less conclusive, the histopathological data show an enhancement of the presbycusis process.	t	\N
21630061	Since air-conducted (AC) and clinical (mastoid) bone-conducted (BC) sounds interact in the cochlea (e.g. pitch, cancellation, masking, beats), it has been thought that both AC and BC stimulations lead to a mechanical wave in the cochlea. However, there are also "non-osseous" forms of BC, i.e. auditory sensation produced when the clinical bone vibrator is applied to "non-osseous" soft tissue sites. In the present study, such "non-osseous" sites were identified (e.g. eye, cheek, neck) and they interacted with AC and osseous BC (pitch matching, beats, masking), indicating that all of these forms of auditory stimulation converge in the cochlea, producing the same pattern of mechanical activity, leading to their interactions.	t	\N
21632920	The integration of multisensory information has been shown to be guided by spatial and temporal proximity, as well as to be influenced by attention. Here we used neural measures of the multisensory spread of attention to investigate the spatial and temporal linking of synchronous versus near-synchronous auditory and visual events. Human participants attended selectively to one of two lateralized visual-stimulus streams while task-irrelevant tones were presented centrally. Electrophysiological measures of brain activity showed that tones occurring simultaneously or delayed by 100 ms were temporally linked to an attended visual stimulus, as reflected by robust cross-modal spreading-of-attention activity, but not when delayed by 300 ms. The neural data also indicated a ventriloquist-like spatial linking of the auditory to the attended visual stimuli, but only when occurring simultaneously. These neurophysiological results thus provide unique insight into the temporal and spatial principles of multisensory feature integration and the fundamental role attention plays in such integration.	t	\N
21632945	Contemporary models of the neural system that supports reading propose that activity in a ventral occipitotemporal area (vOT) drives activity in higher-order language areas, for example, those in the posterior superior temporal sulcus (pSTS) and anterior superior temporal sulcus (aSTS). We used fMRI with dynamic causal modeling (DCM) to investigate evidence for other routes from visual cortex to the left temporal lobe language areas. First we identified activations in posterior inferior occipital (iO) and vOT areas that were more activated for silent reading than listening to words and sentences; and in pSTS and aSTS areas that were commonly activated for reading relative to false-fonts and listening to words relative to reversed words. Second, in three different DCM analyses, we tested whether visual processing of words modulates activity from the following: (1) iO→vOT, iO→pSTS, both, or neither; (2) vOT→pSTS, iO→pSTS, both or neither; and (3) pSTS→aSTS, vOT→aSTS, both, or neither. We found that reading words increased connectivity (1) from iO to both pSTS and vOT; (2) to pSTS from both iO and vOT; and (3) to aSTS from both vOT and pSTS. These results highlight three potential processing streams in the occipitotemporal cortex: iO→pSTS→aSTS; iO→vOT→aSTS; and iO→vOT→pSTS→aSTS. We discuss these results in terms of cognitive models of reading and propose that efficient reading relies on the integrity of all these pathways.	t	\N
21637344	Previous cue integration studies have examined continuous perceptual dimensions (e.g., size) and have shown that human cue integration is well described by a normative model in which cues are weighted in proportion to their sensory reliability, as estimated from single-cue performance. However, this normative model may not be applicable to categorical perceptual dimensions (e.g., phonemes). In tasks defined over categorical perceptual dimensions, optimal cue weights should depend not only on the sensory variance affecting the perception of each cue but also on the environmental variance inherent in each task-relevant category. Here, we present a computational and experimental investigation of cue integration in a categorical audio-visual (articulatory) speech perception task. Our results show that human performance during audio-visual phonemic labeling is qualitatively consistent with the behavior of a Bayes-optimal observer. Specifically, we show that the participants in our task are sensitive, on a trial-by-trial basis, to the sensory uncertainty associated with the auditory and visual cues, during phonemic categorization. In addition, we show that while sensory uncertainty is a significant factor in determining cue weights, it is not the only one and participants' performance is consistent with an optimal model in which environmental, within category variability also plays a role in determining cue weights. Furthermore, we show that in our task, the sensory variability affecting the visual modality during cue-combination is not well estimated from single-cue performance, but can be estimated from multi-cue performance. The findings and computational principles described here represent a principled first step towards characterizing the mechanisms underlying human cue integration in categorical tasks.	t	\N
21638105	In two experiments, we examined the impact of the degree of match between sequential auditory perceptual organization processes and the demands of a short-term memory task (memory for order vs. item information). When a spoken sequence of digits was presented so as to promote its perceptual partitioning into two distinct streams by conveying it in alternating female (F) and male (M) voices (FMFMFMFM)--thereby disturbing the perception of true temporal order--recall of item order was greatly impaired (as compared to recall of item identity). Moreover, an order error type consistent with the formation of voice-based streams was committed more quickly in the alternating-voice condition (Exp. 1). In contrast, when the perceptual organization of the sequence mapped well onto an optimal two-group serial rehearsal strategy--by presenting the two voices in discrete clusters (FFFFMMMM)--order, but not item, recall was enhanced (Exp. 2). The results are consistent with the view that the degree of compatibility between perceptual and deliberate sequencing processes is a key determinant of serial short-term memory performance. Alternative accounts of talker variability effects in short-term memory, based on the concept of a dedicated phonological short-term store and a capacity-limited focus of attention, are also reviewed.	t	\N
21646423	To determine the rate of word learning for children with hearing loss (HL) in quiet and in noise compared to normal-hearing (NH) peers. The effects of digital noise reduction (DNR) were examined for children with HL. Forty-one children with NH and 26 children with HL were grouped by age (8-9 years and 11-12 years). The children learned novel words associated with novel objects through a process of trial and error. Functions relating performance across trials were calculated for each child in each listening condition and were compared. Significant effects were observed for age (older > younger) in the children with NH and listening condition (quiet > noise) in the children with HL. Significant effects of hearing status were also observed across groups (NH > HL), indicating that the children with HL required more trials to learn the new words. However, word learning improved significantly in noise with the use of DNR for the older but not for the younger children with HL. Hearing aid history and signal-to-noise ratio did not contribute to performance. Word learning was significantly reduced in younger children, in noise, and in the presence of hearing loss. Age-related benefits of DNR were apparent for children over 10 years of age.	t	\N
21646931	The perception of pitch has recently gained attention. At present, clinical audiologic tests to assess this are hardly available. This article reports on the development of a clinical test using harmonic intonation (HI) and disharmonic intonation (DI). Prospective collection of normative data and pilot study in hearing-impaired subjects. Tertiary referral center. Normative data were collected from 90 normal-hearing subjects recruited from 3 different language backgrounds. The pilot study was conducted on 18 hearing-impaired individuals who were selected into 3 pathologic groups: high-frequency hearing loss (HF), low-frequency hearing loss (LF), and cochlear implant users (CI). Normative data collection and exploratory diagnostics by means of the newly constructed HI/DI tests using intonation patterns to find the just noticeable difference (JND) for pitch discrimination in low-frequency harmonic complex sounds presented in a same-different task. JND for pitch discrimination using HI/DI tests in the hearing population and pathologic groups. Normative data are presented in 5 parameter statistics and box-and-whisker plots showing median JNDs of 2 (HI) and 3 Hz (DI). The results on both tests are statistically abnormal in LF and CI subjects, whereas they are not significantly abnormal in the HF group. The HI and DI tests allow the clinical assessment of low-frequency pitch perception. The data obtained in this study define the normal zone for both tests. Preliminary results indicate possible abnormal TFS perception in some hearing-impaired subjects.	t	\N
21664147	Homonyms, i.e. ambiguous words like 'score', have different meanings in different contexts. Previous research indicates that all potential meanings of a homonym are first accessed in parallel before one of the meanings is selected in a competitive race. If these processes are automatic, these processes of selection should even be observed when homonyms are shown subliminally. This study measured the time course of subliminal and supraliminal priming by homonyms with a frequent (dominant) and a rare (subordinate) meaning in a neutral context, using a lexical decision task. In the subliminal condition, priming across prime-target asynchronies ranging from 100 ms to 1.5 s indicated that the dominant meaning of homonyms was facilitated and the subordinate meaning was inhibited. This indicates that selection of meaning was much faster with subliminal presentation than with supraliminal presentation. Awareness of a prime might decelerate an otherwise rapid selection process.	t	\N
21666515	The cortical processing of musical sounds is influenced by listeners' sensitivity to the structural regularities of music, and particularly by sensitivity to harmonic relationships. As subcortical and cortical processing dynamically interact to shape auditory perception in an experience-dependent manner, we asked whether subcortical processing of musical sounds would be sensitive to harmonic relationships. We examined auditory brainstem responses to a chord that was preceded either by a harmonically related chord, by an unrelated chord, or was repeated. We observed higher spectral response magnitudes in the related than in the unrelated or repeated conditions, for both musician and nonmusician listeners. Our results suggest that listeners' implicit knowledge of musical regularities influences subcortical auditory processing.	t	\N
21669238	The fast detection of novel or deviant stimuli is a striking property of the auditory processing which reflects basic organizational principles of the auditory system and at the same time is of high practical significance. In human electrophysiology, deviance detection has been related to the occurrence of the mismatch negativity (MMN)--a component of the event-related potential (ERP) evoked 100 to 250 ms after the occurrence of a rare irregular sound. Recently, it has been shown in animal studies that a considerable portion of neurons in the auditory pathway exhibits a property called stimulus-specific adaptation enabling them to encode inter-sound relationships and to discharge at higher rates to rare changes in the acoustic stimulation. These neural responses have been linked to the deviant-evoked potential measured at the human scalp, but such responses occur at lower levels anatomically (e.g. the primary auditory cortex as well as the inferior colliculi) and are elicited earlier (20-30 ms after sound onset) in comparison to MMN. Further, they are not considerable enough in size to be interpreted as a direct neural correlate of the MMN. We review here a series of recent findings that provides a first step toward filling this gap between animal and human recordings by showing that comparably early modulations due to a sound's deviancy can be observed in humans, particularly in the middle-latency portion of the ERP within the first 50 ms after sound onset. The existence of those early indices of deviance detection preceding the well-studied MMN component strongly supports the idea that the encoding of regularities and the detection of violations is a basic principle of human auditory processing acting on multiple levels. This sustains the notion of a hierarchically organized novelty and deviance detection system in the human auditory system.	t	\N
21680846	The attentional effects triggered by emotional stimuli in humans have been substantially investigated, but little is known about the impact of affective valence on the processing of meaning. Here, we used a cross-modal priming paradigm involving visually presented adjective-noun dyads and environmental sounds of controlled affective valence to test the contributions of conceptual relatedness and emotional congruence to priming. Participants undergoing event-related potential recording indicated whether target environmental sounds were related in meaning to adjective-noun dyads presented as primes. We tested spontaneous emotional priming by manipulating the congruence between the affective valence of the adjective in the prime and that of the sound. While the N400 was significantly reduced in amplitude by both conceptual relatedness and emotional congruence, there was no interaction between the 2 factors. The same pattern of results was found when participants judged the emotional congruence between environmental sounds and adjective-noun dyads. These results support the hypothesis that conceptual and emotional processes are functionally independent regardless of the specific cognitive focus of the comprehender.	t	\N
21682421	Cross-channel envelope correlations are hypothesized to influence speech intelligibility, particularly in adverse conditions. Acoustic analyses suggest speech envelope correlations differ for syllabic and phonemic ranges of modulation frequency. The influence of cochlear filtering was examined here by predicting cross-channel envelope correlations in different speech modulation ranges for normal and impaired auditory-nerve (AN) responses. Neural cross-correlation coefficients quantified across-fiber envelope coding in syllabic (0-5 Hz), phonemic (5-64 Hz), and periodicity (64-300 Hz) modulation ranges. Spike trains were generated from a physiologically based AN model. Correlations were also computed using the model with selective hair-cell damage. Neural predictions revealed that envelope cross-correlation decreased with increased characteristic-frequency separation for all modulation ranges (with greater syllabic-envelope correlation than phonemic or periodicity). Syllabic envelope was highly correlated across many spectral channels, whereas phonemic and periodicity envelopes were correlated mainly between adjacent channels. Outer-hair-cell impairment increased the degree of cross-channel correlation for phonemic and periodicity ranges for speech in quiet and in noise, thereby reducing the number of independent neural information channels for envelope coding. In contrast, outer-hair-cell impairment was predicted to decrease cross-channel correlation for syllabic envelopes in noise, which may partially account for the reduced ability of hearing-impaired listeners to segregate speech in complex backgrounds.	t	\N
21683500	Responses to threat occur via two known independent processing routes. We propose that early, reflexive processing is predominantly tuned to the detection of congruent combinations of facial cues that signal threat, whereas later, reflective processing is predominantly tuned to incongruent combinations of threat. To test this prediction, we examined responses to threat-gaze expression pairs (anger versus fear expression by direct versus averted gaze). We report on two functional magnetic resonance imaging (fMRI) studies, one employing prolonged presentations (2s) of threat-gaze pairs to allow for reflective processing (Study 1), and one employing severely restricted (33 ms), backward masked presentations of threat-gaze pairs to isolate reflexive neural responding (Study 2). Our findings offer initial support for the conclusion that early, reflexive responses to threat are predominantly tuned to congruent threat-gaze pairings, whereas later reflective responses are predominantly tuned to ambiguous threat-gaze pairings. These findings highlight a distinct dual function in threat perception.	t	\N
21683995	Electrophysiological studies investigating similarities between music and language perception have relied exclusively on the signal averaging technique, which does not adequately represent oscillatory aspects of electrical brain activity that are relevant for higher cognition. The current study investigated the patterns of brain oscillations during simultaneous processing of music and language using visually presented sentences and auditorily presented chord sequences. Music-syntactically regular or irregular chord functions were presented in sync with syntactically or semantically correct or incorrect words. Irregular chord functions (presented simultaneously with a syntactically correct word) produced an early (150-250 ms) spectral power decrease over anterior frontal regions in the theta band (5-7 Hz) and a late (350-700 ms) power increase in both the delta and the theta band (2-7 Hz) over parietal regions. Syntactically incorrect words (presented simultaneously with a regular chord) elicited a similar late power increase in delta-theta band over parietal sites, but no early effect. Interestingly, the late effect was significantly diminished when the language-syntactic and music-syntactic irregularities occurred at the same time. Further, the presence of a semantic violation occurring simultaneously with regular chords produced a significant increase in later delta-theta power at posterior regions; this effect was marginally decreased when the identical semantic violation occurred simultaneously with a music syntactical violation. Altogether, these results show that low frequency oscillatory networks get activated during the syntactic processing of both music and language, and further, these networks may possibly be shared.	t	\N
21689048	The purpose of this study was to develop a music quality rating test battery (MQRTB) and pilot test it by comparing appraisal ratings from cochlear implant (CI) recipients using the fine-structure processing (FSP) and high-definition continuous interleaved sampling (HDCIS) speech processing strategies. The development of the MQRTB involved three stages: (1) Selection of test items for the MQRTB; (2) Verification of its length and complexity with normally-hearing individuals; and (3) Pilot testing with CI recipients. Part 1 involved 65 adult listeners, Part 2 involved 10 normally-hearing adults, and Part 3 involved five adult MED-EL CI recipients. The MQRTB consisted of ten songs, with ratings made on scales assessing pleasantness, naturalness, richness, fullness, sharpness, and roughness. Results of the pilot study, which compared FSP and HDCIS for music, indicated that acclimatization to a strategy had a significant effect on ratings (p < 0.05). When acclimatized to FSP, the group rated FSP as closer to 'exactly as I want it to sound' than HDCIS (p < 0.05), and that HDCIS sounded significantly sharper and rougher than FSP. However when acclimatized to HDCIS, there were no significant differences between ratings. There was no effect of song familiarity or genre on ratings. Overall the results suggest that the use of FSP as the default strategy for MED-EL recipients would have a positive effect on music appreciation, and that the MQRTB is an effective tool for assessing music sound quality.	t	\N
21692141	The aim of this functional magnetic resonance imaging (fMRI) study was to identify human brain areas that are sensitive to the direction of auditory motion. Such directional sensitivity was assessed in a hypothesis-free manner by analyzing fMRI response patterns across the entire brain volume using a spherical-searchlight approach. In addition, we assessed directional sensitivity in three predefined brain areas that have been associated with auditory motion perception in previous neuroimaging studies. These were the primary auditory cortex, the planum temporale and the visual motion complex (hMT/V5+). Our whole-brain analysis revealed that the direction of sound-source movement could be decoded from fMRI response patterns in the right auditory cortex and in a high-level visual area located in the right lateral occipital cortex. Our region-of-interest-based analysis showed that the decoding of the direction of auditory motion was most reliable with activation patterns of the left and right planum temporale. Auditory motion direction could not be decoded from activation patterns in hMT/V5+. These findings provide further evidence for the planum temporale playing a central role in supporting auditory motion perception. In addition, our findings suggest a cross-modal transfer of directional information to high-level visual cortex in healthy humans.	t	\N
21701947	Phenomena in a variety of verbal tasks--for example, masked priming, lexical decision, and word naming--are typically explained in terms of similarity between word-forms. Despite the apparent commonalities between these sets of phenomena, the representations and similarity measures used to account for them are not often related. To show how this gap might be bridged, we build on the work of Hannagan, Dupoux, and Christophe, Cognitive Science 35:79-118, (2011) to explore several methods of representing visual word-forms using holographic reduced representations and to evaluate them on their ability to account for a wide range of effects in masked form priming, as well as data from lexical decision and word naming. A representation that assumes that word-internal letter groups are encoded relative to word-terminal letter groups is found to predict qualitative patterns in masked priming, as well as lexical decision and naming latencies. We then show how this representation can be integrated with the BEAGLE model of lexical semantics (Jones & Mewhort, Psychological Review 114:1-37, 2007) to enable the model to encompass a wider range of verbal tasks.	t	\N
21707205	In reaction time research, there has been an increasing appreciation that response-initiation processes are sensitive to recent experience and, in particular, the difficulty of previous trials. From this perspective, the authors propose an explanation for a perplexing property of masked priming: Although primes are not consciously identified, facilitation of target processing by a related prime is magnified in a block containing a high proportion of related primes and a low proportion of unrelated primes relative to a block containing the opposite mix (Bodner & Masson, 2001). In the present study, this phenomenon is explored with a parity (even/odd) decision task in which a prime (e.g., 2) precedes a target that can be either congruent (e.g., 4) or incongruent (e.g., 3). It is shown that the effect of congruence proportion with masked primes cannot be explained in terms of the blockwise prime-target contingency. Specifically, with masked primes, there is no congruency disadvantage in a block containing a high proportion of incongruent primes, but there is a congruency advantage when the block contains an equal proportion of congruent and incongruent primes. In qualitative contrast, visible primes are sensitive to the blockwise prime-target contingency. The authors explain the relatedness proportion effect found with masked primes in terms of a model according to which response-initiation processes adapt to the statistical structure of the environment, specifically the difficulty of recent trials. This account is supported with an analysis at the level of individual trials using the linear mixed effects model.	t	\N
21707266	We report a single case study of a synesthete (PS) who has complex visual experiences from sounds, including human voices. Different vowel sounds from different speakers and modified to be of different pitch (f0) were presented to PS and controls who were asked to draw an (abstract) visual image of the sound noting colors, sizes, and locations. PS tended to be more consistent over time than controls. For both PS and controls, the pitch of the vowel influenced the choice of luminance (higher pitch being lighter) and vertical position (higher pitch being higher in space). However, the gender of the speaker influenced the size of the 'image' independently of pitch (vowels from males being larger).	t	\N
21710713	To investigate the contralateral suppressions of transient evoked otoacoustic emissions (TEOAE) in diabetes mellitus patients with normal hearing. The pure tone audiometry, acoustic immittance and TEOAE tests were performed in 30 diabetes mellitus patients with normal hearing and 30 healthy controls. The efferent system functions were evaluated by contralateral suppression of TEOAE. There were no significant differences of pure tone thresholds and amplitudes of TEOAE between the two groups. The contralateral suppressions of TEOAE in diabetes mellitus patients were significantly lower than that in controls (P<0.05 at 2000 and 4000 Hz respectively). The nerve functions of central nerve system can be damaged in diabetes mellitus patients with normal hearing.	t	\N
21714708	We investigated if linguistic complexity contributes to the variation of the speech reception threshold in noise (SRTN) and thus should be employed as an additional design criterion in sentence tests used for audiometry. Three test lists were established with sentences from the Göttingen sentence test ( Kollmeier & Wesselkamp, 1997 ). One list contained linguistically simple sentences, the other two lists contained sentences with two types of linguistic complexity. For each listener the SRTN was determined for each list. Younger and older listeners with normal hearing and older listeners with hearing impairment were tested. Younger listeners with normal hearing showed significantly worse SRTNs on the complex lists than on the simple list. This difference could not be found for either of the older groups. The effect of linguistic complexity on speech recognition seems to depend on age and/or hearing status. Hence, pending further research, linguistic complexity seems less relevant as a sentence test design criterion for clinical-audiological purposes, but we argue that a test with larger variation in linguistic complexity across sentences might show a relation between linguistic complexity and speech recognition even in a clinical population.	t	\N
21719027	Web based speech training for dysarthric speakers, such as E-learning based Speech Therapy (EST), puts considerable demands on auditory discrimination abilities. To discuss the development and the evaluation of an auditory discrimination test (ADT) for the assessment of auditory speech discrimination skills in Dutch adult dysarthric speakers as a prelude to EST. Five ADT subtests were developed, each addressing a vital speech dimension in speech therapy: articulation (segmental elements), intensity, overall pitch, speech rate and intonation. A healthy control group of 36 participants performed a 'same-different task' in each subtest. ADT items yielding scores of at least 80% but below 100% correctly responding healthy controls were considered sensitive to diminished auditory discrimination. Subsequently, the ADT was carried out by 14 neurological patients with dysarthric speech and 14 matched healthy controls. Score percentages, sensitivity indices and reaction times (ms) on only sensitive items were compared. The majority of the ADT items met the 'minimal 80% to below 100% criterion' in the healthy control group. The neurological participants performed lower on all outcome measures across all subtests than the healthy controls, although not all of these differences achieved statistical significance. The results of the healthy control group show that the majority of the ADT items meet our criterion for sensitivity to diminished auditory discrimination. The poorer performance of dysarthric patients across all subtests supports the sensitivity of the ADT. However, further research involving larger and more homogeneous groups of neurological patients is required. Readers will be encouraged to (1) identify potential factors that may hinder web based speech training and (2) estimate the value of assessing auditory discrimination skills as a vital condition for (web based) speech training in dysarthric patients.	t	\N
21723130	Human voices play a fundamental role in social communication, and areas of the adult "social brain" show specialization for processing voices and their emotional content (superior temporal sulcus, inferior prefrontal cortex, premotor cortical regions, amygdala, and insula). However, it is unclear when this specialization develops. Functional magnetic resonance (fMRI) studies suggest that the infant temporal cortex does not differentiate speech from music or backward speech, but a prior study with functional near-infrared spectroscopy revealed preferential activation for human voices in 7-month-olds, in a more posterior location of the temporal cortex than in adults. However, the brain networks involved in processing nonspeech human vocalizations in early development are still unknown. To address this issue, in the present fMRI study, 3- to 7-month-olds were presented with adult nonspeech vocalizations (emotionally neutral, emotionally positive, and emotionally negative) and nonvocal environmental sounds. Infants displayed significant differential activation in the anterior portion of the temporal cortex, similarly to adults. Moreover, sad vocalizations modulated the activity of brain regions involved in processing affective stimuli such as the orbitofrontal cortex and insula. These results suggest remarkably early functional specialization for processing human voice and negative emotions.	t	\N
21723376	Older human listeners demonstrate perceptual deficits in temporal processing even when audibility has been controlled. These age-related auditory deficits in temporal processing are thought to originate in the central auditory pathway. Precise temporal processing is necessary to detect and discriminate auditory cues such as modulation frequency, modulation depth and envelope shape which are critical for perception of speech and environmental sounds. This study aims to further understanding of temporal processing in aging using non-invasive electrophysiological measurements. Amplitude modulation following responses (AMFRs) and frequency modulation following responses (FMFRs) were recorded from aged (92-95-weeks old) and young (9-12-weeks old) Fischer-344 (F-344) rats for sinusoidally amplitude modulated (sAM) tones, sinusoidally frequency modulated (sFM) tones and ramped and damped amplitude modulation (AM) stimuli which differ in their envelope shapes. The modulation depth for the sAM and sFM stimuli and envelope shape for the ramped and damped stimuli were systematically varied. There was a monotonic decrease in AMFR and FMFR amplitudes with decreases in modulation depth across age for sAM and sFM stimuli. There was no significant difference between the response amplitudes of the young and aged animals for the largest modulation depths. However, a reduction in modulation depth resulted in a significant decrease in the response amplitudes and higher modulation detection thresholds for sAM and sFM stimuli with age. The aged animals showed significantly lower response amplitudes for ramped stimuli but not for damped stimuli. Cross correlating the responses with the ramped, symmetric, or damped stimulus envelopes revealed a decreased fidelity in encoding envelope shapes with age. These results indicate that age related temporal processing deficits become apparent only with reduced modulation depths or when discriminating envelope shapes. This has implications for psychophysical or diagnostic testing as well as for constraining potential cellular and network mechanisms responsible for these deficits.	t	\N
21724244	In a neuroimaging study focusing on young bilinguals, we explored the brains of bilingual and monolingual babies across two age groups (younger 4-6 months, older 10-12 months), using fNIRS in a new event-related design, as babies processed linguistic phonetic (Native English, Non-Native Hindi) and non-linguistic Tone stimuli. We found that phonetic processing in bilingual and monolingual babies is accomplished with the same language-specific brain areas classically observed in adults, including the left superior temporal gyrus (associated with phonetic processing) and the left inferior frontal cortex (associated with the search and retrieval of information about meanings, and syntactic and phonological patterning), with intriguing developmental timing differences: left superior temporal gyrus activation was observed early and remained stably active over time, while left inferior frontal cortex showed greater increase in neural activation in older babies notably at the precise age when babies' enter the universal first-word milestone, thus revealing a first-time focal brain correlate that may mediate a universal behavioral milestone in early human language acquisition. A difference was observed in the older bilingual babies' resilient neural and behavioral sensitivity to Non-Native phonetic contrasts at a time when monolingual babies can no longer make such discriminations. We advance the "Perceptual Wedge Hypothesis" as one possible explanation for how exposure to greater than one language may alter neural and language processing in ways that we suggest are advantageous to language users. The brains of bilinguals and multilinguals may provide the most powerful window into the full neural "extent and variability" that our human species' language processing brain areas could potentially achieve.	t	\N
21728463	Gestalt phenomena are often so powerful that mere demonstrations can confirm their existence, but Gestalts have proven hard to define and measure. Here we outline a theory of basic Gestalts (TBG) that defines Gestalts as emergent features (EFs). The logic relies on discovering wholes that are more discriminable than are the parts from which they are built. These wholes contain EFs that can act as basic features in human vision. As context is added to a visual stimulus, a hierarchy of EFs appears. Starting with a single dot and adding a second yields the first two potential EFs: the proximity (distance) and orientation (angle) between the two dots. A third dot introduces two more potential EFs: symmetry and linearity; a fourth dot produces surroundedness. This hierarchy may extend to collinearity, parallelism, closure, and more. We use the magnitude of Configural Superiority Effects to measure the salience of EFs on a common scale, potentially letting us compare the strengths of various grouping principles. TBG appears promising, with our initial experiments establishing and quantifying at least three basic EFs in human vision.	t	\N
21730860	Auditory temporal processing frequently appears more affected in old subjects than would be predicted by the degree of peripheral hearing loss, pointing to an age-dependent central processing deficit. In parallel, an age-dependent decline of inhibitory function has been demonstrated in the auditory pathway, suggesting a causal relationship between temporal processing and inhibition. Gabapentin has been specifically synthesized as a potential gamma-amino-butyric-acid (GABA) mimetic with the capability to cross the blood-brain barrier. Gabapentin treatment ameliorated tinnitus in a rat model and improved tinnitus annoyance in humans with acoustic trauma. Consequently, the present study evaluated the effect of gabapentin on auditory temporal processing in the gerbil model. Psychometric functions were collected for different test paradigms. (A) "Gap detection": The detection of a gap in the middle of a 800 msec broadband noise pulse was determined either at 15 or at 30 dB SL. (B) "Forward masking": The detection of a 20 msec probe stimulus following 2.5 msec after a 400 msec 40 dB SPL masker was determined with masker and probe frequency at 2.85 kHz. The effect of gabapentin was evaluated by collecting gap detection and forward masking functions before, during, and after treating gerbils with gabapentin doses of 115 or 350 mg/kg/day administered via drinking water. Data under different experimental conditions were collected for groups of 3 to 5 young (<2 years) and 6 to 10 old (>2 years) gerbils. Two-way analyses of variance for the factors age groups and treatment groups with subsequent pairwise comparisons for significant effects were used for the statistical evaluation of the data. For gap detection, mean thresholds were significantly increased in the group of old as compared with the young gerbils at 30 dB SL (young 2.0 msec; old 3.2 msec) and at 15 dB SL (young 2.9 msec; old 9.1 msec). Gabapentin had no significant effect on gap detection, and there was no significant interaction between age group and gabapentin treatment. Mean thresholds in the forward masking paradigm were significantly elevated in old (45.5 dB SPL) as compared with young (35.0 dB SPL) gerbils. Overall, gabapentin had no significant effect on masked thresholds; however, there was a significant interaction between treatment and age. Subsequent pairwise comparisons revealed no significant effect on masked thresholds in old gerbils but showed significantly elevated thresholds of young gerbils during 350 mg/kg gabapentin (38.3 dB SPL) compared with thresholds obtained in young gerbils before (32.3 dB SPL) and after (33.5 dB SPL) treatment. Gabapentin did not exert a therapeutic effect on impaired gap detection and forward masking in old gerbils. The lack of an effect of gabapentin on impaired auditory temporal processing in old gerbils and the finding of elevated masked thresholds in young gerbils can be reconciled with reports of only moderate GABAergic effects compared with other drugs (e.g., comparing elevation of GABA levels in the brain by gabapentin and vigabatrin) and effects due to binding of gabapentin to alpha-2-delta units of voltage-gated calcium channels.	t	\N
21732223	Increasing the target-field luminance aids detection for a simultaneously presented black target disc and a black masking annulus. At an intermediate interval separating the onset of the target from the mask, increasing the target-field luminance reduces target detection. This decrease in performance occurs with both temporal and spatial forced choice tasks. With a spatial forced choice, an observer's performance can fall below chance. We associate below-chance performance with a brightness reversal of the black target disc, such that the target disc appears brighter than its surround. The occurrence of brightness reversals follows from our model of the Broca-Sulzer effect, and nonmonotone masking functions result from a generalization of luminance summation.	t	\N
21736456	The aim of this study was to examine the influence of musical expertise in 9-year-old children on passive (as reflected by MMN) and active (as reflected by discrimination accuracy) processing of speech sounds. Musician and nonmusician children were presented with a sequence of syllables that included standards and deviants in vowel frequency, vowel duration, and VOT. Both the passive and the active processing of duration and VOT deviants were enhanced in musician compared with nonmusician children. Moreover, although no effect was found on the passive processing of frequency, active frequency discrimination was enhanced in musician children. These findings are discussed in terms of common processing of acoustic features in music and speech and of positive transfer of training from music to the more abstract phonological representations of speech units (syllables).	t	\N
21736927	We used a new virtual program in two experiments to prepare subjects to perform the Morris water task (www.nesplora.com). The subjects were Psychology students; they were trained to locate a safe platform amidst the presence of four pinpoint landmarks spaced around the edge of the pool (i.e., two landmarks relatively near the platform and two landmarks relatively distant away from it). At the end of the training phase, we administered one test trial without the platform and recorded the amount of time that the students had spent in the platform quadrant. In Experiment 1, we conducted the test trial in the presence of one or two of the distant landmarks. When only one landmark was present during testing, performance fell to chance. However, the men outperformed the women when the two distant landmarks were both present. Experiment 2 replicated the previous results and extended it by showing that no sex differences exist when the searching process is based on the near landmarks. Both the men and the women had similarly good performances when the landmarks were present both individually and together. When present together, an addition effect was found. Far landmark tests favor configural learning processes, whereas near landmark tests favor elemental learning. Our findings suggest that other factors in addition to the use of directional cues can underlie the sex differences in the spatial learning process. Thus, we expand upon previous research in the field.	t	\N
21752361	A controversial issue in the cognitive neuroscience of language is the question whether independent lexical representations need to be included in cognitive models. Recent models claim to account for the available data without including phonological or orthographic lexicons. These models base their lexical decision ("Is it a word or not?") either on familiarity of the input string or alternatively, on semantic information. These two alternatives were evaluated in a series of experiments with an individual suffering from word-meaning deafness. This is a rare disorder of auditory word comprehension which affects mapping of a word's phonology to its meaning. The participant, BB, was unaffected by the 'word-likeness' of nonwords with comparable accuracy for plausible and abstruse nonwords. She was further able to make lexical decisions despite her severe impairment in comprehending the word's meaning. Lexical and semantic processing were assessed on an item-specific basis providing a methodological advancement over previous studies. The comprehension tasks involved word-picture matching as well as definition tasks. The results suggest that BB's lexical decisions are based neither on familiarity of the input string nor on semantic information, which was largely unavailable. The only alternative are lexical representations on which she could base her decisions.	t	\N
21753000	Feeling the beat and meter is fundamental to the experience of music. However, how these periodicities are represented in the brain remains largely unknown. Here, we test whether this function emerges from the entrainment of neurons resonating to the beat and meter. We recorded the electroencephalogram while participants listened to a musical beat and imagined a binary or a ternary meter on this beat (i.e., a march or a waltz). We found that the beat elicits a sustained periodic EEG response tuned to the beat frequency. Most importantly, we found that meter imagery elicits an additional frequency tuned to the corresponding metric interpretation of this beat. These results provide compelling evidence that neural entrainment to beat and meter can be captured directly in the electroencephalogram. More generally, our results suggest that music constitutes a unique context to explore entrainment phenomena in dynamic cognitive processing at the level of neural networks.	t	\N
21756455	Cochlear implant (CI) devices are the choice of treatment for individuals with severe to profound hearing loss. The CI devices provide the opportunity for children who are deaf to perceive sound by electrical stimulation of the auditory nerve, with the goal of optimizing oral communication. A natural benefit of acquiring hearing using CIs is the ability to hear, and perhaps enjoy, music. Music is a non-verbal auditory stimulus and a powerful tool for transmitting emotion. Identifying emotional cues is an important part of normal social development and communication and thus music may play an important role in establishing these skills during development. To date, it is not known whether children who use cochlear implants to hear can identify the emotional content carried in music. Our objective in the present study was to determine whether children who have been deaf from infancy and are experienced CI users have acquired the ability to identify emotion in musical phrases. Study participants were 18 CI users (ages 7-13 years) who received right unilateral CIs (mean age at CI activation of 2.9 years) and 18 age-and gender-matched controls. Participants were asked to judge 32 brief musical excerpts as happy or sad by pointing to simple graphics of a smiling or frowning face. Children using CIs were able to correctly distinguish happy versus sad music well above chance levels, but performed more poorly on this task than their peers with typical hearing. Age at CI activation and time since CI activation were both uncorrelated with outcome measures. Children with CIs show the ability to perceive emotion in music but do so less accurately than typically hearing peers.	t	\N
21756504	To describe the rate of occurrence of unsuspected decrease in sensitivity of the sound processor microphone and to evaluate its effect on the patient's audiological performance in terms of reduction in speech recognition scores. Speech processor microphones were tested by connecting the speech processor acoustic monitor circuit to a hearing aid analyzer. The response curves were compared with those obtained from fully working microphones. During a 6-month investigation period, microphone response curves were measured from a group of cochlear implant recipients who had not reported any problems. Despite the absence of any subjective problem, some microphones were found to show a loss of sensitivity. Their users, aged between 4 and 67 years, were tested both with the defective and a working microphone in order to calculate the correlation between the degree of microphone failure and the decline in audiological performance. To quantify the effect of microphone failure, patients' speech recognition skills were measured by live voice connected discourse tracking series administered in different conditions and by recorded sentences lists. A total of 120 apparently fully functioning sound processors were tested in the investigation: 33 (27.5%) were affected by a subjectively unreported sensitivity decrease. Speech-tracking scores correlated significantly with the loss of microphone sensitivity in all test conditions (r = 0.69-0.77, P < 0.05). A high degree of correlation was also found for speech audiometry tests (r = 0.70-0.73, P < 0.05). Microphone sensitivity loss affected speech recognition skills, especially without lip reading and in the presence of background noise. The results indicate that any reduction in sound processor microphone sensitivity causes a degree of hearing decline that negatively affects the cochlear recipient's clinical performance. Microphone faults are often unreported events, and their occurrence rate is underestimated. To establish that the microphone is providing correct input to the speech processor a standard control procedure, including technical and clinical checks, is needed in clinical practice.	t	\N
21756841	The overall goal of the current study was to examine the relationships among uncompensated admittance (Ya) at ambient pressure extracted from tympanograms, energy reflectance (ER) measures at ambient pressure from wideband acoustic transfer functions (WATF), and distortion product otoacoustic emissions (DPOAEs). If WATF measures of Ya are comparable to tympanometric measures of Ya at ambient pressure, it would be further evidence that the two systems provide comparable information at ambient pressure. Such a relationship could be used as a cross-check or validation for WATF measures and support the use of WATFs in lieu of tympanograms in some applications. Finally, if WATF measures of Ya and/or ER at DPOAE stimulus frequencies can account for some of the variability observed in DPOAE levels and/or signal-to-noise ratios (SNRs) in ears with normal hearing, the relationships could be used to improve hearing screening procedures. The hypotheses were as follows: (1) measures of Ya at ambient pressure are significantly correlated as measured with tympanometric and WATF procedures and (2) measures of Ya and ER at DPOAE stimulus frequencies are significantly correlated with DPOAE level and SNR. Repeated measures in human adults. Forty ears of 20 adults with normal hearing and middle ear function were included in the final sample. Tympanograms were obtained using probe frequencies of 226, 678, and 1000 Hz; WATFs were obtained using a click probe, and DPOAEs were elicited with f2's of 1000, 2000, and 4000 Hz. A repeated measures analysis of variance (RM-ANOVA) was completed to identify significant differences between ears and among probe frequencies for Ya measured at ambient pressure from the Tympstar and for Ya measured at the three closest frequencies on the WATFs. Lines were fit to the comparison of Ya from the Tympstar and WATF, and percent variance accounted for (r2) was calculated. Ya and ER were extracted at all stimulus frequencies that were used to elicit DPOAEs and were compared to DPOAE levels and SNRs. RM-ANOVAs were completed to identify any significant differences in DPOAE level and SNR between ears and among f2 conditions. Lines were fit to the comparison of Ya and ER measures at f1 and f2 with the DPOAE levels and SNRs. The relationship between behavioral air conduction threshold at each f2 and DPOAE level (and SNR) was examined with regression analysis. Ya was significantly correlated between the tympanometric and WATF measures at all three probe-tone frequencies. Ya and ER at f1 and f2 were significantly correlated with DPOAE level for f2 = 4000 Hz. The implications are as follows: (1) WATFs, which can be obtained with the same probe microphone system as DPOAEs, could be used as a supplement to tympanometry in a diagnostic test battery, and the relationship between Ya measured on the two systems could be used for verification, and (2) Ya and ER measures from WATFs at both DPOAE stimulus frequencies account for some of the variability observed in DPOAE levels at f2 = 4000 Hz in normal ears.	t	\N
21760512	Good test-retest reliability of high frequency (≥ 8 kHz) pure-tone audiometry (HFPTA) is essential to detect significant changes in hearing threshold caused by ototoxicity. While the test-retest reliability of HFPTA in adults has been extensively studied, such investigations in children are scant. This study aimed to evaluate the test-retest reliability of the HFPTA in normal-hearing children with particular reference to the criteria for ototoxicity. Participants were 125 children aged between 4 and 13 yr, with normal hearing in the 0.25 to 4 kHz range and normal tympanometric findings. The participants were divided into three age groups, 4 to 6 yr (16M; 16F); 7 to 9 yr (22M; 30F); and 10 to 13 yr (24M; 17F), for investigating possible age effects in the test-retest reliability of HFPTA. The instrumentation for the HFPTA procedure was an Interacoustics AC40 audiometer with Koss R/80 high-frequency headphones, calibrated to meet Australian standards. Hearing thresholds at 8, 9, 10, 11.2, 12.5, 14, and 16 kHz were measured in a sound-treated chamber using a modified Hughson-Westlake procedure with a 5 dB step size. Testing began with an ear and test frequency selected at random; the subsequent test frequencies were randomly selected. After acquisition of hearing threshold data at all frequencies, the other ear was tested using the same procedure. After the first HFPTA test, the headphones were removed and carefully replaced. A second HFPTA test was performed with the ear order reversed. The order of testing the ear for the next participant was reversed. Good test-retest reliability of HFPTA was achieved with no significant difference in mean HFPTA thresholds across test and retest conditions for all age groups. An age effect in the test-retest reliability of HFPTA was evident with the 4- to 6-yr-old, 7- to 9-yr-old, and 10- to 13-yr-old children demonstrating normal variability of thresholds (within ±10 dB) in 89.9%, 93.0%, and 97% of ears tested, respectively. When the variability of test-retest thresholds was assessed at each frequency, the 4 to 6 yr group showed significantly lower percentage of normal variability at 14 kHz. In identifying significant deterioration of hearing thresholds across test-retest conditions in relation to the ASHA (1994) ototoxicity criteria, the three age groups (youngest to oldest) demonstrated false-positive rates of 24.6%, 11%, and 7.6%, respectively. : Overall, this study demonstrated high test-retest reliability of HFPTA in all but the 4 to 6 yr group. With a false-positive rate of 24.6% for ototoxicity for the youngest group, it is recommended that the HFPTA should not be used alone in assessing the possibility of a genuine threshold shift for this age group. If possible, the HFPTA should be supplemented with an objective test of auditory function to confirm the diagnosis. For children aged 7 yr or older, the HFPTA test is promising as a useful tool to identify hearing impairment in the extended high-frequency range (>8 kHz). However, interpretation of HFPTA findings in serial testing for monitoring hearing in a child should be made with due attention being given to the frequency of the stimulus, age of the child, and the associated nonzero false-positive rates.	t	\N
21763342	During the last decades music neuroscience has become a rapidly growing field within the area of neuroscience. Music is particularly well suited for studying neuronal plasticity in the human brain because musical training is more complex and multimodal than most other daily life activities, and because prospective and professional musicians usually pursue the training with high and long-lasting commitment. Therefore, music has increasingly been used as a tool for the investigation of human cognition and its underlying brain mechanisms. Music relates to many brain functions like perception, action, cognition, emotion, learning and memory and therefore music is an ideal tool to investigate how the human brain is working and how different brain functions interact. Novel findings have been obtained in the field of induced cortical plasticity by musical training. The positive effects, which music in its various forms has in the healthy human brain are not only important in the framework of basic neuroscience, but they also will strongly affect the practices in neuro-rehabilitation.	t	\N
21769535	Previous work has demonstrated that the ability to keep track of moving objects is improved when the objects have unique visual features, such as color or shape. In the present study, we investigated how orientation information is used during the tracking of objects. Orientation is an interesting feature to explore in moving objects because it is directional and is often informative of the direction of motion. Most objects move forward, in the direction they are oriented. In the present experiments, participants tracked a subset of moving isosceles triangles whose orientations were constant, related, or unrelated to the direction of motion. In the standard multiple object tracking (MOT) task, tracking performance improved when orientations were unique and remained constant, but not when orientation and direction of motion were aligned. In the target recovery task, in which MOT was interrupted by a brief blanking of the display, performance did improve when orientation and direction were aligned. In the final experiment, results showed that orientation was not used before the blank to predict future target locations, but was instead used after the blank. We concluded that people use orientation to compare a stored representation to target position for recovery of lost targets.	t	\N
21771217	There is strong evidence that bilinguals have a deficit in speech perception for their second language compared with monolingual speakers under unfavourable listening conditions (e.g., noise or reverberation), despite performing similarly to monolingual speakers under quiet conditions. This deficit persists for speakers highly proficient in their second language and is greater in those who learned the language later in life. These findings have important educational implications because the number of multilingual children is increasing worldwide, and many of these children are being taught in their non-native language under poor classroom acoustic conditions. The performance of monolingual, bilingual and trilingual speakers on an English speech perception task was examined in both quiet and noisy conditions. Trilingual performance was compared with that of monolingual and bilingual speakers. Monolingual speakers of English and early bilingual and trilingual speakers (i.e., acquired English as a second/third language before the age of 6 years) were recruited. Their fluency in English was tested by interview and by a questionnaire assessing their knowledge and use of the language. Audiological evaluation confirmed normal hearing in all participants. English speech perception was tested in quiet and in different levels of noise (50, 55, 60, 65 and 70 dB SPL) using the Speech Perception in Noise (SPIN) Test. Bilingual and trilingual listeners performed similarly to monolingual listeners in quiet conditions, but their performance declined more rapidly in noise and was significantly poorer at 65 and 70 dB SPL. Trilingual listeners performed less well than bilinguals at these noise levels, but not significantly so. A subgroup of five bilingual speakers who learned Arabic and English simultaneously since birth were poorer at higher levels of noise than monolinguals, but not significantly so. The results replicate previous findings of poorer speech perception in noise with bilingual speakers compared with monolinguals and extend the findings to trilingual speakers.	t	\N
21786870	The ability to obtain reliable phonetic information from a talker's face during speech perception is an important skill. However, lip-reading abilities vary considerably across individuals. There is currently a lack of normative data on lip-reading abilities in young normal-hearing listeners. This letter describes results obtained from a visual-only sentence recognition experiment using CUNY sentences and provides the mean number of words correct and the standard deviation for different sentence lengths. Additionally, the method for calculating T-scores is provided to facilitate the conversion between raw and standardized scores. This metric can be utilized by clinicians and researchers in lip-reading studies. This statistic provides a useful benchmark for determining whether an individual's lip-reading score falls within the normal range, or whether it is above or below this range.	t	\N
21786905	Spectral ripple discrimination thresholds were measured in 15 cochlear-implant users with broadband (350-5600 Hz) and octave-band noise stimuli. The results were compared with spatial tuning curve (STC) bandwidths previously obtained from the same subjects. Spatial tuning curve bandwidths did not correlate significantly with broadband spectral ripple discrimination thresholds but did correlate significantly with ripple discrimination thresholds when the rippled noise was confined to an octave-wide passband, centered on the STC's probe electrode frequency allocation. Ripple discrimination thresholds were also measured for octave-band stimuli in four contiguous octaves, with center frequencies from 500 Hz to 4000 Hz. Substantial variations in thresholds with center frequency were found in individuals, but no general trends of increasing or decreasing resolution from apex to base were observed in the pooled data. Neither ripple nor STC measures correlated consistently with speech measures in noise and quiet in the sample of subjects in this study. Overall, the results suggest that spectral ripple discrimination measures provide a reasonable measure of spectral resolution that correlates well with more direct, but more time-consuming, measures of spectral resolution, but that such measures do not always provide a clear and robust predictor of performance in speech perception tasks.	t	\N
21790494	Three experiments tested how the physical format and information content of forward and backward masks affected the extent of visual pattern masking. This involved using different types of forward and backward masks with target discrimination measured by percentage correct in the first experiment (with a fixed target duration) and by an adaptive threshold procedure in the last two. The rationale behind the manipulation of the content of the masks stemmed from masking theories emphasizing attentional and/or conceptual factors rather than visual ones. Experiment 1 used word masks and showed that masking was reduced (a masking reduction effect) when the forward and backward masks were the same word (although in different case) compared to when the masks were different words. Experiment 2 tested the extent to which a reduction in masking might occur due to the physical similarity between the forward and backward masks by comparing the effect of the same content of the masks in the same versus different case. The result showed a significant reduction in masking for same content masks but no significant effect of case. The last experiment examined whether the reduction in masking effect would be observed with nonword masks--that is, having no high-level representation. No reduction in masking was found from same compared to different nonword masks (Experiment 3). These results support the view that the conscious perception of a rapidly displayed target stimulus is in part determined by high-level perceptual/cognitive factors concerned with masking stimulus grouping and attention.	t	\N
21796943	The spray drying technique was used to obtain the roxithromycin containing microcapsules with high taste masking efficiency. Eudragit L30D-55 was chosen as a barrier coating. The taste was evaluated by an electronic tongue, and taste-masking effect in water lasted at least several dozen hours.	t	\N
21799207	We investigated training-related improvements in listening in noise and the biological mechanisms mediating these improvements. Training-related malleability was examined using a program that incorporates cognitively based listening exercises to improve speech-in-noise perception. Before and after training, auditory brainstem responses to a speech syllable were recorded in quiet and multitalker noise from adults who ranged in their speech-in-noise perceptual ability. Controls did not undergo training but were tested at intervals equivalent to the trained subjects. Trained subjects exhibited significant improvements in speech-in-noise perception that were retained 6 months later. Subcortical responses in noise demonstrated training-related enhancements in the encoding of pitch-related cues (the fundamental frequency and the second harmonic), particularly for the time-varying portion of the syllable that is most vulnerable to perceptual disruption (the formant transition region). Subjects with the largest strength of pitch encoding at pretest showed the greatest perceptual improvement. Controls exhibited neither neurophysiological nor perceptual changes. We provide the first demonstration that short-term training can improve the neural representation of cues important for speech-in-noise perception. These results implicate and delineate biological mechanisms contributing to learning success, and they provide a conceptual advance to our understanding of the kind of training experiences that can influence sensory processing in adulthood.	t	\N
21801242	We investigated the effect of long-term musical training on the time course of development of neuronal representations within the auditory cortex by means of magnetoencephalography. In musicians but not in nonmusicians, pre-attentive encoding of a complex regularity within a tone sequence was evident by a constant increase of the pattern mismatch negativity within < 10 min. The group difference was more pronounced in the left hemisphere, indicating stronger plastic changes in its structures supporting temporal analysis and sound pattern encoding. The results suggest an effect of long-term musical training on short-term auditory learning processes. This has implications not only for cognitive neuroscience in showing how short-term and long-term neuronal plasticity can interact within the auditory cortex, but also for educational and clinical applications of implicit auditory learning where beneficial effects of (musical) experience might be exploited.	t	\N
21805200	It has recently been shown that spatially uninformative sounds can cause a visual stimulus to pop out from an array of similar distractor stimuli when that sound is presented in temporal proximity to a feature change in the visual stimulus. Until now, this effect has predominantly been demonstrated by using stationary stimuli. Here, we extended these results by showing that auditory stimuli can also improve the sensitivity of visual motion change detection. To accomplish this, we presented moving visual stimuli (small dots) on a computer screen. At a random moment during a trial, one of these stimuli could abruptly move in an orthogonal direction. Participants' task was to indicate whether such an abrupt motion change occurred or not by making a corresponding button press. If a sound (a short 1,000 Hz tone pip) co-occurred with the abrupt motion change, participants were able to detect this motion change more frequently than when the sound was not present. Using measures derived from signal detection theory, we were able to demonstrate that the effect on accuracy was due to increased sensitivity rather than to changes in response bias.	t	\N
21817932	The transmission of fine structure information to cochlear implant users is an expanding area of research. Previous studies comparing the fine structure processing (FSP) speech coding strategy to the envelope-based continuous interleaved sampling (CIS) strategy indicated improved speech perception when using the fine structure strategy. Those investigations were performed with an extended frequency spectrum in the low frequencies together with the fine structure strategy. The current study addresses the question whether these improvements are due to the presentation of fine structure per se or rather the extended frequency spectrum. Hence, this cross over study compares the two strategies using an identical frequency spectrum. Randomized crossover study. 31 patients were randomly assigned to two groups. One group was fitted with a CIS map for 4 weeks, tested and subsequently fitted with a FSP map for 4 weeks. The other group followed the same pattern in reverse. Test material consisted of sentence tests in noise, monosyllables in quiet and melody recognition. No statistical significance was noted between the different speech coding strategies at an identical frequency spectrum. This study shows that there is no difference in speech perception with FSP compared to CIS at an extended frequency spectrum. Therefore, the extended frequency spectrum in the low frequencies might explain a benefit of FSP observed in previous studies.	t	\N
21818631	When a single brief flash is accompanied by two auditory beeps, participants often report perceiving two flashes. The present experiment examined whether the perception of illusory redundant flashes can result in faster responses as compared to the perception of a single flash, because previous research has shown such a redundancy gain for physical stimuli. To this end, participants were asked to respond as rapidly as possible to the onset of any flash. Following their response, they additionally indicated whether they perceived a single flash or a double flash. Most importantly, we observed significant shorter reaction times in response to redundant flashes, irrespective of whether they were physically presented or illusorily perceived. Taken together, our results suggest that an illusory percept can affect simple reaction time in much the same manner as the corresponding physical stimulation.	t	\N
21826003	A number of clinical measures of directivity, including the front-to-back ratio (FBR) and front-to-side ratio (FSR), have been suggested to audiologists to monitor the functionality of hearing aids with directional microphones. These suggestions, however, are based on the assumption that directivity measured clinically changes monotonically when compared with changes measured using the directivity index (DI) and perceptual directional benefit. The objective of the present study was to empirically examine this assumption. In addition, the reliability of the clinical directivity measure was estimated to establish a referral threshold for defective directional microphone hearing aids. The directivity of the directional microphones of two behind-the-ear hearing aids was systematically degraded by plugging the microphone ports. The directivity was measured using four clinical measures: the FBR and FSR performed in the test chamber of a hearing aid analyzer and in sound field. Each measure was repeated four times in each directivity-degraded condition. The degraded directivity was also assessed using the DI measure in an anechoic chamber. The perceptual directional benefit in each directivity-degraded condition was obtained by testing 10 hearing-impaired adults in a sound field with diffuse noise using the Hearing in Noise Test (HINT). The results of the DI and HINT measures showed strong correlation between the two FSRs (test chamber and sound field), while the two FBRs showed no correlation. The directivity generated by the FBRs could remain unchanged even when the directional microphone had lost more than 50% of its directivity. The results further indicated that the measures performed in the sound field were more reliable than those performed in the test chamber. Based on the results of the reliability measures, a 30% change in directivity was suggested as the referral threshold signifying defective directional systems. Because the FSR predicts the DI and HINT measurements more accurately than does the FBR, it is suggested that clinicians use the FSR to monitor hearing aid directivity. By using the FSR measure and informed by the suggested referral threshold, clinicians would be able to correctly identify defective directional microphone hearing aids at an early stage, rather than at a point when the directivity has been diminished completely or even reversed.	t	\N
21826552	Strong cross-modal interactions exist between visual and auditory processing. The relative contributions of perceptual versus decision-related processes to such interactions are only beginning to be understood. We used methodological and statistical approaches to control for potential decision-related contributions such as response interference, decisional criterion shift, and strategy selection. Participants were presented with rising-, falling-, and constant-amplitude sounds and were asked to detect change (increase or decrease) in sound amplitude while ignoring an irrelevant visual cue of a disk that grew, shrank, or stayed constant in size. Across two experiments, testing context was manipulated by varying the grouping of visual cues during testing, and cross-modal congruency showed independent perceptual and decision-related effects. Whereas a change in testing context greatly affected criterion shifts, cross-modal effects on perceptual sensitivity remained relatively consistent. In general, participants were more sensitive to increases in sound amplitude and less sensitive to sounds paired with dynamic visual cues. As compared with incongruent visual cues, congruent cues enhanced detection of amplitude decreases, but not increases. These findings suggest that the relative contributions of perceptual and decisional processing and the impacts of these processes on cross-modal interactions can vary significantly depending on asymmetries in within-modal processing, as well as consistencies in cross-modal dynamics.	t	\N
21833562	A bone-anchored hearing aid (Baha) is used in patients with single-sided sensorineural deafness (SSD) to overcome the head shadow effect. Of all the patients with SSD, treated at our hospital, 196 patients used a Baha on trial between November 2001 and April 2010. The objective of this study is to evaluate what factors determine the decision of a SSD patient whether or not to opt for a Baha device following a Baha trial period. 196 patients with SSD were enrolled for a trial period of 2 weeks at the Antwerp University Hospital, a tertiary referral centre. 93% of these patients suffered from an acquired hearing loss. 44% of all the patients (87/196) chose to wear a Baha device after the trail period, either on an abutment or on a headband. The collected data were analysed to determine correlations between the decision of a patient following a Baha trial period on the one hand, and Fletcher Index ipsi- and contralaterally, bone conduction hearing thresholds at the better hearing ear, aetiology, age at the start of the trial period, duration of hearing loss at the start of the trial period and the type of device used during the trial period, on the other hand. Although 66% of all the patients (109/196) declined the Baha after a trial, reasons not to choose a Baha were diverse and no crucial factors could be found that determine the success of a Baha trial period. Lack of improvement concerning speech understanding in noise was the most important reason mentioned by patients who declined the Baha. The authors advocate that all patients, suffering from SSD, should be offered the opportunity to try a Baha device as no factors could be found that determine the decision of a patient following the trial period.	t	\N
21844339	"Normal hearing" is typically defined by threshold audibility, even though everyday communication relies on extracting key features of easily audible sound, not on sound detection. Anecdotally, many normal-hearing listeners report difficulty communicating in settings where there are competing sound sources, but the reasons for such difficulties are debated: Do these difficulties originate from deficits in cognitive processing, or differences in peripheral, sensory encoding? Here we show that listeners with clinically normal thresholds exhibit very large individual differences on a task requiring them to focus spatial selective auditory attention to understand one speech stream when there are similar, competing speech streams coming from other directions. These individual differences in selective auditory attention ability are unrelated to age, reading span (a measure of cognitive function), and minor differences in absolute hearing threshold; however, selective attention ability correlates with the ability to detect simple frequency modulation in a clearly audible tone. Importantly, we also find that selective attention performance correlates with physiological measures of how well the periodic, temporal structure of sounds above the threshold of audibility are encoded in early, subcortical portions of the auditory pathway. These results suggest that the fidelity of early sensory encoding of the temporal structure in suprathreshold sounds influences the ability to communicate in challenging settings. Tests like these may help tease apart how peripheral and central deficits contribute to communication impairments, ultimately leading to new approaches to combat the social isolation that often ensues.	t	\N
21860975	The electrical stimulation generated by the Cochlear Implant (CI) may improve the neural synchrony and hence contribute to the development of auditory skills in patients with Auditory Neuropathy/Auditory Dyssynchrony (AN/AD). Prospective cohort cross-sectional study to evaluate the auditory performance and the characteristics of the electrically evoked compound action potential (ECAP) in 18 children with AN/AD and cochlear implants. The auditory perception was evaluated by sound field thresholds and speech perception tests. To evaluate ECAP's characteristics, the threshold and amplitude of neural response were evaluated at 80 Hz and 35 Hz. No significant statistical difference was found concerning the development of auditory skills. The ECAP's characteristics differences at 80 and 35 Hz stimulation rate were also not statistically significant. The CI was seen as an efficient resource to develop auditory skills in 94% of the AN/AD patients studied. The auditory perception benefits and the possibility to measure ECAP showed that the electrical stimulation could compensate for the neural dyssynchrony caused by the AN/AD. However, a unique clinical procedure cannot be proposed at this point. Therefore, a careful and complete evaluation of each AN/AD patient before recommending a Cochlear Implant is advised. Clinical Trials: NCT01023932.	t	\N
21860978	Hearing has an important role in human development and social adaptation in blind people. To evaluate the performance of temporal auditory processing in blind people; to characterize the temporal resolution ability; to characterize the temporal ordinance ability and to compare the performance of the study population in the applied tests. Fifteen blind adults participated in this study. A cross-sectional study was undertaken; approval was obtained from the Pernambuco Catholic University Ethics Committee, no. 003/2008. Temporal auditory processing was excellent--the average composed threshold in the original RGDT version was 4. 98 ms; it was 50 ms for all frequencies in the expanded version. PPS and DPS results ranged from 95% to 100%. There were no quantitative differences in the comparison of tests; but oral reports suggested that the original RGDT original version was more difficult. The study sample performed well in temporal auditory processing; it also performed well in temporal resolution and ordinance abilities.	t	\N
21862680	To evaluate potential contributions of broadband spectral integration in the perception of static vowels. Specifically, can the auditory system infer formant frequency information from changes in the intensity weighting across harmonics when the formant itself is missing? Does this type of integration produce the same results in the lower (first formant [F1]) and higher (second formant [F2]) regions? Does the spacing between the spectral components affect a listener's ability to integrate the acoustic cues? Twenty young listeners with normal hearing identified synthesized vowel-like stimuli created for adjustments in the F1 region (/Λ/-/α/, /i/-/ε/) and in the F2 region (/Λ/-/æ/). There were 2 types of stimuli: (a) 2-formant tokens and (b) tokens in which 1 formant was removed and 2 pairs of sine waves were inserted below and above the missing formant; the intensities of these harmonics were modified to cause variations in their spectral center of gravity (COG). The COG effects were tested over a wide range of frequencies. Obtained patterns were consistent with calculated changes to the spectral COG, in both the F1 and F2 regions. The spacing of the sine waves did not affect listeners' responses. The auditory system may perform broadband integration as a type of auditory wideband spectral analysis.	t	\N
21877769	This study investigated the ability to use temporal-envelope (E) cues in a consonant identification task when presented within one or two frequency bands. Syllables were split into five bands spanning the range 70-7300 Hz with each band processed to preserve E cues and degrade temporal fine-structure cues. Identification scores were measured for normal-hearing listeners in quiet for individual processed bands and for pairs of bands. Consistent patterns of results were obtained in both the single- and dual-band conditions: identification scores increased systematically with band center frequency, showing that E cues in the higher bands (1.8-7.3 kHz) convey greater information.	t	\N
21877806	A functional simulation of hearing loss was evaluated in its ability to reproduce the temporal masking functions for eight listeners with mild to severe sensorineural hearing loss. Each audiometric loss was simulated in a group of age-matched normal-hearing listeners through a combination of spectrally-shaped masking noise and multi-band expansion. Temporal-masking functions were obtained in both groups of listeners using a forward-masking paradigm in which the level of a 110-ms masker required to just mask a 10-ms fixed-level probe (5-10 dB SL) was measured as a function of the time delay between the masker offset and probe onset. At each of four probe frequencies (500, 1000, 2000, and 4000 Hz), temporal-masking functions were obtained using maskers that were 0.55, 1.0, and 1.15 times the probe frequency. The slopes and y-intercepts of the masking functions were not significantly different for listeners with real and simulated hearing loss. The y-intercepts were positively correlated with level of hearing loss while the slopes were negatively correlated. The ratio of the slopes obtained with the low-frequency maskers relative to the on-frequency maskers was similar for both groups of listeners and indicated a smaller compressive effect than that observed in normal-hearing listeners.	t	\N
21877812	The auditory system takes advantage of early reflections (ERs) in a room by integrating them with the direct sound (DS) and thereby increasing the effective speech level. In the present paper the benefit from realistic ERs on speech intelligibility in diffuse speech-shaped noise was investigated for normal-hearing and hearing-impaired listeners. Monaural and binaural speech intelligibility tests were performed in a virtual auditory environment where the spectral characteristics of ERs from a simulated room could be preserved. The useful ER energy was derived from the speech intelligibility results and the efficiency of the ERs was determined as the ratio of the useful ER energy to the total ER energy. Even though ER energy contributed to speech intelligibility, DS energy was always more efficient, leading to better speech intelligibility for both groups of listeners. The efficiency loss for the ERs was mainly ascribed to their altered spectrum compared to the DS and to the filtering by the torso, head, and pinna. No binaural processing other than a binaural summation effect could be observed.	t	\N
21878379	Otoacoustic emission (OAE) amplitude can be reduced by acoustic stimulation. This effect is produced by the medial olivocochlear (MOC) reflex. Past studies have shown that the MOC reflex is related to listening in noise and attention. In the present study, the relationship between strength of the contralateral MOC reflex and masked threshold was investigated in 19 adults. Detection thresholds were determined for 1000-Hz, 300-ms tone presented simultaneously with one repetition of a 300-ms masker in an ongoing train of masker bursts. Three masking conditions were tested: 1) broadband noise 2) a fixed-frequency 4-tone complex masker and 3) a random-frequency 4-tone complex masker. Broadband noise was expected to produce energetic masking and the tonal maskers were expected to produce informational masking in some listeners. DPOAEs were recorded at fine frequency intervals from 500 to 4000 Hz, with and without contralateral acoustic stimulation. MOC reflex strength was estimated as a reduction in baseline level and a shift in frequency of DPOAE fine-structure maxima near 1000-Hz. MOC reflex and psychophysical testing were completed in separate sessions. Individuals with poorer thresholds in broadband noise and in random-frequency maskers were found to have stronger MOC reflexes.	t	\N
21886362	Event-related potential (ERPs) provide an exquisite means to monitor the extent of processing of external stimulus input during sleep. The processing of relatively high intensity stimuli has been well documented. Sleep normally occurs in much less noisy environments. The present study therefore employed ERPs to examine the extent of processing of very low intensity (near-hearing threshold) stimuli. Brief duration 1000 Hz auditory tone bursts varying in intensity at random from -5 to +45 dB from normative hearing level (nHL) in 10 dB steps were presented every 1.5 to 2.5 s when the subject was awake and reading a book and again during all-night sleep. n = 10 healthy young adults. In the waking state, the auditory stimuli elicited a negative-going deflection, N1, peaking at about 100 ms, followed by a smaller positivity, P2, peaking at about 180 ms. N1-P2 gradually decreased in amplitude with decreases in stimulus intensity and remained visible at near-hearing threshold levels. During NREM sleep, the amplitude of N1 was at baseline level and was reduced to only 15% to 20% of its waking amplitude during REM sleep. P2 was much larger in sleep than in wakefulness. Importantly, during sleep, P2 could be reliably elicited by the auditory stimuli to within 15 dB of threshold. During NREM, a large amplitude negativity peaking at about 350 ms was elicited by the higher intensity stimuli. This N350 was much reduced in amplitude during REM sleep. A significant N350 was not, however, elicited when stimuli intensity levels were below 25 dB nHL. Auditory stimuli that are only slightly above hearing threshold appear to be processed extensively during a 200 to 400 ms interval in both NREM and REM sleep. The nature of this processing is, however, very different compared to the waking state.	t	\N
21889805	Auditory training has been advocated as a management strategy for children with hearing, listening or language difficulties. Because poor speech-in-noise perception is commonly reported, previous research has focused on the use of complex (word/sentence) stimuli as auditory training material to improve sentence-in-noise perception. However, some evidence suggests that engagement with the training stimuli is more important than the type of stimuli used for training. The aim of this experiment was to assess if sentence-in-noise perception could be improved using simpler auditory training stimuli. We recruited 41 typically developing, normal-hearing children aged 8-10 years divided into four groups. Groups 1-3 trained over 4 weeks (12 × 30 min sessions) on either: (1) pure-tone frequency discrimination (FD), (2) FD in a modulated noise (FDN) or, (3) mono-syllabic words in a modulated noise (WN). Group 4 was an untrained Control. In the training tasks, either tone frequency (Group 1), or tone (Group 2) or speech (Group 3) level was varied adaptively. All children completed pre- and post-training tests of sentence perception in modulated (SMN) and unmodulated (SUN) noise and a probe measure of each training task. All trained groups improved significantly on the trained tasks. Transfer of training occurred between FDN training and FD, WN and SMN testing, and between WN training and SMN testing. A significant performance suppression on the SUN test resulted from FD and FDN training. The pattern of training-induced improvement, relative to Controls, suggests that transfer of training is more likely when some stimulus dimensions (tone frequency, speech, modulated noise) are shared between training tasks and outcomes. This and the finding of suppressed post-training performance, relative to Controls, between tasks not sharing a stimulus dimension both favour the use of outcome-specific material for auditory training.	t	\N
21895071	Human listening tests were conducted to investigate if participants could distinguish between samples of target echoes and clutter obtained from a broadband active sonar experiment. For each echo, the listeners assigned a rating based on how confident they were that it was a target echo or clutter. The measure of performance was the area under the binormal receiver-operating-characteristic (ROC) curve, A(z). The mean performance was A(z)=0.95 ± 0.04 when signals were presented with their full available acoustic bandwidth of approximately 0-2 kHz. It was A(z)=0.77 ± 0.08 when the bandwidth was reduced to 0.5-2 kHz. The error bounds are stated as 95% confidence intervals. These results show that the listeners could definitely hear differences, but their performance was significantly degraded when the low-frequency signal information was removed. The performance of an automatic aural classifier was compared against this human-performance baseline. Results of statistical tests showed that it outperformed 2 of 13 listeners and 5 of 9 human listeners in the full-bandwidth and reduced-bandwidth tests, respectively, and performed similarly to the other listeners. Given its performance, the automatic aural classifier may prove beneficial to Navy sonar systems.	t	\N
21895094	In cochlear implants (CIs), simultaneous or sequential stimulation of adjacent electrodes can produce intermediate pitch percepts between those of the component electrodes. However, it is unclear whether simultaneous and sequential virtual channels (VCs) can be discriminated. In this study, CI users were asked to discriminate simultaneous and sequential VCs; discrimination was measured for monopolar (MP) and bipolar + 1 stimulation (BP + 1), i.e., relatively broad and focused stimulation modes. For sequential VCs, the interpulse interval (IPI) varied between 0.0 and 1.8 ms. All stimuli were presented at comfortably loud, loudness-balanced levels at a 250 pulse per second per electrode (ppse) stimulation rate. On average, CI subjects were able to reliably discriminate between sequential and simultaneous VCs. While there was no significant effect of IPI or stimulation mode on VC discrimination, some subjects exhibited better VC discrimination with BP + 1 stimulation. Subjects' discrimination between sequential and simultaneous VCs was correlated with electrode discrimination, suggesting that spatial selectivity may influence perception of sequential VCs. To maintain equal loudness, sequential VC amplitudes were nearly double those of simultaneous VCs, presumably resulting in a broader spread of excitation. These results suggest that perceptual differences between simultaneous and sequential VCs might be explained by differences in the spread of excitation.	t	\N
21895095	Three experiments were designed to examine temporal envelope processing by cochlear implant (CI) listeners. In experiment 1, the hypothesis that listeners' modulation sensitivity would in part determine their ability to discriminate between temporal modulation rates was examined. Temporal modulation transfer functions (TMTFs) obtained in an amplitude modulation detection (AMD) task were compared to threshold functions obtained in an amplitude modulation rate discrimination (AMRD) task. Statistically significant nonlinear correlations were observed between the two measures. In experiment 2, results of loudness-balancing showed small increases in the loudness of modulated over unmodulated stimuli beyond a modulation depth of 16%. Results of experiment 3 indicated small but statistically significant effects of level-roving on the overall gain of the TMTF, but no impact of level-roving on the average shape of the TMTF across subjects. This suggested that level-roving simply increased the task difficulty for most listeners, but did not indicate increased use of intensity cues under more challenging conditions. Data obtained with one subject, however, suggested that the most sensitive listeners may derive some benefit from intensity cues in these tasks. Overall, results indicated that intensity cues did not play an important role in temporal envelope processing by the average CI listener.	t	\N
21912929	Previous research examining cross-modal conflicts in object recognition has often made use of animal vocalizations and images, which may be considered natural and ecologically valid, thus strengthening the association in the congruent condition. The current research tested whether the same cross-modal conflict would exist for man-made object sounds as well as comparing the speed and accuracy of auditory processing across the two object categories. Participants were required to attend to a sound paired with a visual stimulus and then respond to a verification item (e.g., "Dog?"). Sounds were congruent (same object), neutral (unidentifiable image), or incongruent (different object) with the images presented. In the congruent and neutral condition, animals were recognized significantly faster and with greater accuracy than man-made objects. It was hypothesized that in the incongruent condition, no difference in reaction time or error rate would be found between animals and man-made objects. This prediction was not supported, indicating that the association between an object's sound and image may not be that disparate when comparing animals to man-made objects. The findings further support cross-modal conflict research for both the animal and man-made object category. The most important finding, however, was that auditory processing is enhanced for living compared to nonliving objects, a difference only previously found in visual processing. Implications relevant to both the neuropsychological literature and sound research are discussed.	t	\N
21913927	In prior studies, we have used a conditional linkage between rare deviations in a regular sound pattern to determine if the auditory system can use the first deviation to anticipate the probable features of the second deviation (i.e., make a conditional inference). This study was designed to test two hypotheses about why the mismatch negativity (MMN) to a duration deviant sound seems more susceptible to conditional inference effects. The MMNs to duration and frequency glide deviant sounds were significantly smaller when their occurrence was conditionally linked to the identity of a prior deviant as opposed to when they occurred randomly in a sequence. Results provide support for the learned conditional inference interpretation of reduced MMN size to linked deviants. We discuss alternate explanations and conclude that conditional inference studies could provide insight into the dynamics of probability-based prediction in the auditory system.	t	\N
21920411	Tinnitus is described as an auditory perception in the absence of any external sound source. Tinnitus loudness has been correlated to sustained high frequency gamma-band activity in auditory cortex. It remains unknown whether unilateral tinnitus is always generated in the left auditory cortex, irrespective of the side on which the tinnitus is perceived, or in the contralateral auditory cortex. In order to solve this enigma source localized electroencephalographic (EEG) recordings of a homogenous group of unilateral left and right-sided tinnitus patients presenting with noise-like tinnitus was analyzed. Based on a region of interest analysis, the most important result of this study is that tinnitus lateralization depended on the gamma-band activity of the contralateral parahippocampal area. As for the auditory cortex no differences were found between left-sided and right-sided tinnitus patients. However, in comparison to a control group both left and right-sided tinnitus patients had an increased gamma-band activity in both the left and right primary and secondary auditory cortex. Thus whereas in tinnitus the primary and secondary auditory cortices of both sides are characterized by increased gamma-band activity, the side on which the tinnitus is perceived relates to gamma-band activity in the contralateral parahippocampal area.	t	\N
21926628	The purpose of this study was to determine the reliability of the electrophysiological binaural beat steady state response as a gauge of temporal fine structure coding, particularly as it relates to the aging auditory system. The hypothesis was that the response would be more robust in a lower, than in a higher, frequency region and in younger, than in older, adults. Two experiments were undertaken. The first measured the 40 Hz binaural beat steady state response elicited by tone pairs in two frequency regions: lower (390 and 430 Hz tone pair) and higher (810 and 850 Hz tone pair). Frequency following responses (FFRs) evoked by the tones were also recorded. Ten young adults with normal hearing participated. The second experiment measured the binaural beat and FFRs in older adults but only in the lower frequency region. Fourteen older adults with relatively normal hearing participated. Response metrics in both experiments included response component signal-to-noise ratio (F statistic) and magnitude-squared coherence. Experiment 1 showed that FFRs were elicited in both frequency regions but were more robust in the lower frequency region. Binaural beat responses elicited by the lower frequency pair of tones showed greater amplitude fluctuation within a participant than the respective FFRs. Experiment 2 showed that older adults exhibited similar FFRs to younger adults, but proportionally fewer older participants showed binaural beat responses. Age differences in onset responses were also observed. The lower prevalence of the binaural beat response in older adults, despite the presence of FFRs, provides tentative support for the sensitivity of this measure to age-related deficits in temporal processing. However, the lability of the binaural beat response advocates caution in its use as an objective measure of fine structure coding.	t	\N
21928934	An experiment is reported that investigates the relation between the suffix effect and the effect of irrelevant sound on the serial recall of short sequences of spoken material. The main issue was whether there is an effect of irrelevant sound under articulatory suppression in the presence of a spoken suffix. As in Hanley and Bakopoulou (2003), the irrelevant sound comprised speech that was presented during the retention interval. When a spoken suffix appeared at the end of the list, a significant effect of irrelevant sound remained when participants were able to rehearse list items. However, it disappeared under articulatory suppression. The effects of irrelevant sound remained significant under suppression when the suffix was an auditory tone but was confined to the final position of the serial position curve. These results parallel those reported by Jones, Macken, and Nicholls (2004) and Jones, Hughes, and Macken (2006) when they examined the effect of articulatory suppression on the phonological similarity effect. The results are consistent with Jones et al.'s (2006, 2004) view that an acoustic-perceptual representation of the terminal list items is the source of the effects of irrelevant sound and phonological similarity when they occur in the presence of articulatory suppression.	t	\N
21928935	Serial recall from working memory is known to be impaired by the presence of irrelevant background speech, but several prior studies have concluded that the magnitude of the impairment is independent of the phonological relationship between to-be-remembered (TBR) and to-be-ignored (TBI) sources of information. In the present study, we examined the influence of between-stream phonological similarity in serial recall while attending to a heretofore uncontrolled variable, the phonetic feature. We found that TBI items sharing many phonetic features with TBR items produced significantly stronger working-memory impairments than TBI items with minimal phonetic feature overlap. In addition, participants were more likely to report remembering incorrect items that incorporated phonological characteristics of the TBI stream in the high-overlap condition. These findings provide evidence for subphonemic between-stream interactions and suggest that multiple parallel processes contribute to the irrelevant speech effect. We propose that a 2-component model, which combines the assumptions of process- and content-based accounts for the irrelevant speech effect, offers the best explanation for these findings.	t	\N
21930137	In daily communication, we often use indirect speech to convey our intention. However, little is known about the brain mechanisms that underlie the comprehension of indirect speech. In this study, we conducted a functional MRI experiment using a scenario reading task to compare the neural activity induced by an indirect reply (a type of indirect speech) and a literal sentence. Participants read a short scenario consisting of three sentences. The first two sentences explained the situation of the protagonists, whereas the third sentence had an indirect, literal, or unconnected meaning. The indirect reply condition primarily activated the bilateral fronto-temporal networks (Brodmann's Areas (BA) 47 and 21) and the dorso-medial prefrontal cortex (dmPFC). In the literal sentence condition, only the left fronto-temporal network (BA 45 and 21) and the dmPFC (posterior region) were activated. In addition, we found greater activation resulting from comprehension of an indirect reply than from literal sentence comprehension in the dmPFC, the left middle frontal area (BA 9), the bilateral inferior frontal area (BA 9/47), and the right middle temporal area (BA 21). Our findings indicate that the right and left fronto-temporal networks play a crucial role in detecting contextual violations, whereas the medial frontal cortex is important for generating inferences to make sense of remarks within a context.	t	\N
21932257	To examine how musical expertise tunes the brain to subtle metric anomalies in an ecological musical context, we presented piano compositions ending on standard and deviant cadences (endings) to expert pianists and musical laymen, while high-density EEG was recorded. Temporal expectancies were manipulated by substituting standard "masculine" cadences at metrically strong positions with deviant, metrically unaccented, "feminine" cadences. Experts detected metrically deviant cadences better than laymen. Analyses of event-related potentials demonstrated that an early P3a-like component (~150-300 ms), elicited by musical closure, was significantly enhanced at frontal and parietal electrodes in response to deviant endings in experts, whereas a reduced response to deviance occurred in laymen. Putative neuronal sources contributing to the modulation of this component were localized in a network of brain regions including bilateral supplementary motor areas, middle and posterior cingulate cortex, precuneus, associative visual areas, as well as in the right amygdala and insula. In all these regions, experts showed enhanced responses to metric deviance. Later effects demonstrated enhanced activations within the same brain network, as well as higher processing speed for experts. These results suggest that early brain responses to metric deviance in experts may rely on motor representations mediated by the supplementary motor area and motor cingulate regions, in addition to areas involved in self-referential imagery and relevance detection. Such motor representations could play a role in temporal sensory prediction evolved from musical training and suggests that rhythm evokes action more strongly in highly trained instrumentalists.	t	\N
21934538	The processing of case-marking and argument structures was investigated in children at the age of 3 years, 4 years and 6 months, and 6 years. Two event-related potential (ERP) experiments were conducted in a case-marked language, i.e. German, comparing the processing of (a) double-nominative violations with subject-initial structures and (b) double-accusative violations with object-initial structures. It is known that for both violation types, adults display a biphasic N400/P600 ERP response, reflecting thematic-semantic, and syntactic processes. For double-nominative violations, 3-year-old children already show an adult-like processing pattern revealing their abilities to repair the tested structure. For double-accusative violations, ERP results indicate developmental processing differences with even 6-year-old children not showing an adult pattern. This suggests a late development of the complete function of the accusative case.	t	\N
21936745	To evaluate the feasibility, the duration and results of sedation by intrarectal pentobarbital and oral alimemazine for auditory brain stem responses (ABR) and auditory steady-state responses (ASSR) recordings in children aged 2 to 5 years. Prospective study. 180 consecutive children aged 2 to 5 years, referred for language retardation and/or behavioral problems, who could not be tested by behavioral methods, underwent ABR and ASSR recordings. The children who did not spontaneously nap were sedated by intrarectal pentobarbital eventually potentiated by oral alimemazine. A spontaneous nap was obtained in only 23 cases, 72 children received only pentobarbital, and 85 received both pentobarbital and alimemazine. Even so, recording was impossible in 16 cases, and interrupted before completion of the ASSR recordings in 45 cases. Children went to sleep in average 64 min +/- 40. The average recording time for the ABR was 20 minutes, and for the ASSR 25 minutes. Sedation by pentobarbital, eventually completed by oral alimemazine, allows ABR and/or ASSR recordings in 89.8% of the children who did not nap in the recording room, and is therefore a good alternative to general anesthesia in these children.	t	\N
21940982	The goal of this study was to assess the functional utility of transient noise reduction (TNR) algorithms available in hearing aids via speech intelligibility and user preferences. Two pairs of hearing aids, 1 pair each from Siemens and Unitron, were programmed for 17 hearing impaired individuals after a hearing evaluation. Intelligibility was measured for each participant for sentences presented in quiet, with 2 types of transient noise, multitalker babble, and in a combination of each type of transient noise and multitalker babble. Each condition was tested with TNR activated and TNR deactivated in a counterbalanced, single-blinded format. Subjective ratings of overall speech understanding, comfort, and sound quality were obtained for each condition. A significant improvement in speech intelligibility was measured with the TNR activated when speech was presented in multitalker babble, in the presence of chair clang transient noises, and when combining these noises. Activation of the TNR algorithm did not result in significant improvements for any of the subjective ratings. While improvements were limited to certain conditions, specifically those with the chair clang transient and/or multitalker babble, TNR appears to offer an incremental step in improving the listening experience for hearing aid users.	t	\N
21945467	The different response characteristics of the different auditory cortical responses under conventional central masking conditions were examined by comparing the effects of contralateral white noise on the cortical component of 40-Hz auditory steady state fields (ASSFs) and the N100 m component in auditory evoked fields (AEFs) for tone bursts using a helmet-shaped magnetoencephalography system in 8 healthy volunteers (7 males, mean age 32.6 years). The ASSFs were elicited by monaural 1000 Hz amplitude modulation tones at 80 dB SPL, with the amplitude modulated at 39 Hz. The AEFs were elicited by monaural 1000 Hz tone bursts of 60 ms duration (rise and fall times of 10 ms, plateau time of 40 ms) at 80 dB SPL. The results indicated that continuous white noise at 70 dB SPL presented to the contralateral ear did not suppress the N100 m response in either hemisphere, but significantly reduced the amplitude of the 40-Hz ASSF in both hemispheres with asymmetry in that suppression of the 40-Hz ASSF was greater in the right hemisphere. Different effects of contralateral white noise on these two responses may reflect different functional auditory processes in the cortices.	t	\N
21954871	This study examined the development of uncertainty monitoring in early childhood. Specifically, this study tested the prediction that preschoolers can reflect on their sense of certainty about the likely accuracy of their decisions, and it examined whether this ability differs across domains. Three-, 4-, and 5-year-olds (N = 74) completed a perceptual identification and a lexical identification task in which they reported whether they were certain or uncertain about their answers. Results showed that even 3-year-olds provided confidence judgments that discriminated accurate from inaccurate responses, but this discrimination increased with age. Furthermore, results suggest that 3-year-olds primarily rely on response latency to assess certainty, whereas older preschoolers do not. Overall, these findings suggest that uncertainty monitoring emerges and develops during the preschool years.	t	\N
21962945	While grammatical aspects of language are preserved, executive deficits are prominent in Lewy body spectrum disorder (LBSD), including Parkinson's disease (PD), Parkinson's dementia (PDD) and dementia with Lewy bodies (DLB). We examined executive control during sentence processing in LBSD by assessing temporary structural ambiguities. Using an on-line word detection procedure, patients heard sentences with a syntactic structure that has high-compatibility or low-compatibility with the main verb's statistically preferred syntactic structure, and half of the sentences were lengthened strategically between the onset of the ambiguity and its resolution. We found selectively slowed processing of lengthened ambiguous sentences in the PDD/DLB subgroup. This correlated with impairments on measures of executive control. Regression analyses related the working memory deficit during ambiguous sentence processing to significant cortical thinning in frontal and parietal regions. These findings emphasize the role of prefrontal disease in the executive limitations that interfere with processing ambiguous sentences in LBSD.	t	\N
21967269	Six experiments studied relative frequency judgment and recall of sequentially presented items drawn from 2 distinct categories (i.e., city and animal). The experiments show that judged frequencies of categories of sequentially encountered stimuli are affected by certain properties of the sequence configuration. We found (a) a first-run effect whereby people overestimated the frequency of a given category when that category was the first repeated category to occur in the sequence and (b) a dissociation between judgments and recall; respondents may judge 1 event more likely than the other and yet recall more instances of the latter. Specifically, the distribution of recalled items does not correspond to the frequency estimates for the event categories, indicating that participants do not make frequency judgments by sampling their memory for individual items as implied by other accounts such as the availability heuristic (Tversky & Kahneman, 1973) and the availability process model (Hastie & Park, 1986). We interpret these findings as reflecting the operation of a judgment heuristic sensitive to sequential patterns and offer an account for the relationship between memory and judged frequencies of sequentially encountered stimuli.	t	\N
21973363	Spectral-ripple discrimination has been used widely for psychoacoustical studies in normal-hearing, hearing-impaired, and cochlear implant listeners. The present study investigated the perceptual mechanism for spectral-ripple discrimination in cochlear implant listeners. The main goal of this study was to determine whether cochlear implant listeners use a local intensity cue or global spectral shape for spectral-ripple discrimination. The effect of electrode separation on spectral-ripple discrimination was also evaluated. Results showed that it is highly unlikely that cochlear implant listeners depend on a local intensity cue for spectral-ripple discrimination. A phenomenological model of spectral-ripple discrimination, as an "ideal observer," showed that a perceptual mechanism based on discrimination of a single intensity difference cannot account for performance of cochlear implant listeners. Spectral modulation depth and electrode separation were found to significantly affect spectral-ripple discrimination. The evidence supports the hypothesis that spectral-ripple discrimination involves integrating information from multiple channels.	t	\N
21974502	Thresholds of vowel formant discrimination for F1 and F2 of isolated vowels with full and partial vowel spectra were measured for normal-hearing listeners at fixed and roving speech levels. Performance of formant discrimination was significantly better for fixed levels than for roving levels with both full and partial spectra. The effect of vowel spectral range was present only for roving levels, but not for fixed levels. These results, consistent with studies of profile analysis, indicated different perceptual mechanisms for listeners to discriminate vowel formant frequency at fixed and roving levels.	t	\N
21986211	Hearing loss is routinely estimated from the audiogram, even though this measure gives only a rough approximation of hearing. Indeed, cochlear regions functioning poorly, if at all, called dead regions, are not detected by a simple audiogram. To detect cochlear dead regions, additional measurements of psychophysical tuning curves or thresholds in background noise (TEN test) are required. A first aim of this study was to assess the presence of dead regions after impulse noise trauma using psychophysical tuning curves. The procedure we used was based on a compromise between the need to collect reliable estimates of psychophysical tuning curves and the limited time available to obtain these estimates in a hospital setting. Psychophysical tuning curves were measured using simultaneous masking with a 2-alternative forced choice paradigm, where the target was randomly placed in one of the two masker presentations. It is well known that some components of noise-induced hearing loss are reversible. A second aim of this study was to examine the potential recovery of dead regions after acoustic trauma. A third issue addressed in this article was the relationship between noise-induced dead regions and tinnitus. We found that 70% of the subjects had dead regions after noise trauma, while 88% reported tinnitus. Moreover, we found that the extent of dead regions probably diminished in about 50% of subjects, which highlights the ability of the human auditory system to recover from noise-induced hearing loss.	t	\N
21993050	Prescriptive methods have been at the core of modern hearing aid fittings for the past several decades. Every decade or so, there have been revisions to existing methods and/or the emergence of new methods that become widely used. In 2001 Byrne et al provided a comparison of insertion gain for generic prescriptive methods available at that time. The purpose of this article was to compare National Acoustic Laboratories-Non-linear 1 (NAL-NL1), National Acoustic Laboratories-Non-linear 2 (NAL-NL2), Desired Sensation Level Multistage Input/Output (DSL m[i/o]), and Cambridge Method for Loudness Equalization 2-High-Frequency (CAMEQ2-HF) prescriptive methods for adults on the amplification characteristics of prescribed insertion gain and compression ratio. Following the differences observed in prescribed insertion gain among the four prescriptive methods, analyses of predicted specific loudness, overall loudness, and bandwidth of cochlear excitation and effective audibility as well as speech intelligibility of the international long-term average speech spectrum (ILTASS) at an average conversational input level were completed. These analyses allow for the discussion of similarities and differences among the present-day prescriptive methods. The impact of insertion gain differences among the methods is examined for seven hypothetical hearing loss configurations using models of loudness perception and speech intelligibility. Hearing loss configurations for adults of various types and degrees were selected, five of which represent sensorineural impairment and were used by Byrne et al; the other two hearing losses provide an example of mixed and conductive impairment. Prescribed insertion gain data were calculated in 1/3-octave frequency bands for each of the seven hearing losses from the software application of each prescriptive method over multiple input levels. The insertion gain data along with a diffuse field-to-eardrum transfer function were used to calculate output levels at the eardrums of the hypothetical listeners. Levels of hearing loss and output were then used in the Moore and Glasberg loudness model and the ANSI S3.5-1997 Speech Intelligibility Index model. NAL-NL2 and DSL m[i/o] provided comparable overall loudness of approximately 8 sones for the five sensorineural hearing losses for a 65 dB SPL ILTASS input. This loudness was notably less than that perceived by a normal-hearing person for the same input signal, 18.6 sones. NAL-NL2 and DSL m[i/o] also provided comparable predicted speech intelligibility in quiet and noise. CAMEQ2-HF provided a greater average loudness, similar to NAL-NL1, with more high-frequency bandwidth but no significant improvement to predicted speech intelligibility. Definite variation in prescribed insertion gain was present among the prescriptive methods. These differences when averaged across the hearing losses were, by and large, negligible with regard to predicted speech intelligibility at normal conversational speech levels. With regard to loudness, DSL m[i/o] and NAL-NL2 provided the least overall loudness, followed by CAMEQ2-HF and NAL-NL1 providing the most loudness. CAMEQ2-HF provided the most audibility at high frequencies; even so, the audibility became less effective for improving speech intelligibility as hearing loss severity increased.	t	\N
22013241	Interactions between auditory and somatosensory information are relevant to the neural processing of speech since speech processes and certainly speech production involves both auditory information and inputs that arise from the muscles and tissues of the vocal tract. We previously demonstrated that somatosensory inputs associated with facial skin deformation alter the perceptual processing of speech sounds. We show here that the reverse is also true, that speech sounds alter the perception of facial somatosensory inputs. As a somatosensory task, we used a robotic device to create patterns of facial skin deformation that would normally accompany speech production. We found that the perception of the facial skin deformation was altered by speech sounds in a manner that reflects the way in which auditory and somatosensory effects are linked in speech production. The modulation of orofacial somatosensory processing by auditory inputs was specific to speech and likewise to facial skin deformation. Somatosensory judgments were not affected when the skin deformation was delivered to the forearm or palm or when the facial skin deformation accompanied nonspeech sounds. The perceptual modulation that we observed in conjunction with speech sounds shows that speech sounds specifically affect neural processing in the facial somatosensory system and suggest the involvement of the somatosensory system in both the production and perceptual processing of speech.	t	\N
22023487	The aim of this study was to assess differences between real ear insertion gains (REIG) measured with the modified pressure concurrent equalization (MPCE) and modified pressure stored equalization (MPSE) methods for open fittings in a typical audiology patient population. REIGs were compared for the two methods using a warble tone sweep at 65 dB SPL. The differences between the two methods at 0.25, 0.5, 1, 2, 3, 4 and 6 kHz were recorded. Eighty-three ears of a consecutive sample of 48 candidates for open-fit hearing aids were included. The mean difference between MPSE and MPCE REIGs was less than 1 dB at all frequencies. Analysis of variance showed that the main effect of method was not significant, and there was no significant interaction between method and frequency. The results for the MPSE and MPCE methods did not differ significantly for the patients with mild-to-moderate hearing losses tested here, for whom REIGs were generally less than 20 dB. Further research is needed to identify the REIG values at which the differences between MPCE and MPSE methods become clinically significant.	t	\N
22030970	Frontonasal dysplasia (FND) is a rare malformative complex affecting the frontal portion of the face, the eyes and the nose; it may occur singly or associated with other clinical signs. No systematic studies describing hearing in this condition were found. To evaluate hearing sensitivity and sound stimulus conduction from cochlea to brainstem in patients with clinical signs of FND. 21 patients with isolated or syndromic FND were submitted to a clinical (otological/vestibular antecedents and otoscopy) and instrumental (pure tone and speech audiometry, tympanometry and brainstem auditory evoked response) hearing evaluation. A clinical, cross-sectional observational prospective study. Hearing thresholds were normal in 15 (70%) patients, abnormal in 5 (25%), mostly with conductive hearing loss; one patient did not cooperate with testing. The tympanometric curve was type A in 30 (72%) ears, type C in 5 (12%), type As in 4 (9%) and type B in 3 (7%). The auditory brainstem response (ABR) showed no abnormalities. Patients with FND showed no abnormalities in the auditory system from cochlea to brainstem in this study. Mild conductive hearing loss found in some is probably related to cleft palate. Further evaluation of hearing pathways at higher levels is recommended.	t	\N
22030972	Accurate information about type, degree, and configuration of hearing loss are necessary for successful audiological early interventions. Auditory brainstem response with tone burst stimuli (TB ABR) and auditory steady-state response (ASSR) exams provide this information. To analyze the clinical applicability of TB ABR and ASSR at 2 kHz in infants, comparing responses in full-term and premature neonates. The study was cross-sectional, clinical and experimental. Subjects consisted of 17 premature infants and 19 full-term infants. TB ABR and ASSR exams at 2000 Hz were done during natural sleep. The electrophysiological minimum response obtained with TB ABR was 32.4 dBnHL (52.4 dBSPL); the ASSR minimum was 13.8 dBHL (26.4 dBSPL). The exams required 21.1 min and 22 min, respectively. Premature and full-term infant responses showed no statistically significant differences, except for auditory steady-state response duration. Both exams have clinical applicability at 2 kHz in infants, with 20 min of duration, on average. In general, there are no differences between premature and full-term individuals.	t	\N
22033983	We are constantly exposed to our own face and voice, and we identify our own faces and voices as familiar. However, the influence of self-identity upon self-speech perception is still uncertain. Speech perception is a synthesis of both auditory and visual inputs; although we hear our own voice when we speak, we rarely see the dynamic movements of our own face. If visual speech and identity are processed independently, no processing advantage would obtain in viewing one's own highly familiar face. In the present experiment, the relative contributions of facial and vocal inputs to speech perception were evaluated with an audiovisual illusion. Our results indicate that auditory self-speech conveys a processing advantage, whereas visual self-speech does not. The data thereby support a model of visual speech as dynamic movement processed separately from speaker recognition.	t	\N
22035565	Frequency modulation (FM) is an important building block of communication signals for animals and human. Attempts to predict the response of central neurons to FM sounds have not been very successful, though achieving successful results could bring insights regarding the underlying neural mechanisms. Here we proposed a new method to predict responses of FM-sensitive neurons in the auditory midbrain. First we recorded single unit responses in anesthetized rats using a random FM tone to construct their spectro-temporal receptive fields (STRFs). Training of neurons in the artificial neural network to respond to a second random FM tone was based on the temporal information derived from the STRF. Specifically, the time window covered by the presumed trigger feature and its delay time to spike occurrence were used to train a finite impulse response neural network (FIRNN) to respond to this random FM. Finally we tested the model performance in predicting the response to another similar FM stimuli (third random FM tone). We found good performance in predicting the time of responses if not also the response magnitudes. Furthermore, the weighting function of the FIRNN showed temporal 'bumps' suggesting temporal integration of synaptic inputs from different frequency laminae. This article is part of a Special Issue entitled: Neural Coding.	t	\N
22037719	The aim of this study was to investigate audio-vestibular function in patients with essential tremor. Twenty-three patients with essential tremor (46 ears) and 21 health control subjects (42 ears) were included in the present study. Patients and comparison subjects were matched for age and gender. All patient and control subjects underwent pure tone audiometric test, tympanogram, transient-evoked otoacoustic emissions and auditory brainstem response. Vestibular system was evaluated by bitermal caloric test. Comparison of variables between the groups was performed. Investigation of the relationship between parameters about ET disease and hearing levels were also studied. Pure tone thresholds significantly differed between patients and controls in 250 and 500 Hz frequencies (p < 0.05). There was no statistically significant difference in 1,000, 2,000, 4,000, and 6,000 Hz frequencies in essential tremor patients in comparison to the control subjects. A correlation between tremor severity and audiometric scores in low frequencies was not found. In addition, statistical analysis did not demonstrate a correlation between audiometric scores and tremor duration. The otoacoustic emission responses were found significantly different in patient and control group. The latencies of waves I, V and I-V inter-peak latencies on the ABR were not different between the groups. Our findings indicated that, abnormalities are due to cochlea rather than the retro cochlear pathology which is responsible for hearing loss associated with essential tremor.	t	\N
22056210	When speaking or producing music, people rely in part on auditory feedback - the sounds associated with the performed action. Three experiments investigated the degree to which alterations of auditory feedback (AAF) during music performances influence the experience of agency (i.e., the sense that your actions led to auditory events) and the possible link between agency and the disruptive effect of AAF on production. Participants performed short novel melodies from memory on a keyboard. Auditory feedback during performances was manipulated with respect to its pitch contents and/or its synchrony with actions. Participants rated their experience of agency after each trial. In all experiments, AAF reduced judgments of agency across conditions. Performance was most disrupted (measured by error rates and slowing) when AAF led to an ambiguous experience of agency, suggesting that there may be some causal relationship between agency and disruption. However, analyses revealed that these two effects were probably independent. A control experiment verified that performers can make veridical judgments of agency.	t	\N
22057984	Binaural hearing in cochlear implant (CI) users can be achieved either by bilateral implantation or bimodally with a contralateral hearing aid (HA). Binaural-bimodal hearing has the advantage of complementing the high-frequency electric information from the CI by low-frequency acoustic information from the HA. We examined the contribution of a contralateral HA in 25 adult implantees to their perception of fundamental frequency-cued speech characteristics (initial consonant voicing, intonation, and emotions). Testing with CI alone, HA alone, and bimodal hearing showed that all three characteristics were best perceived under the bimodal condition. Significant differences were recorded between bimodal and HA conditions in the initial voicing test, between bimodal and CI conditions in the intonation test, and between both bimodal and CI conditions and between bimodal and HA conditions in the emotion-in-speech test. These findings confirmed that such binaural-bimodal hearing enhances perception of these speech characteristics and suggest that implantees with residual hearing in the contralateral ear may benefit from a HA in that ear.	t	\N
22067074	The irrelevant sound effect (ISE) describes reduced verbal short-term memory during irrelevant changing-state sounds which consist of different and distinct auditory tokens. Steady-state sounds lack such changing-state features and do not impair performance. An EEG experiment (N=16) explored the distinguishing neurophysiological aspects of detrimental changing-state speech (3-token sequence) compared to ineffective steady-state speech (1-token sequence) on serial recall performance. We analyzed evoked and induced activity related to the memory items as well as spectral activity during the retention phase. The main finding is that the behavioral sound effect was exclusively reflected by attenuated token-induced gamma activation most pronounced between 50-60 Hz and 50-100 ms post-stimulus onset. Changing-state speech seems to disrupt a behaviorally relevant ongoing process during target presentation (e.g., the serial binding of the items).	t	\N
22073929	The majority of the patients with unilateral auditory neuropathy spectrum disorder (UANSD) were pediatric and mostly showed a great degree of hearing loss when diagnosed. Abnormal auditory brainstem response (ABR) and preserved otoacoustic emissions (OAEs) and/or cochlear microphonics (CM) were important features to differentiate it from common sensorineural deafness and central nerve hearing loss. To identify the clinical characteristics of patients with UANSD. This was a retrospective study involving 14 patients diagnosed as having UANSD between 2004 and 2010 in the Chinese PLA Hospital. In all, 50% of the cases were males (1:1 sex ratio) and the average age of onset was 4.1 years. Of the 14 affected ears with UANSD in these cases, 6 were left-sided, while 8 were right-sided. Of the 14 contralateral ears, 4 presented with sensorineural hearing loss, while the other 10 showed normal hearing. The degree of hearing loss in the 14 affected ears varied, including moderate in 1, moderately severe in 4, severe in 5, and profound in 4. ABRs were absent in the 14 affected ears, while the OAEs, and/or CM were present.	t	\N
22074449	Stapedotomy is, in our opinion, the technique of choice in stapes surgery. The precision of this technique allows clinicians to perform the surgical procedure in day surgery under local anesthesia. There is a strong emphasis on increasing the number of elective day surgery cases, especially in the patients' best interest, as it decreases the likelihood of late cancellation and hospital-acquired morbidity. A prospective study was performed to determine whether stapes surgery for otosclerosis could be performed safely in an outpatient setting. We present a series of stapes surgery cases for otosclerosis performed on a day-case basis. We performed a classic stapedotomy in 9 patients, a reverse classic step stapedotomy in 2 patients, a partial reverse classic step stapedotomy in 11 patients, and a hemi-stapedectomy in two patients. Three of 24 patients (12.5%) treated with classic stapedotomy, 1 patient with partial reverse classic step stapedotomy, and 1 patient with hemi-stapedectomy were formally admitted to the hospital after surgery (length of stay, 23 h). The indications were vertigo (two patients) and asthenia (one patient). These patients were treated under general anesthesia. Two of these patients resided more than 250 km away from the hospital.	t	\N
22082232	The present study investigated the lexical representations underlying the production of English schwa words. Two types of schwa words were compared: words with a schwa in poststress position (e.g., mackerel), whose schwa and reduced variants differ in a categorical way, and words with a schwa in prestress position (e.g., salami), whose variants differ in a noncategorical way. Participants named pseudohomophones and matched pseudowords corresponding to schwa and reduced variants of these words. Results revealed an advantage for pseudohomophones over matched pseudowords for both variants of poststress schwa words but only for schwa variants of prestress schwa words. As the pseudohomophone advantage is assumed to reflect the activation of a phonologically matching stored phonological representation, these results suggest that both variants of poststress schwa words are lexically represented while only schwa variants of prestress schwa words are. This result extends the proposal that words with two categorically distinct variants are stored in the production lexicon with 2 representations to another language and demonstrates that this 2-lexeme account does not generalize to pronunciation variants differing from one another in a noncategorical fashion. This finding challenges one of the widely shared assumption of generative models of word production: that content words have only 1 phonological representation. On the other hand, it provides further evidence in favor of another fundamental assumption of these models: that lexical representations are abstract sets of segments rather than fully detailed exemplars.	t	\N
22087888	To examine the role of perceived gender on fricative identification, a study was conducted in which listeners identified /s/-/∫/ and /s/-/θ/ continua combined with vowels produced by a man and a woman. These were acoustically modified to be consistent with different-sized vocal tracts (VT), and were presented with pictures of men or women. Listeners identified more tokens of /s/ in the /s/-/∫/ and more tokens of /θ/ in the /s/-/θ/ continuum when these sounds were combined with men's vowels, with vowels consistent with a 17 cm VT, and with pictures of men. Results support the hypothesis that listeners incorporate information about talker gender during fricative perception.	t	\N
22087912	Cochlear hearing loss is often associated with a loss of basilar-membrane (BM) compression, which in turn may contribute to degraded processing of suprathreshold stimuli. Behavioral estimates of compression may therefore be useful as long as they are valid over a wide range of levels and frequencies. Additivity of forward masking (AFM) may provide such a measure, but research to date lacks normative data from normal-hearing (NH) listeners at high sound levels, which is necessary to evaluate data from hearing-impaired (HI) listeners. The present study measured AFM in six NH listeners for signal frequencies of 500, 1500, and 4000 Hz in the presence of background noise, designed to elevate signal thresholds to levels similar to those experienced by HI listeners. Results consistent with compressive BM responses were found for all six listeners at 500 Hz, five listeners at 1500 Hz, but only two listeners at 4000 Hz. Further measurements in the absence of background noise also indicated a lack of consistent compression at 4000 Hz at higher signal levels, in contrast to earlier results collected at lower levels. A better understanding of this issue will be required before AFM can be used as a general behavioral estimate of BM compression.	t	\N
22087920	The factors influencing the stream segregation of discrete tones and the perceived continuity of discrete tones as continuing through an interrupting masker are well understood as separate phenomena. Two experiments tested whether perceived continuity can influence the build-up of stream segregation by manipulating the perception of continuity during an induction sequence and measuring streaming in a subsequent test sequence comprising three triplets of low and high frequency tones (LHL-[ellipsis (horizontal)]). For experiment 1, a 1.2-s standard induction sequence comprising six 100-ms L-tones strongly promoted segregation, whereas a single extended L-inducer (1.1 s plus 100-ms silence) did not. Segregation was similar to that following the single extended inducer when perceived continuity was evoked by inserting noise bursts between the individual tones. Reported segregation increased when the noise level was reduced such that perceived continuity no longer occurred. Experiment 2 presented a 1.3-s continuous inducer created by bridging the 100-ms silence between an extended L-inducer and the first test-sequence tone. This configuration strongly promoted segregation. Segregation was also increased by filling the silence after the extended inducer with noise, such that it was perceived like a bridging inducer. Like physical continuity, perceived continuity can promote or reduce test-sequence streaming, depending on stimulus context.	t	\N
22087922	This work was aimed at determining whether binaural interference occurs in electric hearing, and if so, whether it occurs as a consequence of perceptual grouping (central explanation) or if it is related to the spread of excitation in the cochlea (peripheral explanation). Six bilateral cochlear-implant listeners completed a series of experiments in which they judged the lateral position of a target pulse train, lateralized via interaural time or level differences, in the presence of an interfering diotic pulse train. The target and interferer were presented at widely separated electrode pairs (one basal and one apical). The results are broadly similar to those reported for acoustic hearing. All listeners but one showed significant binaural interference in at least one of the stimulus conditions. In all cases of interference, a robust recovery was observed when the interferer was presented as part of an ongoing stream of identical pulse trains, suggesting that the interference was at least partly centrally mediated. Overall, the results suggest that both simultaneous and sequential grouping mechanisms operate in electric hearing, at least for stimuli with a wide tonotopic separation.	t	\N
22087923	The present study examined the effect of combined spectral and temporal enhancement on speech recognition by cochlear-implant (CI) users in quiet and in noise. The spectral enhancement was achieved by expanding the short-term Fourier amplitudes in the input signal. Additionally, a variation of the Transient Emphasis Spectral Maxima (TESM) strategy was applied to enhance the short-duration consonant cues that are otherwise suppressed when processed with spectral expansion. Nine CI users were tested on phoneme recognition tasks and ten CI users were tested on sentence recognition tasks both in quiet and in steady, speech-spectrum-shaped noise. Vowel and consonant recognition in noise were significantly improved with spectral expansion combined with TESM. Sentence recognition improved with both spectral expansion and spectral expansion combined with TESM. The amount of improvement varied with individual CI users. Overall the present results suggest that customized processing is needed to optimize performance according to not only individual users but also listening conditions.	t	\N
22093438	In auditory-visual synaesthesia, all kinds of sound can induce additional visual experiences. To identify the brain regions mainly involved in this form of synaesthesia, functional magnetic resonance imaging (fMRI) has been used during non-linguistic sound perception (chords and pure tones) in synaesthetes and non-synaesthetes. Synaesthetes showed increased activation in the left inferior parietal cortex (IPC), an area involved in multimodal integration, feature binding and attention guidance. No significant group-differences could be detected in area V4, which is known to be related to colour vision and form processing. The results support the idea of the parietal cortex acting as sensory nexus area in auditory-visual synaesthesia, and as a common neural correlate for different types of synaesthesia.	t	\N
22095215	The behavioral and neurofunctional consequences of blindness often include performance enhancements and recruitment of occipital regions for nonvisual tasks. How the neuroanatomical changes resulting from this sensory loss relate to these functional changes is, however, less clear. Previous studies using cortical thickness (CT) measures have shown thicker occipital cortex in early-blind (EB) individuals compared with sighted controls. We hypothesized that this finding reflects the crossmodal plasticity often observed in blind individuals and thus could reflect behavioral adaptations. To address this issue, CT measures in blind (early and late) and sighted subjects were obtained along with several auditory behavioral measures in an attempt to relate behavioral and neuroanatomical changes. Group contrasts confirmed previous results in showing thicker occipital cortex in the EB. Regression analyses between CT measures across the whole brain of all blind individuals with the behavioral scores from 2 tasks in which EB subjects were superior (pitch and melody discrimination) showed that CT of occipital areas was directly related to behavioral enhancements. These findings constitute a compelling demonstration that anatomical changes in occipital areas are directly related to heightened behavioral abilities in the blind and hence support the idea that these anatomical features reflect adaptive compensatory plasticity.	t	\N
22097102	Daily life in today's society is filled with moments of high state anxiety. State anxiety is the amount of anxiety one is experiencing in the present moment; it is not a personality trait. Many people need simple, cost-effective ways to self-regulate themselves so they can sleep, be more productive, and attend to their activities of daily life. Therefore, the present study questioned whether listening to a steady beat will decrease feelings of state anxiety in healthy subjects. Participants (N=36) between the ages of 20 and 50 volunteered for the study. During this study, experimental anxiety was induced and the Visual Analog Scale served as the measurement tool for both tests. Subjects in the control group sat in silence, while subjects in the experimental group listened to a steady beat of 66 beats per minute. The results of an independent-samples t test indicated significant differences between the groups on the posttest measure, t (34)=2.41, p=.02. Subjects who listened to the steady beat reported less anxiety than subjects who sat in silence. This study suggests that steady beat alone can reduce state anxiety, thus providing a cost-effective and accessible means for self-regulation in the midst of high state anxiety.	t	\N
22101495	The purpose of the present study was to establish whether the validity effect produced by masked eye gaze cues should be attributed to strictly reflexive mechanisms or to volitional top-down mechanisms. While we find that masked eye gaze cues are effective in producing a validity effect in a central cueing paradigm, we also find that the efficacy of masked gaze cues is sharply constrained by the experimental context. Specifically, masked gaze cues only produced a validity effect when they appeared in the context of unmasked and predictive gaze cues. Unmasked gaze cues, in contrast, produced reliable validity effects across a range of experimental contexts, including Experiment 4 where 80% of the cues were invalid (counter-predictive). Taken together, these results suggest that the effective processing of masked gaze cues requires volitional control, whereas the processing of unmasked (clearly visible) gaze cues appears to benefit from both reflexive and top-down mechanisms.	t	\N
22102362	The neural events that lead to successful or failed detection of suprathreshold sounds are not well established. In this experiment, event-related potentials (ERPs) and functional magnetic resonance imaging (fMRI) were recorded while participants performed two tasks: a primary difficult duration judgment task on a sequence of tones presented to one ear, and a secondary target detection task on an auditory oddball stream presented to the other ear. The paradigm was designed to elicit competition and variability in detection of auditory targets despite identical input. Successful detection of auditory targets was associated mainly with greater fMRI activity in superior parietal cortex and thalamus. In the ERPs, successful detection was linked with a larger fronto-central negativity at 200-400 ms, and a later centro-posterior positivity. Failure to detect targets was associated with greater fMRI signal in the default mode network, a significantly smaller electrical fronto-central negativity and no late positivity. These findings demonstrate that variability in auditory detection is related to modulation of activity in multimodal parietal and frontal networks active ∼ 200 ms after target onset. Results are consistent with a limited capacity and late selection view of attention.	t	\N
22107896	Auditory perception and learning take place during the third trimester of gestation. Fetuses and newborns who lack typical auditory experience can go on to develop typical socioemotional attachment and language, given a supportive environment. For hospitalized preterm infants in developmentally sensitive neonatal intensive care units, detrimental effects of deviant early auditory experience may be remediated by later experience, but much is unknown about the causes of language deficits of prematurity. Prenatal auditory stimulation programs that incorporate audio speakers against the maternal belly should be discouraged because of possible overstimulation effects on the developing auditory system and sleep/wake state organization.	t	\N
22108539	Tracing the temporal structure of acoustic events is crucial in order to efficiently adapt to dynamic changes in the environment. In turn, regularity in temporal structure may facilitate tracing of the acoustic signal and its likely spatial source. However, temporal processing in audition extends beyond a domain-general facilitatory function. Temporal regularity and temporal order of auditory events correspond to contextually extracted, statistically sampled relations among sounds. These relations are the backbone of prediction in audition, determining both when an event is likely to occur (temporal structure) and also what type of event can be expected at a specific point in time (formal structure, e.g. spectral information). Here, we develop a model of temporal processing in audition and speech that involves a division of labor between the cerebellum and the basal ganglia in tracing acoustic events in time. As for the cerebellum and its associated thalamo-cortical connections, we refer to its role in the automatic encoding of event-based temporal structure with high temporal precision, while the basal ganglia-thalamo-cortical system engages in the attention-dependent evaluation of longer-range intervals. Recent electrophysiological and neurofunctional evidence suggests that neocortical processing of spectral structure relies on concurrent extraction of event-based temporal information. We propose that spectrotemporal predictive processes may be facilitated by subcortical coding of relevant changes in sound energy as temporal event markers.	t	\N
22111542	The objectives were: (1) to examine the effects of a directional microphone with different directivity patterns and different microphone combinations on wind noise levels at the hearing aid output; and (2) to derive strategies appropriate for hearing aid selection and future designs. The in-situ frequency responses of a behind-the-ear hearing aid (BTE1) were matched when the hearing aid was programmed to dipole, hypercardioid, cardioids, or adaptive microphone mode. The in-situ frequency responses of another hearing aid (BTE2) were matched among an omnidirectional microphone (OMNI), an adaptive directional microphone (ADM), and a combination of an omnidirectional microphone at low frequencies and an adaptive directional microphone at high frequencies (MIXED). Flow noise was recorded at flow velocities of 0, 4.5, 9.0, and 13.5 m/s. Measurements were repeated for the hypercardioid pattern of BTE1. Flow noise recorded using directional microphones with four directivity patterns and using OMNI, ADM, and MIXED. Directional microphones with different directivity patterns generated similar flow noise levels. ADM yielded higher overall levels than OMNI and MIXED, which had similar overall levels. The adaptive directional microphone is the most versatile microphone for use in wind. The mixed microphone mode is a viable wind noise reduction option.	t	\N
22115685	Cochlear implants have become a viable treatment option for individuals who present with severe to profound hearing loss. While there are several parameters that affect the successful use of this technology, quality programming of the cochlear implant system is crucial. This review chapter focuses on general device programming techniques, programming techniques specific to children, objective programming techniques, a brief overview of programming parameters of the currently commercially available multichannel systems, and managing patient complaints and device failures. The chapter also provides what the authors believe the future may hold for new programming techniques.	t	\N
22119935	The loudness dependence of the auditory evoked potential (LDAEP) is considered a noninvasive in vivo marker of central serotonergic functioning in humans. Nevertheless, results of genetic association studies point towards a modulation of this biomarker by dopaminergic neurotransmission. We examined the effect of dopaminergic modulation on the LDAEP using L-3,4-dihydroxyphenylalanine (levodopa)/benserazide (Madopar®) as a challenge agent in healthy volunteers. A double-blind placebo-controlled challenge design was chosen. Forty-two healthy participants (21 females and 21 males) underwent two LDAEP measurements, following a baseline LDAEP measurement either placebo or levodopa (levodopa 200 mg/benserazide 50 mg) were given orally. Changes in the amplitude and dipole source activity of the N1/P2 intensities (60, 70, 80, 90, and 100 dB) were analyzed. The participants of neither the levodopa nor the placebo group showed any significant LDAEP alterations compared to the baseline measurement. The test-retest reliability (Cronbachs Alpha) between baseline and intervention was 0.966 in the verum group and 0.759 in the placebo group, respectively. The administration of levodopa showed no effect on the LDAEP. These findings are in line with other trials using dopamine receptor agonists.	t	\N
22122114	In this article we report on listener categorization of meaningful environmental sounds. A starting point for this study was the phenomenological taxonomy proposed by Gaver (1993b). In the first experimental study, 15 participants classified 60 environmental sounds and indicated the properties shared by the sounds in each class. In a second experimental study, 30 participants classified and described 56 sounds exclusively made by solid objects. The participants were required to concentrate on the actions causing the sounds independent of the sound source. The classifications were analyzed with a specific hierarchical cluster technique that accounted for possible cross-classifications, and the verbalizations were submitted to statistical lexical analyses. The results of the first study highlighted 4 main categories of sounds: solids, liquids, gases, and machines. The results of the second study indicated a distinction between discrete interactions (e.g., impacts) and continuous interactions (e.g., tearing) and suggested that actions and objects were not independent organizational principles. We propose a general structure of environmental sound categorization based on the sounds' temporal patterning, which has practical implications for the automatic classification of environmental sounds.	t	\N
22122401	To assess the potential risk of hearing loss to young listeners, due to the use of personal listening devices (PLDs). The study included two parts: (1) A self-report questionnaire on music listening habits, and (2) Physical measurements of preferred listening levels, in quiet and in everyday background noise. Young teenagers aged 13 to 17 years. Part 1 included 289 participants with mean age of 14 years. Part 2 included 11 and 74 participants (2A and 2B) with a mean age of 15 years. Eleven listened to PLDs in quiet conditions (2A) and 74 in everyday background noise (2B). Questionnaire main findings indicated that most of the participants reported high or very high volume settings and demonstrated low awareness towards loud music listening consequences. Physical measurements corrected for diffuse field indicated mean preferred listening levels of: 82 (SD = 9) dBA in quiet, and 89 (SD = 9) dBA in the presence of background noise. The potential risk to hearing of PLDs users was calculated using the 8 hour equivalent level. More than 25% of the participants in the noisy condition were found to be at risk according to occupational damage risk criteria NIOSH, 1998.	t	\N
22123457	Evidence for cortical sensory activation in the human fetus at the beginning of the third trimester of pregnancy was provided in a recent imaging study. Although hearing is functional before birth, it is not clear whether recognition of the mother's voice is learned in utero or rapidly following delivery. We developed an original fMRI procedure that allows for the specific exploration of fetal brain response to auditory stimuli. This procedure provides the first in vivo evidence for the development of maternal voice recognition in utero between 33 and 34 weeks of gestation. This methodology could have crucial implications in the study of fetal cognition.	t	\N
22133495	A role for the cerebellum in cognition has been proposed based on studies suggesting a profile of cognitive deficits due to cerebellar stroke. Such studies are limited in the determination of the detailed organisation of cerebellar subregions that are critical for different aspects of cognition. In this study we examined the correlation between cognitive performance and cerebellar integrity in a specific degeneration of the cerebellar cortex: Spinocerebellar Ataxia type 6 (SCA6). The results demonstrate a critical relationship between verbal working memory and grey matter density in superior (bilateral lobules VI and crus I of lobule VII) and inferior (bilateral lobules VIIIa and VIIIb, and right lobule IX) parts of the cerebellum. We demonstrate that distinct cerebellar regions subserve different components of the prevalent psychological model for verbal working memory based on a phonological loop. The work confirms the involvement of the cerebellum in verbal working memory and defines specific subsystems for this within the cerebellum.	t	\N
22137677	We assessed the relationship between brain structure and function in 10 individuals with specific language impairment (SLI), compared to six unaffected siblings, and 16 unrelated control participants with typical language. Voxel-based morphometry indicated that grey matter in the SLI group, relative to controls, was increased in the left inferior frontal cortex and decreased in the right caudate nucleus and superior temporal cortex bilaterally. The unaffected siblings also showed reduced grey matter in the caudate nucleus relative to controls. In an auditory covert naming task, the SLI group showed reduced activation in the left inferior frontal cortex, right putamen, and in the superior temporal cortex bilaterally. Despite spatially coincident structural and functional abnormalities in frontal and temporal areas, the relationships between structure and function in these regions were different. These findings suggest multiple structural and functional abnormalities in SLI that are differently associated with receptive and expressive language processing.	t	\N
22143297	To demonstrate whether there are differences in the outcomes of children with cochlear implants using the techniques between mastoidectomy with posterior tympanotomy (MPTA) and suprameatal approach (SMA). A total of 44 congenitally deaf children who underwent cochlear implantation before the age of 5 years at the Sixth Hospital Affiliated to Shanghai Jiao tong University from January 2005 to March 2008 were included in this study. Children with severe mental retardation or with cochlear malformations were excluded. Each group had 22 patients; however, 4 patients in the SMA group and 5 in the MPTA group were not included in this study because the assessments for these patients were not available. All children were assessed before surgery and 6, 12, and 24 months after surgery with Categories of Auditory Performance (CAP) and Speech Intelligibility Rating (SIR). Each assessment was given by the child's speech therapist in the familiar environment of the child's home or school. At each interval, the CAP and SIR scores of each group were compared. For the both groups, CAP and SIR scores increased with the time of implant usage during the follow-up period after implantation. However, there were no significant differences in scores of CAP and SIR at the 4 time points between the SMA group and the MPTA group. Intelligible speech and auditory performance of children with cochlear implantation using the SMA were good. There were no differences between children implanted with the SMA technique versus the MPTA technique.	t	\N
22155324	Temporal congruency promotes perceptual binding of multisensory inputs. Here, we used EEG frequency-tagging to track cortical activities elicited by auditory and visual inputs separately, in the form of steady-state evoked potentials (SS-EPs). We tested whether SS-EPs could reveal a dynamic coupling of cortical activities related to the binding of auditory and visual inputs conveying synchronous vs. non-synchronous temporal periodicities, or beats. The temporally congruent audiovisual condition elicited markedly enhanced auditory and visual SS-EPs, as compared to the incongruent condition. Furthermore, an increased inter-trial phase coherence of both SS-EPs was observed in that condition. Taken together, these observations indicate that temporal congruency enhances the processing of multisensory inputs at sensory-specific stages of cortical processing, possibly through a dynamic binding by synchrony of the elicited activities and/or improved dynamic attending. Moreover, we show that EEG frequency-tagging with SS-EPs constitutes an effective tool to explore the neural dynamics of multisensory integration in the human brain.	t	\N
22163041	Insects often communicate by sound in mixed species choruses; like humans and many vertebrates in crowded social environments they thus have to solve cocktail-party-like problems in order to ensure successful communication with conspecifics. This is even more a problem in species-rich environments like tropical rainforests, where background noise levels of up to 60 dB SPL have been measured. Using neurophysiological methods we investigated the effect of natural background noise (masker) on signal detection thresholds in two tropical cricket species Paroecanthus podagrosus and Diatrypa sp., both in the laboratory and outdoors. We identified three 'bottom-up' mechanisms which contribute to an excellent neuronal representation of conspecific signals despite the masking background. First, the sharply tuned frequency selectivity of the receiver reduces the amount of masking energy around the species-specific calling song frequency. Laboratory experiments yielded an average signal-to-noise ratio (SNR) of -8 dB, when masker and signal were broadcast from the same side. Secondly, displacing the masker by 180° from the signal improved SNRs by further 6 to 9 dB, a phenomenon known as spatial release from masking. Surprisingly, experiments carried out directly in the nocturnal rainforest yielded SNRs of about -23 dB compared with those in the laboratory with the same masker, where SNRs reached only -14.5 and -16 dB in both species. Finally, a neuronal gain control mechanism enhances the contrast between the responses to signals and the masker, by inhibition of neuronal activity in interstimulus intervals. Thus, conventional speaker playbacks in the lab apparently do not properly reconstruct the masking noise situation in a spatially realistic manner, since under real world conditions multiple sound sources are spatially distributed in space. Our results also indicate that without knowledge of the receiver properties and the spatial release mechanisms the detrimental effect of noise may be strongly overestimated.	t	\N
22166679	To evaluate the effect of different lipid fractions on auditory brainstem evoked responses in hyperlipidaemia. We conducted a single institution (medical college), prospective, cross-sectional study of 25 hyperlipidaemic patients and 25 normolipidaemic controls, all with a normal hearing threshold on pure tone audiometry. Brainstem evoked response audiometry results were recorded in both groups. The hyperlipidaemic group were further divided into two subgroups, based on the serum value of each lipid fraction: those with less than and those with greater than the mean serum value. These two subgroups were further compared with the control group. The hyperlipidaemic and normolipidaemic groups had statistically significant differences for all audiometry waves apart from the wave I and the III-V interpeak latencies. The subgroups had a statistically significant difference in brainstem evoked responses. We found a statistically significant association between low-density lipoproteins and many waveforms in the hyperlipidaemic group. We found that low-density lipoproteins were significantly associated with many waveforms in hyperlipidaemic patients. Thus, low-density lipoproteins may be important in auditory dysfunction.	t	\N
22171054	Oscillatory entrainment mechanisms are invoked during attentional processing of rhythmically occurring stimuli, whereby their phase alignment regulates the excitability state of neurons coding for anticipated inputs. These mechanisms have been examined in the delta band (1-3 Hz), where entrainment frequency matches the stimulation rate. Here, we investigated entrainment for subdelta rhythmic stimulation, recording from intracranial electrodes over human auditory cortex during an intersensory audiovisual task. Audiovisual stimuli were presented at 0.67 Hz while participants detected targets within one sensory stream and ignored the other. It was found that entrainment operated at twice the stimulation rate (1.33 Hz), and this was reflected by higher amplitude values in the FFT spectrum, cyclic modulation of alpha-amplitude, and phase-amplitude coupling between delta phase and alpha power. In addition, we found that alpha-amplitude was relatively increased in auditory cortex coincident with to-be-ignored auditory stimuli during attention to vision. Thus, the data suggest that entrainment mechanisms operate within a delimited passband such that for subdelta task rhythms, oscillatory harmonics are invoked. The phase of these delta-entrained oscillations modulates alpha-band power. This may in turn increase or decrease responsiveness to relevant and irrelevant stimuli, respectively.	t	\N
22171970	Coupling of thalamocortical networks through synchronous oscillations at gamma frequencies (30-80 Hz) has been suggested as a mechanism for binding of auditory sensory information into an object representation, which then becomes accessible for perception and cognition. This study investigated whether contralateral noise interferes with this step of central auditory processing. Neuromagnetic 40-Hz oscillations were examined in young healthy participants while they listened to amplitude-modulated sound in one ear and a multi-talker masking noise in the contralateral ear. Participants were engaged in a gap-detection task, for which their behavioural performance declined under masking. The amplitude modulation of the stimulus elicited steady 40-Hz oscillations with sources in bilateral auditory cortices. Analysis of the temporal dynamics of phase synchrony between source activity and the stimulus revealed two oscillatory components; the first was indicated by an instant onset in phase synchrony with the stimulus while the second showed a 200-ms time constant of gradual increase in phase synchrony after phase resetting by the gap. Masking abolished only the second component. This coincided with masking-related decrease of the P2 wave of the transient auditory-evoked responses whereas the N1 wave, reflecting early sensory processing, was unaffected. Given that the P2 response has been associated with object representation, we propose that the first 40-Hz component is related to representation of low-level sensory input whereas the second is related to internal auditory processing in thalamocortical networks. The observed modulation of oscillatory activity is discussed as reflecting a neural mechanism critical for speech understanding in noise.	t	\N
22172546	Wernicke's aphasia is a condition which results in severely disrupted language comprehension following a lesion to the left temporo-parietal region. A phonological analysis deficit has traditionally been held to be at the root of the comprehension impairment in Wernicke's aphasia, a view consistent with current functional neuroimaging which finds areas in the superior temporal cortex responsive to phonological stimuli. However behavioural evidence to support the link between a phonological analysis deficit and auditory comprehension has not been yet shown. This study extends seminal work by Blumstein, Baker, and Goodglass (1977) to investigate the relationship between acoustic-phonological perception, measured through phonological discrimination, and auditory comprehension in a case series of Wernicke's aphasia participants. A novel adaptive phonological discrimination task was used to obtain reliable thresholds of the phonological perceptual distance required between nonwords before they could be discriminated. Wernicke's aphasia participants showed significantly elevated thresholds compared to age and hearing matched control participants. Acoustic-phonological thresholds correlated strongly with auditory comprehension abilities in Wernicke's aphasia. In contrast, nonverbal semantic skills showed no relationship with auditory comprehension. The results are evaluated in the context of recent neurobiological models of language and suggest that impaired acoustic-phonological perception underlies the comprehension impairment in Wernicke's aphasia and favour models of language which propose a leftward asymmetry in phonological analysis.	t	\N
22176307	The parents' evaluation of aural/oral performance of children (PEACH) scale was developed to assess the effectiveness of amplification for children, based on a systematic use of parents' observations of children's performance in real-world environments. The purpose of the present study was to adapt the PEACH scale into the Malay language, and to collect normative data on a group of children with normal hearing. The participants were parents of 74 children aged between 3 months and 13 years of age. Parents were requested to observe their children's auditory/oral behavior in everyday life and to record their observations in the PEACH booklet. High internal consistency (Cronbach's alpha = 0.93) and item-total correlation were found (0.52-0.85). Similar to the published norms for English-speaking children, near-perfect scores were achieved by Malaysian children around 40 months of age. The adapted version can be used to evaluate amplification for children in the Malay speaking environment. The normative curve relating age to scores for the Malay PEACH can be used as a reference against which functional aural/oral performance of hearing-impaired Malaysian children can be evaluated.	t	\N
22177319	To compare the audiologic outcome and feasibility of bone-anchored hearing aid (BAHA) and external auditory canal reconstruction (EACR) surgeries in pediatric patients presenting a congenital aural atresia (CAA). A retrospective chart review of 40 patients operated in our tertiary pediatric care center between 2002 and 2010 was performed. 20 patients underwent EACR, whereas another 20 patients were implanted with a BAHA device. Air conduction (AC), bone conduction (BC), pure tone average (PTA) and speech discrimination score (SDS) were compared preoperatively, and hearing gain (HG) postoperatively at 6 and at 12 months at frequencies of 500, 1000, 2000 and 4000 Hz. Operative time, complications and associated microtia were documented as well. EACR patients were graded retrospectively upon Jahrsdoerfer's classification. Preoperative AC were significantly different between groups, at 500, 1000 and 2000 Hz but not at 4000 Hz. BAHA group compared postoperatively to EACR group showed significantly a superior HG of 46.9 ± 7.0 dB (p<0.001) and of 39.8(7) ± 7.2(6.9)dB (p<0.001) at 6 months and at 1 year, respectively. Moreover, aided air thresholds from the EACR group revealed an audiologic status similar to those of the BAHA group patients, at 6 months and one year postoperatively. Both groups had a similar evolution of their BC, as well as of the incidence of complications. We report one case of transient facial paralysis in the EACR group. Total operative time is significantly lower (p<0.001) for a BAHA implantation (56 ± 21 min) than for EACR surgery (216 ± 174 min). No preoperative or postoperative correlation (Pearson correlation test; p>0.05) was found between patient's Jahrsdoerfer's score and their audiologic outcome. HG does not seem to be influenced by the presence of microtia. EACR, although constituting an attractive option, does not give acceptable results alone. It can however, when combined to conventional air conduction hearing aids, provide excellent audiologic outcomes comparable to BAHA. BAHA implantation is a reliable, safe and efficient therapeutic option that allows a significantly better audiologic outcome when compared to unaided EACR for patients with CAA.	t	\N
22178340	Determine the prevalence of 35delG mutation in GJB2 gene in patients with prelingual deafness of no defined etiology whose underwent cochlear implant in the Otolaryngology Department at the Hospital de Clínicas de Porto Alegre and compare the speech recognition index using an open-set of sentences according to the presence or absence of the 35delG mutation. Cross-sectional study nested in a cohort. Were analyzed 37 patients with indeterminate etiology for deafness that underwent to cochlear implant. DNA was extracted and the mutations were studied using Polymerase Chain Reaction followed by gene sequencing. The prevalence of 35delG mutation was 11%. The speech recognition index was 72% in the group with 35delG mutation, and 30% in the group without this mutation (p>0.05). Prevalence of 35delG mutation in this study confirmed findings in the Brazilian literature. There was a clinically significant difference in hearing performance in patients with 35delG. Absence of statistical significance in this result might be attributed to the small number of patients with 35delG in our sample.	t	\N
22178454	When speech is interrupted by noise, listeners often perceptually "fill-in" the degraded signal, giving an illusion of continuity and improving intelligibility. This phenomenon involves a neural process in which the auditory cortex (AC) response to onsets and offsets of acoustic interruptions is suppressed. Since meaningful visual cues behaviorally enhance this illusory filling-in, we hypothesized that during the illusion, lip movements congruent with acoustic speech should elicit a weaker AC response to interruptions relative to static (no movements) or incongruent visual speech. AC response to interruptions was measured as the power and inter-trial phase consistency of the auditory evoked theta band (4-8 Hz) activity of the electroencephalogram (EEG) and the N1 and P2 auditory evoked potentials (AEPs). A reduction in the N1 and P2 amplitudes and in theta phase-consistency reflected the perceptual illusion at the onset and/or offset of interruptions regardless of visual condition. These results suggest that the brain engages filling-in mechanisms throughout the interruption, which repairs degraded speech lasting up to ~250 ms following the onset of the degradation. Behaviorally, participants perceived speech continuity over longer interruptions for congruent compared to incongruent or static audiovisual streams. However, this specific behavioral profile was not mirrored in the neural markers of interest. We conclude that lip-reading enhances illusory perception of degraded speech not by altering the quality of the AC response, but by delaying it during degradations so that longer interruptions can be tolerated.	t	\N
22178980	When hearing thresholds are measured with high-frequency resolution there is a pseudo-periodic variation in thresholds across frequency of up to 15-20dB. This variation is called threshold fine structure (previously referred to as threshold microstructure). Consequently, estimates of auditory status based on threshold measures can depend greatly on the specific frequency evaluated. The impact of threshold fine structure on the prediction of auditory status was examined by measuring detection thresholds of pure tones (providing an indication of threshold fine structure) and comparing them with thresholds obtained using linear sweeps, sinusoidally frequency modulated tones, and narrow-band noise. Spontaneous otoacoustic emissions (SOAEs) were also obtained to confirm the established relationship between threshold fine structure and SOAEs. Thresholds obtained using linear sweeps and narrow-band noise provided stable threshold estimates indicating that such threshold estimates were less influenced by threshold fine structure. Consequently, thresholds obtained with these stimuli may provide estimates of cochlear status less dependent of the exact frequency being evaluated, permitting better prediction of performance on other psychoacoustic measures (such as cochlear tuning and loudness perception) and the properties of their more objective measures (such as otoacoustic emissions).	t	\N
22180019	The purpose of the study was to determine why perceived spatial separation provides a greater release from informational masking in Chinese than English when target sentences in each of the languages are masked by other talkers speaking the same language. Monolingual speakers of English and Mandarin Chinese listened to semantically anomalous sentences in their own language when 1 of 3 maskers was present (speech-spectrum noise, a 2-talker speech masker in the same language, and a 2-talker speech masker in the other language). Both groups benefitted equally from spatial separation when the maskers were speech-spectrum noise or cross-language. Chinese listeners benefitted less from spatial separation than did English listeners when a same-language masker was used. Performance was scored in terms of the number of target words correctly identified; because Chinese target words were composed of 2 "stand-alone" morphemes, the authors also scored Chinese target words as correct when either of the morphemes was correctly identified. When this was done, Chinese and English listeners benefitted equally from spatial separation in all conditions. These results support a model in which release from informational masking in both monolingual English and Chinese listeners occurs because spatial separation facilitates morpheme access in both languages.	t	\N
22183905	To evaluate the long-term results and predictive factors of a good outcome with the use of a total ossicular replacement prosthesis in children. Retrospective case review. Tertiary referral center. The study included 114 children (116 ears). A total of 116 ears underwent total ossicular chain reconstruction with a titanium prosthesis. Cartilage was always used for tympanic membrane reconstruction. Audiological results were evaluated according to the guidelines of the American Academy of Otolaryngology-Head and Neck Surgery. Predictive factors of audiological results were determined. Logistic regression and χ(2) tests were used for statistical analysis. The mean age at surgery was 9.8 years. Ossiculoplasty was performed during second-look surgery in 91 ears (78.4%) and during another stage in 25 ears (21.6%). The first-stage procedure was always performed for cholesteatoma. Audiometric results were available for 116 ears at 1 year, for 89 ears (76.7%) at 2 years, and for 42 ears (36.2%) at 5 years. Closure of the average air-bone gap (ABG) to within 20 dB was achieved in 65 ears (56%) at 1 year. The mean (SD) preoperative and postoperative (at 1 year) ABGs were 41.0 (9.5) dB and 22.4 (12.6) dB, respectively. There were no cases of extrusion, but 17 luxations of the prosthesis were confirmed by computed tomography. Luxation occurred on average at 31.4 months. Only three 4000-Hz degradations of bone conduction were reported, with no dead ears. We examined 3 predictive factors of auditory results: preoperative ABG, footplate status, and postoperative otoscopic findings. Total ossiculoplasty is a reliable technique in children. Long-term hearing outcomes are stable and satisfactory, but luxation can occur at any time. Preoperative ABG and footplate status are negative predictive factors of auditory results.	t	\N
22196341	It has recently been conjectured that dyslexia arises from abnormal auditory sampling. What sampling rate is altered and how it affects reading remains unclear. We hypothesized that by impairing phonemic parsing abnormal low-gamma sampling could yield phonemic representations of unusual format and disrupt phonological processing and verbal memory. Using magnetoencephalography and behavioral tests, we show in dyslexic subjects a reduced left-hemisphere bias for phonemic processing, reflected in less entrainment to ≈30 Hz acoustic modulations in left auditory cortex. This deficit correlates with measures of phonological processing and rapid naming. We further observed enhanced cortical entrainment at rates beyond 40 Hz in dyslexics and show that this particularity is associated with a verbal memory deficit. These data suggest that a single auditory anomaly, i.e., phonemic oversampling in left auditory cortex, accounts for three main facets of the linguistic deficit in dyslexia.	t	\N
22196745	Prosody can be expressed not only by modification to the timing, stress and intonation of auditory speech but also by modifying visual speech. Studies have shown that the production of visual cues to prosody is highly variable (both within and across speakers), however behavioural studies have shown that perceivers can effectively use such visual cues. The latter result suggests that people are sensitive to the type of prosody expressed despite cue variability. The current study investigated the extent to which perceivers can match visual cues to prosody from different speakers and from different face regions. Participants were presented two pairs of sentences (consisting of the same segmental content) and were required to decide which pair had the same prosody. Experiment 1 tested visual and auditory cues from the same speaker and Experiment 2 from different speakers. Experiment 3 used visual cues from the upper and the lower face of the same talker and Experiment 4 from different speakers. The results showed that perceivers could accurately match prosody even when signals were produced by different speakers. Furthermore, perceivers were able to match the prosodic cues both within and across modalities regardless of the face area presented. This ability to match prosody from very different visual cues suggests that perceivers cope with variation in the production of visual prosody by flexibly mapping specific tokens to abstract prosodic types.	t	\N
22199181	In perception studies, it is common to use vowel stimuli from standardized recordings or synthetic stimuli created using values from well-known published research. Although the use of standardized stimuli is convenient, unconsidered dialect and regional accent differences may introduce confounding effects. The goal of this study was to examine the effect of regional accent variation on vowel identification. The authors analyzed formant values of 8 monophthong vowels produced by 12 talkers from the region where the research took place and compared them with standardized vowels. Fifteen listeners with normal hearing identified synthesized vowels presented in varying levels of noise and at varying spectral distances from the local-dialect values. Acoustically, local vowels differed from standardized vowels, and distance varied across vowels. Perceptually, there was a robust effect of accent similarity such that identification was reduced for vowels at greater distances from local values. Researchers and clinicians should take care in choosing stimuli for perception experiments. It is recommended that regionally validated vowels be used instead of relying on standardized vowels in vowel perception tasks.	t	\N
22199197	The current study measured, objectively and subjectively, how changes in speech rate affect recognition of English passages in bilingual listeners. Ten native monolingual, 20 English-dominant bilingual, and 20 non-English-dominant bilingual listeners repeated target words in English passages at five speech rates (unprocessed, two expanded, and two compressed), in quiet and in noise. For noise conditions, performance was measured at a signal-to-noise ratio that was determined through an adaptive procedure to avoid ceiling and floor effects. Listeners also made subjective judgments of speech rate, speech clarity, and performance confidence. In noise, stepwise improvement was observed as rate slowed down. A similar effect was not found in quiet. This pattern in performance was largely comparable across listener groups but was most robust in English-dominant listeners. Changes in speech rate and presence of noise significantly affected listeners' subjective ratings; however, no intergroup differences were observed for any of the subjective ratings. Bilingual listeners benefited from slow speech rates, more evidently so in noise than in quiet. Their performance, however, did not reach a monolingual level, even at the most favorable rate. Nonetheless, all listeners reported comparable confidence when processing temporally manipulated English passages.	t	\N
22207311	We used eyetracking, perceptual discrimination, and production tasks to examine the influences of perceptual similarity and linguistic experience on word recognition in nonnative (L2) speech. Eye movements to printed words were tracked while German and Dutch learners of English heard words containing one of three pronunciation variants (/t/, /s/, or /f/) of the interdental fricative /θ/. Irrespective of whether the speaker was Dutch or German, looking preferences for target words with /θ/ matched the preferences for producing /s/ variants in German speakers and /t/ variants in Dutch speakers (as determined via the production task), while a control group of English participants showed no such preferences. The perceptually most similar and most confusable /f/ variant (as determined via the discrimination task) was never preferred as a match for /θ/. These results suggest that linguistic experience with L2 pronunciations facilitates recognition of variants in an L2, with effects of frequency outweighing effects of perceptual similarity.	t	\N
22212698	Auditory-evoked potentials represent the response of the auditory pathway to an auditory stimulus. Specific language impairment (SLI) children have delayed language development with difficulties in both understanding and producing spoken language. Hence, the purpose of this study was to determine whether a group of children with SLI had abnormal changes in the auditory middle latency response (AMLR). AMLR was obtained for 19 SLI children and they were studied and compared to normal. Audiological assessment and speech language tests were done for the study group. The results revealed no significant statistical differences between SLI children and the normal with regard to AMLR (P > 0.05). Our results suggest that children with SLI do not have abnormal auditory system response at the level measured by AMLR casting doubt on affection of the hypnotized origin of AMLR, mainly primary auditory cortex, as a cause for delayed language development in those children.	t	\N
22212767	The LiSN & Learn auditory training software was developed specifically to improve binaural processing skills in children with suspected central auditory processing disorder who were diagnosed as having a spatial processing disorder (SPD). SPD is defined here as a condition whereby individuals are deficient in their ability to use binaural cues to selectively attend to sounds arriving from one direction while simultaneously suppressing sounds arriving from another. As a result, children with SPD have difficulty understanding speech in noisy environments, such as in the classroom. To develop and evaluate the LiSN & Learn auditory training software for children diagnosed with the Listening in Spatialized Noise-Sentences Test (LiSN-S) as having an SPD. The LiSN-S is an adaptive speech-in-noise test designed to differentially diagnose spatial and pitch-processing deficits in children with suspected central auditory processing disorder. Participants were nine children (aged between 6 yr, 9 mo, and 11 yr, 4 mo) who performed outside normal limits on the LiSN-S. In a pre-post study of treatment outcomes, participants trained on the LiSN & Learn for 15 min per day for 12 weeks. Participants acted as their own control. Participants were assessed on the LiSN-S, as well as tests of attention and memory and a self-report questionnaire of listening ability. Performance on all tasks was reassessed after 3 mo where no further training occurred. The LiSN & Learn produces a three-dimensional auditory environment under headphones on the user's home computer. The child's task was to identify a word from a target sentence presented in background noise. A weighted up-down adaptive procedure was used to adjust the signal level of the target based on the participant's response. On average, speech reception thresholds on the LiSN & Learn improved by 10 dB over the course of training. As hypothesized, there were significant improvements in posttraining performance on the LiSN-S conditions where the target and distracter stimuli are spatially separated and which specifically evaluate binaural processing ability (p ranging from <.003 to .0001, η2 ranging from 0.694 to 0.873). In contrast, there was no improvement on the LiSN-S control conditions where the target and distracter stimuli emanate from the same direction (p ranging from .07 to .86, η2 ranging from 0.362 to 0.004). Significant improvements were found posttraining on measures of memory, on one measure of attention, and on self-reported ratings of listening ability. There were no significant differences between post- and 3 mo posttraining scores on any of the assessment tools. The initial LiSN & Learn study has shown that children as young as 6 yr of age are able to complete the training (although some coaxing was needed in a minority of cases). Both parents and children have reported benefits from the training, and feedback from the trial has resulted in extra features being added to the software. In order to further evaluate the efficacy of LiSN & Learn to remediate binaural processing deficits in children a clinical trial is currently under way utilizing a randomized blinded control group design.	t	\N
22212768	The Australian version of the Listening in Spatialized Noise-Sentences Test (LiSN-S) was originally developed to assess auditory stream segregation skills in children aged 6 to 11 yr with suspected central auditory processing disorder. The LiSN-S creates a three-dimensional auditory environment under headphones. A simple repetition-response protocol is used to assess a listener's speech reception threshold (SRT) for target sentences presented in competing speech maskers. Performance is measured as the improvement in SRT in decibels gained when either pitch, spatial, or both pitch and spatial cues are incorporated in the maskers. To collect additional normative data on the Australian LiSN-S for adolescents and adults up to 60 yr of age, to analyze the effects of age on LiSN-S performance, to examine retest reliability in the older population, and to extrapolate findings from the Australian data so that the North American version of the test can also be used clinically with older adults. In a descriptive design, normative and test-retest reliability data were collected from adolescents and adults and combined with previously published data from Australian children aged 6 to 11 yr. One hundred thirty-two participants with normal hearing aged 12 yr, 0 mo, to 60 yr, 7 mo, took part in the normative data study. Fifty-five participants returned between 2 and 4 mo after the initial assessment for retesting. Analysis of variance revealed a significant effect of age on LiSN-S performance (p < .01 for all LiSN-S measures, ηp2 ranging from 0.16 to 0.54). On the low and high cue SRT measures, planned contrasts revealed significant differences between adults and children aged 13 yr and younger, as well as between 50- to 60-yr-olds and younger adults aged 18-29 yr. Whereas there were significant differences between adults and children on the talker, spatial, and total advantage measures, there were no significant differences in performance in adults aged 18-60 yr. There was a small but significant improvement on retest ranging from 0.5 to 1.2 dB across the four LiSN-S test conditions (p ranging from .01 to <.001). However, there was no significant difference between test and retest on the advantage measures (p ranging from .143 to .768). Test-retest differences across all LiSN-S measures were significantly correlated (r ranging from 0.2 to 0.7, p ranging from .023 to <.00000001) and did not differ as a function of age (p ranging from .178 to .980). As there was no significant difference among adults aged 18-60 yr on the LiSN-S talker, spatial, and total advantage measures, it appears that the decline in ability to understand speech in noise experienced by 50- to 60-yr-olds is not related to their ability to use either spatial or pitch cues. This result suggests that some other factor/s contributes to the decline in speech perception in noise experienced by older adults that is reported in the literature and was demonstrated in this study on the LiSN-S low and high cue SRT measures.	t	\N
22213908	Contrasting results have been reported regarding the phonetic acquisition of bilinguals. A lack of discrimination has been observed for certain native contrasts in 8-month-old Catalan-Spanish bilingual infants (Bosch & Sebastián-Gallés, 2003a), though not in French-English bilingual infants (Burns, Yoshida, Hill & Werker, 2007; Sundara, Polka & Molnar, 2008). At present, the data for Catalan-Spanish bilinguals constitute an exception in the early language acquisition literature. This study contributes new findings that show that Catalan-Spanish bilingual infants do not lose the capacity to discriminate native contrasts. We used an adaptation of the anticipatory eye movement paradigm (AEM; McMurray & Aslin, 2004) to explore this question. In two experiments we tested the ability of infants from Catalan and Spanish monolingual families and from Catalan-Spanish bilingual families to discriminate a Spanish-Catalan common and a Catalan-specific vowel contrast. Results from both experiments revealed that Catalan-Spanish bilingual infants showed the same discrimination abilities as those shown by their monolingual peers, even in a phonetic contrast that had not been discriminated in previous studies. Our results demonstrate that discrimination can be observed in 8-month-old bilingual infants when tested with a measure not based on recovery of attention. The high ratio of cognates in Spanish and Catalan may underlie the reason why bilinguals failed to discriminate the native vowels when tested with the familiarization-preference procedure but succeeded with the AEM paradigm.	t	\N
22216557	The purpose of this research was twofold: firstly, to develop a music perception test (MPT) for hearing-aid users, and secondly, to evaluate the influence of non-linear frequency compression (NFC) on music perception with the use of the self-compiled test. This article focuses on the description of the development and validation of the MPT. To date, the main direction in frequency-lowering hearing-aid studies has been in relation to speech perception abilities. As hearing-aid technology has improved, interest has grown in musical perception as a dimension that could improve hearing-aid users' quality of life. The MPT was designed to evaluate different aspects of rhythm, timbre, pitch and melody. The development of the MPT could be described as design-based. Phase 1 of the study included test development and recording, while phase 2 entailed presentation of stimuli to normal hearing listeners (n = 15) and hearing-aid users (n = 4). Based on the findings of phase 2, item analysis was performed to eliminate or change stimuli that resulted in high error rates. During phase 3 the adapted version of the test was performed on a smaller group of normal hearing listeners (n = 4) and 20 hearing-aid users. Results proved that adults with normal hearing as well as adults using hearing aids were able to complete all the sub-tests of the MPT, although hearing-aid users scored lower on the various sub-tests than normal hearing listeners. For the rhythm section of the MPT normal hearing listeners scored on average 93.8% versus 75.5% of hearing-aid users; for the timbre section the scores were 83% versus 62.3% respectively. Normal hearing listeners obtained an average score of 86.3% for the pitch section and 88.2% for the melody section, compared with the 70.8% and 61.9% respectively obtained by hearing-aid users. This implies that the MPT can be used successfully for assessment of music perception in hearing-aid users within the South African context and may therefore result in more effective hearing-aid fittings taking place. The test can be used as a counselling tool to assist audiologists and patients in understanding the problems they experience regarding music perception, and might be used for future musical training in areas where participants experience problems in customising individual fittings.	t	\N
22217385	Various recent studies attest that reading involves creating an implicit prosodic representation of the written text which may systematically affect the resolution of syntactic ambiguities in sentence comprehension. Research up to now suggests that implicit prosody itself depends on a partial syntactic analysis of the text, raising the question of whether implicit prosody contributes to the parsing process, or whether it merely interprets the syntactic analysis. The present reading experiments examine the influence of stress-based linguistic rhythm on the resolution of local lexical-syntactic ambiguities in German. Both speech production data from unprepared oral reading and eye-tracking results from silent reading demonstrate that readers favor syntactic analyses that allow for a prosodic representation in which stressed and unstressed syllables alternate rhythmically. The findings contribute evidence confirming immediate and guiding effects of linguistic rhythm on the earliest stages of syntactic parsing in reading.	t	\N
22221469	Auditory sustained responses have been recently suggested to reflect neural processing of speech sounds in the auditory cortex. As periodic fluctuations below the pitch range are important for speech perception, it is necessary to investigate how low frequency periodic sounds are processed in the human auditory cortex. Auditory sustained responses have been shown to be sensitive to temporal regularity but the relationship between the amplitudes of auditory evoked sustained responses and the repetitive rates of auditory inputs remains elusive. As the temporal and spectral features of sounds enhance different components of sustained responses, previous studies with click trains and vowel stimuli presented diverging results. In order to investigate the effect of repetition rate on cortical responses, we analyzed the auditory sustained fields evoked by periodic and aperiodic noises using magnetoencephalography. Sustained fields were elicited by white noise and repeating frozen noise stimuli with repetition rates of 5-, 10-, 50-, 200- and 500 Hz. The sustained field amplitudes were significantly larger for all the periodic stimuli than for white noise. Although the sustained field amplitudes showed a rising and falling pattern within the repetition rate range, the response amplitudes to 5 Hz repetition rate were significantly larger than to 500 Hz. The enhanced sustained field responses to periodic noises show that cortical sensitivity to periodic sounds is maintained for a wide range of repetition rates. Persistence of periodicity sensitivity below the pitch range suggests that in addition to processing the fundamental frequency of voice, sustained field generators can also resolve low frequency temporal modulations in speech envelope.	t	\N
22224482	We reported a case of an elderly female patient affected by musical hallucinations (MHs) as the unique symptom of a right temporal ischemic stroke. A functional magnetic resonance imaging examination was performed in the patient and in five age- and sex-matched normal controls (NC) to detect the complex neural substrate subserving MHs in such a context. Although an activation pattern involving the primary auditory cortex and the temporal associative areas bilaterally was found in the patient and NC, a significant increased activation mostly located in right temporal cortex (in the ischemic area), was observed in the patient. Further functional neuroimaging studies should be performed to detect the complex neural pathways underlying MHs and to find out differences between these hallucinations and real music perception.	t	\N
22227006	Aging disrupts neural timing, reducing the nervous system's ability to precisely encode sound. Given that the neural representation of temporal features is strengthened with musical training in young adults, can musical training offset the negative impact of aging on neural processing? By comparing auditory brainstem timing in younger and older musicians and nonmusicians to a consonant-vowel speech sound /da/. we document a musician's resilience to age-related delays in neural timing.	t	\N
22232409	To compare the development of phonological skills in children with specific language impairment (SLI) with and without literacy delay and to examine whether kindergarten phonological skills could discriminate these 2 groups. In a longitudinal study, 8 children with SLI and literacy delay, 10 children with SLI and normal literacy, and 14 typically developing children were studied from the last year of kindergarten to the start of Grade 3. A wide range of phonological tasks (phonological awareness [PA], verbal short-term memory [vSTM], and rapid automatized naming [RAN]) were administered yearly. The SLI group with literacy delay scored significantly lower than the typically developing children on almost all phonological tasks in all grades, whereas the SLI group with normal literacy scored significantly lower only on demanding PA and vSTM tasks. A combination of kindergarten PA and RAN skills could correctly classify 75% of the children with SLI. By including vSTM, the discriminatory value did not increase. Children with SLI and normal literacy at age 8;1 [years;months] continued to have difficulties with demanding PA and vSTM tasks. Children with SLI and poor PA and RAN in kindergarten were at high risk of developing literacy problems in a transparent orthography.	t	\N
22237762	The objective of the study was to compare the performance of cochlear implantation between post-meningitic and non-meningitic patients, and to evaluate the impact on hearing outcome of technical advances in cochlear implant technology. Retrospective chart review was used as the study design. Twenty adults with post-meningitic profound hearing loss receiving unilateral or bilateral cochlear implants between 1990 and 2008 were tested. Results were compared to a control group of 46 adults implanted for a non-meningitic hearing loss, with the same pre-operative speech scores. Speech scores were poorer in post-meningitic patients compared to those of control group, whatever the duration after implantation (p < 0.0001). Speech scores of subjects implanted and fitted before 2001 were compared to those of subjects implanted after 2001, with the same duration of hearing loss. Performance improved with implants and processors available after 2001, with a magnitude of improvement higher in post-meningitic patients (p < 0.0001 and p < 0.05 in post-meningitic and control groups, respectively, two-way ANOVA). Consequently, speech scores of post-meningitic patients implanted after 2001 achieved those of control subjects (two-way ANOVA). Advances in cochlear implant technology and coding strategy improve hearing outcome in post-meningitic adult patients, who now achieve similar performance as those of non-meningitic patients.	t	\N
22246139	Dynamic-range compression is routinely used in bilaterally fitted hearing devices. The objective of this study was to investigate how compression applied independently at each ear affects spatial perception in normal-hearing listeners and to relate the effects to changes in binaural cues caused by the compression for different types of sound. A semantic-differential method was used to measure the spatial attributes of sounds. Eleven normal-hearing participants responded to questions addressing certainty of location, diffuseness, movement, image splits, and externalization of sounds. Responses were given on seven-point scales between pairs of opposing terms. Stimuli included speech and a range of synthetic sounds with varying characteristics. Head-related transfer functions were used to simulate a source at an azimuth of -60° or +60°. Three processing conditions were compared: (1) an unprocessed reference condition; (2) fast-acting, wide-dynamic-range compression operating independently at each ear; and (3) imposition of a static bias in interaural level difference (ILD) equivalent to that generated by the compression under steady state conditions. All processing was applied in a high-frequency channel above 2 kHz. The three processing conditions were compared separately in two bandwidth conditions: a high-pass condition in which the high-frequency channel was presented to listeners in isolation and a full-bandwidth condition in which the high-frequency channel was recombined with the unprocessed low-frequency channel. Hierarchical cluster analysis was used to group related questions based on similarity of participants' responses. This led to the calculation of composite scores for four spatial attributes: "diffuseness," "movement," "image split," and "externalization." Compared with the unprocessed condition, fast-acting compression significantly increased diffuseness, movement, and image-split scores and significantly reduced externalization scores. The effects of compression were greater when listeners heard the high-frequency channel in isolation than when it was recombined with the unprocessed low-frequency channel. The effects were apparent only for sounds containing gradual onsets and offsets, including speech. Dynamic compression had a much more pronounced effect on the spatial attributes of sounds than imposition of a static bias in ILD. Fast-acting compression at high frequencies operating independently at each ear can adversely affect the spatial attributes of sounds in normal-hearing listeners by increasing diffuseness, increasing or giving rise to a sense of movement, causing images to split, and affecting the externalization of sounds. The effects are reduced, but not eliminated, when listeners have access to undisturbed low-frequency cues. Sounds containing gradual onsets and offsets, including speech, are most affected. The effects arise primarily as a result of relatively slow changes in ILD that are generated as the sound level at one or both ears crosses the compression threshold. The results may have implications for the use of compression in bilaterally fitted hearing devices, specifically in relation to spatial perception in dynamic situations.	t	\N
22248574	Feedback connections among auditory cortical regions may play an important functional role in processing naturalistic speech, which is typically considered a problem solved through serial feed-forward processing stages. Here, we used fMRI to investigate whether activity within primary auditory cortex (PAC) is sensitive to the perceived clarity of degraded sentences. A region-of-interest analysis using probabilistic cytoarchitectonic maps of PAC revealed a modulation of activity, in the most primary-like subregion (area Te1.0), related to the intelligibility of naturalistic speech stimuli that cannot be driven by stimulus differences. Importantly, this effect was unique to those conditions accompanied by a perceptual increase in clarity. Connectivity analyses suggested sources of input to PAC are higher-order temporal, frontal and motor regions. These findings are incompatible with feed-forward models of speech perception, and suggest that this problem belongs amongst modern perceptual frameworks in which the brain actively predicts sensory input, rather than just passively receiving it.	t	\N
22251287	Previous behavioural research suggests that infants possess phonologically detailed representations of the vowels and consonants in familiar words. These tasks examine infants' sensitivity to mispronunciations of a target label in the presence of a target and distracter image. Sensitivity to the mispronunciation may, therefore, be contaminated by the degree of mismatch between the distracter label and the heard mispronounced label. Event-related potential (ERP) studies allow investigation of infants' sensitivity to the relationship between a heard label (correct or mispronounced) and the referent alone using single picture trials. ERPs also provide information about the timing of lexico-phonological activation in infant word recognition. The current study examined 14-month-olds' sensitivity to vowel mispronunciations of familiar words using ERP data from single picture trials. Infants were presented with familiar images followed by a correct pronunciation of its label, a vowel mispronunciation or a phonologically unrelated non-word. The results support and extend previous behavioural findings that 14-month-olds are sensitive to mispronunciations of the vowels in familiar words using an ERP task. We suggest that the presence of pictorial context reinforces infants' sensitivity to mispronunciations of words, and that mispronunciation sensitivity may rely on infants accessing the cross-modal associations between word forms and their meanings.	t	\N
22256834	This study investigated the impact of an established behavioural dysarthria treatment on acoustic and perceptual measures of speech in two adults with Down syndrome (DS) and dysarthria to obtain preliminary measures of treatment effect, effect size and treatment feasibility. A single-subject A-B-A experimental design was used to measure the effects of the Lee Silverman Voice treatment (LSVT®) on speech in two adults with DS and dysarthria. Dependent measures included vocal sound pressure level (dB SPL), phonatory stability and listener intelligibility scores. Statistically significant improvements (p < 0.05) in vocal dB SPL and phonatory stability were present following treatment in both participants. Speech intelligibility scores improved in one of the two participants. These data suggest that people with DS and dysarthria can respond positively to intensive speech treatment such as LSVT. Further investigations are needed to develop speech treatments specific to DS.	t	\N
22265373	We discuss experimental support for the existence of a corollary discharge signal of attention movement control and its formulation in terms of the corollary discharge of attention model of attention movement (CODAM). The data is from fMRI, MEG and EEG activity observed about 200 ms after stimulus onset in various attention paradigms and in which the activity is mainly sited in parietal and extra-striate visual areas. Moreover the data arises from neural activity observed before report of a subject's experience occurs. The overall experimental support for the existence of a copy of the attention movement control signal generates, it is suggested, a viable route to explore the relation between this signal and human consciousness, as concluded in the paper.	t	\N
22280585	Despite many studies investigating auditory spatial impressions in rooms, few have addressed the impact of simultaneous visual cues on localization and the perception of spaciousness. The current research presents an immersive audiovisual environment in which participants were instructed to make auditory width judgments in dynamic bi-modal settings. The results of these psychophysical tests suggest the importance of congruent audio visual presentation to the ecological interpretation of an auditory scene. Supporting data were accumulated in five rooms of ascending volumes and varying reverberation times. Participants were given an audiovisual matching test in which they were instructed to pan the auditory width of a performing ensemble to a varying set of audio and visual cues in rooms. Results show that both auditory and visual factors affect the collected responses and that the two sensory modalities coincide in distinct interactions. The greatest differences between the panned audio stimuli given a fixed visual width were found in the physical space with the largest volume and the greatest source distance. These results suggest, in this specific instance, a predominance of auditory cues in the spatial analysis of the bi-modal scene.	t	\N
22280586	When speech is in competition with interfering sources in rooms, monaural indicators of intelligibility fail to take account of the listener's abilities to separate target speech from interfering sounds using the binaural system. In order to incorporate these segregation abilities and their susceptibility to reverberation, Lavandier and Culling [J. Acoust. Soc. Am. 127, 387-399 (2010)] proposed a model which combines effects of better-ear listening and binaural unmasking. A computationally efficient version of this model is evaluated here under more realistic conditions that include head shadow, multiple stationary noise sources, and real-room acoustics. Three experiments are presented in which speech reception thresholds were measured in the presence of one to three interferers using real-room listening over headphones, simulated by convolving anechoic stimuli with binaural room impulse-responses measured with dummy-head transducers in five rooms. Without fitting any parameter of the model, there was close correspondence between measured and predicted differences in threshold across all tested conditions. The model's components of better-ear listening and binaural unmasking were validated both in isolation and in combination. The computational efficiency of this prediction method allows the generation of complex "intelligibility maps" from room designs.	t	\N
22280718	The high soprano range was investigated by acoustic and electroglottographic measurements of 12 sopranos and high-speed endoscopy of one of these. A single laryngeal transition was observed on glissandi above the primo passaggio. It supports the existence of two distinct laryngeal mechanisms in the high soprano range: M2 and M3, underlying head and whistle registers. The laryngeal transition occurred gradually over several tones within the interval D#5-D6. It occurred over a wider range and was completed at a higher pitch for trained than untrained sopranos. The upper limit of the laryngeal transition during glissandi was accompanied by pitch jumps or instabilities, but, for most singers, it did not coincide with the upper limit of R1:f(0) tuning (i.e., tuning the first resonance to the fundamental frequency). However, pitch jumps could also be associated with changes in resonance tuning. Four singers demonstrated an overlap range over which they could sing with a full head or fluty resonant quality. Glottal behaviors underlying these two qualities were similar to the M2 and M3 mechanisms respectively. Pitch jumps and discontinuous glottal and spectral changes characteristic of a M2-M3 laryngeal transition were observed on decrescendi produced within this overlap range.	t	\N
22289805	Human speech perception is highly resilient to acoustic distortions. In addition to distortions from external sound sources, degradation of the acoustic structure of the sound itself can substantially reduce the intelligibility of speech. The degradation of the internal structure of speech happens, for example, when the digital representation of the signal is impoverished by reducing its amplitude resolution. Further, the perception of speech is also influenced by whether the distortion is transient, coinciding with speech, or is heard continuously in the background. However, the complex effects of the acoustic structure and continuity of the distortion on the cortical processing of degraded speech are unclear. In the present magnetoencephalography study, we investigated how the cortical processing of degraded speech sounds as measured through the auditory N1m response is affected by variation of both the distortion type (internal, external) and the continuity of distortion (transient, continuous). We found that when the distortion was continuous, the N1m was significantly delayed, regardless of the type of distortion. The N1m amplitude, in turn, was affected only when speech sounds were degraded with transient internal distortion, which resulted in larger response amplitudes. The results suggest that external and internal distortions of speech result in divergent patterns of activity in the auditory cortex, and that the effects are modulated by the temporal continuity of the distortion.	t	\N
22292057	To the extent that sensorineural systems are efficient, redundancy should be extracted to optimize transmission of information, but perceptual evidence for this has been limited. Stilp and colleagues recently reported efficient coding of robust correlation (r = .97) among complex acoustic attributes (attack/decay, spectral shape) in novel sounds. Discrimination of sounds orthogonal to the correlation was initially inferior but later comparable to that of sounds obeying the correlation. These effects were attenuated for less-correlated stimuli (r = .54) for reasons that are unclear. Here, statistical properties of correlation among acoustic attributes essential for perceptual organization are investigated. Overall, simple strength of the principal correlation is inadequate to predict listener performance. Initial superiority of discrimination for statistically consistent sound pairs was relatively insensitive to decreased physical acoustic/psychoacoustic range of evidence supporting the correlation, and to more frequent presentations of the same orthogonal test pairs. However, increased range supporting an orthogonal dimension has substantial effects upon perceptual organization. Connectionist simulations and Eigenvalues from closed-form calculations of principal components analysis (PCA) reveal that perceptual organization is near-optimally weighted to shared versus unshared covariance in experienced sound distributions. Implications of reduced perceptual dimensionality for speech perception and plausible neural substrates are discussed.	t	\N
22292257	To investigate the effect of regular scuba diving on the hearing thresholds of sport divers who have no history of noise exposure or ear-related accidents. Comprehensive topographic examination of the peripheral hearing system of sport divers. Cross-sectional study. General sport diving community. 81 sport divers with a mean of 300 dives each were compared to a control group of 81 non-divers. Participants were classified into three age groups. Examination included microscopic otoscopy, tympanometry, pure-tone audiometry (PTA) including air and bone conduction, speech audiometry and otoacoustic emissions (OAE). PTA suggested significant differences of the hearing thresholds at several frequencies between sport divers and non-divers in all age groups, although a Bonferroni correction for multiple testing was applied. Interestingly, the results were contradictory. Divers obtained better hearing results in air conduction, whereas non-divers showed better results in bone conduction. Speech audiometry and OAE did not reveal significant differences. There are no published studies of the peripheral cochlear system of divers that have used a combination of PTA, speech audiometry and OAE. All studies suggesting hearing impairment in divers were based on PTA and might have been influenced by a lack of accuracy of PTA. Our results suggest that diving does not adversely affect the hearing system of sport divers. A thorough test battery of audiological methods implying PTA, speech audiometry and OAE may contribute to offer more reliable results to answer the question of whether commercial or military divers are at higher risk for hearing detoriation.	t	\N
22293014	More than 100 years ago, Huey (1908/1968) indicated that the upper part of words was more relevant for perception than the lower part. Here we examined whether mutilated words, in their upper/lower portions (e.g., ), can automatically access their word units in the mental lexicon. To that end, we conducted four masked repetition priming experiments with the lexical decision task. Results showed that mutilated primes produced a sizeable masked repetition priming effect. Furthermore, the magnitude of the masked repetition priming effect was greater when the upper part of the primes was preserved than when the lower portion was preserved-this was the case not only when the mutilated words were presented in lower case but also when the mutilated words were presented in upper case. Taken together, these findings suggest that the front-end of computational models of visual-word recognition should be modified to provide a more realistic account at the level of letter features.	t	\N
22305992	To form a unified percept of our environment, the human brain integrates information within and across the senses. This MEG study investigated interactions within and between sensory modalities using a frequency analysis of steady-state responses that are elicited time-locked to periodically modulated stimuli. Critically, in the frequency domain, interactions between sensory signals are indexed by crossmodulation terms (i.e. the sums and differences of the fundamental frequencies). The 3 × 2 factorial design, manipulated (1) modality: auditory, visual or audiovisual (2) steady-state modulation: the auditory and visual signals were modulated only in one sensory feature (e.g. visual gratings modulated in luminance at 6 Hz) or in two features (e.g. tones modulated in frequency at 40 Hz & amplitude at 0.2 Hz). This design enabled us to investigate crossmodulation frequencies that are elicited when two stimulus features are modulated concurrently (i) in one sensory modality or (ii) in auditory and visual modalities. In support of within-modality integration, we reliably identified crossmodulation frequencies when two stimulus features in one sensory modality were modulated at different frequencies. In contrast, no crossmodulation frequencies were identified when information needed to be combined from auditory and visual modalities. The absence of audiovisual crossmodulation frequencies suggests that the previously reported audiovisual interactions in primary sensory areas may mediate low level spatiotemporal coincidence detection that is prominent for stimulus transients but less relevant for sustained SSR responses. In conclusion, our results indicate that information in SSRs is integrated over multiple time scales within but not across sensory modalities at the primary cortical level.	t	\N
22306805	In human communication, direct speech (e.g., Mary said, "I'm hungry") is perceived as more vivid than indirect speech (e.g., Mary said that she was hungry). This vividness distinction has previously been found to underlie silent reading of quotations: Using functional magnetic resonance imaging (fMRI), we found that direct speech elicited higher brain activity in the temporal voice areas (TVA) of the auditory cortex than indirect speech, consistent with an "inner voice" experience in reading direct speech. Here we show that listening to monotonously spoken direct versus indirect speech quotations also engenders differential TVA activity. This suggests that individuals engage in top-down simulations or imagery of enriched supra-segmental acoustic representations while listening to monotonous direct speech. The findings shed new light on the acoustic nature of the "inner voice" in understanding direct speech.	t	\N
22310087	To review outcomes following paediatric cholesteatoma surgery performed between 1999 and 2009 in a tertiary paediatric ENT unit. Retrospective case note review. A total of 137 mastoid procedures were recorded. Fifty-four per cent of children were observed to have disease involving the entire middle-ear cleft and mastoid complex. The revision rate was 25 per cent. Time to recurrence was one to three years in 17 patients, three to six years in five patients, and six to nine years in three cases. Eight of 25 revision cases demonstrated spontaneous improvement in air conduction thresholds following primary surgery. A high facial ridge and inadequate meatoplasty correlated highly with disease recurrence. Children tend to present with aggressive disease. Disease extent and ossicular chain involvement are associated with a higher risk of recurrent disease. Spontaneous improvement in hearing thresholds following cholesteatoma surgery should alert the clinician to recurrent disease.	t	\N
22310374	The use of cochlear implants in the rehabilitation of hearing-impaired patients is now widely established and offers great benefits for children. To evaluate these benefits, the Evaluation of Auditory Responses to Speech (EARS®) assessment tool was designed in 1995. The objectives of this study were to assess the auditory perceptual skills in a large paediatric population over time, as well as to provide standardised values. An international multicentre study was conducted in 35 clinics from 1996 to 2009. During this period, 765 children were assessed with the EARS in 10 test intervals: pre-operative, first fitting and then 1, 3, 6, and 12 months after the first fitting and annually thereafter, up to a maximum period of 5 years. The auditory skills improved significantly over time for all speech perception tests and questionnaires as well as between single test intervals: pre-operative to 6 months, 6 months to 1 year and 1 year to 2 years. Standardised values of the children stratified according to their age at implantation were calculated. The auditory perceptual skills significantly improved over time. The standardised values will help professionals set therapeutic goals and counsel parents accordingly.	t	\N
22314920	Although cochlear implant (CI) users frequently report deterioration of sound quality when listening to music, few methods exist to quantify these subjective claims. 1) To design a novel research method for quantifying sound quality perception in CI users during music listening; 2) To validate this method by assessing one attribute of music perception, bass frequency perception, which is hypothesized to be relevant to overall musical sound quality perception. Limitations in bass frequency perception contribute to CI-mediated sound quality deteriorations. The proposed method will quantify this deterioration by measuring CI users' impaired ability to make sound quality discriminations among musical stimuli with variable amounts of bass frequency removal. A method commonly used in the audio industry (multiple stimulus with hidden reference and anchor [MUSHRA]) was adapted for CI users, referred to as CI-MUSHRA. CI users and normal hearing controls were presented with 7 sound quality versions of a musical segment: 5 high pass filter cutoff versions (200-, 400-, 600-, 800-, 1000-Hz) with decreasing amounts of bass information, an unaltered version ("hidden reference"), and a highly altered version (1,000-1,200 Hz band pass filter; "anchor"). Participants provided sound quality ratings between 0 (very poor) and 100 (excellent) for each version; ratings reflected differences in perceived sound quality among stimuli. CI users had greater difficulty making overall sound quality discriminations as a function of bass frequency loss than normal hearing controls, as demonstrated by a significantly weaker correlation between bass frequency content and sound quality ratings. In particular, CI users could not perceive sound quality difference among stimuli missing up to 400 Hz of bass frequency information. Bass frequency impairments contribute to sound quality deteriorations during music listening for CI users. CI-MUSHRA provided a systematic and quantitative assessment of this reduced sound quality. Although the effects of bass frequency removal were studied here, we advocate CI-MUSHRA as a user-friendly and versatile research tool to measure the effects of a wide range of acoustic manipulations on sound quality perception in CI users.	t	\N
22316007	To measure the acceptable noise level (ANL) with and without noise reduction algorithms (NRAs), and to predict ΔANL, i.e. the difference in acceptable noise level with and without NRAs. The ANL test was applied to three NRAs. Furthermore, the measured ΔANL was predicted using several methods based on either the calculation of the signal-to-noise ratio or correlation methods of the processed signals with an unprocessed reference signal. Ten normal-hearing and eleven hearing-impaired subjects accomplished the ANL test. In general, the ANL test could determine an increased acceptance of noise with some NRAs. However, great inter-individual differences also resulted that were attributed to audible distortions when an NRA was used. Prediction of the mean measured DANL was possible, but individual prediction of DANL failed due to inter-individual differences. Mean DANL was predicted more accurately for hearing-impaired subjects when individual hearing loss was taken into account. The ANL test is a suitable tool for measuring the advantage of one NRA. A prediction of the measured individual ΔANL failed. However, mean DANL could be predicted with some methods. Furthermore, the individual hearing loss should be taken into account for a more accurate prediction for hearing-impaired subjects.	t	\N
22325920	Factors affecting perceptions of occupational suitability were examined for speakers who stutter and speakers who do not stutter. In Experiment 1, 58 adults who do not stutter heard one of two audio recordings (less severe stuttering, more severe stuttering) of a speaker who stuttered. Participants rated the speaker's communicative functioning, personal attributes, and suitability for 32 occupations, along with perceptions of the occupations' speaking demands and educational requirements. Perceived speaking demand strongly affected occupational suitability ratings at both levels of stuttering severity. In Experiment 2, 58 additional adults who do not stutter heard a recording of another adult in one of two conditions (fluent speech, pseudo-stuttering), and provided the same ratings as in Experiment 1. In the pseudo-stuttering condition, participants' perceptions of occupational speaking demand again had a strong effect on occupational suitability ratings. In the fluent condition, suitability ratings were affected primarily by perceived educational demand; perceived speaking demand was of secondary importance. Across all participants in Experiment 2, occupational suitability ratings were associated with ratings of the speaker's personal attributes and communicative functioning. In both experiments, speakers who stuttered received lower suitability ratings for high speaking demand occupations than for low speaking demand occupations. Ratings for many high speaking occupations, however, fell just below the midpoint of the occupational suitability scale, suggesting that participants viewed these occupations as less appropriate, but not necessarily inappropriate, for people who stutter. Overall, the findings support the hypothesis that people who stutter may face occupational stereotyping and/or role entrapment in work settings. At the end of this activity the reader will be able to (a) summarize main findings on research related to the work-related experiences of people who stutter, (b) describe factors that affect perceptions of which occupations are best suited for speakers who stutter and speakers who do not stutter, and (c) discuss how findings from the present study relate to previous findings on occupational advice for people who stutter.	t	\N
22326589	The present study was aimed at investigating the relationship between the mismatch negativity (MMN) and psychoacoustical effects of sequential streaming on comodulation masking release (CMR). The influence of sequential streaming on CMR was investigated using a psychoacoustical alternative forced-choice procedure and electroencephalography (EEG) for the same group of subjects. The psychoacoustical data showed, that adding precursors comprising of only off-signal-frequency maskers abolished the CMR. Complementary EEG data showed an MMN irrespective of the masker envelope correlation across frequency when only the off-signal-frequency masker components were present. The addition of such precursors promotes a separation of the on- and off-frequency masker components into distinct auditory objects preventing the auditory system from using comodulation as an additional cue. A frequency-specific adaptation changing the representation of the flanking bands in the streaming conditions may also contribute to the reduction of CMR in the stream conditions, however, it is unlikely that adaptation is the primary reason for the streaming effect. A neurophysiological correlate of sequential streaming was found in EEG data using MMN, but the magnitude of the MMN was not correlated with the audibility of the signal in CMR experiments. Dipole source analysis indicated different cortical regions involved in processing auditory streaming and modulation detection. In particular, neural sources for processing auditory streaming include cortical regions involved in decision-making.	t	\N
22326590	The primary goal of this study was to characterize the variability in auditory-nerve temporal response patterns obtained with the electrically evoked compound action potential (ECAP) within and across a relatively large group of cochlear-implant recipients. ECAPs were recorded in response to each of 21 pulses in a pulse train for five rates (900, 1200, 1800, 2400, and 3500 pps) and three cochlear regions (basal, middle, and apical). An alternating amplitude pattern was typically observed across the pulse train for slower rates, reflecting refractory properties of individual nerve fibers. For faster rates, the alternation ceased and overall amplitudes were substantially lower relative to the first pulse in the train, reflecting cross-fiber desynchronization. The following specific parameters were examined: (1) the rate at which the alternating pattern ceased (termed stochastic rate), (2) the alternation depth and the rate at which the maximum alternation occurred, and (3) the average normalized ECAP amplitude across the pulse train (measure of overall adaptation/desynchronization). Data from 29 ears showed that stochastic rates for the group spanned the entire range of rates tested. The majority of subjects (79%) had different stochastic rates across the three cochlear regions. The stochastic rate occurred most frequently at 2400 pps for basal and middle electrodes, and at 3500 pps for apical electrodes. Stimulus level was significantly correlated with stochastic rate, where higher levels yielded faster stochastic rates. The maximum alternation depth averaged 19% of the amplitude for the first pulse. Maximum alternation occurred most often at 1800 pps for basal and apical electrodes, and at 1200 pps for middle electrodes. These differences suggest some independence between alternation depth and stochastic rate. Finally, the overall amount of adaptation or desynchronization ranged from 63% (for 900 pps) to 23% (for 3500 pps) of the amplitude for the first pulse. Differences in temporal response properties across the cochlea within subjects may have implications for developing new speech-processing strategies that employ varied rates across the array.	t	\N
22326591	There is evidence that humans represent numbers in the form of a mental number line (MNL). Here we show that the MNL modulates the representation of visual and haptic space both in healthy individuals and right-brain-damaged patients, both with and without left unilateral spatial neglect (USN). Participants were asked to estimate the midpoint of visually or haptically explored rods while listening to task-irrelevant stimuli: a small digit ("2"), a large digit ("8"), or a non-numerical auditory stimulus ("blah"). In a control silent condition, the bisection error of USN patients was biased rightwards (namely, the marker of USN) only in the visual modality. Regardless of the direction of the bisection error committed in silent trials, listening to the small digit shifted the perceived midline leftwards, and listening to the large digit shifted the perceived midline rightwards, compared to a control condition in which a neutral syllable ("blah") was presented. The shift induced by listening to numbers occurred independently of the modality of response (i.e., both in vision and haptics), and in every group of participants. Interestingly, the effect of auditory numbers processing on space estimation was overall larger for haptically than for visually explored space in all participants. In conclusion, the present data show that listening to irrelevant numbers affects space perception also in patients with left USN, indicating that the spatial representation and attention processes disrupted by USN are not involved in these numerical magnitude-spatial effects.	t	\N
22326876	Individuals diagnosed with schizophrenia show deficiencies of basic neurophysiological sorting mechanisms. This study further investigated this issue, focusing on the two phenomena, laterality of coding and auditory forward masking. A specific audiometric method for use in psychiatry was the measuring set up to register brain stem audiograms (ABRs). A sample of 49 schizophrenic patients was compared with three control groups consisting of healthy reference subjects (n=49), attention deficit hyperactivity disorder (ADHD) patients (n=29), Asperger syndrome (AS) patients (n=13) and drug-induced psychotic patients (n=14). Schizophrenic patients showed significant abnormal laterality of brainstem activity in wave II of the auditory brainstem response (ABR) in comparison with all other study groups. Forward masking effects in the superior olive complex were coded significantly differently by schizophrenic patients compared to control groups except for the AS group. The results suggest deficits in the coding of auditory stimuli in the lower parts of the auditory pathway in schizophrenia and indicate that increased peripheral lateral asymmetry and forward masking aberrances could be neurophysiological markers for the disorder.	t	\N
22327619	Visual cues are known to aid auditory processing when they provide direct information about signal content, as in lip reading. However, some studies hint that visual cues also aid auditory perception by guiding attention to the target in a mixture of similar sounds. The current study directly tests this idea for complex, nonspeech auditory signals, using a visual cue providing only timing information about the target. Listeners were asked to identify a target zebra finch bird song played at a random time within a longer, competing masker. Two different maskers were used: noise and a chorus of competing bird songs. On half of all trials, a visual cue indicated the timing of the target within the masker. For the noise masker, the visual cue did not affect performance when target and masker were from the same location, but improved performance when target and masker were in different locations. In contrast, for the chorus masker, visual cues improved performance only when target and masker were perceived as coming from the same direction. These results suggest that simple visual cues for when to listen improve target identification by enhancing sounds near the threshold of audibility when the target is energetically masked and by enhancing segregation when it is difficult to direct selective attention to the target. Visual cues help little when target and masker already differ in attributes that enable listeners to engage selective auditory attention effectively, including differences in spectrotemporal structure and in perceived location.	t	\N
22329567	To determine the optimal seating position in a noisy classroom for students with unilateral hearing loss (UHL) without any auditory rehabilitation as compared to normal-hearing adults and student peers. Speech discrimination scores (SDS) for babble noise at distances of 3, 4, 6, 8, and 10 m from a speaker were measured in a simulated classroom measuring 300 m3 (reverberation time = 0.43 s). Students with UHL (n = 25, 10-19 years old), normal-hearing students (n = 25), and normal-hearing adults (n = 25). The SDS for the normal-hearing adults at the 3, 4, 6, 8, and 10 m distances were 90.0±6.4%, 84.7±7.9%, 80.6±10.0%, 75.5±12.6%, and 68.8±13.0%, respectively. Those for the normal-hearing students were 90.1±6.2%, 78.1±9.4%, 66.4±10.7%, 61.8±11.2%, and 60.8±10.9%. Those for the UHL group were 81.7±9.0%, 70.2±12.4%, 62.1±17.2%, 52.4±17.1%, and 48.9±17.9%. The UHL group needed a seating position of 4.35 m to achieve an equivalent mean SDS as those for normal-hearing adults seated at 10 m. Likewise, the UHL group needed to be seated at 6.27 m to have an equivalent SDS as the normal-hearing students seated at 10 m. Students with UHL in noisy classrooms require seating ranging from 4.35 m to no further than 6.27 m away from a teacher to obtain a SDS comparable to normal hearing adults and student peers.	t	\N
22337493	Complete psychometric functions for phoneme and word recognition scores at 8 signal-to-noise ratios from -15 dB to 20 dB were generated for the first 10, 20, and 25, as well as all 50, three-word presentations of the Tri-Word or Computer Assisted Speech Recognition Assessment (CASRA) Test (Gelfand, 1998) based on the results of 12 normal-hearing young adult participants from the original study. The psychometric functions for both phoneme and word scores were very similar and essentially overlapping for all set sizes. Performance on the shortened tests accounted for 98.8% to 99.5% of the full (50-set) test variance with phoneme scoring, and 95.8% to 99.2% of the full test variance with word scoring. Shortening the tests accounted for little if any of the variance in the slopes of the functions. The psychometric functions for abbreviated versions of the Tri-Word speech recognition test using 10, 20, and 25 presentation sets were described and are comparable to those of the original 50-presentation approach for both phoneme and word scoring in healthy, normal-hearing, young adult participants.	t	\N
22341758	The neural mechanisms used in tone rises and falls in Mandarin were investigated. Nine participants were scanned while they named one-character pictures that required rising or falling tone responses in Mandarin: the left insula and right putamen showed stronger activation between rising and falling tones; the left brainstem showed weaker activation between rising and falling tones. Connectivity analysis showed that the significant projection from the laryngeal motor cortex to the brainstem which was present in rising tones was absent in falling tones. Additionally, there was a significant difference between the connection from the insula to the laryngeal motor cortex which was negative in rising tones but positive in falling tones. These results suggest that the significant projection from the laryngeal motor cortex to the brainstem used in rising tones was not active in falling tones. The connection from the left insula to the laryngeal motor cortex that differs between rising and falling tones may control whether the rise mechanism is active or not.	t	\N
22343068	Interaural intensity disparities (IIDs), the cues all animals use to localize high frequency sounds, are initially processed in the lateral superior olive (LSO) by a subtractive process where inputs from one ear excite and inputs from the other ear inhibit LSO neurons. Such cells are called excitatory-inhibitory (EI) neurons and are prominent not only in the LSO but also in higher nuclei, which include the dorsal nucleus of the lateral lemniscus (DNLL) and inferior colliculus (IC). The IC is of particular interest since its EI cells receive diverse innervation patterns from a large number of lower nuclei, which include the DNLLs and LSOs, and thus comprise a population with diverse binaural properties. The first part of this review focuses on the circuits that create EI cells in the LSO, DNLL and IC. The second section then turns to the responses evoked by dynamic IIDs that change over time, as with multiple sounds that emanate from different regions of space or moving sound sources. I show that many EI neurons in the IC respond to dynamic IIDs in ways that are not predictable from their responses to static IIDs, IIDs presented one at a time. In the final section, results from in vivo whole cell recording in the IC are presented and address the connectional basis for the responsiveness to dynamic IIDs. The principal conclusion is that EI cells comprise a diverse population. The diversity is created by the particular set of inputs each EI type receives and is expressed in the differences in the responses to dynamic IIDs that are generated by those inputs. These results show that the construction of EI neurons in the IC imparts features that not only encode the location of an individual sound source, but also that allow animals to determine the direction of a moving sound and to focus and localize a single sound in midst of many sounds, as typically occurs in the daily lives of all animals.	t	\N
22344314	To assess the effects of spatial frequency and phase alignment of mask components in pattern masking, target threshold vs. mask contrast (TvC) functions for a sine-wave grating (S) target were measured for five types of mask: a sine-wave grating (S), a square-wave grating (Q), a missing fundamental square-wave grating (M), harmonic complexes consisting of phase-scrambled harmonics of a square wave (Qp), and harmonic complexes consisting of phase-scrambled harmonics of a missing fundamental square wave (Mp). Target and masks had the same fundamental frequency (0.46 cpd) and the target was added in phase with the fundamental frequency component of the mask. Under monocular viewing conditions, the strength of masking depends on phase relationships among mask spatial frequencies far removed from that of the target, at least 3 times the target frequency, only when there are common target and mask spatial frequencies. Under dichoptic viewing conditions, S and Q masks produced similar masking to each other and the phase-scrambled masks (Qp and Mp) produced less masking. The results suggest that pattern masking is spatial frequency broadband in nature and sensitive to the phase alignments of spatial components.	t	\N
22352523	Knowledge-based speech recognition systems extract acoustic cues from the signal to identify speech characteristics. For channel-deteriorated telephone speech, acoustic cues, especially those for stop consonant place, are expected to be degraded or absent. To investigate the use of knowledge-based methods in degraded environments, feature extrapolation of acoustic-phonetic features based on Gaussian mixture models is examined. This process is applied to a stop place detection module that uses burst release and vowel onset cues for consonant-vowel tokens of English. Results show that classification performance is enhanced in telephone channel-degraded speech, with extrapolated acoustic-phonetic features reaching or exceeding performance using estimated Mel-frequency cepstral coefficients (MFCCs). Results also show acoustic-phonetic features may be combined with MFCCs for best performance, suggesting these features provide information complementary to MFCCs.	t	\N
22352610	Recent acoustic descriptions have shown that Spanish and Portuguese vowels are produced differently in Europe and Latin America. The present study investigates whether comparable between-variety differences exist in vowel perception. Spanish, Peruvian, Portuguese, and Brazilian listeners were tested in a vowel identification task with stimuli sampled from the whole vowel space. The mean perceived first (F1) and second formant (F2) of every vowel category were compared across varieties. For both languages, perception exhibited the same between-variety differences as production for F1 but not F2, which suggests correspondence between produced F1 and perceived vowel height but not between F2 and frontness.	t	\N
22352612	It is hypothesized that in sine-wave replicas of natural speech, lexical tone recognition would be severely impaired due to the loss of F0 information, but the linguistic information at the sentence level could be retrieved even with limited tone information. Forty-one native Mandarin-Chinese-speaking listeners participated in the experiments. Results showed that sine-wave tone-recognition performance was on average only 32.7% correct. However, sine-wave sentence-recognition performance was very accurate, approximately 92% correct on average. Therefore the functional load of lexical tones on sentence recognition is limited, and the high-level recognition of sine-wave sentences is likely attributed to the perceptual organization that is influenced by top-down processes.	t	\N
22352622	The combined effect of low-pass filtering (cut-off frequencies between 500 and 3000 Hz) and periodic interruptions (1.5 and 10 Hz) on speech intelligibility was investigated. When combined, intelligibility was lower than each manipulation alone, even in some conditions where there was no effect from a single manipulation (such as the fast interruption rate of 10 Hz). By using young normal-hearing listeners, potential suprathreshold deficits and aging effects that may occur due to hearing impairment were eliminated. Thus, the results imply that reduced audibility of high-frequency speech components may partially explain the reduced intelligibility of interrupted speech in hearing impaired persons.	t	\N
22352624	In the context of binaural audio rendering, choosing the best head-related transfer function (HRTF) for an individual from large databases poses several problems. This study proposes a method to reduce the size of a given HRTF database. Participants, 45 in total, were asked to rate the quality of binaural synthesis for 46 HRTFs. The lack of reciprocity in the ratings was noted. Results were used to create a perceptually optimized HRTF subset which satisfied all participants' judgments. The subset was validated using localization tests on a separate group of subjects with results showing reduced errors when subjects were given their best choice, rather than their worst choice HRTF.	t	\N
22353565	Exposure to synchronous but spatially discordant auditory and visual inputs produces adaptive recalibration of the respective localization processes, which manifest themselves in measurable aftereffects. Here we report two experiments that examined the time course of visual recalibration of apparent sound location in order to establish the build-up and dissipation of recalibration. In Experiment 1 participants performed a sound localization task before and during exposure to an auditory-visual discrepancy. In Experiment 2, participants performed a sound localization task before and after 60, 180 or 300 exposures to the discrepancy and aftereffects were measured across a series of post-adaptation sound localization trials. The results show that recalibration is very fast. Substantial aftereffects are obtained after only 18-24 exposures and asymptote appears to be reached between 60 and 180 exposures. The rate of adaptation was independent of the size of the discrepancy. The retention of the aftereffect was strong, as we found no dissipation, not even after as few as 60 exposure trials.	t	\N
22356177	Talkers hyperarticulate vowels when communicating with listeners that require increased speech intelligibility. Vowel hyperarticulation is said to be motivated by knowledge of the listener's linguistic needs because it typically occurs in speech to infants, foreigners and hearing-impaired listeners, but not to non-verbal pets. However, the degree to which vowel hyperarticulation is determined by feedback from the listener is surprisingly less well understood. This study examines whether mothers' speech input is driven by knowledge of the infant's linguistic competence, or by the infant's feedback cues. Specifically, we manipulated (i) mothers' knowledge of whether they believed their infants could hear them or not, and (ii) the audibility of the speech signal available to the infant (full or partial audibility, or inaudible). Remarkably, vowel hyperarticulation was completely unaffected by mothers' knowledge; instead, there was a reduction in the degree of hyperarticulation such that vowels were hyperarticulated to the greatest extent in the full audibility condition, there was reduced hyperarticulation in the partially audible condition, and no hyperarticulation in the inaudible condition. Thus, while it might be considered adaptive to hyperarticulate speech to the hearing-impaired adult or infant, when these two factors (infant and hearing difficulty) are coupled, vowel hyperarticulation is sacrificed. Our results imply that infant feedback drives talker behavior and raise implications for intervention strategies used with carers of hearing-impaired infants.	t	\N
22359341	To date, the underlying cognitive and neural mechanisms of absolute pitch (AP) have remained elusive. In the present fMRI study, we investigated verbal and tonal perception and working memory in musicians with and without absolute pitch. Stimuli were sine wave tones and syllables (names of the scale tones) presented simultaneously. Participants listened to sequences of five stimuli, and then rehearsed internally either the syllables or the tones. Finally participants indicated whether a test stimulus had been presented during the sequence. For an auditory stroop task, half of the tonal sequences were congruent (frequencies of tones corresponded to syllables which were the names of the scale tones) and half were incongruent (frequencies of tones did not correspond to syllables). Results indicate that first, verbal and tonal perception overlap strongly in the left superior temporal gyrus/sulcus (STG/STS) in AP musicians only. Second, AP is associated with the categorical perception of tones. Third, the left STG/STS is activated in AP musicians only for the detection of verbal-tonal incongruencies in the auditory stroop task. Finally, verbal labelling of tones in AP musicians seems to be automatic. Overall, a unique feature of AP appears to be the similarity between verbal and tonal perception.	t	\N
22361102	The ability of infants to develop recognition of a common stress pattern that is language specific has been tested mainly in trochaic languages with a strong-weak (SW) stress pattern. The goals of the present study were: (a) to test Hebrew-learning infants on their stress pattern preference in the Hebrew language, for which the weak-strong (WS) stress pattern is the common one, and (b) to test whether the infants would generalize any preference for the common stress pattern in Hebrew to English words, which belong to a different rhythmic class. Fifty-six 9-month-old Hebrew-learning infants were tested on their preference for SW and WS stress patterns using Hebrew and English bisyllabic words with the head-turn preference procedure. The infants showed preference for WS Hebrew words but not for SW English words. Hebrew-learning infants recognize the common stress pattern in their native language, supporting language-specific distributional learning by infants. This recognition, however, is not generalized to a foreign language with different prosodic characteristics.	t	\N
22361320	The aim of this study was to determine which level-dependent hearing aid digital signal-processing strategy (DSP) participants preferred when listening to music and/or performing a speech-in-noise task. Two receiver-in-the-ear hearing aids were compared: one using 32-channel adaptive dynamic range optimization (ADRO) and the other wide dynamic range compression (WDRC) incorporating dual fast (4 channel) and slow (15 channel) processing. The manufacturers' first-fit settings based on participants' audiograms were used in both cases. Results were obtained from 18 participants on a quick speech-in-noise (QuickSIN; Killion, Niquette, Gudmundsen, Revit, & Banerjee, 2004) task and for 3 music listening conditions (classical, jazz, and rock). Participants preferred the quality of music and performed better at the QuickSIN task using the hearing aids with ADRO processing. A potential reason for the better performance of the ADRO hearing aids was less fluctuation in output with change in sound dynamics. ADRO processing has advantages for both music quality and speech recognition in noise over the multichannel WDRC processing that was used in the study. Further evaluations of which DSP aspects contribute to listener preference are required.	t	\N
22362394	Schizophrenia is associated with deficits in the ability to perceive emotion based on tone of voice. The basis for this deficit remains unclear, however, and relevant assessment batteries remain limited. The authors evaluated performance in schizophrenia on a novel voice emotion recognition battery with well-characterized physical features, relative to impairments in more general emotional and cognitive functioning. The authors studied a primary sample of 92 patients and 73 comparison subjects. Stimuli were characterized according to both intended emotion and acoustic features (e.g., pitch, intensity) that contributed to the emotional percept. Parallel measures of visual emotion recognition, pitch perception, general cognition, and overall outcome were obtained. More limited measures were obtained in an independent replication sample of 36 patients, 31 age-matched comparison subjects, and 188 general comparison subjects. Patients showed statistically significant large-effect-size deficits in voice emotion recognition (d=1.1) and were preferentially impaired in recognition of emotion based on pitch features but not intensity features. Emotion recognition deficits were significantly correlated with pitch perception impairments both across (r=0.56) and within (r=0.47) groups. Path analysis showed both sensory-specific and general cognitive contributions to auditory emotion recognition deficits in schizophrenia. Similar patterns of results were observed in the replication sample. The results demonstrate that patients with schizophrenia show a significant deficit in the ability to recognize emotion based on tone of voice and that this deficit is related to impairment in detecting the underlying acoustic features, such as change in pitch, required for auditory emotion recognition. This study provides tools for, and highlights the need for, greater attention to physical features of stimuli used in studying social cognition in neuropsychiatric disorders.	t	\N
22366255	The aim of this study was to test a multimodal event-related potential (ERP) paradigm in chronic solvent encephalopathy (CSE) to develop a sensitive method for the clinical diagnostics to CSE. The study comprised 11 CSE patients and 13 healthy controls. We used three tasks: an auditory odd-ball (AUD), a visual detection (VIS), and a recognition memory (MEM) task. The auditory and visual stimuli were presented in single- and dual-task conditions. The auditory P300 amplitude in single-task condition was smaller in the patient group than in the control group at the parietal (Pz) but not at the frontal midline electrode location. The auditory P300 response in the dual task condition AUD+VIS was unrecognizable in 8 of 11 patients and in 1 of 13 controls and in the AUD+MEM condition in 10 of 11 patients and in 4 of 13 controls. In the AUD+MEM condition, the auditory P300 amplitude at Pz was smaller in the patient group than in the control group. Reaction time for auditory stimuli in both dual conditions as well as for visual stimuli in AUD+VIS condition were in the patient group prolonged. The ERP results indicate that CSE patients present with slowed performance speed and difficulties in allocation of attention. Based on ERP results, the disturbance in brain activity in CSE seems to affect posterior aspects of the frontoparietal continuity. The multimodal paradigm seems promising as a tool for the clinical diagnostics of CSE.	t	\N
22367093	A major focus of recent attempts to enhance cochlear implant (CI) systems has been to increase the rate at which pulses are delivered to the electrode array. One basis for these attempts has been the expectation that faster stimulation rates would lead to an enhanced representation of temporal modulation information. However, there is recent physiological and behavioral evidence to suggest that the reverse may be the case. Here, the effects of stimulation rate on the perception of amplitude modulation were assessed using both modulation detection and modulation frequency discrimination tasks for a range of pulse rates extending considerably higher than the highest rate tested in previous studies and for different speech-relevant modulation frequencies. Detection of sinusoidal amplitude modulation was assessed in five CI users using monopolar pulse trains presented to a single electrode at rates of 482, 723, 1447, 2894, and 5787 pulses per second (pps). Adaptive procedures were used to find the minimal detectable modulation depth at modulation frequencies of 10 and 100 Hz and at carrier levels of 25%, 50%, and 75% of the electrode's dynamic range. Discrimination of modulation frequency was examined for the same range of pulse rates for the highest carrier level. Similar adaptive procedures determined the minimum increase in modulation frequency that could be detected relative to reference modulation frequencies of 10, 100, and 200 Hz. In both tasks, level roving was implemented to minimize possible loudness cues. Consistent with previous evidence, modulation detection thresholds were better for higher carrier levels and lower modulation frequencies. When modulation depth at threshold was expressed in terms of the ratio of the depth of the modulation and the carrier level in dB (i.e., 20 log m), performance was significantly better at lower pulse rates. However, when modulation depth was expressed relative to dynamic range, the effect of pulse rate was no longer significant, reflecting the fact that dynamic range increases with pulse rate. Modulation frequency discrimination clearly worsened with increasing modulation frequency, but there was no significant effect of pulse rate. In contrast to some recent evidence, no clearly harmful effect of higher pulse rates on modulation perception was found. However, even with very fast stimulation rates, tested over a wide range of modulation frequencies and with two different tasks, there is no evidence of benefit from faster stimulation rates in the perception of amplitude modulation.	t	\N
22384211	Recent behavioral neuroscience research revealed that elementary reactive behavior can be improved in the case of cross-modal sensory interactions thanks to underlying multisensory integration mechanisms. Can this benefit be generalized to an ongoing coordination of movements under severe physical constraints? We choose a juggling task to examine this question. A central issue well-known in juggling lies in establishing and maintaining a specific temporal coordination among balls, hands, eyes and posture. Here, we tested whether providing additional timing information about the balls and hands motions by using external sound and tactile periodic stimulations, the later presented at the wrists, improved the behavior of jugglers. One specific combination of auditory and tactile metronome led to a decrease of the spatiotemporal variability of the juggler's performance: a simple sound associated to left and right tactile cues presented antiphase to each other, which corresponded to the temporal pattern of hands movement in the juggling task. A contrario, no improvements were obtained in the case of other auditory and tactile combinations. We even found a degraded performance when tactile events were presented alone. The nervous system thus appears able to integrate in efficient way environmental information brought by different sensory modalities, but only if the information specified matches specific features of the coordination pattern. We discuss the possible implications of these results for the understanding of the neuronal integration process implied in audio-tactile interaction in the context of complex voluntary movement, and considering the well-known gating effect of movement on vibrotactile perception.	t	\N
22401989	We used structural magnetic resonance imaging (MRI) and voxel based morphometry (VBM) to investigate whether the efficiency of word processing in the non-native language (lexical efficiency) and the number of non-native languages spoken (2+ versus 1) were related to local differences in the brain structure of bilingual and multilingual speakers. We dissociate two different correlates for non-native language processing. Firstly, multilinguals who spoke 2 or more non-native languages had higher grey matter density in the right posterior supramarginal gyrus compared to bilinguals who only spoke one non-native language. This is interpreted in relation to previous studies that have shown that grey matter density in this region is related to the number of words learnt in bilinguals relative to monolinguals and in monolingual adolescents with high versus low vocabulary. Our second result was that, in bilinguals, grey matter density in the left pars opercularis was positively related to lexical efficiency in second language use, as measured by the speed and accuracy of lexical decisions and the number of words produced in a timed verbal fluency task. Grey matter in the same region was also negatively related to the age at which the second language was acquired. This is interpreted in terms of previous findings that associated the left pars opercularis with phonetic expertise in the native language.	t	\N
22406661	Detection performance is impaired for a visual target presented in an apparent motion (AM) trajectory, and this AM interference weakens when orientation information is inconsistent between the target and AM stimuli. These indicate that the target is perceptually suppressed by internal object representations of AM stimuli established along the AM trajectory. Here, we showed that transient sounds presented together with AM stimuli could enhance the magnitude of AM interference. Furthermore, this auditory effect attenuated when frequencies of the sounds were inconsistent during AM. We also confirmed that the sounds wholly elevated the magnitude of AM interference irrespective of the inconsistency in orientation information between the target and AM stimuli when the saliency of the sounds was maintained. These results suggest that sounds can contribute to the robust establishment and spatiotemporal maintenance of the internal object representation of an AM stimulus.	t	\N
22410729	To compare temporal aspects of peripheral neural responses and central auditory perception between groups of younger adult and elderly cochlear implant users. Cohort study. Academic hospital and cochlear implant center. Adult cochlear implant users aged 28 to 57 years in the younger group (n = 5) and 61 to 89 years (n = 9) in the elderly group. All subjects used Advanced Bionics devices. Diagnostic. Time constants of neural (i.e., electrically evoked compound action potentials [ECAPs]) and perceptual recovery from forward masking. Interstimulus intervals (ISIs) were varied in both experiments. ECAP recovery rates were equivalent between groups, and no correlation was found between ECAP recovery and age. No correlations were found between ECAP recovery and speech perception. Psychophysical recovery was significantly slower in the elderly compared with the younger subjects (p < 0.0005), with a significant effect of age (R2 = 0.70, p < 0.0005). At the longest ISI (240 ms), elderly subjects experienced a mean maximum threshold shift of 35.2% (relative to 1 ms ISI) versus 14.8% for younger subjects. There was a significant positive relationship between psychophysical recovery and consonant-nucleus-consonant word scores (R2 = 0.62, p < 0.001), although no relationship was found with Hearing in Noise Test sentences. These findings suggest that difficulties observed in speech perception by elderly CI users may be due to age-related changes in the central rather than peripheral auditory system. With further study, these results may provide information to allow clinicians to assess patients' temporal processing abilities and facilitate setting program parameters that will maximize their auditory perceptual experience with a cochlear implant.	t	\N
22411592	Gender priming studies have demonstrated facilitation of noun production following pre-activation of a target noun's grammatical gender. Findings provide support for models in which syntactic information relating to words is stored within the lexicon and activated during lexical retrieval. Priming effects are observed in the context of determiner plus noun phrase production. Few studies demonstrate gender priming effects in bare noun production (i.e., nouns in isolation). We investigated the effects of English determiner primes on bare mass and count noun production. In two experiments, participants named pictures after exposure to primes involving congruent, incongruent and neutral determiners. Facilitation of noun production by congruent and neutral determiner primes was found in both experiments. The results suggest that noun phrase syntax is activated in lexical retrieval, even when not explicitly required for production. Post hoc analysis of the relative frequency of congruent and incongruent prime-target pairs provides support for a frequency-based interpretation of the data.	t	\N
22423705	The frequency response and sensitivity of the ER-3A and ER-2 insert earphones are measured in the occluded-ear simulator using three ear canal extensions. Compared to the other two extensions, the DB 0370 (Brüel & Kjær), which is recommended by the international standards, introduces a significant resonance peak around 4500 Hz. The ER-3A has an amplitude response like a band-pass filter (1400 Hz, 6 dB/octave -4000 Hz, -36 dB/octave), and a group delay with "ripples" of up to ±0.5 ms, while the ER-2 has an amplitude response, and a group delay which are flat and smooth up to above 10000 Hz. Both earphones are used to record auditory brainstem responses, ABRs, from 22 normal-hearing ears in response to two chirps and a click at levels from 20 to 80 dB nHL. While the click-ABRs are slightly larger for ER-2 than for ER-3A, the chirp-ABRs are much larger for ER-2 than for ER-3A at levels below 60 dB nHL. With a simulated amplitude response of the ER-3A and the smooth group delay of the ER-2 it is shown that the increased chirp-ABR amplitude with the ER-2 is caused by its broader amplitude response and not by its smoother group delay.	t	\N
22426232	Rapid information processing in the human brain is vital to survival in a highly dynamic environment. The key tool humans use to exchange information is spoken language, but the exact speed of the neuronal mechanisms underpinning speech comprehension is still unknown. Here we investigate the time course of neuro-lexical processing by analyzing neuromagnetic brain activity elicited in response to psycholinguistically and acoustically matched groups of words and pseudowords. We show an ultra-early dissociation in cortical activation elicited by these stimulus types, emerging ∼50 ms after acoustic information required for word identification first becomes available. This dissociation is the earliest brain signature of lexical processing of words so far reported, and may help explain the evolutionary advantage of human spoken language.	t	\N
22436117	Assessment of interaural asymmetry (IA) on dichotic listening tests becomes challenging when accuracy on one or both ears are at ceiling. Cognitive task demands are often increased to avoid this situation, raising the possibility that non-auditory-specific processes will further influence IA estimation. To investigate the utility of low-pass filtered dichotic speech stimuli (dichotic filtered words [DFWs]) as a potential auditory-specific technique for increasing task difficulty in the dichotic listening paradigm. A prospective experimental study investigating differences in IA observed for a group of young adults on three dichotic listening tasks differentiated on the basis of stimulus type (unfiltered words, DFWs) and/or mode of test administration (divided attention [DIV]; directed attention [DIR]). Thirty-two adults between 18 and 34 yr of age participated in the study. Accuracy scores on each ear and normalized values of IA (via laterality index) served as dependent measures evaluated in a repeated-measures ANOVA design. Correlation and regression analyses were carried out to investigate potential relationships between participants' behavioral accuracy to experimental stimuli and the magnitude of observed IA. Behavioral profiles of IA were constructed for each participant. Using a traditional DIV test mode, DFW stimuli produced a significantly larger right-ear advantage (REA) as compared to unfiltered speech. The magnitude of the average REA to DFW stimuli was not statistically different between DIV and DIR test modes for the group. Inspection of individual performances, however, revealed that out of the participants who demonstrated meaningful IA during DIV, approximately half produced the same or greater amount of IA during DIR whereas half showed a relative decrease. Participants' accuracy to left-sided stimuli was most related to the magnitude of observed IA across tasks. The utility of low-pass filtered speech may be helpful in the evaluation of IA obtained during dichotic testing.	t	\N
22436413	The aim of this study was to report speech performance in quiet and in noise, sound localization with cochlear implanted children bilaterally. Their performances were compared also in unilateral conditions. In addition, speech and language evaluation was analyzed. Twenty-three children implanted with Neurelec Digisonic SP devices in 3 tertiary centres were tested on a battery of speech perception tests in quiet and in noise. Localization was assessed by lateralization tasks (90° and 30°). Progress in speech and language development and subjective assessment of benefit were assessed using several rating scales and questionnaires (categories of auditory perception, speech intelligibility rating, family participating rating scale). Children scored better when tested in bilateral conditions rather than in unilateral conditions. In quiet, the mean scores for the poorer and better side were 52% and 73%, respectively. In the bilateral condition, the mean score increased to 83%. In noise, the mean scores were 39% and 57% respectively, which increased to a mean of 70% in the bilateral condition. Nine children (<9 years) completed the ±90° lateralization task. For both unilateral conditions performance was not significantly different from chance level. In the bilateral condition, the mean score was 86%. The ±30° lateralization score was completed by eight of the older children (>9 years). The scores in the unilateral conditions were closed to chance level, but significantly better in the bilateral condition (mean of 86%). Performances in bilateral conditions were significantly better than in unilateral conditions on speech perception in quiet and in noise. Localization was significantly better when tested in the bilateral condition for ±90° lateralization task for the younger children and the ±30° task for the older children. All these results supported the hypothesis than bilateral cochlear implantation is more beneficial than unilateral implantation in children.	t	\N
22436438	It has been demonstrated that written and spoken language processing are tightly linked. Here we focus on the development of this relationship at the time children start reading and writing. We hypothesize that the newly acquired knowledge about graphemes shapes lexical access in neural spoken word recognition. A group of preliterate children (six years old) and two groups of beginning readers (six and eight years old) were tested in a spoken word identification task. Using word onset priming we compared behavioural and neural facilitation for target words in identical prime-target pairs (e.g., mon-monster) and in prime target pairs that varied in the first speech sound (e.g., non-monster, Variation condition). In both groups of beginning readers priming was less effective in the Variation condition than in the Identity condition. This was indexed by less behavioural facilitation and enhanced P350 amplitudes in the event related potentials (ERPs). In the group of preliterate children, by contrast, both conditions did not differ. Together these results reveal that lexical access in beginning readers is based on more acoustic detail than lexical access in preliterate children. The results are discussed in the light of bidirectional speech and print interactions in readers.	t	\N
22445915	Developmental dyslexia is associated with impaired speech-in-noise perception. The goal of the present research was to further characterize this deficit in dyslexic adults. In order to specify the mechanisms and processing strategies used by adults with dyslexia during speech-in-noise perception, we explored the influence of background type, presenting single target-words against backgrounds made of cocktail party sounds, modulated speech-derived noise or stationary noise. We also evaluated the effect of three listening configurations differing in terms of the amount of spatial processing required. In a monaural condition, signal and noise were presented to the same ear while in a dichotic situation, target and concurrent sound were presented to two different ears, finally in a spatialised configuration, target and competing signals were presented as if they originated from slightly differing positions in the auditory scene. Our results confirm the presence of a speech-in-noise perception deficit in dyslexic adults, in particular when the competing signal is also speech, and when both signals are presented to the same ear, an observation potentially relating to phonological accounts of dyslexia. However, adult dyslexics demonstrated better levels of spatial release of masking than normal reading controls when the background was speech, suggesting that they are well able to rely on denoising strategies based on spatial auditory scene analysis strategies.	t	\N
22465105	Increased early identification of hearing loss has led to infants younger than 24 months of age being implanted with cochlear implants. The objective of this study was to assess early speech development in children implanted with a cochlear implant before 24 months of age using the German questionnaire Elternfragebogen für Risikokinder 2 (ELFRA-2), and to compare these results to normative data of the ELFRA-2 in order to determine any performance differences. Two groups of children were included in this study. The first group included 6 children with a mean age at implantation of 11 months (range: 8-14 months). These children were tested by their parents or caretakers with the ELFRA-2 at the chronological age of 24 months. The second group included 9 children with a mean age at implantation of 13 months (range: 6-20 months) who were tested with the ELFRA-2 after 24 months of cochlear implant use. Comparison analyses of children tested with the ELFRA-2 demonstrated a statistically significant difference in all ELFRA-2 scales between children with cochlear implants (CIs) at the chronological age of 24 months and the norm group (productive vocabulary: p=0.002; syntax: p=0.003; and morphology: p<0.001), and no significant difference between children with CIs at 24 months of device use and the norm group in all scales (productive vocabulary: p=0.335; syntax: p=0.965; and morphology: p=0.304). Children implanted before 24 months of age reach a speech production level after 24 months of device use that is comparable to that of their normal hearing peers.	t	\N
22477057	Response interference (or response conflict) refers to the phenomenon whereby response times to a target stimulus are longer in the presence of distractor stimuli that indicate contrary motor responses. Response interference has been observed even when the distractor stimuli cannot be discriminated above chance levels. These results raise the question of whether response interference might be driven automatically by the physical distractor stimuli, independently of one's subjective perception of the distractors. Using a modified version of the Eriksen flanker task, we applied metacontrast masks to the flanker stimuli and measured their subjective visibility after each trial. We found converging lines of evidence that the subjective perception of flankers contributed to response interference, over and above the contribution of automatic processing of the stimulus itself. A factorial analysis revealed that the objective, physical congruency of target and flankers and the subjective, perceptual congruency of target and flankers make additive, noninteracting contributions to target response interference, suggesting that the two interference effects originate from independent levels or stages of cognitive processing.	t	\N
22480024	The majority of research on prosody in conversation to date has focused on exploring the role of individual prosodic features, such as certain types of pitch accent, pitch register or voice quality, for the accomplishment of specified social actions. From this research the picture emerges that when it comes to the implementation of specific actions at specific sequential locations conversationalists employ prosodic features systematically, but also with considerable variation, and indeed flexibility. This paper suggests a further line of enquiry, which pursues a wider, more fundamental role of prosody for interaction, and which does not focus on individual prosodic practices or features, but on participants' collaborative use of prosody for the implementation of one of the most basic interactional decisions: whether to continue a previously established action trajectory, or whether to start a new one. The data and findings of recent research make it clear that prosody, and in fact talk-in-interaction as such, is not appropriately defined by reference to individual features, speakers, locations and actions alone, but must be approached as a resource and negotiating strategy for social interaction. Prosody, therefore, must be described according to its role for both the accomplishment, and the coordination of actions across turns and participants.	t	\N
22484251	It has been argued that the human visual system is optimized for identification of broadband objects embedded in stimuli possessing orientation averaged power spectra fall-offs that obey the 1/f(β) relationship typically observed in natural scene imagery (i.e., β=2.0 on logarithmic axes). Here, we were interested in whether individual spatial channels leading to recognition are functionally optimized for narrowband targets when masked by noise possessing naturalistic image statistics (β=2.0). The current study therefore explores the impact of variable β noise masks on the identification of narrowband target stimuli ranging in spatial complexity, while simultaneously controlling for physical or perceived differences between the masks. The results show that β=2.0 noise masks produce the largest identification thresholds regardless of target complexity, and thus do not seem to yield functionally optimized channel processing. The differential masking effects are discussed in the context of contrast gain control.	t	\N
22485043	To examine the role of morphology in verbal working memory. Forty nine children, all native speakers of Arabic from the same region and of the same dialect, performed a Listening Word Span Task, whereby they had to recall Arabic uninflected words (i.e., base words), inflected words with regular (possessive) morphology, or inflected words with irregular (broken plural) morphology. Each of these words was at the end of a sentence (henceforth, target word). The participant's task was to listen to a series of sentences and then recall the target words. Recall of inflected words was significantly poorer than uninflected words, and recall of words with regular morphology was significantly poorer than recall of words with irregular morphology. These findings, albeit preliminary, suggest a role of morphology in verbal working memory. They also suggest that, at least in Arabic, regular morphological forms are decomposed into their component elements and hence impose an extra load on the central executive and episodic buffer components of working memory. Furthermore, in concert with findings from other studies, they suggest that the effect of morphology on working memory is probably language-specific. The clinical implications of the present findings are addressed.	t	\N
22488914	The relationship between the evoked responses (ERPs/ERFs) and the event-related changes in EEG/MEG power that can be observed during sentence-level language comprehension is as yet unclear. This study addresses a possible relationship between MEG power changes and the N400m component of the event-related field. Whole-head MEG was recorded while subjects listened to spoken sentences with incongruent (IC) or congruent (C) sentence endings. A clear N400m was observed over the left hemisphere, and was larger for the IC sentences than for the C sentences. A time-frequency analysis of power revealed a decrease in alpha and beta power over the left hemisphere in roughly the same time range as the N400m for the IC relative to the C condition. A linear regression analysis revealed a positive linear relationship between N400m and beta power for the IC condition, not for the C condition. No such linear relation was found between N400m and alpha power for either condition. The sources of the beta decrease were estimated in the LIFG, a region known to be involved in semantic unification operations. One source of the N400m was estimated in the left superior temporal region, which has been related to lexical retrieval. We interpret our data within a framework in which beta oscillations are inversely related to the engagement of task-relevant brain networks. The source reconstructions of the beta power suppression and the N400m effect support the notion of a dynamic communication between the LIFG and the left superior temporal region during language comprehension.	t	\N
22501035	Masked detection thresholds can often be improved by introducing coherent masker amplitude modulation across frequency, a phenomenon referred to as comodulation masking release (CMR). While CMR can be large for detection, it is smaller for supra-threshold tasks, such as intensity discrimination. In this experiment, frequency discrimination for a 1000-Hz tone near threshold was found to be poorer in an amplitude-modulated than a steady bandpass noise. These results parallel previous findings for intensity discrimination. Although this study examined the relatively simple task of frequency discrimination, the results may have implications for more complex tasks, such as speech recognition in fluctuating noise.	t	\N
22502488	Simultaneous, on-frequency masking is commonly assumed to be linear with increasing noise intensity. However, some evidence suggests that, expressed in terms of signal-to-noise ratio changes with background level changes, masking slopes can vary from 0 dB/dB. These results and evidence from a large sample of subjects with normal and impaired hearing demonstrate level-dependent changes in masking, large individual differences in masking among subjects with similar thresholds in quiet, and significant correlations of masking slope with other estimates of auditory function measured in the same backgrounds.	t	\N
22503556	Several studies show that the perception of occlusion may affect various aspects of motion perception. Here we present data indicating that occlusion cues also influence the visual interpolation of sampled motion. Normally, sampled motion stimuli are perceived as less smooth and jerkier when the spatial gaps between successive presentations of the "moving" target stimulus increase. Adding surfaces occluding the spatial gaps, however, we found that the perceived smoothness of motion was not only better, but also independent of the gap width. We argue that this effect occurs because the visual system attributes the interruptions in the motion path to occlusion rather than to the moving object itself.	t	\N
22503904	Motion-induced blindness (MIB) is an illusory phenomenon, in which a static target surrounded by moving distracters is perceived to disappear. We determined the electrocorticographic (ECoG) correlates of MIB. While undergoing intracranial ECoG recording, a patient with focal epilepsy was instructed to report the transitions of a visual target, which was designed to illusorily or physically disappear and reappear. We then determined the neural modulations associated with illusory and physical transitions of the target. We also tested whether the phase of local delta activity could predict exclusively illusory transitions. High-gamma activity at 80-150 Hz was attenuated in the fusiform region prior to the reports of illusory and real visual target disappearance. Conversely, such high-gamma activity was augmented prior to the report of real target reappearance. Exclusively around illusory disappearance but not around real one, the delta phases in the fusiform region showed a highly skewed distribution with preference of the negative peak. Neuronal modulations in the fusiform region may be involved in visual awareness, while spontaneous fluctuations of neural states entrained on delta rhythm may be involved in generation of MIB. Our study increases our understanding of the mechanisms of visual awareness.	t	\N
22505222	Scholars have documented similarities in the way voice and music convey emotions. By using functional magnetic resonance imaging (fMRI) we explored whether these similarities imply overlapping processing substrates. We asked participants to trace changes in either the emotion or pitch of vocalizations and music using a joystick. Compared to music, vocalizations more strongly activated superior and middle temporal cortex, cuneus, and precuneus. However, despite these differences, overlapping rather than differing regions emerged when comparing emotion with pitch tracing for music and vocalizations, respectively. Relative to pitch tracing, emotion tracing activated medial superior frontal and anterior cingulate cortex regardless of stimulus type. Additionally, we observed emotion specific effects in primary and secondary auditory cortex as well as in medial frontal cortex that were comparable for voice and music. Together these results indicate that similar mechanisms support emotional inferences from vocalizations and music and that these mechanisms tap on a general system involved in social cognition.	t	\N
22509533	It has been proposed that OAEs be classified not on the basis of the stimuli used to evoke them, but on the mechanisms that produce them (Shera and Guinan, 1999). One branch of this taxonomy focuses on a coherent reflection model and explicitly describes interrelationships between spontaneous emissions (SOAEs) and stimulus-frequency emissions (SFOAEs). The present study empirically examines SOAEs and SFOAEs from individual ears within the context of model predictions, using a low stimulus level (20 dB SPL) to evoke SFOAEs. Emissions were recorded from ears of normal-hearing young adults, both with and without prominent SOAE activity. When spontaneous activity was observed, SFOAEs demonstrated a localized increase about the SOAE peaks. The converse was not necessarily true though, i.e., robust SFOAEs could be measured where no SOAE peaks were observed. There was no significant difference in SFOAE phase-gradient delays between those with and without observable SOAE activity. However, delays were larger for a 20 dB SPL stimulus level than those previously reported for 40 dB SPL. The total amount of SFOAE phase accumulation occurring between adjacent SOAE peaks tended to cluster about an integral number of cycles. Overall, the present data appear congruous with predictions stemming from the coherent reflection model and support the notion that such comparisons ideally are made with emissions evoked using relatively lower stimulus levels.	t	\N
22509989	This study investigated the effect of short-term visual deprivation on auditory steady-state response (ASSR) to amplitude-modulated tones. Magnetoencephalography data were acquired while subjects performed an auditory detection task under both monaural and dichotic presentation conditions. Analyses were performed on the spectral power, mean amplitudes and dipole positions of the ASSR at the onset of blindfolding, as well as after 2, 4 and 6 h of visual deprivation. Results show a modulation of the spectral power of the ASSR at the frequencies that were present in the stimulus after 6 h of sensory deprivation, and this was especially true for the dichotic condition. Moreover, participants showed two spectral peaks in the occipital cortex at the end of the visual deprivation period, a phenomenon normally observed in the auditory cortex. Our results shed light not only on the timeline associated with short-term crossmodal recruitment of input-deprived sensory cortices but also demonstrate that the visual cortex can display auditory cortex-like functioning in response to the ASSR. Importantly, our results also highlight the importance of taking into consideration individual differences when investigating crossmodal plastic phenomena. Indeed, the occipital spectral peaks were only observed in half the subjects following short-term deprivation.	t	\N
22511853	Several studies have reported optimal population decoding of sensory responses in two-alternative visual discrimination tasks. Such decoding involves integrating noisy neural responses into a more reliable representation of the likelihood that the stimuli under consideration evoked the observed responses. Importantly, an ideal observer must be able to evaluate likelihood with high precision and only consider the likelihood of the two relevant stimuli involved in the discrimination task. We report a new perceptual bias suggesting that observers read out the likelihood representation with remarkably low precision when discriminating grating spatial frequencies. Using spectrally filtered noise, we induced an asymmetry in the likelihood function of spatial frequency. This manipulation mainly affects the likelihood of spatial frequencies that are irrelevant to the task at hand. Nevertheless, we find a significant shift in perceived grating frequency, indicating that observers evaluate likelihoods of a broad range of irrelevant frequencies and discard prior knowledge of stimulus alternatives when performing two-alternative discrimination.	t	\N
22513340	Both functional magnetic resonance imaging (fMRI) and event-related brain potential (ERP) studies have shown that verbal working memory plays an important role during sentence processing. There is growing evidence from outside of sentence processing that human alpha oscillations (7-13 Hz) play a critical role in working memory. This study aims to link this to the sentence processing domain. Time-frequency analyses and source localization were performed on electroencephalography (EEG) data that were recorded during the processing of auditorily presented sentences involving either a short or a long distance between an argument (subject or object) and the respective sentence-final verb. We reasoned that oscillatory activity in the alpha band should increase during sentences with longer argument-verb distances, since decreased temporal proximity should result in increased memory demands. When verbal working memory-intensive long-dependency sentences were compared to short-dependency sentences, a sustained oscillatory enhancement at 10 Hz was found during storage prior to the sentence-final verb, turning into a transient power increase in the beta band (13-20 Hz) at the sentence-final verb. The sources of the alpha oscillations were localized to bilaterally occipital and left parietal cortices. Only the source activity in the left parietal cortex was negatively correlated with verbal working memory abilities. These findings indicate that the parsimonious role of alpha oscillations in domain-general working memory can be extended to language, that is, sentence processing. We suggest that the function of left parietal cortex underlying verbal working memory storage during sentence processing is to inhibit the premature release of verbal information that will subsequently be integrated.	t	\N
22517303	Professional and community concerns about the potentially dangerous noise levels for common leisure activities has led to increased interest on providing hearing health information to participants. However, noise reduction programmes aimed at leisure activities (such as music listening) face a unique difficulty. The noise source that is earmarked for reduction by hearing health professionals is often the same one that is viewed as pleasurable by participants. Furthermore, these activities often exist within a social setting, with additional peer influences that may influence behavior. The current study aimed to gain a better understanding of social-based factors that may influence an individual's motivation to engage in positive hearing health behaviors. Four hundred and eighty-four participants completed questionnaires examining their perceptions of the hearing risk associated with listening to music listening and asking for estimates of their own and their peer's music listening behaviors. Participants were generally aware of the potential risk posed by listening to personal stereo players (PSPs) and the volumes likely to be most dangerous. Approximately one in five participants reported using listening volumes at levels perceived to be dangerous, an incidence rate in keeping with other studies measuring actual PSP use. However, participants showed less awareness of peers' behavior, consistently overestimating the volumes at which they believed their friends listened. Misperceptions of social norms relating to listening behavior may decrease individuals' perceptions of susceptibility to hearing damage. The consequences of hearing health promotion are discussed, along with suggestions relating to the development of new programs.	t	\N
22521874	Although activity in premotor and motor cortices is commonly observed in neuroimaging studies of spoken language processing, the degree to which this activity is an obligatory part of everyday speech comprehension remains unclear. We hypothesised that rather than being a unitary phenomenon, the neural response to speech perception in motor regions would differ across listeners as a function of individual cognitive ability. To examine this possibility, we used functional magnetic resonance imaging (fMRI) to investigate the neural processes supporting speech perception by comparing active listening to pseudowords with matched tasks that involved reading aloud or repetition, all compared to acoustically matched control stimuli and matched baseline tasks. At a whole-brain level there was no evidence for recruitment of regions in premotor or motor cortex during speech perception. A focused region of interest analysis similarly failed to identify significant effects, although a subset of regions approached significance, with notable variability across participants. We then used performance on a battery of behavioural tests that assessed meta-phonological and verbal short-term memory abilities to investigate the reasons for this variability, and found that individual differences in particular in low phonotactic probability pseudoword repetition predicted participants' neural activation within regions in premotor and motor cortices during speech perception. We conclude that normal listeners vary in the degree to which they recruit premotor and motor cortex as a function of short-term memory ability. This is consistent with a resource-allocation approach in which recruitment of the dorsal speech processing pathway depends on both individual abilities and specific task demands.	t	\N
22522927	Humans possess a remarkable ability to attend to a single speaker's voice in a multi-talker background. How the auditory system manages to extract intelligible speech under such acoustically complex and adverse listening conditions is not known, and, indeed, it is not clear how attended speech is internally represented. Here, using multi-electrode surface recordings from the cortex of subjects engaged in a listening task with two simultaneous speakers, we demonstrate that population responses in non-primary human auditory cortex encode critical features of attended speech: speech spectrograms reconstructed based on cortical responses to the mixture of speakers reveal the salient spectral and temporal features of the attended speaker, as if subjects were listening to that speaker alone. A simple classifier trained solely on examples of single speakers can decode both attended words and speaker identity. We find that task performance is well predicted by a rapid increase in attention-modulated neural selectivity across both single-electrode and population-level cortical responses. These findings demonstrate that the cortical representation of speech does not merely reflect the external acoustic environment, but instead gives rise to the perceptual aspects relevant for the listener's intended goal.	t	\N
22524352	In various paradigms of modern neurosciences of music, experts of Western classical music have displayed superior brain architecture when compared with individuals without explicit training in music. In this paper, we show that chord violations embedded in musical cadences were neurally processed in a facilitated manner also by musicians trained in Finnish folk music. This result, obtained by using early right anterior negativity (ERAN) as an index of harmony processing, suggests that tonal processing is advanced in folk musicians by their long-term exposure to both Western and non-Western music.	t	\N
22524356	Recent research has shown that music training enhances music-related sensorimotor associations, such as the relationship between a key press on the keyboard and its associated musical pitch (auditory feedback). Such results suggest that the role of auditory feedback in performance may be based on learned associations that are task specific. Here, results from various studies will be presented that suggest that the real state of affairs is more complex. Several recent studies have shown similar effects of altered auditory feedback during piano performance for pianists and individuals with no piano training. Other recent research suggests dramatic differences between pianists and nonmusicians concerning the influence of auditory feedback on melody switching that suggest greater influence of auditory feedback among nonmusicians than pianists. Taken together, results suggest that musical training refines preexisting sensorimotor associations.	t	\N
22524361	This paper examines the role of mental imagery in music performance. Self-reports by musicians, and various other sources of anecdotal evidence, suggest that covert auditory, motor, and/or visual imagery facilitate multiple aspects of music performance. The cognitive and motor mechanisms that underlie such imagery include working memory, action simulation, and internal models. Together these mechanisms support the generation of anticipatory images that enable thorough action planning and movement execution that is characterized by efficiency, temporal precision, and biomechanical economy. In ensemble performance, anticipatory imagery may facilitate interpersonal coordination by enhancing online predictions about others' action timing. Overlap in brain regions subserving auditory imagery and temporal prediction is consistent with this view. It is concluded that individual differences in anticipatory imagery may be a source of variation in expressive performance excellence and the quality of ensemble cohesion. Engaging in effortful musical imagery is therefore justified when artistic perfection is the goal.	t	\N
22524378	Congenital amusia is a neurodevelopmental disorder of musical perception and production. Much research has focused on characterizing the deficits within this special population; however, it is also important from both a psychological and educational perspective to determine which aspects of the disorder may be subject to change because this will also constrain theorizing about the nature of the disorder, as well as facilitating possible future remediation programs. In this small-scale study, a professional singing teacher used a broad-brush intervention approach with five individuals diagnosed with congenital amusia. The compensatory elements were designed to enhance vocal efficiency and health, singing technique, musical understanding, pitch perception, and production. Improvements were observed in most individuals in perception, indexed via the Montreal Battery for the Evaluation of Amusia scale subtest and in the vocal performance of familiar songs. The workshop setting gave a unique opportunity for observation and discussion to inform further investigations of this disorder.	t	\N
22525854	When people experience an unchanging sensory input for a long period of time, their perception tends to switch stochastically and unavoidably between alternative interpretations of the sensation; a phenomenon known as perceptual bi-stability or multi-stability. The huge variability in the experimental data obtained in such paradigms makes it difficult to distinguish typical patterns of behaviour, or to identify differences between switching patterns. Here we propose a new approach to characterising switching behaviour based upon the extraction of transition matrices from the data, which provide a compact representation that is well-understood mathematically. On the basis of this representation we can characterise patterns of perceptual switching, visualise and simulate typical switching patterns, and calculate the likelihood of observing a particular switching pattern. The proposed method can support comparisons between different observers, experimental conditions and even experiments. We demonstrate the insights offered by this approach using examples from our experiments investigating multi-stability in auditory streaming. However, the methodology is generic and thus widely applicable in studies of multi-stability in any domain.	t	\N
22534571	Event Related Potentials (ERPs) were recorded from Spanish-English bilinguals (N=10) to test pre-attentive speech discrimination in two language contexts. ERPs were recorded while participants silently read magazines in English or Spanish. Two speech contrast conditions were recorded in each language context. In the phonemic in English condition, the speech sounds represented two different phonemic categories in English, but represented the same phonemic category in Spanish. In the phonemic in Spanish condition, the speech sounds represented two different phonemic categories in Spanish, but represented the same phonemic categories in English. Results showed pre-attentive discrimination when the acoustics/phonetics of the speech sounds match the language context (e.g., phonemic in English condition during the English language context). The results suggest that language contexts can affect pre-attentive auditory change detection. Specifically, bilinguals' mental processing of stop consonants relies on contextual linguistic information.	t	\N
22537033	To develop, optimize, and evaluate a new Spanish sentence test in noise. The test comprises a basic matrix of ten names, verbs, numerals, nouns, and adjectives. From this matrix, test lists of ten sentences with an equal syntactical structure can be formed at random, with each list containing the whole speech material. The speech material represents the phoneme distribution of the Spanish language. The test was optimized for measuring speech reception thresholds (SRTs) in noise by adjusting the presentation levels of the individual words. Subsequently, the test was evaluated by independent measurements investigating the training effects, the comparability of test lists, open-set vs. closed-set test format, and performance of listeners of different Spanish varieties. In total, 68 normal-hearing native Spanish-speaking listeners. SRTs measured using an adaptive procedure were 6.2 ± 0.8 dB SNR for the open-set and 7.2 ± 0.7 dB SNR for the closed-set test format. The residual training effect was less than 1 dB after using two double-lists before data collection. No significant differences were found for listeners of different Spanish varieties indicating that the test is applicable to Spanish as well as Latin American listeners. Test lists can be used interchangeably.	t	\N
22541369	Neuroimaging studies of English suggest that speech comprehension engages two interdependent systems: a bilateral fronto-temporal network responsible for general perceptual and cognitive processing, and a specialised left-lateralised network supporting specifically linguistic processing. Using fMRI we test this hypothesis in Polish, a Slavic language with rich and diverse morphology. We manipulated general perceptual complexity (presence or absence of an onset-embedded stem, e.g. kotlet 'cutlet' vs. kot 'cat') and specifically linguistic complexity (presence of an inflectional affix, e.g. dom 'house, Nom' vs. dom-u 'house, Gen'). Non-linguistic complexity activated a bilateral network, as in English, but we found no differences between inflected and uninflected nouns. Instead, all types of words activated left inferior frontal areas, suggesting that all Polish words can be considered linguistically 'complex' in processing terms. The results support a dual network hypothesis, but highlight differences between languages like English and Polish, and underline the importance of cross-linguistic comparisons.	t	\N
22542727	Impaired auditory sensitivity to amplitude rise time (ART) has been suggested to be a primary deficit in developmental dyslexia. The present study investigates whether impaired ART-sensitivity at a pre-reading age precedes and predicts later emerging reading problems in a sample of Dutch children. An oddball paradigm, with a deviant that differed from the standard stimulus in ART, was administered to 41-month-old children (30 genetically at-risk for developmental dyslexia and 14 controls) with concurrent EEG measurement. A second deviant that differed from the standard stimulus in frequency served as a control deviant. Grade two reading scores were used to divide the at-risks in a typical-reading and a dyslexic subgroup. We found that both ART- and frequency processing were related to later reading skill. We however also found that irrespective of reading level, the at-risks in general showed impaired basic auditory processing when compared to controls and that it was impossible to discriminate between the at-risk groups on basis of both auditory measures. A relatively higher quality of early expressive syntactic skills in the typical-reading at-risk group might indicate a protective factor against negative effects of impaired auditory processing on reading development. Based on these results we argue that ART- and frequency-processing measures, although they are related to reading skill, lack the power to be considered single-cause predictors of developmental dyslexia. More likely, they are genetically driven risk factors that may add to cumulative effects on processes that are critical for learning to read.	t	\N
22543320	The purpose of this study was to determine daily noise doses and 8-hour time weighted averages for rock band musicians, crew members, and spectators during a typical rehearsal and performance using both Occupational Safety and Health Administration (OSHA) and National Institute of Occupational Safety and Health (NIOSH) measurement criteria. Personal noise dosimetry was completed on five members of a rock band during one 2-hr rehearsal and one 4-hr performance. Time-weighted averages (TWA) and daily dose values were calculated using both OSHA and NIOSH criteria and compared to industry guidelines for enrollment in hearing conservation programs and the use of hearing protection devices. TWA values ranged from 84.3 to 90.4 dBA (OSHA) and from 90.0 to 96.4 dBA (NIOSH) during the rehearsal. The same values ranged from 91.0 to 99.7 dBA (OSHA) and 94.0 to 102.8 dBA (NIOSH) for the performance. During the rehearsal, daily noise doses ranged from 45.54% to 106.7% (OSHA) and from 317.74% to 1396.07% (NIOSH). During the performance, doses ranged from 114.66% to 382.49% (OSHA) and from 793.31% to 5970.15% (NIOSH). The musicians in this study were exposed to dangerously high levels of noise and should be enrolled in a hearing conservation programs. Hearing protection devices should be worn, especially during performances. The OSHA measurement criteria yielded values significantly more conservative than those produced by NIOSH criteria. Audiologists should counsel musician-patients about the hazards of excessive noise (music) exposure and how to protect their hearing.	t	\N
22559370	The "combined-stimulation advantage" refers to an improvement in speech recognition when cochlear-implant or vocoded stimulation is supplemented by low-frequency acoustic information. Previous studies have been interpreted as evidence for "super-additive" or "synergistic" effects in the combination of low-frequency and electric or vocoded speech information by human listeners. However, this conclusion was based on predictions of performance obtained using a suboptimal high-threshold model of information combination. The present study shows that a different model, based on Gaussian signal detection theory, can predict surprisingly large combined-stimulation advantages, even when performance with either information source alone is close to chance, without involving any synergistic interaction. A reanalysis of published data using this model reveals that previous results, which have been interpreted as evidence for super-additive effects in perception of combined speech stimuli, are actually consistent with a more parsimonious explanation, according to which the combined-stimulation advantage reflects an optimal combination of two independent sources of information. The present results do not rule out the possible existence of synergistic effects in combined stimulation; however, they emphasize the possibility that the combined-stimulation advantages observed in some studies can be explained simply by non-interactive combination of two information sources.	t	\N
22559371	Preliminary data [M. Epstein and M. Florentine, Ear. Hear. 30, 234-237 (2009)] obtained using speech stimuli from a visually present talker heard via loudspeakers in a sound-attenuating chamber indicate little difference in loudness when listening with one or two ears (i.e., significantly reduced binaural loudness summation, BLS), which is known as "binaural loudness constancy." These data challenge current understanding drawn from laboratory measurements that indicate a tone presented binaurally is louder than the same tone presented monaurally. Twelve normal listeners were presented recorded spondees, monaurally and binaurally across a wide range of levels via earphones and a loudspeaker with and without visual cues. Statistical analyses of binaural-to-monaural ratios of magnitude estimates indicate that the amount of BLS is significantly less for speech presented via a loudspeaker with visual cues than for stimuli with any other combination of test parameters (i.e., speech presented via earphones or a loudspeaker without visual cues, and speech presented via earphones with visual cues). These results indicate that the loudness of a visually present talker in daily environments is little affected by switching between binaural and monaural listening. This supports the phenomenon of binaural loudness constancy and underscores the importance of ecological validity in loudness research.	t	\N
22559380	Speech perception requires the integration of information from multiple phonetic and phonological dimensions. A sizable literature exists on the relationships between multiple phonetic dimensions and single phonological dimensions (e.g., spectral and temporal cues to stop consonant voicing). A much smaller body of work addresses relationships between phonological dimensions, and much of this has focused on sequences of phones. However, strong assumptions about the relevant set of acoustic cues and/or the (in)dependence between dimensions limit previous findings in important ways. Recent methodological developments in the general recognition theory framework enable tests of a number of these assumptions and provide a more complete model of distinct perceptual and decisional processes in speech sound identification. A hierarchical Bayesian Gaussian general recognition theory model was fit to data from two experiments investigating identification of English labial stop and fricative consonants in onset (syllable initial) and coda (syllable final) position. The results underscore the importance of distinguishing between conceptually distinct processing levels and indicate that, for individual subjects and at the group level, integration of phonological information is partially independent with respect to perception and that patterns of independence and interaction vary with syllable position.	t	\N
22559460	Peruvian Spanish (PS) and Iberian Spanish (IS) learners were tested on their ability to categorically discriminate and identify Dutch vowels. It was predicted that the acoustic differences between the vowel productions of the two dialects, which compare differently to Dutch vowels, would manifest in differential L2 perception for listeners of these two dialects. The results show that although PS learners had higher general L2 proficiency, IS learners were more accurate at discriminating all five contrasts and at identifying six of the L2 Dutch vowels. These findings confirm that acoustic differences in native vowel production lead to differential L2 vowel perception.	t	\N
22563091	Linguistic variables alone cannot fully account for bilingual listeners' perception of English-running speech. In the present study, the authors investigated how linguistic and attitudinal factors, in combination, affect bilingual processing of temporally degraded English passages in quiet and in noise. Thirty-six bilinguals with various linguistic and attitudinal characteristics participated in the study. Bilingual individuals completed questionnaires that assessed their language background, willingness to communicate (WTC), and self-perceived communication competency (SPCC) in English. Participants listened to English passage pairs from the Connected Speech Test, presented at 45 dB HL at 3 rates (unprocessed, expanded, compressed), in quiet and in noise. Language proficiency measures were the most significant linguistic variables, accounting for the largest amount of variance in performance across most conditions. Both WTC and SPCC were associated with performance and contributed to regression models. Subscales assessing listeners' WTC and SPCC in a group were more predictive of performance than communication in an interpersonal or public setting. Performance in noise was more difficult to predict than in quiet. Performance with compression was more difficult to predict than with expansion. To fully understand bilingual clients' perception of English speech, hearing professionals should consider their attitudinal characteristics in addition to language background.	t	\N
22564903	To evaluate whether a hypothesis suggesting that apraxia of speech results from phonological overspecification could be relevant for childhood apraxia of speech (CAS). High-density EEG was recorded from 5 children with CAS and 5 matched controls, ages 5-8 years, with and without CAS, as they listened to randomized sequences of CV syllables in two oddball paradigms: phonemic (/ba/, /pa/) and allophonic (/pa/, /p(h)a/). In the phonemic contrast condition, mismatch negativity (MMN) responses to oddball sounds were observed for the typically developing (comparison) group but not the CAS group, although a component similar to an immature mismatch response was apparent. The allophonic contrast did not elicit MMN responses in the comparison group, but in the CAS group, an MMN-like response was observed. The authors propose that these preliminary findings are consistent with a view of CAS as a disorder that not only affects motor planning but also has a phonological component.	t	\N
22565049	Primary facial nerve tumors (FNTs) present in varying ways. In this study, the authors present their institutional experience with the management of facial nerve tumors, including their recommendations for available therapies such as observation, microsurgical decompression or removal, and stereotactic radiation. They emphasize the auditory and facial nerve function outcomes. Retrospective case review. Tertiary referral center. Retrospective review of all cases of FNT seen at the authors' tertiary care academic medical center over a 10-year period (2002-2011). The clinical presentation, treatment modality, and outcome parameters of cochlear and facial nerve function were assessed. Twelve patients were identified. House-Brackmann grades on presentation were 4 grade I, 2 grade II, 2 grade III, 1 grade IV, and 3 grade V, with 2 grade V patients declining to grade VI shortly after presentation. Seven patients presented with serviceable hearing and 4 with nonserviceable hearing. Treatment options/arms included observation with serial clinicoradiological review (2 cases), stereotactic radiation with the CyberKnife (3 cases), wide fallopian canal decompression (3 cases), microsurgical excision and repair (3 cases), and biopsy followed by observation (1 case). At the end of the review period, facial nerve function was stable in 8 patients, improved in 3, and declined in 1, and none had documented worsening of hearing based on American Academy of Otolaryngology--Head and Neck Surgery Foundation classification. Management of FNT is largely based on the clinicoradiological picture. Each treatment arm is different, but overall auditory and facial function can be maintained.	t	\N
22571290	Vocal emblems, such as shh and brr, are speech sounds that have linguistic and nonlinguistic features; thus, it is unclear how they are processed in the brain. Five adult dextral individuals with left-brain damage and moderate-severe Wernicke's aphasia, five adult dextral individuals with right-brain damage, and five Controls participated in two tasks: (1) matching vocal emblems to photographs ('picture task') and (2) matching vocal emblems to verbal translations ('phrase task'). Cross-group statistical analyses on items on which the Controls performed at ceiling revealed lower accuracy by the group with left-brain damage (than by Controls) on both tasks, and lower accuracy by the group with right-brain damage (than by Controls) on the picture task. Additionally, the group with left-brain damage performed significantly less accurately than the group with right-brain damage on the phrase task only. Findings suggest that comprehension of vocal emblems recruits more left- than right-hemisphere processing.	t	\N
22584037	Music is an ancient and ubiquitous form of human expression. One important component for which music is sought after is its aesthetic value, whose appreciation has typically been associated with largely learned, culturally determined factors, such as education, exposure, and social pressure. However, neuroscientific evidence shows that the aesthetic response to music is often associated with automatic, physically- and biologically-grounded events, such as shivers, chills, increased heart rate, and motor synchronization, suggesting the existence of an underlying biological platform upon which contextual factors may act. Drawing on philosophical notions and neuroscientific evidence, I argue that, although there is no denying that social and cultural context play a substantial role in shaping the aesthetic response to music, these act upon largely universal, biological mechanisms involved with neural processing. I propose that the simultaneous presence of culturally-influenced and biologically-determined contributions to the aesthetic response to music epitomizes Baumgarten's equation of sensory perception with taste. Taking the argument one step further, I suggest that the heavily embodied aesthetic response to music bridges the cleavage between the two discrepant meanings-the one referring to sensory perception, the other referring to judgments of taste-traditionally attributed to the word "aesthetics" in the sciences and the humanities.	t	\N
22588234	To identify whether speech recognition outcomes are influenced by the choice of ear for cochlear implantation in adults with bilateral hearing loss who use a hearing aid in 1 ear but have long-term auditory deprivation in the other. Retrospective matched cohort study. Speech recognition results were examined in 30 adults with monaural sound deprivation. Fifteen received the implant in the sound-deprived ear and 15 in the aided ear. Tertiary referral centers with active cochlear implant programs. Adults with bilateral hearing loss and a minimum of 15 years of monaural sound deprivation who received a cochlear implant after meeting the traditional implantation criteria of the referral centers. Cochlear implantation with devices approved by the U.S. Food and Drug Administration. Paired comparisons of postoperative monosyllabic word recognition scores obtained with the implant alone and in the usual listening condition (CI alone or bimodal). With the cochlear implant alone, individuals who received the implant in a sound-deprived ear obtained poorer scores than individuals who received the implant in the aided ear. There was no significant difference, however, in speech recognition results for the 2 groups when tested in their usual listening condition. In particular, poorer speech recognition scores were obtained with the cochlear implant alone by individuals using bimodal hearing. Similar clinical outcomes of cochlear implantation can be achieved by adults with a long-term monaural sound deprivation when comparing the usual listening condition, irrespective of whether the implant is in the sound-deprived or in the aided ear.	t	\N
22588270	This study aimed to characterize horizontal plane sound localization in interfering noise at different signal-to-noise ratios (SNRs) and to compare performance across normal-hearing listeners and users of unilateral and bilateral cochlear implants (CIs). CI users report difficulties with listening in noisy environments. Although their difficulties with speech understanding have been investigated in several studies, the ability to localize sounds in background noise has not extensively been examined, despite the benefits of binaural hearing being greatest in noisy situations. Sound localization is a measure of binaural processing and is thus well suited to assessing the benefit of bilateral implantation. The results will inform clinicians and implant manufacturers how to focus their efforts to improve localization with CIs in noisy situations. Six normal-hearing listeners, four unilateral, and 10 bilateral CI users indicated the perceived location of sound sources using a light pointer method. Target sounds were noise pulses played from one of 11 loudspeakers placed between -80 and +80 degrees in the frontal horizontal plane in the free field. Localization was assessed in quiet and in diffuse background noise at SNRs between +10 and -7 dB. Speech reception thresholds were measured and their relation to the localization results examined. Localization performance declined with decreasing SNR: target sounds were perceived closer to the median plane and the standard deviation of responses increased. Localization performance across groups was compared using a measure of "Spatial Resolvability" (SR). This measure gives the angular separation between two sound sources that would enable an ideal observer to correctly distinguish them 69.1% of the time. For all participants SR increased with decreasing SNR, that is, at low SNRs the spatial separation between sound sources remained distinguishable only when it was larger. Normal-hearing participants performed best, with SR between 1.4 and 5.1 degrees in quiet. Bilateral CI users showed SR between 8.3 and 43.6 degrees in quiet, corresponding approximately to the spatial resolution of normal-hearing listeners at an SNR of -5 dB. Most bilateral CI users had lost the ability to correctly determine which side the sound came from at an SNR of -3 dB. Overall, the SNR had to be at least +7 dB to achieve localization performance near to that in quiet for all bilateral CI users. No significant correlation was found between spatial resolution and speech reception thresholds, but the speech processor sensitivity setting did significantly affect performance. Unilateral CI users showed the most severe localization problems, with only two of four participants being able to correctly determine which side sounds came from in quiet. This study is the first to examine sound localization with CIs at various SNRs and to compare it with normal hearing. The results confirm that localization with CIs is strongly disrupted in noisy situations. Bilateral CIs were shown to be clearly superior over unilateral CIs for localization in quiet and in noisy situations. With bilateral CIs, localization declined at moderately high absolute noise levels (>63 dB SPL), suggesting that an extension of the acoustic-dynamic range to higher levels would be beneficial. The absence of a relation between speech reception thresholds and spatial resolution highlights the need for additional clinical tests to assess the binaural benefit of a second implant.	t	\N
22591493	Recent oddball studies showed that auditory change detection responses exist in the first 50 ms after sound onset, upstream of mismatch negativity (MMN). We examined if these early responses could be elicited by feature-specific changes, meaning changes in the value of one attribute of a stimulus, regardless of whether other attributes of the stimulus are changing or not. We used a multifeature paradigm with four types of deviants: frequency, duration, intensity, and interaural time difference. In the middle latency range, only frequency deviants led to an enhanced Nb response. All four feature changes generated significant MMNs. Our results indicate that human brain is capable of detecting a feature-specific change for frequency attributes in the middle latency. The different levels of information being encoded in two separate event-related potential time ranges support the notion of a hierarchical organization of auditory deviance detection.	t	\N
22606932	We describe the audiometric results following surgery in a consecutive series of patients with a congenital ossicular middle ear disorder that was associated with a mobile stapes footplate. We performed a retrospective analysis of patient charts from a tertiary referral center. A total of 23 patients (23 ears) underwent exploratory tympanotomy and ossicular reconstruction between 1986 and 2001. The main outcome measure was the audiometric results. Overall, we observed a mean gain in air conduction pure tone average of 17 dB (from 47 dB to 30 dB), a sensorineural deterioration of 3 dB, and a mean postoperative air-bone gap of 19 dB (mean preoperative air-bone gap of 38 dB). The air-bone gap closure was 20 dB or less in 15 of the 23 cases (65%), in agreement with the few results reported in the literature. Moreover, the audiometric results remained stable. In the syndromic group, the mean gain in air conduction was only 13 dB, which was worse than that observed for the nonsyndromic ears. Surgery for congenital ossicular chain anomalies with a concomitant mobile stapes footplate provides positive audiometric outcomes. Most ears had some sensorineural impairment (10 to 20 dB), which influenced the final hearing level attained after surgery. Preoperative assessment is mandatory to search for syndromal diagnoses, which might be important for patient counseling and prognosis.	t	\N
22609773	This study investigated the bandwidth of phase sensitivity. Subjects discriminated amplitude-modulated tones (AM), and quasi-frequency-modulated tones (QFM) in a two-interval, forced-choice task. An adaptive threshold procedure was used to estimate the modulation depth needed to discriminate the stimuli as a function of carrier and modulation frequency. Non-monotonicities in threshold-bandwidth functions were often observed at higher modulation frequencies. The results are discussed in terms of two potential cues: (1) waveform envelope, (2) cubic distortion products. In order to degrade the information obtained from auditory distortions, the phase for the carrier frequency was randomly sampled from a uniform distribution, which diminished the non-monotonicities with minimal effect at lower modulation frequencies. Model simulations demonstrated that phase randomization degrades distortion product cues with only a modest effect on temporal cues. Final results show that maximum bandwidths for phase sensitivity (BW(max)) were not proportional to carrier frequencies.	t	\N
22615475	In this study, the authors evaluated the effect of frequency compression hearing aids on speech perception ability and the time course and magnitude of acclimatization-related changes. Participants included children ages 11-18 years. Speech perception ability was evaluated over well-controlled baseline, treatment, and withdrawal study phases. Study-worn hearing aids were individually fitted to all participants. The authors evaluated speech perception ability using outcomes of speech detection (/s/ and /[symbol in text]/ sounds), /s-[symbol in text]/ discrimination, and plural and consonant recognition. Indices of change were discussed on a case-by-case basis across all study phases. Significant treatment effects were measured for all cases, on at least one measure, with some listeners displaying significant acclimatization trends following a trial of frequency compression. Findings suggest that frequency compression provided varying outcomes, both in benefit and acclimatization, across listeners. For some, a period of acclimatization was necessary before change could be measured. For others, performance remained stable over the time course under evaluation, suggesting that some but not all children will experience improved speech recognition ability after a period of frequency compression hearing aid use.	t	\N
22615939	In natural audio-visual environments, a change in depth is usually correlated with a change in loudness. In the present study, we investigated whether correlating changes in disparity and loudness would provide a functional advantage in binding disparity and sound amplitude in a visual search paradigm. To test this hypothesis, we used a method similar to that used by van der Burg et al. to show that non-spatial transient (square-wave) modulations of loudness can drastically improve spatial visual search for a correlated luminance modulation. We used dynamic random-dot stereogram displays to produce pure disparity modulations. Target and distractors were small disparity-defined squares (either 6 or 10 in total). Each square moved back and forth in depth in front of the background plane at different phases. The target's depth modulation was synchronized with an amplitude-modulated auditory tone. Visual and auditory modulations were always congruent (both sine-wave or square-wave). In a speeded search task, five observers were asked to identify the target as quickly as possible. Results show a significant improvement in visual search times in the square-wave condition compared to the sine condition, suggesting that transient auditory information can efficiently drive visual search in the disparity domain. In a second experiment, participants performed the same task in the absence of sound and showed a clear set-size effect in both modulation conditions. In a third experiment, we correlated the sound with a distractor instead of the target. This produced longer search times, indicating that the correlation is not easily ignored.	t	\N
22616091	Sound localization in the horizontal (azimuth) plane relies mainly on interaural time differences (ITDs) and interaural level differences (ILDs). Both are distorted in listeners with acquired unilateral conductive hearing loss (UCHL), reducing their ability to localize sound. Several studies demonstrated that UCHL listeners had some ability to localize sound in azimuth. To test whether listeners with acquired UCHL use strongly perturbed binaural difference cues, we measured localization while they listened with a sound-attenuating earmuff over their impaired ear. We also tested the potential use of monaural pinna-induced spectral-shape cues for localization in azimuth and elevation, by filling the cavities of the pinna of their better-hearing ear with a mould. These conditions were tested while a bone-conduction device (BCD), fitted to all UCHL listeners in order to provide hearing from the impaired side, was turned off. We varied stimulus presentation levels to investigate whether UCHL listeners were using sound level as an azimuth cue. Furthermore, we examined whether horizontal sound-localization abilities improved when listeners used their BCD. Ten control listeners without hearing loss demonstrated a significant decrease in their localization abilities when they listened with a monaural plug and muff. In 4/13 UCHL listeners we observed good horizontal localization of 65 dB SPL broadband noises with their BCD turned off. Localization was strongly impaired when the impaired ear was covered with the muff. The mould in the good ear of listeners with UCHL deteriorated the localization of broadband sounds presented at 45 dB SPL. This demonstrates that they used pinna cues to localize sounds presented at low levels. Our data demonstrate that UCHL listeners have learned to adapt their localization strategies under a wide variety of hearing conditions and that sound-localization abilities improved with their BCD turned on.	t	\N
22617829	Lateral prefrontal cortex and basal ganglia work together to mediate working memory and top-down regulation of cognition. This circuit regulates the balance and interactions between automatic and high-order control responses. Using ultra-high-field high-resolution functional magnetic resonance imaging (7T-fMRI), the present study examined the role of subcortical structures in cognitive control during language processing. Participants were asked to judge the grammaticality of unambiguous, ungrammatical and ambiguous sentences. Grammatical unambiguous sentences should elicit an automatic response, while ambiguous and ungrammatical sentences should conflict with the automatic response and, hence, require a high-order control response. Within the control response domain, ambiguity and ungrammaticality represent two different dimensions of conflict resolution, while for a temporarily ambiguous sentence a correct interpretation is available, that is not the case for ungrammatical sentences. Our results reveal an anterior-posterior axis in the dorsomedial striatum with more rostral regions supporting higher levels of cognitive processing. This functional architecture mirrors the rostrocaudal hierarchical organization evidenced within the lateral prefrontal cortex.	t	\N
22621261	Categorical perception, an increased sensitivity to between- compared with within-category contrasts, is a stable property of native speech perception that emerges as language matures. Although recent research suggests that categorical responses to speech sounds can be found in left prefrontal as well as temporo-parietal areas, it is unclear how the neural system develops heightened sensitivity to between-category contrasts. In the current study, two groups of adult participants were trained to categorize speech sounds taken from a dental/retroflex/velar continuum according to two different boundary locations. Behavioral results suggest that for successful learners, categorization training led to increased discrimination accuracy for between-category contrasts with no concomitant increase for within-category contrasts. Neural responses to the learned category schemes were measured using a short-interval habituation design during fMRI scanning. Whereas both inferior frontal and temporal regions showed sensitivity to phonetic contrasts sampled from the continuum, only the bilateral middle frontal gyri exhibited a pattern consistent with encoding of the learned category scheme. Taken together, these results support a view in which top-down information about category membership may reshape perceptual sensitivities via attention or executive mechanisms in the frontal lobes.	t	\N
22646315	The study investigated the effect of immediate feedback in training listeners to perceive subtle differences in voice quality, a perceptual skill that is important for speech-language pathologists. Sixty naive listeners were randomly assigned to a feedback group (Group F), a no feedback group (Group NF), and a no training group acting as a control group (Group C). The task was to evaluate the severity of a perceptual voice quality (breathiness) by using a reference-matching paradigm. All participants took part in three rating sessions (pre-training, 2 days after training and 1 week after training). Group F and Group NF participated in a training session immediately after the first rating session, where Group F practiced with immediate feedback given and Group NF practice with no immediate feedback given. The results showed that Group F and Group NF had significant improvement after training, but Group F did not retain the improvement in the third rating session. The use of a reference-matching training paradigm without giving frequent immediate feedback is suggested for auditory-perceptual voice evaluation training. The most effective frequency of immediate feedback is yet to be determined.	t	\N
22648604	The "pip-and-pop effect" refers to the facilitation of search for a visual target (a horizontal or vertical bar whose color changes frequently) among multiple visual distractors (tilted bars also changing color unpredictably) by the presentation of a spatially uninformative auditory cue synchronized with the color change of the visual target. In the present study, the visual stimuli in the search display changed brightness instead of color, and the crossmodal congruency between the pitch of the auditory cue and the brightness of the visual target was manipulated. When cue presence and cue congruency were randomly varied between trials (Experiment 1), both congruent cues (low-frequency tones synchronized with dark target states or high-frequency tones synchronized with bright target states) and incongruent cues (the reversed mapping) facilitated visual search performance equally, relative to a no-cue baseline condition. However, when cue congruency was blocked and the participants were informed about the pitch-brightness mapping in the cue-present blocks (Experiment 2), performance was significantly enhanced when the cue and target were crossmodally congruent as compared to when they were incongruent. These results therefore suggest that the crossmodal congruency between auditory pitch and visual brightness can influence performance in the pip-and-pop task by means of top-down facilitation.	t	\N
22659285	The health impacts of environmental noise are a growing concern amongst both the general public and policy-makers in Europe. Environmental noise - especially from road transportation - is widely accepted as an important environmental impact factor that can be taken as a start for the process of evaluating the impact of annoyance on the exposed urban population. Extensive urbanisation and the increase of road transport define the main driving forces for the environmental noise exposure of the population. In urban conditions, it is rather common, regarding road transportation noise, to hear from people that, especially, PTW (Powered Two Wheelers) are annoying, and many times are actually the most annoying environmental noise sources introducing a degradation of the urban environment. In this research, in Athens city centre, both scooters and motorbikes operation patterns are analysed, in the basis of their environmental impact through ad-hoc tests to establish if specific features of their emitted noise are annoying and affect the quality of life. It resulted that PTW are a relevant cause of specific environmental annoyance on pedestrians when low background noise levels and sparse traffic flow allow identifying the PTW. Based on the results of a measurement campaign, both L(max) and roughness indices are identified as characteristic noise signatures of the PTW. Results are compared to laboratory studies on annoyance found in literature and to a specific set of interviews with a large number of pedestrians in selected sites. Annoyance caused by scooters and motorbikes is analysed in the findings and conclusions.	t	\N
22659582	How do perceivers apply knowledge to instances they have never experienced before? On one hand, listeners might use idealized representations that do not contain specific details. On the other, they might recognize and process information based on more detailed memory representations. The current study examined the latter possibility with respect to musical meter perception, previously thought to be computed based on highly-idealized (isochronous) internal representations. In six experiments, listeners heard sets of metrically-ambiguous melodies. Each melody was played in a simultaneous musical context with unambiguous metrical cues (3/4 or 6/8). Cross-melody similarity was manipulated by pairing certain cues-timbre (musical instrument) and motif content (2-6-note patterns)-with each meter, or distributing cues across meters. After multiple exposures, listeners heard each melody without context, and judged metrical continuations (all Experiments) or familiarity (Experiments 5-6). Responses were assessed for "metrical restoration"-the tendency to make metrical judgments that fit the melody's previously-heard metrical context. Cross-melody similarity affected the presence and degree of metrical restoration, and timbre affected familiarity. Results suggest that metrical processing may be calculated based on fairly detailed representations rather than idealized isochronous pulses, and is dissociated somewhat from familiarity judgments. Implications for theories of meter perception are discussed.	t	\N
22666424	Sensory attenuation refers to the observation that self-generated stimuli are attenuated, both in terms of their phenomenology and their cortical response compared to the same stimuli when generated externally. Accordingly, it has been assumed that sensory attenuation might help individuals to determine whether a sensory event was caused by themselves or not. In the present study, we investigated whether this dependency is reciprocal, namely whether sensory attenuation is modulated by prior beliefs of authorship. Participants had to judge the loudness of auditory effects that they believed were either self-generated or triggered by another person. However, in reality, the sounds were always triggered by the participants' actions. Participants perceived the tones' loudness attenuated when they believed that the sounds were self-generated compared to when they believed that they were generated by another person. Sensory attenuation is considered to contribute to the emergence of people's belief of authorship. Our results suggest that sensory attenuation is also a consequence of prior belief about the causal link between an action and a sensory change in the environment.	t	\N
22667466	Cochlear microphonic (CM) measurements may potentially become a supplementary approach to otoacoustic emission (OAE) measurements for assessing low-frequency cochlear functions in the clinic. The objective of this study was to investigate the measurement of CMs in subjects with high-frequency hearing loss. Currently, CMs can be measured using electrocochleography (ECochG or ECoG) techniques. Both CMs and OAEs are cochlear responses, while auditory brainstem responses (ABRs) are not. However, there are inherent limitations associated with OAE measurements such as acoustic noise, which can conceal low-frequency OAEs measured in the clinic. However, CM measurements may not have these limitations. CMs were measured in human subjects using an ear canal electrode. The CMs were compared between the high-frequency hearing loss group and the normal-hearing control group. Distortion product OAEs (DPOAEs) and audiogram were also measured. The DPOAE and audiogram measurements indicate that the subjects were correctly selected for the two groups. Low-frequency CM waveforms (CMWs) can be measured using ear canal electrodes in high-frequency hearing loss subjects. The difference in amplitudes of CMWs between the high-frequency hearing loss group and the normal-hearing group is insignificant at low frequencies but significant at high frequencies.	t	\N
22670350	Quinn and Watt (2006 Perception 35 267-280) showed that the optimal tempi for pieces of music (ie the appropriate speed for a melody) vary from melody to melody. The current study tested if such tempi responses depend on previous stimulus presentations. To do this, we ran a many-participants single-presentation version of the Quinn and Watt experiment. 616 visitors to the Glasgow Science Centre participated. We found that the results substantially matched those obtained earlier, and demonstrate that participants' responses are essentially independent of previously presented stimuli.	t	\N
22688920	Previous research suggests that young children have significant difficulty recognizing speech in the presence of background noise as compared with older children and adults. However, limited research exists that examines the developmental effects of speech recognition in noise in separate age groups of young children, especially in a classroom setting. The lack of research may relate to the limited number of tests with multiple, equally intelligible lists in noise that are also appropriate for young children. As a result, the goals of the present study include investigating (1) effects of age and (2) benefits of spatial separation of speech and noise sources on the speech recognition in noise performance of young children with normal-hearing sensitivity. A secondary goal of the study was to establish the validity and reliability of the Phrases in Noise Test (PINT) for assessing the 50% correct speech-in-noise threshold of young children. The investigators used a two-way repeated measures design to examine the main effects of age and spatial separation. Sixty-eight children in separate groups of 3-, 4-, 5-, and 6-year-olds and 17 adults completed two speech-recognition conditions with (1) speech and noise from the same loudspeaker at 0-degree azimuth (S0/N0) and (2) speech and noise from separate loudspeakers at 0- and 180-degree azimuth (S0/N180). Recruiting sites included local preschools and school districts for children and a university for adults. The results of this investigation suggest that younger children (<4 years of age) have significantly poorer speech-in-noise thresholds than older children and adults, and 4- and 5-year-old children also have significantly poorer performance than adults when speech and noise are presented from the same spatial location. All participants obtained significant spatial release from masking. On a parent and teacher screening questionnaire to assess educational risk, five of 12 children with at-risk behaviors had poor speech-in-noise thresholds relative to their peers. When two lists of the PINT are used, the test seems to be a valid and reliable measure for assessing young children's speech-in-noise thresholds. Young children exhibit significantly poorer speech recognition than do older children and adults in a classroom, especially when speech and noise are presented from the same location. Given the poor acoustics of typical classrooms, and the earlier age at which many children are educated, special modifications to classrooms may be necessary to improve listening through acoustic modifications or classroom amplification. A combination of a parent or teacher questionnaire and the PINT may be helpful in identifying children who are at risk for educational delays and listening difficulties in classrooms with typically poor acoustics.	t	\N
22693344	Hearing and feeling both rely upon the transduction of physical events into frequency-based neural codes, suggesting that the auditory system may be intimately related to the somatosensory system. Here, we provide evidence that the neural substrates for audition and somatosensation are anatomically linked. Using diffusion tensor imaging with both deterministic and probabilistic tractography to measure white matter connectivity, we show that there are extensive ipsilateral connections between the primary auditory cortex and the primary and secondary somatosensory regions in the human cerebral cortex. We further show that these cross-modal connections are exaggerated between the auditory and secondary somatosensory cortex in the lesioned hemisphere of a patient (SR) with acquired auditory-tactile synesthesia, in whom sounds alone produce bodily sensations. These results provide an anatomical basis for multisensory interactions between audition and somatosensation and suggest that cross-talk between these regions may explain why some sounds, such as nails screeching down a chalkboard or an audible mosquito, can induce feelings of touch, especially on the contralesional body surface of patient SR.	t	\N
22695875	The aim of this study is to compare the hearing thresholds of the diabetic patients with the age- and sex-matched normal control group regarding age, glucose level, duration and complications of diabetes with the degree of hearing impairment. Pure tone audiometry was performed in 58 patients with type 1 and 2 diabetes mellitus and compared with 58 age- and sex-matched controls. The patients were categorized into groups according to age, duration of diabetes, complications and control of diabetes. These observations were compared with those from the control subjects. 15-50 years age group with diabetes showed a significant high frequency hearing loss, as compared to the controls. Complicated and poorly controlled diabetics have significant degree of hearing loss in high frequencies as compared to those who were well controlled and uncomplicated. There was also a correlation between the level of hearing loss and duration of diabetes.	t	\N
22710551	Superficial siderosis is a progressive disease of the central nervous system associated with chronic subarachnoid hemorrhage. Sensorineural hearing loss occurs early in the disease typically progressing to a profound hearing loss during several years and ultimately affecting 95% of patients. There are published reports of variable outcomes regarding auditory performance for cochlear implantation in cases of superficial siderosis: the objective of this article was to systematically review this evidence. A systematic search of NHS Evidence electronic journal databases AMED (1985 to present), BNI (1985 to present), CINAHL (1981 to present), EMBASE (1980 to present), HEALTH BUSINESS ELITE, HMIC, MEDLINE (1950 to present), and PsycINFO (1806 to present) was performed. Further research using personal communication, Google Scholar, hand searching Otology & Neurotology (2008-2011), and assessment of reference lists identified in other relevant articles yielded additional articles. A total of 24 articles were short-listed based on relevance; no studies were excluded on a basis of quality. Of these 24 articles, 11 were excluded. The 13 articles included in this review report 15 cases of cochlear implantation in superficial siderosis. Of these 15 individual cases, 7 (47%) showed clear sustained benefit from cochlear implantation, 6 showed limited/no benefit from the onset, and the remaining 2 patients' initial benefit was not maintained. Outcomes will depend on the site of lesion and the degree of cochlear nerve functionality, as well as ongoing neural deterioration. Comprehensive assessment of the auditory pathway including electrical auditory brainstem response and magnetic resonance imaging as well as pre/postimplantation counseling is indicated, but these preoperative measures are imperfect predictors of outcomes. There are indications that, where the underlying disease is stable, cochlear implant performance may be sustained, and where there is disease progression (specifically regarding involvement of auditory brainstem nuclei), cochlear implant performance may deteriorate. Further data are needed in this regard; however, results suggest that earlier implantation would provide benefit for a longer period and increase cost-effectiveness.	t	\N
22712945	Frequency modulation detection limens (FMDLs) were measured for five hearing-impaired (HI) subjects for carrier frequencies f(c) = 1000, 4000, and 6000 Hz, using modulation frequencies f(m) = 2 and 10 Hz and levels of 20 dB sensation level and 90 dB SPL. FMDLs were smaller for f(m) = 10 than for f(m) = 2 Hz for the two higher f(c), but not for f(c) = 1000 Hz. FMDLs were also determined with additional random amplitude modulation (AM), to disrupt excitation-pattern cues. The disruptive effect was larger for f(m) = 10 than for f(m) = 2 Hz. The smallest disruption occurred for f(m) = 2 Hz and f(c) = 1000 Hz. AM detection thresholds for normal-hearing and HI subjects were measured for the same f(c) and f(m) values. Performance was better for the HI subjects for both f(m). AM detection was much better for f(m)  = 10 than for f(m) = 2 Hz. Additional tests showed that most HI subjects could discriminate temporal fine structure (TFS) at 800 Hz. The results are consistent with the idea that, for f(m) = 2 Hz and f(c) = 1000 Hz, frequency modulation (FM) detection was partly based on the use of TFS information. For higher carrier frequencies and for all carrier frequencies with f(m) = 10 Hz, FM detection was probably based on place cues.	t	\N
22712949	The musical scale is a basis for melodies and can be a simple melody by itself. The present study investigated magnetoencephalographic (MEG) responses to omissions of one tone out of the C major scale. The tone preceding the omitted "target" tone was either prolonged or repeated. In another series, the tone after the target tone was repeated. In "normal" oddball experiments, the complete C major scale was presented more frequently than an incomplete scale lacking one tone, and in "reverse" oddball experiments, the roles were exchanged. In the normal oddball experiments, omission of any tone produced a response significantly different in amplitude from the standard response in the group of non-musicians, although the responses differed depending on the types of omission. The leading tone (B in the C major scale) was shown to elicit a large response when omitted and also when its presence was emphasized. The Reverse oddball experiments showed that repeated presentation of an incomplete scale lacking one tone temporarily reduced the influence of the complete scale but could not even temporarily replace it working as "standard." In addition, an auxiliary study was done to see possible influence of rhythmic variations.	t	\N
22713383	We report a patient with non-fluent Primary Progressive Aphasia who was premorbidly literate in two alphabetic scripts, Hungarian (L1) and English (L2). Testing was performed over a two-year period to assess the impact of progressive illness on oral reading and repetition of single words. Results showed significant decline in oral reading in both languages, and an effect of language status in favour of oral reading in L1. Phonological complexity was a significant predictor of oral reading decline in both languages. Of interest, we observed an effect of language status on task performance whereby repetition was better in L2 than L1 but oral reading was better in L1 than L2. We conclude that language status has an effect on repetition and oral reading abilities for bilingual speakers with non-fluent Primary Progressive Aphasia.	t	\N
22714710	It is clear that the ability to learn new speech contrasts changes over development, such that learning to categorize speech sounds as native speakers of a language do is more difficult in adulthood than it is earlier in development. There is also a wealth of data concerning changes in the perception of speech sounds during infancy, such that infants quite rapidly progress from language-general to more language-specific perceptual biases. It is often suggested that the perceptual narrowing observed during infancy plays a causal role in the loss of plasticity observed in adulthood, but the relationship between these two phenomena is complicated. Here I consider the relationship between changes in sensitivity to speech sound categorization over the first 2 years of life, when they appear to reorganize quite rapidly, to the "long tail" of development throughout childhood, in the context of understanding the sensitive period for speech perception.	t	\N
22716266	This study aimed to discover whether general receptive vocabulary is qualitatively phenotypical in Down syndrome. Sixty-two participants with Down syndrome (M age=16.74 years, SD=3.28) were individually matched on general vocabulary raw total score with 62 participants with intellectual disability of undifferentiated etiology (M age=16.20 years, SD=3.08) and 62 typical children (M age=5.32 years, SD=0.82). Item analyses using the transformed item difficulties method to detect differential item functioning across groups showed that the groups' rank orders of item difficulty were highly similar. It was concluded that the general receptive vocabulary of older children and adolescents with Down syndrome is not qualitatively distinguished when its overall size is held constant. Methodological and theoretical implications of this finding are discussed.	t	\N
22717687	To develop a predictive model of spatial release from masking (SRM) for cochlear implantees, and validate this model against data from the literature. To establish the spatial configurations for which the model predicts a large advantage of bilateral over unilateral implantation. To collect data to support these predictions and generate predictions of more typical advantages of bilateral implantation. The model initially assumed that bilateral cochlear implantees had equally effective implants on each side, with which they could perform optimal better-ear listening. Predictions were compared with measurements of SRM, using one and two implants with up to three interfering noises. The effect of relaxing the assumption of equally effective implants was explored. Novel measurements of SRM for eight unilateral implantees were collected, including measurements using speech and noise at azimuths of ± 60 degrees, and compared with prediction. A spatial map of bilateral implant benefit was generated for a situation with one interfering noise in anechoic conditions, and predictions of benefit were generated from binaural room impulse responses in a variety of real rooms. The model accurately predicted data from a previous study for multiple interfering noises in a variety of spatial configurations, even when implants were assumed to be equally effective (r = 0.97). It predicted that the maximum benefit of bilateral implantation was 18 dB. Predictions were little affected if the implants were not assumed to be equally effective. The new measurements supported the 18 dB advantage prediction. The spatial map of predicted benefit showed that, for a listener facing the target voice, bilateral implantees could enjoy an advantage of about 10 dB over unilateral implantees in a wide range of situations. Predictions based on real-room measurements with speech and noise at 1 m showed that large benefits can occur even in reverberant spaces. In optimal conditions, the benefit of bilateral implantation to speech intelligibility in noise can be much larger than has previously been reported. This benefit is thus considerably larger than reported benefits of summation or squelch and is robust in reverberation when the interfering source is close.	t	\N
22721629	Understanding the temporal dynamics underlying cortical processing of auditory categories is complicated by difficulties in equating temporal and spectral features across stimulus classes. In the present magnetoencephalography (MEG) study, female voices and cat sounds were filtered so as to match in most of their acoustic properties, and the respective auditory evoked responses were investigated with a paradigm that allowed us to examine auditory cortical processing of two natural sound categories beyond the physical make-up of the stimuli. Three cat or human voice sounds were first presented to establish a categorical context. Subsequently, a probe sound that was congruent, incongruent, or ambiguous to this context was presented. As an index of a categorical mismatch, MEG responses to incongruent sounds were stronger than the responses to congruent sounds at ~250 ms in the right temporoparietal cortex, regardless of the sound category. Furthermore, probe sounds that could not be unambiguously attributed to any of the two categories ("cat" or "voice") evoked stronger responses after the voice than cat context at 200-250 ms, suggesting a stronger contextual effect for human voices. Our results suggest that categorical templates for human and animal vocalizations are established at ~250 ms in the right temporoparietal cortex, likely reflecting continuous online analysis of spectral stimulus features during auditory categorizing task.	t	\N
22726605	In a recent study, we reported that the accurate perception of beat structure in music ('perception of musical meter') accounted for over 40% of the variance in single word reading in children with and without dyslexia (Huss et al., 2011). Performance in the musical task was most strongly associated with the auditory processing of rise time, even though beat structure was varied by manipulating the duration of the musical notes. Here we administered the same musical task a year later to 88 children with and without dyslexia, and used new auditory processing measures to provide a more comprehensive picture of the auditory correlates of the beat structure task. We also measured reading comprehension and nonword reading in addition to single word reading. One year later, the children with dyslexia performed more poorly in the musical task than younger children reading at the same level, indicating a severe perceptual deficit for musical beat patterns. They now also had significantly poorer perception of sound rise time than younger children. Longitudinal analyses showed that the musical beat structure task was a significant longitudinal predictor of development in reading, accounting for over half of the variance in reading comprehension along with a linguistic measure of phonological awareness. The non-linguistic musical beat structure task is an important independent longitudinal and concurrent predictor of variance in reading attainment by children. The different longitudinal versus concurrent associations between musical beat perception and auditory processing suggest that individual differences in the perception of rhythmic timing are an important shared neural basis for individual differences in children in linguistic and musical processing.	t	\N
22732772	The purpose of this study was to determine how combinations of reverberation and noise, typical of environments in many elementary school classrooms, affect normal-hearing school-aged children's speech recognition in stationary and amplitude-modulated noise, and to compare their performance with that of normal-hearing young adults. In addition, the magnitude of release from masking in the modulated noise relative to that in stationary noise was compared across age groups in nonreverberant and reverberant listening conditions. Last, for all noise and reverberation combinations the degree of change in predicted performance at 70% correct was obtained for all age groups using a best-fit cubic polynomial. Bamford-Kowal-Bench sentences and noise were convolved with binaural room impulse responses representing nonreverberant and reverberant environments to create test materials representative of both audiology clinics and school classroom environments. Speech recognition of 48 school-aged children and 12 adults was measured in speech-shaped and amplitude-modulated speech-shaped noise, in the following three virtual listening environments: nonreverberant, reverberant at 2 m, and reverberant at 6 m. Speech recognition decreased in the reverberant conditions and with decreasing age. Release from masking in modulated noise relative to stationary noise decreased with age and was reduced by reverberation. In the nonreverberant condition, participants showed similar amounts of masking release across ages. The slopes of performance-intensity functions increased with age, with the exception of the nonreverberant modulated masker condition. The slopes were steeper in the stationary masker conditions, where they also decreased with reverberation and distance. In the presence of a modulated masker, the slopes did not differ between the two reverberant conditions. The results of this study reveal systematic developmental changes in speech recognition in noisy and reverberant environments for elementary-school-aged children. The overall pattern suggests that younger children require better acoustic conditions to achieve sentence recognition equivalent to their older peers and adults. In addition, this is the first study to report a reduction of masking release in children as a result of reverberation. Results support the importance of minimizing noise and reverberation in classrooms, and highlight the need to incorporate noise and reverberation into audiological speech-recognition testing to improve predictions of performance in the real world.	t	\N
22743078	This work is a preliminary study that sought to investigate and develop a method for defining and evaluating "success" in paediatric cochlear implantation (PCI) and to apply a process by which a clinical team could optimally achieve this aim. A pilot group of 25 profoundly deaf children who received a unilateral cochlear implant from 1995 to 2008 was used to develop the process. The cases displayed features that are commonly encountered in PCI. Individual case records were examined retrospectively for adverse factors that might impact on the implantation outcome with particular reference to the probability and severity of impact of each factor. Case prognosis was then rated on a 1-4 basis (1: excellent, 2: good, 3: fair, 4: poor). The subsequent outcomes were assessed using standardised speech (GFW, DEAP), language (PLS-4; CELF) and vocabulary (PPVT; EVT) assessments. Auditory performance outcomes were assessed using a new Categories of Auditory Performance Index (CAPI) that incorporated criteria, testing and scoring aspects. Family issues were also evaluated. Case outcomes were rated 1-4 as above and the prognoses and outcomes were then compared. Accurate prognostication was seen in 14 cases, 5 had better outcomes than expected and 6 obtained poorer results. "Success", where the outcome equalled or exceeded the prognosis, occurred in 19 (76%) of cases. The successful group contained some "limited gains" cases where the results were nonetheless in line with expectations and parental satisfaction. The detrimental effect of delayed implantation was evident; Connexin 26 (GJB2) mutation had little influence. Poor general medical condition and adverse family situations commonly produced poorer outcomes. Success in PCI is achieved when the outcome matches or exceeds the pre-operative expectations of the well-counselled family, without adverse side effects. The assessments achieved a good success rate, but further research is required to clearly identify potential problems and a skilled team is needed to evaluate their risk to the PCI outcome. Unforseen events may also intervene. Currently, differing outcome evaluation techniques impede comparison of studies, particularly in the speech and hearing domains. Rationalisation of these is recommended to facilitate future research.	t	\N
22744139	To determine if short-term computerized speech-in-noise training can produce significant improvements in speech-in-noise perception by cochlear implant (CI) recipients on standardized audiologic testing measures. Five adult postlingually deafened CI recipients participated in 4 speech-in-noise training sessions using the Seeing and Hearing Speech program (Sensimetrics; Malden, MA). Each participant completed lessons concentrating on consonant and vowel recognition at word, phrase, and sentence levels. Speech-in-noise abilities were assessed using the QuickSIN (Killion, Niquette, Gudmundsen, Revit, & Banerjee, 2004) and the Hearing in Noise Test (HINT ( Nilsson, Soli & Sullivan, 1994)). All listeners significantly improved key word identification on the HINT after training, albeit only at the most favorable signal-to-noise ratio (SNR). Listeners also showed a significant reduction in the degree of SNR loss on the QuickSIN after training. Short-term speech-in-noise training may improve speech-in-noise perception in postlingually deafened adult CI recipients.	t	\N
22760769	Sound localization is a computational process accomplished along the auditory pathway. Once the acoustic information received at each ear is analyzed independently (monaural cues) and comparatively (binaural cues), those cues are integrated to generate a coherent spatial percept. Using adult ferrets trained by positive conditioning in a spatial task, we aimed to study the role of the auditory cortex in the ability to localize sounds under both normal hearing and monaurally occluded conditions, the latter of which requires a reinterpretation of the values of the localization cues. Sound localization deficits were found after lesion or inactivation of the different auditory cortical regions, thereby indicating their participation in spatial processing. The differential impairments found in the approach-to-target and in the head movement responses reveal the complex relationship between cortex and midbrain which are putatively responsible for the voluntary and reflexive aspects of localization behaviour respectively. Furthermore, every auditory cortical region contributes to the adaptation process that follows monaural occlusion, indicating the key role that the auditory cortex plays in experience-dependent plasticity. Also, the selective lesion of the descending projections from the auditory cortex to the inferior colliculus by chromophore-targeted laser photolysis has revealed the essential function that descending pathways play in learning-induced localization plasticity.	t	\N
22772879	Synesthesia is an unusual condition in which stimulation of one sensory modality causes an experience in another sensory modality or when a sensation in one sensory modality causes another sensation within the same modality. We describe a previously unreported association of auditory-olfactory synesthesia coexisting with auditory-visual synesthesia. Given that many types of synesthesias involve vision, it is important that the clinician provide these patients with the necessary information and support that is available.	t	\N
22773777	Many neurons adapt their spike output to accommodate the prevailing sensory environment. Although such adaptation is thought to improve coding of relevant stimulus features, the relationship between adaptation at the neural and behavioral levels remains to be established. Here we describe improved discrimination performance for an auditory spatial cue (interaural time differences, ITDs) following adaptation to stimulus statistics. Physiological recordings in the midbrain of anesthetized guinea pigs and measurement of discrimination performance in humans both demonstrate improved coding of the most prevalent ITDs in a distribution, but with highest accuracy maintained for ITDs corresponding to frontal locations, suggesting the existence of a fovea for auditory space. A biologically plausible model accounting for the physiological data suggests that neural tuning is stabilized by inhibition to maintain high discriminability for frontal locations. The data support the notion that adaptive coding in the midbrain is a key element of behaviorally efficient sound localization in dynamic acoustic environments.	t	\N
22779451	Animals live in cluttered auditory environments, where sounds arrive at the two ears through several paths. Reflections make sound localization difficult, and it is thought that the auditory system deals with this issue by isolating the first wavefront and suppressing later signals. However, in many situations, reflections arrive too early to be suppressed, for example, reflections from the ground in small animals. This paper examines the implications of these early reflections on binaural cues to sound localization, using realistic models of reflecting surfaces and a spherical model of diffraction by the head. The fusion of direct and reflected signals at each ear results in interference patterns in binaural cues as a function of frequency. These cues are maximally modified at frequencies related to the delay between direct and reflected signals, and therefore to the spatial location of the sound source. Thus, natural binaural cues differ from anechoic cues. In particular, the range of interaural time differences is substantially larger than in anechoic environments. Reflections may potentially contribute binaural cues to distance and polar angle when the properties of the reflecting surface are known and stable, for example, for reflections on the ground.	t	\N
22779484	Previous studies have shown a loss in the precision of horizontal localization responses of older hearing-impaired (HI) individuals, along with potentially poorer neural representations of sound-source location. These deficits could be the result or corollary of greater difficulties in discriminating spatial images, and the insensitivity to punctate sound sources. This hypothesis was tested in three headphone-presentation experiments varying interaural coherence (IC), the cue most associated with apparent auditory source width. First, thresholds for differences in IC were measured for a broad sampling of participants. Older HI participants were significantly worse at discriminating IC across reference values than younger normal-hearing participants. These results are consistent with senescent increases in temporal jitter. Performance decreased with age, a finding corroborated in a second discrimination experiment using a separate group of participants matched for hearing loss. This group also completed a third, visual experiment, with both a cross-mapping task where they drew the size of the sound they heard and the identification task where they chose the image that best corresponded to what they heard. The results from the visual tasks indicate that older HI individuals do not hear punctate images and are relatively insensitive to changes in width based on IC.	t	\N
22783631	This study investigates the phonetic implementation of stress in American English compounds by measuring the interaction of stress cues with different intonation patterns. Participants in an experiment produced compounds and phrases such as greenhouse and green house in different prosodic positions and sentence types to elicit the contrast in a variety of intonational environments. Intonation patterns were labeled and carefully controlled for, and measurements of vowel duration, intensity (dB) and pitch (in semitones) were compared. The results of the experiment reveal clear patterns that distinguish compounds from phrases, and provide acoustic evidence of the characteristic prominence pattern for adjective + noun compounds. Intensity is a significant cue for compound stress in all but the rising intonation environments, such as questions. Duration differences are a reliable cue in exactly this intonational environment, and also in certain clause-final intonation patterns, which similarly contain a high boundary tone. Distinctive pitch patterns were also found. The results suggest that interactions with the intonational and prosodic environment determine which phonetic cues will signal the stress pattern of a compound in a given utterance.	t	\N
22786953	Auditory spatial perception plays a critical role in day-to-day communication. For instance, listeners utilize acoustic spatial information to segregate individual talkers into distinct auditory "streams" to improve speech intelligibility. However, spatial localization is an exceedingly difficult task in everyday listening environments with numerous distracting echoes from nearby surfaces, such as walls. Listeners' brains overcome this unique challenge by relying on acoustic timing and, quite surprisingly, visual spatial information to suppress short-latency (1-10 ms) echoes through a process known as "the precedence effect" or "echo suppression." In the present study, we employed electroencephalography (EEG) to investigate the neural time course of echo suppression both with and without the aid of coincident visual stimulation in human listeners. We find that echo suppression is a multistage process initialized during the auditory N1 (70-100 ms) and followed by space-specific suppression mechanisms from 150 to 250 ms. Additionally, we find a robust correlate of listeners' spatial perception (i.e., suppressing or not suppressing the echo) over central electrode sites from 300 to 500 ms. Contrary to our hypothesis, vision's powerful contribution to echo suppression occurs late in processing (250-400 ms), suggesting that vision contributes primarily during late sensory or decision making processes. Together, our findings support growing evidence that echo suppression is a slow, progressive mechanism modifiable by visual influences during late sensory and decision making stages. Furthermore, our findings suggest that audiovisual interactions are not limited to early, sensory-level modulations but extend well into late stages of cortical processing.	t	\N
22790547	Children and adolescents who live in situations of social vulnerability present a series of health problems. Nonetheless, affirmations that sensory and cognitive abnormalities are present are a matter of controversy. The aim of this study was to investigate aspects to auditory processing, through applying the brainstem auditory evoked potential (BAEP) and behavioral auditory processing tests to children living on the streets, and comparison with a control group. Cross-sectional study in the Laboratory of Auditory Processing, School of Medicine, Universidade de São Paulo. The auditory processing tests were applied to a group of 27 individuals, subdivided into 11 children (7 to 10 years old) and 16 adolescents (11 to 16 years old), of both sexes, in situations of social vulnerability, compared with an age-matched control group of 10 children and 11 adolescents without complaints. The BAEP test was also applied to investigate the integrity of the auditory pathway. For both children and adolescents, there were significant differences between the study and control groups in most of the tests applied, with significantly worse performance in the study group, except in the pediatric speech intelligibility test. Only one child had an abnormal result in the BAEP test. The results showed that the study group (children and adolescents) presented poor performance in the behavioral auditory processing tests, despite their unaltered auditory brainstem pathways, as shown by their normal results in the BAEP test.	t	\N
22813575	To separate neural signals from noise, brain responses measured in neuroimaging are routinely averaged across space and time. However, such procedures may obscure some properties of neural activity. Recently, multi-voxel pattern analysis methods have demonstrated that patterns of activity across voxels contain valuable information that is concealed by spatial averaging. Here we show that temporal patterns of neural activity contain information that can discriminate different stimuli, even within brain regions that show no net activation to that stimulus class. Furthermore, we find that in many brain regions, responses to natural stimuli are highly context dependent. In such cases, prototypical event-related responses do not even exist for individual stimuli, so that averaging responses to the same stimulus within different contexts may worsen the effective signal-to-noise. As a result, analysis of the temporal structures of single events can reveal aspects of neural dynamics which cannot be detected using standard event-related averaging methods.	t	\N
22821077	The decision about which location should be the goal of the next eye movement is known to be determined by the interaction between auditory and visual input. This interaction can be explained by the vector theory that states that each element (either visual or auditory) in a scene evokes a vector in the oculomotor system. These vectors determine the direction in which the eye movement is initiated. Because auditory input is lateralized and localizable in most studies, it is currently unclear how non-lateralized auditory input interacts with the vectors evoked by visual input. In the current study, we investigated the influence of a non-lateralized auditory non-target on saccade accuracy (saccade angle deviation from the target) and latency in a single-target condition in Experiment 1 and a double-target condition in Experiment 2. The visual targets in Experiment 2 were positioned in such a way that saccades on average landed in between the two targets (i.e., a global effect). There was no effect of the auditory input on saccade accuracy in the single-target condition, but auditory input did influence saccade accuracy in the double-target condition. In both experiments, saccade latency increased when auditory input accompanied the visual target(s). Together, these findings show that non-lateralized auditory input enhances all vectors evoked by visual input. The results will be discussed in terms of their possible neural substrates.	t	\N
22834336	The coherency characteristics of the brain electrical activity were studied in two groups of subjects: 1) with high and 2) low indexes of "emotional ear" (an ability to successfully recognize emotions in the speech). Comparison of the coherency links between two groups of subjects permitted the authors to make a conclusion that the persons of the first group had a much lower coherency level, especially in the alpha- and gamma-rhythms. The subjects of the second group were characterized by the opposite tendency: an increase in coherent links on the majority of frequency bands.	t	\N
22836173	Functional networks in the human brain give rise to complex cognitive and perceptual abilities. While the decrease of functional connectivity is linked to neurological and psychiatric disorders, less is known about the consequences of increased functional connectivity. One population that has exceptionally enhanced perceptual abilities is people with absolute pitch (AP) - an ability to categorize tones into pitch classes without reference. AP has been linked to exceptional talent as well as to psychiatric and neurological conditions. Here we show that AP possessors have increased functional activation during music listening, as well as increased degrees, clustering, and local efficiency of functional correlations, with the difference being highest around the left superior temporal gyrus. Our results provide the first evidence that increased functional connectivity in a small-world brain network is related to exceptional perceptual abilities in a healthy population.	t	\N
22842471	Task-irrelevant perceptual learning (TIPL) refers to the phenomenon where the stimulus features of a subject's task are learned when they are consistently presented at times when behaviorally relevant events occur. In this article, we addressed two points concerning TIPL. First, we address the question, are all behaviorally relevant events equal in their impact on encoding processes? Second, we address the hypothesis that TIPL involves mechanisms of the alerting attentional system. Two experiments of fast-TIPL were conducted in which the attentional state of participants was manipulated by using an alerting cue (visual or auditory) that informed participants of the arrival of an upcoming target. Images were presented with task-related stimuli (cues, targets and distractors) and subjects were tested on their memory of those images. Results indicate that memory for target-paired images was enhanced and cue-paired images were suppressed relative to that of distractor-paired images. The alerting cue increased the ability to recall target-paired images presented after this cue, although this result depended on the proportion of cued trials in a session. These results demonstrate a complex interplay between task-elements and the encoding of stimuli paired with them where both enhancement and suppression of task-paired stimuli can be found depending whether those stimuli are paired with task-targets or cues.	t	\N
22848464	When a second sound follows a long first sound, its location appears to be perceived away from the first one (the localization/lateralization aftereffect). This aftereffect has often been considered to reflect an efficient neural coding of sound locations in the auditory system. To understand determinants of the localization aftereffect, the current study examined whether it is induced by an interaural temporal difference (ITD) in the amplitude envelope of high frequency transposed tones (over 2 kHz), which is known to function as a sound localization cue. In Experiment 1, participants were required to adjust the position of a pointer to the perceived location of test stimuli before and after adaptation. Test and adapter stimuli were amplitude modulated (AM) sounds presented at high frequencies and their positional differences were manipulated solely by the envelope ITD. Results showed that the adapter's ITD systematically affected the perceived position of test sounds to the directions expected from the localization/lateralization aftereffect when the adapter was presented at ±600 µs ITD; a corresponding significant effect was not observed for a 0 µs ITD adapter. In Experiment 2, the observed adapter effect was confirmed using a forced-choice task. It was also found that adaptation to the AM sounds at high frequencies did not significantly change the perceived position of pure-tone test stimuli in the low frequency region (128 and 256 Hz). The findings in the current study indicate that ITD in the envelope at high frequencies induces the localization aftereffect. This suggests that ITD in the high frequency region is involved in adaptive plasticity of auditory localization processing.	t	\N
22858614	The purpose of this evidence-based systematic review was to evaluate the efficacy of digital noise reduction and directional microphones for outcome measures of audibility, speech recognition, speech and language, and self- or parent-report in pediatric hearing aid users. The authors searched 26 databases for experimental studies published after 1980 addressing one or more clinical questions and meeting all inclusion criteria. The authors evaluated studies for methodological quality and reported or calculated p values and effect sizes when possible. A systematic search of the literature resulted in the inclusion of 4 digital noise reduction and 7 directional microphone studies (in 9 journal articles) that addressed speech recognition, speech and language, and/or self- or parent-report outcomes. No digital noise reduction or directional microphone studies addressed audibility outcomes. On the basis of a moderate level of evidence, digital noise reduction was not found to improve or degrade speech understanding. Additional research is needed before conclusions can be drawn regarding the impact of digital noise reduction on important speech, language, hearing, and satisfaction outcomes. Moderate evidence also indicates that directional microphones resulted in improved speech recognition in controlled optimal settings; however, additional research is needed to determine the effectiveness of directional microphones in actual everyday listening environments.	t	\N
22859982	Pitch processing is a critical ability on which humans' tonal musical experience depends, and which is also of paramount importance for decoding prosody in speech. Congenital amusia refers to deficits in the ability to properly process musical pitch, and recent evidence has suggested that this musical pitch disorder may impact upon the processing of speech sounds. Here we present the first electrophysiological evidence demonstrating that individuals with amusia who speak Mandarin Chinese are impaired in classifying prosody as appropriate or inappropriate during a speech comprehension task. When presented with inappropriate prosody stimuli, control participants elicited a larger P600 and smaller N100 relative to the appropriate condition. In contrast, amusics did not show significant differences between the appropriate and inappropriate conditions in either the N100 or the P600 component. This provides further evidence that the pitch perception deficits associated with amusia may also affect intonation processing during speech comprehension in those who speak a tonal language such as Mandarin, and suggests music and language share some cognitive and neural resources.	t	\N
22866760	Many studies have examined the processes involved in recognizing types of human action through sound, but little is known about whether the physical characteristics of an action (such as kinetic and kinematic parameters) can be perceived and imitated from sound. Twelve young healthy adults listened to recordings of footsteps on a gravel path taken from walks of different stride lengths (SL) and cadences. In 1 protocol, participants performed a real-time reenactment of the walking action depicted in a sound sample. Second, participants listened to 2 different sound samples and discriminated differences in SL. In a 2nd experiment, these procedures were repeated using synthesized sounds derived from the kinetic interactions between the foot and walking surface. A 3rd experiment examined the influence of altered cadence on participants' ability to discriminate changes in SL. Participants significantly adapted their own SL and cadence according to those depicted in both real and synthesized sounds (p < .01). However, although participants accurately discriminated between large changes in SL, these perceptions were heavily influenced by temporal factors, that is, when cadence changed between samples. These findings show that spatial attributes of action sounds can be both mimicked and discriminated, even when only basic kinetic interactions present within the action are specified.	t	\N
22875083	The aim of the current study was to differentiate between neural activity that represents neural anomalies that are responsible for persistent developmental stuttering (PDS) from the activity that is a result of compensating for stuttering. This was done by investigating alterations to the intrinsic functional architecture of speech-language processes of patients with PDS before and after a short-term intervention. The resting-state functional connectivity (RSFC) and cortical thickness were examined before and after the intervention. The structural data were used to validate the functional results. Fifteen stuttering patients who received intervention (PDS+), 13 stuttering patients who did not receive intervention (PDS-), and 13 fluent controls participated. Before the intervention, both groups of PDS patients showed significant RSFC and cortical thickness reductions in the left pars-opercularis (PO) and RSFC increases in the cerebellum, as compared to fluent controls. The intervention was effective in reducing stuttering in PDS+ patients and lowering their RSFC in the cerebellum to the level of fluent controls. The intervention effect was specific to the PDS+ group (it was not evident in the PDS- group). The intervention did not change RSFC and cortical thickness in the left PO, which remained at its preintervention level. The results suggest that the left PO is a locus where the intrinsic functional architecture of speech-language processes is altered in PDS patients, suggesting an etiologic role of this region in PDS. The cerebellum showed intervention-induced neural reorganization, suggesting a compensatory response when stuttering occurs.	t	\N
22889186	Two experiments are reported concerning the perception of ground extent to discover whether prior reports of anisotropy between frontal extents and extents in depth were consistent across different measures (visual matching and pantomime walking) and test environments (outdoor environments and virtual environments). In Experiment 1 it was found that depth extents of up to 7 m are indeed perceptually compressed relative to frontal extents in an outdoor environment, and that perceptual matching provided more precise estimates than did pantomime walking. In Experiment 2, similar anisotropies were found using similar tasks in a similar (but virtual) environment. In both experiments pantomime walking measures seemed to additionally compress the range of responses. Experiment 3 supported the hypothesis that range compression in walking measures of perceived distance might be due to proactive interference (memory contamination). It is concluded that walking measures are calibrated for perceived egocentric distance, but that pantomime walking measures may suffer range compression. Depth extents along the ground are perceptually compressed relative to frontal ground extents in a manner consistent with the angular scale expansion hypothesis.	t	\N
22894208	Artificial bandwidth extension methods have been developed to improve the quality and intelligibility of narrowband telephone speech and to reduce the difference with wideband speech. Such methods have commonly been evaluated with objective measures or subjective listening-only tests, but conversational evaluations have been rare. This article presents a conversational evaluation of two methods for the artificial bandwidth extension of telephone speech. Bandwidth-extended narrowband speech is compared with narrowband and wideband speech in a test setting including a simulated telephone connection, realistic conversation tasks, and various background noise conditions. The responses of the subjects indicate that speech processed with one of the methods is preferred to narrowband speech in noise, but wideband speech is superior to both narrowband and bandwidth-extended speech. Bandwidth extension was found to be beneficial for telephone conversation in noisy listening conditions.	t	\N
22894315	Studies of spoken-word recognition have revealed that competition from embedded words differs in strength as a function of where in the carrier word the embedded word is found and have further shown embedding patterns to be skewed such that embeddings in initial position in carriers outnumber embeddings in final position. Lexico-statistical analyses show that this skew is highly attenuated in Japanese, a noninflectional language. Comparison of the extent of the asymmetry in the three Germanic languages English, Dutch, and German allows the source to be traced to a combination of suffixal morphology and vowel reduction in unstressed syllables.	t	\N
22897231	When asked how many animals of each kind Moses took on the Ark, most people respond with "two" despite the substituted name (Moses for Noah) in the question. Possible explanations for semantic illusions appear to be related to processing limitations such as those of working memory. Indeed, individual working memory capacity has an impact upon how sentences containing substitutions are processed. This experiment examined further the role of working memory in the occurrence of semantic illusions using a dual-task working memory load approach. Participants verified statements while engaging in either articulatory suppression or random number generation. Secondary task type had a significant effect on semantic illusion rate, but only when comparing the control condition to the two dual-task conditions. Furthermore, secondary task performance in the random number generation condition declined, suggesting a tradeoff between tasks. Response time analyses also showed a different pattern of processing across the conditions. The findings suggest that the phonological loop plays a role in representing semantic illusion sentences coherently and in monitoring for details, while the role of the central executive is to assist gist-processing of sentences. This usually efficient strategy leads to error in the case of semantic illusions.	t	\N
22915198	Scalar Expectancy Theory (SET) has been the leading theory in timing research, and has also influenced research into human timing. However, other timing theories exist, such as Learning to Time (LeT). The double bisection task was designed to test the SET and LeT theories in pigeons. The purpose of this experiment was to verify whether similar results emerge from a human adaptation of the double bisection task. The results indicated that humans perform the double bisection task in the same way as pigeons do. However, the assumptions inherent in LeT cannot be applied to humans. Two other explanations are also assessed here.	t	\N
22915276	A hearing screening among 14- to 15-year-old pupils was performed to estimate the number of hearing-impaired individuals in the adolescent population. A total of 1,298 pupils from 30 schools in Tyrol (Austria) participated in the screening. Hearing tests were performed in a silent room at the school. Sinus tones at frequencies 0.5/1/2/4/6 kHz and at levels of 25/20/20/20/20 dB HL, respectively, were delivered via headphones to either ear. Failure of screening was defined as not hearing one or more frequencies in one or both ears. The screening was failed by 16.3% of the pupils. There was a small but not significant difference between males and females (17.0 vs. 15.2%). Most of the pupils failed at only one frequency (9.6%). Failing at two or more frequencies in the same ear occurred in 3.9% of the pupils, thereof in 1.1% bilaterally. As the specificity of our screening is limited, false-positive results may result. Thus, the rate of hearing deficits in our sample is probably a bit lower than indicated by the figures above. Most of the adolescents who failed the screen failed at only one frequency. These subjects have a small elevation of their hearing threshold, not a hearing loss in the sense of a raised averaged threshold. A hearing loss in the latter sense is supposed to be present in only very few percent of adolescents, a bilateral hearing loss in perhaps less than 1%.	t	\N
22918112	Current strategies for functional rehabilitation of microtia-atresia patients with bone-anchored implants or surgical atresia repair have been extended by the feasibility of active middle ear implants. The aim of the present research is to evaluate a new flowchart of the treatment of these patients that considers active middle ear implants. Congenital aural atresia and microtia. Bilateral cases are provided with a conductive hearing aid after birth and implanted with an active middle ear implant within the second year. Unilateral cases are provided with a conductive hearing aid and implanted with a middle ear or bone-conduction device in early childhood. Unilateral cases without amplification in the vulnerable time after birth are carefully selected for late implantation. At age 8 to 10, the auricular reconstruction is completed. Feasibility of implantation irrespective of age, functional gain in audiometry. The results of early implantation are as good as those previously published for adolescents. Mean reaction threshold with the Vibrant Soundbridge was 21 dB. Mean functional gain was 48 dB. The local tissues are unaltered and ready for auricular reconstruction. Active middle ear implants allow early and selective stimulation of the auditory pathway in children with congenital conductive hearing loss and are expected to lead to the normal development of the binaural hearing functions. To date, it is the only option if the stimulation is to be started at the age of 12 to 18 months. This was implemented into a new flowchart for aural atresia-microtia.	t	\N
22921291	To use a randomized design to evaluate the effectiveness of voice training programs for telemarketers via multidimensional analysis. Forty-eight telemarketers were randomly assigned to two groups: voice training group (n=14) who underwent training over an 8-week period and a nontraining control group (n=34). Before and after training, recordings of the sustained vowel /ɛ/ and connected were collected for acoustic and perceptual analyses. Based on pre- and posttraining comparisons, the voice training group presented with a significant reduction in percent jitter (P=0.044). No other significant differences were observed, and inter-rater reliability varied from poor to fair. These findings suggest that voice training improved a single acoustic dimension, but do not change perceptual dimension of telemarketers' voices.	t	\N
22921292	This study investigated whether listener's experience, presence/absence of vibrato, formant frequency difference, or onset delay affect the ability of experienced and inexperienced listeners to segregate complex vocal stimuli. Repeated measures factorial design. Two sets of stimuli were constructed: one with no vibrato and another with vibrato. For each set, each stimulus was synthesized at four pitches: A3, E4, B4, and F5. Stimuli were synthesized using formant patterns appropriate for the vowel |ɑ|. Frequencies for formants one through four were systematically varied from lower to higher in an attempt to simulate the acoustic results of corresponding changes in vocal tract length. Four formant patterns were synthesized (patterns A-D). Three pairs were created at each pitch, pairing the formants AB (mezzo-soprano/mezzo-soprano), CD (soprano/soprano), and AD (mezzo-soprano/soprano). Each of these three pairs was constructed in three separate conditions: simultaneous onset; the first voice in the pair with an onset delay of 100 milliseconds; and the second voice in the pair with an onset delay of 100 milliseconds. Using a scroll bar, listeners rated how difficult it was for them to hear each stimulus pair as two separate voices. The most difficult combinations to segregate were produced with no vibrato and used simultaneous onset. The easiest conditions to segregate were combinations including a "soprano-like" formant pattern (D) in the vibrato condition. Overall, listener's experience did not affect the perceived difficulty of segregation; however, in the presence of vibrato cues, inexperienced listeners did not use delay cues as an aid in segregation in the same manner as did experienced listeners. Once vibrato was removed from the experimental context, inexperienced listeners were able to use delay to aid in segregation in a similar manner to experienced listeners. Presence/absence of vibrato, formant pattern difference, and onset delay interact in a complex manner to affect the perceived difficulty of voice segregation.	t	\N
22921368	In spite of its massively parallel architecture [1], the human brain is fundamentally limited if required to perform two tasks at the same time [2, 3]. This limitation can be studied with the psychological refractory period (PRP) paradigm, where two stimuli that require speeded responses occur in close succession [4]. Interference generally takes the form of a delay in the time to respond to the second stimulus [5]. Previous studies suggested that sensory decisions require the accumulation of sensory evidence [6, 7] and that the PRP reflects the inability to form more than one decision at a time [4, 8]. In the present study, we used a psychophysical reverse-correlation technique [9, 10] to measure the time-course of evidence accumulation during the PRP. We found that the accumulation of evidence could occur during the PRP albeit with a reduced efficiency, which implies that multiple decision processes can occur in parallel in the human brain. In addition to the reduced efficiency of evidence accumulation, our results uncover an additional delay in the routing of the decision to motor structures during the PRP, which implies that the process of sensory decision making is separable from the preparation of a motor response [11-13].	t	\N
22921779	To evaluate the auditory outcomes of children implanted in an ear with eighth nerve hypoplasia or aplasia and to determine whether preoperative characteristics are predictive of auditory performance achieved. retrospective case review. tertiary care medical center. ten children implanted in an ear with eighth nerve hypoplasia or aplasia, as determined by high resolution magnetic resonance imaging. Neural response test measurements, detection and speech awareness thresholds, Meaningful Auditory Integration Scale scores, as well as speech perception level achieved. Post-implantation, three children demonstrated little to no detection of sound, three had improved detection and awareness of environmental sounds, one developed closed-set speech perception and spoken language, and three developed open-set speech perception and spoken language. No imaging findings appeared related to outcomes. Significantly better implant detection thresholds and Meaningful Auditory Integration Scale scores were found in children who had preoperative aided auditory detection (p's ≤ 0.02-0.05). Some children with eighth nerve hypoplasia or aplasia may derive significant benefit from a cochlear implant. In our study high resolution magnetic resonance imaging was more sensitive than high resolution computer tomography in detecting neural deficiency. However, no imaging findings were predictive of auditory performance level achieved post-implantation.	t	\N
22928919	To investigate speech recognition performance in noise with bilateral open-fit hearing aids and as reference also with closed earmolds, in omnidirectional mode, directional mode, and directional mode in conjunction with noise reduction. A within-subject design with repeated measures across conditions was used. Speech recognition thresholds in noise were obtained for the different conditions. Twenty adults without prior experience with hearing aids. All had symmetric sensorineural mild hearing loss in the lower frequencies and moderate to severe hearing loss in the higher frequencies. Speech recognition performance in noise was not significantly better with an omnidirectional microphone compared to unaided, whereas performance was significantly better with a directional microphone (1.6 dB with open fitting and 4.4 dB with closed earmold) compared to unaided. With open fitting, no significant additional advantage was obtained by combining the directional microphone with a noise reduction algorithm, but with closed earmolds a significant additional advantage of 0.8 dB was obtained. The significant, though limited, advantage of directional microphones and the absence of additional significant improvement by a noise reduction algorithm should be considered when fitting open-fit hearing aids.	t	\N
22928962	In synaesthetes, stimulation of one sensory pathway provokes a sensory experience (e.g. a colour concurrent) in a different sensory modality or sub-modality. Results of synaesthetic Stroop and priming tests indicate that the perception of a colour concurrent interferes with the processing of a veridical colour in synaesthetes. We here examined the congruency between a stimulus' colour and the colour concurrent both in grapheme-colour synaesthetes and in non-synaesthetes trained on grapheme-colour associations. Electrophysiological (electroencephalogram) and behavioural measurements were collected during a priming task that included grapheme-grapheme and grapheme-colour patch pairs. To investigate covert bidirectional synaesthesia, an additional inverted colour patch-grapheme condition was included. Both groups of participants showed longer reaction time and more negative-going N300 and N400 event-related potential (ERP) components on incongruent trials. Whereas ERP effects in the non-synaesthetes were largely confined to the late cognitive components N300, P300 and N400, the synaesthetes also showed congruency-dependent modulation of the early sensory component N170. Our results suggest that early cognitive processes distinguish cross-modal synaesthetic perceptions from acquired associations. The involvement of both early- and late-stage cognitive components in bidirectional synaesthesia possibly indicates similar feature-binding mechanisms during processing of opposite flow directions of information, namely grapheme-colour and colour-grapheme.	t	\N
22937153	In everyday life, we need a capacity to flexibly shift attention between alternative sound sources. However, relatively little work has been done to elucidate the mechanisms of attention shifting in the auditory domain. Here, we used a mixed event-related/sparse-sampling fMRI approach to investigate this essential cognitive function. In each 10-sec trial, subjects were instructed to wait for an auditory "cue" signaling the location where a subsequent "target" sound was likely to be presented. The target was occasionally replaced by an unexpected "novel" sound in the uncued ear, to trigger involuntary attention shifting. To maximize the attention effects, cues, targets, and novels were embedded within dichotic 800-Hz vs. 1500-Hz pure-tone "standard" trains. The sound of clustered fMRI acquisition (starting at t = 7.82 sec) served as a controlled trial-end signal. Our approach revealed notable activation differences between the conditions. Cued voluntary attention shifting activated the superior intra--parietal sulcus (IPS), whereas novelty-triggered involuntary orienting activated the inferior IPS and certain subareas of the precuneus. Clearly more widespread activations were observed during voluntary than involuntary orienting in the premotor cortex, including the frontal eye fields. Moreover, we found -evidence for a frontoinsular-cingular attentional control network, consisting of the anterior insula, inferior frontal cortex, and medial frontal cortices, which were activated during both target discrimination and voluntary attention shifting. Finally, novels and targets activated much wider areas of superior temporal auditory cortices than shifting cues.	t	\N
22938066	There are different reports of the usefulness of the Nasometer™ as a complement to listening, often as correlation calculations between listening and nasalance measurements. Differences between findings have been attributed to listener experience and types of speech stimuli. To compare nasalance scores from the Nasometer with perceptual assessments, for the same and different Swedish speech stimuli, using three groups of listeners with differing levels of experience in judging speech nasality. To compare nasalance scores and blinded listener ratings of randomized recordings using three groups of listeners and two groups of speakers. Speakers were either classified as having hypernasal speech or speech with typical speech resonance. Listeners were speech-language pathologists (SLPs) working predominantly with resonance disorders, other SLPs and untrained listeners. Correlations (r(s)) between hypernasality ratings and nasalance scores for each listener group and speech stimuli were calculated. For both groups of SLPs all correlations between perceptual ratings and nasalance scores were significant at p= 0.01. The correlations between the nasalance scores and ratings by listeners in the SLP groups were higher than those for the untrained listener group regardless of stimulus type. Post-hoc Mann-Whitney U-tests showed that the only difference that was significant was expert SLP group versus untrained listener group. Secondly, correlations between perceptual ratings and oral stimulus nasalance scores were higher when the perceptual ratings were based on spontaneous speech rather than on the oral stimulus. However, a Wilcoxon signed rank test showed that the difference was not significant. A third finding was that correlations between oral stimulus nasalance scores and perceptual scores were higher than those between mixed stimulus nasalance scores and perceptual scores. A Wilcoxon signed rank test showed that the difference was significant. The Nasometer might be useful for the SLP with limited experience in assessing resonance disorders in differentiating between hyper- and hyponasality. With listener reliability for ratings of hypernasality still being an issue, the use of a nasalance score as a complement to the perceptual evaluation will also aid the expert SLP. It will give an alternative way of quantifying speech resonance and might help in especially hard to judge cases.	t	\N
22944369	Previous studies have demonstrated amygdala activation in response to fearful faces even if presented below the threshold of conscious visual perception. It has also been proposed that subcortical regions are selectively sensitive to low spatial frequency (LSF) information. However, chronic hyperarousal may reduce amygdala activation in panic disorder (PD). Our aim was to establish whether the amygdala is engaged by masked and LSF fearful faces in PD as compared to healthy subjects. Neutral faces were used as the mask stimulus. Thirteen PD patients (seven females, six males; mean age=29.1 (S.D: 5.9)) and 15 healthy volunteers (seven females, eight males; mean age=27.9 (S.D. 4.5)) underwent two passive viewing tasks during a 3T functional magnetic resonance imaging (fMRI) as follows: 1) presentation of faces with fearful versus neutral expressions (17ms) using a backward masking procedure and 2) presentation of the same faces whose spatial frequency contents had been manipulated by low-pass filtering. Level of awareness was confirmed by a forced choice fear-detection task. Whereas controls showed bilateral activation to fearful masked faces versus neutral faces, patients failed to show activation within the amygdala. LSF stimuli did not elicit amygdala response in either group, contrary to the view that LSF information plays a crucial role in the processing of facial expressions in the amygdala. Findings suggest maladaptive amygdala responses to potentially threatening visual stimuli in PD patients.	t	\N
22962856	It has been suggested that high alexithymia scorers have an 'augmenter' profile which amplifies their physiological and subjective responses to highly arousing stimuli. The aim of this study was to test this theory using several physiological measures. Participants listened to musical excerpts either in a 'weak-to-strong' or a 'strong-to-weak' order of arousing levels of stimuli. The results show that alexithymia was associated with an augmenter profile for subjective reports for the most arousing stimulus and with stronger skin conductance level responses in the 'strong-to-weak' order. These results partially support the augmenter profile and reveal that alexithymia may be associated with higher anticipation for the most arousing excerpt.	t	\N
22970141	Extensive research shows that inter-talker variability (i.e., changing the talker) affects recognition memory for speech signals. However, relatively little is known about the consequences of intra-talker variability (i.e. changes in speaking style within a talker) on the encoding of speech signals in memory. It is well established that speakers can modulate the characteristics of their own speech and produce a listener-oriented, intelligibility-enhancing speaking style in response to communication demands (e.g., when speaking to listeners with hearing impairment or non-native speakers of the language). Here we conducted two experiments to examine the role of speaking style variation in spoken language processing. First, we examined the extent to which clear speech provided benefits in challenging listening environments (i.e. speech-in-noise). Second, we compared recognition memory for sentences produced in conversational and clear speaking styles. In both experiments, semantically normal and anomalous sentences were included to investigate the role of higher-level linguistic information in the processing of speaking style variability. The results show that acoustic-phonetic modifications implemented in listener-oriented speech lead to improved speech recognition in challenging listening conditions and, crucially, to a substantial enhancement in recognition memory for sentences.	t	\N
22978875	Simulated room impulse responses have been proven to be both useful and indispensable for comprehensive testing of acoustic signal processing algorithms while controlling parameters such as the reverberation time, room dimensions, and source-array distance. In this work, a method is proposed for simulating the room impulse responses between a sound source and the microphones positioned on a spherical array. The method takes into account specular reflections of the source by employing the well-known image method, and scattering from the rigid sphere by employing spherical harmonic decomposition. Pseudocode for the proposed method is provided, taking into account various optimizations to reduce the computational complexity. The magnitude and phase errors that result from the finite order spherical harmonic decomposition are analyzed and general guidelines for the order selection are provided. Three examples are presented: an analysis of a diffuse reverberant sound field, a study of binaural cues in the presence of reverberation, and an illustration of the algorithm's use as a mouth simulator.	t	\N
22985274	Whether the mechanisms giving rise to pitch reflect spectral or temporal processing has long been debated. Generally, sounds having strong harmonic structures in their spectra have strong periodicities in their temporal structures. We found that when a wideband harmonic tone complex is passed through a noise vocoder, the resulting sound can have a harmonic structure with a large peak-to-valley ratio, but with little or no periodicity in the temporal structure. To test the role of harmonic structure in pitch perception for a nonhuman mammal, we measured behavioral responses to noise-vocoded tone complexes in chinchillas (Chinchilla laniger) using a stimulus generalization paradigm. Chinchillas discriminated either a harmonic tone complex or an iterated rippled noise from a 1-channel vocoded version of the tone complex. When tested with vocoded versions generated with 8, 16, 32, 64, and 128 channels, responses were similar to those of the 1-channel version. Behavioral responses could not be accounted for based on harmonic peak-to-valley ratio as the acoustic cue, but could be accounted for based on temporal properties of the autocorrelation functions such as periodicity strength or the height of the first peak. The results suggest that pitch perception does not arise through spectral processing in nonhuman mammals but rather through temporal processing. The conclusion that spectral processing contributes little to pitch in nonhuman mammals may reflect broader cochlear tuning than that described in humans.	t	\N
22986018	Structural priming paradigms have been influential in shaping theories of adult sentence processing and theories of syntactic development. However, until recently there have been few attempts to provide an integrated account that explains both adult and developmental data. The aim of the present paper was to begin the process of integration by taking a developmental approach to structural priming. Using a dialog comprehension-to-production paradigm, we primed participants (3-4year olds, 5-6year olds and adults) with double object datives (Wendy gave Bob a dog) and prepositional datives (Wendy gave a dog to Bob). Half the participants heard the same verb in prime and target (e.g. gave-gave) and half heard a different verb (e.g. sent-gave). The results revealed substantial differences in the magnitude of priming across development. First, there was a small but significant abstract structural priming effect across all age groups, but this effect was larger in younger children than in older children and adults. Second, adding verb overlap between prime and target prompted a large, significant increase in the priming effect in adults (a lexical boost), a small, marginally significant increase in the older children and no increase in the youngest children. The results support the idea that abstract syntactic knowledge can develop independently of verb-specific frames. They also support the idea that different mechanisms may be needed to explain abstract structural priming and lexical priming, as predicted by the implicit learning account (Bock, K., & Griffin, Z. M. (2000). The persistence of structural priming: Transient activation or implicit learning? Journal of Experimental Psychology - General, 129(2), 177-192). Finally, the results illustrate the value of an integrative developmental approach to both theories of adult sentence processing and theories of syntax acquisition.	t	\N
22989579	The perception of a melody is invariant to the absolute properties of its constituting notes, but depends on the relation between them-the melody's relative pitch profile. In fact, a melody's "Gestalt" is recognized regardless of the instrument or key used to play it. Pitch processing in general is assumed to occur at the level of the auditory cortex. However, it is unknown whether early auditory regions are able to encode pitch sequences integrated over time (i.e., melodies) and whether the resulting representations are invariant to specific keys. Here, we presented participants different melodies composed of the same 4 harmonic pitches during functional magnetic resonance imaging recordings. Additionally, we played the same melodies transposed in different keys and on different instruments. We found that melodies were invariantly represented by their blood oxygen level-dependent activation patterns in primary and secondary auditory cortices across instruments, and also across keys. Our findings extend common hierarchical models of auditory processing by showing that melodies are encoded independent of absolute pitch and based on their relative pitch profile as early as the primary auditory cortex.	t	\N
22989871	The function of consciousness was explored in two contexts of audio-visual speech, cross-modal visual attention guidance and McGurk cross-modal integration. Experiments 1, 2, and 3 utilized a novel cueing paradigm in which two different flash suppressed lip-streams cooccured with speech sounds matching one of these streams. A visual target was then presented at either the audio-visually congruent or incongruent location. Target recognition differed for the congruent versus incongruent trials, and the nature of this difference depended on the probabilities of a target appearing at these respective locations. Thus, even though the lip-streams were never consciously perceived, they were nevertheless meaningfully integrated with the consciously perceived sounds, and participants learned to guide their attention according to statistical regularities between targets and these unconsciously perceived cross-modal cues. In Experiment 4, McGurk stimuli were presented in which the lip-streams were either flash suppressed (4a) or unsuppressed (4b), and the McGurk effect was found to vanish under conditions of flash suppression. Overall, these results suggest a simple yet fundamental principle regarding the function of consciousness in multisensory integration - cross-modal effects can occur in the absence of consciousness, but the influencing modality must be consciously perceived for its information to cross modalities.	t	\N
22989933	Distraction is a strategy that is commonly used to cope with pain. Results concerning the efficacy of distraction from both experimental and clinical studies are variable, however, and indicate that its efficacy may depend on particular circumstances. Several models propose that distraction may be less effective for people who display a large attentional bias towards pain-related information. This hypothesis was tested in an experimental context with 53 pain-free volunteers. First, attentional bias towards cues signalling the occurrence of pain (electrocutaneous stimuli) and towards words describing the sensory experience of this painful stimulus was independently assessed by means of 2 behavioural paradigms (respectively, spatial cueing task and dot-probe task). This was followed by a subsequent distraction task during which the efficacy of distraction, by directing attention away from the electrocutaneous stimuli, was tested. In addition, state-trait anxiety, catastrophic thinking, and initial pain intensity were measured. Results indicated that people who display a large attentional bias towards predictive cues of pain or who initially experience the pain as more painful benefit less from distraction on a subsequent test. No effects were found between attentional bias towards pain words, state-trait anxiety, catastrophic thinking, and the efficacy of distraction. Current findings suggest that distraction should not be used as a 'one size fits all' method to control pain, but only under more specific conditions.	t	\N
22992016	The present study showed that elevation of bone-conduction (BC) thresholds at low frequencies might be a characteristic audiometric finding in cases with otitis media with effusion (OME) with high pepsinogen (PG) concentrations. The objective of this study was to investigate whether there is any characteristic audiometric finding in adult cases with otitis media with high PG compared to those with low PG. Twenty-four adult patients with unilateral OME of undetermined etiology and high PG concentrations (> 500 ng/ml) in their middle ear effusions (high PG group) were selected. The air-conduction and BC thresholds of pure tone audiometry were compared between the affected and healthy ears. Results were compared to those in 23 patients with low PG concentrations (< 50 ng/ml; low PG group). The average BC difference in the threshold at 0.25 kHz between the affected ear and the healthy ear was significantly higher in the high PG group than in the low PG group, with a significantly higher proportion of patients in the high PG group having BC thresholds at 0.25 kHz in the affected ear that were ≥ 15 dB higher than in the healthy ear.	t	\N
23000014	Music is a stimulus which may give rise to a wide range of emotional and cognitive responses. Therefore, brain reactivity to music has become a focus of interest in cognitive neuroscience. It is possible that individual preference moderates the effectof music on the brain. In the present study we examined whether there are common effects of listening to music even if each subject in a sample chooses their own piece of music. We invited 18 subjects to bring along their favorite relaxing music, and their favourite stimulating music. Additionally, a condition with tactile stimulation on the foot and a baseline condition (rest) without stimulation were used. The tactile stimulation was chosen to provide a simple, non-auditory condition which would be identical for all subjects. The electroencephalogram was recorded for each of the 3 conditions and during rest. We found responses in the alpha range mainly on parietal and occipital sites that were significant compared to baseline in 13 subjects during relaxing music, 15 subjects during activating music, and 16 subjects during tactile stimulation. Most subjects showed an alpha desynchronization in a lower alpha range followed by a synchronization in an upper frequency range. However, some subjects showed an increase in this area, whereas others showed a decrease only. In addition, many subjects showed reactivity in the beta range. Beta activity was especially increased while listening to activating music and during tactile stimulation in most subjects. We found interindividual differences in the response patterns even though the stimuli provoked comparable subjective emotions (relaxation, activation), and even if the stimulus was the same for all subjects (somatosensory stimulation). We suggest that brain responsivity to music should be examined individually by considering individual characteristics.	t	\N
23009003	Cochlear Implant (CI) is the most advanced method of treatment in receptive type of deafness and profound hearing loss. Good functioning auditory organ plays a significant role not only in auditory perception but also in the process of phonation, giving the possibility of good realization of the process of verbal communication. The aim of the study is to assess the quality of voice and life in adults with pre- and postlingual deafness treated using cochlear implant. Twenty six patients with pre- (group I) and postlingual (group II) deafness deriving no benefit from hearing aids were included into the study. Voice quality was assessed using subjective and objective methods. The endoscopic (VLS) and stroboscopic (VLSS) examination of the larynx and the acoustic voice analysis were carried out. The quality of life was assessed using the Nijmegen Cochlear Implant Questionnaire (NCIQ). Examinations were conducted before implantation and 6 months after activation of speech processor. In the subjective and objective assessment of voice quality the improvement was registered in both groups. The effects were less spectacular in prelingual patients. In this group of patients the subjective assessment of voice quality and stroboscopic examination confirmed the hypofunctional type of dysphonia. In postlingual patients the hyperfunctional type of dysphonia was registered what was confirmed by the analysis of acoustic parameters of voice. The improvement of quality of life was observed in both groups of patients after implantation. The voice quality was improved after implantation in both analyzed groups. In patients with postlingual deafness values of parameters of voice quality assessment were closed to physiological. Results of the subjective assessment of voice quality were confirmed by objective examinations and the acoustic voice analysis. Rehabilitation with cochlear implant gave the opportunity for active participation in private and social life, improving the quality of life in patients with pre- and postlingual deafness.	t	\N
23017812	Auditory impairments in schizophrenia have been demonstrated previously, especially for tasks requiring precise encoding of frequency, although it is unclear the extent to which they have difficulty using pitch information and other cues to segregate sounds. We determined the extent to which those with schizophrenia have difficulty using pitch information and other auditory cues to segregate sounds that are presented sequentially. Ten participants with schizophrenia and nine healthy/normal control participants completed a battery of tasks that tested for the ability to perform sequential auditory stream segregation using pitch, amplitude modulation, or inter-aural phase difference as cues to segregation. All three sequential segregation tasks showed reduced tendency for those with schizophrenia to perceive segregated sounds, compared to control participants. These findings extend prior research by demonstrating a general impairment on sequential sound segregation tasks in schizophrenia, and not just on tasks that require precise encoding of frequency. Together, the pattern of results provide evidence that auditory impairments in schizophrenia result from selective abnormalities in neural circuits that carry out specific computations necessary for stream segregation, as opposed to an impairment in processing specific cues.	t	\N
23025161	We examined whether and how sounds influence visually induced illusory self-motion (vection). Visual stimuli were presented for 40 s. They were made radially, expanding or contracting visual motion field and luminance-defined gratings drifting in a vertical or horizontal direction. Auditory stimuli were presented with the visual stimuli in most conditions; we employed sounds that increased or decreased in intensity, or ascended or descended in frequency. As a result, the sound which increased in intensity facilitated forward vection, and the sound which ascended/descended in frequency facilitated upward/downward vection. The perceptual plausibility of the sound for the corresponding self-motion seemed an important factor of enhancing vection.	t	\N
23025794	Preferences between low delays and phase-frequency responses of behind-the-ear, open-canal hearing aids were investigated with acoustic conditions deemed sensitive to delay effects by normal-hearing listeners. Hearing aids with the following selectable delay and phase response options were fitted at low insertion gain: (1) 1.4 ms delay, minimum phase; (2) 3.4 ms delay, minimum phase; and (3) 3.4 ms delay, linear phase. Blind paired comparisons were made between processing options and between each option and a muted hearing-aid output with two music stimuli. The three alternative forced choice responses were "Slightly prefer", "Prefer", or "Strongly prefer". Twelve hearing-impaired musicians. At the 3.4-ms delay, the minimum-phase response was significantly preferred to the linear-phase response for one music sample and vice-versa for the other sample with a sign test (p < 0.04) but not a Wilcoxon signed rank test that accounted for the low preference strength. Preferences between all other processing conditions were not significant. In acoustic conditions sensitive to delay effects, delays of 1.4 or 3.4 ms were either not detected or no less preferable than no delayed aided signal. It is unclear whether different phase-frequency responses may be preferred with different music stimuli.	t	\N
23027674	To examine the hypothesis that infants receiving a degraded auditory signal have more difficulty segmenting words from fluent speech if familiarized with the words presented in both speech and sign compared to familiarization with the words presented in speech only. Experiment utilizing an infant-controlled visual preference procedure. Twenty 8.5-month-old normal-hearing infants completed testing. Infants were familiarized with repetitions of words in either the speech + sign (n = 10) or the speech only (n = 10) condition. Infants were then presented with four six-sentence passages using an infant-controlled visual preference procedure. Every sentence in two of the passages contained the words presented in the familiarization phase, whereas none of the sentences in the other two passages contained familiar words. Infants exposed to the speech + sign condition looked at familiar word passages for 15.3 seconds and at nonfamiliar word passages for 15.6 seconds, t (9) = -0.130, p = .45. Infants exposed to the speech only condition looked at familiar word passages for 20.9 seconds and to nonfamiliar word passages for 15.9 seconds. This difference was statistically significant, t (9) = 2.076, p = .03. Infants' ability to segment words from degraded speech is negatively affected when these words are initially presented in simultaneous speech and sign. The current study suggests that a decreased ability to segment words from fluent speech may contribute towards the poorer performance of pediatric cochlear implant recipients in total communication settings on a wide range of spoken language outcome measures.	t	\N
23030713	The tip-of-the-tongue (TOT) state is a common experience, usually coupled with a frustrating feeling caused by the incapability of retrieving a familiar word. It is thought that TOTs occur when the semantic and syntactic information of the word is retrieved but not its phonology. This study aims to further understand the role of phonology in TOT resolution. Specifically, using a syllabic pseudohomophone priming paradigm, we aim to analyse the role of the phonological syllabic position (first vs. last) and the number of syllables in TOT states resolution. TOT was elicited by a picture naming task, after which a lexical decision task was presented. Here, first, last, or none of the phonological syllables of the target word were embedded in pseudohomophone primes. Results showed a significant syllabic pseudohomophone priming effect facilitating TOT resolution. The effect was stronger for four-syllable words, especially when the last syllable was used as prime. These results seem to reinforce the importance of phonology in TOT states resolution, particularly the role of the syllable as an important sublexical unit in speech processing.	t	\N
23035086	Humans and other animals with foveate vision make saccadic eye movements to prioritize the visual analysis of behaviorally relevant information. Even before movement onset, visual processing is selectively enhanced at the target of a saccade, presumably gated by brain areas controlling eye movements. Here we assess concurrent changes in visual performance and perceived contrast before saccades, and show that saccade preparation enhances perception rapidly, altering early visual processing in a manner akin to increasing the physical contrast of the visual input. Observers compared orientation and contrast of a test stimulus, appearing briefly before a saccade, to a standard stimulus, presented previously during a fixation period. We found simultaneous progressive enhancement in both orientation discrimination performance and perceived contrast as time approached saccade onset. These effects were robust as early as 60 ms after the eye movement was cued, much faster than the voluntary deployment of covert attention (without eye movements), which takes ∼300 ms. Our results link the dynamics of saccade preparation, visual performance, and subjective experience and show that upcoming eye movements alter visual processing by increasing the signal strength.	t	\N
23041297	Speech prosody conveys information about important aspects of communication: the meaning of the sentence and the emotional state or intention of the speaker. The present study addressed processing of emotional prosodic changes in natural speech stimuli in school-age children (mean age 10 years) by recording the electroencephalogram, facial electromyography, and behavioral responses. The stimulus was a semantically neutral Finnish word uttered with four different emotional connotations: neutral, commanding, sad, and scornful. In the behavioral sound-discrimination task the reaction times were fastest for the commanding stimulus and longest for the scornful stimulus, and faster for the neutral than for the sad stimulus. EEG and EMG responses were measured during non-attentive oddball paradigm. Prosodic changes elicited a negative-going, fronto-centrally distributed neural response peaking at about 500 ms from the onset of the stimulus, followed by a fronto-central positive deflection, peaking at about 740 ms. For the commanding stimulus also a rapid negative deflection peaking at about 290 ms from stimulus onset was elicited. No reliable stimulus type specific rapid facial reactions were found. The results show that prosodic changes in natural speech stimuli activate pre-attentive neural change-detection mechanisms in school-age children. However, the results do not support the suggestion of automaticity of emotion specific facial muscle responses to non-attended emotional speech stimuli in children.	t	\N
23043968	The current study examined auditory processing deficits in dyslexia using a dichotic pitch stimulus and functional MRI. Cortical activation by the dichotic pitch task occurred in bilateral Heschl's gyri, right planum temporale, and right superior temporal sulcus. Adolescents with dyslexia, relative to age-matched controls, illustrated greater activity in left Heschl's gyrus for random noise, less activity in right Heschl's gyrus for all auditory conditions, and less activity in right superior temporal sulcus for a dichotic melody. Subsequent analyses showed that these group differences were attributable to dyslexic readers who performed poorly on the psychophysical task. Furthermore, behavioral performance on phonological reading was correlated to activity from dichotic conditions in right Heschl's gyrus and right superior temporal sulcus. It is postulated that these differences between reader groups is primarily due to a noise exclusion deficit shown previously in dyslexia.	t	\N
23046141	Object-substitution masking (OSM) occurs when a mask, such as four dots that surround a brief target item, onsets simultaneously with the target and offsets a short time after the target, rather than simultaneously with it. OSM is a reduction in accuracy of reporting the target with the temporally trailing mask, compared with the simultaneously offsetting mask. It has been thought that OSM occurs only if attention cannot be rapidly focused, or prefocused, on the target location. One line of evidence for this is a reported interaction between target display set size and the duration of the trailing mask. We analyze the evidence for this interaction and suggest it occurs only as an artifact of data being compressed by a ceiling effect. We report six experiments that support this interpretation by showing that the interaction is always absent unless a ceiling effect is induced. We go on to analyze other evidence to support the notion that attention modulates OSM, and argue that in each case, the data either reflect a ceiling effect or can be explained in another way. Our data and our analyses of the existing literature have strong implications for how OSM should be conceptualized.	t	\N
23046461	It has been repeatedly shown that the auditory N1 is enhanced for sounds presented at an attended time point. The present study investigated the underlying mechanisms using a temporal cuing paradigm. In each trial, an auditory cue indicated at which time point a second sound could be relevant for response selection. Crucially, in addition to temporal attention, two physical sound features with known effects on the sensory N1 were manipulated: location and intensity. Positive evidence for conjoint effects of attention and location or attention and intensity would corroborate the notion that the sensory N1 was modulated by temporal attention, thus supporting a gain mechanism. However, the N1 effect of temporal attention was not similarly lateralized as the sensory N1, and, moreover, it was independent of sound intensity. Thus, the present results do not provide compelling evidence that temporal attention involves an increase in sensory gain.	t	\N
23047261	Acoustically evoked neural and hair cell potentials can be measured from the round window (RW) intraoperatively in the general population of cochlear implant recipients. Cochlear implant performance varies greatly among patients. Improved methods to assess and monitor functional hair cell and neural substrate before and during implantation could potentially aid in enhanced nontraumatic intracochlear electrode placement and subsequent improved outcomes. Subjects (1-80 yr) undergoing cochlear implantation were included. A monopolar probe was placed at the RW after surgical access was obtained. The cochlear microphonic (CM), summating potential (SP), compound action potential (CAP), and auditory nerve neurophonic (ANN) were recorded in response to tone bursts at frequencies of 0.25 to 4 kHz at various levels. Measurable hair cell/neural potentials were detected to 1 or more frequencies in 23 of 25 subjects. The greatest proportion and magnitude of cochlear responses were to low frequencies (<1,000 Hz). At these low frequencies, the ANN, when present, contributed to the ongoing response at the stimulus frequency. In many subjects, the ANN was small or absent, whereas hair cell responses remained. In cochlear implant recipients, acoustically evoked cochlear potentials are detectable even if hearing is extremely limited. Sensitive measures of cochlear and neural status can characterize the state of hair cell and neural function before implantation. Whether this information correlates with speech performance outcomes or can help in tailoring electrode type, placement or audiometric fitting, can be determined in future studies.	t	\N
23049884	Accumulating evidence suggests that basic visual information processing is impaired in schizophrenia. However, deficits in peripheral vision remain largely unexplored. Here we hypothesized that sensory processing of information in the visual periphery would be impaired in schizophrenia patients and, as a result, crowding - the breakdown in target recognition that occurs in cluttered visual environments - would be stronger. Therefore, we assessed visual crowding in the peripheral vision of schizophrenia patients and healthy controls. Subjects were asked to identify a target letter that was surrounded by distracter letters of similar appearance. Targets and distracters were displayed at 8° and 10° of visual angle from the fixation point (eccentricity), and target-distracter spacing was 2°, 3°, 4°, 5°, 6°, 7° or 8° of visual angle. Eccentricity and target-distracter spacing were randomly varied. Accuracy was defined as the proportion of correctly identified targets. Critical spacing was defined as the spacing at which target identification accuracy began to deteriorate, and was assessed at viewing eccentricities of 8° and 10°. Schizophrenia patients were less accurate and showed a larger critical spacing than healthy individuals. These results indicate that crowding is stronger and sensory processing of information in the visual periphery is impaired in schizophrenia. This is in line with previous reports of preferential magnocellular dysfunction in schizophrenia. Thus, deficits in peripheral vision may account for perceptual alterations and contribute to cognitive dysfunction in schizophrenia.	t	\N
23055485	Older adults frequently report they can hear what is said but cannot understand the meaning, especially in noise. This difficulty may arise from the inability to process rapidly changing elements of speech. Aging is accompanied by a general slowing of neural processing and decreased neural inhibition, both of which likely interfere with temporal processing in auditory and other sensory domains. Age-related reductions in inhibitory neurotransmitter levels and delayed neural recovery can contribute to decreases in the temporal precision of the auditory system. Decreased precision may lead to neural timing delays, reductions in neural response magnitude, and a disadvantage in processing the rapid acoustic changes in speech. The auditory brainstem response (ABR), a scalp-recorded electrical potential, is known for its ability to capture precise neural synchrony within subcortical auditory nuclei; therefore, we hypothesized that a loss of temporal precision results in subcortical timing delays and decreases in response consistency and magnitude. To assess this hypothesis, we recorded ABRs to the speech syllable /da/ in normal hearing younger (18-30 years old) and older (60-67 years old) adult humans. Older adults had delayed ABRs, especially in response to the rapidly changing formant transition, and greater response variability. We also found that older adults had decreased phase locking and smaller response magnitudes than younger adults. Together, our results support the theory that older adults have a loss of temporal precision in the subcortical encoding of sound, which may account, at least in part, for their difficulties with speech perception.	t	\N
23057507	There is strong evidence of shared acoustic profiles common to the expression of emotions in music and speech, yet relatively limited understanding of the specific psychoacoustic features involved. This study combined a controlled experiment and computational modelling to investigate the perceptual codes associated with the expression of emotion in the acoustic domain. The empirical stage of the study provided continuous human ratings of emotions perceived in excerpts of film music and natural speech samples. The computational stage created a computer model that retrieves the relevant information from the acoustic stimuli and makes predictions about the emotional expressiveness of speech and music close to the responses of human subjects. We show that a significant part of the listeners' second-by-second reported emotions to music and speech prosody can be predicted from a set of seven psychoacoustic features: loudness, tempo/speech rate, melody/prosody contour, spectral centroid, spectral flux, sharpness, and roughness. The implications of these results are discussed in the context of cross-modal similarities in the communication of emotion in the acoustic domain.	t	\N
23063935	Individuals who have been exposed to two different musical cultures (bimusicals) can be differentiated from those exposed to only one musical culture (monomusicals). Just as bilingual speakers handle the distinct language-syntactic rules of each of two languages, bimusical listeners handle two distinct musical-syntactic rules (e.g., tonal schemas) in each musical culture. This study sought to determine specific brain activities that contribute to differentiating two culture-specific tonal structures. We recorded magnetoencephalogram (MEG) responses of bimusical Japanese nonmusicians and amateur musicians as they monitored unfamiliar Western melodies and unfamiliar, but traditional, Japanese melodies, both of which contained tonal deviants (out-of-key tones). Previous studies with Western monomusicals have shown that tonal deviants elicit an early right anterior negativity (mERAN) originating in the inferior frontal cortex. In the present study, tonal deviants in both Western and Japanese melodies elicited mERANs with characteristics fitted by dipoles around the inferior frontal gyrus in the right hemisphere and the premotor cortex in the left hemisphere. Comparisons of the nature of mERAN activity to Western and Japanese melodies showed differences in the dipoles' locations but not in their peak latency or dipole strength. These results suggest that the differentiation between a tonal structure of one culture and that of another culture correlates with localization differences in brain subregions around the inferior frontal cortex and the premotor cortex.	t	\N
23064383	1) Describe the association between hearing loss and dysfunction of each of the 5 vestibular end-organs--the horizontal, superior, and posterior semicircular canals; saccule; and utricle--in older individuals. 2) Evaluate whether hearing loss and vestibular end-organ deficits share any risk factors. Cross-sectional study. Academic medical center. Fifty-one individuals age 70 years or older. Audiometry, head-thrust dynamic visual acuity (htDVA), sound-evoked cervical vestibular-evoked myogenic potential (cVEMP), and tap-evoked ocular VEMP (oVEMP). Audiometric pure-tone averages (PTA), htDVA LogMAR scores as a measure of semicircular canal function in each canal plane, and cVEMP and oVEMP amplitudes as a measure of saccular and utricular function, respectively. We observed a significant correlation between hearing loss at high frequencies and reduced cVEMP amplitudes (or reduced saccular function; r = -0.37, p < 0.0001) in subjects age 70 years or older. In contrast, hearing loss was not associated with oVEMP amplitudes (or utricular function), or htDVA LogMAR scores (or semicircular canal function) in any of the canal planes. Age and noise exposure were significantly associated with measures of both cochlear and saccular dysfunction. The concomitant decline in the cochlear and saccular function associated with aging may reflect their common embryologic origin in the pars inferior of the labyrinth. Noise exposure seems to be related to both saccular and cochlear dysfunction. These findings suggest a potential benefit of screening individuals with presbycusis-particularly those with significant noise exposure history-for saccular dysfunction, which may contribute to fall risk in the elderly.	t	\N
23067726	The ability to control external events through our own actions is a fundamental aspect of human experience. Both the subjective experience of agency, and its neural correlates, remain poorly understood. Previous studies show that the angular gyrus is activated when participants explicitly judge that they lack agency. In contrast, the positive sense of agency over external events is associated with distortions of time perception. Here, we show that the perceived interval between actions and a subsequent tone is shorter than the perceived interval between a physically comparable passive movement and a tone, replicating the 'intentional binding' effect reported previously. We considered this as a potential implicit marker of agency, and investigated its neural basis, by using parametric analyses to identify brain areas whose activation correlated more strongly with the perceived action-tone interval in the action condition, than in the passive condition. Small volume corrections were used to test specific hypotheses about the contribution of the angular gyrus, and of the supplementary motor area (SMA), based on previous literature. We found no correlation between angular gyrus and our temporal measure of sense of agency. In contrast, we found that a lateral, caudal region within the SMA proper was more strongly associated with the perceived action-tone interval than with perception of a control interval following a passive movement. We suggest that the supplementary motor complex contributes to the subjective experience of temporal flow that accompanies goal-directed voluntary actions.	t	\N
23076114	Following prolonged exposure to adaptor sounds moving in a single direction, participants may perceive stationary-probe sounds as moving in the opposite direction [direction-selective auditory motion aftereffect (aMAE)] and be less sensitive to motion of any probe sounds that are actually moving (motion-sensitive aMAE). The neural mechanisms of aMAEs, and notably whether they are due to adaptation of direction-selective motion detectors, as found in vision, is presently unknown and would provide critical insight into auditory motion processing. We measured human behavioral responses and auditory evoked potentials to probe sounds following four types of moving-adaptor sounds: leftward and rightward unidirectional, bidirectional, and stationary. Behavioral data replicated both direction-selective and motion-sensitive aMAEs. Electrical neuroimaging analyses of auditory evoked potentials to stationary probes revealed no significant difference in either global field power (GFP) or scalp topography between leftward and rightward conditions, suggesting that aMAEs are not based on adaptation of direction-selective motion detectors. By contrast, the bidirectional and stationary conditions differed significantly in the stationary-probe GFP at 200 ms poststimulus onset without concomitant topographic modulation, indicative of a difference in the response strength between statistically indistinguishable intracranial generators. The magnitude of this GFP difference was positively correlated with the magnitude of the motion-sensitive aMAE, supporting the functional relevance of the neurophysiological measures. Electrical source estimations revealed that the GFP difference followed from a modulation of activity in predominantly right hemisphere frontal-temporal-parietal brain regions previously implicated in auditory motion processing. Our collective results suggest that auditory motion processing relies on motion-sensitive, but, in contrast to vision, non-direction-selective mechanisms.	t	\N
23088548	This study investigates how newly learned words are integrated into the first-language lexicon using masked priming. Two lexical decision experiments are reported, with the aim of establishing whether newly learned words behave like real words in a masked form priming experiment. If they do, they should show a prime lexicality effect (PLE), in which less priming is obtained due to form similarity when the prime is a word. In the first experiment, subjects were taught the meanings of novel words that were neighbors of real words, but no PLE was observed; that is, equally strong form priming was obtained for both trained and untrained novel primes. In the second experiment, 4 training sessions were spread over 4 weeks, and under these conditions, a clear PLE was obtained in the final session. It is concluded that lexicalization requires multiple training sessions. Possible explanations of the PLE are discussed.	t	\N
23088832	The neuroscience of tinnitus represents an ideal model to explore central issues in brain functioning such as the formation of auditory percepts, in addition to opening up new treatment avenues for the condition in the long-term. The present review discusses the origin and nature of tinnitus-related neural activity. First, we review evidence for the hypothesis that tinnitus is caused by the central nervous system changes induced by sensory deprivation, even when hearing loss is not visible in the audiogram. Second, we suggest that changes in neural activity in individual central structures may not be sufficient to underlie the tinnitus percept. Instead, we propose that tinnitus may arise from functional alterations at multiple levels which promote abnormal propagation of neural activity throughout the network involved in auditory perception. In this context, functional coupling within and between central auditory structures may be especially important to consider. Investigating how sensory deprivation affects functional coupling between areas, which might be reflected in changes in temporal coherence of intrinsic ongoing activity patterns, may give critical insights into the mechanisms of tinnitus.	t	\N
23095305	To update a 15-year-old study of 800 postlinguistically deaf adult patients showing how duration of severe to profound hearing loss, age at cochlear implantation (CI), age at onset of severe to profound hearing loss, etiology and CI experience affected CI outcome. Retrospective multicenter study. Data from 2251 adult patients implanted since 2003 in 15 international centers were collected and speech scores in quiet were converted to percentile ranks to remove differences between centers. The negative effect of long duration of severe to profound hearing loss was less important in the new data than in 1996; the effects of age at CI and age at onset of severe to profound hearing loss were delayed until older ages; etiology had a smaller effect, and the effect of CI experience was greater with a steeper learning curve. Patients with longer durations of severe to profound hearing loss were less likely to improve with CI experience than patients with shorter duration of severe to profound hearing loss. The factors that were relevant in 1996 were still relevant in 2011, although their relative importance had changed. Relaxed patient selection criteria, improved clinical management of hearing loss, modifications of surgical practice, and improved devices may explain the differences.	t	\N
23102807	Age-related hearing impairment (ARHI), or presbycusis, is a common condition of the elderly that results in significant communication difficulties in daily life. Clinically, it has been defined as a progressive loss of sensitivity to sound, starting at the high frequencies, inability to understand speech, lengthening of the minimum discernable temporal gap in sounds, and a decrease in the ability to filter out background noise. The causes of presbycusis are likely a combination of environmental and genetic factors. Previous research into the genetics of presbycusis has focused solely on hearing as measured by pure-tone thresholds. A few loci have been identified, based on a best ear pure-tone average phenotype, as having a likely role in susceptibility to this type of hearing loss; and GRM7 is the only gene that has achieved genome-wide significance. We examined the association of GRM7 variants identified from the previous study, which used an European cohort with Z-scores based on pure-tone thresholds, in a European-American population from Rochester, NY (N = 687), and used novel phenotypes of presbycusis. In the present study mixed modeling analyses were used to explore the relationship of GRM7 haplotype and SNP genotypes with various measures of auditory perception. Here we show that GRM7 alleles are associated primarily with peripheral measures of hearing loss, and particularly with speech detection in older adults.	t	\N
23109085	People often fail to detect changes between successively presented tactile patterns, a phenomenon known as tactile change blindness. In this study, we investigated whether changes introduced to tactile patterns are detected better when a participant's attention is focused on the location where the change occurs. Across two experiments, participants (N = 55) were instructed to detect changes between two consecutively presented tactile patterns. In half of the trials, the stimulated body sites in the two patterns were identical. In the other half of the trials, one of the stimulated body locations differed between the two patterns. Endogenous (or voluntary) attention was manipulated by instructing participants which new bodily location was most likely to be stimulated. We found that changes at the attended location were detected more accurately than changes at bodily locations that were unattended. This finding demonstrates that attention can effectively modulate tactile change detection. We discuss the value of this experimental paradigm for investigating excessive attentional focus or hypervigilance to particular regions of the body in various clinical populations.	t	\N
23116413	Aiming to further our understanding of fundamental mechanisms of auditory working memory (WM), the present study compared performance for three auditory materials (words, tones, timbres). In a forward recognition task (Experiment 1) participants indicated whether the order of the items in the second sequence was the same as in the first sequence. In a backward recognition task (Experiment 2) participants indicated whether the items of the second sequence were played in the correct backward order. In Experiment 3 participants performed an articulatory suppression task during the retention delay of the backward task. To investigate potential length effects the number of items per sequence was manipulated. Overall findings underline the benefit of a cross-material experimental approach and suggest that human auditory WM is not a unitary system. Whereas WM processes for timbres differed from those for tones and words, similarities and differences were observed for words and tones: Both types of stimuli appear to rely on rehearsal mechanisms, but might differ in the involved sensorimotor codes.	t	\N
23117535	Noise sensitivity is considered to be a self-perceived indicator of vulnerability to stressors in general and not noise alone. Multiple chemical sensitivity (MCS) has to some extent been accompanied by noise sensitivity, indicating a moderate correspondence between them. The aim of this study is to investigate if the Weinstein's Noise Sensitivity Scale and Quick Environmental Exposure and Sensitivity Inventory's (QEESI) Chemical Intolerance Subscale can differentiate noise sensitivity and MCS as different entities, and if there are overlaps in the characteristics of noise sensitivity and MCS. In 2002, 327 individuals (166 men, 161 women; age range 45 - 66 years) from the Finnish Twin Cohort answered a questionnaire on noise-related and MCS items. Somatic, psychological, and lifestyle factors were obtained through earlier questionnaires for the same individuals. Both confirmatory and exploratory factor analyses (CFA and EFA) of the questionnaire items on the Weinstein's Noise Sensitivity Scale and QEESI's Chemical Intolerance Subscale indicated the presence of three factors - Noise Sensitivity, Chemical Sensitivity, and Ability to Concentrate factors - arising from the forming of two factors from the items of the Weinstein's scale. In the regression analyses, among all subjects, the Noise Sensitivity Factor was associated with neuroticism and smoking, and the Chemical Sensitivity Factor was associated with allergies and alcohol use. The study indicates that the Weinstein's Noise Sensitivity Scale and QEESI's Chemical Intolerance Subscale differentiate noise sensitivity and MCS as different entities.	t	\N
23122629	To evaluate through a multidimensional protocol voice changes after voice therapy in patients with benign vocal fold lesions. 65 consecutive patients affected by benign vocal fold lesions were enrolled. Depending on videolaryngostroboscopy the patients were divided into 3 groups: 23 patients with Reinke's oedema, 22 patients with vocal fold cysts and 20 patients with gelatinous polyp. Each subject received 10 voice therapy sessions and was evaluated, before and after voice therapy, through a multidimensional protocol including videolaryngostroboscopy, perception, acoustics, aerodynamics and self-rating by the patient. Data were compared using Wilcoxon signed-rank test. Kruskal-Wallis test was used to analyse the mean variation difference between the three groups of patients. Mann-Whitney test was used for post hoc analysis. Only in 11 cases videolaryngostroboscopy revealed an improvement of the initial pathology. However a significant improvement was observed in perceptual, acoustic and self-assessment ratings in the 3 groups of patients. In particular the parameters of G, R and A of the GIRBAS scale, and the noise to harmonic ratio, Jitter and shimmer scores improved after rehabilitation. A significant improvement of all the parameters of Voice Handicap Index after rehabilitation treatment was found. No significant difference among the three groups of patients was visible, except for self-assessment ratings. Voice therapy may provide a significant improvement in perceptual, acoustic and self-assessed voice quality in patients with benign glottal lesions. Utilization of voice therapy may allow some patients to avoid surgical intervention.	t	\N
23128585	Both the Stroop and the Simon paradigms are often used in research on cognitive control, however, there is evidence that dissociable control processes are involved in these tasks: While conflicts in the Stroop task may be resolved mainly by enhanced task-relevant stimulus processing, conflicts in the Simon task may be resolved rather by suppressing the influence of task-irrelevant information on response selection. In the present study, we show that these control mechanisms interact in different ways with the presentation of accessory stimuli. Accessory stimuli do not affect cognitive control in the Simon task, but they impair the efficiency of cross-trial control processes in the Stroop task. Our findings underline the importance of differentiating between different types of conflicts and mechanisms of cognitive control.	t	\N
23135617	The purpose of this study was to examine differences between older and younger listeners in the ability to sequentially attend to and ignore words. Participants (n = 13 older adults and 13 younger adults) completed a temporally interleaved word recognition task. On each trial, 10 words were presented, and participants were instructed to repeat back every other word while ignoring the intervening words. Three variables were examined: (1) whether the word strings that were to be attended and to be ignored created syntactically correct sentences; (2) whether the to-be-attended and to-be-ignored words were presented from the same or from different spatial locations; and (3) whether the five target words in each trial (and the five distractor words in each trial) were spoken by a single talker or by five different talkers. In addition, digit-span forward and digit-span backward were measured and used as variables in correlation analyses. As a group, the younger participants outperformed the older listeners, particularly when the to-be-attended and to-be-ignored words were presented from the same spatial location (versus when they were presented with spatial separation). Compared with the younger participants, older listeners also made more error responses that were to-be-ignored words, although the proportion of errors that were not responses involving masking words did not significantly differ between groups. Scores on the digit-span-forward test (but not digit-span-backward scores or the degree of hearing loss) were associated with older individuals' performance on this temporally interleaved speech-recognition task. The overall pattern of results suggests that factors other than threshold elevation contribute to speech-understanding problems experienced by older listeners. However, although younger adults outperformed older listeners on this interleaved sentence task, older and younger adults benefited, to a similar extent, from spatial separation of the to-be-attended and to-be-ignored words, and from having a consistent target talker within a trial.	t	\N
23138762	Posterior cortical atrophy (PCA) is typically considered to be a visual syndrome, primarily characterised by progressive impairment of visuoperceptual and visuospatial skills. However, patients commonly describe early difficulties with word retrieval. This paper details the first systematic analysis of linguistic function in PCA. Characterising and quantifying the aphasia associated with PCA is important for clarifying diagnostic and selection criteria for clinical and research studies. 15 patients with PCA, seven patients with logopenic/phonological aphasia (LPA) and 18 age matched healthy participants completed a detailed battery of linguistic tests evaluating auditory input processing, repetition and working memory, lexical and grammatical comprehension, single word retrieval and fluency, and spontaneous speech. Relative to healthy controls, PCA patients exhibited language impairments across all of the domains examined, but with anomia, reduced phonemic fluency and slowed speech rate the most prominent deficits. PCA performance most closely resembled that of LPA patients on tests of auditory input processing, repetition and digit span, but was relatively stronger on tasks of comprehension and spontaneous speech. The study demonstrates that in addition to the well reported degradation of vision, literacy and numeracy, PCA is characterised by progressive oral language dysfunction with prominent word retrieval difficulties. Overlap in the linguistic profiles of PCA and LPA, which are both most commonly caused by Alzheimer's disease, further emphasises the notion of a phenotypic continuum between typical and atypical manifestations of the disease. Clarifying the boundaries between Alzheimer's disease phenotypes has important implications for diagnosis, clinical trial recruitment and investigations into biological factors driving phenotypic heterogeneity in Alzheimer's disease. Rehabilitation strategies to ameliorate the phonological deficit in PCA are required.	t	\N
23143506	This retrospective study aimed to evaluate the efficacy of the Esteem(®) middle ear implant in sensorineural hearing loss (SNHL) of different degree as well as to compare it with that obtained with conventional hearing aids. Fifteen out of 30 adults patients who received an Esteem(®) middle ear device for rehabilitation of sensorineural hearing loss met the primary eligibility criterion of prior, continuous use of conventional hearing aids. Study population included moderate-to-severe SNHL (8 patients) and severe-to-profound SNHL (7 patients). Audiometric measurements included free-field pure-tone and speech audiometry in Esteem(®)-aided, HA-aided, and baseline threshold. For speech audiometry, speech reception threshold (SRT) and word recognition score (WRS) were assessed. Subjective benefit was evaluated by Client Oriented Scale of Improvement (COSI) questionnaire. In all the subjects, SRT and WRS showed improvement both with conventional HA and Esteem(®) in respect to the unaided situation. Although not statistically significant, a slight prevalence of the Esteem(®) performances was recorded both audiometrically and as subjective satisfaction score. The Esteem(®) middle ear device demonstrated appreciable benefit for rehabilitation of SNHL of different degree, comparable to what can be achieved by conventional hearing aids. In addition, this rehabilitative process may enable also individuals presenting with severe-to-profound SNHL to achieve remarkable functional outcomes.	t	\N
23145620	A modulation-based index is proposed for predicting speech intelligibility by cochlear implant (CI) listeners. The input to the proposed index are speech envelopes extracted using the individual CI user's daily strategy, and as such, this approach incorporates information about the number of active electrodes, shape of the compression function and electrical dynamic range. High correlation (r = 0.96) was achieved with the proposed index when evaluated with speech-reception thresholds (SRTs) obtained by CI users in steady and speech-masker conditions. This outcome suggests that the information contained in electrodograms seems to be sufficient for reliably predicting CI user's performance in noise. The proposed index can be used by clinicians to optimize the selection of fitting parameters of individual CI users for better performance in noise.	t	\N
23150094	To report the results of cochlear implantation via the middle fossa approach in 4 patients, discuss the complications, and present a detailed description of the programming specifications in these cases. Retrospective case review. Tertiary-care referral center with a well-established cochlear implant program. Four patients with bilateral canal wall down mastoid cavities who underwent the middle fossa approach for cochlear implantation. Cochlear implantation and subsequent rehabilitation. A middle fossa approach with cochleostomy was successfully performed on the most superficial part of the apical turn in 4 patients. A Nucleus 24 cochlear implant system was used in 3 patients and a MED-EL Sonata Medium device in 1 patient. The single electrode array was inserted through a cochleostomy from the cochlear apex and occupied the apical, middle, and basal turns. Telemetry and intraoperative impedance recordings were performed at the end of surgery. A CT scan of the temporal bones was performed to document electrode insertion for all of the patients. Complications, hearing thresholds, and speech perception outcomes were evaluated. Neural response telemetry showed present responses in all but 1 patient, who demonstrated facial nerve stimulation during the test. Open-set speech perception varied from 30% to 100%, despite the frequency allocation order of the MAP. Cochlear implantation via the middle cranial fossa is a safe approach, although it is a challenging procedure, even for experienced surgeons.	t	\N
23152235	This is an update of a Cochrane Review first published in The Cochrane Library in Issue 12, 2010.Tinnitus is described as the perception of sound or noise in the absence of real acoustic stimulation. Numerous management strategies have been tried for this potentially debilitating, heterogeneous symptom. External noise has been used as a management tool for tinnitus, in different capacities and with different philosophical intent, for over a century. To assess the effectiveness of sound-creating devices (including hearing aids) in the management of tinnitus in adults. Primary outcome measures were changes in the loudness or severity of tinnitus and/or impact on quality of life. Secondary outcome measures were change in pure-tone auditory thresholds and adverse effects of treatment. We searched the Cochrane ENT Group Trials Register; CENTRAL; PubMed; EMBASE; CINAHL; Web of Science; BIOSIS Previews; Cambridge Scientific Abstracts; ICTRP and additional sources for published and unpublished trials. The date of the most recent search was 8 February 2012. Prospective randomised controlled trials recruiting adults with persistent, distressing, subjective tinnitus of any aetiology in which the management strategy included maskers, noise-generating device and/or hearing aids, used either as the sole management tool or in combination with other strategies, including counselling. Two authors independently examined the 387 search results to identify studies for inclusion in the review, of which 33 were potentially relevant. The update searches in 2012 retrieved no further potentially relevant studies. Both authors extracted data independently. Six trials (553 participants) are included in this review. Studies were varied in design, with significant heterogeneity in the evaluation of subjective tinnitus perception, with different scores, scales, tests and questionnaires as well as variance in the outcome measures used to assess the improvement in tinnitus sensation/quality of life. This precluded meta-analysis of the data. There was no long-term follow-up. We assessed the risk of bias as medium in three and high in three studies. Following analysis of the data, no significant change was seen in the loudness of tinnitus or the overall severity of tinnitus following the use of sound therapy compared to other interventions such as patient education, 'relaxation techniques', 'tinnitus coping strategies', counselling, 'tinnitus retraining' and exposure to environmental sounds. No side effects or significant morbidity were reported from the use of sound-creating devices. The limited data from the included studies failed to show strong evidence of the efficacy of sound therapy in tinnitus management. The absence of conclusive evidence should not be interpreted as evidence of lack of effectiveness. The lack of quality research in this area, in addition to the common use of combined approaches (hearing therapy plus counselling) in the management of tinnitus are, in part, responsible for the lack of conclusive evidence. Other combined forms of management, such as tinnitus retraining therapy, have been subject to a Cochrane Review. Optimal management may involve multiple strategies.	t	\N
23155729	Auditory stimuli are known to improve visual target recognition and detection when both are presented in the same spatial location. However, most studies have focused on crossmodal spatial congruency along the horizontal plane and the effects of audio-visual spatial congruency in depth (i.e., along the depth axis) are relatively less well understood. In the following experiments we presented a visual (face) or auditory (voice) target stimulus in a location on a spatial array which was either spatially congruent or incongruent in depth (i.e., positioned directly in front or behind) with a crossmodal stimulus. The participant's task was to determine whether a visual (experiments 1 and 3) or auditory (experiment 2) target was located in the foreground or background of this array. We found that both visual and auditory targets were less accurately located when crossmodal stimuli were presented from different, compared to congruent, locations in depth. Moreover, this effect was particularly found for visual targets located in the periphery, although spatial incongruency affected the location of auditory targets across both locations. The relative distance of the array to the observer did not seem to modulate this congruency effect (experiment 3). Our results add to the growing evidence for multisensory influences on search performance and extend these findings to the localisation of targets in the depth plane.	t	\N
23160796	Small songbirds have a difficult analysis problem: their head is small compared to the wavelengths of sounds used for communication providing only small interaural time and level differences. Klump and Larsen (1992) measured the physical binaural cues in the European starling (Sturnus vulgaris) that allow the comparison of acoustical cues and perception. We determined the starling's minimum audible angle (MAA) in an operant Go/NoGo procedure for different spectral and temporal stimulus conditions. The MAA for broadband noise with closed-loop localization reached 17°, while the starling's MAA for open-loop localization of broadband noise reached 29°. No substantial difference between open-loop and closed-loop localization was found in 2 kHz pure tones. The closed-loop MAA improved from 26° to 19° with an increase in pure tone frequency from 1 to 4 kHz. This finding is in line with the physical cues available. While the starlings can only make use of interaural time difference cues at lower frequencies (e.g., 1 and 2 kHz), additional interaural level difference cues become available at higher frequencies (e.g., 4 kHz or higher, Klump and Larsen 1992). An improvement of the starling's MAA with an increasing number of standard stimulus presentations prior to the test stimulus has important implications for determining relative (MAA) localization thresholds.	t	\N
23163413	Music performance requires control of two sequential structures: the ordering of pitches and the temporal intervals between successive pitches. Whether pitch and temporal structures are processed as separate or integrated features remains unclear. A repetition suppression paradigm compared neural and behavioral correlates of mapping pitch sequences and temporal sequences to motor movements in music performance. Fourteen pianists listened to and performed novel melodies on an MR-compatible piano keyboard during fMRI scanning. The pitch or temporal patterns in the melodies either changed or repeated (remained the same) across consecutive trials. We expected decreased neural response to the patterns (pitch or temporal) that repeated across trials relative to patterns that changed. Pitch and temporal accuracy were high, and pitch accuracy improved when either pitch or temporal sequences repeated over trials. Repetition of either pitch or temporal sequences was associated with linear BOLD decrease in frontal-parietal brain regions including dorsal and ventral premotor cortex, pre-SMA, and superior parietal cortex. Pitch sequence repetition (in contrast to temporal sequence repetition) was associated with linear BOLD decrease in the intraparietal sulcus (IPS) while pianists listened to melodies they were about to perform. Decreased BOLD response in IPS also predicted increase in pitch accuracy only when pitch sequences repeated. Thus, behavioral performance and neural response in sensorimotor mapping networks were sensitive to both pitch and temporal structure, suggesting that pitch and temporal structure are largely integrated in auditory-motor transformations. IPS may be involved in transforming pitch sequences into spatial coordinates for accurate piano performance.	t	\N
23169193	An estimated 36 million US citizens have impaired hearing, but nearly half of them have never had a hearing test. As noted by a recent National Institutes of Health/National Institute on Deafness and Other Communication Disorders (NIH/NIDCD) Working Group, "In the United States (in contrast to many other nations) there are no readily accessible low cost hearing screening programs…" (Donahue et al, 2010, p. 2). Since 2004, telephone administered screening tests utilizing three-digit sequences presented in noise have been developed, validated, and implemented in seven countries. Each of these tests has been based on a test protocol conceived by Smits and colleagues in The Netherlands. Investigators from Communication Disorders Technology, Inc., Indiana University, and VU University Medical Center of Amsterdam agreed to collaborate in the development and validation of a screening test for hearing impairment suitable for delivery over the telephone, for use in the United States. This test, utilizing spoken three-digit sequences (triplets), was to be based on the design of Smits and his colleagues. A version of the digits-in-noise test was developed utilizing digit triplets spoken in Middle American dialect. The stimuli were individually adjusted to speech-to-noise ratio (SNR) values yielding 50% correct identification, on the basis of data collected from a group of 10 young adult listeners with normal hearing. A final set of 64 homogeneous stimuli were selected from an original 160 recorded triplets. Each test consisted of a series of 40 triplets drawn at random, presented in a noise background. The SNR threshold for 50% correct identification of the triplets was determined by a one-down, one-up adaptive procedure. The test was implemented by telephone, and administered to listeners with varying levels of hearing impairment. The listeners were then evaluated with pure-tone tests and other audiometric measures as clinically appropriate. Ninety participants included 72 who were volunteers from the regular client population at the Indiana University Hearing Clinic, and 18 who were recruited with a newspaper ad offering a free hearing test. Of the 90 participants, 49 were later determined to have mean pure-tone thresholds greater than 20 dB hearing level (HL). The primary data analyses were correlations between telephone test thresholds and other measures, including pure-tone thresholds and speech recognition tests, collected for the same participants. The correlation between the telephone test and pure-tone thresholds (r = 0.74) was within the range of correlations observed with successful telephone screening tests in use in other countries. Thresholds based on the average of only 21 trials (trials five through 25 of the 40-trial tracking history) yielded sensitivity and specificity values of 0.80 and 0.83, respectively, using pure-tone average((0.5, 1.0, 2.0 kHz)) >20 dB HL as the criterion measure. This US version of the digits-in-noise telephone screening test is sufficiently valid to be implemented for use by the general public. Its properties are quite similar to those telephone screening tests currently in use in most European countries. Telephone tests provide efficient, easy to use, and valid screening for functional hearing impairment. The results of this test are a reasonable basis for advising those who fail to seek a comprehensive hearing evaluation by an audiologist.	t	\N
23169195	Speech recognition in noise testing has been conducted at least since the 1940s (Dickson et al, 1946). The ability to recognize speech in noise is a distinct function of the auditory system (Plomp, 1978). According to Kochkin (2002), difficulty recognizing speech in noise is the primary complaint of hearing aid users. However, speech recognition in noise testing has not found widespread use in the field of audiology (Mueller, 2003; Strom, 2003; Tannenbaum and Rosenfeld, 1996). The audiogram has been used as the "gold standard" for hearing ability. However, the audiogram is a poor indicator of speech recognition in noise ability. This study investigates the relationship between pure-tone thresholds, the articulation index, and the ability to recognize speech in quiet and in noise. Pure-tone thresholds were measured for audiometric frequencies 250-6000 Hz. Pure-tone threshold groups were created. These included a normal threshold group and slight, mild, severe, and profound high-frequency pure-tone threshold groups. Speech recognition thresholds in quiet and in noise were obtained using the Hearing in Noise Test (HINT) (Nilsson et al, 1994; Vermiglio, 2008). The articulation index was determined by using Pavlovic's method with pure-tone thresholds (Pavlovic, 1989, 1991). Two hundred seventy-eight participants were tested. All participants were native speakers of American English. Sixty-three of the original participants were removed in order to create groups of participants with normal low-frequency pure-tone thresholds and relatively symmetrical high-frequency pure-tone threshold groups. The final set of 215 participants had a mean age of 33 yr with a range of 17-59 yr. Pure-tone threshold data were collected using the Hughson-Weslake procedure. Speech recognition data were collected using a Windows-based HINT software system. Statistical analyses were conducted using descriptive, correlational, and multivariate analysis of covariance (MANCOVA) statistics. The MANCOVA analysis (where the effect of age was statistically removed) indicated that there were no significant differences in HINT performances between groups of participants with normal audiograms and those groups with slight, mild, moderate, or severe high-frequency hearing losses. With all of the data combined across groups, correlational analyses revealed significant correlations between pure-tone averages and speech recognition in quiet performance. Nonsignificant or significant but weak correlations were found between pure-tone averages and HINT thresholds. The ability to recognize speech in steady-state noise cannot be predicted from the audiogram. A new classification scheme of hearing impairment based on the audiogram and the speech reception in noise thresholds, as measured with the HINT, may be useful for the characterization of the hearing ability in the global sense. This classification scheme is consistent with Plomp's two aspects of hearing ability (Plomp, 1978).	t	\N
23173635	A 42-year-old man suffered damage to the left supra-sylvian areas due to a stroke and presented with verbal short-term memory (STM) deficits. He occasionally could not recall even a single syllable that he had heard one second before. A study of mismatch negativity using magnetoencephalography suggested that the duration of auditory sensory (echoic) memory traces was reduced on the affected side of the brain. His maximum digit span was four with auditory presentation (equivalent to the 1st percentile for normal subjects), whereas it was up to six with visual presentation (almost within the normal range). He simply showed partial recall in the digit span task, and there was no self correction or incorrect reproduction. From these findings, reduced echoic memory was thought to have affected his verbal short-term retention. Thus, the impairment of verbal short-term memory observed in this patient was "pure auditory" unlike previously reported patients with deficits of the phonological short-term store (STS), which is the next higher-order memory system. We report this case to present physiological and behavioral data suggesting impaired short-term storage of verbal information, and to demonstrate the influence of deterioration of echoic memory on verbal STM.	t	\N
23177349	This research investigates how early learning about native language sound structure affects how infants associate sounds with meanings during word learning. Infants (19-month-olds) were presented with bisyllabic labels with high or low phonotactic probability (i.e., sequences of frequent or infrequent phonemes in English). The labels were produced with the predominant English trochaic (strong/weak) stress pattern or the less common iambic (weak/strong) pattern. Using the habituation-based Switch Task to test label learning, we found that infants readily learned high probability trochaic labels. However, they failed to learn low probability labels, regardless of stress, and failed to learn iambic labels, regardless of phonotactics. Thus, infants required support from both common phoneme sequences and a common stress pattern to map the labels to objects. These findings demonstrate that early word learning is shaped by prior knowledge of native language phonological regularities and provide support for the role of statistical learning in language acquisition.	t	\N
23177756	A plethora of investigations have studied the acoustic characteristics of vibrato such as the rate, extent, onset (time from initiation of phonation until the first peak of vibrato), and periodicity. Despite extensive research, the degree to which various parameters of vibrato contribute to its acceptability remain unclear. The present study sought to determine the psychoacoustical relationship of mean fundamental frequency (f(0)), modulation frequency (f(f0m)), modulation depth (d(f0m)), and intensity to the appropriateness or inappropriateness of vibrato. Phonation samples of eight voice majors singing at low, middle, and high pitches were obtained. A high fidelity vocoder (STRAIGHT; Kawahara, 1997) was used to resynthesize these vowels with systematic manipulations of f(f0m) and d(f0m) of the f(0) contours resulting in a total of 600 stimuli (8 singers×3 pitches×5 f(f0m) levels×5 d(f0m) levels). Nine listeners (four experts and five students) evaluated these stimuli for appropriateness of vibrato at two different presentation levels (70 and 90 dB sound pressure level). Statistical analyses of the perceptual data suggest that appropriateness of vibrato tends to increase with mean f(0) and decrease with d(f0m.) Appropriateness of vibrato is greatest for f(f0m) value of 6 Hz, but decreases both above and below this frequency. perceived appropriateness of vibrato results from an interaction of mean f(0), f(f0m), and d(f0m) of the vowel waveform.	t	\N
23202431	The investigation of brain activity using naturalistic, ecologically-valid stimuli is becoming an important challenge for neuroscience research. Several approaches have been proposed, primarily relying on data-driven methods (e.g. independent component analysis, ICA). However, data-driven methods often require some post-hoc interpretation of the imaging results to draw inferences about the underlying sensory, motor or cognitive functions. Here, we propose using a biologically-plausible computational model to extract (multi-)sensory stimulus statistics that can be used for standard hypothesis-driven analyses (general linear model, GLM). We ran two separate fMRI experiments, which both involved subjects watching an episode of a TV-series. In Exp 1, we manipulated the presentation by switching on-and-off color, motion and/or sound at variable intervals, whereas in Exp 2, the video was played in the original version, with all the consequent continuous changes of the different sensory features intact. Both for vision and audition, we extracted stimulus statistics corresponding to spatial and temporal discontinuities of low-level features, as well as a combined measure related to the overall stimulus saliency. Results showed that activity in occipital visual cortex and the superior temporal auditory cortex co-varied with changes of low-level features. Visual saliency was found to further boost activity in extra-striate visual cortex plus posterior parietal cortex, while auditory saliency was found to enhance activity in the superior temporal cortex. Data-driven ICA analyses of the same datasets also identified "sensory" networks comprising visual and auditory areas, but without providing specific information about the possible underlying processes, e.g., these processes could relate to modality, stimulus features and/or saliency. We conclude that the combination of computational modeling and GLM enables the tracking of the impact of bottom-up signals on brain activity during viewing of complex and dynamic multisensory stimuli, beyond the capability of purely data-driven approaches.	t	\N
23205712	To estimate the prevalence of severe and profound hearing loss in a clinical population and to report the audiological and hearing-aid characteristics for this group, as well as outcome measures from use of hearing aids. A retrospective observational study initially, followed by a postal Glasgow health status inventory (GHSI) to establish the patients functional outcomes. A clinical database of 32 781 cases was interrogated from which 2199 cases of severe /profound hearing loss were identified. From these, an adult sample stratified in terms of age and gender of n = 302 was contacted. An estimated 6.7% of the local clinical population and 0.7% of the general population were found to have hearing > 70 dB averaged over 0.5, 1, and 2 kHz. Most patients were fitted with bilateral hearing aids, using a non-linear prescription, and as a group they reported a high level of social support. This study has estimated the prevalence of severe and profound hearing loss as 6.7% of the clinical population, and 0.7% of the general population. This is consistent with previous work, although it probably underestimates the prevalence. Further work is indicated to strengthen the estimate.	t	\N
23207574	Synchronizing movements with auditory beats, compared to visual flashes, yields divergent activation in timing-related brain areas as well as more stable tapping synchronization. The differences in timing-related brain activation could reflect differences in tapping synchronization stability, rather than differences between modality (i.e., audio-motor vs. visuo-motor integration). In the current fMRI study, participants synchronized their finger taps with four types of visual and auditory pacing sequences: flashes and a moving bar, as well as beeps and a frequency-modulated 'siren'. Behavioral tapping results showed that visuo-motor synchronization improved with moving targets, whereas audio-motor synchronization degraded with frequency-modulated sirens. Consequently, a modality difference in synchronization occurred between the discrete beeps and flashes, but not between the novel continuous siren and moving bar. Imaging results showed that activation in the putamen, a key timing area, paralleled the behavioral results: putamen activation was highest for beeps, intermediate for the continuous siren and moving bar, and was lowest for the flashes. Putamen activation differed between modalities for beeps and flashes, but not for the novel moving bar and siren. By dissociating synchronization performance from modality, we show that activation in the basal ganglia is associated with sensorimotor synchronization stability rather than modality-specificity in this task. Synchronization stability is apparently contingent upon the modality's processing affinity: discrete auditory and moving visual signals are modality appropriate, and can be encoded reliably for integration with the motor system.	t	\N
23216089	Results for speech recognition in noise tests when using a new research coding strategy designed to introduce the virtual channel effect provided no advantage over MP3(000™). Although statistically significant smaller just noticeable differences (JNDs) were obtained, the findings for pitch ranking proved to have little clinical impact. The aim of this study was to explore whether modifications to MP3000 by including sequential virtual channel stimulation would lead to further improvements in hearing, particularly for speech recognition in background noise and in competing-talker conditions, and to compare results for pitch perception and melody recognition, as well as informally collect subjective impressions on strategy preference. Nine experienced cochlear implant subjects were recruited for the prospective study. Two variants of the experimental strategy were compared to MP3000. The study design was a single-blinded ABCCBA cross-over trial paradigm with 3 weeks of take-home experience for each user condition. Comparing results of pitch-ranking, a significantly reduced JND was identified. No significant effect of coding strategy on speech understanding in noise or competing-talker materials was found. Melody recognition skills were the same under all user conditions.	t	\N
23218176	The use of the stapes coupling technique, employed in the Vibrant Soundbridge system, is technically less demanding than the vibroplasty technique, and is more likely to generate a positive outcome without significantly changing residual hearing or resulting in medical or surgical complication. We report a patient with repeated left ossiculoplasty failure, who was successfully implanted with a Vibrant Soundbridge. We believe that the stapes coupling technique can provide natural stimulation to the inner ear, resulting in a better perceived sound quality.	t	\N
23219981	Prior work with Wright and others demonstrated that rhesus monkeys recognized the relative relationships of notes in common melodies. As an extension of tests of pattern similarities, tamarins were habituated to 3-sound unit patterns in an AAB or ABB form that were human phonemes, piano notes, or monkey calls. The subjects were tested with novel sounds in each category constructed either to match the prior pattern or to violate the prior habituated pattern. The monkeys attended significantly more to a violation of their habituated pattern to a new pattern when human phonemes were used, and there was a trend difference in attention toward pattern violations with melodies. Monkey call patterns generated a variety of behavioral responses, were less likely to show habituation, and did not generate a strong attention reaction to changes in the patterns. Monkeys can extract abstract rules and patterns from auditory stimuli but the stimuli, by their nature, may generate competing responses which block processing of abstract regularities.	t	\N
23220120	Normal-hearing listeners can perceptually segregate concurrent sound sources, but listeners with significant hearing loss or who wear a cochlear implant (CI) lag behind in this ability. Perceptual grouping mechanisms are essential to segregate concurrent sound sources and affect comodulation masking release (CMR). Thus, CMR measurements in CI users could shed light on segregation cues needed for forming and grouping of auditory objects. CMR illustrates the fact that detection of a target sound embedded in a fluctuating masker is improved by the addition of masker energy remote from the target frequency, provided the envelope fluctuations across masker components are coherent. We modified such a CMR experiment to electrically-induced hearing using direct stimulation and measured the effect in 21 CI users. Cluster analysis of our data revealed two groups: one showed no or only small CMR of 0.1 dB ± 2.7 (N = 14) and a second group achieved a CMR of 10.7 dB ± 3.2 (N = 7), a value that is close to the enhancement observed in a comparable acoustic experiment in normal-hearing listeners (12.9 dB ± 2.6, N = 6). Interestingly, we observed that CMR in CI users may relate to hearing etiology and duration of hearing loss pre-implantation. Our study demonstrates for the first time that a substantial minority of cochlear-implant listeners (about a third) can show significant CMR. This outcome motivates the development of physiologically inspired multi-band gain control and/or different coding strategies for these groups in order to better preserve coherent modulation and thus to take advantage of the individual remaining capabilities to analyze spectro-temporal patterns.	t	\N
23224782	Past research has identified an event-related potential (ERP) marker for vocal emotional encoding and has highlighted vocal-processing differences between male and female listeners. We further investigated this ERP vocal-encoding effect in order to determine whether it predicts voice-related changes in listeners' memory for verbal interaction content. Additionally, we explored whether sex differences in vocal processing would affect such changes. To these ends, we presented participants with a series of neutral words spoken with a neutral or a sad voice. The participants subsequently encountered these words, together with new words, in a visual word recognition test. In addition to making old/new decisions, the participants rated the emotional valence of each test word. During the encoding of spoken words, sad voices elicited a greater P200 in the ERP than did neutral voices. While the P200 effect was unrelated to a subsequent recognition advantage for test words previously heard with a neutral as compared to a sad voice, the P200 did significantly predict differences between these words in a concurrent late positive ERP component. Additionally, the P200 effect predicted voice-related changes in word valence. As compared to words studied with a neutral voice, words studied with a sad voice were rated more negatively, and this rating difference was larger, the larger the P200 encoding effect was. While some of these results were comparable in male and female participants, the latter group showed a stronger P200 encoding effect and qualitatively different ERP responses during word retrieval. Estrogen measurements suggested the possibility that these sex differences have a genetic basis.	t	\N
23231206	Earlier work using sine-wave and noise-vocoded signals suggests that dynamic spectral structure plays a greater role in speech recognition for children than adults [Nittrouer and Lowenstein. (2010). J. Acoust. Soc. Am. 127, 1624-1635], but questions arise concerning whether outcomes can be compared because sine waves and wide noise bands are different in nature. The current study addressed that question using narrow noise bands for both signals, and applying a difference ratio to index the contribution made by dynamic spectral structure. Results replicated earlier findings, supporting the idea that dynamic spectral structure plays a critical role in speech recognition, especially for children.	t	\N
23241225	According to classical theories, automatic processes operate independently of attention. Recent evidence, however, shows that masked visuomotor priming, an example of an automatic process, depends on attention to visual form versus semantics. In a continuation of this approach, we probed feature-specific attention within the perceptual domain and tested in two event-related potential (ERP) studies whether masked visuomotor priming in a shape decision task specifically depends on attentional sensitization of visual pathways for shape in contrast to color. Prior to the masked priming procedure, a shape or a color decision task served to induce corresponding task sets. ERP analyses revealed visuomotor priming effects over the occipitoparietal scalp only after the shape, but not after the color induction task. Thus, top-down control coordinates automatic processing streams in congruency with higher-level goals even at a fine-grained level.	t	\N
23246616	Speech recognition in a multi-talker situation poses high demands on attentional and other central resources. This study examines the relationship between age, cognition and speech recognition in tasks that require selective or divided attention in a multi-talker setting. Two groups of normal-hearing adults (one younger and one older group) were asked to repeat utterances from either one or two concurrent speakers. Cognitive abilities were then inspected by neuropsychological tests. Speech recognition scores approached its ceiling and did not significantly differ between age groups for tasks that demanded selective attention. However, when divided attention was required, performance in older listeners was reduced as compared to the younger group. When selective attention was required, speech recognition was strongly related to working memory skills, as determined by a regression model. In comparison, speech recognition for tests requiring divided attention could be more strongly determined by neuropsychological probes of fluid intelligence. The findings of this study indicate that - apart from hearing impairment - cognitive aspects account for the typical difficulties of older listeners in a multi-speaker setting. Our results are discussed in the context of evidence showing that frontal lobe functions in terms of working memory and fluid intelligence generally decline with age.	t	\N
23269517	The most prominent models of numerical representation posit that numerical symbols are converted into a single internal, abstract representation prior to estimation and comparison processing. Here, we (1) provide a mathematical analysis of the predictions of the abstract-representation hypothesis, assuming the validity of the analog-representation hypothesis, (2) run a simulation to assess the patterns of data that result from our mathematical analysis, and (3) conduct two experiments to test the predictions of our model, using relative frequencies as inputs. We assess relative frequencies in a typical numerical distance task, whereby participants are presented with two relative frequencies and asked to identify the one that represents the larger quantity. Our data reveal that relative frequencies' numerical representations (1) are analog and (2) are scale-specific (i.e., nonabstract).	t	\N
23275409	In this study, the authors explored whether French-learning infants use nonadjacent phonotactic regularities in their native language, which they learn between the ages of 7 and 10 months, to segment words from fluent speech. Two groups of 20 French-learning infants were tested using the head-turn preference procedure at 10 and 13 months of age. In Experiment 1, infants were familiarized with 2 passages: 1 containing a target word with a frequent nonadjacent phonotactic structure and the other containing a target word with an infrequent nonadjacent phonotactic structure in French. During the test phase, infants were presented with 4 word lists: 2 containing the target words presented during familiarization and 2 other control words with the same phonotactic structure. In Experiment 2, the authors retested infants' ability to segment words with the infrequent phonotactic structure. Ten- and 13-month-olds were able to segment words with the frequent phonotactic structure, but it is only by 13 months, and only under the circumstances of Experiment 2, that infants could segment words with the infrequent phonotactic structure. These results provide new evidence showing that infant word segmentation is influenced by prior nonadjacent phonotactic knowledge.	t	\N
23275416	In this article, the authors describe the development of a new instrument, the Test of Child Speechreading (ToCS), which was specifically designed for use with deaf and hearing children. Speechreading is a skill that is required for deaf children to access the language of the hearing community. ToCS is a deaf-friendly, computer-based test that measures child speechreading (silent lipreading) at 3 psycholinguistic levels: (a) Words, (b) Sentences, and (c) Short Stories. The aims of the study were to standardize the ToCS with deaf and hearing children and to investigate the effects of hearing status, age, and linguistic complexity on speechreading ability. Eighty-six severely and profoundly deaf children and 91 hearing children participated. All children were between the ages of 5 and 14 years. The deaf children were from a range of language and communication backgrounds, and their preferred mode of communication varied. Speechreading skills significantly improved with age for both groups of children. There was no effect of hearing status on speechreading ability, and children from both groups showed similar performance across all subtests of the ToCS. The ToCS is a valid and reliable assessment of speechreading ability in school-age children that can be used to measure individual differences in performance in speechreading ability.	t	\N
23275425	In this study, the authors aimed to investigate how differences in language ability relate to differences in processing talker information in the native language and an unfamiliar language by comparing performance for different ages and for groups with impaired language. Three groups of native English listeners with typical language development (TLD; ages 7-9, ages 10-12, adults) and 2 groups with specific language impairment (SLI; ages 7-9, ages 10-12) participated in the study. Listeners heard pairs of words in both English and German (unfamiliar language) and were asked to determine whether the words were produced by the same or different talkers. In English, talker discrimination improved with age. In German, performance improved with age for the school-age children but was worse for adult listeners. No differences were found between TLD and SLI children. These results show that as listeners' language skills develop, there is a trade-off between more general perceptual abilities useful for processing talker information in any language and those that are relevant to their everyday language experiences and, thus, tied to the phonology. The lack of differences between the children with and without language impairments suggests that general auditory processing may be intact in at least some children with SLI.	t	\N
23275583	This study was designed to evaluate the effect of a pinna compensation (PC) algorithm on localization performance in the horizontal plane and speech intelligibility in noise. Nine and 18 experienced hearing aid users with bilaterally symmetrical sensorineural hearing loss participated in the localization study and the speech-in-noise study, respectively. Performance was evaluated unaided, aided with a behind-the-ear (BTE) hearing aid with an omnidirectional microphone (Omni), and aided with the same hearing aid with the PC algorithm (Omni+PC). Localization performance was measured using 12 loudspeakers spaced 30° apart on a horizontal plane. Speech-in-noise performance was measured with speech presented from 0° or 180°. A single-blinded, repeated measures design was used. Significant improvement in localization accuracy was found when comparing the Omni+PC condition to the Omni condition. Also, the Omni+PC condition improved the signal-to-noise ratio by 2.4 dB when compared to the Omni condition when speech was presented from the front in a diffuse noise background. Use of the PC algorithm improved localization on the horizontal plane and speech-in-noise performance. These results support use of the PC algorithm in BTE hearing aid fittings.	t	\N
23276111	Recent research on eye movements during scene viewing has primarily focused on where the eyes fixate. But eye fixations also differ in their durations. Here we investigated whether fixation durations in scene viewing are under the direct and immediate control of the current visual input. Subjects freely viewed photographs of scenes in preparation for a later memory test while their eye movements were recorded. Using a novel scene degradation paradigm based on a saccade-contingent display change method, scenes were reduced in luminance during saccades ending in critical fixations. Results from two experiments showed that the durations of the critical fixations were immediately affected by scene luminance, with a monotonic relationship between luminance reduction and fixation duration. The results are the first to demonstrate that fixation durations in scene viewing are immediately influenced by the ease of processing of the image currently in view. These results are consistent with the CRISP (a timer-Controlled Random-Walk with Inhibition for Saccade Planning) computational model of saccade generation in scenes, proposing that difficulty in moment-by-moment visual and cognitive processing of the scene modulates fixation durations.	t	\N
23281939	Recent human behavioral studies have shown semantic and/or lexical processing for stimuli presented below the auditory perception threshold. Here, we investigated electroencephalographic responses to words, pseudo-words and complex sounds, in conditions where phonological and lexical categorizations were behaviorally successful (categorized stimuli) or unsuccessful (uncategorized stimuli). Data showed a greater decrease in low-beta power at left-hemisphere temporal electrodes for categorized non-lexical sounds (complex sounds and pseudo-words) than for categorized lexical sounds (words), consistent with the signature of a failure in lexical access. Similar differences between lexical and non-lexical sounds were observed for uncategorized stimuli, although these stimuli did not yield evoked potentials or theta activity. The results of the present study suggest that behaviorally uncategorized stimuli were processed at the lexical level, and provide evidence of the neural bases of the results observed in previous behavioral studies investigating auditory perception in the absence of stimulus awareness.	t	\N
23285949	To study the long-term rehabilitation effects for treating tinnitus by tinnitus masking combined with counseling. Complete the tinnitus handicap inventory(THI) for participants before treatment and after six months, 1 year, 2 years, 3 years. Eighty-six tinnitus patients participated. Nine participants and sixteen, twenty-six, sixteen participants drop-up during the follow up after half-year, 1 year, 2 years, 3 years separately. The percentage of participants whose THI score decreased more than or equal to 20 are 66%, 56%, 40%, 48% after half year, 1 year, 2 years, 3 years respectively. The tinnitus related handicap improved by tinnitus masking combined with counseling after 1 year. The drop-out rate was increase after follow-up 1 year. The long-term rehabilitation effects for tinnitus treatment is still needed.	t	\N
23288656	Three experiments investigated whether extrinsic vowel normalization takes place largely at a categorical or a precategorical level of processing. Traditional vowel normalization effects in categorization were replicated in Experiment 1: Vowels taken from an [I]-[ε] continuum were more often interpreted as /I/ (which has a low first formant, F(1)) when the vowels were heard in contexts that had a raised F(1) than when the contexts had a lowered F(1). This was established with contexts that consisted of only two syllables. These short contexts were necessary for Experiment 2, a discrimination task that encouraged listeners to focus on the perceptual properties of vowels at a precategorical level. Vowel normalization was again found: Ambiguous vowels were more easily discriminated from an endpoint [ε] than from an endpoint [I] in a high-F(1) context, whereas the opposite was true in a low-F(1) context. Experiment 3 measured discriminability between pairs of steps along the [I]-[ε] continuum. Contextual influences were again found, but without discrimination peaks, contrary to what was predicted from the same participants' categorization behavior. Extrinsic vowel normalization therefore appears to be a process that takes place at least in part at a precategorical processing level.	t	\N
23294198	Thirteen (5.4%) of patients with idiopathic sudden sensorineural hearing loss (ISSNHL) had benign paroxysmal positional vertigo (BPPV). Most of the patients showed profound hearing loss and had BPPV of the lateral canal. BPPV in patients with ISSNHL may have no influence on hearing recovery. BPPV occurs in 8.6-12.7% of patients with ISSNHL; however, the role of BPPV in hearing recovery remains controversial. Therefore, we investigated hearing outcomes in the patients, including the distribution of initial hearing threshold, the type of canal involved, and the number of repositioning maneuvers performed. Of 241 patients with ISSNHL who presented to a dizziness clinic between March 2008 and May 2012, 13 with both ISSNHL and BPPV were recruited for this study. The patients were evaluated for their initial hearing threshold, type of canal involved, response to repositioning maneuvers, and hearing outcome. Age- and hearing threshold-matched patients with ISSNHL but without BPPV were randomly sampled and included for a hearing outcome comparison. Of 13 patients with ISSNHL and BPPV, 11 showed profound hearing loss. The lateral canal was involved in 11 patients, including 3 who had multi-canal involvement. Ten patients underwent a single repositioning maneuver. Those patients did not display a significant difference in hearing recovery compared with those patients having ISSNHL only.	t	\N
23294284	Following cochlear implantation, hearing-impaired listeners must adapt to speech as heard through their prosthesis. Visual speech information (VSI; the lip and facial movements of speech) is typically available in everyday conversation. Here, we investigate whether learning to understand a popular auditory simulation of speech as transduced by a cochlear implant (noise-vocoded [NV] speech) is enhanced by the provision of VSI. Experiment 1 demonstrates that provision of VSI concurrently with a clear auditory form of an utterance as feedback after each NV utterance during training does not enhance learning over clear auditory feedback alone, suggesting that VSI does not play a special role in retuning of perceptual representations of speech. Experiment 2 demonstrates that provision of VSI concurrently with NV speech (a simulation of typical real-world experience) facilitates perceptual learning of NV speech, but only when an NV-only repetition of each utterance is presented after the composite NV/VSI form during training. Experiment 3 shows that this more efficient learning of NV speech is probably due to the additional listening effort required to comprehend the utterance when clear feedback is never provided and is not specifically due to the provision of VSI. Our results suggest that rehabilitation after cochlear implantation does not necessarily require naturalistic audiovisual input, but may be most effective when (a) training utterances are relatively intelligible (approximately 85% of words reported correctly during effortful listening), and (b) the individual has the opportunity to map what they know of an utterance's linguistic content onto the degraded form.	t	\N
23294551	The aim of the present study was to examine patterns of neural activity in response to variations in scale notes and alterations from a scale note to a non-scale note. Event-related potentials (ERPs) were recorded in response to scale and non-scale violin notes using an odd-ball mismatch negativity (MMN) paradigm. Standard stimuli were set to the scale note A4 (440 Hz). Deviant stimuli included two scale notes (scale-B, B4 = 494 Hz; scale-C, C5 = 523 Hz) and a non-scale note halfway between them (non-scale, B4+42¢ = 506 Hz). MMN amplitude elicited by the non-scale was significantly larger than that elicited by the scale-B and scale-C, which did not differ significantly from one another. The current results suggest that the human brain may possess pre-attentive mechanisms for extracting relational aspects among sounds of the musical scale. The results indicate that non-scale notes may be processed in a different way even in the pre-attentive stage than scale notes.	t	\N
23296187	Sounds can modulate visual perception as well as neural activity in retinotopic cortex. Most studies in this context investigated how sounds change neural amplitude and oscillatory phase reset in visual cortex. However, recent studies in macaque monkeys show that congruence of audio-visual stimuli also modulates the amount of stimulus information carried by spiking activity of primary auditory and visual neurons. Here, we used naturalistic video stimuli and recorded the spatial patterns of functional MRI signals in human retinotopic cortex to test whether the discriminability of such patterns varied with the presence and congruence of co-occurring sounds. We found that incongruent sounds significantly impaired stimulus decoding from area V2 and there was a similar trend for V3. This effect was associated with reduced inter-trial reliability of patterns (i.e. higher levels of noise), but was not accompanied by any detectable modulation of overall signal amplitude. We conclude that sounds modulate naturalistic stimulus encoding in early human retinotopic cortex without affecting overall signal amplitude. Subthreshold modulation, oscillatory phase reset and dynamic attentional modulation are candidate neural and cognitive mechanisms mediating these effects.	t	\N
23297908	The relative contributions of within-channel and across-channel processes to perceptual comodulation masking release (CMR) were investigated in the framework of an auditory processing model. A generalized version of the computational auditory signal processing and perception model [CASP; Jepsen et al., J. Acoust. Soc. Am. 124, 422-438 (2008)] was used and extended by an across-channel modulation processing stage according to Piechowiak et al. [J. Acoust. Soc. Am. 121, 2111-2126 (2007)]. Five experimental paradigms were considered: CMR with a broadband noise masker as a function of the masker spectrum level; CMR with four widely spaced flanking bands (FBs) varying in overall level; CMR with one FB varying in frequency and level relative to the on-frequency band (OFB); CMR with one FB varying in frequency; and CMR as a function of the number of FBs. The predictions suggest that at least three different mechanisms contribute to overall CMR in the considered conditions: (1) a within-channel process based on changes in the envelope characteristic due to the addition of the signal to the masker; (2) a within-channel process based on nonlinear peripheral processing of the OFB's envelope caused by the FB(s); and (3) an across-channel process that is robust across presentation levels but relatively small (2-5 dB).	t	\N
23297915	This study used the same methodology in Wong [J. Speech Lang. Hear. Res. 55, 1423-1437 (2012b)] to examine the perceived accuracy of monosyllabic Mandarin tones produced by 4- and 5-year-old Mandarin-speaking children growing up in Taiwan and combined the findings with those of 3-year-olds reported in Wong [J. Speech Lang. Hear. Res. 55, 1423-1437 (2012b)] to track the development of monosyllabic tone production in preschool children. Tone productions of adults and children were collected in a picture naming task and low-pass filtered to remove lexical information and reserve tone information. Five native-speakers categorized the target tones in the filtered productions. Children's tone accuracy was compared to adults' to determine mastery and developmental changes. The results showed that preschool children in Taiwan have not fully mastered the production of monosyllabic Mandarin tones. None of the tones produced by the children in the three age groups reached adult-like accuracy. Little developmental change was found in children's tone accuracy during the preschool years. A similar order of accuracy of the tones was observed across the three age groups and the order appeared to follow the order of articulatory complexity in producing the tones. The findings suggest a protracted course of development in children's acquisition of Mandarin tones and that tone development may be constrained by physiological factors.	t	\N
23299125	Phantom electrode (PE) stimulation consists of out-of-phase stimulation of two electrodes. When presented at the apex of the electrode array, phantom stimulation is known to produce a lower pitch sensation than monopolar (MP) stimulation on the most apical electrode. The ratio of the current between the primary electrode (PEL) and the compensating electrode (CEL) is represented by the coefficient σ, which ranges from 0 (monopolar) to 1 (full bipolar). The exact mechanism by which PE stimulation produces a lower pitch sensation is unclear. In the present study, unmasked and masked thresholds were obtained using a forward masking paradigm to estimate the spread of current for MP and PE stimulation. Masked thresholds were measured for two phantom electrode configurations (1) PEL = 4, CEL = 5 (lower pitch phantom) and (2) PEL = 4, CEL = 3 (higher pitch phantom). The unmasked thresholds were subtracted from the masked thresholds to obtain masking patterns which were normalized to their peak. The masking patterns reveal (1) differences in the spread of excitation that are consistent with the direction of pitch shift produced by PE stimulation, and (2) narrower spread of electrical excitation for PE stimulation relative to MP stimulation.	t	\N
23301659	The last decade has offered a multitude of instant fit coupling systems to be fitted with behind-the-ear (BTE) hearing aids. The impact of these designs on the reliability of real ear measurements (REMs) has not been reported. The purpose of this study was to obtain REM reliability data for instant fit coupling systems. REM reliability data was obtained for four different instant-fit coupling systems and for standard size 13 tubing and custom earmolds. REMs were performed for all five coupling systems two times and by two examiners. Ten normal-hearing individuals (20 ears) served as participants. The REM test-retest reliability is high for the four instant fit coupling systems as well as for the custom earmolds. The REM inter-examiner reliability is high for three of the four instant fit coupling systems. Carrying out REMs with instant fit coupling systems appears to be fundamentally no different than performing REMs with conventional hearing aids. For either, care should be taken in probe tube placement in terms of insertion depth and maintaining the probe tube placement, and other best practices regarding test environment and test setup should be observed.	t	\N
23308266	Computational and experimental research has revealed that auditory sensory predictions are derived from regularities of the current environment by using internal generative models. However, so far, what has not been addressed is how the auditory system handles situations giving rise to redundant or even contradictory predictions derived from different sources of information. To this end, we measured error signals in the event-related brain potentials (ERPs) in response to violations of auditory predictions. Sounds could be predicted on the basis of overall probability, i.e., one sound was presented frequently and another sound rarely. Furthermore, each sound was predicted by an informative visual cue. Participants' task was to use the cue and to discriminate the two sounds as fast as possible. Violations of the probability based prediction (i.e., a rare sound) as well as violations of the visual-auditory prediction (i.e., an incongruent sound) elicited error signals in the ERPs (Mismatch Negativity [MMN] and Incongruency Response [IR]). Particular error signals were observed even in case the overall probability and the visual symbol predicted different sounds. That is, the auditory system concurrently maintains and tests contradictory predictions. Moreover, if the same sound was predicted, we observed an additive error signal (scalp potential and primary current density) equaling the sum of the specific error signals. Thus, the auditory system maintains and tolerates functionally independently represented redundant and contradictory predictions. We argue that the auditory system exploits all currently active regularities in order to optimally prepare for future events.	t	\N
23317386	Non-word repetition (NWR) tasks have been found to correlate with language skills and to discriminate between groups of typically developing (TD) children and children with specific language impairment (SLI) across languages. The main aim was to develop an easily-administered NWR screening test that could discriminate between Slovak-speaking TD children and children with SLI. The second aim was to establish if the novel scoring methods for NWR tasks were equally effective at differentiating between the TD versus SLI groups. As Slovak vowels are not reduced in unstressed syllables, it was also sought to establish whether scoring vowels (in addition to the consonants usually assessed in English language tests) would be informative. The paper evaluated the performance of a new NWR task for Slovak-speaking children. Study 1 compared the performance of 60 TD children in three age groups: 3-year-olds (N = 20), 4-your-olds (N = 20) and 5-year-olds (N = 20). Five types of scoring methods were examined: whole-item, number of syllables, syllable structure, consonants, and vowels. Study 2 compared performance on the NWR task administered in Study 1 across three groups of child participants: an SLI group (N = 16), a TD age-matched group (N = 16), and a TD language-matched group (N = 14). Study 1 found an age effect in the TD sample for three out of five of the scoring methods tested (number of syllables, syllable structure, and consonants). Study 2 showed that all five of the scoring methods discriminated between: (1) the group of children with SLI and the TD language-matched (TDLM) sample and (2) the group of children with SLI and the TD age-matched (TDAM) group. The novel NWR tool reliably differentiated between children with SLI and TD children. Scoring cut-off points which demonstrated high rates of success at verifying true-positives (93.75%) and true-negatives (100%) are provided. The most informative scoring methods for Slavic languages (whole-item scoring and vowels correct) are identified and discussed.	t	\N
23333667	Spontaneous beat gestures are an integral part of the paralinguistic context during face-to-face conversations. Here we investigated the time course of beat-speech integration in speech perception by measuring ERPs evoked by words pronounced with or without an accompanying beat gesture, while participants watched a spoken discourse. Words accompanied by beats elicited a positive shift in ERPs at an early sensory stage (before 100 ms) and at a later time window coinciding with the auditory component P2. The same word tokens produced no ERP differences when participants listened to the discourse without view of the speaker. We conclude that beat gestures are integrated with speech early on in time and modulate sensory/phonological levels of processing. The present results support the possible role of beats as a highlighter, helping the listener to direct the focus of attention to important information and modulate the parsing of the speech stream.	t	\N
23333869	The present study investigates the status of rhythmic irregularities occurring in natural speech and the importance of rhythmic alternations in cognitive processing. Previous studies showed the relevance of rhythm for language processing, but there has been only little research using the method of event-related potentials to investigate this phenomenon in a natural metrical context. To this end, an experiment was conducted in which the so-called Rhythm Rule (alternation of stressed and unstressed syllables) was either met or violated by stress clashes or stress lapses which are known to occur in German. The comparison of rhythmic well-formed conditions with the conditions including rhythmic irregularities revealed biphasic EEG-patterns for rhythmically marked structures, i.e., stress clashes and lapses. The present results show that irregular but possible rhythmic variants are costly in language processing, reflected by an early negativity and an N400 in contrast to the well-formed control conditions. Supposedly, the early negativity reflects error detection in rhythmical structure and supports the view that the brain is sensitive to subtle violations of rhythmical structure. A late positive component reflects the evaluation process related to the task requirements. The study shows that subtle rhythmical deviations from the Rhythm Rule are perceived and treated differently from well-formed structures during processing, even if the deviation in question is permitted and can therefore occur in language production.	t	\N
23335579	Recent work suggests that people predict how objects interact in a manner consistent with Newtonian physics, but with additional uncertainty. However, the sources of uncertainty have not been examined. In this study, we measure perceptual noise in initial conditions and stochasticity in the physical model used to make predictions. Participants predicted the trajectory of a moving object through occluded motion and bounces, and we compared their behavior to an ideal observer model. We found that human judgments cannot be captured by simple heuristics and must incorporate noisy dynamics. Moreover, these judgments are biased consistently with a prior expectation on object destinations, suggesting that people use simple expectations about outcomes to compensate for uncertainty about their physical models.	t	\N
23337441	Here, we explore the sensitivity of different awareness scales in revealing conscious reports on visual emotion perception. Participants were exposed to a backward masking task involving fearful faces and asked to rate their conscious awareness in perceiving emotion in facial expression using three different subjective measures: confidence ratings (CRs), with the conventional taxonomy of certainty, the perceptual awareness scale (PAS), through which participants categorize "raw" visual experience, and post-decision wagering (PDW), which involves economic categorization. Our results show that the CR measure was the most exhaustive and the most graded. In contrast, the PAS and PDW measures suggested instead that consciousness of emotional stimuli is dichotomous. Possible explanations of the inconsistency were discussed. Finally, our results also indicate that PDW biases awareness ratings by enhancing first-order accuracy of emotion perception. This effect was possibly a result of higher motivation induced by monetary incentives.	t	\N
23338560	Our study estimates detection thresholds for tones of different durations and frequencies in Great Tits (Parus major) with operant procedures. We employ signals covering the duration and frequency range of communication signals of this species (40-1,010 ms; 2, 4, 6.3 kHz), and we measure threshold level-duration (TLD) function (relating threshold level to signal duration) in silence as well as under behaviorally relevant environmental noise conditions (urban noise, woodland noise). Detection thresholds decreased with increasing signal duration. Thresholds at any given duration were a function of signal frequency and were elevated in background noise, but the shape of Great Tit TLD functions was independent of signal frequency and background condition. To enable comparisons of our Great Tit data to those from other species, TLD functions were first fitted with a traditional leaky-integrator model. We then applied a probabilistic model to interpret the trade-off between signal amplitude and duration at threshold. Great Tit TLD functions exhibit features that are similar across species. The current results, however, cannot explain why Great Tits in noisy urban environments produce shorter song elements or faster songs than those in quieter woodland environments, as detection thresholds are lower for longer elements also under noisy conditions.	t	\N
23342146	This study investigated a theoretically challenging dissociation between good production and poor perception of tones among neurologically unimpaired native speakers of Cantonese. The dissociation is referred to as the near-merger phenomenon in sociolinguistic studies of sound change. In a passive oddball paradigm, lexical and nonlexical syllables of the T1/T6 and T4/T6 contrasts were presented to elicit the mismatch negativity (MMN) and P3a from two groups of participants, those who could produce and distinguish all tones in the language (Control) and those who could produce all tones but specifically failed to distinguish between T4 and T6 in perception (Dissociation). The presence of MMN to T1/T6 and null response to T4/T6 of lexical syllables in the dissociation group confirmed the near-merger phenomenon. The observation that the control participants exhibited a statistically reliable MMN to lexical syllables of T1/T6, weaker responses to nonlexical syllables of T1/T6 and lexical syllables of T4/T6, and finally null response to nonlexical syllables of T4/T6, suggests the involvement of top-down processing in speech perception. Furthermore, the stronger P3a response of the control group, compared with the dissociation group in the same experimental conditions, may be taken to indicate higher cognitive capability in attention switching, auditory attention or memory in the control participants. This cognitive difference, together with our speculation that constant top-down predictions without complete bottom-up analysis of acoustic signals in speech recognition may reduce one's sensitivity to small acoustic contrasts, account for the occurrence of dissociation in some individuals but not others.	t	\N
23354998	Complex sounds vary along a number of acoustic dimensions. These dimensions may exhibit correlations that are familiar to listeners due to their frequent occurrence in natural sounds-namely, speech. However, the precise mechanisms that enable the integration of these dimensions are not well understood. In this study, we examined the categorization of novel auditory stimuli that differed in the correlations of their acoustic dimensions, using decision bound theory. Decision bound theory assumes that stimuli are categorized on the basis of either a single dimension (rule based) or the combination of more than one dimension (information integration) and provides tools for assessing successful integration across multiple acoustic dimensions. In two experiments, we manipulated the stimulus distributions such that in Experiment 1, optimal categorization could be accomplished by either a rule-based or an information integration strategy, while in Experiment 2, optimal categorization was possible only by using an information integration strategy. In both experiments, the pattern of results demonstrated that unidimensional strategies were strongly preferred. Listeners focused on the acoustic dimension most closely related to pitch, suggesting that pitch-based categorization was given preference over timbre-based categorization. Importantly, in Experiment 2, listeners also relied on a two-dimensional information integration strategy, if there was immediate feedback. Furthermore, this strategy was used more often for distributions defined by a negative spectral correlation between stimulus dimensions, as compared with distributions with a positive correlation. These results suggest that prior experience with such correlations might shape short-term auditory category learning.	t	\N
23357804	Frequency-lowering signal processing in hearing aids has re-emerged as an option to improve audibility of the high frequencies by expanding the input bandwidth. Few studies have investigated the usefulness of the scheme as an option for bimodal users (i.e., combined use of a cochlear implant and a contralateral hearing aid). In this study, that question was posed. The purposes of this study were (1) to determine if frequency compression was a better bimodal option than conventional amplification and (2) to determine the impact of a frequency-compression hearing aid on speech recognition abilities. There were two separate experiments in this study. The first experiment investigated the contribution of a frequency-compression hearing aid to contralateral cochlear implant (CI) performance for localization and speech perception in noise. The second experiment assessed monaural consonant and vowel perception in quiet using the frequency-compression and conventional hearing aid without the use of a contralateral CI or hearing aid. Ten subjects fitted with a cochlear implant and hearing aid participated in the first experiment. Seventeen adult subjects with a cochlear implant and hearing aid or two hearing aids participated in the second experiment. To be included, subjects had to have a history of postlingual deafness, a moderate or moderate-to-severe hearing loss, and have not worn this type of frequency-lowering hearing aid previously. In the first experiment, performance using the frequency-compression and conventional hearing aids was assessed on tests of sound localization, speech perception in a background of noise, and two self-report questionnaires. In the second experiment, consonant and vowel perception in quiet was assessed monaurally for the two conditions. In both experiments, subjects alternated daily between a frequency-compression and conventional hearing aid for 2 mo. The parameters of frequency compression were set individually for each subject, and audibility was measured for the frequency compression and conventional hearing aid programs by comparing estimations of the Speech Intelligibility Index (SII) using a modified algorithm (Bentler et al, 2011). In both experiments, the outcome measures were administered following the hearing aid fitting to assess performance at baseline and after 2 mo of use. For this group of subjects, the results revealed no significant difference between the frequency-compression and conventional hearing aid on tests of localization and consonant recognition. Spondee-in-noise and vowel perception scores were significantly higher with the conventional hearing aid compared to the frequency-compression hearing aid after 2 mo of use. These results suggest that, for the subjects in this study, frequency compression is not a better bimodal option than conventional amplification. In addition, speech perception may be negatively influenced by frequency compression because formant frequencies are too severely compressed and can no longer be distinguished.	t	\N
23357807	Johnson and Dillon (2011) provided a model-based comparison of current generic hearing aid prescriptive methods for adults with hearing loss based on the attributes of speech intelligibility, loudness, and bandwidth. This study compared the National Acoustic Laboratories-Non-linear 2 (NAL-NL2) and Cambridge Method for Loudness Equalization 2-High-Frequency (CAM2) prescriptive methods using adult participants with less high-frequency hearing loss than Johnson and Dillon (2011). Of study interest was quantification of prescribed audibility, speech intelligibility, and loudness. The preferences of participants for either NAL-NL2 or CAM2 and preferred deviations from prescribed settings are also reported. Using a single-blind, counter-balanced, randomized design, preference judgments for the prescriptive methods with regard to sound quality of speech and music stimuli were obtained. Preferred gain adjustments from the prescription within the 4-10 kHz frequency range were also obtained from each participant. Speech intelligibility and loudness model calculations were completed on the prescribed and adjusted amplification. Fourteen male Veterans, whose average age was 65 yr and whose hearing sensitivity averaged normal to borderline normal through 1000 Hz sloping to a moderately severe sensorineural loss, served as participants. Following a brief listening time (~10 min), typical of an initial fitting visit, the participants made paired comparison of sound quality between the NAL-NL2 and CAM2 prescriptive settings. Participants were also asked to modify each prescription in the range of 4-10 kHz using an overall gain control and make subsequent comparisons of sound quality preference between prescriptive and adjusted settings. Participant preferences were examined with respect to quantitative analysis of loudness modeling, speech intelligibility modeling, and measured high-frequency bandwidth audibility. Consistent with the lack of difference in predicted speech intelligibility between the two prescriptions, sound quality preferences on the basis of clarity were split across participants while some participants did not have a discernable preference. Considering sound quality judgments of pleasantness, the majority of participants preferred the sound quality of the NAL-NL2 (8 of 14) prescription instead of the CAM2 prescription (2 of 14). Four of the 14 participants showed no preference on the basis of pleasantness for either prescription. Individual subject preferences were supported by loudness modeling that indicated NAL-NL2 was the softer of the two prescriptions and CAM2 was the louder. CAM2 did provide more audibility to the higher frequencies (5-8 kHz) than NAL-NL2. Participants turned the 4-10 kHz gain recommendation of CAM2 lower, on average, by a significant amount of 4 dB when making adjustments while no significant adjustment was made to the initial NAL-NL2 recommendation. NAL-NL2 prescribed gains were more often preferred at the initial fitting by the majority of participating veterans. For those patients with preference for a louder fitting than NAL-NL2, CAM2 is a good alternative. When the participant adjustment from the prescription between 4 and 10 kHz exceeded 4 dB from either NAL-NL2 (2 of 14) or CAM2 (11 of 14), the participants demonstrated a later preference for that adjustment 69% of the time. These findings are viewed as limited evidence that some individuals may have a preference for high-frequency gain that differs from the starting prescription.	t	\N
23363117	Recently introduced hearing devices allow dynamic-range compression to be coordinated at the two ears through a wireless link. This study investigates how linking compression across the ears might improve speech intelligibility in the presence of a spatially separated steady noise. An analysis of the compressors' behavior shows how linked compression can preserve interaural level differences (ILDs) and, compared to compression operating independently at each ear, improve the long-term apparent speech-to-noise ratio (SNR) at the ear with the better SNR. Speech intelligibility for normal-hearing listeners was significantly better with linked than with unlinked compression. The performance with linked compression was similar to that without any compression. The benefit of linked over unlinked compression was the same for binaural listening and for monaural listening to the ear with the better SNR, indicating that the benefit was due to changes to the signal at this ear and not to the preservation of ILDs. Differences in performance across experimental conditions were qualitatively consistent with changes in apparent SNR at the better ear. Predictions made using a speech intelligibility model suggest that linked compression could potentially provide a user of bilateral hearing aids with an improvement in intelligibility of up to approximately ten percentage points.	t	\N
23374605	To date, the nature of the phonological deficit in developmental dyslexia is still debated. We concur with possible impairments in the representations of the universal phonological constraints that universally govern how phonemes co-occur as a source of this deficit. We were interested in whether-and how-dyslexic children have sensitivity to sonority-related markedness constraints. We tested 10 French dyslexic children compared with 20 typically developing chronological age-matched and reading level-matched controls. All were tested with two aurally administered syllable counting tasks that manipulated well-formedness of unattested consonant clusters, as determined by universal phonological sonority-related markedness constraints (onset clusters in Experiment 1; intervocalic clusters in Experiment 2). Surprisingly, dyslexic children's response patterns were similar to those in both control groups; as universal phonological sonority-related markedness increased, dyslexic children increasingly perceptually confused and phonologically repaired clusters with an illusory epenthetic vowel (e.g., /ʁəbal/). Although dyslexic children were systematically slower, like both control groups, they were influenced by universal sonority-related markedness constraints and hierarchically ranked constraints specific to French over evident acoustic-phonetic contrasts or sonority-unrelated cues. Our results are counterintuitive but innovative and compete to question an impaired universal phonological grammar because dyslexic children were found to have normal universal phonological constraints and were skilled to restore phonotactically legal syllable structures with a language-specific illusory epenthetic vowel (i.e., /ə/-like vowel). We discuss them regarding active phonological decoding and recoding processes within the framework of the optimality theory.	t	\N
23384511	We tested whether incoming sounds are processed differently depending on how the preceding sound sequence has been interpreted by the brain. Sequences of a regularly repeating three-tone pattern, the perceived organization of which spontaneously switched back and forth between two alternative interpretations, were delivered to listeners. Occasionally, a regular tone was exchanged for a slightly or moderately lower one (deviants). The electroencephalogram (EEG) was recorded while listeners continuously marked their perception of the sound sequence. We found that for both the regular and the deviant tones, the early exogenous P1 and N1 amplitudes varied together with the perceived sound organization. Percept-dependent effects on the late endogenous N2 and P3a amplitudes were only found for deviant tones. These results suggest that the perceived sound organization affects sound processing both by modulating what information is extracted from incoming sounds as well as by influencing how deviant sound events are evaluated for further processing.	t	\N
23384530	Prosodic aspects of speech such as pitch, duration and amplitude constitute nonverbal cues that supplement or modify the meaning of the spoken word, to provide valuable clues as to a speakers' state of mind. It can thus indicate what emotion a person is feeling (emotional prosody), or their attitude towards an event, person or object (attitudinal prosody). Whilst the study of emotional prosody has gathered pace, attitudinal prosody now deserves equal attention. In social cognition, understanding attitudinal prosody is important in its own right, since it can convey powerful constructs such as confidence, persuasion, sarcasm and superiority. In this review, it is examined what prosody is, how it conveys attitudes, and which attitudes prosody can convey. The review finishes by considering the neuroanatomy associated with attitudinal prosody, and put forward the hypothesis that this cognition is mediated by the right cerebral hemisphere, particularly posterior superior lateral temporal cortex, with an additional role for the basal ganglia, and limbic regions such as the medial prefrontal cortex and amygdala. It is suggested that further exploration of its functional neuroanatomy is greatly needed, since it could provide valuable clues about the value of current prosody nomenclature and its separability from other types of prosody at the behavioural level.	t	\N
23386124	The sight of a speaker's facial movements during the perception of a spoken message can benefit speech processing through online predictive mechanisms. Recent evidence suggests that these predictive mechanisms can operate across sensory modalities, that is, vision and audition. However, to date, behavioral and electrophysiological demonstrations of cross-modal prediction in speech have considered only the speaker's native language. Here, we address a question of current debate, namely whether the level of representation involved in cross-modal prediction is phonological or pre-phonological. We do this by testing participants in an unfamiliar language. If cross-modal prediction is predominantly based on phonological representations tuned to the phonemic categories of the native language of the listener, then it should be more effective in the listener's native language than in an unfamiliar one. We tested Spanish and English native speakers in an audiovisual matching paradigm that allowed us to evaluate visual-to-auditory prediction, using sentences in the participant's native language and in an unfamiliar language. The benefits of cross-modal prediction were only seen in the native language, regardless of the particular language or participant's linguistic background. This pattern of results implies that cross-modal visual-to-auditory prediction during speech processing makes strong use of phonological representations, rather than low-level spatiotemporal correlations across facial movements and sounds.	t	\N
23387878	Velopharyngeal incompetence is a contributing factor to speech disorders and implies the presence of hypernasality, inappropriate nasal escape, and decreased air pressure during speech. One prosthetic treatment is a rehabilitative procedure employing a palatal lift prosthesis (PLP), which reduces hypernasality by approximating the incompetent soft palate to the posterior pharyngeal wall and consists of two parts, the anterior denture base and the palatal lifting plate, which are connected with steel wires; however, it seems difficult to reproduce the mobility of the soft palate in speaking, and it is therefore likely that the palatal lifting plate stimulates or oppresses the tissue of the soft palate and hinders rather than assists articulatory function. To avoid these disturbances we devised an adjustable PLP with a flexible conjunction between the denture base and the palatal lifting plate to obtain the optimal vertical lifting angle. The palatal plate was adapted to conform in a passive manner to the soft palate with light-cured resin. The designed PLP simplified the procedure and reduced the number of adjustments and visits.	t	\N
23389426	The human visual attention system is geared toward detecting the most salient and relevant events in an overwhelming stream of information. There has been great interest in measuring how many visual events can be processed at a time, and most of the work has suggested that the limit is three to four. However, attention to a visual stimulus can also be driven by a synchronous auditory event. The present work indicates that a fundamentally different limit applies to audiovisual processing, such that at most only a single audiovisual event can be processed at a time. This limited capacity is not due to a limitation in visual selection; participants were able to process about four visual objects simultaneously. Instead, we propose that audiovisual orienting is subject to a fundamentally different capacity limit than pure visual selection is.	t	\N
23395711	In the current ERP study, an active oddball task was carried out, testing pure tones and auditory, visual and audiovisual syllables. For pure tones, an MMN, an N2b, and a P3 were found, confirming traditional findings. Auditory syllables evoked an N2 and a P3. We found that the amplitude of the P3 depended on the distance between standard and deviant. A smaller distance required more attention, which was reflected in a larger amplitude. An analysis of audiovisual material, after correction for visual activity, showed that McGurk type stimuli evoked brain responses that differed from both the standard and the congruent deviants. Finally, we found that congruent audiovisual stimuli elicited an N2 with a shorter latency and a P3 with a smaller amplitude than auditory stimuli. The current ERP study, thus, shows that for audiovisual processing the whole is more than the sum of its parts.	t	\N
23395774	This paper examines the judgment of segmented temporal intervals, using short tone sequences as a convenient test case. In four experiments, we investigate how the relative lengths, arrangement, and pitches of the tones in a sequence affect judgments of sequence duration, and ask whether the data can be described by a simple weighted sum of segments model. The model incorporates three basic assumptions: (i) the judgment of each segment is a negatively accelerated function of its duration, (ii) the judgment of the overall interval is produced by summing the judgments of each segment, and (iii) more recent segments are weighted more heavily. We also assume that higher-pitched tones are judged to last longer. Empirically, sequences with equal-sized segments were consistently judged longer than those with accelerating or decelerating structures. Furthermore, temporal structure interacted with duration, such that accelerating sequences were judged longer than decelerating ones at short durations but the effect reversed at longer durations. These effects were modulated by the number of tones in the sequence, the rate of acceleration/deceleration, and whether the sequence had ascending or descending pitch, and were well-described by the weighted sum model. The data provide strong constraints on theories of temporal judgment, and the weighted sum of segments model offers a useful basis for future theoretical and empirical investigation.	t	\N
23397035	We used a face-gender repetition priming paradigm to precisely map the spatial frequencies (SFs) that influence observers' responses under different prime awareness conditions. A visible prime condition was set up by presenting the stimulus sequence mask-blank-prime-blank-mask-target and an invisible prime condition by switching the order of the masks and the blanks (see also Dehaene et al., 2001). The prime faces (~4.6° × 3.1°) were randomly filtered trial-by-trial according to the SF bubbles technique (Willenbockel, Fiset et al., 2010). Classification vectors, derived by summing the SF filters from each trial weighted by observers' transformed response times, revealed that SFs around 12 cycles per face width modulated responses in both prime awareness conditions. The significant SFs closely matched those optimal for accurate performance in a direct face-gender classification paradigm. Surprisingly, the significant SFs facilitated observers' responses in the visible prime condition, whereas they slowed responses in the invisible prime condition. Our findings suggest that SF tuning per se remains robust under different prime awareness conditions but that diagnostic visual cues might be utilized in a qualitatively different fashion as a function of awareness.	t	\N
23400826	In two experiments, we used an interruption-and-recall (IAR) task to explore listeners' ability to monitor the capacity of working memory as new information arrived in real time. In this task, listeners heard recorded word lists with instructions to interrupt the input at the maximum point that would still allow for perfect recall. Experiment 1 demonstrated that the most commonly selected segment size closely matched participants' memory span, as measured in a baseline span test. Experiment 2 showed that reducing the sound level of presented word lists to a suprathreshold but effortful listening level disrupted the accuracy of matching selected segment sizes with participants' memory spans. The results are discussed in terms of whether online capacity monitoring may be subsumed under other, already enumerated working memory executive functions (inhibition, set shifting, and memory updating).	t	\N
23403808	The objective of this study was to examine the role of the acoustic stapedius reflex in the protection of speech recognition from the upward spread of masking arising from low-frequency background noise. Speech recognition scores were measured for nine control participants (19-34 years) and six patients with transected stapedius tendons poststapedotomy (39-57 years) as a function of the amplitude of a low-frequency masker, presented at nominal signal to noise ratios of +5 dB, -5 dB, and -15 dB. All participants had pure-tone hearing thresholds in the normal range. Continuous high-pass noise was present in all conditions to avoid ceiling effects; this reduced performance in quiet to approximately 85% for all participants. Scores were measured for soft and loud nonsense syllables (average third octave band levels of 35 and 65 dB SPL), so that a comparison of the low-frequency noise masking functions at the two levels would provide information about the effects of the reflex on speech intelligibility in noise. A third group of nine control participants (19-22 years) listened in the presence of a low-frequency masker gated to come on 1 sec before stimulus onset, to reduce the likelihood of reflex adaptation. The Speech-Intelligibility Index was used to quantify the amount of speech information available in each condition. Patients with transected tendons performed more poorly than control participants as a function of Speech-Intelligibility Index in all conditions, even at levels that were too soft for reflex activation. This could be because of postsurgical differences in sensitivity, the more advanced age of poststapedotomy group, or differences in medial olivocochlear inhibition. For loud speech, patient performance fell nearly linearly with increases in the low-frequency masker, but control participants' performance declined little as the signal to noise ratio declined from +5 to -5 dB, and then fell rapidly as the ratio declined to -15 dB. This plateau in the masking function did not occur for the patients. Masking functions obtained with the gated low-frequency masker were either highly similar or poorer to those obtained with a continuous masker, suggesting that the use of a continuous low frequency masker did not result in significant reflex adaptation. The stapedius reflex appears to offer some protection from the upward spread of masking of speech by background low-frequency noise at moderate levels, but not at high levels.	t	\N
23406991	Around 10% of people experience subjective tinnitus (the perception of sound, only audible to the patient, in the absence of an external auditory stimulus).(1-3) It may be associated with hearing loss, anxiety, depression, sleep disturbance, concentration problems or reduced quality of life; for around 0.5% it is extremely disturbing.(1-4) Risk factors include aging, significant noise exposure, drug therapy (e.g. aminoglycosides, NSAIDs, diuretics), or disorders of the outer, middle or inner ear or auditory nerve (e.g. ear wax, infections, vestibular schwannoma, otosclerosis).(1,2,4) It may be due to excessive spontaneous activity in the auditory system and brain; if the signal (normally suppressed by the subconscious) becomes noticed it becomes more intrusive and annoying in a vicious cycle.(5) Here, we discuss symptomatic drug and non-drug treatments for subjective tinnitus in adults. We do not cover treatment of underlying causes of tinnitus.	t	\N
23408389	Coding for the degree of disorder in a temporally unfolding sensory input allows for optimized encoding of these inputs via information compression and predictive processing. Prior neuroimaging work has examined sensitivity to statistical regularities within single sensory modalities and has associated this function with the hippocampus, anterior cingulate, and lateral temporal cortex. Here we investigated to what extent sensitivity to input disorder, quantified by Markov entropy, is subserved by modality-general or modality-specific neural systems when participants are not required to monitor the input. Participants were presented with rapid (3.3 Hz) auditory and visual series varying over four levels of entropy, while monitoring an infrequently changing fixation cross. For visual series, sensitivity to the magnitude of disorder was found in early visual cortex, the anterior cingulate, and the intraparietal sulcus. For auditory series, sensitivity was found in inferior frontal, lateral temporal, and supplementary motor regions implicated in speech perception and sequencing. Ventral premotor and central cingulate cortices were identified as possible candidates for modality-general uncertainty processing, exhibiting marginal sensitivity to disorder in both modalities. The right temporal pole differentiated the highest and lowest levels of disorder in both modalities, but did not show general sensitivity to the parametric manipulation of disorder. Our results indicate that neural sensitivity to input disorder relies largely on modality-specific systems embedded in extended sensory cortices, though uncertainty-related processing in frontal regions may be driven by both input modalities.	t	\N
23413000	In the present experiments, participants had to verify properties of concepts but, depending on the trial condition, concept-property pairs were presented via headphones or on the screen. The results showed that participants took longer and were less accurate at verifying conceptual properties when the channel used to present the CONCEPT-property pair and the type of property matched in sensory modality (e.g., LEMON-yellow on screen; BLENDER-loud in headphones) compared to when properties and channel did not match (e.g., LEMON-yellow in headphones; BLENDER-loud on screen). Such interference is consistent with theories of embodied cognition holding that knowledge is grounded in modality-specific systems (Barsalou in Behav Brain Sci 22:577-660, 1999). When the resources of one modality are burdened during the task, processing costs are incurred in a conceptual task (Vermeulen et al. in Cognition 109:287-294, 2008).	t	\N
23425567	Previous studies on the role of vowel harmony in word segmentation are based on artificial languages where harmonic cues reliably signal word boundaries. In this corpus study run on the data available at CHILDES, we investigated whether natural languages provide a learner with reliable segmentation cues similar to the ones created artificially. We observed that in harmonic languages (child-directed speech to thirty-five Turkish and three Hungarian children), but not in non-harmonic ones (child-directed speech to one Farsi and four Polish children), harmonic vowel sequences are more likely to appear within words, and non-harmonic ones mostly appear across word boundaries, suggesting that natural harmonic languages provide a learner with regular cues that could potentially be used for word segmentation along with other cues.	t	\N
23432759	The time frame for infants' acquisition of language constancy was probed, using the phonetic variation in a rarely heard accent (South African English) or a frequently heard accent (American English). A total of 156 Australian infants were tested. Six-month-olds looked longer to Australian English than less commonly heard South African accent, but at 9 months, showed similar looking times. With the more frequently heard American accent, 3-month-olds looked longer to Australian and American English, whereas 6-month-olds looked equally. Together these results imply that in the 1st year, differential attention to native versus nonnative accents decreases as infants develop a sense of language constancy for the common native language. However, experience with the nonnative accent can expedite this process.	t	\N
23433243	Patients with alien hand syndrome (AHS) experience making apparently deliberate and purposeful movements with their hand against their will. However, the mechanisms contributing to these involuntary actions remain poorly understood. Here, we describe two experimental investigations in a patient with corticobasal syndrome (CBS) with alien hand behaviour in her right hand. First, we show that responses with the alien hand are made significantly more quickly to images of objects which afford an action with that hand compared to objects which afford an action with the unaffected hand. This finding suggests that involuntary grasping behaviours in AHS might be due to exaggerated, automatic motor activation evoked by objects which afford actions with that limb. Second, using a backwards masked priming task, we found normal automatic inhibition of primed responses in the patient's unaffected hand, but importantly there was no evidence of such suppression in the alien limb. Taken together, these findings suggest that grasping behaviours in AHS may result from exaggerated object affordance effects, which might potentially arise from disrupted inhibition of automatically evoked responses.	t	\N
23440517	Users of bilateral cochlear implants (CIs) experience difficulties localizing sounds in reverberant rooms, even in rooms where normal-hearing listeners would hardly notice the reverberation. We measured the localization ability of seven bilateral CI users listening with their own devices in anechoic space and in a simulated reverberant room. To determine factors affecting performance in reverberant space we measured the sensitivity to interaural time differences (ITDs), interaural level differences (ILDs), and forward masking in the same participants using direct computer control of the electric stimulation in their CIs. Localization performance, quantified by the coefficient of determination r(2) and the root mean squared error, was significantly worse in the reverberant room than in anechoic conditions. Localization performance in the anechoic room, expressed as r(2), was best predicted by subject's sensitivity to ILDs. However, the decrease in localization performance caused by reverberation was better predicted by the sensitivity to envelope ITDs measured on single electrode pairs, with a correlation coefficient of 0.92. The CI users who were highly sensitive to envelope ITDs also better maintained their localization ability in reverberant space. Results in the forward masking task added only marginally to the predictions of localization performance in both environments. The results indicate that envelope ITDs provided by CI processors support localization in reverberant space. Thus, methods that improve perceptual access to envelope ITDs could help improve localization with bilateral CIs in everyday listening situations.	t	\N
23440859	Children with reading disability and normal reading development were compared in their ability to discriminate native (English) and novel language (Mandarin) from nonlinguistic sounds. Children's preference for native versus novel language sounds and for disyllables containing dominant trochaic versus non-dominant iambic stress patterns was also assessed. Participants included second and third grade monolingual native English speakers with reading disability (N = 18) and normal reading development (N = 18). Children selected from pairs of novel, native, and nonlinguistic sounds that was more like language. Both groups discriminated disyllabic linguistic sounds (native and novel) from nonlinguistic sounds. Both groups showed preference for the dominant English trochaic stress pattern over the non-dominant iambic stress pattern. Implications for development of prosodic sensitivity in relation to reading skills and future research are discussed.	t	\N
23442569	The relative impact of early intervention approach on speech perception and language skills was examined in these 3 well-matched groups of children using cochlear implants. Eight children from an auditory verbal intervention program were identified. From a pediatric database, researchers blind to the outcome data, identified 23 children from auditory oral programs and 8 children from bilingual-bicultural programs with the same inclusion criteria and equivalent demographic factors. All child participants were male, had congenital profound hearing loss (pure tone average >80 dBHL), no additional disabilities, were within the normal IQ range, were monolingual English speakers, had no unusual findings on computed tomography/magnetic resonance imaging, and received hearing aids and cochlear implants at a similar age and before 4 years of age. Open-set speech perception (consonant-nucleus-consonant [CNC] words and Bamford-Kowal-Bench [BKB] sentences) and the Peabody Picture Vocabulary Test (PPVT) were administered. The mean age at cochlear implant was 1.7 years (range, 0.8-3.9; SD, 0.7), mean test age was 5.4 years (range, 2.5-10.1; SD, 1.7), and mean device experience was 3.7 years (range, 0.7-7.9; SD, 1.8). Results indicate mean CNC scores of 60%, 43%, and 24% and BKB scores of 77%, 77%, and 56% for the auditory-verbal (AV), aural-oral (AO), and bilingual-bicultural (BB) groups, respectively. The mean PPVT delay was 13, 19, and 26 months for AV, AO, and BB groups, respectively. Despite equivalent child demographic characteristics at the outset of this study, by 3 years postimplant, there were significant differences in AV, AO, and BB groups. Results support consistent emphasis on oral/aural input to achieve optimum spoken communication outcomes for children using cochlear implants.	t	\N
23445327	Metabolic syndrome is a risk factor for age-related hearing impairment (ARHI). There are metabolic differences between abdominal adipose tissue present in subcutaneous and visceral areas. In this study, we investigated the association between abdominal fat composition, measured by computerized tomography (CT), and hearing thresholds. We recruited 662 adults aged 40-82 years with normal or symmetrical sensorineural hearing loss who underwent fat measurement by CT. Linear regression models were used to address the association between risk factors, including abdominal fat composition, and average hearing levels at low and high frequencies. After adjusting for age, systemic disease and other variables, a positive association between visceral adipose tissue (VAT) area and average hearing threshold was observed in women. In men, there was no significant association between abdominal fat composition and hearing threshold. Our findings show an association between VAT and hearing impairment in women. A reduction in visceral adiposity may help to prevent hearing loss in women.	t	\N
23446225	The aim of this study was to assess the benefit of having preserved acoustic hearing in the implanted ear for speech recognition in complex listening environments. The present study included a within-subjects, repeated-measures design including 21 English-speaking and 17 Polish-speaking cochlear implant (CI) recipients with preserved acoustic hearing in the implanted ear. The patients were implanted with electrodes that varied in insertion depth from 10 to 31 mm. Mean preoperative low-frequency thresholds (average of 125, 250, and 500 Hz) in the implanted ear were 39.3 and 23.4 dB HL for the English- and Polish-speaking participants, respectively. In one condition, speech perception was assessed in an eight-loudspeaker environment in which the speech signals were presented from one loudspeaker and restaurant noise was presented from all loudspeakers. In another condition, the signals were presented in a simulation of a reverberant environment with a reverberation time of 0.6 sec. The response measures included speech reception thresholds (SRTs) and percent correct sentence understanding for two test conditions: CI plus low-frequency hearing in the contralateral ear (bimodal condition) and CI plus low-frequency hearing in both ears (best-aided condition). A subset of six English-speaking listeners were also assessed on measures of interaural time difference thresholds for a 250-Hz signal. Small, but significant, improvements in performance (1.7-2.1 dB and 6-10 percentage points) were found for the best-aided condition versus the bimodal condition. Postoperative thresholds in the implanted ear were correlated with the degree of electric and acoustic stimulation (EAS) benefit for speech recognition in diffuse noise. There was no reliable relationship among measures of audiometric threshold in the implanted ear nor elevation in threshold after surgery and improvement in speech understanding in reverberation. There was a significant correlation between interaural time difference threshold at 250 Hz and EAS-related benefit for the adaptive speech reception threshold. The findings of this study suggest that (1) preserved low-frequency hearing improves speech understanding for CI recipients, (2) testing in complex listening environments, in which binaural timing cues differ for signal and noise, may best demonstrate the value of having two ears with low-frequency acoustic hearing, and (3) preservation of binaural timing cues, although poorer than observed for individuals with normal hearing, is possible after unilateral cochlear implantation with hearing preservation and is associated with EAS benefit. The results of this study demonstrate significant communicative benefit for hearing preservation in the implanted ear and provide support for the expansion of CI criteria to include individuals with low-frequency thresholds in even the normal to near-normal range.	t	\N
23446226	Understanding speech in acoustically degraded environments can place significant cognitive demands on school-age children who are developing the cognitive and linguistic skills needed to support this process. Previous studies suggest the speech understanding, word learning, and academic performance can be negatively impacted by background noise, but the effect of limited audibility on cognitive processes in children has not been directly studied. The aim of the present study was to evaluate the impact of limited audibility on speech understanding and working memory tasks in school-age children with normal hearing. Seventeen children with normal hearing between 6 and 12 years of age participated in the present study. Repetition of nonword consonant-vowel-consonant stimuli was measured under conditions with combinations of two different signal to noise ratios (SNRs; 3 and 9 dB) and two low-pass filter settings (3.2 and 5.6 kHz). Verbal processing time was calculated based on the time from the onset of the stimulus to the onset of the child's response. Monosyllabic word repetition and recall were also measured in conditions with a full bandwidth and 5.6 kHz low-pass cutoff. Nonword repetition scores decreased as audibility decreased. Verbal processing time increased as audibility decreased, consistent with predictions based on increased listening effort. Although monosyllabic word repetition did not vary between the full bandwidth and 5.6 kHz low-pass filter condition, recall was significantly poorer in the condition with limited bandwidth (low pass at 5.6 kHz). Age and expressive language scores predicted performance on word recall tasks, but did not predict nonword repetition accuracy or verbal processing time. Decreased audibility was associated with reduced accuracy for nonword repetition and increased verbal processing time in children with normal hearing. Deficits in free recall were observed even under conditions where word repetition was not affected. The negative effects of reduced audibility may occur even under conditions where speech repetition is not impacted. Limited stimulus audibility may result in greater cognitive effort for verbal rehearsal in working memory and may limit the availability of cognitive resources to allocate to working memory and other processes.	t	\N
23464029	Aided consonant and vowel identification was measured in 13 listeners with high-frequency sloping hearing losses. To investigate the influence of compression-channel analysis bandwidth on identification performance independent of the number of channels, performance was compared for three 17-channel compression systems that differed only in terms of their channel bandwidths. One compressor had narrow channels, one had widely overlapping channels, and the third had level-dependent channels. Measurements were done in quiet, in speech-shaped noise, and in a three-talker background. The results showed no effect of channel bandwidth, neither on consonant nor on vowel identification scores. This suggests that channel bandwidth per se has little influence on speech intelligibility when individually prescribed, frequency-varying compressive gain is provided.	t	\N
23466188	It is well established that individuals with schizophrenia demonstrate alterations in auditory perception beginning at the very earliest stages of information processing. However, it is not clear how these impairments in basic information processing translate into high-order cognitive deficits. Auditory scene analysis allows listeners to group auditory information into meaningful objects, and as such provides an important link between low-level auditory processing and higher cognitive abilities. In the present study we investigated whether low-level impairments in the processing of binaural temporal information impact upon auditory scene analysis ability. Binaural temporal processing ability was investigated in 19 individuals with schizophrenia and 19 matched controls. Individuals with schizophrenia showed impaired binaural temporal processing ability on an inter-aural time difference (ITD) discrimination task. In addition, patients demonstrated impairment in two measures of auditory scene analysis. Specifically, patients had reduced ability to use binaural temporal cues to extract signal from noise in a masking level difference paradigm, and to separate the location of a source sound in the presence of an echo in the precedence effect paradigm. These findings demonstrate that individuals with schizophrenia have impairments in the accuracy with which simple binaural temporal information is encoded in the auditory system, and furthermore, this impairment has functional consequences in terms of the use of these cues to extract information in complex auditory environments.	t	\N
23467261	We tested whether congenital amusics, who exhibit pitch perception deficits, nevertheless adjust the pitch of their voice in response to a sudden pitch shift applied to vocal feedback. Nine amusics and matched controls imitated their own previously-recorded speech or singing, while the online feedback they received was shifted mid-utterance by 25 or 200 cents. While a few amusics failed to show pitch-shift effects, a majority showed a pitch-shift response and nearly half showed a normal response to both large and small shifts, with similar magnitudes and response times as controls. The size and presence of the shift response to small shifts were significantly predicted by participants' vocal pitch matching accuracy, rather than their ability to perceive small pitch changes. The observed dissociation between the ability to consciously perceive small pitch changes and to produce and monitor vocal pitch provides evidence for a dual-route model of pitch processing in the brain.	t	\N
23472959	The more a novel word conforms to the phonotactics of the language, the more wordlike it is and the easier it is to learn. It is unknown to what extent children with hearing loss (CHL) take advantage of phonotactic cues to support word learning. This study investigated whether CHL had similar sensitivities to wordlikeness during a word-learning task as children with normal hearing (CNH). Sixteen CHL and 24 CNH participated in a novel word-learning task. Novel words varied by English wordlikeness. Recall was tested using a forced-choice identification task wherein foils for each trial related semantically, lexically or not at all. Receptive vocabulary and working memory were also assessed. All children were able to identify high wordlike novel words more accurately than low wordlike novel words. The number of errors on identification of words that were moderate in wordlikeness was inversely correlated to vocabulary size (not working memory) and CHL had smaller vocabularies than CNH. When in error, CHL were more likely than CNH to select a semantically related foil. Although they are sensitive to extremes in wordlikeness, compared with their peers with normal hearing, CHL present with subtle differences in word learning. Clinical implications for exploiting wordlikeness in service of word learning assessment and intervention are presented.	t	\N
23473799	The Attention Network Test (ANT) assesses the networks of attention (alerting, orienting, and executive control). The utility of measuring attention network performances with the repeated administration of the ANT in clinical populations has not yet been explored. In the present study, we administered a variant of the ANT (ANT-I) to 11 multiple sclerosis (MS) patients and 11 controls over six monthly sessions to examine the stability, isolability, robustness, and reliability of the ANT-I. Participants responded through button presses to indicate the direction of a target arrow presented either above or below a fixation cross. The target was accompanied by distracting arrows, pointing either in the same or opposite direction of the target arrow. Target arrows were preceded in some trials by alerting auditory signals and/or by uninformative visual cues. Network scores were calculated using orthogonal subtractions of performance under selected conditions. The MS patients were slower than the controls, though group differences in network performance were rare. Even after five test sessions, the network scores remained stable, despite some practice effects that were the largest under conditions that tested the executive network. The reliabilities of the orienting and executive network effects were good in both groups and increased across sessions, especially with the MS patients. The alerting network was the least reliable, especially for MS patients, but it also became more reliable across sessions. The results suggest that the ANT-I can be used to measure the integrity of attention networks for MS patients in applications that require repeated testing.	t	\N
23479475	The human visual system possesses the remarkable ability to pick out salient objects in images. Even more impressive is its ability to do the very same in the presence of disturbances. In particular, the ability persists despite the presence of noise, poor weather, and other impediments to perfect vision. Meanwhile, noise can significantly degrade the accuracy of automated computational saliency detection algorithms. In this article, we set out to remedy this shortcoming. Existing computational saliency models generally assume that the given image is clean, and a fundamental and explicit treatment of saliency in noisy images is missing from the literature. Here we propose a novel and statistically sound method for estimating saliency based on a nonparametric regression framework and investigate the stability of saliency models for noisy images and analyze how state-of-the-art computational models respond to noisy visual stimuli. The proposed model of saliency at a pixel of interest is a data-dependent weighted average of dissimilarities between a center patch around that pixel and other patches. To further enhance the degree of accuracy in predicting the human fixations and of stability to noise, we incorporate a global and multiscale approach by extending the local analysis window to the entire input image, even further to multiple scaled copies of the image. Our method consistently outperforms six other state-of-the-art models (Bruce & Tsotsos, 2009; Garcia-Diaz, Fdez-Vidal, Pardo, & Dosil, 2012; Goferman, Zelnik-Manor, & Tal, 2010; Hou & Zhang, 2007; Seo & Milanfar, 2009; Zhang, Tong, & Marks, 2008) for both noise-free and noisy cases.	t	\N
23479544	The NIH Toolbox project has assembled measurement tools to assess a wide range of human perception and ability across the lifespan. As part of this initiative, a small but comprehensive battery of auditory tests has been assembled. The main tool of this battery, pure-tone thresholds, measures the ability of people to hear at specific frequencies. Pure-tone thresholds have long been considered the "gold standard" of auditory testing, and are normally obtained in a clinical setting by highly trained audiologists. For the purposes of the Toolbox project, an automated procedure (NIH Toolbox Threshold Hearing Test) was developed that allows nonspecialists to administer the test reliably. Three supplemental auditory tests are also included in the Toolbox auditory test battery: assessment of middle-ear function (tympanometry), speech perception in noise (the NIH Toolbox Words-in-Noise Test), and self-assessment of hearing impairment (the NIH Toolbox Hearing Handicap Inventory Ages 18-64 and the NIH Toolbox Hearing Handicap Inventory Ages 64+). Tympanometry can help differentiate conductive from sensorineural pathology. The NIH Toolbox Words-in-Noise Test measures a listener's ability to perceive words in noisy situations. This ability is not necessarily predicted by a person's pure-tone thresholds; some people with normal hearing have difficulty extracting meaning from speech sounds heard in a noisy context. The NIH Toolbox Hearing Handicap Inventory focuses on how a person's perceived hearing status affects daily life. The test was constructed to include emotional and social/situational subscales, with specific questions about how hearing impairment may affect one's emotional state or limit participation in specific activities. The 4 auditory tests included in the Toolbox auditory test battery cover a range of auditory abilities and provide a snapshot of a participant's auditory capacity.	t	\N
23483527	We used the visual world paradigm to examine interlingual lexical competition when Dutch-English bilinguals listened to low-constraining sentences in their nonnative (L2; Experiment 1) and native (L1; Experiment 2) languages. Additionally, we investigated the influence of the degree of cross-lingual phonological similarity. When listening in L2, participants fixated more on competitor pictures of which the onset of the name was phonologically related to the onset of the name of the target in the nontarget language (e.g., fles, "bottle", given target flower) than on phonologically unrelated distractor pictures. Even when they listened in L1, this effect was also observed when the onsets of the names of the target picture (in L1) and the competitor picture (in L2) were phonologically very similar. These findings provide evidence for interlingual competition during the comprehension of spoken sentences, both in L2 and in L1.	t	\N
23485447	With an incidence of 1:29 000 among Caucasians, Friedreich's ataxia (FRDA) is the most common inherited ataxia, leading to both sensory and motor degeneration. Despite many FRDA patients exhibiting normal or near normal sound detection thresholds, many individuals show abnormal neural conduction along their central auditory pathways. Electrophysiological testing can show abnormal or absent cochlear nerve and auditory brainstem recordings in the presence of normal pre-neural cochlear function (otoacoustic emissions or cochlear microphonics). This pattern of normal pre-neural cochlear function and disrupted neural conduction has been termed auditory neuropathy spectrum disorder (ANSD). Studies of FRDA patients with ANSD have shown that they exhibit severe deficits in temporal processing, impaired frequency discrimination, and deficits in speech perception. Rehabilitation of these auditory percept deficits remains difficult, as hearing aids may amplify sounds without adding clarity to the temporally disrupted or distorted signal that FRDA patients with ANSD may receive. There is limited data on the best intervention for patients with FRDA with ANSD, although personal radio aids (FM systems) have been shown to be beneficial. We report a case, where cochlear implantation has led to a dramatic improvement in speech perception in an individual with FRDA and ANSD. The majority of the literature on ANSD treatment has focused on paediatric patients with the 'dyssynchrony' type of ANSD, rather than the true neuropathy type underlying the hearing loss in FRDA patients.	t	\N
23497238	Auditory feedback is important for accurate control of voice fundamental frequency (F(0)). The purpose of this study was to address whether task instructions could influence the compensatory responding and sensorimotor adaptation that has been previously found when participants are presented with a series of frequency-altered feedback (FAF) trials. Trained singers and musically untrained participants (nonsingers) were informed that their auditory feedback would be manipulated in pitch while they sang the target vowel [/α /]. Participants were instructed to either 'compensate' for, or 'ignore' the changes in auditory feedback. Whole utterance auditory feedback manipulations were either gradually presented ('ramp') in -2 cent increments down to -100 cents (1 semitone) or were suddenly ('constant') shifted down by 1 semitone. Results indicated that singers and nonsingers could not suppress their compensatory responses to FAF, nor could they reduce the sensorimotor adaptation observed during both the ramp and constant FAF trials. Compared to previous research, these data suggest that musical training is effective in suppressing compensatory responses only when FAF occurs after vocal onset (500-2500 ms). Moreover, our data suggest that compensation and adaptation are automatic and are influenced little by conscious control.	t	\N
23506084	Although adults rapidly adjust to accented speakers' pronunciation of words, young children appear to struggle when confronted with unfamiliar variants of their native language (e.g., American English-learning 15-month-olds cannot recognize familiar words spoken in Jamaican English; Best et al., 2009). It is currently unclear, however, why this is the case, or how infants overcome this apparent inability. Here, we begin to address these crucial questions. Experiments 1 and 2 confirm with a new population that infants are initially unable to recognize familiar words produced in unfamiliar accents. That is, Canadian English-learning infants cannot recognize familiar words spoken in Australian English until they near their second birthday. However, Experiments 3 and 4 show that this early inability to recognize accented words can readily be overcome when infants are exposed to a story read in the unfamiliar accent prior to test. Importantly, this adaptation only occurs when the story is highly familiar, consistent with the idea that top-down lexical feedback may enable the adaptation process. We conclude that infants, like adults, have the cognitive capacity to rapidly deduce the mapping between their own and an unfamiliar variant of their native language. Thus, the essential machinery underlying spoken language communication is in place much earlier than previous studies have suggested.	t	\N
23507048	Event related potentials (ERP) associated with early sensory information processing have been proposed as possible vulnerability markers for psychosis. Compared to other ERPs reported in schizophrenia research, like Mismatch Negativity (MMN), little is known about P3a, an ERP related to novelty detection. The aim of this study was to analyze the MMN-P3a complex in 20 antipsychotic naïve first-episode psychosis patients (FEP), 23 antipsychotic naïve individuals at clinical high-risk for psychosis (CHR) and 24 healthy controls. The MMN-P3a amplitudes and latencies were obtained during a passive auditory mismatch frequency deviant ERP paradigm and analyzed in frontal and central scalp regions. There were no significant differences in MMN amplitude between groups. There was a significant group difference in P3a due to reduced amplitude (F[2,64] = 3.7, p = 0.03) in both CHR and FEP groups (Mean difference (MD) = 0.39, p = 0.04 and MD = 0.49, p = 0.02, respectively) compared to the control group and this effect was most prominent on the right side (Group × laterality effect: MD = 0.57, p < 0.01 and MD = 0.58, p < 0.01, respectively). No significant differences were observed for MMN or P3a latencies between groups. Although a P3a decrement in chronic schizophrenia and FEP has been previously reported, our results suggest that this novelty detection impairment is present even in pre-psychosis stages in antipsychotic naïve subjects. This study supports the evidence that P3a could represent a neurophysiological vulnerability marker for the development of psychosis.	t	\N
23510647	Rehabilitation success in cochlear implant patients is influenced by many factors. Influence of different etiologies of deafness on rehabilitation outcome is assessed. Retrospective survey of patients. University hospital. Secondary referral center. One hundred and sixty-three between 1996 and 2008 implanted children (0-18 years, mean 5.17 years). Evaluation of patients' data: origin of deafness, hearing and speech test results. Access Data Base; Wilcoxon and t-test. Mean follow-up: 65.4 months. Etiology of deafness in children found in 104 cases: 69 (51.9%) suffered from hereditary hearing loss. All children showed improvement in their auditory performance, children with connexin-26 mutation performed significantly best, Usher and CHARGE-syndrome children significantly worst. Post-meningitic and post-septic children developed slower but reached same levels later. Primary cause of deafness, among other factors, has a considerable impact on outcome of rehabilitation. This offers possibilities to influence the outcome by etiology-adjusted therapy modules.	t	\N
23517653	Numerous cortical disorders affect language. We explore the connection between the observed language behavior and the underlying substrates by adopting a neurocomputational approach. To represent the observed trajectories of the discourse in patients with disorganized speech and in healthy participants, we design a graphical representation for the discourse as a trajectory that allows us to visualize and measure the degree of order in the discourse as a function of the disorder of the trajectories. Our work assumes that many of the properties of language production and comprehension can be understood in terms of the dynamics of modular networks of neural associative memories. Based upon this assumption, we connect three theoretical and empirical domains: (1) neural models of language processing and production, (2) statistical methods used in the construction of functional brain images, and (3) corpus linguistic tools, such as Latent Semantic Analysis (henceforth LSA), that are used to discover the topic organization of language. We show how the neurocomputational models intertwine with LSA and the mathematical basis of functional neuroimaging. Within this framework we describe the properties of a context-dependent neural model, based on matrix associative memories, that performs goal-oriented linguistic behavior. We link these matrix associative memory models with the mathematics that underlie functional neuroimaging techniques and present the "functional brain images" emerging from the model. This provides us with a completely "transparent box" with which to analyze the implication of some statistical images. Finally, we use these models to explore the possibility that functional synaptic disconnection can lead to an increase in connectivity between the representations of concepts that could explain some of the alterations in discourse displayed by patients with schizophrenia.	t	\N
23522843	Aging usually affects the ability to focus attention on a given task and to ignore distractors. However, aging is also associated with increased between-subject variability, and it is unclear in which features of processing older high-performing and low-performing human beings may differ in goal-directed behavior. To study involuntary shifts in attention to task-irrelevant deviant stimuli and subsequent reorientation, we used an auditory distraction task and analyzed event-related potential measures (mismatch negativity), P3a and reorienting negativity) of 35 younger, 32 older high-performing, and 32 older low-performing participants. Although both high and low performing elderly individuals showed a delayed reorienting to the primary stimulus feature, relative to young participants, poor performance of the elderly participants in processing of deviant stimuli was associated with strong involuntary attention capture by task-irrelevant features. In contrast, high performance of the elderly group was associated with intensified attentional shifting toward the target features. Thus, it appears that performance deficits in aging are due to higher distractibility in combination with deficits in the orienting-reorienting mechanisms.	t	\N
23526255	In this study, we present the normative values of the adaptation of the International Affective Digitized Sounds (IADS-2; Bradley & Lang, 2007a) for European Portuguese (EP). The IADS-2 is a standardized database of 167 naturally occurring sounds that is widely used in the study of emotions. The sounds were rated by 300 college students who were native speakers of EP, in the three affective dimensions of valence, arousal, and dominance, by using the Self-Assessment Manikin (SAM). The aims of this adaptation were threefold: (1) to provide researchers with standardized and normatively rated affective sounds to be used with an EP population; (2) to investigate sex and cultural differences in the ratings of affective dimensions of auditory stimuli between EP and the American (Bradley & Lang, 2007a) and Spanish (Fernández-Abascal et al., Psicothema 20:104-113 2008; Redondo, Fraga, Padrón, & Piñeiro, Behavior Research Methods 40:784-790 2008) standardizations; and (3) to promote research on auditory affective processing in Portugal. Our results indicated that the IADS-2 is a valid and useful database of digitized sounds for the study of emotions in a Portuguese context, allowing for comparisons of its results with those of other international studies that have used the same database for stimulus selection. The normative values of the EP adaptation of the IADS-2 database can be downloaded along with the online version of this article.	t	\N
23528307	KCNA10 is a voltage gated potassium channel that is expressed in the inner ear. The localization and function of KCNA10 was studied in a mutant mouse, B6-Kcna10(TM45), in which the single protein coding exon of Kcna10 was replaced with a beta-galactosidase reporter cassette. Under the regulatory control of the endogenous Kcna10 promoter and enhancers, beta-galactosidase was expressed in hair cells of the vestibular organs and the organ of Corti. KCNA10 expression develops in opposite tonotopic gradients in the inner and outer hair cells. Kcna10(TM45) homozygotes display only a mild elevation in pure tone hearing thresholds as measured by auditory brainstem response (ABR), while heterozygotes are normal. However, Kcna10(TM45) homozygotes have absent vestibular evoked potentials (VsEPs) or elevated VsEP thresholds with prolonged peak latencies, indicating significant vestibular dysfunction despite the lack of any overt imbalance behaviors. Our results suggest that Kcna10 is expressed primarily in hair cells of the inner ear, with little evidence of expression in other organs. The Kcna10(TM45) targeted allele may be a model of human nonsyndromic vestibulopathy.	t	\N
23534195	The author investigated the longitudinal relations between theory of mind (ToM) understanding and perceptions of self and social conversations in 17 school-aged children (12 girls, 5 boys, age 8-12 years). ToM was assessed at Time 1 (T1; M age = 8 years 5 months, SD = 8.7 months, and perceptions of self and conversational experiences assessed two years later at Time 2 (T2; M age = 10 years 4 months, SD = 7.9 months. Most importantly, longitudinal findings showed that children who scored relatively high on ToM at T1 reported relatively lower perceptions of self-worth and higher number of mental states verbs in their perceptions of peer and family conversations at T2. Significant negative longitudinal associations were found between children's number of siblings and their perceptions of self-worth (T1) and number of cognitive terms in their perceptions of peer and family conversations (T2). Frequency analysis suggested that girls' perceptions of conversations referred to more social and psychological aspects of self and relationships, whereas boys focused mainly on physical activities. Most children were more likely to prefer listening to talking during social conversations. The majority of children reported feelings of mixed or ambiguous emotions during experiences of silence. Implications for socioemotional and cognitive development in early adolescents are discussed.	t	\N
23538131	A total of 64 loci for autosomal dominant non-syndromic hearing impairment have been described, and the causative genes have been identified for 24 of these. The present study reports on the clinical characteristics of an autosomal dominantly inherited hearing impairment that is linked to a region within the DFNA60 locus located on chromosome 2 in q22.1-24.1. A pedigree spanning four generations was established with 13 affected individuals. Linkage analysis demonstrated that the locus extended over a 2.96 Mb region flanked by markers D2S2335 and D2S2275. The audiograms mainly showed a distinctive U-shaped configuration. Deterioration of hearing started at a wide age range, from 12 to 40 years. Cross-sectional analysis showed rapid progression of hearing impairment from mild to severe, between the ages of 40 and 60 years, a phenomenon that is also observed in DFNA9 patients. The results of the individual longitudinal analyses were generally in line with those obtained by the cross-sectional analysis. Speech recognition scores related to the level of hearing impairment (PTA1,2,4 kHz) appeared to be fairly similar to those of presbyacusis patients. It is speculated that hearing impairment starting in mid-life, as shown by DFNA60 patients, could play a role in the development of presbyacusis. Furthermore, speech recognition did not deteriorate appreciably before the sixth decade of life. We conclude that DFNA60 should be considered in hearing impaired patients who undergo a rapid progression in middle age and are negative for DFNA9. Furthermore, cochlear implantation resulted in good rehabilitation in two DFNA60 patients.	t	\N
23538912	Experiences of adversity in the early years of life alter the developing brain. However, evidence documenting this relationship often focuses on severe stressors and relies on peripheral measures of neurobiological functioning during infancy. In the present study, we employed functional MRI during natural sleep to examine associations between a more moderate environmental stressor (nonphysical interparental conflict) and 6- to 12-month-old infants' neural processing of emotional tone of voice. The primary question was whether interparental conflict experienced by infants is associated with neural responses to emotional tone of voice, particularly very angry speech. Results indicated that maternal report of higher interparental conflict was associated with infants' greater neural responses to very angry relative to neutral speech across several brain regions implicated in emotion and stress reactivity and regulation (including rostral anterior cingulate cortex, caudate, thalamus, and hypothalamus). These findings suggest that even moderate environmental stress may be associated with brain functioning during infancy.	t	\N
23551061	The N1 and P2 event-related potentials (ERPs) are attenuated when the eliciting sounds coincide with our own actions. Although this ERP attenuation could be caused by central processes, it may also reflect a peripheral mechanism: the coactivation of the stapedius muscle with the task-relevant effector, which reduces signal transmission efficiency in the middle ear, reducing the effective intensity of concurrently presented tones, which, in turn, elicit lower amplitude auditory ERPs. Because stapedius muscle contraction attenuates frequencies below 2 kHz, no attenuation should occur at frequencies above 2 kHz. A self-induced tone paradigm was administered with 0.5, 2.0, and 8.0 kHz pure tones. Self-induced tones elicited attenuated N1 and P2 ERPs, but the magnitude of attenuation was not affected by tone frequency. This result does not support the hypothesis that ERP attenuation to self-induced tones are caused by stapedius muscle contractions.	t	\N
23553325	Sound pressure level delivered through personal listening devices (PLDs) and reaching the ear drum might be affected by body size and jaw movements. This study aimed to investigate whether jaw movement and/or smaller body mass index (BMI) resulted in decrease of sound pressure level within the ear canals of PLD users via an earbud earphone. Case series. Forty-five normal-hearing subjects (16 males; mean age, 23.3 years) participated in this study. A probe-microphone system was used to measure sound pressure level in the external ear canal with music delivered from a media player via an earbud earphone. Test materials consisted of two 20-second excerpts from a heavy metal music piece. Subjects were instructed to adjust the volume of the media player to conform to three conditions for sound pressure measurement: comfortable, loud, and maximum. Measurements were then repeated while subjects mimicked chewing action under the same listening conditions. Sound pressure levels were significantly lower when measured with jaw movement than without jaw movement (P < .05). Sound pressure levels monitored with/without jaw movement were generally lower in subjects with a BMI<23 than those with a BMI ≥ 23 (P < .05). Jaw movement and low BMI (<23) reduced the overall sound level of PLDs at the ear canal. Sound pressure levels detected in the external ear canal of our subjects using earbud earphones were significantly lower under conditions of jaw movement/BMI <23. Our research invites further studies on a larger group of PLD users to correlate these variables with hearing threshold shifts over time.	t	\N
23556595	Although bilateral cochlear implantation has the potential to improve sound localization and speech understanding in noise, obstacles exist in presenting maximally useful binaural information to bilateral cochlear-implant (CI) users. One obstacle is that electrode arrays may differ in cochlear position by several millimeters, thereby stimulating different neural populations. Effects of interaural frequency mismatch on binaural processing were studied in normal-hearing (NH) listeners using band-limited pulse trains, thereby avoiding confounding factors that may occur in CI users. In experiment 1, binaural image fusion was measured to capture perceptual number, location, and compactness. Subjects heard a single, compact image on 73% of the trials. In experiment 2, intracranial image location was measured for different interaural time differences (ITDs) and interaural level differences (ILDs). For larger mismatch, locations perceptually shifted towards the ear with the higher carrier frequency. In experiment 3, ITD and ILD just-noticeable differences (JNDs) were measured. JNDs increased with decreasing bandwidth and increasing mismatch, but were always measurable up to 3 mm of mismatch. If binaural-hearing mechanisms are similar between NH and CI subjects, these results may explain reduced sensitivity of ITDs and ILDs in CI users. Large mismatches may lead to distorted spatial maps and reduced binaural image fusion.	t	\N
23556693	Talking silently to ourselves occupies much of our mental lives, yet the mechanisms underlying this experience remain unclear. The following experiments provide behavioral evidence that the auditory content of inner speech is provided by corollary discharge. Corollary discharge is the motor system's prediction of the sensory consequences of its actions. This prediction can bias perception of other sensations, pushing percepts to match with prediction. The two experiments below show this bias induced by inner speech, demonstrating that inner speech causes external sounds to be heard as similar to the imagined speech, and that this bias operates on subphonemic content.	t	\N
23565267	Psychophysiological evidence suggests that music and language are intimately coupled such that experience/training in one domain can influence processing required in the other domain. While the influence of music on language processing is now well-documented, evidence of language-to-music effects have yet to be firmly established. Here, using a cross-sectional design, we compared the performance of musicians to that of tone-language (Cantonese) speakers on tasks of auditory pitch acuity, music perception, and general cognitive ability (e.g., fluid intelligence, working memory). While musicians demonstrated superior performance on all auditory measures, comparable perceptual enhancements were observed for Cantonese participants, relative to English-speaking nonmusicians. These results provide evidence that tone-language background is associated with higher auditory perceptual performance for music listening. Musicians and Cantonese speakers also showed superior working memory capacity relative to nonmusician controls, suggesting that in addition to basic perceptual enhancements, tone-language background and music training might also be associated with enhanced general cognitive abilities. Our findings support the notion that tone language speakers and musically trained individuals have higher performance than English-speaking listeners for the perceptual-cognitive processing necessary for basic auditory as well as complex music perception. These results illustrate bidirectional influences between the domains of music and language.	t	\N
23571299	A variety of noises have been employed for decades in an effort to facilitate habituation, mask, or suppress tinnitus. Many of these sounds have reportedly provided benefit, but success has not been universal. More recently, musical stimuli have been added as a sound therapy component. The potential advantages of using such stimuli, in particular fractal tones, in combination with amplification are discussed in this paper.	t	\N
23571301	The past decade has seen an escalating enthusiasm to comprehend chronic tinnitus from the perspective of both scientific understanding and clinical management. At the same time, there is a significant interest and commercial investment in providing targeted and individualized approaches to care, which incorporate novel sound-based technologies, with standard audiological and psychological strategies. Commercially produced sound-based devices for the tinnitus market include Co-ordinated Reset Neuromodulation ® , Neuromonics © , Serenade ® , and Widex ® Zen. Additionally, experimental interventions such as those based on frequency-discrimination training are of current interest. Many of these interventions overtly claim to target the underlying neurological causes of tinnitus. Here, we briefly summarize current perspectives on the pathophysiology of tinnitus and evaluate claims made by the device supporters from a critical point of view. We provide an opinion on how future research in the field of individualized sound-based interventions might best provide a reliable evidence-base in this growing area of translational medicine.	t	\N
23577828	Mismatch negativity (MMN) overlaps with other auditory event-related potential (ERP) components. We examined the ERPs of 50 9- to 11-year-old children for vowels /i/, /y/ and equivalent complex tones. The goal was to separate MMN from obligatory ERP components using principal component analysis and equal probability control condition. In addition to the contrast of the deviant minus standard response, we employed the contrast of the deviant minus control response, to see whether the obligatory processing contributes to MMN in children. When looking for differences in speech deviant minus standard contrast, MMN starts around 112 ms. However, when both contrasts are examined, MMN emerges for speech at 160 ms whereas for nonspeech MMN is observed at 112 ms regardless of contrast. We argue that this discriminative response to speech stimuli at 112 ms is obligatory in nature rather than reflecting change detection processing.	t	\N
23578016	Music is a cultural universal and a rich part of the human experience. However, little is known about common brain systems that support the processing and integration of extended, naturalistic 'real-world' music stimuli. We examined this question by presenting extended excerpts of symphonic music, and two pseudomusical stimuli in which the temporal and spectral structure of the Natural Music condition were disrupted, to non-musician participants undergoing functional brain imaging and analysing synchronized spatiotemporal activity patterns between listeners. We found that music synchronizes brain responses across listeners in bilateral auditory midbrain and thalamus, primary auditory and auditory association cortex, right-lateralized structures in frontal and parietal cortex, and motor planning regions of the brain. These effects were greater for natural music compared to the pseudo-musical control conditions. Remarkably, inter-subject synchronization in the inferior colliculus and medial geniculate nucleus was also greater for the natural music condition, indicating that synchronization at these early stages of auditory processing is not simply driven by spectro-temporal features of the stimulus. Increased synchronization during music listening was also evident in a right-hemisphere fronto-parietal attention network and bilateral cortical regions involved in motor planning. While these brain structures have previously been implicated in various aspects of musical processing, our results are the first to show that these regions track structural elements of a musical stimulus over extended time periods lasting minutes. Our results show that a hierarchical distributed network is synchronized between individuals during the processing of extended musical sequences, and provide new insight into the temporal integration of complex and biologically salient auditory sequences.	t	\N
23580330	Schumacher et al. Psychological Science 12:101-108, (2001) demonstrated the elimination of most dual-task costs ("perfect time-sharing") after extensive dual-task practice of a visual and an auditory task in combination. For the present research, we used a transfer methodology to examine this practice effect in more detail, asking what task-processing stages were sped up by this dual-task practice. Such research will be essential to specify mechanisms associated with the practice-related elimination of dual-task costs. In three experiments, we introduced postpractice transfer probes focusing on the perception, central response-selection, and final motor-response stages. The results indicated that the major change achieved by dual-task practice was a speed-up in the central response-selection stages of both tasks. Additionally, perceptual-stage shortening of the auditory task was found to contribute to the improvements in time-sharing. For a better understanding of such time-sharing, we discuss the contributions of the present findings in relation to models of practiced dual-task performance.	t	\N
23585888	To localize the neural generators of the musically elicited mismatch negativity with high temporal resolution we conducted a beamformer analysis (Synthetic Aperture Magnetometry, SAM) on magnetoencephalography (MEG) data from a previous musical mismatch study. The stimuli consisted of a six-tone melodic sequence comprising broken chords in C- and G-major. The musical sequence was presented within an oddball paradigm in which the last tone was lowered occasionally (20%) by a minor third. The beamforming analysis revealed significant right hemispheric neural activation in the superior temporal (STC), inferior frontal (IFC), superior frontal (SFC) and orbitofrontal (OFC) cortices within a time window of 100-200 ms after the occurrence of a deviant tone. IFC and SFC activation was also observed in the left hemisphere. The pronounced early right inferior frontal activation of the auditory mismatch negativity has not been shown in MEG studies so far. The activation in STC and IFC is consistent with earlier electroencephalography (EEG), optical imaging and functional magnetic resonance imaging (fMRI) studies that reveal the auditory and inferior frontal cortices as main generators of the auditory MMN. The observed right hemispheric IFC is also in line with some previous music studies showing similar activation patterns after harmonic syntactic violations. The results demonstrate that a deviant tone within a musical sequence recruits immediately a distributed neural network in frontal and prefrontal areas suggesting that top-down processes are involved when expectation violation occurs within well-known stimuli.	t	\N
23586418	The neural dys-synchrony associated with auditory neuropathy spectrum disorder (ANSD) causes a temporal impairment that could degrade spatial hearing, particularly sound localization accuracy (SLA) and spatial release from masking (SRM). Unilateral cochlear implantation has become an accepted treatment for ANSD but treatment options for the contralateral ear remain controversial. We report spatial hearing measures in a child with ANSD before and after receiving a second cochlear implant (CI). An 11-year-7-month old boy with ANSD and expressive and receptive language delay received a second CI eight years after his first implant. The SLA and SRM were measured four months before sequential bilateral CIs (with the contralateral ear plugged and unplugged), and after nine months using both CIs. Testing done before the second CI, with the first CI alone, suggested that residual hearing in the contralateral ear contributed to sound localization accuracy, but not word recognition in quiet or noise. Nine-months after receiving a second CI, SLA improved by 12.76° and SRM increased to 3.8-4.2 dB relative to pre-operative performance. Results were compared to published outcomes for children with bilateral CIs. The addition of a second CI in this child with ANSD improved spatial hearing.	t	\N
23587808	Whether visual subliminal processing involves semantic processing is still being debated. To examine this, we combined a passive electroencephalogram (EEG) study with an application of transcranial direct current stimulation (tDCS). In the masked-face priming paradigm, we presented a subliminal prime preceding the target stimulus. Participants were asked to determine whether the target face was a famous face, indicated by a button press. The prime and target pair were either the same person's face (congruent) or different person's faces (incongruent), and were always both famous or both non-famous faces. Experiments were performed over 2 days: 1 day for a real tDCS session and another for a sham session as a control condition. In the sham session, a priming effect, reflected in the difference in amplitude of the late positive component (250-500 ms to target onset), was observed only in the famous prime condition. According to a previous study, this effect might indicate a subliminal semantic process [10]. Alternatively, a priming effect toward famous primes disappeared after tDCS stimulation. Our results suggested that a subliminal process might not be limited to processes in the occipital and temporal areas, but may proceed to the semantic level processed in prefrontal cortex.	t	\N
23616587	Congenital amusia is a lifelong disorder of music perception and production. The present study investigated the cerebral bases of impaired pitch perception and memory in congenital amusia using behavioural measures, magnetoencephalography and voxel-based morphometry. Congenital amusics and matched control subjects performed two melodic tasks (a melodic contour task and an easier transposition task); they had to indicate whether sequences of six tones (presented in pairs) were the same or different. Behavioural data indicated that in comparison with control participants, amusics' short-term memory was impaired for the melodic contour task, but not for the transposition task. The major finding was that pitch processing and short-term memory deficits can be traced down to amusics' early brain responses during encoding of the melodic information. Temporal and frontal generators of the N100m evoked by each note of the melody were abnormally recruited in the amusic brain. Dynamic causal modelling of the N100m further revealed decreased intrinsic connectivity in both auditory cortices, increased lateral connectivity between auditory cortices as well as a decreased right fronto-temporal backward connectivity in amusics relative to control subjects. Abnormal functioning of this fronto-temporal network was also shown during the retention interval and the retrieval of melodic information. In particular, induced gamma oscillations in right frontal areas were decreased in amusics during the retention interval. Using voxel-based morphometry, we confirmed morphological brain anomalies in terms of white and grey matter concentration in the right inferior frontal gyrus and the right superior temporal gyrus in the amusic brain. The convergence between functional and structural brain differences strengthens the hypothesis of abnormalities in the fronto-temporal pathway of the amusic brain. Our data provide first evidence of altered functioning of the auditory cortices during pitch perception and memory in congenital amusia. They further support the hypothesis that in neurodevelopmental disorders impacting high-level functions (here musical abilities), abnormalities in cerebral processing can be observed in early brain responses.	t	\N
23621479	In some cochlear implant users, success is not achieved in spite of optimal clinical factors (including age at implantation, duration of rehabilitation and post-implant hearing level), which may be attributed to disorders at higher levels of the auditory pathway. We used cortical auditory evoked potentials to investigate the ability to perceive and discriminate auditory stimuli in 10 unsuccessful implant users aged 8-10 years (CI) and 10 healthy age-matched controls with normal hearing (NH). Pure tones (1 and 2 kHz) and double consonant-vowel syllables were applied. The stimuli were presented in an oddball paradigm that required the subjects to react consciously. The latencies and amplitudes of the P1, N1, P2, N2 and P3 waves were analyzed, in addition to reaction times and number of responses. Significant differences in the average response times and number of responses were observed between the CI and NH groups. The latencies also indicate that the CI group took longer to perceive and discriminate between tonal and speech auditory stimuli than the NH group.	t	\N
23625644	In this study we provide evidence that unconscious priming can be obtained as a result of the processing of the salient region (SR) of illusory figures and without that of illusory contours (ICs). We used a metacontrast masking paradigm where illusory figures were masked by real figures. In Experiment 1 we found a clear priming effect when participants were asked to discriminate between square and diamond masks preceded by congruent or incongruent illusory square or diamond primes. It is likely that metacontrast impairs the processing of ICs but not of the SR; therefore the above result strongly suggests that the priming effect was specifically related to the processing of the SR. In Experiment 2 participants were tested in the same task as in Experiment 1 with additional primes in which the inducers were presented in the same locations but their shapes were changed so as to modify the global configuration. We termed these primes High, Low, and No Salient Region (HSR, LSR, and NSR, respectively). The HSR condition replicated Experiment 1, whereas in the LSR and NSR conditions the priming effect got progressively smaller. The results of Experiment 1 were replicated with the priming effect significantly larger in the HSR than in all other conditions. It was also larger in the HSR than in LSR condition and smallest but still present in the NSR condition. Taken together, these results indicate that the unconscious processing of only the SR yields a priming effect and that a reduction of the saliency of the SR leads to a reduction of the priming effect, while its elimination does not abolish it.	t	\N
23627625	Fully auditory Brain-computer interfaces based on the dichotic listening task (DL-BCIs) are suited for users unable to do any muscular movement, which includes gazing, exploration or coordination of their eyes looking for inputs in form of feedback, stimulation or visual support. However, one of their disadvantages, in contrast with the visual BCIs, is their lower performance that makes them not adequate in applications that require a high accuracy. To overcome this disadvantage, we employed a Bayesian approach in which the DL-BCI was modeled as a Binary phase shift keying receiver for which the accuracy can be estimated a priori as a function of the signal-to-noise ratio. The results showed the measured accuracy to match the predefined target accuracy, thus validating this model that made possible to estimate in advance the classification accuracy on a trial-by-trial basis. This constitutes a novel methodology in the design of fully auditory DL-BCIs that let us first, define the target accuracy for a specific application and second, classify when the signal-to-noise ratio guarantees that target accuracy.	t	\N
23633439	This study combines functional and structural magnetic resonance imaging to test the "asymmetric sampling in time" (AST) hypothesis, which makes assertions about the symmetrical and asymmetrical representation of speech in the primary and nonprimary auditory cortex. Twenty-three volunteers participated in this parametric clustered-sparse fMRI study. The availability of slowly changing acoustic cues in spoken sentences was systematically reduced over continuous segments with varying lengths (100, 150, 200, 250 ms) by utilizing local time-reversion. As predicted by the hypothesis, functional lateralization in Heschl's gyrus could not be observed. Lateralization in the planum temporale and posterior superior temporal gyrus shifted towards the right hemisphere with decreasing suprasegmental temporal integrity. Cortical thickness of the planum temporale was automatically measured. Participants with an L > R cortical thickness performed better on the in-scanner auditory pattern-matching task. Taken together, these findings support the AST hypothesis and provide substantial novel insight into the division of labor between left and right nonprimary auditory cortex functions during comprehension of spoken utterances. In addition, the present data yield support for a structural-behavioral relationship in the nonprimary auditory cortex.	t	\N
23633643	The purpose of this study was to determine the effect of oral reading expressiveness on the comprehension of storybooks by 4- and 5-year-old prekindergarten children. The possible impact of prosody on listening comprehension was explored. Ninety-two prekindergarten children (M age = 57.26 months, SD = 3.89 months) listened to an expressive or inexpressive recording of 1 of 2 similar stories. Story comprehension was tested using assessments of both free recall and cued recall. Children showed statistically significantly better cued recall for the expressive readings of stories compared to the inexpressive readings of stories. This effect generalized across stories and when story length was controlled across both expressive and inexpressive versions. The effect of expressiveness on children's free recall was not significant. Highly expressive readings resulted in better comprehension of storybooks by prekindergarten children. Further, because recordings were used, this effect might be attributed to the facilitation of language processing rather than to enhanced social interaction between the reader and the child.	t	\N
23645715	Even though language allows us to say exactly what we mean, we often use language to say things indirectly, in a way that depends on the specific communicative context. For example, we can use an apparently straightforward sentence like "It is hard to give a good presentation" to convey deeper meanings, like "Your talk was a mess!" One of the big puzzles in language science is how listeners work out what speakers really mean, which is a skill absolutely central to communication. However, most neuroimaging studies of language comprehension have focused on the arguably much simpler, context-independent process of understanding direct utterances. To examine the neural systems involved in getting at contextually constrained indirect meaning, we used functional magnetic resonance imaging as people listened to indirect replies in spoken dialog. Relative to direct control utterances, indirect replies engaged dorsomedial prefrontal cortex, right temporo-parietal junction and insula, as well as bilateral inferior frontal gyrus and right medial temporal gyrus. This suggests that listeners take the speaker's perspective on both cognitive (theory of mind) and affective (empathy-like) levels. In line with classic pragmatic theories, our results also indicate that currently popular "simulationist" accounts of language comprehension fail to explain how listeners understand the speaker's intended message.	t	\N
23648960	Speech perception requires the effortless mapping from smooth, seemingly continuous changes in sound features into discrete perceptual units, a conversion exemplified in the phenomenon of categorical perception. Explaining how/when the human brain performs this acoustic-phonetic transformation remains an elusive problem in current models and theories of speech perception. In previous attempts to decipher the neural basis of speech perception, it is often unclear whether the alleged brain correlates reflect an underlying percept or merely changes in neural activity that covary with parameters of the stimulus. Here, we recorded neuroelectric activity generated at both cortical and subcortical levels of the auditory pathway elicited by a speech vowel continuum whose percept varied categorically from /u/ to /a/. This integrative approach allows us to characterize how various auditory structures code, transform, and ultimately render the perception of speech material as well as dissociate brain responses reflecting changes in stimulus acoustics from those that index true internalized percepts. We find that activity from the brainstem mirrors properties of the speech waveform with remarkable fidelity, reflecting progressive changes in speech acoustics but not the discrete phonetic classes reported behaviorally. In comparison, patterns of late cortical evoked activity contain information reflecting distinct perceptual categories and predict the abstract phonetic speech boundaries heard by listeners. Our findings demonstrate a critical transformation in neural speech representations between brainstem and early auditory cortex analogous to an acoustic-phonetic mapping necessary to generate categorical speech percepts. Analytic modeling demonstrates that a simple nonlinearity accounts for the transformation between early (subcortical) brain activity and subsequent cortical/behavioral responses to speech (>150-200 ms) thereby describing a plausible mechanism by which the brain achieves its acoustic-to-phonetic mapping. Results provide evidence that the neurophysiological underpinnings of categorical speech are present cortically by ~175 ms after sound enters the ear.	t	\N
23651462	To develop and evaluate a 12-item version of the Speech, Spatial and Qualities of Hearing scale for use in clinical research and rehabilitation settings, and provide a formula for converting scores between the full (SSQ49) and abbreviated (SSQ12) versions. Items were selected independently at the three centres (Eriksholm Research Centre, MRC Institute of Hearing Research, University of New England) to be representative of the complete scale. A consensus was achieved after discussion. The data set (n = 1220) used for a factor analysis (Akeroyd et al, submitted) was re-analysed to compare original SSQ scores (SSQ49) with scores on the short version (SSQ12). A scatter-plot of SSQ12 scores against SSQ49 scores showed that SSQ12 score was about 0.6 of a scale point lower than the SSQ49 (0-10 scale) in the re-analysis of the Akeroyd et al data. SSQ12 scores lay on a slightly steeper slope than scores on the SSQ49. The SSQ12 provides similar results to SSQ49 in a large clinical research sample. The slightly lower average SSQ12 score and the slightly steeper slope reflect the composition of this short form relative to the SSQ49.	t	\N
23652327	To compare speech understanding of the BAHA BP110 and BAHA Intenso sound processors. Prospective experimental study. Tertiary referral center. Twenty experienced user of osseointegrated auditory implants with conductive or mixed hearing loss. In a first session, half of the participants were fitted with an Intenso, the other half with a BP110. After 1 month of use, aided speech understanding in quiet and in noise was measured, and the other test processor was fitted. One month later, speech understanding with the second sound processor was assessed. Speech understanding in quiet and in noise, with noise arriving either from the front, the rear, or the side of the user with the osseointegrated bone conductor. Significant improvements were found for both processors for speech understanding in quiet (+9.6 to +34.8 percent points; p = 0.02 to 0.001) and in noise (+6.2 to +13.8 dB, p < 0.001). No significant differences were found between the 2 devices for speech in quiet. For noise from the rear, subjects were able to understand speech at signal-to-noise ratios which were lower (less favorable) by -5.1 dB (p < 0.001) when compared with the Intenso. Speech understanding is substantially improved by both devices, with no significant differences between the sound processors in quiet. In noise, speech understanding is significantly better with the BP110 when compared to the Intenso for noise from the rear.	t	\N
23654396	Chen et al. [J. Acoust. Soc. Am. 131, 2987-2998 (2012)] evaluated the effectiveness of an algorithm for enhancing spectral changes over time in improving the intelligibility of speech in background sounds for hearing-impaired subjects. The processing improved intelligibility for speech in steady speech-spectrum noise (SSN) but tended to impair intelligibility in a background of two-talker speech (TTS). Large individual differences were found. The present study assessed whether the effectiveness of the algorithm was improved when the parameters that controlled the degree and type of enhancement were chosen individually for each subject, using a genetic algorithm based on subjective preferences for speech clarity. The parameter values selected by the genetic algorithm varied markedly across subjects. Speech intelligibility was measured for unprocessed stimuli and stimuli processed using the selected parameters, with SSN and TTS maskers and two signal-to-masker ratios (SMRs) for each subject. The intelligibility of speech in the SSN masker at the lower SMR was improved about 14 percentage points by the processing. The overall improvement produced by the processing was significantly larger than the improvement observed in the previous study when the parameter values were fixed across subjects, indicating that use of the genetic algorithm was beneficial.	t	\N
23659894	Attentional problems are commonly reported as a feature of the behavioural profile in both Williams syndrome (WS) and Down's syndrome (DS). Recent studies have begun to investigate these impairments empirically, acknowledging the need for an approach that considers cross-syndrome comparisons and developmental changes across the different component functions of attention. The present study assessed children with WS and DS using a new preschool attention battery (ECAB: early childhood attention battery), designed to be suitable for mental age 3-6 years including groups with developmental disorders. The ECAB has the advantage of giving an individual profile of attentional abilities for each child, covering different components of attention. In relation to test norms for their mental age, both groups showed a profile of strengths and weaknesses in the attention domain. Both syndrome groups performed relatively well on tests of sustained attention and poorly on aspects of selective attention and attentional control (executive function). The DS group showed a specific strength in auditory sustained attention, whilst the WS group showed a particular deficit in visuo-spatial response control. There was also evidence for considerable differences in the developmental trajectory of these abilities across the two groups. The results provide evidence for syndrome-specific patterns of impairment, and distinct profiles of strengths and weaknesses that may be useful in understanding the nature of everyday attention difficulties in these groups and tailoring interventions to meet these needs.	t	\N
23664833	Here, we applied a multi-feature mismatch negativity (MMN) paradigm in order to systematically investigate the neuronal representation of vowels and temporally manipulated CV syllables in a homogeneous sample of string players and non-musicians. Based on previous work indicating an increased sensitivity of the musicians' auditory system, we expected to find that musically trained subjects will elicit increased MMN amplitudes in response to temporal variations in CV syllables, namely voice-onset time (VOT) and duration. In addition, since different vowels are principally distinguished by means of frequency information and musicians are superior in extracting tonal (and thus frequency) information from an acoustic stream, we also expected to provide evidence for an increased auditory representation of vowels in the experts. In line with our hypothesis, we could show that musicians are not only advantaged in the pre-attentive encoding of temporal speech cues, but most notably also in processing vowels. Additional "just noticeable difference" measurements suggested that the musicians' perceptual advantage in encoding speech sounds was more likely driven by the generic constitutional properties of a highly trained auditory system, rather than by its specialisation for speech representations per se. These results shed light on the origin of the often reported advantage of musicians in processing a variety of speech sounds.	t	\N
23683806	The present study aimed to investigate the vocal tract and glottal function during and after phonation into a tube and a stirring straw. A male classically trained singer was assessed. Computerized tomography (CT) was performed when the subject produced [a:] at comfortable speaking pitch, phonated into the resonance tube and when repeating [a:] after the exercise. Similar procedure was performed with a narrow straw after 15 minutes silence. Anatomic distances and area measures were obtained from CT midsagittal and transversal images. Acoustic, perceptual, electroglottographic (EGG), and subglottic pressure measures were also obtained. During and after phonation into the tube or straw, the velum closed the nasal passage better, the larynx position lowered, and hypopharynx area widened. Moreover, the ratio between the inlet of the lower pharynx and the outlet of the epilaryngeal tube became larger during and after tube/straw phonation. Acoustic results revealed a stronger spectral prominence in the singer/speaker's formant cluster region after exercising. Listening test demonstrated better voice quality after straw/tube than before. Contact quotient derived from EGG decreased during both tube and straw and remained lower after exercising. Subglottic pressure increased during straw and remained somewhat higher after it. CT and acoustic results indicated that vocal exercises with increased vocal tract impedance lead to increased vocal efficiency and economy. One of the major changes was the more prominent singer's/speaker's formant cluster. Vocal tract and glottal modifications were more prominent during and after straw exercising compared with tube phonation.	t	\N
23684420	Bilateral cochlear implant (CI) users encounter difficulties in localizing sound sources in everyday environments, especially in the presence of background noise and reverberation. They tend to show large directional errors and front-back confusions compared to normal hearing (NH) subjects in the same conditions. In this study, the ability of bilateral CI users to use head movements to improve sound source localization was evaluated. Speech sentences of 0.5, 2, and 4.5 seconds were presented in noise to the listeners in conditions with and without head movements. The results show that for middle and long signal durations, the CI users could significantly reduce the number of front-back confusions. The angular accuracy, however, did not improve. Analysis of head trajectories showed that the CI users had great difficulties in moving their head towards the position of the source, whereas the NH listeners targeted the source loudspeaker correctly.	t	\N
23686398	Morphology of the human brain predicts the speed at which individuals learn to distinguish novel foreign speech sounds after laboratory training. However, little is known about the neuroanatomical basis of individual differences in speech perception when a second language (L2) has been learned in natural environments for extended periods of time. In the present study, two samples of highly proficient bilinguals were selected according to their ability to distinguish between very similar L2 sounds, either isolated (prelexical) or within words (lexical). Structural MRI was acquired and processed to estimate vertex-wise indices of cortical thickness (CT) and surface area (CSA), and the association between cortical morphology and behavioral performance was inspected. Results revealed that performance in the lexical task was negatively associated with the thickness of the left temporal cortex and angular gyrus, as well as with the surface area of the left precuneus. Our findings, consistently with previous fMRI studies, demonstrate that morphology of the reported areas is relevant for word recognition based on phonological information. Further, we discuss the possibility that increased CT and CSA in sound-to-meaning mapping regions, found for poor non-native speech sounds perceivers, would have plastically arisen after extended periods of increased functional activity during L2 exposure.	t	\N
23688330	A sequence of constant-frequency tones can promote streaming in a subsequent sequence of alternating-frequency tones, but why this effect occurs is not fully understood and its time course has not been investigated. Experiment 1 used a 2.0-s-long constant-frequency inducer (10 repetitions of a low-frequency pure tone) to promote segregation in a subsequent, 1.2-s test sequence of alternating low- and high-frequency tones. Replacing the final inducer tone with silence substantially reduced reported test-sequence segregation. This reduction did not occur when either the 4th or 7th inducer was replaced with silence. This suggests that a change at the induction/test-sequence boundary actively resets build-up, rather than less segregation occurring simply because fewer inducer tones were presented. Furthermore, Experiment 2 found that a constant-frequency inducer produced its maximum segregation-promoting effect after only three tones--this contrasts with the more gradual build-up typically observed for alternating-frequency sequences. Experiment 3 required listeners to judge continuously the grouping of 20-s test sequences. Constant-frequency inducers were considerably more effective at promoting segregation than alternating ones; this difference persisted for ∼10 s. In addition, resetting arising from a single deviant (longer tone) was associated only with constant-frequency inducers. Overall, the results suggest that constant-frequency inducers promote segregation by capturing one subset of test-sequence tones into an ongoing, preestablished stream, and that a deviant tone may reduce segregation by disrupting this capture. These findings offer new insight into the dynamics of stream segregation, and have implications for the neural basis of streaming and the role of attention in stream formation.	t	\N
23689300	Evaluation of speech perception in noisy environments for normally hearing children was conducted in order to provide normal data for speech perception testing in children with hearing impairments thus improving early intervention alternatives for Mandarin-speaking children with hearing impairments. The speech perception abilities of 174 developmentally normal children ranging aged 2-5 years, in four age groups, were evaluated in environments that were quiet or with high levels of competing noise using the Mandarin pediatric speech intelligibility (MPSI) test. The mean score of MPSI between the four age groups showed notable statistical differences, including a variation in mean score between the four age groups, clearly indicating that the speech perception abilities of young children in noisy environments improved greatly with age, most notably between the ages of 3 and 4 years old. Speech perception ability in noisy environments was shown to be significantly, but weakly, related to age, implying the presence of other, possibly environment factors, in speech perception development. Furthermore, no statistically significant difference between boys and girls was noted in the experimental MPSI scores. The ability of children to increasingly perceive speech in environments containing high competing noise levels was shown to gradually and progressively increase with age. These results indicated that the developing Mandarin speech perception abilities in noisy environments in normal hearing children develops substantially after the age of 3-4 years, suggesting that similar age ranges may be even more critical intervention points for children with hearing impairments. More studies are still needed to confirm that.	t	\N
23696279	The constituent elements and dynamics of the networks responsible for word production are a central issue to understanding human language. Of particular interest is their dependency on lexical category, particularly the possible segregation of nouns and verbs into separate processing streams. We applied a novel mixed-effects, multilevel analysis to electrocorticographic data collected from 19 patients (1942 electrodes) to examine the activity of broadly disseminated cortical networks during the retrieval of distinct lexical categories. This approach was designed to overcome the issues of sparse sampling and individual variability inherent to invasive electrophysiology. Both noun and verb generation evoked overlapping, yet distinct nonhierarchical processes favoring ventral and dorsal visual streams, respectively. Notable differences in activity patterns were noted in Broca's area and superior lateral temporo-occipital regions (verb > noun) and in parahippocampal and fusiform cortices (noun > verb). Comparisons with functional magnetic resonance imaging (fMRI) results yielded a strong correlation of blood oxygen level-dependent signal and gamma power and an independent estimate of group size needed for fMRI studies of cognition. Our findings imply parallel, lexical category-specific processes and reconcile discrepancies between lesional and functional imaging studies.	t	\N
23700131	Recent research has reported that spatial modulation effects of audiotactile interactions tend to be limited to the space and body parts around the head. The present study investigated the generality of this finding by manipulating body parts stimulated and spatial relationships between the body parts and sounds. In Experiment 1, tactile stimuli were presented randomly to either left or right cheek, hand (palm or back) placed near the head, and knee while auditory stimuli were presented to either the same or opposite side from loudspeakers close to the head. Participants made speeded spatial discrimination responses regarding the side (left versus right) of the tactile stimulation. For any body part stimulated, the performance was worse when the auditory stimuli were presented from the opposite side rather than from the same side. Experiment 2 demonstrated that the spatial modulation effects for the palm or the back of the hand occurred irrespective of hand position (near or far from the head) and sound position (near or far from the head). The sounds delivered from near the head exerted a greater influence on tactile spatial discrimination performance as compared with the sound delivered from far from the head. Furthermore, the back of the hand was more influenced by the auditory stimuli than the palm when the hands were placed near the sounds. These results suggest that the spatial modulation effects of audiotactile interactions can occur beyond the space and body surface around the head.	t	\N
23703134	To compare hearing performance relating to the peripheral and central auditory system between solvent-exposed and non-exposed workers. Forty-eight workers exposed to a mixture of solvents and 48 non-exposed control subjects of matched age, gender and educational level were selected to participate in the study. The evaluation procedures included: pure-tone audiometry (500 - 8,000 Hz), to investigate the peripheral auditory system; the Random Gap Detection test, to assess the central auditory system; and the Amsterdam Inventory for Auditory Disability and Handicap, to investigate subjects' self-reported hearing performance in daily-life activities. A Student t test and analyses of covariance (ANCOVA) were computed to determine possible significant differences between solvent-exposed and non-exposed subjects for the hearing level, Random Gap Detection test and Amsterdam Inventory for Auditory Disability and Handicap. Pearson correlations among the three measures were also calculated. Solvent-exposed subjects exhibited significantly poorer hearing thresholds for the right ear than non-exposed subjects. Also, solvent-exposed subjects exhibited poorer results for the Random Gap Detection test and self-reported poorer listening performance than non-exposed subjects. Results of the Amsterdam Inventory for Auditory Disability and Handicap were significantly correlated with the binaural average of subject pure-tone thresholds and Random Gap Detection test performance. Solvent exposure is associated with poorer hearing performance in daily life activities that relate to the function of the peripheral and central auditory system.	t	\N
23705636	This article explores enhancing sincerity, honesty, or truthfulness in computer-generated synthetic speech by accompanying it with music. Sincerity is important if we are to respond positively to any voice, whether human or artificial. What is sincerity in the artificial disembodied voice? Studies in musical expression and performance may illuminate aspects of the 'musically spoken' or sung voice in rendering deeper levels of expression that may include sincerity. We consider one response to this notion in an especially composed melodrama (music accompanying a (synthetic) spoken voice) designed to convey sincerity.	t	\N
23709064	Spatial ventriloquism refers to the phenomenon that a visual stimulus such as a flash can attract the perceived location of a spatially discordant but temporally synchronous sound. An analogous example of mutual attraction between audition and vision has been found in the temporal domain, where temporal aspects of a visual event, such as its onset, frequency, or duration, can be biased by a slightly asynchronous sound. In this review, we examine various manifestations of spatial and temporal attraction between the senses (both direct effects and aftereffects), and we discuss important constraints on the occurrence of these effects. Factors that potentially modulate ventriloquism-such as attention, synesthetic correspondence, and other cognitive factors-are described. We trace theories and models of spatial and temporal ventriloquism, from the traditional unity assumption and modality appropriateness hypothesis to more recent Bayesian and neural network approaches. Finally, we summarize recent evidence probing the underlying neural mechanisms of spatial and temporal ventriloquism.	t	\N
23711533	Auditory cortices can be separated into dissociable processing pathways similar to those observed in the visual domain. Emotional stimuli elicit enhanced neural activation within sensory cortices when compared to neutral stimuli. This effect is particularly notable in the ventral visual stream. Little is known, however, about how emotion interacts with dorsal processing streams, and essentially nothing is known about the impact of emotion on auditory stimulus localization. In the current study, we used fMRI in concert with individualized auditory virtual environments to investigate the effect of emotion during an auditory stimulus localization task. Surprisingly, participants were significantly slower to localize emotional relative to neutral sounds. A separate localizer scan was performed to isolate neural regions sensitive to stimulus location independent of emotion. When applied to the main experimental task, a significant main effect of location, but not emotion, was found in this ROI. A whole-brain analysis of the data revealed that posterior-medial regions of auditory cortex were modulated by sound location; however, additional anterior-lateral areas of auditory cortex demonstrated enhanced neural activity to emotional compared to neutral stimuli. The latter region resembled areas described in dual pathway models of auditory processing as the 'what' processing stream, prompting a follow-up task to generate an identity-sensitive ROI (the 'what' pathway) independent of location and emotion. Within this region, significant main effects of location and emotion were identified, as well as a significant interaction. These results suggest that emotion modulates activity in the 'what,' but not the 'where,' auditory processing pathway.	t	\N
23716219	The purpose of this study is to investigate the sufficient "similarity" between consecutive auditory events for the auditory system to define the fundamental period for pitch perception. It is possible to contaminate the periodicity of harmonic complex tones by scaling the impulse response in the time domain at every other cycle. Scale-alternating wavelet sequences (SAWS) in which two impulse responses with different scaling factors alternated were generated based on impulse responses obtained from Japanese vowels spoken by a male speaker. Preliminary listening to such signals indicated that the perceived pitch went down an octave relative to the original when the scaling factor exceeded a certain degree. In the first experiment, pitch matching was measured as a function of the scaling factor by the method of adjustment where the comparison stimuli were completely periodic with adjustable base periods. The pitch shift was discontinuous against the base period, chromatic continuum. In the second experiment, pitch matching was investigated with comparison stimuli whose odd harmonics were attenuated. This procedure provides a stimulus continuum where the pitch moved up an octave without changing its pitch chroma. The attenuation of the odd harmonics needed to match the SAWS varied systematically as a function of the degree of scaling. The relation between pitch matching and the peak height along the time interval axis of the stabilized auditory image is discussed.	t	\N
23716244	This study investigated monaural envelope correlation perception (Richards 1987) for noise bandwidths ranging from 25 to 1,600 Hz. The high-frequency side of the low band was fixed at 3,000 Hz and the low-frequency side of the high band was fixed at 3,500 Hz. When comodulated, the magnitude spectra of the pair of noise bands were either identical or reflected around the midpoint. Six listeners with normal hearing participated. Listeners showed similar performance for identical and reflected-spectrum conditions, with best performance usually occurring for bandwidths between 200 and 800 Hz. Results were considered in terms of envelope comparisons of waveforms at the outputs of multiple peripheral filters or envelope comparisons of waveforms at the outputs of central filters set to the bandwidths of the noise stimuli. Some aspects of the results were incompatible with the account based on multiple peripheral filters. However, the results of a supplementary condition involving the gating of band subregions indicated that this incompatibility could be accounted for by nonoptimal weighting of peripheral filter outputs.	t	\N
23716245	Envelope fluctuations of complex sounds carry information that is -essential for many types of discrimination and for detection in noise. To study the neural representation of envelope information and mechanisms for processing of this temporal aspect of sounds, it is useful to identify an animal model that can -sensitively detect amplitude modulations (AM). Low modulation frequencies, which dominate speech sounds, are of particular interest. Yet, most animal -models studied previously are relatively insensitive to AM at low modulation -frequencies. Rabbits have high thresholds for low-frequency modulations, -especially for tone carriers. Rhesus macaques are less sensitive than humans to low-frequency -modulations of wideband noise (O'Conner et al. Hear Res 277, 37-43, 2011). Rats and -chinchilla also have higher thresholds than humans for amplitude -modulations of noise (Kelly et al. J Comp Psychol 120, 98-105, 2006; Henderson et al. J Acoust Soc Am 75, -1177-1183, 1984). In contrast, the budgerigar has thresholds for AM detection of wideband noise similar to those of human listeners at low -modulation frequencies (Dooling and Searcy. Percept Psychophys 46, 65-71, 1981). A -one-interval, two-alternative operant conditioning procedure was used to estimate AM -detection thresholds for 4-kHz tone carriers at low modulation -frequencies (4-256 Hz). Budgerigar thresholds are comparable to those of human subjects in a comparable task. Implications of these comparative results for temporal coding of complex sounds are discussed. Comparative results for masked AM detection are also presented.	t	\N
23716246	Compared to humans, Mongolian gerbils (Meriones unguiculatus) are much more sensitive at detecting mistuning of frequency components of a harmonic complex (Klinge and Klump. J Acoust Soc Am 128:280-290, 2010). One processing mechanism suggested to result in the high sensitivity involves evaluating the phase shift that gradually develops between the mistuned and the remaining components in the same or separate auditory filters. To investigate if this processing mechanism may explain the observed sensitivity, we determined the gerbils' thresholds to detect a constant phase shift in a component of a harmonic complex that is introduced without a frequency shift. The gerbils' detection thresholds for constant phase shifts were considerably lower for a high-frequency component (6,400 Hz) than for a low-frequency component (400 Hz) of a 200-Hz harmonic complex and increased with decreasing stimulus duration. Compared to the phase shifts calculated from the mistuning detection thresholds, the detection thresholds for constant phase shifts were similar to those for gradual phase shifts for the low-frequency harmonic but considerably lower for the high-frequency harmonic. A simulation of the processing of harmonic complexes by the gerbil's peripheral auditory filters when components are phase shifted shows waveform changes comparable to those assessed for mistuning detection Klinge and Klump (J Acoust Soc Am 128:280-290, 2010) and provides evidence that detection of the gradual phase shifts may underlie mistuning detection.	t	\N
23716247	Detecting rare and surprising events is a useful strategy for sensory -systems. In the human auditory system, deviance detection is indexed by an important component of the auditory event-related potentials, the mismatch negativity (MMN). Responses of single neurons in the inferior colliculus, medial geniculate body, and auditory cortex of mammals (cats, rats, and mice) show responses that share some properties with MMN: they are evoked by rare events, are preattentive (in as much as they occur in anesthetized animals), and, at least at the level of primary auditory cortex, cannot be accounted for by simple fatigue of the incoming sensory information. Here we extend these results to deviations beyond tone frequency. Recording in rat primary auditory cortex and using oddball sequences consisting of two frozen tokens of broadband noise samples, we found differences between the responses to the same token when used as the common and when used as the deviant, showing an exquisite sensitivity to the small differences between two spectro-temporally similar sounds. Similarly, differential adaptation can be demonstrated when using two word-like stimuli that have been derived from human speech but adapted to the rat auditory system. Thus, differential adaptation to common and rare sounds is present also with sounds whose complexity mirrors that of natural environments.	t	\N
23716255	Many previous studies have shown that a tone that is momentarily -interrupted can be perceived as continuous if the interruption is completely masked by noise. It has been suggested this "continuity illusion" occurs only when peripheral neural responses contain no evidence that the signal was interrupted. In this study, we used a combination of psychophysical measures and computational simulations of peripheral auditory responses to examine whether the continuity illusion can be experienced under conditions where peripheral neural responses contain evidence that the signal did not continue through the masker. Our results provide an example of a salient continuity illusion despite evidence of an interruption in the peripheral representation, indicating that the illusion may depend more on global features of the interrupting sound, such as its long-term specific loudness, than on its fine-grained temporal structure.	t	\N
23717403	The present study of KCNQ4 mutations was carried out to 1) determine the prevalence by unbiased population-based genetic screening, 2) clarify the mutation spectrum and genotype/phenotype correlations, and 3) summarize clinical characteristics. In addition, a review of the reported mutations was performed for better understanding of this deafness gene. The screening using 287 probands from unbiased Japanese autosomal dominant nonsyndromic hearing loss (ADNSHL) families identified 19 families with 7 different disease causing mutations, indicating that the frequency is 6.62% (19/287). While the majority were private mutations, one particular recurrent mutation, c.211delC, was observed in 13 unrelated families. Haplotype analysis in the vicinity of c.211delC suggests existence of a common ancestor. The majority of the patients showed all frequency, but high-frequency predominant, sensorineural hearing loss. The present study adds a new typical audiogram configuration characterized by mid-frequency predominant hearing loss caused by the p.V230E mutation. A variant at the N-terminal site (c. 211delC) showed typical ski-slope type audiogram configuration. Concerning clinical features, onset age was from 3 to 40 years old, and mostly in the teens, and hearing loss was gradually progressive. Progressive nature is a common feature of patients with KCNQ4 mutations regardless of the mutation type. In conclusion, KCNQ4 mutations are frequent among ADNSHL patients, and therefore screening of the gene and molecular confirmation of these mutations have become important in the diagnosis of these conditions.	t	\N
23727883	Visual motion aftereffects can occur contingent on arbitrary sounds. Two circles, placed side by side, were alternately presented, and the onsets were accompanied by tone bursts of high and low frequencies, respectively. After a few minutes of exposure to the visual apparent motion with the tones, a circle blinking at a fixed location was perceived as a lateral motion in the same direction as the previously exposed apparent motion (Teramoto et al. in PLoS One 5:e12255, 2010). In the present study, we attempted to reverse this contingency (pitch aftereffects contingent on visual information). Results showed that after prolonged exposure to the audio-visual stimuli, the apparent visual motion systematically affected the perceived pitch of the auditory stimuli. When the leftward apparent visual motion was paired with the high-low-frequency sequence during the adaptation phase, a test tone sequence was more frequently perceived as a high-low-pitch sequence when the leftward apparent visual motion was presented and vice versa. Furthermore, the effect was specific for the exposed visual field and did not transfer to the other side, thus ruling out an explanation in terms of simple response bias. These results suggest that new audiovisual associations can be established within a short time, and visual information processing and auditory processing can mutually influence each other.	t	\N
23740184	Age-related hearing loss (presbyacusis) has a complex etiology. Results from animal models detailing the effects of specific cochlear injuries on audiometric profiles may be used to understand the mechanisms underlying hearing loss in older humans and predict cochlear pathologies associated with certain audiometric configurations ("audiometric phenotypes"). Patterns of hearing loss associated with cochlear pathology in animal models were used to define schematic boundaries of human audiograms. Pathologies included evidence for metabolic, sensory, and a mixed metabolic + sensory phenotype; an older normal phenotype without threshold elevation was also defined. Audiograms from a large sample of older adults were then searched by a human expert for "exemplars" (best examples) of these phenotypes, without knowledge of the human subject demographic information. Mean thresholds and slopes of higher frequency thresholds of the audiograms assigned to the four phenotypes were consistent with the predefined schematic boundaries and differed significantly from each other. Significant differences in age, gender, and noise exposure history provided external validity for the four phenotypes. Three supervised machine learning classifiers were then used to assess reliability of the exemplar training set to estimate the probability that newly obtained audiograms exhibited one of the four phenotypes. These procedures classified the exemplars with a high degree of accuracy; classifications of the remaining cases were consistent with the exemplars with respect to average thresholds and demographic information. These results suggest that animal models of age-related hearing loss can be used to predict human cochlear pathology by classifying audiograms into phenotypic classifications that reflect probable etiologies for hearing loss in older humans.	t	\N
23741046	Many manual tasks involve object manipulation and are achieved by an evolving series of actions, or action phases, recruited to achieve task subgoals. The ability to effectively link action phases is an important component of manual dexterity. However, our understanding of how the effective linking of sequential action phases develops with skill learning is limited. Here, we addressed this issue using a task in which participants applied forces to a handle to move a cursor on a computer screen to successively acquire visual targets. Target acquisition required actively holding the cursor within the target zone (hold phase) for a required duration before moving to the next target (transport phase). If the transport phase was initiated prematurely, before the end of the required hold duration, participants had to return to the target to acquire it. The goal was to acquire targets as quickly as possible. Distinct visual and auditory sensory events marked goal completion of each action phase. During initial task performance, the transport phase was reactively triggered by sensory events signaling hold phase completion. However, with practice, participants learned to initiate the transport phase based on a prediction of the time of hold phase completion. Simulations revealed that participants learned to near-optimally compensate for temporal uncertainty, presumably related to estimation of time intervals and execution of motor commands, so as to reduce the average latency between the end of the required hold phase duration and the start of the transport phase, while avoiding an excess of premature exits.	t	\N
23742366	Estimates of human basilar membrane gain and compression obtained using temporal masking curve (TMC) and additivity of forward masking (AFM) methods with long-duration maskers or long masker-signal silent intervals may be affected by olivocochlear efferent activation, which reduces basilar membrane gain. The present study introduces a fixed-duration masking curve (FDMC) method, which involves a comparison of off- and on-frequency forward masker levels at threshold as a function of masker and signal duration, with the total masker-signal duration fixed at 25 ms to minimize efferent effects. Gain and compression estimates from the FDMC technique were compared with those from TMC (104-ms maskers) and AFM (10- and 200-ms maskers) methods. Compression estimates over an input-masker range of 40-60 dB sound pressure level were similar for the four methods. Maximum compression occurred at a lower input level for the FDMC compared to the TMC method. Estimates of gain were similar for TMC and FDMC methods. The FDMC method may provide a more reliable estimate of BM gain and compression in the absence of efferent activation and could be a useful method for estimating effects of efferent activity when used with a precursor sound (to trigger efferent activation), presented prior to the combined masker-signal stimulus.	t	\N
23742368	Masking functions and fixed-signal functions were constructed using a narrow range of pedestal intensities for 10-ms, 1000-Hz gated tones. Data from three experiments agreed with previously reported data, clearly demonstrating negative masking and the pedestal effect. The data extend earlier findings by showing (1) the resilience of the pedestal effect when a background noise masker is introduced; (2) a possible indifference of the fixed-signal function to stimulus duration; (3) the ability of a set of psychometric functions to produce both masking and fixed-signal functions; (4) depending on method, the impact of unit choice on the interpretation of both the pedestal effect and negative masking data. Results are discussed in relation to current psychophysical models, and suggest that accounting for the auditory system's sensitivity to differences in low-level sounds remains a challenge.	t	\N
23745759	This study investigates the influence of stress grouping on verbal short-term memory (STM). English speakers show a preference to combine syllables into trochaic groups, both lexically and in continuous speech. In two serial recall experiments, auditory lists of nonsense syllables were presented with either trochaic (STRONG-weak) or iambic (weak-STRONG) stress patterns, or in monotone. The acoustic correlates that carry stress were also manipulated in order to examine the relationship between input and output processes during recall. In Experiment 1, stressed and unstressed syllables differed in intensity and pitch but were matched for spoken duration. Significantly more syllables were recalled in the trochaic stress pattern condition than in the iambic and monotone conditions, which did not differ. In Experiment 2, spoken duration and pitch were manipulated but intensity was held constant. No effects of stress grouping were observed, suggesting that intensity is a critical acoustic factor for trochaic grouping. Acoustic analyses demonstrated that speech output was not identical to the auditory input, but that participants generated correct stress patterns by manipulating acoustic correlates in the same way in both experiments. These data challenge the idea of a language-independent STM store and support the notion of separable phonological input and output processes.	t	\N
23751011	Cognitive theories on consciousness propose a strong link between consciousness and working memory (WM). This link is also present at the neural level: Both consciousness and WM have been implicated in a prefrontal parietal network. However, the link remains empirically unexplored. The present study investigates the relation between consciousness and WM by studying the impact of WM load on one aspect of consciousness, the threshold for subjective visibility. At the same time, we investigated how WM affects cognitive control, a function that has been implicated to consciousness. Results showed an increase in the threshold of subjective visibility when WM load increased. Furthermore, the impact of a prime stimulus on the response to the target was also modulated by WM load. We propose that the observed interference is caused by the functional and neural overlap of these functions.	t	\N
23751171	The extent of visual perceptual processing that occurs in the absence of awareness is as yet unclear. Here we examined event-related-potential (ERP) indices of visual and cognitive processes as awareness was manipulated through object-substitution masking (OSM), an awareness-disrupting effect that has been hypothesized to result from the disruption of reentrant signaling to low-level visual cortical areas. In OSM, a visual stimulus array is briefly presented that includes a parafoveal visual target denoted by a cue, typically consisting of several surrounding dots. When the offset of the target-surrounding cue dots is delayed relative to the rest of the array, a striking reduction in the perception of the target image surrounded by the dots is observed. Using faces and houses as the target stimuli, we found that successful OSM reduced or eliminated all the measured electrophysiological indices of visual processing stages after 130ms post-stimulus. More specifically, when targets were missed within the masked condition (i.e., on trials with effective OSM that disrupted awareness), we observed fully intact early feed-forward processing up through the visual extrastriate P1 ERP component peaking at 100ms, followed by reduced low-level activity over the occipital pole 130-170ms post-stimulus, reduced ERP indices of lateralized shifts of attention toward the parafoveal target, reduced object-generic visual processing, abolished object-category-specific (face-specific) processing, and reduced late visual short-term-memory processing activity. The results provide a comprehensive electrophysiological account of the neurocognitive underpinnings of effective OSM of visual-object images, including evidence for central roles of early reentrant signal disruption and insufficient visual attentional deployment.	t	\N
23757308	Most people cannot name the musical note that corresponds to a particular pitch without being provided a reference note, but those people with absolute pitch (AP) can do this accurately. Early experience during a developmental period is often thought to convey identity and stability of the note categories in people with AP, but the plasticity of these categories has not been investigated. Here we provide the first evidence that the note categories of adults with AP can change with listening experience. Participants with AP showed shifts in perception in direct accord with prior exposure to music detuned by a fraction of a semitone. This suggests that the apparent stability of AP categories is conferred not by early experience but rather by the cultural norms adopted for tuning music.	t	\N
23758506	Levitin's findings that nonmusicians could produce from memory the absolute pitches of self-selected pop songs have been widely cited in the music psychology literature. These findings suggest that latent absolute pitch (AP) memory may be a more widespread trait within the population than traditional AP labelling ability. However, it has been left unclear what factors may facilitate absolute pitch retention for familiar pieces of music. The aim of the present paper was to investigate factors that may contribute to latent AP memory using Levitin's sung production paradigm for AP memory and comparing results to the outcomes of a pitch labelling task, a relative pitch memory test, measures of music-induced emotions, and various measures of participants' musical backgrounds. Our results suggest that relative pitch memory and the quality and degree of music-elicited emotions impact on latent AP memory.	t	\N
23774001	Auditory alarm misperception is one of the critical events that lead aircraft pilots to an erroneous flying decision. The rarity of these alarms associated with their possible unreliability may play a role in this misperception. In order to investigate this hypothesis, we manipulated both audiovisual conflict and sound rarity in a simplified landing task. Behavioral data and event related potentials (ERPs) of thirteen healthy participants were analyzed. We found that the presentation of a rare auditory signal (i.e., an alarm), incongruent with visual information, led to a smaller amplitude of the auditory N100 (i.e., less negative) compared to the condition in which both signals were congruent. Moreover, the incongruity between the visual information and the rare sound did not significantly affect reaction times, suggesting that the rare sound was neglected. We propose that the lower N100 amplitude reflects an early visual-to-auditory gating that depends on the rarity of the sound. In complex aircraft environments, this early effect might be partly responsible for auditory alarm insensitivity. Our results provide a new basis for future aeronautic studies and the development of countermeasures.	t	\N
23776244	Individuals with autism spectrum disorders (ASDs) often show insensitivity to the human voice, a deficit that is thought to play a key role in communication deficits in this population. The social motivation theory of ASD predicts that impaired function of reward and emotional systems impedes children with ASD from actively engaging with speech. Here we explore this theory by investigating distributed brain systems underlying human voice perception in children with ASD. Using resting-state functional MRI data acquired from 20 children with ASD and 19 age- and intelligence quotient-matched typically developing children, we examined intrinsic functional connectivity of voice-selective bilateral posterior superior temporal sulcus (pSTS). Children with ASD showed a striking pattern of underconnectivity between left-hemisphere pSTS and distributed nodes of the dopaminergic reward pathway, including bilateral ventral tegmental areas and nucleus accumbens, left-hemisphere insula, orbitofrontal cortex, and ventromedial prefrontal cortex. Children with ASD also showed underconnectivity between right-hemisphere pSTS, a region known for processing speech prosody, and the orbitofrontal cortex and amygdala, brain regions critical for emotion-related associative learning. The degree of underconnectivity between voice-selective cortex and reward pathways predicted symptom severity for communication deficits in children with ASD. Our results suggest that weak connectivity of voice-selective cortex and brain structures involved in reward and emotion may impair the ability of children with ASD to experience speech as a pleasurable stimulus, thereby impacting language and social skill development in this population. Our study provides support for the social motivation theory of ASD.	t	\N
23778471	The audibility thresholds for the sound frequency of 137 upward- and downward-sloping audiograms showing sensorineural hearing loss were selected and analyzed in conjunction with speech recognition thresholds obtained from individuals seen at a public otolaryngology clinic to determine which frequencies in slope audiograms best represent speech recognition thresholds. The linear regression model and mean square error were used to determine the associations between the threshold values. The mean square error identified larger errors when using thresholds of 500, 1000, and 2000 Hz than when using audibility thresholds of 500, 1000, 2000, and 4000 Hz. The linear regression model showed a higher correlation (91%) between the audiogram thresholds for frequencies of 500, 1000, 2000, and 4000 Hz than for the frequencies of 500, 1000, and 2000 Hz (88%). Frequencies of 500, 1000, 2000, and 4000 Hz were the most significant in predicting the speech recognition threshold.	t	\N
23784072	Several sources of evidence point toward a link between asymmetry of prefrontal brain activity and approach-withdrawal tendencies. Here, we tested the causal nature of this link and examined if the categorization of an ambiguous approach- or withdrawal-related vocal signal can be biased by manipulating left and right frontal neural activity. We used voice morphing of affective non-verbal vocalizations to create individually tailored affectively ambiguous stimuli on an Anger-Fear continuum-two emotions that represent extremes on the approach-withdrawal dimension. We tested perception of these stimuli after 10 min of low-frequency repetitive transcranial magnetic stimulation over left or right dorsolateral prefrontal cortex or over the vertex (control), a technique that has transient inhibitory effects on the targeted brain region. As expected, ambiguous stimuli were more likely perceived as expressing Anger (approach) than Fear (withdrawal) after right prefrontal compared with left prefrontal or control stimulation. These results provide the first evidence that the manipulation of asymmetrical activity in prefrontal cortex can change the explicit categorization of ambiguous emotional signals.	t	\N
23785181	To evaluate child-adult differences for consonant identification in a noise or a 2-talker masker. Error patterns were compared across age and masker type to test the hypothesis that errors with the noise masker reflect limitations in the peripheral encoding of speech, whereas errors with the 2-talker masker reflect target-masker confusions within the central auditory system. A repeated-measures design compared the performance of children (5-13 years) and adults in continuous speech-shaped noise or a 2-talker masker. Consonants were identified from a closed set of 12 using a picture-pointing response. In speech-shaped noise, children under age 10 years performed more poorly than adults, but performance was adultlike for 11- to 13-year-olds. In the 2-talker masker, significant child-adult differences were observed in even the oldest group of children. Systematic clusters of consonant errors were observed for children in the noise masker and for adults in both maskers, but not for children in the 2-talker masker. These results suggest a more prolonged time course of development for consonant identification in a 2-talker masker than in a noise masker. Differences in error patterns between the maskers support the hypothesis that errors with the 2-talker masker reflect failures of sound segregation.	t	\N
23785182	In this study, the authors assessed the influence of masking level (29% or 71% sentence perception) and test modality on the processing load during language perception as reflected by the pupil response. In addition, the authors administered a delayed cued stimulus recall test to examine whether processing load affected the encoding of the stimuli in memory. Participants performed speech and text reception threshold tests, during which the pupil response was measured. In the cued recall test, the first half of correctly perceived sentences was presented, and participants were asked to complete the sentences. Reading and listening span tests of working memory capacity were presented as well. Regardless of test modality, the pupil response indicated higher processing load in the 29% condition than in the 71% correct condition. Cued recall was better for the 29% condition. The consistent effect of masking level on the pupil response during listening and reading support the validity of the pupil response as a measure of processing load during language perception. The absent relation between pupil response and cued recall may suggest that cued recall is not directly related to processing load, as reflected by the pupil response.	t	\N
23786392	Exposure to loud noise can impair cochlear microcirculation and cause noise-induced hearing loss (NIHL). TNF-α signaling has been shown to be activated in NIHL and to control spiral modiolar artery vasoconstriction that regulates cochlear microcirculation. It was the aim of this experimental study to analyse the effects of the TNF-α inhibitor etanercept on cochlear microcirculation and hearing threshold shift in NIHL in vivo. After assessment of normacusis using ABR, loud noise (106 dB SPL, 30 minutes) was applied on both ears in guinea pigs. Etanercept was administered systemically after loud noise exposure while control animals received a saline solution. In vivo fluorescence microscopy of strial capillaries was performed after surgical exposure of the cochlea for microcirculatory analysis. ABR measurements were derived from the contralateral ear. Guinea pigs (n = 6, per group). Compared to controls, cochlear blood flow in strial capillary segments was significantly increased in etanercept-treated animals. Additionally, hearing threshold was preserved in animals receiving the TNF-α inhibitor in contrast to a significant threshold raising in controls. TNF-α inhibition using etanercept improves cochlear microcirculation and protects hearing levels after loud noise exposure and appears as a promising treatment strategy for human NIHL.	t	\N
23786395	This study investigates the relation between diagnosis of dead regions based on the off-frequency psychophysical tuning curve (PTC) tip and the frequency and level of the probe tone. A previously developed functional model of auditory processing was used to simulate the complete loss of inner hair cells (IHC), dysfunction of outer hair cells (OHC), complete loss of IHCs in combination with OHC dysfunction, and IHC insensitivity. The model predictions were verified through comparison with experimental data. This study compares PTC data of five normal-hearing listeners and six hearing-impaired listeners with model-simulated PTC data. It was shown that OHC activity and IHC insensitivity may significantly alter the shift of PTC tips with increasing probe level. Model results suggest that OHC activity and IHC insensitivity can change the outcome of dead region diagnosis using PTCs. Supplementary to PTC dead region diagnostic information, model results may provide additional information regarding the edge frequency of a dead region and OHC function.	t	\N
23789623	Pickering & Garrod's (P&G's) integrated model of production and comprehension includes no explicit role for nonlinguistic cognitive processes. Yet, how domain-general cognitive functions contribute to language processing has become clearer with well-specified theories and supporting data. We therefore believe that their account can benefit by incorporating functions like working memory and cognitive control into a unified model of language processing.	t	\N
23789938	Pickering & Garrod (P&G) consider the possibility that inner speech might be a product of forward production models. Here I consider the idea of inner speech as a forward model in light of empirical work from the past few decades, concluding that, while forward models could contribute to it, inner speech nonetheless requires activity from the implementers.	t	\N
23790092	The purpose of this study was to evaluate the effect of lengthening of voice onset time and burst duration of selected speech stimuli on perception by individuals with auditory dys-synchrony. This is the second of a series of articles reporting the effect of signal enhancing strategies on speech perception by such individuals. Two experiments were conducted: (1) assessment of the 'just-noticeable difference' for voice onset time and burst duration of speech sounds; and (2) assessment of speech identification scores when speech sounds were modified by lengthening the voice onset time and the burst duration in units of one just-noticeable difference, both in isolation and in combination with each other plus transition duration modification. Lengthening of voice onset time as well as burst duration improved perception of voicing. However, the effect of voice onset time modification was greater than that of burst duration modification. Although combined lengthening of voice onset time, burst duration and transition duration resulted in improved speech perception, the improvement was less than that due to lengthening of transition duration alone. These results suggest that innovative speech processing strategies that enhance temporal cues may benefit individuals with auditory dys-synchrony.	t	\N
23790958	To determine the prevalence of otitis media with effusion (OME) in children with Down syndrome (DS), and the associated to hearing loss at the age of 8 years. A national population based clinical study of all children with DS born in Norway in 2002. OME was found in 20 out of 52 (38%) children. Those with OME had a significant lower hearing level with a mean pure tone average (PTA) of 33.4 dB HL compared to children with no OME whose mean PTA was 21.7 dB HL (p < 0.0001). Verified hearing loss above 25 dB HL in the better hearing ear was found in 12 out of the 20 with OME, compared to 5 out 31 without OME. The findings of this present study uncover the increased risk of OME in eight year old children with DS as current otitis media was found in one of three. This reduced hearing ability in children with DS due to OME at age of 8 strongly emphasizes the need for optimal treatment and follow up to optimize hearing rehabilitation. The findings are further supported by the population based study design, the focus on the narrow age band and the high response rate.	t	\N
23801322	When a deviant (oddball) stimulus is presented within a series of otherwise identical (standard) stimuli, the duration of the oddball tends to be overestimated. Two experiments investigated factors affecting systematic distortions in the perceived duration of oddball stimuli. Both experiments used an auditory oddball paradigm where oddball tones varied in both their pitch distance from the pitch of a standard tone and their likelihood of occurrence. Experiment 1 revealed that (1) far-pitch oddballs were perceived to be longer than near-pitch oddballs, (2) effects of pitch distance were greater in low-likelihood conditions, and (3) oddballs in later serial positions were perceived to be longer than oddballs in earlier serial positions. The above effects held regardless of whether oddballs were higher or lower in pitch than the standard. Experiment 2 revealed a pattern of response times in an oddball detection task that generally paralleled the pattern of data observed in Experiment 1; across conditions, there was a negative correlation between detection times and perceived duration. Taken together, the results suggest that the observed effects of oddball pitch, likelihood, and position on perceived duration are at least partly driven by how quickly individuals are able to initiate timing the oddball following its onset. Implications for different theoretical accounts of the oddball effect are discussed.	t	\N
23817922	For this research, we used a dual-task approach to investigate the involvement of working memory in following written instructions. In two experiments, participants read instructions to perform a series of actions on objects and then recalled the instructions either by spoken repetition or performance of the action sequence. Participants engaged in concurrent articulatory suppression, backward-counting, and spatial-tapping tasks during the presentation of the instructions, in order to disrupt the phonological-loop, central-executive, and visuospatial-sketchpad components of working memory, respectively. Recall accuracy was substantially disrupted by all three concurrent tasks, indicating that encoding and retaining verbal instructions depends on multiple components of working memory. The accuracy of recalling the instructions was greater when the actions were performed than when the instructions were repeated, and this advantage was unaffected by the concurrent tasks, suggesting that the benefit of enactment over oral repetition does not cost additional working memory resources.	t	\N
23831479	Mismatch negativity (MMN) is a promising window on how the functional integrity of auditory sensory memory and change discrimination is modulated by age and relevant clinical conditions. However, the effects of aging on MMN have remained somewhat elusive, particularly at short interstimulus intervals (ISIs). We performed a meta-analysis of peer-reviewed MMN studies that had targeted both young and elderly adults to estimate the mean effect size. Nine studies, consisting of 29 individual investigations, were included and the final total study population consisted of 182 young and 165 elderly subjects. The effects of different deviant types and duration of ISIs on the effect size were assessed. The overall mean effect size was 0.63 (95% CI at 0.43-0.82). The effect sizes for long ISI (>2s, effect size 0.68, 95% CI at 0.31-1.06) and short ISI (<2s, effect size 0.61, 95% CI at 0.39-0.84) were both considered moderate. A further analysis showed a prominent aging-related decrease in MMN responses to duration and frequency changes at short ISIs. It was also interesting to note that the effect size was about 25% larger for duration deviant condition compared to the frequency deviant condition. In conclusion, a reduced MMN response to duration and frequency deviants is a robust feature among the aged adults, which suggests that there has been a decline in the functional integrity of central auditory processing in this population.	t	\N
23842602	The aim of this study was to investigate the influence of impulse noise on age-related hearing loss. The study consisted of two groups. Each group contained 109 men. Group I comprised veterans with normal hearing at the end of 1979 sino-vietnamese war. All these veterans were randomly selected from Guangzhou Military Command. Group II were men with no military experience randomly chosen from the health examination center of Guangzhou General Hospital of Guangzhou Military Command. Pure-tone thresholds of these two groups were measured and compared. The pure-tone thresholds of Group I were poorer than those of Group II at the frequencies of 4, 6 and 8 kHz. Thus, impulse noise accelerates age-related hearing loss.	t	\N
23855495	Recent evidence suggests that humans can become fearful after exposure to olfactory fear signals, yet these studies have reported the effects of fear chemosignals without examining emotion-relevant input from traditional communication modalities (i.e., vision, audition). The question that we pursued here was therefore: How significant is an olfactory fear signal in the broader context of audiovisual input that either confirms or contradicts olfactory information? To test this, we manipulated olfactory (fear, no fear) and audiovisual (fear, no fear) information and demonstrated that olfactory fear signals were as potent as audiovisual fear signals in eliciting a fearful facial expression. Irrespective of confirmatory or contradictory audiovisual information, olfactory fear signals produced by senders induced fear in receivers outside of conscious access. These findings run counter to traditional views that emotions are communicated exclusively via visual and linguistic channels.	t	\N
23864263	Similarities have been observed in the localization of the final position of moving visual and moving auditory stimuli: Perceived endpoints that are judged to be farther in the direction of motion in both modalities likely reflect extrapolation of the trajectory, mediated by predictive mechanisms at higher cognitive levels. However, actual comparisons of the magnitudes of displacement between visual tasks and auditory tasks using the same experimental setup are rare. As such, the purpose of the present free-field study was to investigate the influences of the spatial location of motion offset, stimulus velocity, and motion direction on the localization of the final positions of moving auditory stimuli (Experiment 1 and 2) and moving visual stimuli (Experiment 3). To assess whether auditory performance is affected by dynamically changing binaural cues that are used for the localization of moving auditory stimuli (interaural time differences for low-frequency sounds and interaural intensity differences for high-frequency sounds), two distinct noise bands were employed in Experiments 1 and 2. In all three experiments, less precise encoding of spatial coordinates in paralateral space resulted in larger forward displacements, but this effect was drowned out by the underestimation of target eccentricity in the extreme periphery. Furthermore, our results revealed clear differences between visual and auditory tasks. Displacements in the visual task were dependent on velocity and the spatial location of the final position, but an additional influence of motion direction was observed in the auditory tasks. Together, these findings indicate that the modality-specific processing of motion parameters affects the extrapolation of the trajectory.	t	\N
23866207	Although much research has been done to study the working memory structure in children in their first school years, the relation of cognitive constructs involved in this process remains uncertain. In particular, it is unclear whether working memory is a domain general construct that coordinates separate codes of verbal and visuospatial storage or whether it is a domain-specific construct with distinct resources of verbal and visuospatial information. This paper investigates the structure of working memory, by using the Working Memory Test Battery for Children (WMTB-C) and by doing confirmatory factor analyses (CFAs) on a sample of Portuguese children (n = 103) between 8 and 9 years of age. The results of the confirmatory factor analyses that provide the best fit of the data correspond to the model that includes Central Executive and Visuospatial Sketchpad in the same factor, co-varying with a Phonological Loop factor. Moreover, the traditional working memory tripartite structure--based on the Baddeley and Hitch Model--revealed good fit to the data.	t	\N
23871868	Accounts of the functional role of the frontal cortex in pre-attentive auditory change detection include attention switching, response inhibition, contrast enhancement, and activation of a predictive model. These accounts assume different sequential activation patterns between the temporal and frontal cortices: Change detection in the auditory areas of the superior temporal cortex (STC) followed by inferior frontal cortex (IFC) activation for attention switching and response inhibition; STC preceded by IFC activation for contrast enhancement; and an IFC-STC-IFC activation sequence for the predictive model. We used the event-related optical signal (EROS), which provides a temporal resolution of milliseconds and a spatial resolution of 5 to 10mm, combined with lagged correlation path modeling to examine the response of the right frontal and temporal cortices to auditory duration deviants of varying magnitude. Event-related potentials (ERPs) were also recorded, as was the slow optical (hemodynamic) brain response. The data analyses revealed temporal-frontal, frontal-temporal-frontal, and temporal-frontal activation patterns when the deviants represented relatively large, medium, and small changes from the standard stimulus, respectively. These results indicate that the degree of deviance modulates spatio-temporal dynamics within the STC-IFC auditory change detection network.	t	\N
23874705	Besides the influence of dopaminergic neurotransmission on negative symptoms in schizophrenia, there is evidence that alterations of serotonin (5-HT) system functioning also play a crucial role in the pathophysiology of these disabling symptoms. From post mortem and genetic studies on patients with negative symptoms a 5-HT dysfunction is documented. In addition atypical neuroleptics and some antidepressants improve negative symptoms via serotonergic action. So far no research has been done to directly clarify the association between the serotonergic functioning and the extent of negative symptoms. Therefore, we examined the status of brain 5-HT level in negative symptoms in schizophrenia by means of the loudness dependence of auditory evoked potentials (LDAEP). The LDAEP provides a well established and non-invasive in vivo marker of the central 5-HT activity. We investigated 13 patients with schizophrenia with predominant negative symptoms treated with atypical neuroleptics and 13 healthy age and gender matched controls with a 32-channel EEG. The LDAEP of the N1/P2 component was evaluated by dipole source analysis and single electrode estimation at Cz. Psychopathological parameters, nicotine use and medication were assessed to control for additional influencing factors. Schizophrenic patients showed significantly higher LDAEP in both hemispheres than controls. Furthermore, the LDAEP in the right hemisphere in patients was related to higher scores in scales assessing negative symptoms. A relationship with positive symptoms was not found. These data might suggest a diminished central serotonergic neurotransmission in patients with predominant negative symptoms.	t	\N
23876942	The goal of the present study was to determine if students can be trained to reliably perceive dysphonia using the Dysphonic Severity Percentage (DSP) scale, a perceptual measure shown to have high interrater reliability when used by speech-language pathologists experienced with voice disorders. Because the DSP scale was found to be useful as a research tool in the measurement of dysphonia, using it to train students to recognize dysphonia can enhance their education as future clinicians and researchers. This method involved having five inexperienced speech-language pathology students listen to voice samples in two conditions (spontaneous speech and paragraph reading) of 10 clients with moderate to severe dysphonia (phonotrauma); the students simultaneously tallied the nondysphonic syllables on written content of the samples to obtain a DSP for both conditions. Because the clients' dysphonias were moderate to severe, and there were many dysphonic syllables, it was more efficient and advantageous for the students' perception and training to tally the nondysphonic syllables, leaving the dysphonic syllables to calculate the DSP. By tallying the nondysphonic syllables, the students were still recognizing which syllables were dysphonic by not choosing them, thus increasing their perception of both normal and disordered syllables. Statistical analysis using the intraclass correlation coefficient revealed high interrater reliability and high correlations among the trained students for both spontaneous speech and paragraph reading, thus indicating similar training experiences and perceptions. This method appeared to be a more quantitative measure of perceptual ratings than current scales, which use general gradations of dysphonic severity. Moreover, the DSPs were similar between the newly trained students and experienced raters for spontaneous speech, indicating that the students could be trained in the direction experienced by voice clinicians. There was, however, a significant difference between the two groups for paragraph reading, which will be discussed. It was concluded that the DSP method was an effective technique to train students to recognize dysphonia.	t	\N
23882007	In this article, the authors examine (a) the effect of semantic context on accentedness, comprehensibility, and intelligibility of Spanish-accented American English (AE) as judged by monolingual AE listeners and (b) the interaction of semantic context and accentedness on comprehensibility and intelligibility. Twenty adult native (L1) Spanish speakers proficient in AE and 4 L1 AE speakers (controls) read 48 statements consisting of true-false, semantically meaningful, and semantically anomalous sentences. Eighty monolingual AE listeners assessed accentedness, comprehensibility, and intelligibility of the statements. A significant main effect was found for semantic category on all 3 dependent variables. Accents were perceived to be stronger, and both comprehensibility and intelligibility were worse, in semantically anomalous contexts. Speaker data were grouped into strong, mid-level, and mild accents. The interaction between semantic category and accent was significant for both comprehensibility and intelligibility. The effect of semantic context was strongest for strong accents. Intelligibility was excellent for speakers with mid-level accents in true-false and semantically meaningful contexts, and it was excellent for mild accents in all contexts. Listeners access semantic information, in addition to phonetic and phonotactic features, in the perception of nonnative speech. Both accent level and semantic context are important in research on foreign-accented speech.	t	\N
23883307	The irrelevant sound effect (ISE) is the finding that serial recall performance is impaired under complex auditory backgrounds such as speech as compared to white noise or silence. Several findings have demonstrated that ISE occurs with nonspeech backgrounds and that the changing-state complexity of the background stimuli is critical to ISE. In a pair of experiments, we investigate whether speech-like qualities of the irrelevant background have an effect beyond their changing-state complexity. We do so by using two kinds of transformations of speech with identical changing-state complexity: one kind that preserved speech-like information (sinewave speech and fully reversed sinewave speech) and others in which this information was distorted (two selectively reversed sinewave speech conditions). Our results indicate that even when changing-state complexity is held constant, sinewave speech conditions in which speech-like interformant relationships are disrupted, produce less ISE than those in which these relationships are preserved. This indicates that speech-like properties of the background are important beyond their changing-state complexity for ISE.	t	\N
23885549	In patients with epileptic lesions in the hippocampus as well as in the temporal lobe and hippocampus simultaneously, studies were made on the perception of sound signals imitating sound source movement. It was established that hippocampal lesion results in disturbance of estimation of sound spatial characteristics which manifests in a change accuracy of localization and shortening of subjective sound image movement trajectory. Maximum disturbances of localization function are observed during lesions of hippocampus and temporal lobe. Possible neurophysiological mechanism underling observed disturbances are considered.	t	\N
23886425	Communication while traveling in an automobile often is very difficult for hearing aid users. This is because the automobile/road noise level is usually high, and listeners/drivers often do not have access to visual cues. Since the talker of interest usually is not located in front of the listener/driver, conventional directional processing that places the directivity beam toward the listener's front may not be helpful and, in fact, could have a negative impact on speech recognition (when compared to omnidirectional processing). Recently, technologies have become available in commercial hearing aids that are designed to improve speech recognition and/or listening effort in noisy conditions where talkers are located behind or beside the listener. These technologies include (1) a directional microphone system that uses a backward-facing directivity pattern (Back-DIR processing), (2) a technology that transmits audio signals from the ear with the better signal-to-noise ratio (SNR) to the ear with the poorer SNR (Side-Transmission processing), and (3) a signal processing scheme that suppresses the noise at the ear with the poorer SNR (Side-Suppression processing). The purpose of the current study was to determine the effect of (1) conventional directional microphones and (2) newer signal processing schemes (Back-DIR, Side-Transmission, and Side-Suppression) on listener's speech recognition performance and preference for communication in a traveling automobile. A single-blinded, repeated-measures design was used. Twenty-five adults with bilateral symmetrical sensorineural hearing loss aged 44 through 84 yr participated in the study. The automobile/road noise and sentences of the Connected Speech Test (CST) were recorded through hearing aids in a standard van moving at a speed of 70 mph on a paved highway. The hearing aids were programmed to omnidirectional microphone, conventional adaptive directional microphone, and the three newer schemes. CST sentences were presented from the side and back of the hearing aids, which were placed on the ears of a manikin. The recorded stimuli were presented to listeners via earphones in a sound-treated booth to assess speech recognition performance and preference with each programmed condition. Compared to omnidirectional microphones, conventional adaptive directional processing had a detrimental effect on speech recognition when speech was presented from the back or side of the listener. Back-DIR and Side-Transmission processing improved speech recognition performance (relative to both omnidirectional and adaptive directional processing) when speech was from the back and side, respectively. The performance with Side-Suppression processing was better than with adaptive directional processing when speech was from the side. The participants' preferences for a given processing scheme were generally consistent with speech recognition results. The finding that performance with adaptive directional processing was poorer than with omnidirectional microphones demonstrates the importance of selecting the correct microphone technology for different listening situations. The results also suggest the feasibility of using hearing aid technologies to provide a better listening experience for hearing aid users in automobiles.	t	\N
23888869	Binding is key in multisensory perception. This study investigated the audio-visual (A-V) temporal binding window in 4-, 5-, and 6-year-old children (total N = 120). Children watched a person uttering a syllable whose auditory and visual components were either temporally synchronized or desynchronized by 366, 500, or 666 ms. They were asked whether the voice and face went together (Experiment 1) or whether the desynchronized videos differed from the synchronized one (Experiment 2). Four-year-olds detected the 666-ms asynchrony, 5-year-olds detected the 666- and 500-ms asynchrony, and 6-year-olds detected all asynchronies. These results show that the A-V temporal binding window narrows slowly during early childhood and that it is still wider at 6 years of age than in older children and adults.	t	\N
23891107	Biologically salient sounds, including speech, are rarely heard in isolation. Our brains must therefore organize the input arising from multiple sources into separate "streams" and, in the case of speech, map the acoustic components of the target signal onto meaning. These auditory and linguistic processes have traditionally been considered to occur sequentially and are typically studied independently [1, 2]. However, evidence that streaming is modified or reset by attention [3], and that lexical knowledge can affect reports of speech sound identity [4, 5], suggests that higher-level factors may influence perceptual organization. In two experiments, listeners heard sequences of repeated words or acoustically matched nonwords. After several presentations, they reported that the initial /s/ sound in each syllable formed a separate stream; the percept then fluctuated between the streamed and fused states in a bistable manner. In addition to measuring these verbal transformations, we assessed streaming objectively by requiring listeners to detect occasional targets-syllables containing a gap after the initial /s/. Performance was better when streaming caused the syllables preceding the target to transform from words into nonwords, rather than from nonwords into words. Our results show that auditory stream formation is influenced not only by the acoustic properties of speech sounds, but also by higher-level processes involved in recognizing familiar words.	t	\N
23893201	Mimicking the human ear on the basis of auditory models has become a viable approach in many applications by now. However, only a few attempts have been made to extend the scope of physiological ear models to be employed in cochlear implants (CI). Contemporary CI systems rely on much simpler filter banks and simulate the natural signal processing of a healthy cochlea to only a very limited extent. When looking at rehabilitation outcomes, current systems seem to have reached their peak potential, which signals the need for better algorithms and/or technologies. In this paper, we present a novel sound processing strategy, SAM (Stimulation based on Auditory Modeling), that is based on neurophysiological models of the human ear and can be employed in auditory prostheses. It incorporates active cochlear filtering (basilar membrane and outer hair cells) along with the mechanoelectrical transduction of the inner hair cells, so that several psychoacoustic phenomena are accounted for inherently. Although possible, current implementation does not make use of parallel stimulation of the electrodes, which matches state-of-the-art CI hardware. This paper elaborates on SAM's signal processing and provides a computational evaluation of the strategy. Results show that aspects of normal cochlear processing that are missing in common strategies can be replicated by SAM. This is supposed to improve overall CI user performance, which we have at least partly proven in a pilot study with implantees.	t	\N
23893677	Several studies suggest that auditory perception in general and the perception of speech in noise in particular continue to develop until late childhood or early adolescence. It remains unclear, however, whether this prolonged development results from the maturation of the cognitive factors associated with the performance of auditory tasks or from the slow development of auditory sensory processing. We investigated the perception of monosyllabic words embedded in white noise in Hebrew-speaking school-age children and adults. Although identification thresholds did not become adult-like until 11 years of age, we found no evidence linking this prolonged development to non-sensory factors associated with performance consistency. Therefore, we suggest that similar to the development of amplitude and frequency modulation detection thresholds, this protracted development is related to the maturation of auditory sensory processing.	t	\N
23893940	Videos can be used as didactic tools for self-learning under several circumstances, including those cases in which students are responsible for the development of this resource as an audiovisual notebook. We compared students' and teachers' perceptions regarding the main features that an audiovisual notebook should include. Four questionnaires with items about information, images, text and music, and filmmaking were used to investigate students' (n = 115) and teachers' perceptions (n = 28) regarding the development of a video focused on a histological technique. The results show that both students and teachers significantly prioritize informative components, images and filmmaking more than text and music. The scores were significantly higher for teachers than for students for all four components analyzed. The highest scores were given to items related to practical and medically oriented elements, and the lowest values were given to theoretical and complementary elements. For most items, there were no differences between genders. A strong positive correlation was found between the scores given to each item by teachers and students. These results show that both students' and teachers' perceptions tend to coincide for most items, and suggest that audiovisual notebooks developed by students would emphasize the same items as those perceived by teachers to be the most relevant. Further, these findings suggest that the use of video as an audiovisual learning notebook would not only preserve the curricular objectives but would also offer the advantages of self-learning processes.	t	\N
23905279	Recent studies in the field of intonational phonology have shown that information-seeking questions can be distinguished from confirmation-seeking questions by prosodic means in a variety of languages (Armstrong, 2010, for Puerto Rican Spanish; Grice & Savino, 1997, for Bari Italian; Kügler, 2003, for Leipzig German; Mata & Santos, 2010, for European Portuguese; Vanrell, Mascaró, Prieto, & Torres-Tamarit, 2010, for Catalan). However, all these studies have relied on production experiments and little is known about the perceptual relevance of these intonational cues. This paper explores whether Majorcan Catalan listeners distinguish information- and confirmation-seeking questions by means of two distinct nuclear falling pitch accents. Three behavioral tasks were conducted with 20 Majorcan Catalan subjects, namely a semantic congruity test, a rating test, and a classical categorical perception identification/discrimination test. The results show that a difference in pitch scaling on the leading H tone of the H+L* nuclear pitch accent is the main cue used by Majorcan Catalan listeners to distinguish confirmation questions from information-seeking questions. Thus, while a iH+L* pitch accent signals an information-seeking question (i.e., the speaker has no expectation about the nature of the answer), the H+L* pitch accent indicates that the speaker is asking about mutually shared information. We argue that these results have implications in representing the distinctions of tonal height in Catalan. The results also support the claim that phonological contrasts in intonation, together with other linguistic strategies, can signal the speakers' beliefs about the certainty of the proposition expressed.	t	\N
23906967	Training people on temporal discrimination can substantially improve performance in the trained modality but also in untrained modalities. A pretest-training-posttest design was used to investigate whether consolidation plays a crucial role for training effects within the trained modality and its transfer to another modality. In the pretest, both auditory and visual discrimination performance was assessed. In the training phase, participants performed only the auditory task. After a consolidation interval of either 5 min or 24h, participants were again tested in both the auditory and visual tasks. Irrespective of the consolidation interval, performance improved from the pretest to the posttest in both modalities. Most importantly, the training effect for the trained auditory modality was independent of the consolidation interval whereas the transfer effect to the visual modality was larger after 24h than after 5 min. This finding shows that transfer effects benefit from extended consolidation.	t	\N
23921930	To confirm the clinical efficacy and safety of a direct acoustic cochlear implant. Prospective multicenter study. The study was performed at 3 university hospitals in Europe (Germany, The Netherlands, and Switzerland). Fifteen patients with severe-to-profound mixed hearing loss because of otosclerosis or previous failed stapes surgery. Implantation with a Codacs direct acoustic cochlear implant investigational device (ID) combined with a stapedotomy with a conventional stapes prosthesis Preoperative and postoperative (3 months after activation of the investigational direct acoustic cochlear implant) audiometric evaluation measuring conventional pure tone and speech audiometry, tympanometry, aided thresholds in sound field and hearing difficulty by the Abbreviated Profile of Hearing Aid Benefit questionnaire. The preoperative and postoperative air and bone conduction thresholds did not change significantly by the implantation with the investigational Direct Acoustic Cochlear Implant. The mean sound field thresholds (0.25-8 kHz) improved significantly by 48 dB. The word recognition scores (WRS) at 50, 65, and 80 dB SPL improved significantly by 30.4%, 75%, and 78.2%, respectively, after implantation with the investigational direct acoustic cochlear implant compared with the preoperative unaided condition. The difficulty in hearing, measured by the Abbreviated Profile of Hearing Aid Benefit, decreased by 27% after implantation with the investigational direct acoustic cochlear implant. Patients with moderate-to-severe mixed hearing loss because of otosclerosis can benefit substantially using the Codacs investigational device.	t	\N
23927112	The present study evaluated the influence of suppressor frequency (fs) and level (Ls) on stimulus-frequency otoacoustic emissions (SFOAEs) recorded using the amplitude-modulated (AM) suppressor technique described by Neely et al. [J. Acoust. Soc. Am. 118, 2124-2127 (2005a)]. Data were collected in normal-hearing subjects, with data collection occurring in two phases. In phase 1, SFOAEs were recorded with probe frequency (fp) = 1, 2, and 4 kHz and probe levels (Lp) ranging from 0 to 60 dB sound pressure level (SPL). At each fp, Ls ranged from Ls = Lp to Ls = Lp + 30 dB. Additionally, nine relationships between fs and fp were evaluated, ranging from fs/fp = 0.80 to fs/fp = 1.16. Results indicated that for low suppressor levels, suppressors higher in frequency than fp (fs > fp) resulted in higher AM-SFOAE levels than suppressors lower in frequency than fp (fs < fp). At higher suppressor levels, suppressors both higher and lower in frequency than fp produced similar AM-SFOAE levels, and, in many cases, low-frequency suppressors produced the largest response. Recommendations for stimulus parameters that maximize AM-SFOAE level were derived from these data. In phase 2, AM-SFOAEs were recorded using these parameters for fp = 0.7-8 kHz and Lp = 20-60 dB SPL. Robust AM-SFOAE responses were recorded in this group of subjects using the parameters developed in phase 1.	t	\N
23927121	Real-world sound sources are usually perceived as externalized and thus properly localized in both direction and distance. This is largely due to (1) the acoustic filtering by the head, torso, and pinna, resulting in modifications of the signal spectrum and thereby a frequency-dependent shaping of interaural cues and (2) interaural cues provided by the reverberation inside an enclosed space. This study first investigated the effect of room reverberation on the spectro-temporal behavior of interaural level differences (ILDs) by analyzing dummy-head recordings of speech played at different distances in a standard listening room. Next, the effect of ILD fluctuations on the degree of externalization was investigated in a psychoacoustic experiment performed in the same listening room. Individual binaural impulse responses were used to simulate a distant sound source delivered via headphones. The ILDs were altered using a gammatone filterbank for analysis and resynthesis, where the envelopes of the left and right-ear signals were modified such that the naturally occurring fluctuations of the ILDs were restricted. This manipulation reduced the perceived degree of externalization. This was consistent with the analysis of short-term ILDs at different distances showing that a decreased distance to the sound source also reduced the ILD fluctuations.	t	\N
23927128	The perturbation of acoustic features in a speaker's auditory feedback elicits rapid compensatory responses that demonstrate the importance of auditory feedback for control of speech output. The current study investigated whether responses to a perturbation of speech auditory feedback vary depending on the importance of the perturbed feature to perception of the vowel being produced. Auditory feedback of speakers' first formant frequency (F1) was shifted upward by 130 mels in randomly selected trials during the speakers' production of consonant-vowel-consonant words containing either the vowel /Λ/ or the vowel /ɝ/. Although these vowels exhibit comparable F1 frequencies, the contribution of F1 to perception of /Λ/ is greater than its contribution to perception of /ɝ/. Compensation to the F1 perturbation was observed during production of both vowels, but compensatory responses during /Λ/ occurred at significantly shorter latencies and exhibited significantly larger magnitudes than compensatory responses during /ɝ/. The finding that perturbation of vowel F1 during /Λ/ and /ɝ/ yielded compensatory differences that mirrored the contributions of F1 to perception of these vowels indicates that some portion of feedback control is weighted toward monitoring and preservation of acoustic cues for speech perception.	t	\N
23933139	A series of five experiments investigated the extent of subliminal processing of negation. Participants were presented with a subliminal instruction to either pick or not pick an accompanying noun, followed by a choice of two nouns. By employing subjective measures to determine individual thresholds of subliminal priming, the results of these studies indicated that participants were able to identify the correct noun of the pair--even when the correct noun was specified by negation. Furthermore, using a grey-scale contrast method of masking, Experiment 5 confirmed that these priming effects were evidenced in the absence of partial awareness, and without the effect being attributed to the retrieval of stimulus-response links established during conscious rehearsal.	t	\N
23937006	To analyse the impact of tinnitus loudness, tinnitus frequency, hearing loss, tinnitus subjective loudness on the life of tinnitus patient. To inspect the 154 tinnitus patients with pure tone audiometry, tinnitus matching, tinnitus classification questionnaire and THI scale. This study applies THI scale to evaluate the impact of tinnitus on the life of tinnitus patient. Using statistical methods to analyse the relationship between tinnitus loudness, tinnitus frequency, hearing loss, tinnitus subjective loudness and the impact of tinnitus on the life of tinnitus patient. (1) Tinnitus frequency is closed with the frequency of hearing loss. (2) There is no significant correlation between tinnitus loudness and the impact of tinnitus on the life of tinnitus patient. (3) There is no distinction between hearing loss and the THI scores. (4) The patient gets more scores in subgroup of THI with the increase of tinnitus subjective loudness classification. The impact of tinnitus loudness, hearing loss on the life of tinnitus patient is not very clearly, while the impact of tinnitus subjective loudness classification on the life of tinnitus patient is significant. In clinical, we can not evaluate the effect of the tinnitus treatment relying on tinnitus loudness and hearing loss simply.The finding provides us individual treatment to tinnitus patients.	t	\N
23937689	Several accounts of speech perception propose that the areas involved in producing language are also involved in perceiving it. In line with this view, neuroimaging studies show activation of premotor cortex (PMC) during phoneme judgment tasks; however, there is debate about whether speech perception necessarily involves motor processes, across all task contexts, or whether the contribution of PMC is restricted to tasks requiring explicit phoneme awareness. Some aspects of speech processing, such as mapping sounds onto meaning, may proceed without the involvement of motor speech areas if PMC specifically contributes to the manipulation and categorical perception of phonemes. We applied TMS to three sites-PMC, posterior superior temporal gyrus, and occipital pole-and for the first time within the TMS literature, directly contrasted two speech perception tasks that required explicit phoneme decisions and mapping of speech sounds onto semantic categories, respectively. TMS to PMC disrupted explicit phonological judgments but not access to meaning for the same speech stimuli. TMS to two further sites confirmed that this pattern was site specific and did not reflect a generic difference in the susceptibility of our experimental tasks to TMS: stimulation of pSTG, a site involved in auditory processing, disrupted performance in both language tasks, whereas stimulation of occipital pole had no effect on performance in either task. These findings demonstrate that, although PMC is important for explicit phonological judgments, crucially, PMC is not necessary for mapping speech onto meanings.	t	\N
23943499	To investigate the effect of semantic congruity on audiovisual target responses, participants detected a semantic concept that was embedded in a series of rapidly presented stimuli. The target concept appeared as a picture, an environmental sound, or both; and in bimodal trials, the audiovisual events were either consistent or inconsistent in their representation of a semantic concept. The results showed faster detection latencies to bimodal than to unimodal targets and a higher rate of missed targets when visual distractors were presented together with auditory targets, in comparison to auditory targets presented alone. The findings of Experiment 2 showed a cross-modal asymmetry, such that visual distractors were found to interfere with the accuracy of auditory target detection, but auditory distractors had no effect on either the speed or the accuracy of visual target detection. The biased-competition theory of attention (Desimone & Duncan Annual Review of Neuroscience 18: 1995; Duncan, Humphreys, & Ward Current Opinion in Neurobiology 7: 255-261 1997) was used to explain the findings because, when the saliency of the visual stimuli was reduced by the addition of a noise filter in Experiment 4, visual interference on auditory target detection was diminished. Additionally, the results showed faster and more accurate target detection when semantic concepts were represented in a visual rather than an auditory format.	t	\N
23966690	Aging listeners experience greater difficulty understanding speech in adverse listening conditions and exhibit degraded temporal resolution, even when audiometric thresholds are normal. When threshold evidence for peripheral involvement is lacking, central and cognitive factors are often cited as underlying performance declines. However, previous work has uncovered widespread loss of cochlear afferent synapses and progressive cochlear nerve degeneration in noise-exposed ears with recovered thresholds and no hair cell loss (Kujawa and Liberman 2009). Here, we characterize age-related cochlear synaptic and neural degeneration in CBA/CaJ mice never exposed to high-level noise. Cochlear hair cell and neuronal function was assessed via distortion product otoacoustic emissions and auditory brainstem responses, respectively. Immunostained cochlear whole mounts and plastic-embedded sections were studied by confocal and conventional light microscopy to quantify hair cells, cochlear neurons, and synaptic structures, i.e., presynaptic ribbons and postsynaptic glutamate receptors. Cochlear synaptic loss progresses from youth (4 weeks) to old age (144 weeks) and is seen throughout the cochlea long before age-related changes in thresholds or hair cell counts. Cochlear nerve loss parallels the synaptic loss, after a delay of several months. Key functional clues to the synaptopathy are available in the neural response; these can be accessed noninvasively, enhancing the possibilities for translation to human clinical characterization.	t	\N
23966965	Williams syndrome (WS), a genetic, neurodevelopmental disorder, is of keen interest to music cognition researchers because of its characteristic auditory sensitivities and emotional responsiveness to music. However, actual musical perception and production abilities are more variable. We examined musicality in WS through the lens of amusia and explored how their musical perception abilities related to their auditory sensitivities, musical production skills, and emotional responsiveness to music. In our sample of 73 adolescents and adults with WS, 11% met criteria for amusia, which is higher than the 4% prevalence rate reported in the typically developing (TD) population. Amusia was not related to auditory sensitivities but was related to musical training. Performance on the amusia measure strongly predicted musical skill but not emotional responsiveness to music, which was better predicted by general auditory sensitivities. This study represents the first time amusia has been examined in a population with a known neurodevelopmental genetic disorder with a range of cognitive abilities. Results have implications for the relationships across different levels of auditory processing, musical skill development, and emotional responsiveness to music, as well as the understanding of gene-brain-behavior relationships in individuals with WS and TD individuals with and without amusia.	t	\N
23967947	The purpose of this study was to analyze vowel articulation across various speaking tasks in a group of 20 early Parkinson's disease (PD) individuals prior to pharmacotherapy. Vowels were extracted from sustained phonation, sentence repetition, reading passage, and monologue. Acoustic analysis was based upon measures of the first (F1) and second (F2) formant of the vowels /a/, /i/, and /u/, vowel space area (VSA), F2i/F2u and vowel articulation index (VAI). Parkinsonian speakers manifested abnormalities in vowel articulation across F2u, VSA, F2i/F2u, and VAI in all speaking tasks except sustained phonation, compared to 15 age-matched healthy control participants. Findings suggest that sustained phonation is an inappropriate task to investigate vowel articulation in early PD. In contrast, monologue was the most sensitive in differentiating between controls and PD patients, with classification accuracy up to 80%. Measurements of vowel articulation were able to capture even minor abnormalities in speech of PD patients with no perceptible dysarthria. In conclusion, impaired vowel articulation may be considered as a possible early marker of PD. A certain type of speaking task can exert significant influence on vowel articulation. Specifically, complex tasks such as monologue are more likely to elicit articulatory deficits in parkinsonian speech, compared to other speaking tasks.	t	\N
23977030	Mechanisms of propofol-induced loss of consciousness remain poorly understood. Recent fMRI studies have shown decreases in functional connectivity during unconsciousness induced by this anesthetic agent. Functional connectivity does not provide information of directional changes in the dynamics observed during unconsciousness. The aim of the present study was to investigate, in healthy humans during an auditory task, the changes in effective connectivity resulting from propofol induced loss of consciousness. We used Dynamic Causal Modeling for fMRI (fMRI-DCM) to assess how causal connectivity is influenced by the anesthetic agent in the auditory system. Our results suggest that the dynamic observed in the auditory system during unconsciousness induced by propofol, can result in a mixture of two effects: a local inhibitory connectivity increase and a decrease in the effective connectivity in sensory cortices.	t	\N
23978930	Faces presented upside-down are harder to recognize than presented right-side up, an effect known as the face inversion effect. With inversion the perceptual processing of the spatial relationship among facial parts is disrupted. Previous literature indicates a face inversion effect in chimpanzees toward familiar and conspecific faces. Although these results are not inconsistent with findings from humans they have some controversy in their methodology. Here, we employed a delayed matching-to-sample task to test captive chimpanzees on discriminating chimpanzee and human faces. Their performances were deteriorated by inversion. More importantly, the discrimination deterioration was systematically different between the two age groups of chimpanzee participants, i.e. young chimpanzees showed a stronger inversion effect for chimpanzee than for human faces, while old chimpanzees showed a stronger inversion effect for human than for chimpanzee faces. We conclude that the face inversion effect in chimpanzees is modulated by the level of expertise of face processing.	t	\N
23980148	Learning, the foundation of adaptive and intelligent behavior, is based on plastic changes in neural assemblies, reflected by the modulation of electric brain responses. In infancy, auditory learning implicates the formation and strengthening of neural long-term memory traces, improving discrimination skills, in particular those forming the prerequisites for speech perception and understanding. Although previous behavioral observations show that newborns react differentially to unfamiliar sounds vs. familiar sound material that they were exposed to as fetuses, the neural basis of fetal learning has not thus far been investigated. Here we demonstrate direct neural correlates of human fetal learning of speech-like auditory stimuli. We presented variants of words to fetuses; unlike infants with no exposure to these stimuli, the exposed fetuses showed enhanced brain activity (mismatch responses) in response to pitch changes for the trained variants after birth. Furthermore, a significant correlation existed between the amount of prenatal exposure and brain activity, with greater activity being associated with a higher amount of prenatal speech exposure. Moreover, the learning effect was generalized to other types of similar speech sounds not included in the training material. Consequently, our results indicate neural commitment specifically tuned to the speech features heard before birth and their memory representations.	t	\N
23992232	The detection of auditory stimuli that deviate from a simple or complex auditory regularity is reflected by the mismatch negativity component of the human auditory evoked potential. Moreover, simple deviants of an oddball paradigm modulate the preceding middle-latency response of the auditory evoked potential. For the frequency oddball paradigms it has been shown that the Nb wave, at approximately 40 ms from stimulus onset, is enhanced in response to deviant compared with standard stimuli. In this study we tested whether the detection of auditory deviants in a (frequency-location) feature-conjunction paradigm is reflected by modulations of the Na, Pa or Nb wave of healthy human participants. In addition, a frequency oddball paradigm was applied to directly contrast the results of a simple and a complex invariance. Feature-conjunction deviants did not elicit any modulations of the tested middle-latency waves. Deviants of the frequency oddball paradigm, by contrast, elicited an enhancement of the Nb wave, confirming the outcome of precedent studies. In both conditions a significant mismatch negativity component was elicited, which showed larger amplitudes and shorter latencies in the oddball condition than in the feature-conjunction condition. These findings corroborate the idea that simple auditory regularities are encoded upstream of those of more complex auditory features and are in line with the idea of a hierarchically working auditory novelty system.	t	\N
24003981	This study explores if increasing number of repetitions might improve the precision of the acceptable noise level (ANL) test. We measured twelve ANL repetitions, i.e. four complete ANL tests (4 × 3 repetitions), at one session using diotic presentation and a non-semantic ANL version. Thirty-two normal-hearing adults. Small order and fatigue effects were seen. We used the coefficient of repeatability (CR) to assess the repeatability; CRs ranged between 3.9 and 7.6 dB for the four ANL tests. Using the twelve ANL repetitions we removed the variability of the ANL across subjects by normalizing the data to the individual mean ANL for the twelve repetitions. The mean normalized ANL across the subjects rapidly approached the ANL normalized to the individual mean for the 12 repetitions (0 dB), and after three repetitions the SD seemed to be stable at about 3 dB. The findings suggest that both order and fatigue affect the ANL. The findings also suggest that it may be more accurate to speak of an acceptable noise range than ANL. These findings have large implications for how we understand acceptable noise and it would explain a large part of the variability seen among normal-hearing and perhaps hearing-impaired subjects.	t	\N
24005271	The objective of the present study was to evaluate the hearing function in the airport technical personnel and estimate the effectiveness of multicomponent anti-noise hearing protectors used by the specialists engaged in the aircraft maintenance. The tonal threshold audiometry was carried out before and after a shiftwork. The extra-aural effect of noise was assessed from the characteristics of cardiac rhythm variability. The study included two groups of subjects: in one of them (n=8) they used ordinary flight headsets (control) in the other the protection was ensured with the help of multi-insert hearing protectors (n=16). The initial hearing thresholds were found to be increased up to 70 and 60 dB at the frequencies of 4 and 8 kHz respectively. The regression analysis revealed the relationship between these parameters and the duration of aerodrome work experience. Temporary threshold shifts were observed only in the control group. An increase in the tone of the sympathetic nervous system was observed in the control subjects but was absent in the study group. It is concluded that the multi-component hearing protectors employed in the present study are highly efficacious anti-noise devices. The mechanisms of noise-induced hearing loss are discussed.	t	\N
24007920	Many previous studies have shown that the human language processor is capable of rapidly integrating information from different sources during reading or listening. Yet, little is known about how this ability develops from child to adulthood. To gain insight into how children (in comparison to adults) handle different kinds of linguistic information during on-line language comprehension, the current study investigates a well-known morphological phenomenon that is subject to both structural and semantic constraints, the plurals-in-compounds effect, i.e. the dislike of plural (specifically regular plural) modifiers inside compounds (e.g. rats eater). We examined 96 seven-to-twelve-year-old children and a control group of 32 adults measuring their eye-gaze changes in response to compound-internal plural and singular forms. Our results indicate that children rely more upon structural properties of language (in the present case, morphological cues) early in development and that the ability to efficiently integrate information from multiple sources takes time for children to reach adult-like levels.	t	\N
24012681	It was recently shown that brain activity can be represented as a stimulation-specific vector field. Since the vector field of brain activity is specifically transformed by sensory input, we suggested that a tensor field that transforms brain activity reflects sensory input. We calculated the tensor fields that transform brain activity between visual baseline and auditory word processing in PET data and between environmental sounds and auditory word processing in fMRI data. In the first comparison, significant clusters formed a distributed network over the brain cortex. In the second comparison, clusters were more localised in the temporo-frontal network of speech processing. Our study therefore demonstrated that tensor fields reflect the sensory input that specifically transforms brain activity.	t	\N
24016155	For studying multistable auditory perception, we propose a paradigm that evokes integrated or segregated perception of a sound sequence, and permits decomposition of the segregated grouping into foreground and background sounds. The paradigm combines 3-tone pitch patterns with alternating timbres, resulting in a repeating 6-tone structure that can be perceived as rising based on temporal proximity, or as falling based on timbre similarity. Listeners continuously report their percept while EEG is recorded. Results show an ERP modulation starting at ∼70 ms after sound onset that can be explained by whether a sound belongs to perceived foreground or background, with no additional effect of integrated versus segregated grouping. Auditory grouping as indexed by the mismatch negativity did not correspond with reported sound grouping. The paradigm offers a new possibility for investigating effects of conscious perceptual organization on sound processing.	t	\N
24018572	The purpose of this study was to determine the effects of hearing instruments set to Desired Sensation Level version 5 (DSL v5) hearing instrument prescription algorithm targets and equipped with directional microphones and digital noise reduction (DNR) on children's sentence recognition in noise performance and loudness perception in a classroom environment. Ten children (ages 8-17 years) with stable, congenital sensorineural hearing losses participated in the study. Participants were fitted bilaterally with behind-the-ear hearing instruments set to DSL v5 prescriptive targets. Sentence recognition in noise was evaluated using the Bamford-Kowal-Bench Speech in Noise Test (Niquette et al., 2003). Loudness perception was evaluated using a modified version of the Contour Test of Loudness Perception (Cox, Alexander, Taylor, & Gray, 1997). Children's sentence recognition in noise performance was significantly better when using directional microphones alone or in combination with DNR than when using omnidirectional microphones alone or in combination with DNR. Children's loudness ratings for sounds above 72 dB SPL were lowest when fitted with the DSL v5 Noise prescription combined with directional microphones. DNR use showed no effect on loudness ratings. Use of the DSL v5 Noise prescription with a directional microphone improved sentence recognition in noise performance and reduced loudness perception ratings for loud sounds relative to a typical clinical reference fitting with the DSL v5 Quiet prescription with no digital signal processing features enabled. Potential clinical strategies are discussed.	t	\N
24021849	Dehaene et al. (2003) showed an absence of conscious, but not masked, conflict effects when patients with schizophrenia performed a number-categorisation priming task. We aimed to replicate these influential results using a different word-categorisation priming task. Counter to Dehaene et al.'s findings, 21 patients and 20 healthy controls showed similar congruence effects for both masked and visible primes. Within patients, a reduced congruence effect for visible primes associated with longer duration of illness and more severe behavioural disorganisation. Patients, unlike controls, were no slower to respond to targets that followed visible compared to masked primes. Conscious conflict effects on priming tasks are not universally reduced in schizophrenia but may associate with chronicity and behavioural disorganisation. That patients were no slower when the preceding primes were clearly visible accords with evidence elsewhere that information processing in schizophrenia is driven more by immediate conscious experience and constrained less by prior events.	t	\N
24024543	Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.	t	\N
24028995	Recent theory of physiology of language suggests a dual stream dorsal/ventral organization of speech perception. Using intra-cerebral Event-related potentials (ERPs) during pre-surgical assessment of twelve drug-resistant epileptic patients, we aimed to single out electrophysiological patterns during both lexical-semantic and phonological monitoring tasks involving ventral and dorsal regions respectively. Phonological information processing predominantly occurred in the left supra-marginal gyrus (dorsal stream) and lexico-semantic information occurred in anterior/middle temporal and fusiform gyri (ventral stream). Similar latencies were identified in response to phonological and lexico-semantic tasks, suggesting parallel processing. Typical ERP components were strongly left lateralized since no evoked responses were recorded in homologous right structures. Finally, ERP patterns suggested the inferior frontal gyrus as the likely final common pathway of both dorsal and ventral streams. These results brought out detailed evidence of the spatial-temporal information processing in the dual pathways involved in speech perception.	t	\N
24032320	The purpose was the develop a questionnaire to identify the specific listening difficulties of second language (L2) learners. Based on previous research, a questionnaire containing 31 items was developed and administered to 1,056 college freshmen. The whole sample was split randomly into two subsamples, each containing 528 cases. Exploratory factor analysis was performed to analyse the first subsample, and six factors were extracted, explaining a total of 57.1% of variance. To test the factor model, confirmatory factor analysis was conducted with the second subsample. Various fit indices were examined. The best fitting model for the data was a 23-item, six-factor model representing text, input channel and surroundings, relevance, listener, speaker, and task. Apart from the listener factor, all components are external ones and deemed to be uncontrollable by listeners. L2 learners must take an active role in listening practice to overcome L2 listening difficulties.	t	\N
24034879	This paper presents an analytic procedure to assist safety practitioners in evaluating the audibility of an existing auditory warning system in their workplaces. Two alarm location models are described: (a) a model with an unknown signal sound level, and (b) a model with a known signal sound level. A heuristic algorithm to determine a minimum number of alarm devices and their locations so that the warning signals can be clearly heard by workers is also proposed. The algorithm considers the ambient noise level, noise levels generated by individual machines, locations where workers are likely to be present, and noise levels at worker locations. From the numerical examples and the computation experiment, both the optimization and heuristic approaches yield solutions that satisfy the 15-dBA constraints. The heuristic approach is efficient in solving large alarm location problems due its capability to find near-optimal solutions within reasonable computation time.	t	\N
24035819	Given recent interest in syllabic rates (∼2-5 Hz) for speech processing, we review the perception of "fluctuation" range (∼1-10 Hz) modulations during listening to speech and technical auditory stimuli (AM and FM tones and noises, and ripple sounds). We find evidence that the temporal modulation transfer function (TMTF) of human auditory perception is not simply low-pass in nature, but rather exhibits a peak in sensitivity in the syllabic range (∼2-5 Hz). We also address human and animal neurophysiological evidence, and argue that this bandpass tuning arises at the thalamocortical level and is more associated with non-primary regions than primary regions of cortex. The bandpass rather than low-pass TMTF has implications for modeling auditory central physiology and speech processing: this implicates temporal contrast rather than simple temporal integration, with contrast enhancement for dynamic stimuli in the fluctuation range. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives".	t	\N
24042339	Behavioral improvement within the first hour of training is commonly explained as procedural learning (i.e., strategy changes resulting from task familiarization). However, it may additionally reflect a rapid adjustment of the perceptual and/or attentional system in a goal-directed task. In support of this latter hypothesis, we show feature-specific gains in performance for groups of participants briefly trained to use either a spectral or spatial difference between 2 vowels presented simultaneously during a vowel identification task. In both groups, the neuromagnetic activity measured during the vowel identification task following training revealed source activity in auditory cortices, prefrontal, inferior parietal, and motor areas. More importantly, the contrast between the 2 groups revealed a striking double dissociation in which listeners trained on spectral or spatial cues showed higher source activity in ventral ("what") and dorsal ("where") brain areas, respectively. These feature-specific effects indicate that brief training can implicitly bias top-down processing to a trained acoustic cue and induce a rapid recalibration of the ventral and dorsal auditory streams during speech segregation and identification.	t	\N
24047945	Tests of auditory perception, such as those used in the assessment of central auditory processing disorders ([C]APDs), represent a domain in audiological assessment where measurement of this theoretical construct is often confounded by nonauditory abilities due to methodological shortcomings. These confounds include the effects of cognitive variables such as memory and attention and suboptimal testing paradigms, including the use of verbal reproduction as a form of response selection. We argue that these factors need to be controlled more carefully and/or modified so that their impact on tests of auditory and visual perception is only minimal. To advocate for a stronger theoretical framework than currently exists and to suggest better methodological strategies to improve assessment of auditory processing disorders (APDs). Emphasis is placed on adaptive forced-choice psychophysical methods and the use of matched tasks in multiple sensory modalities to achieve these goals. Together, this approach has potential to improve the construct validity of the diagnosis, enhance and develop theory, and evolve into a preferred method of testing. Examination of methods commonly used in studies of APDs. Where possible, currently used methodology is compared to contemporary psychophysical methods that emphasize computer-controlled forced-choice paradigms. In many cases, the procedures used in studies of APD introduce confounding factors that could be minimized if computer-controlled forced-choice psychophysical methods were utilized. Ambiguities of interpretation, indeterminate diagnoses, and unwanted confounds can be avoided by minimizing memory and attentional demands on the input end and precluding the use of response-selection strategies that use complex motor processes on the output end. Advocated are the use of computer-controlled forced-choice psychophysical paradigms in combination with matched tasks in multiple sensory modalities to enhance the prospect of obtaining a valid diagnosis.	t	\N
24048514	We studied autistics by quantitative EEG spectral and coherence analysis during three experimental conditions: basal, watching a cartoon with audio (V-A), and with muted audio band (VwA). Significant reductions were found for the absolute power spectral density (PSD) in the central region for delta and theta, and in the posterior region for sigma and beta bands, lateralized to the right hemisphere. When comparing VwA versus the V-A in the midline regions, we found significant decrements of absolute PSD for delta, theta and alpha, and increments for the beta and gamma bands. In autistics, VwA versus V-A tended to show lower coherence values in the right hemisphere. An impairment of visual and auditory sensory integration in autistics might explain our results.	t	\N
24059595	The frequency-following response (FFR) is the compound phase-locked brainstem response to periodic components of sound stimuli, and is closely related to pitch perception. Its weak amplitude often prevents its measurement with a high signal-to-noise ratio (SNR). Recording of FFR using multichannel EEG is possible but expensive and it involves the manual screening of raw data. We describe a new method to extract FFR features by prescreening the raw data using automatic monitoring of sound pressure in the ear canal. Removal of stimulus artifacts, noise reduction, and data selection were systematically studied. The reliability of our new method was tested by comparing FFRs tracking accuracy and pitch perception in fifteen individuals with normal hearing. The extracted FFRs tracking accuracy was significantly correlated with behavioral measures of pitch perception, indicating that FFR could be used to represent individual differences in pitch perception ability among a population with similar hearing characteristics. The designed system could extract FFR signals more accurately with high SNR after signal prescreen and noise reduction.	t	\N
24060845	The present study investigated whether and how beat gesture (small baton-like hand movements used to emphasize information in speech) influences semantic processing as well as its interaction with pitch accent during speech comprehension. Event-related potentials were recorded as participants watched videos of a person gesturing and speaking simultaneously. The critical words in the spoken sentences were accompanied by a beat gesture, a control hand movement, or no hand movement, and were expressed either with or without pitch accent. We found that both beat gesture and control hand movement induced smaller negativities in the N400 time window than when no hand movement was presented. The reduced N400s indicate that both beat gesture and control movement facilitated the semantic integration of the critical word into the sentence context. In addition, the words accompanied by beat gesture elicited smaller negativities in the N400 time window than those accompanied by control hand movement over right posterior electrodes, suggesting that beat gesture has a unique role for enhancing semantic processing during speech comprehension. Finally, no interaction was observed between beat gesture and pitch accent, indicating that they affect semantic processing independently.	t	\N
24060967	Some dialogues are perceived as running more smoothly than others. To some extent that impression could be related to how well speakers adapt their prosody to each other. Adaptation in prosody can be signaled by the use of pitch accents that indicate how utterances are structurally related to those of the interlocutor (prosodic function) or by copying the interlocutor's prosodic features (prosodic form). The same acoustic features, such as pitch, are involved in both ways of adaptation. Further, function and form may require a different prosody for successful adaptation in certain discourse contexts. In this study we investigate to what extent interlocutors are perceived as good adapters, depending on whether the prosody of both speakers is functionally coherent or similar in form. This is done in two perception tests using prosodically manipulated dialogues. Results show that coherent functional prosody can be a cue for speaker adaptation and that this cue is more powerful than similarity in prosodic form.	t	\N
24066179	We tested non-musicians and musicians in an auditory psychophysical experiment to assess the effects of timbre manipulation on pitch-interval discrimination. Both groups were asked to indicate the larger of two presented intervals, comprised of four sequentially presented pitches; the second or fourth stimulus within a trial was either a sinusoidal (or "pure"), flute, piano, or synthetic voice tone, while the remaining three stimuli were all pure tones. The interval-discrimination tasks were administered parametrically to assess performance across varying pitch distances between intervals ("interval-differences"). Irrespective of timbre, musicians displayed a steady improvement across interval-differences, while non-musicians only demonstrated enhanced interval discrimination at an interval-difference of 100 cents (one semitone in Western music). Surprisingly, the best discrimination performance across both groups was observed with pure-tone intervals, followed by intervals containing a piano tone. More specifically, we observed that: 1) timbre changes within a trial affect interval discrimination; and 2) the broad spectral characteristics of an instrumental timbre may influence perceived pitch or interval magnitude and make interval discrimination more difficult.	t	\N
24073580	To establish music reference values for normal-hearing (NH) person in China, in order to give convenience in clinical application. The NH participant group included 39 subjects, of which 21 females and 18 males. Musical Sounds in Cochlear Implants test battery was used to assess the music perception ability for normal-hearing participants. The median pitch difference NH participants could discriminate for string was 2.5 semitones and 0.5 for flute. The average scores of rhythm discrimination, melody discrimination, chord discrimination, instrument identification and instrument number detection test were 86.1% (SD = 11.2), 76.5% (SD = 11.1), 75.6% (SD = 11.4), 89.9% (SD = 13.0) and 74.1% (SD = 20.7), respectively. The MuSIC test could be a test for music perception ability for China users and in further study the material that more suited for our culture should be added into it.	t	\N
24073684	Work on the perception of urban soundscapes has generated a number of perceptual models which are proposed as tools to test and evaluate soundscape interventions. However, despite the excessive sound levels and noise within hospital environments, perceptual models have not been developed for these spaces. To address this, a two-stage approach was developed by the authors to create such a model. First, semantics were obtained from listening evaluations which captured the feelings of individuals from hearing hospital sounds. Then, 30 participants rated a range of sound clips representative of a ward soundscape based on these semantics. Principal component analysis extracted a two-dimensional space representing an emotional-cognitive response. The framework enables soundscape interventions to be tested which may improve the perception of these hospital environments.	t	\N
24080979	Cochlear implantation (CI) is currently the main device option for children with auditory neuropathy spectrum disorder (ANSD) who receive minimal benefit from conventional amplification. This study examines potential prognostic factors associated with post-CI speech performance in this population. Retrospective chart review. Academic center. ANSD patients without inner ear abnormalities implanted with unilateral or bilateral CI between 1998 and 2010. CI and speech perception testing. Post-CI speech perception testing at 50 dBHL. "Good" performers were defined as patients with greater than 70% speech perception and "poor" performers less than 70%. Medical comorbidity, educational information, and social history were gathered. Twenty-seven patients met inclusion criteria. Mean age at diagnosis, first CI, and second CI in good performers were 2.5 ± 3.4, 3.4 ± 3.6, and 3.8 ± 1.6 years, respectively, compared with 9.7 ± 7.8, 14.8 ± 12.9, and 8.9 ± 3.5 in poor performers. Mean speech perception after first and second implantation for good performers trended at 85% and 90%, respectively, compared with 36% and 73% in poor performers. Better pre-CI PTA correlated with better post-CI speech perception. Patients with bilateral CI demonstrated better speech perception outcomes compared with unilateral CI use. Poor performers had later age of implantation, lower socioeconomic status, and lack of family support compared with good performers. ANSD patients who do not benefit from conventional amplification do well when implanted at a young age with proper access to education and habilitation training. Medical, social, and economic information may be helpful in predicting positive outcomes.	t	\N
24083896	In this cross-language study, six Italian and six French voice experts evaluated perceptually the speech of 27 Italian and 40 French patients with dysphonia to determine if there were differences based on native language. French and Italian voice specialists agreed substantially in their evaluations of the overall grade of dysphonia and moderately concerning roughness and breathiness. No statistically significant effects were found related to the language of the speakers with the exception of breathiness, a finding that was interpreted as being due to different voice pathologies in the patient groups. It was concluded that the perception of the overall grade of dysphonia and breathiness is not language-dependent, whereas the significant difference in the perception of roughness may be related to a perception/adaption process.	t	\N
24089502	Linguistic content can be conveyed both in speech and in writing. But how similar is the neural processing when the same real-life information is presented in spoken and written form? Using functional magnetic resonance imaging, we recorded neural responses from human subjects who either listened to a 7 min spoken narrative or read a time-locked presentation of its transcript. Next, within each brain area, we directly compared the response time courses elicited by the written and spoken narrative. Early visual areas responded selectively to the written version, and early auditory areas to the spoken version of the narrative. In addition, many higher-order parietal and frontal areas demonstrated strong selectivity, responding far more reliably to either the spoken or written form of the narrative. By contrast, the response time courses along the superior temporal gyrus and inferior frontal gyrus were remarkably similar for spoken and written narratives, indicating strong modality-invariance of linguistic processing in these circuits. These results suggest that our ability to extract the same information from spoken and written forms arises from a mixture of selective neural processes in early (perceptual) and high-order (control) areas, and modality-invariant responses in linguistic and extra-linguistic areas.	t	\N
24091182	The auditory nervous system is highly nonlinear. Some nonlinear responses arise through active processes in the cochlea, while others may arise in neural populations of the cochlear nucleus, inferior colliculus and higher auditory areas. In humans, auditory brainstem recordings reveal nonlinear population responses to combinations of pure tones, and to musical intervals composed of complex tones. Yet the biophysical origin of central auditory nonlinearities, their signal processing properties, and their relationship to auditory perception remain largely unknown. Both stimulus components and nonlinear resonances are well represented in auditory brainstem nuclei due to neural phase-locking. Recently mode-locking, a generalization of phase-locking that implies an intrinsically nonlinear processing of sound, has been observed in mammalian auditory brainstem nuclei. Here we show that a canonical model of mode-locked neural oscillation predicts the complex nonlinear population responses to musical intervals that have been observed in the human brainstem. The model makes predictions about auditory signal processing and perception that are different from traditional delay-based models, and may provide insight into the nature of auditory population responses. We anticipate that the application of dynamical systems analysis will provide the starting point for generic models of auditory population dynamics, and lead to a deeper understanding of nonlinear auditory signal processing possibly arising in excitatory-inhibitory networks of the central auditory nervous system. This approach has the potential to link neural dynamics with the perception of pitch, music, and speech, and lead to dynamical models of auditory system development.	t	\N
24094802	Although reduced stress is thought to be one of the most deviant speech dimensions in hypokinetic dysarthria associated with Parkinson's disease (PD), the mechanisms of stress production in PD have not been thoroughly explored by objective methods. The aim of the present study was to quantify the effect of PD on prosodic characteristics and to describe contrastive stress patterns in parkinsonian speech. The ability of 20 male speakers with early PD and 16 age- and gender-matched healthy controls (HCs) to signal contrastive stress was investigated. Each participant was instructed to unnaturally emphasize five key words while reading a short block of text. Acoustic analyses were based on the measurement of pitch, intensity, and duration. In addition, an innovative measurement termed the stress pattern index (SPI) was designed to mirror the effect of all distinct acoustic cues exploited during stress production. Although PD patients demonstrated a reduced ability to convey contrastive stress, they could still notably increase pitch, intensity, and duration to emphasize a word within a sentence. No differences were revealed between PD and HC stress productions using the measurements of pitch, intensity, duration, and intensity range. However, restricted SPI and pitch range were evident in the PD group. A reduced ability to express stress seems to be the distinctive pattern of hypokinetic dysarthria, even in the early stages of PD. Because PD patients were able to consciously improve their speech performance using multiple acoustic cues, the introduction of speech therapy may be rewarding.	t	\N
24098565	In the present study, we investigated brain morphological signatures of dyslexia by using a voxel-based asymmetry analysis. Dyslexia is a developmental disorder that affects the acquisition of reading and spelling abilities and is associated with a phonological deficit. Speech perception disabilities have been associated with this deficit, particularly when listening conditions are challenging, such as in noisy environments. These deficits are associated with known neurophysiological correlates, such as a reduction in the functional activation or a modification of functional asymmetry in the cortical regions involved in speech processing, such as the bilateral superior temporal areas. These functional deficits have been associated with macroscopic morphological abnormalities, which potentially include a reduction in gray and white matter volumes, combined with modifications of the leftward asymmetry along the perisylvian areas. The purpose of this study was to investigate gray/white matter distribution asymmetries in dyslexic adults using automated image processing derived from the voxel-based morphometry technique. Correlations with speech-in-noise perception abilities were also investigated. The results confirmed the presence of gray matter distribution abnormalities in the superior temporal gyrus (STG) and the superior temporal Sulcus (STS) in individuals with dyslexia. Specifically, the gray matter of adults with dyslexia was symmetrically distributed over one particular region of the STS, the temporal voice area, whereas normal readers showed a clear rightward gray matter asymmetry in this area. We also identified a region in the left posterior STG in which the white matter distribution asymmetry was correlated to speech-in-noise comprehension abilities in dyslexic adults. These results provide further information concerning the morphological alterations observed in dyslexia, revealing the presence of both gray and white matter distribution anomalies and the potential involvement of these defects in speech-in-noise deficits.	t	\N
24098665	Numerous studies have documented the phenomenon of phonetic imitation: the process by which the production patterns of an individual become more similar on some phonetic or acoustic dimension to those of her interlocutor. Though social factors have been suggested as a motivator for imitation, few studies has established a tight connection between language-external factors and a speaker's likelihood to imitate. The present study investigated the phenomenon of phonetic imitation using a within-subject design embedded in an individual-differences framework. Participants were administered a phonetic imitation task, which included two speech production tasks separated by a perceptual learning task, and a battery of measures assessing traits associated with Autism-Spectrum Condition, working memory, and personality. To examine the effects of subjective attitude on phonetic imitation, participants were randomly assigned to four experimental conditions, where the perceived sexual orientation of the narrator (homosexual vs. heterosexual) and the outcome (positive vs. negative) of the story depicted in the exposure materials differed. The extent of phonetic imitation by an individual is significantly modulated by the story outcome, as well as by the participant's subjective attitude toward the model talker, the participant's personality trait of openness and the autistic-like trait associated with attention switching.	t	\N
24099584	The objective of this prospective study was to investigate the relationship between acceptable noise level (ANL), which was measured using Taiwanese and the international speech test signal (ISTS), and real-world hearing-aid success for listeners who were representative of the population commonly seen in clinics. Unaided ANLs were measured pre-hearing-aid fitting. Hearing-aid success was assessed three months post-fitting using the international outcome inventory for hearing aids (IOI-HA) and a hearing-aid use questionnaire. Eighty adults with hearing impairment completed the study. Both Taiwanese and ISTS ANLs were significantly associated with hearing-aid success, with higher ANLs suggesting poorer outcomes. However, the ANL's prediction accuracy for the probability of hearing-aid success was either much lower than that suggested by some literature, or was not much different from that of simply predicting all listeners as successful users. The current study suggested the possibility of using ANL to predict hearing-aid success. However, the usefulness of ANL as a clinical tool is unlikely to be as great as indicated by the literature.	t	\N
24101343	In a series of three experiments, we used an ambiguous plaid motion stimulus to explore the behavioral and electrophysiological effects of prior stimulus exposures and perceptual states on current awareness. The results showed that prior exposure to a stimulus biased toward one percept led to subsequent suppression of that percept. In contrast, in the absence of stimulus bias, prior perceptual experience can have a facilitative influence. The suppressive effects caused by the prior stimulus were found to transfer to an ambiguous plaid test stimulus rotated 180º relative to the adaptation stimulus, but were abolished if (1) the ambiguous test stimulus was only rotated 90º relative to the adaptation stimulus or (2) the adaptation stimulus was heavily biased toward the component grating percept. Event-related potential recordings were consistent with the involvement of visual cortical areas and suggested that the influence of recent stimulus exposure may involve recruitment of additional brain processes beyond those responsible for initial stimulus encoding. In contrast, the effects of prior and current perceptual experience appeared to depend on similar brain processes. Although the data presented here focus on vision, the work is discussed within the context of data from a parallel series of experiments in audition.	t	\N
24110307	The cocktail party problem is a multi-faceted challenge which encompasses various aspects of auditory perception. Its processes underlie the brain's ability to detect, identify and classify sound objects; to robustly represent and maintain speech intelligibility amidst severe distortions; and to guide actions and behaviors in line with complex goals and shifting acoustic soundscapes. Here, we present a perspective that considers the powerful Bayesian inference as a unifying framework to integrate the role of sensory cues as well as stimulus-driven priors and top-down schemas including attention.	t	\N
24110658	Temporal fine structure (TFS) carries important information for the speech perception of hearing-impaired listeners and for the design of novel prosthetic hearing devices. This study assessed the performance of present intelligibility indices for predicting the intelligibility of speech containing different amount of TFS information. Speech intelligibility data was collected from vocoded and wideband Mandarin sentences containing little/partial and intact TFS information, respectively, and was then subjected to the correlation analysis with existing intelligibility indices. It was found that, though performing well in predicting the intelligibility of vocoded or wideband speech separately, present intelligibility indices were not highly correlated with the intelligibility scores when a general function was used to map all intelligibility measures to intelligibility scores. Analysis further showed that the intelligibility prediction power could be significantly improved when multiple condition-dependent functions were used for mapping intelligibility measures to intelligibility scores.	t	\N
24116428	Bilateral cochlear implants (CIs) have provided some success in improving spatial hearing abilities to patients, but with large variability in performance. One reason for the variability is that there may be a mismatch in the place-of-stimulation arising from electrode arrays being inserted at different depths in each cochlea. Goupell et al. [(2013b). J. Acoust. Soc. Am. 133(4), 2272-2287] showed that increasing interaural mismatch led to non-fused auditory images and poor lateralization of interaural time differences in normal hearing subjects listening to a vocoder. However, a greater bandwidth of activation helped mitigate these effects. In the present study, the same experiments were conducted in post-lingually deafened bilateral CI users with deliberate and controlled interaural mismatch of single electrode pairs. Results show that lateralization was still possible with up to 3 mm of interaural mismatch, even when off-center, or multiple, auditory images were perceived. However, mismatched inputs are not ideal since it leads to a distorted auditory spatial map. Comparison of CI and normal hearing listeners showed that the CI data were best modeled by a vocoder using Gaussian-pulsed tones with 1.5 mm bandwidth. These results suggest that interaural matching of electrodes is important for binaural cues to be maximally effective.	t	\N
24116537	Using molecular psychophysics, temporal loudness weights were measured for 2-s, 1-kHz tones with flat, increasing and decreasing time-intensity profiles. While primacy and recency effects were observed for flat profile stimuli, the so-called "level dominance" effect was observed for both increasing and decreasing profile stimuli, fully determining their temporal weights. The weighs obtained for these profiles were basically zero for all but the most intense parts of these sounds. This supports the view that the "level dominance" effect is prominent with intensity-varying sounds and that it persists over time since temporal weights are not affected by the direction of intensity change.	t	\N
24124356	To assess the benefits of cochlear implantation in the elderly. A retrospective analysis of 31 postlingually deafened elderly (≥60 years of age) with unilateral cochlear implants was conducted. Audiological testing included preoperative and postoperative pure-tone audiometry and a monosyllabic word recognition test presented from recorded material in free field. Speech perception tests included Ling's six sound test (sound detection, discrimination, and identification), syllable discrimination, and monosyllabic and multisyllabic word recognition (open set) without lip-reading. Everyday life benefits from cochlear implantation were also evaluated. The mean age at the time of cochlear implantation was 72.4 years old. The mean postimplantation follow-up time was 2.34 years. All patients significantly improved their audiological and speech understanding performances. The preoperative mean pure-tone average threshold for 500 Hz, 1,000 Hz, 2,000 Hz, and 4,000 Hz was 110.17 dB HL. Before cochlear implantation, all patients scored 0% on the monosyllabic word recognition test in free field at 70 dB SPL intensity level. The postoperative pure-tone average was 37.14 dB HL (the best mean threshold was 17.50 dB HL, the worst was 58.75 dB HL). After the surgery, mean monosyllabic word recognition reached 47.25%. Speech perception tests showed statistically significant improvement in speech recognition. The results of this study showed that cochlear implantation is indeed a successful treatment for improving speech recognition and offers a great help in everyday life to deafened elderly patients. Therefore, they can be good candidates for cochlear implantation and their age alone should not be a relevant or excluding factor when choosing candidates for cochlear implantation.	t	\N
24129008	PURPOSE Previous work has shown that monolingual French and English speakers use distinct articulatory settings, the underlying articulatory posture of a language. In the present article, the authors report on an experiment in which they investigated articulatory settings in bilingual speakers. The authors first tested the hypothesis that in order to sound native-like, bilinguals must use distinct, language-specific articulatory settings in monolingual mode. The authors then tested the hypothesis that in bilingual mode, a bilingual individual's articulatory setting is identical to the monolingual-mode setting of 1 of his or her languages. METHOD Eight French-English bilinguals each read 90 English and 90 French sentences, and the authors measured their interspeech posture (ISP) using optical tracking of the lips and jaw and ultrasound imaging of the tongue. RESULTS Results show that bilingual speakers who are perceived as native in both languages exhibit distinct, language-specific ISPs, and those who are not perceived as native in one or more languages do not. In bilingual mode, bilinguals use an ISP that is equivalent to the monolingual-mode ISP of their currently most used language. The most balanced bilingual used a French lip ISP but an English tongue-tip ISP. CONCLUSION Results support the claim that bilinguals who sound native in each of their languages have distinct articulatory settings for each language.	t	\N
24130256	Biological motion research is an increasingly active field, with a great potential to contribute to a wide range of applications, such as behavioral monitoring/motion detection in surveillance situations, intention inference in social interactions, and diagnostic tools in autism research. In recent years, a large amount of motion capture data has become freely available online, potentially providing rich stimulus sets for biological motion research. However, there currently does not exist an easy-to-use tool to extract, present and manipulate motion capture data in the MATLAB environment, which many researchers use to program their experiments. We have developed the Biomotion Toolbox, which allows researchers to import motion capture data in a variety of formats, to display actions using Psychtoolbox 3, and to manipulate action displays in specific ways (e.g., inversion, three-dimensional rotation, spatial scrambling, phase-scrambling, and limited lifetime). The toolbox was designed to allow researchers with a minimal level of MATLAB programming skills to code experiments using biological motion stimuli.	t	\N
24130865	Temporal processing underlies both music and language skills. There is increasing evidence that rhythm abilities track with reading performance and that language disorders such as dyslexia are associated with poor rhythm abilities. However, little is known about how basic time-keeping skills can be shaped by musical training, particularly during critical literacy development years. This study was carried out in collaboration with Harmony Project, a non-profit organization providing free music education to children in the gang reduction zones of Los Angeles. Our findings reveal that elementary school children with just one year of classroom music instruction perform more accurately in a basic finger-tapping task than their untrained peers, providing important evidence that fundamental time-keeping skills may be strengthened by short-term music training. This sets the stage for further examination of how music programs may be used to support the development of basic skills underlying learning and literacy, particularly in at-risk populations which may benefit the most.	t	\N
24131604	The Dichotic Verbal Memory Test (DVMT) is useful in detecting verbal memory deficits and differences in memory function between the brain hemispheres. The purpose of this study was to prepare the Persian version of DVMT, to obtain its results in 18- to 25-yr-old Iranian individuals, and to examine the ear, gender, and serial position effect. The Persian version of DVMT consisted of 18 10-word lists. After preparing the 18 lists, content validity was assessed by a panel of eight experts and the equivalency of the lists was evaluated. Then the words were recorded on CD in a dichotic mode such that 10 words were presented to one ear, with the same words reversed simultaneously presented to the other ear. Thereafter, it was performed on a sample of young, normal, Iranian individuals. Thirty normal individuals (no history of neurological, ontological, or psychological diseases) with ages ranging from 18 to 25 yr were examined for evaluating the equivalency of the lists, and 110 subjects within the same age range participated in the final stage of the study to obtain the normative data on the developed test. There was no significant difference between the mean scores of the 18 developed lists (p > 0.05). The mean content validity index (CVI) score was .96. A significant difference was found between the mean score of the two ears (p < 0.05) and between female and male participants (p < 0.05). The Persian version of DVMT has good content validity and can be used for verbal memory assessment in Iranian young adults.	t	\N
24131606	Current bone anchored hearing solutions (BAHSs) have incorporated automatic adaptive multichannel directional microphones (DMs). Previous fixed single-channel hypercardioid DMs in BAHSs have provided benefit in a diffuse listening environment, but little data are available on the performance of adaptive multichannel DMs in BAHSs for persons with unilateral sensorineural hearing loss (USNHL). The primary goal was to determine if statistically significant differences existed in the mean Reception Threshold for Sentences (RTS in dB) in diffuse uncorrelated restaurant noise between unaided, an omnidirectional microphone (OM), split DM (SDM), and full DM (FDM) in the Oticon Medical Ponto Pro. A second goal was to assess subjective benefit using the Abbreviated Profile of Hearing Aid Benefit (APHAB) comparing the Ponto Pro to the participant's current BAHS, and the Ponto Pro and participant's own BAHS to unaided. The third goal was to compare RTS data of the Ponto Pro to data from an identical study examining Cochlear Americas' Divino. A randomized repeated measures, single blind design was used to measure an RTS for each participant for unaided, OM, SDM, and FDM. Fifteen BAHS users with USNHL were recruited from Washington University in St. Louis and the surrounding area. The Ponto Pro was fit by measuring in-situ bone conduction thresholds and was worn for 4 wk. An RTS was obtained utilizing Hearing in Noise Test (HINT) sentences in uncorrelated restaurant noise from an eight loudspeaker array, and subjective benefit was determined utilizing the APHAB. Analysis of variance (ANOVA) was used to analyze the results of the Ponto Pro HINT and APHAB data, and comparisons between the Ponto Pro and previous Divino data. No statistically significant differences existed in mean RTS between unaided, the Ponto Pro's OM, SDM, or FDM (p = 0.10). The Ponto Pro provided statistically significant benefit for the Background Noise (BN) (p < 0.01) and Reverberation (RV) (p < 0.05) subscales compared to the participant's own BAHS. The Ponto Pro (Ease of Communication [EC] [p < 0.01], BN [p < 0.001], and RV [p < 0.01] subscales) and participant's own BAHS (BN [p < 0.01] and RV [p < 0.01] subscales) overall provided statistically significant benefit compared to unaided. Clinically significant benefit of 5% was present for the Ponto Pro compared to the participant's own BAHS and 10% for the Ponto Pro and the participant's own BAHS compared to unaided. The Ponto Pro's OM (p = 0.05), SDM (p = 0.05), and FDM (p < 0.01) were statistically significantly better than the Divino's OM. No significant differences existed between the Ponto Pro's OM, SDM, and FDM compared to the Divino's DM. No statistically significant differences existed between unaided, OM, SDM, or FDM. Participants preferred the Ponto Pro compared to the participant's own BAHS and the Ponto Pro and participant's own BAHS compared to unaided. The RTS of the Ponto Pro's adaptive multichannel DM was similar to the Divino's fixed hypercardioid DM, but the Ponto Pro's OM was statistically significantly better than the Divino's OM.	t	\N
24136318	Hearing loss resulting from bilateral vestibular schwannomas (VSs) has a significant effect on the quality of life of patients with neurofibromatosis Type 2 (NF2). A national consensus protocol was produced in England as a guide for cochlear implantation (CI) and auditory brainstem implantation (ABI) in these patients. Consensus statement. English NF2 Service. Clinicians from all 4 lead NF2 units in England. A protocol for the assessment, insertion and rehabilitation of CI and ABI in NF2 patients. Patients should undergo more detailed hearing assessment once their maximum aided speech discrimination score falls below 50% in the better hearing ear. Bamford-Kowal-Bench sentence testing scores below 50% should trigger assessment for auditory implantation, as recommended by the National Institute for Clinical Excellence guidelines on CI. Where this occurs in patients with bilateral stable VS or a unilateral stable VS where the contralateral cochlear nerve was lost at previous surgery, CI should be considered. Where VS surgery is planned, CI should be considered where cochlear nerve preservation is thought possible, otherwise an ABI should be considered. Intraoperative testing using electrically evoked auditory brainstem responses or cochlear nerve action potentials may be used to determine whether a CI or ABI is inserted. The NF2 centers in England agreed on this protocol. Multisite, prospective assessments of standardized protocols for auditory implantation in NF2 provide an essential model for evaluating candidacy and outcomes in this challenging patient population.	t	\N
24139706	Language experience can alter perceptual abilities and the neural specialization for phonological contrasts. Here we investigated whether dialectal differences in the lexical use of pitch information lead to differences in functional lateralization for pitch processing. We measured cortical hemodynamic responses to pitch pattern changes in native speakers of Standard (Tokyo) Japanese, which has a lexical pitch accent system, and native speakers of 'accentless' dialects, which do not have any lexical tonal phenomena. While the Standard Japanese speakers showed left-dominant responses in temporal regions to pitch pattern changes within words, the accentless dialects speakers did not show such left-dominance. Pitch pattern changes within harmonic-complex tones also elicited different brain activation patterns between the two groups. These results indicate that the neural processing of pitch information differs depending on the listener's native dialect, and that listeners' linguistic experiences may further affect the processing of pitch changes even for non-linguistic sounds.	t	\N
24141311	Across all sensory modalities, the effect of context-dependent neural adaptation can be observed at every level, from receptors to perception. Nonetheless, it has long been assumed that the processing of interaural time differences, which is the primary cue for sound localization, is nonadaptive, as its outputs are mapped directly onto a hard-wired representation of space. Here we present evidence derived from in vitro and in vivo experiments in gerbils indicating that the coincidence-detector neurons in the medial superior olive modulate their sensitivity to interaural time differences through a rapid, GABA(B) receptor-mediated feedback mechanism. We show that this mechanism provides a gain control in the form of output normalization, which influences the neuronal population code of auditory space. Furthermore, psychophysical tests showed that the paradigm used to evoke neuronal GABA(B) receptor-mediated adaptation causes the perceptual shift in sound localization in humans that was expected on the basis of our physiological results in gerbils.	t	\N
24141681	The signal processing strategy is a parameter that may influence the auditory performance of cochlear implant and is important to optimize this parameter to provide better speech perception, especially in difficult listening situations. To evaluate the individual's auditory performance using two different signal processing strategy. Prospective study with 11 prelingually deafened children with open-set speech recognition. A within-subjects design was used to compare performance with standard HiRes and HiRes 120 in three different moments. During test sessions, subject's performance was evaluated by warble-tone sound-field thresholds, speech perception evaluation, in quiet and in noise. In the silence, children S1, S4, S5, S7 showed better performance with the HiRes 120 strategy and children S2, S9, S11 showed better performance with the HiRes strategy. In the noise was also observed that some children performed better using the HiRes 120 strategy and other with HiRes. Not all children presented the same pattern of response to the different strategies used in this study, which reinforces the need to look at optimizing cochlear implant clinical programming.	t	\N
24148845	Autism spectrum disorders (ASDs) are associated with auditory hyper- or hyposensitivity; atypicalities in central auditory processes, such as speech-processing and selective auditory attention; and neural connectivity deficits. We sought to investigate whether the low-level integrative processes underlying sound localization and spatial discrimination are affected in ASDs. We performed 3 behavioural experiments to probe different connecting neural pathways: 1) horizontal and vertical localization of auditory stimuli in a noisy background, 2) vertical localization of repetitive frequency sweeps and 3) discrimination of horizontally separated sound stimuli with a short onset difference (precedence effect). Ten adult participants with ASDs and 10 healthy control listeners participated in experiments 1 and 3; sample sizes for experiment 2 were 18 adults with ASDs and 19 controls. Horizontal localization was unaffected, but vertical localization performance was significantly worse in participants with ASDs. The temporal window for the precedence effect was shorter in participants with ASDs than in controls. The study was performed with adult participants and hence does not provide insight into the developmental aspects of auditory processing in individuals with ASDs. Changes in low-level auditory processing could underlie degraded performance in vertical localization, which would be in agreement with recently reported changes in the neuroanatomy of the auditory brainstem in individuals with ASDs. The results are further discussed in the context of theories about abnormal brain connectivity in individuals with ASDs.	t	\N
24150886	Facilitatory effects have been noted between tools and the objects that they act upon (their "action recipients") across several paradigms. However, it has not been convincingly established that the motor system is directly involved in the joint visual processing of these object pairings. Here, we used the attentional blink (AB) paradigm to demonstrate privileged access to perceptual awareness for tool-action recipient object pairs and to investigate how motor affordances modulate their joint processing. We demonstrated a reduction in the size of the AB that was greater for congruent tool-action recipient pairings (e.g., hammer-nail) than for incongruent pairings (e.g., scissors-nail). Moreover, the AB was reduced only when action recipients followed their associated tool in the temporal sequence, but not when this order was reversed. Importantly, we also found that the effect was sensitive to manipulations of the motor congruence between the tool and the action recipient. First, we observed a greater reduction in the AB when the tool and action recipient were correctly aligned for action than when the tool was rotated to face away from the action recipient. Second, presenting a different tool as a distractor between the tool and action recipient target objects removed any benefit seen for congruent pairings. This was likely due to interference from the motor properties of the distractor tool that disrupted the motor synergy between the congruent tool and action recipient targets. Overall, these findings demonstrate that the contextual motoric relationship between tools and their action recipients facilitates their visual encoding and access to perceptual awareness.	t	\N
24157488	The aim of this study was to evaluate electrode array position in relation to cochlear anatomy and its influence on hearing performance in cochlear implantees. Twenty-two patients (25 ears) with Med-El cochlear implants were included in this retrospective study. A negative correlation was observed between electrode-modiolus distance (EMD) at the cochlear base and monosyllabic word discrimination 6 months after implantation. We found no correlation between EMD and hearing outcome at 12 months. The insertion depth/cochlear perimeter ratio appeared to negatively influence the EMD at the base. Indeed, deep insertions in small cochleae appeared to yield smaller EMD and better hearing performance. This observation supports the idea of preplanning the surgery by adapting the electrode array to the length of the available scala tympani.	t	\N
24157596	Rhythmic disturbances are a hallmark of motor speech disorders, in which the motor control deficits interfere with the outward flow of speech and by extension speech understanding. As the functions of rhythm are language-specific, breakdowns in rhythm should have language-specific consequences for communication. The goals of this paper are to (i) provide a review of the cognitive-linguistic role of rhythm in speech perception in a general sense and crosslinguistically; (ii) present new results of lexical segmentation challenges posed by different types of dysarthria in American English, and (iii) offer a framework for crosslinguistic considerations for speech rhythm disturbances in the diagnosis and treatment of communication disorders associated with motor speech disorders. This review presents theoretical and empirical reasons for considering speech rhythm as a critical component of communication deficits in motor speech disorders, and addresses the need for crosslinguistic research to explore language-universal versus language-specific aspects of motor speech disorders.	t	\N
24157638	This study sought to determine if a monolingual English listener could rate nasality in English and in Spanish with the same proficiency as a bilingual English-Spanish listener, and to compare nasalance scores with nasality ratings. Speakers for this study were 26 bilingual English-Spanish-speaking children. Speech samples and nasalance scores were obtained simultaneously as each speaker recited one English sentence and one Spanish sentence. A monolingual listener and a bilingual listener rated nasality. For the English sentences, the intrajudge correlation coefficient was r = 0.89 for the monolingual listener and r = 0.89 for the bilingual listener. For the Spanish sentences, the intrajudge correlation coefficient was r = 0.91 for the monolingual listener and r = 0.92 for the bilingual listener. Interjudge agreement was r = 0.86 for rating English sentences and r = 0.78 for rating Spanish sentences. All correlation coefficients were significant (p < 0.001). The correlation coefficients between nasality ratings and nasalance scores were essentially the same for both listeners and both languages. A monolingual and a bilingual judge had high agreement on ratings of nasality for English and Spanish speech. The relationship between nasalance and nasality was not different across languages.	t	\N
24157861	Quite often, magnetoencephalography (MEG) measurements are contaminated by a series of artifacts that degrade the quality of the various source localization methods applied to them. In particular, eye blinking, minor head movement and related activities are a constant source of measurement contamination. In order to solve this problem, trial selection and rejection is applied, a task that is usually performed manually. The present work shows an automatic trial selection and rejection algorithm based on clustering techniques. These techniques employ a measurement of the dissimilarity of the items belonging to a set. This measure, based on the projection of the eigenvector corresponding to the largest eigenvalue of the covariance matrix, is provided and its rationale is explained. Subsequently, covariance matrices belonging to the selected cluster are averaged and used in the well-known Linearly Constrained Minimum Variance (LCMV) Beamformer. The results show a marked improvement of the specificity of the localization algorithm compared to the application of the LCMV without clustering. The method shows a marked reduction in computational cost compared with other data cleaning procedure widely used: Independent Component Analysis (ICA). Thus, we propose clustering techniques to be used in brain localization activity algorithms.	t	\N
24161466	A current view proposes that the right inferior frontal cortex (IFC) is particularly responsible for attentive decoding and cognitive evaluation of emotional cues in human vocalizations. Although some studies seem to support this view, an exhaustive review of all recent imaging studies points to an important functional role of both the right and the left IFC in processing vocal emotions. Second, besides a supposed predominant role of the IFC for an attentive processing and evaluation of emotional voices in IFC, these recent studies also point to a possible role of the IFC in preattentive and implicit processing of vocal emotions. The studies specifically provide evidence that both the right and the left IFC show a similar anterior-to-posterior gradient of functional activity in response to emotional vocalizations. This bilateral IFC gradient depends both on the nature or medium of emotional vocalizations (emotional prosody versus nonverbal expressions) and on the level of attentive processing (explicit versus implicit processing), closely resembling the distribution of terminal regions of distinct auditory pathways, which provide either global or dynamic acoustic information. Here we suggest a functional distribution in which several IFC subregions process different acoustic information conveyed by emotional vocalizations. Although the rostro-ventral IFC might categorize emotional vocalizations, the caudo-dorsal IFC might be specifically sensitive to their temporal features.	t	\N
24163248	Aphasic deficits are usually only interpreted in terms of domain-specific language processes. However, effective human communication and tests that probe this complex cognitive skill are also dependent on domain-general processes. In the clinical context, it is a pragmatic observation that impaired attention and executive functions interfere with the rehabilitation of aphasia. One system that is important in cognitive control is the salience network, which includes dorsal anterior cingulate cortex and adjacent cortex in the superior frontal gyrus (midline frontal cortex). This functional imaging study assessed domain-general activity in the midline frontal cortex, which was remote from the infarct, in relation to performance on a standard test of spoken language in 16 chronic aphasic patients both before and after a rehabilitation programme. During scanning, participants heard simple sentences, with each listening trial followed immediately by a trial in which they repeated back the previous sentence. Listening to sentences in the context of a listen-repeat task was expected to activate regions involved in both language-specific processes (speech perception and comprehension, verbal working memory and pre-articulatory rehearsal) and a number of task-specific processes (including attention to utterances and attempts to overcome pre-response conflict and decision uncertainty during impaired speech perception). To visualize the same system in healthy participants, sentences were presented to them as three-channel noise-vocoded speech, thereby impairing speech perception and assessing whether this evokes domain general cognitive systems. As expected, contrasting the more difficult task of perceiving and preparing to repeat noise-vocoded speech with the same task on clear speech demonstrated increased activity in the midline frontal cortex in the healthy participants. The same region was activated in the aphasic patients as they listened to standard (undistorted) sentences. Using a region of interest defined from the data on the healthy participants, data from the midline frontal cortex was obtained from the patients. Across the group and across different scanning sessions, activity correlated significantly with the patients' communicative abilities. This correlation was not influenced by the sizes of the lesion or the patients' chronological ages. This is the first study that has directly correlated activity in a domain general system, specifically the salience network, with residual language performance in post-stroke aphasia. It provides direct evidence in support of the clinical intuition that domain-general cognitive control is an essential factor contributing to the potential for recovery from aphasic stroke.	t	\N
24183205	To evaluate both monaural and binaural processing skills in a group of children with autism spectrum disorder (ASD) and to determine the degree to which personal frequency modulation (radio transmission) (FM) listening systems could ameliorate their listening difficulties. Auditory temporal processing (amplitude modulation detection), spatial listening (integration of binaural difference cues), and functional hearing (speech perception in background noise) were evaluated in 20 children with ASD. Ten of these subsequently underwent a 6-week device trial in which they wore the FM system for up to 7 hours per day. Auditory temporal processing and spatial listening ability were poorer in subjects with ASD than in matched controls (temporal: P = .014 [95% CI -6.4 to -0.8 dB], spatial: P = .003 [1.0 to 4.4 dB]), and performance on both of these basic processing measures was correlated with speech perception ability (temporal: r = -0.44, P = .022; spatial: r = -0.50, P = .015). The provision of FM listening systems resulted in improved discrimination of speech in noise (P < .001 [11.6% to 21.7%]). Furthermore, both participant and teacher questionnaire data revealed device-related benefits across a range of evaluation categories including Effect of Background Noise (P = .036 [-60.7% to -2.8%]) and Ease of Communication (P = .019 [-40.1% to -5.0%]). Eight of the 10 participants who undertook the 6-week device trial remained consistent FM users at study completion. Sustained use of FM listening devices can enhance speech perception in noise, aid social interaction, and improve educational outcomes in children with ASD.	t	\N
24216384	To investigate automatic event-related potentials (ERPs) to an auditory change in migraine patients. Auditory ERPs were recorded in 22 female patients suffering from menstrually-related migraine and in 20 age-matched control subjects, in three sessions: in the middle of the menstrual cycle, before and during menses. In each session, 200 trains of tone-bursts each including two duration deviants were presented in a passive listening condition. In all sessions, duration deviance elicited a mismatch negativity (MMN) showing no difference between the two groups. However, migraine patients showed an increased N1 orienting component to all incoming stimuli and a prolonged N2b to deviance. They also presented a different modulation of P3a amplitude along the menstrual cycle, which tended to normalise during migraine attacks. None of the studied ERP components showed a default of habituation. This passive paradigm highlighted increased automatic attention orienting to auditory changes but normal auditory sensory processing in migraineurs. Our observations suggest normal auditory processing up to attention triggering but enhanced activation of attention-related frontal networks in migraineurs.	t	\N
24219698	In Williams Syndrome (WS), a known genetic deletion results in atypical brain function with strengths in face and language processing. We examined how genetic influences on brain activity change with development. In three studies, event-related potentials (ERPs) from large samples of children, adolescents, and adults with the full genetic deletion for WS were compared to typically developing controls, and two adults with partial deletions for WS. Studies 1 and 2 identified ERP markers of brain plasticity in WS across development. Study 3 suggested that, in adults with partial deletions for WS, specific genes may be differentially implicated in face and language processing.	t	\N
24231421	A technology of backup alarms based on the use of a broadband signal has recently gained popularity in many countries. In this study, the performance of this broadband technology is compared to that of a conventional tonal alarm and a multi-tone alarm from a worker-safety standpoint. Field measurements of sound pressure level patterns behind heavy vehicles were performed in real work environments and psychoacoustic measurements (sound detection thresholds, equal loudness, perceived urgency and sound localization) were carried out in the laboratory with human subjects. Compared with the conventional tonal alarm, the broadband alarm generates a much more uniform sound field behind vehicles, is easier to localize in space and is judged slighter louder at representative alarm levels. Slight advantages were found with the tonal alarm for sound detection and for perceived urgency at low levels, but these benefits observed in laboratory conditions would not overcome the detrimental effects associated with the large and abrupt variations in sound pressure levels (up to 15-20 dB within short distances) observed in the field behind vehicles for this alarm, which are significantly higher than those obtained with the broadband alarm. Performance with the multi-tone alarm generally fell between that of the tonal and broadband alarms on most measures.	t	\N
24231629	This study aimed to (1) determine the sensitivity of the electrically evoked auditory change complex (eACC) to changes in stimulating electrode position; and (2) investigate the association between results of eACC measures and behavioral electrode discrimination and their association with speech-perception performance in pediatric cochlear implant (CI) users who have auditory neuropathy spectrum disorder (ANSD). Fifteen children with ANSD ranging in age between 5.4 and 18.6 years participated in this study. All subjects used Cochlear Nucleus devices. For each subject, open-set speech-perception ability was assessed using the Phonetically Balanced Kindergarten word lists presented at 60 dB SPL, using monitored live voice in a sound booth. Behavioral and objective measures of electrode discrimination were assessed in a nonclinical test environment. The stimuli used to elicit these measures were 800 msec biphasic pulse trains delivered by a direct interface to the CI. Data were collected from two basic stimulation conditions. In the standard condition, the entire pulse train was delivered to a mid-array electrode (electrode 11 or 12) at the maximum comfortable level (C level). In the change condition, the stimulus was split into two 400 msec pulse train segments presented sequentially on two different electrodes. The stimulation level of the second 400 msec pulse train was loudness balanced to the C level of the mid-array electrode used in the standard condition. The separation between the pair of stimulating electrodes was systematically varied. For behavioral electrode-discrimination measures, each subject was required to determine whether he or she heard one or two sounds for stimuli presented in different stimulation conditions. For the eACC measures, two replicates of 100 artifact-free sweeps were recorded for each stimulation condition. The eACC in response to changes in stimulating electrode position was recorded from all subjects with ANSD using direct electrical stimulation. Electrode-discrimination thresholds determined with the eACC and behavioral measures were consistent. Children with ANSD using CIs who showed poorer speech performance also required larger separations between the stimulating electrode pair to reliably elicit the eACC than subjects with better speech-perception performance. There was a robust correlation between electrode-discrimination capacities and speech-perception performances in subjects tested in this study. The effect of electrode separation on eACC amplitudes was not monotonic. These results demonstrate the feasibility of using eACC to evaluate electrode-discrimination capacities in children with ANSD. These results suggest that the eACC elicited by changes in stimulating electrode position holds great promise as an objective tool for evaluating spectral-pattern detection in such subjects, which may be predictive of their potential speech-perception performance.	t	\N
24232062	Children with auditory neuropathy (AN) have variable hearing on pure tone testing, and the presence of speech and language delays often play a major role in the decision to offer cochlear implantation (CI) in this population. Despite this fact, the speech and language outcomes in this group after CI are not well described. This study compares speech and language outcomes after CI in a subset of the pediatric AN population that does not have a confounding cognitive disorder with those of their peers with cochlear hearing loss (CoHL). Retrospective chart review. Tertiary referral center. Seventeen pediatric patients with AN who received a CI and a group of children with CoHL who received a CI were the subjects of this study. The 2 groups demonstrated similar ages at implant. Children with cognitive delays were excluded from each group. Cochlear implantation. All subjects were evaluated preoperatively and postoperatively with standardized age appropriate speech and language measures, including the Expressive Vocabulary Test (EVT), Peabody Picture Vocabulary Test (PPVT), and Preschool Language Scale (PLS). There was no significant difference between the groups on age of activation of the CI. Children with a diagnosis of AN had a significantly lower unaided pure tone average preoperatively as compared with children with cochlear hearing loss; however, there was no significant difference between the groups on either their preimplantation or postimplantation speech and language scores. Children with a diagnosis of AN without associated cognitive or developmental disorders have speech and language outcomes comparable to other children who received a CI.	t	\N
24236696	Evidence in animals and humans indicates that there are sensitive periods during development, times when experience or stimulation has a greater influence on behavior and brain structure. Sensitive periods are the result of an interaction between maturational processes and experience-dependent plasticity mechanisms. Previous work from our laboratory has shown that adult musicians who begin training before the age of 7 show enhancements in behavior and white matter structure compared with those who begin later. Plastic changes in white matter and gray matter are hypothesized to co-occur; therefore, the current study investigated possible differences in gray matter structure between early-trained (ET; <7) and late-trained (LT; >7) musicians, matched for years of experience. Gray matter structure was assessed using voxel-wise analysis techniques (optimized voxel-based morphometry, traditional voxel-based morphometry, and deformation-based morphometry) and surface-based measures (cortical thickness, surface area and mean curvature). Deformation-based morphometry analyses identified group differences between ET and LT musicians in right ventral premotor cortex (vPMC), which correlated with performance on an auditory motor synchronization task and with age of onset of musical training. In addition, cortical surface area in vPMC was greater for ET musicians. These results are consistent with evidence that premotor cortex shows greatest maturational change between the ages of 6-9 years and that this region is important for integrating auditory and motor information. We propose that the auditory and motor interactions required by musical practice drive plasticity in vPMC and that this plasticity is greatest when maturation is near its peak.	t	\N
24236753	Does temporal regularity facilitate prediction in audition? To test this, we recorded human event-related potentials to frequent standard tones and infrequent pitch deviant tones, pre-attentively delivered within isochronous and anisochronous (20% onset jitter) rapid sequences. Deviant tones were repeated, either with high or low probability. Standard tone repetition sets a first-order prediction, which is violated by deviant tone onset, leading to a first-order prediction error response (Mismatch Negativity). The response to highly probable deviant repetitions is, however, attenuated relative to less probable repetitions, reflecting the formation of higher-order sensory predictions. Results show that temporal regularity is required for higher-order predictions, but does not modulate first-order prediction error responses. Inverse solution analyses (Variable Resolution Electrical Tomography; VARETA) localized the error response attenuation to posterior regions of the left superior temporal gyrus. In a control experiment with a slower stimulus rate, we found no evidence for higher-order predictions, and again no effect of temporal information on first-order prediction error. We conclude that: (i) temporal regularity facilitates the establishing of higher-order sensory predictions, i.e. 'knowing what next', in fast auditory sequences; (ii) first-order prediction error relies predominantly on stimulus feature mismatch, reflecting the adaptive fit of fast deviance detection processes.	t	\N
24244617	Due to their different propagation times, visual and auditory signals from external events arrive at the human sensory receptors with a disparate delay. This delay consistently varies with distance, but, despite such variability, most events are perceived as synchronic. There is, however, contradictory data and claims regarding the existence of compensatory mechanisms for distance in simultaneity judgments. In this paper we have used familiar audiovisual events--a visual walker and footstep sounds--and manipulated the number of depth cues. In a simultaneity judgment task we presented a large range of stimulus onset asynchronies corresponding to distances of up to 35 meters. We found an effect of distance over the simultaneity estimates, with greater distances requiring larger stimulus onset asynchronies, and vision always leading. This effect was stronger when both visual and auditory cues were present but was interestingly not found when depth cues were impoverished. These findings reveal that there should be an internal mechanism to compensate for audiovisual delays, which critically depends on the depth information available.	t	\N
24257810	Remifentanil (Ultiva(®)) is a potent ultra-short acting mu-opioid receptor agonist used for perioperative pain treatment and anaesthesia. So far, it is not known how sensitive the cognitive processing of auditory perception elicited by the mismatch negativity (MMN) paradigm is to opioids. The present exploratory study investigated how the opioid remifentanil modulates different stages of auditory processing as reflected in the MMN(m) and P3a(m). We recorded electroencephalography (EEG) and magnetoencephalography (MEG) during auditory stimulation under remifentanil or placebo infusion in 20 healthy participants. For the MMN, a gender effect was found for tones deviating in frequency (± 10%) from the standard tone. Remifentanil increased the amplitude of the frequency MMN at F3 in females but not in males. No effect of treatment was found for the MMN(m) or the novel P3a(m). These results suggest that while the bottom-up stimulus change detection system for auditory stimuli appears to be relatively insensitive to opioids, the automatic attention switch caused by the change detection seems to be modulated by the opioid system in females. The multiple deviant paradigm including novel sounds is a promising tool for investigating pharmacological manipulation of different stages of auditory processing. Furthermore, combining the two techniques will yield more specific information about the drug effects on MMN(m).	t	\N
24259564	One of the more enduring mysteries of neuroscience is how the visual system constructs robust maps of the world that remain stable in the face of frequent eye movements. Here we show that encoding the position of objects in external space is a relatively slow process, building up over hundreds of milliseconds. We display targets to which human subjects saccade after a variable preview duration. As they saccade, the target is displaced leftwards or rightwards, and subjects report the displacement direction. When subjects saccade to targets without delay, sensitivity is poor; but if the target is viewed for 300-500 ms before saccading, sensitivity is similar to that during fixation with a strong visual mask to dampen transients. These results suggest that the poor displacement thresholds usually observed in the "saccadic suppression of displacement" paradigm are a result of the fact that the target has had insufficient time to be encoded in memory, and not a result of the action of special mechanisms conferring saccadic stability. Under more natural conditions, trans-saccadic displacement detection is as good as in fixation, when the displacement transients are masked.	t	\N
24259673	Viewers can recognize the gist of a scene (i.e., its holistic semantic representation, such as its category) in less time than a single fixation, and backward masking has traditionally been employed as a means to determine that time course. The masks used in those paradigms are often characterized by either specific amplitude spectra only, or amplitude and phase spectra-defined structural properties. However, it remains unclear whether there would be a differential contribution of amplitude only or amplitude + phase defined image statistics to the effective backward masking of rapid scene categorization. The current study addresses this issue. Experiments 1-3 explored amplitude spectra defined contributions to category masking and revealed that the slope of the amplitude spectrum was more important for modulating scene category masking strength than amplitude orientation. Further, the masking effects followed an "amplitude spectrum slope similarity principle" whereby the more similar the amplitude spectrum slope of the mask was to the target's amplitude spectrum slope, the stronger the masking. Experiment 5 showed that, when holding mask amplitude spectrum slope approximately constant, both categorically specific unrecognizable amplitude only and amplitude + phase statistical regularities disrupted rapid scene categorization. Specifically, the masking effects observed in Experiment 5 followed a target-mask categorical dissimilarity principle whereby the more dissimilar the mask category is to the target image category, the stronger the masking. Overall, the results support the notion that amplitude only or amplitude + phase-defined image statistics differentially contribute to the effective backward masking of rapid scene gist recognition.	t	\N
24265213	This study assesses the effects of adding low- or high-frequency information to the band-limited telephone-processed speech on bimodal listeners' telephone speech perception in quiet environments. In the proposed experiments, bimodal users were presented under quiet listening conditions with wideband speech (WB), bandpass-filtered telephone speech (300-3,400 Hz, BP), high-pass filtered speech (f > 300 Hz, HP, i.e., distorted frequency components above 3,400 Hz in telephone speech were restored), and low-pass filtered speech (f < 3,400 Hz, LP, i.e., distorted frequency components below 300 Hz in telephone speech were restored). Results indicated that in quiet environments, for all four types of stimuli, listening with both hearing aid (HA) and cochlear implant (CI) was significantly better than listening with CI alone. For both bimodal and CI-alone modes, there were no statistically significant differences between the LP and BP scores and between the WB and HP scores. However, the HP scores were significantly better than the BP scores. In quiet conditions, both CI alone and bimodal listening achieved the largest benefits when telephone speech was augmented with high rather than low-frequency information. These findings provide support for the design of algorithms that would extend higher frequency information, at least in quiet environments.	t	\N
24268879	Timing performance becomes less precise for longer intervals, which makes it difficult to achieve simultaneity in synchronisation with a rhythm. The metrical structure of music, characterised by hierarchical levels of binary or ternary subdivisions of time, may function to increase precision by providing additional timing information when the subdivisions are explicit. This hypothesis was tested by comparing synchronisation performance across different numbers of metrical levels conveyed by loudness of sounds, such that the slowest level was loudest and the fastest was softest. Fifteen participants moved their hand with one of 9 inter-beat intervals (IBIs) ranging from 524 to 3,125 ms in 4 metrical level (ML) conditions ranging from 1 (one movement for each sound) to 4 (one movement for every 8th sound). The lowest relative variability (SD/IBI<1.5%) was obtained for the 3 longest IBIs (1600-3,125 ms) and MLs 3-4, significantly less than the smallest value (4-5% at 524-1024 ms) for any ML 1 condition in which all sounds are identical. Asynchronies were also more negative with higher ML. In conclusion, metrical subdivision provides information that facilitates temporal performance, which suggests an underlying neural multi-level mechanism capable of integrating information across levels.	t	\N
24270967	An important parameter for characterization of the acoustic quality of closed rooms is reverberation. There is a rising interest in evaluating the ability of cochlear implant (CI) users to understand speech in real-world environments. Whereas the influence of noise on speech perception has been widely investigated, much less is known about the detrimental effect of reverberation. The present study aimed to investigate the influence of reverberation time on the speech perception of CI users and subjects with normal hearing. A reverberated version of the sentences of the Oldenburg sentence test (OLSA) which is a widely used German test to measure speech reception thresholds (SRT) in cochlear implant users was generated using professional audio processing software. The reverberation times used were 0.7, 1.0, 1.5 and 2.0 s. For these four reverberation times and for a non-reverberated control condition, the SRT was measured in eight adult CI users and in eight subjects with normal hearing. To characterize the detrimental effect of reverberation the SRT differences between the reverberated and non-reverberated conditions were calculated. These SRT differences revealed a significant effect of reverberation in CI users with, e.g. a mean SRT increase of 2.9 dB in CI users and 0.9 dB in subjects with normal hearing for a reverberation time of 0.7 s. A strong correlation was found between the SRT increase and the SRT in the non-reverberated condition, highlighting the problems of poor performers in reverberant environments. The results of the current investigation indicated that reverberation results in decreased speech understanding of CI users.	t	\N
24274362	Infant vocalizations and "looming sounds" are classes of environmental stimuli that are critically important to survival but can have dramatically different emotional valences. Here, we simultaneously presented listeners with a stationary infant vocalization and a 3D virtual looming tone for which listeners made auditory time-to-arrival judgments. Negatively valenced infant cries produced more cautious (anticipatory) estimates of auditory arrival time of the tone over a no-vocalization control. Positively valenced laughs had the opposite effect, and across all conditions, men showed smaller anticipatory biases than women. In Experiment 2, vocalization-matched vocoded noise stimuli did not influence concurrent auditory time-to-arrival estimates compared with a control condition. In Experiment 3, listeners estimated the egocentric distance of a looming tone that stopped before arriving. For distant stopping points, women estimated the stopping point as closer when the tone was presented with an infant cry than when it was presented with a laugh. For near stopping points, women showed no differential effect of vocalization type. Men did not show differential effects of vocalization type at either distance. Our results support the idea that both the sex of the listener and the emotional valence of infant vocalizations can influence auditory motion perception and can modulate motor responses to other behaviorally relevant environmental sounds. We also find support for previous work that shows sex differences in emotion processing are diminished under conditions of higher stress.	t	\N
24293020	Multisensory enhancement, as a facilitation phenomenon, is responsible for superior behavioral performance when an individual is responding to cross-modal versus modality-specific stimuli. However, the event-related potential (ERP) counterparts of behavioral multisensory enhancement are not well known. We recorded ERPs and behavioral data from 14 healthy volunteers with three types of target stimuli (modality-specific, bimodal, and trimodal) to examine the spatio-temporal electrophysiological characteristics of multisensory enhancement by comparing behavioral data with ERPs. We found a strong correlation between P3 latency and behavioral performance in terms of reaction time (RT) (R = 0.98, P <0.001), suggesting that P3 latency constitutes a temporal measure of behavioral multisensory enhancement. In addition, a fast RT and short P3 latency were found when comparing the modality-specific visual target with the modality-specific auditory and somatosensory targets. Our results indicate that behavioral multisensory enhancement can be identified by the latency and source distribution of the P3 component. These findings may advance our understanding of the neuronal mechanisms of multisensory enhancement.	t	\N
24306440	Vection is the illusion of self-motion in the absence of real physical movement. The aim of the present study was to analyze how multisensory inputs (visual and auditory) contribute to the perception of vection. Participants were seated in a stationary position in front of a large, curved projection display and were exposed to a virtual scene that constantly rotated around the yaw-axis, simulating a 360° rotation. The virtual scene contained either only visual, only auditory, or a combination of visual and auditory cues. Additionally, simulated rotation speed (90°/s vs. 60°/s) and the number of sound sources (1 vs. 3) were varied for all three stimulus conditions. All participants were exposed to every condition in a randomized order. Data specific to vection latency, vection strength, the severity of motion sickness (MS), and postural steadiness were collected. Results revealed reduced vection onset latencies and increased vection strength when auditory cues were added to the visual stimuli, whereas MS and postural steadiness were not affected by the presence of auditory cues. Half of the participants reported experiencing auditorily induced vection, although the sensation was rather weak and less robust than visually induced vection. Results demonstrate that the combination of visual and auditory cues can enhance the sensation of vection.	t	\N
24313638	This study examined the time-course of reading single words in children and adults using masked repetition priming and the recording of event-related potentials. The N250 and N400 repetition priming effects were used to characterize form- and meaning-level processing, respectively. Children had larger amplitude N250 effects than adults for both shorter and longer duration primes. Children did not differ from adults on the N400 effect. The difference on the N250 suggests that automaticity for form processing is still maturing in children relative to adults, while the lack of differentiation on the N400 effect suggests that meaning processing is relatively mature by late childhood. The overall similarity in the children's repetition priming effects to adults' effects is in line with theories of reading acquisition, according to which children rapidly transition to an orthographic strategy for fast access to semantic information from print.	t	\N
24315729	Goal-directed behavior is well-known to show declines in elderly individuals, possibly because of alterations in dopaminergic neural transmission. The dopaminergic system is modulated by a number of other different factors. One of these factors, which has attracted a considerable amount of interest in neurobiology, but has only rarely been examined with respect to its possible modulatory role for cognitive functions in elderly individuals, is latent Toxoplasma gondii (T. gondii) infection. Latent T. gondii infection may be of relevance to goal-directed behavior as it alters dopaminergic neural transmission. We examine goal-directed behavior in T. gondii IgG positive and negative elderly subjects in auditory distraction paradigm. We apply event-related potentials to examine which cognitive subprocesses are affected by latent T. gondii infection on a neurophysiological level. We show that latent T. gondii infection compromises the management of auditory distraction in elderly by specifically delaying processes of attentional allocation and disengagement. The results show that latent T. gondii infection is neglected but an important neurobiological modulator of cognitive functions in elderly individuals.	t	\N
24320112	There are increasing reports that individual variation in behavioral and neurophysiological measures of infant speech processing predicts later language outcomes, and specifically concurrent or subsequent vocabulary size. If such findings are held up under scrutiny, they could both illuminate theoretical models of language development and contribute to the prediction of communicative disorders. A qualitative, systematic review of this emergent literature illustrated the variety of approaches that have been used and highlighted some conceptual problems regarding the measurements. A quantitative analysis of the same data established that the bivariate relation was significant, with correlations of similar strength to those found for well-established nonlinguistic predictors of language. Further exploration of infant speech perception predictors, particularly from a methodological perspective, is recommended.	t	\N
24321583	The voice is a primary work tool for call center operators, but the main risk factors for voice disorders in this category have not yet been clarified. This study aimed to analyze the vocal behavior in call center operators and search for correlations between the daily voice dose and the self-perceived voice-related handicap. Prospective. Ninety-two call center operators (aged 24-50 years) underwent ambulatory phonation monitoring during a working day and were administered the Voice Handicap Index (VHI) questionnaire and a questionnaire concerning smoking habits, symptoms, and extrawork activities requiring intensive voice use. Mean percentage phonation time (PT) during work was 14.74% and ranged from 4% to 31%. There was a significant difference between the percentage PT in working time and in extrawork time; however, subjects with high percentage PT in working time maintained a high value also in extrawork time. The mean PT was 87.5 ± 35.8 minutes and was not correlated with age, gender, number of work hours, symptoms, extraprofessional voice use, and VHI scores. The mean amplitude was significantly higher in subjects with longer PT and higher pitch (P < 0.001). VHI score (median = 9) was slightly higher than in the general population but not related to the number of work hours, indicating that work time was not a critical factor in causing the perception of voice problems. Our study provides data about the voice behavior of a large cohort of call center operators and demonstrates that the number of work hours and the percentage PT are not statistically related to the perception of voice disturbances in this working category.	t	\N
24323099	In the study of the spatial characteristics of the visual channels, the power spectrum model of visual masking is one of the most widely used. When the task is to detect a signal masked by visual noise, this classical model assumes that the signal and the noise are previously processed by a bank of linear channels and that the power of the signal at threshold is proportional to the power of the noise passing through the visual channel that mediates detection. The model also assumes that this visual channel will have the highest ratio of signal power to noise power at its output. According to this, there are masking conditions where the highest signal-to-noise ratio (SNR) occurs in a channel centered in a spatial frequency different from the spatial frequency of the signal (off-frequency looking). Under these conditions the channel mediating detection could vary with the type of noise used in the masking experiment and this could affect the estimation of the shape and the bandwidth of the visual channels. It is generally believed that notched noise, white noise and double bandpass noise prevent off-frequency looking, and high-pass, low-pass and bandpass noises can promote it independently of the channel's shape. In this study, by means of a procedure that finds the channel that maximizes the SNR at its output, we performed numerical simulations using the power spectrum model to study the characteristics of masking caused by six types of one-dimensional noise (white, high-pass, low-pass, bandpass, notched, and double bandpass) for two types of channel's shape (symmetric and asymmetric). Our simulations confirm that (1) high-pass, low-pass, and bandpass noises do not prevent the off-frequency looking, (2) white noise satisfactorily prevents the off-frequency looking independently of the shape and bandwidth of the visual channel, and interestingly we proved for the first time that (3) notched and double bandpass noises prevent off-frequency looking only when the noise cutoffs around the spatial frequency of the signal match the shape of the visual channel (symmetric or asymmetric) involved in the detection. In order to test the explanatory power of the model with empirical data, we performed six visual masking experiments. We show that this model, with only two free parameters, fits the empirical masking data with high precision. Finally, we provide equations of the power spectrum model for six masking noises used in the simulations and in the experiments.	t	\N
24336606	Sensory-specific cortices appear to be sensitive to information from another modality. Here we investigate whether the human brain automatically extracts the phonological information in visual words in early visual processing. We continuously presented native Chinese speakers peripherally with Chinese homophone characters in an oddball paradigm, while they performed a visual detection task presented in the centre of the visual field. We found the lexical tone phonology embedded in the characters is processed automatically by the brain of native speakers, as revealed by whole-head electrical recordings of the mismatch negativity (MMN). Source solution further revealed the MMN involved the neural activations from the visual cortex to the auditory cortex (130-460 ms). The spatial-temporal dynamics indicate a visual-auditory interaction in the early, automatic processing of phonological information in visual words.	t	\N
24349800	Perceptual learning can be specific to a trained stimulus or optimally generalized to novel stimuli with the breadth of generalization being imperative for how we structure perceptual training programs. Adapting an established auditory interval discrimination paradigm to utilise complex signals, we trained human adults on a standard interval for either 2, 4, or 10 days. We then tested the standard, alternate frequency, interval, and stereo input conditions to evaluate the rapidity of specific learning and breadth of generalization over the time course. In comparison with previous research using simple stimuli, the speed of perceptual learning and breadth of generalization were more rapid and greater in magnitude, including novel generalization to an alternate temporal interval within stimulus type. We also investigated the long term maintenance of learning and found that specific and generalized learning was maintained over 3 and 6 months. We discuss these findings regarding stimulus complexity in perceptual learning and how they can inform the development of effective training protocols.	t	\N
24361298	Prestin, the motor protein of cochlear outer hair cells, was identified 14 years ago. Prestin-based outer hair cell motility is responsible for the exquisite sensitivity and frequency selectivity seen in the mammalian cochlea. Prestin is the 5th member of an eleven-member membrane transporter superfamily of SLC26A proteins. Unlike its paralogs, which are capable of transporting anions across the cell membrane, prestin primarily functions as a motor protein with unique capability of performing direct and reciprocal electromechanical conversion on microsecond time scale. Significant progress in the understanding of its structure and the molecular mechanism has been made in recent years using electrophysiological, biochemical, comparative genomics, structural bioinformatics, molecular dynamics simulation, site-directed mutagenesis and domain-swapping techniques. This article reviews recent advances of the structural and functional properties of prestin with focus on the areas that are critical but still controversial in understanding the molecular mechanism of how prestin works: The structural domains for voltage sensing and interaction with anions and for conformational change. Future research directions and potential application of prestin are also discussed. This article is part of a Special Issue entitled <Annual Reviews 2014>.	t	\N
24365660	SIRT1 is a highly conserved NAD(+)-dependent protein deacetylase known to have protective effects against a variety of age-related diseases. However, there is a lack of information concerning SIRT1 expression in the cochlea and auditory cortex of C57BL/6 mice, a mouse model of age-related hearing loss. Using RT-PCR and immunohistochemistry, we show that SIRT1 is abundantly expressed in the inner hair cells, strial marginal cells, strial intermediate cells, type I and type IV fibrocytes of the spiral ligament and spiral ganglion neurons. In addition, moderate SIRT1 is also detected in the outer hair cells and neurons of the auditory cortex. Associated with elevated hearing thresholds and hair cells loss during aging, there is also a significant reduction of SIRT1 expression in the cochlea and auditory cortex. The expression pattern of SIRT1 in the peripheral and central auditory system suggests that SIRT1 may play an important role in auditory function and therefore may serve as a protective molecule against age-related hearing loss.	t	\N
24383225	In current terminology, auditory neuropathy spectrum disorder (ANSD) is a disease involving the disruption of the temporal coding of acoustic signals in auditory nerve fibres, resulting in the impairment of auditory perceptions that rely on temporal cues. There is debate about almost every aspect of the disorder, including aetiology, lesion sites, and the terminology used to describe it. ANSD is a heterogeneous disease despite similar audiological findings. The absence of an auditory brainstem response (ABR) and the presence of otoacoustic emissions (OAE) suggest an ANSD profile. However, to determine the exact anatomical site of the disorder, more in-depth audiological and electrophysiological tests must be combined with imaging, genetics and neurological examinations. Greater diagnostic specificity is therefore needed to provide these patients with more adequate treatment.	t	\N
24384079	Previous research has suggested that electrically coupled frequency modulation (FM) systems substantially improved speech-recognition performance in noise in individuals with cochlear implants (CIs). However, there is limited evidence to support the use of electromagnetically coupled (neck loop) FM receivers with contemporary CI sound processors containing telecoils. The primary goal of this study was to compare speech-recognition performance in noise and subjective ratings of adolescents and adults using one of three contemporary CI sound processors coupled to electromagnetically and electrically coupled FM receivers from Oticon. A repeated-measures design was used to compare speech-recognition performance in noise and subjective ratings without and with the FM systems across three test sessions (Experiment 1) and to compare performance at different FM-gain settings (Experiment 2). Descriptive statistics were used in Experiment 3 to describe output differences measured through a CI sound processor. Experiment 1 included nine adolescents or adults with unilateral or bilateral Advanced Bionics Harmony (n = 3), Cochlear Nucleus 5 (n = 3), and MED-EL OPUS 2 (n = 3) CI sound processors. In Experiment 2, seven of the original nine participants were tested. In Experiment 3, electroacoustic output was measured from a Nucleus 5 sound processor when coupled to the electromagnetically coupled Oticon Arc neck loop and electrically coupled Oticon R2. In Experiment 1, participants completed a field trial with each FM receiver and three test sessions that included speech-recognition performance in noise and a subjective rating scale. In Experiment 2, participants were tested in three receiver-gain conditions. Results in both experiments were analyzed using repeated-measures analysis of variance. Experiment 3 involved electroacoustic-test measures to determine the monitor-earphone output of the CI alone and CI coupled to the two FM receivers. The results in Experiment 1 suggested that both FM receivers provided significantly better speech-recognition performance in noise than the CI alone; however, the electromagnetically coupled receiver provided significantly better speech-recognition performance in noise and better ratings in some situations than the electrically coupled receiver when set to the same gain. In Experiment 2, the primary analysis suggested significantly better speech-recognition performance in noise for the neck-loop versus electrically coupled receiver, but a second analysis, using the best performance across gain settings for each device, revealed no significant differences between the two FM receivers. Experiment 3 revealed monitor-earphone output differences in the Nucleus 5 sound processor for the two FM receivers when set to the +8 setting used in Experiment 1 but equal output when the electrically coupled device was set to a +16 gain setting and the electromagnetically coupled device was set to the +8 gain setting. Individuals with contemporary sound processors may show more favorable speech-recognition performance in noise electromagnetically coupled FM systems (i.e., Oticon Arc), which is most likely related to the input processing and signal processing pathway within the CI sound processor for direct input versus telecoil input. Further research is warranted to replicate these findings with a larger sample size and to develop and validate a more objective approach to fitting FM systems to CI sound processors.	t	\N
24384084	To validate diagnostic pure-tone audiometry in schools without a sound-treated environment using an audiometer that incorporates insert earphones covered by circumaural earcups and real-time environmental noise monitoring. A within-subject repeated measures design was employed to compare air (250 to 8000 Hz) and bone (250 to 4000 Hz) conduction pure-tone thresholds measured in natural school environments with thresholds measured in a sound-treated booth. 149 children (54% female) with an average age of 6.9 yr (SD = 0.6; range = 5-8). Average difference between the booth and natural environment thresholds was 0.0 dB (SD = 3.6) for air conduction and 0.1 dB (SD = 3.1) for bone conduction. Average absolute difference between the booth and natural environment was 2.1 dB (SD = 2.9) for air conduction and 1.6 dB (SD = 2.7) for bone conduction. Almost all air- (96%) and bone-conduction (97%) threshold comparisons between the natural and booth test environments were within 0 to 5 dB. No statistically significant differences between thresholds recorded in the natural and booth environments for air- and bone-conduction audiometry were found (p > 0.01). Diagnostic air- and bone-conduction audiometry in schools, without a sound-treated room, is possible with sufficient earphone attenuation and real-time monitoring of environmental noise. Audiological diagnosis on-site for school screening may address concerns of false-positive referrals and poor follow-up compliance and allow for direct referral to audiological and/or medical intervention.	t	\N
24386403	The objective was to determine if one of the neural temporal features, neural adaptation, can account for the across-subject variability in behavioral measures of temporal processing and speech perception performance in cochlear implant (CI) recipients. Neural adaptation is the phenomenon in which neural responses are the strongest at the beginning of the stimulus and decline following stimulus repetition (e.g., stimulus trains). It is unclear how this temporal property of neural responses relates to psychophysical measures of temporal processing (e.g., gap detection) or speech perception. The adaptation of the electrical compound action potential (ECAP) was obtained using 1000 pulses per second (pps) biphasic pulse trains presented directly to the electrode. The adaptation of the late auditory evoked potential (LAEP) was obtained using a sequence of 1-kHz tone bursts presented acoustically, through the cochlear implant. Behavioral temporal processing was measured using the Random Gap Detection Test at the most comfortable listening level. Consonant nucleus consonant (CNC) word and AzBio sentences were also tested. The results showed that both ECAP and LAEP display adaptive patterns, with a substantial across-subject variability in the amount of adaptation. No correlations between the amount of neural adaptation and gap detection thresholds (GDTs) or speech perception scores were found. The correlations between the degree of neural adaptation and demographic factors showed that CI users having more LAEP adaptation were likely to be those implanted at a younger age than CI users with less LAEP adaptation. The results suggested that neural adaptation, at least this feature alone, cannot account for the across-subject variability in temporal processing ability in the CI users. However, the finding that the LAEP adaptive pattern was less prominent in the CI group compared to the normal hearing group may suggest the important role of normal adaptation pattern at the cortical level in speech perception.	t	\N
24389260	The left temporal pole (LTP) has been posited to be a heteromodal hub for retrieving proper names for semantically unique entities. Previous investigations have demonstrated that LTP is important for retrieving names for famous faces and unique landmarks. However, whether such a relationship would hold for unique entities apprehended through stimulus modalities other than vision has not been well established, and such evidence is critical to adjudicate claims about the "heteromodal" nature of the LTP. Here, we tested the hypothesis that the LTP would be important for naming famous voices. Individuals with LTP lesions were asked to recognize and name famous persons speaking in audio clips. Relative to neurologically normal and brain damaged comparison participants, patients with LTP lesions were able to recognize famous persons from their voices normally, but were selectively impaired in naming famous persons from their voices. The current results extend previous research and provide further support for the notion that the LTP is a convergence region serving as a heteromodal hub for retrieving the names of semantically unique entities.	t	\N
24391928	The brain is able to realign asynchronous signals that approximately coincide in both space and time. Given that many experience-based links between visual and auditory stimuli are established in the absence of spatiotemporal proximity, we investigated whether or not temporal realignment arises in these conditions. Participants received a 3-min exposure to visual and auditory stimuli that were separated by 706 ms and appeared either from the same (Experiment 1) or from different spatial positions (Experiment 2). A simultaneity judgment task (SJ) was administered right afterwards. Temporal realignment between vision and audition was observed, in both Experiment 1 and 2, when comparing the participants' SJs after this exposure phase with those obtained after a baseline exposure to audiovisual synchrony. However, this effect was present only when the visual stimuli preceded the auditory stimuli during the exposure to asynchrony. A similar pattern of results (temporal realignment after exposure to visual-leading asynchrony but not after exposure to auditory-leading asynchrony) was obtained using temporal order judgments (TOJs) instead of SJs (Experiment 3). Taken together, these results suggest that temporal recalibration still occurs for visual and auditory stimuli that fall clearly outside the so-called temporal window for multisensory integration and appear from different spatial positions. This temporal realignment may be modulated by long-term experience with the kind of asynchrony (vision-leading) that we most frequently encounter in the outside world (e.g., while perceiving distant events).	t	\N
24398259	There is currently no consensus regarding what measures are most valid to demonstrate perceptual processing without awareness. Likewise, whether conscious perception and unconscious processing rely on independent mechanisms or lie on a continuum remains a matter of debate. Here, we addressed these issues by comparing the time courses of subjective reports, objective discrimination performance and response priming during meta-contrast masking, under similar attentional demands. We found these to be strikingly similar, suggesting that conscious perception and unconscious processing cannot be dissociated by their time course. Our results also demonstrate that unconscious processing, indexed by response priming, occurs, and that objective discrimination performance indexes the same conscious processes as subjective visibility reports. Finally, our results underscore the role of attention by showing that how much attention the stimulus receives relative to the mask, rather than whether processing is measured by conscious discrimination or by priming, determines the time course of meta-contrast masking.	t	\N
24402676	While deafness-induced plasticity has been investigated in the visual and auditory domains, not much is known about language processing in audiovisual multimodal environments for patients with restored hearing via cochlear implant (CI) devices. Here, we examined the effect of agreeing or conflicting visual inputs on auditory processing in deaf patients equipped with degraded artificial hearing. Ten post-lingually deafened CI users with good performance, along with matched control subjects, underwent H 2 (15) O-positron emission tomography scans while carrying out a behavioral task requiring the extraction of speech information from unimodal auditory stimuli, bimodal audiovisual congruent stimuli, and incongruent stimuli. Regardless of congruency, the control subjects demonstrated activation of the auditory and visual sensory cortices, as well as the superior temporal sulcus, the classical multisensory integration area, indicating a bottom-up multisensory processing strategy. Compared to CI users, the control subjects exhibited activation of the right ventral premotor-supramarginal pathway. In contrast, CI users activated primarily the visual cortices more in the congruent audiovisual condition than in the null condition. In addition, compared to controls, CI users displayed an activation focus in the right amygdala for congruent audiovisual stimuli. The most notable difference between the two groups was an activation focus in the left inferior frontal gyrus in CI users confronted with incongruent audiovisual stimuli, suggesting top-down cognitive modulation for audiovisual conflict. Correlation analysis revealed that good speech performance was positively correlated with right amygdala activity for the congruent condition, but negatively correlated with bilateral visual cortices regardless of congruency. Taken together these results suggest that for multimodal inputs, cochlear implant users are more vision-reliant when processing congruent stimuli and are disturbed more by visual distractors when confronted with incongruent audiovisual stimuli. To cope with this multimodal conflict, CI users activate the left inferior frontal gyrus to adopt a top-down cognitive modulation pathway, whereas normal hearing individuals primarily adopt a bottom-up strategy.	t	\N
24405904	Although HIV is associated with decreased emotional and cognitive functioning, the mechanisms through which affective changes can alter cognitive processes in HIV-infected individuals are unknown. We aimed to clarify this question through testing the extent to which emotionally negative stimuli prime attention to a subsequent infrequently occurring auditory tone in HIV+ compared to HIV- females. Attention to emotional compared to non-emotional pictures was measured via the LPP ERP. Subsequent attention was indexed through the N1 and late processing negativity ERP. We also assessed mood and cognitive functioning in both groups. In HIV- females, emotionally negative pictures, compared to neutral pictures, resulted in an enhanced LPP to the pictures and an enhanced N1 to subsequent tones. The HIV+ group did not show a difference in the LPP measure between picture categories, and accordingly, did not show a priming effect to the subsequent infrequent tones. The ERP findings, combined with neuropsychological deficits, suggest that HIV+ females show impairments in attention to emotionally-laden stimuli and that this impairment might be related to a loss of affective priming. This study is the first to provide physiological evidence that the LPP, a measure of attention to emotionally-charged visual stimuli, is reduced in HIV-infected individuals. These results set the stage for future work aimed at localizing brain activation to emotional stimuli in HIV+ individuals.	t	\N
24414279	Speech production, both overt and covert, down-regulates the activation of auditory cortex. This is thought to be due to forward prediction of the sensory consequences of speech, contributing to a feedback control mechanism for speech production. Critically, however, these regulatory effects should be specific to speech content to enable accurate speech monitoring. To determine the extent to which such forward prediction is content-specific, we recorded the brain's neuromagnetic responses to heard multisyllabic pseudowords during covert rehearsal in working memory, contrasted with a control task. The cortical auditory processing of target syllables was significantly suppressed during rehearsal compared with control, but only when they matched the rehearsed items. This critical specificity to speech content enables accurate speech monitoring by forward prediction, as proposed by current models of speech production. The one-to-one phonological motor-to-auditory mappings also appear to serve the maintenance of information in phonological working memory. Further findings of right-hemispheric suppression in the case of whole-item matches and left-hemispheric enhancement for last-syllable mismatches suggest that speech production is monitored by 2 auditory-motor circuits operating on different timescales: Finer grain in the left versus coarser grain in the right hemisphere. Taken together, our findings provide hemisphere-specific evidence of the interface between inner and heard speech.	t	\N
24423729	Across the animal kingdom, sensations resulting from an animal's own actions are processed differently from sensations resulting from external sources, with self-generated sensations being suppressed. A forward model has been proposed to explain this process across sensorimotor domains. During vocalization, reduced processing of one's own speech is believed to result from a comparison of speech sounds to corollary discharges of intended speech production generated from efference copies of commands to speak. Until now, anatomical and functional evidence validating this model in humans has been indirect. Using EEG with anatomical MRI to facilitate source localization, we demonstrate that inferior frontal gyrus activity during the 300ms before speaking was associated with suppressed processing of speech sounds in auditory cortex around 100ms after speech onset (N1). These findings indicate that an efference copy from speech areas in prefrontal cortex is transmitted to auditory cortex, where it is used to suppress processing of anticipated speech sounds. About 100ms after N1, a subsequent auditory cortical component (P2) was not suppressed during talking. The combined N1 and P2 effects suggest that although sensory processing is suppressed as reflected in N1, perceptual gaps may be filled as reflected in the lack of P2 suppression, explaining the discrepancy between sensory suppression and preserved sensory experiences. These findings, coupled with the coherence between relevant brain regions before and during speech, provide new mechanistic understanding of the complex interactions between action planning and sensory processing that provide for differentiated tagging and monitoring of one's own speech, processes disrupted in neuropsychiatric disorders.	t	\N
24424293	To assess the role of the efferent auditory system by inhibition of contralateral otoacoustic emission in dyslexic children with auditory processing disorders. The study sample was 34 children: 17 with dyslexia and 17 age-matched controls. Sensitive speech tests (low-pass filtered, time-compressed, distorted and dichotic) were performed to assess coexisting auditory processing disorder. Distortion-product otoacoustic emission (DPOAE) values were measured in basal condition and with contralateral broadband noise signal delivered via an earphone transducer at 60 dB SPL. The lower scores at sensitive speech testing confirmed the association of an auditory processing disorder in the dyslexic children. DPOAE values were significantly attenuated by contralateral inhibition only in the control group (p=0.001; dyslexics, p=0.19); attenuation was not significant at any frequency in the dyslexic group. The differences in DPOAE attenuation between the groups, although not statistically significant, suggest alterations in the auditory efferent system in the dyslexic population. These alterations may affect language perception. If confirmed in further studies with larger samples, these results could provide insight into a possible pathophysiological background of dyslexia.	t	\N
24434131	Cochlear implantation provides children with a significant hearing loss the potential to engage in phonological processing via audition; however these children can still have poor or inadequately detailed mental (phonological) representations of speech and as such phonological awareness and reading difficulties. Heterogeneous participant profiles, particularly varying modes of communication have clouded the research regarding reading outcomes of children using a cochlear implant. The objective of this study was to explore the relationships between word reading and reading comprehension outcomes, and a range of variables of a relatively homogenous group of children using cochlear implants. Forty-seven oral communicating children using a cochlear implant and who had attended auditory-verbal therapy served as participants. They were administered a comprehensive battery of 10 different assessments covering 22 different tasks across the domains of speech perception, speech production, language, phonological processing and reading. Correlation and principal component analyses were used to examine the relationships between outcome areas. Audiologic and demographic variables were not significantly related to reading outcomes, with the exception of family size. Language and word reading were most strongly related to reading comprehension, while phonological awareness and language were most strongly related to word reading. It is proposed that the development of well-specified phonological representations might underlie these relationships. For oral communicating children using a cochlear implant, good reading outcomes are linked to better language and phonological processing abilities.	t	\N
24437764	Preferences of concert hall acoustics are explored with preference mapping. The investigation is performed on previously gathered data from individual vocabulary profiling of nine concert halls and three pieces of symphonic music, namely, excerpts of compositions by Beethoven, Bruckner, and Mozart. Individual preferences are regressed onto a latent three-dimensional sensory space obtained by multiple factor analysis of descriptive sensory data. Overlaying individually estimated preference surfaces onto one another produces preference maps which illustrates both the overall preference of the stimuli as well as differences between individual listeners. A comparison of the maps between music motifs illustrates how each music signal affects the weighting of different acoustical qualities in preference judgments. Differences in preferences between individuals are pronounced in the excerpts of Beethoven and Bruckner, while the responses are more homogeneous for Mozart music motif. Overall, proximity is identified as the main aspect associated with preference, but also loudness, envelopment, and bass are important. A correlation analysis of objective parameters and subjective perceptions substantiates the importance of lateral sound energy for good concert hall acoustics. Particularly, the lateral early energy fraction at high frequencies is found to be associated with the perception of proximity, and hence, also with preference.	t	\N
24437852	The purpose of this study was to determine the overall impact of early and late reflections on the intelligibility of reverberated speech by cochlear implant listeners. Two specific reverberation times were assessed. For each reverberation time, sentences were presented in three different conditions wherein the target signal was filtered through the early, late or entire part of the acoustic impulse response. Results obtained with seven cochlear implant listeners indicated that while early reflections neither enhanced nor reduced overall speech perception performance, late reflections severely reduced speech intelligibility in both reverberant conditions tested.	t	\N
24453339	The temporal envelope of speech is an important cue contributing to speech intelligibility. Theories about the neural foundations of speech perception postulate that the left and right auditory cortices are functionally specialized in analyzing speech envelope information at different time scales: the right hemisphere is thought to be specialized in processing syllable rate modulations, whereas a bilateral or left hemispheric specialization is assumed for phoneme rate modulations. Recently, it has been found that this functional hemispheric asymmetry is different in individuals with language-related disorders such as dyslexia. Most studies were, however, performed in adults and school-aged children, and only a little is known about how neural auditory processing at these specific rates manifests and develops in very young children before reading acquisition. Yet, studying hemispheric specialization for processing syllable and phoneme rate modulations in preliterate children may reveal early neural markers for dyslexia. In the present study, human cortical evoked potentials to syllable and phoneme rate modulations were measured in 5-year-old children at high and low hereditary risk for dyslexia. The results demonstrate a right hemispheric preference for processing syllable rate modulations and a symmetric pattern for phoneme rate modulations, regardless of hereditary risk for dyslexia. These results suggest that, while hemispheric specialization for processing syllable rate modulations seems to be mature in prereading children, hemispheric specialization for phoneme rate modulation processing may still be developing. These findings could have important implications for the development of phonological and reading skills.	t	\N
24456178	Spectral modulation detection (SMD) provides a psychoacoustic estimate of spectral resolution. The SMD threshold for an implanted ear is highly correlated with speech understanding and is thus a non-linguistic, psychoacoustic index of speech understanding. This measure, however, is time and equipment intensive and thus not practical for clinical use. Thus the purpose of the current study was to investigate the efficacy of a quick SMD task with the following three study aims: (1) to investigate the correlation between the long psychoacoustic, and quick SMD tasks, (2) to determine the test/retest variability of the quick SMD task, and (3) to evaluate the relationship between the quick SMD task and speech understanding. This study included a within-subjects, repeated-measures design. Seventy-six adult cochlear implant recipients participated. The results were as follows: (1) there was a significant correlation between the long psychoacoustic, and quick SMD tasks, (2) the test-retest variability of the quick SMD task was highly significant and, (3) there was a significant positive correlation between the quick SMD task and monosyllabic word recognition. The results of this study represent the direct clinical translation of a research-proven task of SMD into a quick, clinically feasible format.	t	\N
24457086	Due to extensive variability in the phonetic realizations of words, there may be few or no proximal spectro-temporal cues that identify a word's onset or even its presence. Dilley and Pitt (2010) showed that the rate of context speech, distal from a to-be-recognized word, can have a sizeable effect on whether or not a word is perceived. This investigation considered whether there is a distinct role for distal rhythm in the disappearing word effect. Listeners heard sentences that had a grammatical interpretation with or without a critical function word (FW) and transcribed what they heard (e.g., are in Jill got quite mad when she heard there are birds can be removed and Jill got quite mad when she heard their birds is still grammatical). Consistent with a perceptual grouping hypothesis, participants were more likely to report critical FWs when distal rhythm (repeating ternary or binary pitch patterns) matched the rhythm in the FW-containing region than when it did not. Notably, effects of distal rhythm and distal rate were additive. Results demonstrate a novel effect of distal rhythm on the amount of lexical material listeners hear, highlighting the importance of distal timing information and providing new constraints for models of spoken word recognition.	t	\N
24464088	Bimodal stimulation, or stimulation of a cochlear implant (CI) together with a contralateral hearing aid (HA), can improve speech perception in noise However, this benefit is variable, and some individuals even experience interference with bimodal stimulation. One contributing factor to this variability may be differences in binaural spectral integration (BSI) due to abnormal auditory experience. CI programming introduces interaural pitch mismatches, in which the frequencies allocated to the electrodes (and contralateral HA) differ from the electrically stimulated cochlear frequencies. Previous studies have shown that some, but not all, CI users adapt pitch perception to reduce this mismatch. The purpose of this study was to determine whether broadened BSI may also reduce the perception of mismatch. Interaural pitch mismatches and dichotic pitch fusion ranges were measured in 21 bimodal CI users. Seventeen subjects with wide fusion ranges also conducted a task to pitch match various fused electrode-tone pairs. All subjects showed abnormally wide dichotic fusion frequency ranges of 1-4 octaves. The fusion range size was weakly correlated with the interaural pitch mismatch, suggesting a link between broad binaural pitch fusion and large interaural pitch mismatch. Dichotic pitch averaging was also observed, in which a new binaural pitch resulted from the fusion of the original monaural pitches, even when the pitches differed by as much as 3-4 octaves. These findings suggest that abnormal BSI, indicated by broadened fusion ranges and spectral averaging between ears, may account for speech perception interference and nonoptimal integration observed with bimodal compared with monaural hearing device use.	t	\N
24465675	Perception of our environment is a multisensory experience; information from different sensory systems like the auditory, visual and tactile is constantly integrated. Complex tasks that require high temporal and spatial precision of multisensory integration put strong demands on the underlying networks but it is largely unknown how task experience shapes multisensory processing. Long-term musical training is an excellent model for brain plasticity because it shapes the human brain at functional and structural levels, affecting a network of brain areas. In the present study we used magnetoencephalography (MEG) to investigate how audio-tactile perception is integrated in the human brain and if musicians show enhancement of the corresponding activation compared to non-musicians. Using a paradigm that allowed the investigation of combined and separate auditory and tactile processing, we found a multisensory incongruency response, generated in frontal, cingulate and cerebellar regions, an auditory mismatch response generated mainly in the auditory cortex and a tactile mismatch response generated in frontal and cerebellar regions. The influence of musical training was seen in the audio-tactile as well as in the auditory condition, indicating enhanced higher-order processing in musicians, while the sources of the tactile MMN were not influenced by long-term musical training. Consistent with the predictive coding model, more basic, bottom-up sensory processing was relatively stable and less affected by expertise, whereas areas for top-down models of multisensory expectancies were modulated by training.	t	\N
24482186	Tinnitus is described as the perception of sound or noise in the absence of real acoustic stimulation. In the current absence of a cure for tinnitus, clinical management typically focuses on reducing the effects of co-morbid symptoms such as distress or hearing loss. Hearing loss is commonly co-morbid with tinnitus and so logic implies that amplification of external sounds by hearing aids will reduce perception of the tinnitus sound and the distress associated with it. To assess the effects of hearing aids specifically in terms of tinnitus benefit in patients with tinnitus and co-existing hearing loss. We searched the Cochrane Ear, Nose and Throat Disorders Group Trials Register; the Cochrane Central Register of Controlled Trials (CENTRAL); PubMed; EMBASE; CINAHL; Web of Science; Cambridge Scientific Abstracts; ICTRP and additional sources for published and unpublished trials. The date of the search was 19 August 2013. Randomised controlled trials and non-randomised controlled trials recruiting adults with subjective tinnitus and some degree of hearing loss, where the intervention involves amplification with hearing aids and this is compared to interventions involving other medical devices, other forms of standard or complementary therapy, or combinations of therapies, no intervention or placebo interventions. Three authors independently screened all selected abstracts. Two authors independently extracted data and assessed those potentially suitable studies for risk of bias. For studies meeting the inclusion criteria, we used the mean difference (MD) to compare hearing aids with other interventions and controls. One randomised controlled trial (91 participants) was included in this review. We judged the trial to have a low risk of bias for method of randomisation and outcome reporting, and an unclear risk of bias for other criteria. No non-randomised controlled trials meeting our inclusion criteria were identified. The included study measured change in tinnitus severity (primary measure of interest) using a tinnitus questionnaire measure, and change in tinnitus loudness (secondary measure of interest) on a visual analogue scale. Other secondary outcome measures of interest, namely change in the psychoacoustic characteristics of tinnitus, change in self reported anxiety, depression and quality of life, and change in neurophysiological measures, were not investigated in this study. The included study compared hearing aid use to sound generator use. The estimated effect on change in tinnitus loudness or severity as measured by the Tinnitus Handicap Inventory score was compatible with benefits for both hearing aids or sound generators but no difference was found between the two alternative treatments (MD -0.90, 95% confidence interval (CI) -7.92 to 6.12) (100-point scale); moderate quality evidence. No negative or adverse events were reported. The current evidence base for hearing aid prescription for tinnitus is limited. To be useful, future studies should make appropriate use of blinding and be consistent in their use of outcome measures. Whilst hearing aids are sometimes prescribed as part of tinnitus management, there is currently no evidence to support or refute their use as a more routine intervention for tinnitus.	t	\N
24486809	Novel stimuli reliably attract attention, suggesting that novelty may disrupt performance when it is task-irrelevant. However, under certain circumstances novel stimuli can also elicit a general alerting response having beneficial effects on performance. In a series of experiments we investigated whether different aspects of novelty--stimulus novelty, contextual novelty, surprise, deviance, and relative complexity--lead to distraction or facilitation. We used a version of the visual oddball paradigm in which participants responded to an occasional auditory target. Participants responded faster to this auditory target when it occurred during the presentation of novel visual stimuli than of standard stimuli, especially at SOAs of 0 and 200 ms (Experiment 1). Facilitation was absent for both infrequent simple deviants and frequent complex images (Experiment 2). However, repeated complex deviant images did facilitate responses to the auditory target at the 200 ms SOA (Experiment 3). These findings suggest that task-irrelevant deviant visual stimuli can facilitate responses to an unrelated auditory target in a short 0-200 millisecond time-window after presentation. This only occurs when the deviant stimuli are complex relative to standard stimuli. We link our findings to the novelty P3, which is generated under the same circumstances, and to the adaptive gain theory of the locus coeruleus-norepinephrine system (Aston-Jones and Cohen, 2005), which may explain the timing of the effects.	t	\N
24496290	Growing evidence suggests that children who are deaf and use cochlear implants (CIs) can communicate effectively using spoken language. Research has reported that age of implantation and length of experience with the CI play an important role in a predicting a child's linguistic development. In recent years, the increase in the number of children receiving bilateral CIs (BiCIs) has led to interest in new variables that may also influence the development of hearing, speech, and language abilities, such as length of bilateral listening experience and the length of time between the implantation of the two CIs. One goal of the present study was to determine how a cohort of children with BiCIs performed on standardized measures of language and nonverbal cognition. This study examined the relationship between performance on language and nonverbal intelligence quotient (IQ) tests and the ages at implantation of the first CI and second CI. This study also examined whether early bilateral activation is related to better language scores. Children with BiCIs (n = 39; ages 4 to 9 years) were tested on two standardized measures, the Test of Language Development and the Leiter International Performance Scale-Revised, to evaluate their expressive/receptive language skills and nonverbal IQ/memory. Hierarchical regression analyses were used to evaluate whether BiCI hearing experience predicts language performance. While large intersubject variability existed, on average, almost all the children with BiCIs scored within or above normal limits on measures of nonverbal cognition. Expressive and receptive language scores were highly variable, less likely to be above the normative mean, and did not correlate with Length of first CI Use, defined as length of auditory experience with one cochlear implant, or Length of second CI Use, defined as length of auditory experience with two cochlear implants. All children in the present study had BiCIs. Most IQ scores were either at or above that found in the general population of typically hearing children. However, there was greater variability in their performance on a standardized test of expressive and receptive language. This cohort of children, who are mainstreamed in schools at age-appropriate grades, whose mothers' education is high, and whose families' socioecononomic status is high, had, as a group, on average, language scores within the same range as the normative sample of hearing children. Further research identifying the predictors that contribute to the high variability in both expressive and receptive language scores in children with BiCIs will provide useful information that can aid in clinical management and decision making.	t	\N
24497938	The vast majority of people experience involuntary musical imagery (INMI) or 'earworms'; perceptions of spontaneous, repetitive musical sound in the absence of an external source. The majority of INMI episodes are not bothersome, while some cause disruption ranging from distraction to anxiety and distress. To date, little is known about how the majority of people react to INMI, in particular whether evaluation of the experience impacts on chosen response behaviours or if attempts at controlling INMI are successful or not. The present study classified 1046 reports of how people react to INMI episodes. Two laboratories in Finland and the UK conducted an identical qualitative analysis protocol on reports of INMI reactions and derived visual descriptive models of the outcomes using grounded theory techniques. Combined analysis carried out across the two studies confirmed that many INMI episodes were considered neutral or pleasant, with passive acceptance and enjoyment being among the most popular response behaviours. A significant number of people, however, reported on attempts to cope with unwanted INMI. The most popular and effective behaviours in response to INMI were seeking out the tune in question, and musical or verbal distraction. The outcomes of this study contribute to our understanding of the aetiology of INMI, in particular within the framework of memory theory, and present testable hypotheses for future research on successful INMI coping strategies.	t	\N
24506533	We examined infants' oscillatory brain activity during a live interaction with an adult who showed them novel objects. Activation in the alpha frequency range was assessed. Nine-month-old infants responded with desynchronization of alpha-band activity when looking at an object together with an adult during a social interaction involving eye contact. When infant and experimenter only looked at the object without engaging in eye contact, no such effect was observed. Results are interpreted in terms of activation of a generic semantic knowledge system induced by eye contact during a social interaction.	t	\N
24518404	It was hypothesized that cochlear implant (CI) subjects would be able to correctly identify 1, 2, and 3 simultaneous pitches through direct electrical stimulation. We further hypothesized that the location on the implant array and the fundamental frequency of the pitches would have an impact on the performance. "They gave me back speech but not music" is a sentence commonly heard by CI subjects. One of the reasons is that in music, multiple streams are frequently played at the same time, which is an essential feature of harmony. Current CI speech processors do not allow CI users to perceive such complex polyphonic sounds. In the present study, the authors assessed the ability of CI subjects to perceive simultaneous modulation frequencies based on direct electrical stimulation. Ten CI subjects were asked to identify 1, 2, and 3 simultaneous pitches applied on different electrodes using sinusoidal amplitude modulation. All stimuli were loudness balanced before the actual identification task. Subjects were able to identify 1, 2, and 3 simultaneous pitches. The further the distance between the 2 electrodes, the better was the performance in the 2-pitch condition. The distance between the modulation frequencies had a significant effect on the performance in the 2-and 3-pitch condition. Subjects are able to identify complex polyphonic stimuli based on the number of active electrodes. The additional polyphonic rate pitch cue improves performance in some conditions.	t	\N
24553776	Sounds emitted by different sources arrive at our ears as a mixture that must be disentangled before meaningful information can be retrieved. It is still a matter of debate whether this decomposition happens automatically or requires the listener's attention. These opposite positions partly stem from different methodological approaches to the problem. We propose an integrative approach that combines the logic of previous measurements targeting either auditory stream segregation (interpreting a mixture as coming from two separate sources) or integration (interpreting a mixture as originating from only one source). By means of combined behavioral and event-related potential (ERP) measures, our paradigm has the potential to measure stream segregation and integration at the same time, providing the opportunity to obtain positive evidence of either one. This reduces the reliance on zero findings (i.e., the occurrence of stream integration in a given condition can be demonstrated directly, rather than indirectly based on the absence of empirical evidence for stream segregation, and vice versa). With this two-way approach, we systematically manipulate attention devoted to the auditory stimuli (by varying their task relevance) and to their underlying structure (by delivering perceptual tasks that require segregated or integrated percepts). ERP results based on the mismatch negativity (MMN) show no evidence for a modulation of stream integration by attention, while stream segregation results were less clear due to overlapping attention-related components in the MMN latency range. We suggest future studies combining the proposed two-way approach with some improvements in the ERP measurement of sequential stream segregation.	t	\N
24556905	The aim of this study was to investigate the efficacy of a direct acoustic cochlear implant (DACI) for speech understanding in noise in patients suffering from severe to profound mixed hearing loss (MHL) due to various etiologies compared to the preoperative best-aided condition. The study was performed at five tertiary referral centers in Europe (Belgium, Germany, Poland and Spain). Nineteen adult subjects with severe to profound MHL due to (advanced) otosclerosis, ear canal fibrosis, chronic otitis media, tympanosclerosis or previous cholesteatoma were implanted with a DACI (Codacs™ Investigational Device) combined with a conventional stapes prosthesis. Unaided and aided speech reception scores in quiet and in noise, preoperative and postoperative air and bone conduction thresholds and aided and unaided sound field thresholds were measured prospectively during the study. Subjective benefit analysis was determined through the Abbreviated Profile of Hearing Aid Benefit questionnaire. Quality of life was measured by the Health Utilities Index. All subjects were fitted preoperatively with hearing aids and/or a bone conduction implant on a headband before DACI implantation. This allows direct comparison between different hearing rehabilitation solutions. The mean speech reception threshold in noise improved significantly by 7.9 dB signal-to-noise ratio (SNR) after activation of the DACI compared to the preoperative best-aided condition. For all 19 subjects, a mean postoperative aided speech reception threshold of 2.6 dB SNR (standard deviation: 8.3 dB) was measured. On average, no significant shift in the bone conduction thresholds was noted 4-5 months after implantation. A mean sound field threshold improvement of 46 and 16 dB was measured compared to the preoperative unaided and best-aided condition, respectively. Speech perception tests in quiet showed a mean improvement of the word recognition scores by 65 and 48% at 65 dB SPL compared to the preoperative unaided and best-aided condition, respectively. In summary, DACI provides an effective improvement of the speech perception in noise compared to the best-aided condition in subjects suffering from severe to profound MHL.	t	\N
24559662	The minimum time interval between two stimuli that can be reliably detected is called the gap detection threshold. The present study examines whether an unconscious state, natural sleep affects the gap detection threshold. Event-related potentials were recorded in 10 young adults while awake and during all-night sleep to provide an objective estimate of this threshold. These subjects were presented with 2, 4, 8 or 16ms gaps occurring in 1.5 duration white noise. During wakefulness, a significant N1 was elicited for the 8 and 16ms gaps. N1 was difficult to observe during stage N2 sleep, even for the longest gap. A large P2 was however elicited and was significant for the 8 and 16ms gaps. Also, a later, very large N350 was elicited by the 16ms gap. An N1 and P2 was significant only for the 16ms gap during REM sleep. ERPs to gaps occurring in noise segments can therefore be successfully elicited during natural sleep. The gap detection threshold is similar in the waking and sleeping states.	t	\N
24561213	The present study investigated the joint impact of target-flanker similarity and of spatial frequency content on the crowding effect in letter identification. We presented spatial frequency filtered letters to neurologically intact non-dyslexic readers while manipulating target-flanker distance, target eccentricity and target-flanker confusability (letter similarity metric based on published letter confusion matrices). The results show that high target-flanker confusability magnifies crowding. They also reveal an intricate pattern of interactions of the spatial frequency content of the stimuli with target eccentricity, flanker distance and similarity. The findings are congruent with the notion that crowding results from the inappropriate pooling of target and flanker features and that this integration is more likely to match a response template at a subsequent decision stage with similar than dissimilar flankers. In addition, the evidence suggests that crowding from similar flankers is biased towards relatively high spatial frequencies and that crowding shifts towards lower spatial frequencies as target eccentricity is increased.	t	\N
24566800	Learning in perceptual tasks is typically highly specific to the trained stimulus parameters. However, can learning be specific to a stimulus parameter that is perceptually indistinguishable from another? We assessed this question using a perceived sound location task in which the perceived sound location was created through either an interaural time difference (ITD) cue or an interaural level difference (ILD) cue. We used the same transient, broadband sound (clicks) for both cues, and after training on one of the cues, listeners switched cue mid-session. This allowed us to assess cue specificity or transfer when the subjectively unnoticed cue switch occurred. One group of listeners improved their ITD performance as a function of training, but deteriorated in performance when switching to ILD in mid training session. Another group of listeners started with ILD training; their improved performance level did not deteriorate as they switched to the ITD cue. This transfer asymmetry was not hypothesized, and we therefore extended our study with a second data collection. Both the training effect and the transfer asymmetry remained after the second data collection. Our results indicate (a) listeners can improve both their ITD and ILD performance for click sounds, extending previous findings on tones; (b) learning can be specific to a stimulus parameter that is indistinguishable from another, as ITD learning did not transfer to ILD performance; but (c) ILD learning can transfer to ITD performance. This transfer asymmetry may have occurred because of how ITD and ILD are coded in early brainstem areas.	t	\N
24568205	The melodic contour of speech forms an important perceptual aspect of tonal and nontonal languages and an important limiting factor on the intelligibility of speech heard through a cochlear implant. Previous work exploring the neural correlates of speech comprehension identified a left-dominant pathway in the temporal lobes supporting the extraction of an intelligible linguistic message, whereas the right anterior temporal lobe showed an overall preference for signals clearly conveying dynamic pitch information [Johnsrude, I. S., Penhune, V. B., & Zatorre, R. J. Functional specificity in the right human auditory cortex for perceiving pitch direction. Brain, 123, 155-163, 2000; Scott, S. K., Blank, C. C., Rosen, S., & Wise, R. J. Identification of a pathway for intelligible speech in the left temporal lobe. Brain, 123, 2400-2406, 2000]. The current study combined modulations of overall intelligibility (through vocoding and spectral inversion) with a manipulation of pitch contour (normal vs. falling) to investigate the processing of spoken sentences in functional MRI. Our overall findings replicate and extend those of Scott et al. [Scott, S. K., Blank, C. C., Rosen, S., & Wise, R. J. Identification of a pathway for intelligible speech in the left temporal lobe. Brain, 123, 2400-2406, 2000], where greater sentence intelligibility was predominately associated with increased activity in the left STS, and the greatest response to normal sentence melody was found in right superior temporal gyrus. These data suggest a spatial distinction between brain areas associated with intelligibility and those involved in the processing of dynamic pitch information in speech. By including a set of complexity-matched unintelligible conditions created by spectral inversion, this is additionally the first study reporting a fully factorial exploration of spectrotemporal complexity and spectral inversion as they relate to the neural processing of speech intelligibility. Perhaps surprisingly, there was little evidence for an interaction between the two factors-we discuss the implications for the processing of sound and speech in the dorsolateral temporal lobes.	t	\N
24576508	We investigated the effects of linguistic experience and language familiarity on the perception of audio-visual (A-V) synchrony in fluent speech. In Experiment 1, we tested a group of monolingual Spanish- and Catalan-learning 8-month-old infants to a video clip of a person speaking Spanish. Following habituation to the audiovisually synchronous video, infants saw and heard desynchronized clips of the same video where the audio stream now preceded the video stream by 366, 500, or 666 ms. In Experiment 2, monolingual Catalan and Spanish infants were tested with a video clip of a person speaking English. Results indicated that in both experiments, infants detected a 666 and a 500 ms asynchrony. That is, their responsiveness to A-V synchrony was the same regardless of their specific linguistic experience or familiarity with the tested language. Compared to previous results from infant studies with isolated audiovisual syllables, these results show that infants are more sensitive to A-V temporal relations inherent in fluent speech. Furthermore, the absence of a language familiarity effect on the detection of A-V speech asynchrony at eight months of age is consistent with the broad perceptual tuning usually observed in infant response to linguistic input at this age.	t	\N
24581119	Music conveys emotion by manipulating musical structures, particularly musical mode- and tempo-impact. The neural correlates of musical mode and tempo perception revealed by electroencephalography (EEG) have not been adequately addressed in the literature. This study used independent component analysis (ICA) to systematically assess spatio-spectral EEG dynamics associated with the changes of musical mode and tempo. Empirical results showed that music with major mode augmented delta-band activity over the right sensorimotor cortex, suppressed theta activity over the superior parietal cortex, and moderately suppressed beta activity over the medial frontal cortex, compared to minor-mode music, whereas fast-tempo music engaged significant alpha suppression over the right sensorimotor cortex. The resultant EEG brain sources were comparable with previous studies obtained by other neuroimaging modalities, such as functional magnetic resonance imaging (fMRI) and positron emission tomography (PET). In conjunction with advanced dry and mobile EEG technology, the EEG results might facilitate the translation from laboratory-oriented research to real-life applications for music therapy, training and entertainment in naturalistic environments.	t	\N
24581190	The use of adequate assessment tools in health care is crucial for the management of care. The lack of specific tools in Portugal for assessing the performance of children who use cochlear implants motivated the translation and adaptation of the EARS (Evaluation of Auditory Responses to Speech) test battery into European Portuguese. This test battery is today one of the most commonly used by (re)habilitation teams of deaf children who use cochlear implants worldwide. The goal to be achieved with the validation of EARS was to provide (re)habilitation teams an instrument that enables: (i) monitoring the progress of individual (re)habilitation, (ii) managing a (re)habilitation program according to objective results, comparable between different (re)habilitation teams, (iii) obtaining data that can be compared with the results of international teams, and (iv) improving engagement and motivation of the family and other professionals from local teams. For the test battery translation and adaptation process, the adopted procedures were the following: (i) translation of the English version into European Portuguese by a professional translator, (ii) revision of the translation performed by an expert panel, including doctors, speech-language pathologists and audiologists, (iii) adaptation of the test stimuli by the team's speechlanguage pathologist, and (iv) further review by the expert panel. For each of the tests that belong to the EARS battery, the introduced adaptations and adjustments are presented, combining the characteristics and objectives of the original tests with the linguistic and cultural specificities of the Portuguese population. The difficulties that have been encountered during the translation and adaptation process and the adopted solutions are discussed. Comparisons are made with other versions of the EARS battery. We defend that the translation and the adaptation process followed for the EARS test battery into European Portuguese was correctly conducted, respecting the characteristics of the original instruments and adapting the test stimuli to the linguistic and cultural reality of the Portuguese population, thus meeting the goals that have been set.	t	\N
24581551	There is evidence that people with psychosis display a "jump-to-conclusions" (JTC) reasoning style, and that this bias may be specific to delusions. A "jump-to-perceptions" (JTP) cognitive bias has also been found and is typically linked to hallucinations. However, there is some evidence for an association between JTP and delusions, and its specificity to hallucinations remains unclear. It has been suggested that these biases are related and products of shared cognitive processes. This study examined the symptom specificity of JTC and JTP, and the relationship between them, in a sample of 98 individuals with delusions divided into 'hallucinators' (n=51) and 'non-hallucinators' (n=47). Biases were assessed using the beads task and visual and auditory perceptual tasks. As predicted, both groups demonstrated a JTC bias, but the 'hallucinators' showed a more pronounced JTP style in both modalities. The presence of JTC and JTP biases did not co-occur: making a decision on the beads task after two or fewer draws was not related to visual JTP, and was associated with a less marked JTP bias in the auditory perceptual task. No differences were found in JTP or JTC between participants with and without a schizophrenia diagnosis. JTP, but not JTC, was associated with the presence of hallucinations. These findings suggest that the JTC and JTP biases show specificity to delusions and hallucinations, respectively, and not to diagnosis. There was no evidence that they are the product of shared cognitive processes, further supporting their specificity.	t	\N
24597272	Listeners find it relatively difficult to recognize words that are similar-sounding to other known words. In contrast, when asked to identify spoken nonwords, listeners perform better when the nonwords are similar to many words in their language. These effects of sound similarity have been assessed in multiple ways, and both sublexical (phonotactic probability) and lexical (neighborhood) effects have been reported, leading to models that incorporate multiple stages of processing. One prediction that can be derived from these models is that there may be differences among individuals in the size of these similarity effects as a function of working memory abilities. This study investigates how item-individual characteristics of nonwords (both phonotactic probability and neighborhood density) interact with listener-individual characteristics (such as cognitive abilities and hearing sensitivity) in the perceptual identification of nonwords. A set of nonwords was used in which neighborhood density and phonotactic probability were not correlated. In our data, neighborhood density affected identification more reliably than did phonotactic probability. The first study, with young adults, showed that higher neighborhood density particularly benefits nonword identification for those with poorer attention-switching control. This suggests that it may be easier to focus attention on a novel item if it activates and receives support from more similar-sounding neighbors. A similar study on nonword identification with older adults showed increased neighborhood density effects for those with poorer hearing, suggesting that activation of long-term linguistic knowledge is particularly important to back up auditory representations that are degraded as a result of hearing loss.	t	\N
24603717	Objective identification and description of mimicked calls is a primary component of any study on avian vocal mimicry but few studies have adopted a quantitative approach. We used spectral feature representations commonly used in human speech analysis in combination with various distance metrics to distinguish between mimicked and non-mimicked calls of the greater racket-tailed drongo, Dicrurus paradiseus and cross-validated the results with human assessment of spectral similarity. We found that the automated method and human subjects performed similarly in terms of the overall number of correct matches of mimicked calls to putative model calls. However, the two methods also misclassified different subsets of calls and we achieved a maximum accuracy of ninety five per cent only when we combined the results of both the methods. This study is the first to use Mel-frequency Cepstral Coefficients and Relative Spectral Amplitude - filtered Linear Predictive Coding coefficients to quantify vocal mimicry. Our findings also suggest that in spite of several advances in automated methods of song analysis, corresponding cross-validation by humans remains essential.	t	\N
24604542	Two experiments were conducted to study effects of modality, temporal position, and their interaction on comparisons of successive stimuli. In Experiment 1, intramodal (tone-tone and line-line) and crossmodal (tone-line and line-tone) stimulus pairs, with two interstimulus intervals (ISIs), 400 and 2,000 ms, were presented. Participants indicated which stimulus was the "stronger." Time-order errors (TOEs) were assessed using the D% measure and were found in all types of pairs. Variation in TOEs across conditions was well accounted for by changes in parameters (stimulus weights, reference levels) in an extended version of Hellström's sensation weighting (SW) model. With an ISI of 2,000 ms, the first stimulus had a lower weight (less impact on the response) than did the second stimulus. More negative TOEs were found with the longer ISI in all pair types except tone-line. In Experiment 2, participants indicated which of two lines was the longer or which of two tones was the louder. An intra- or crossmodal anchor, or no anchor, was interpolated between the stimuli. Anchoring tended to reduce the weight of the first stimulus, suggesting interference with memory, and to yield negative TOEs. Intramodal anchors yielded reduced weights of both stimuli, most dramatically for tones, suggesting an additional effect of stimulus interference. Response times decreased with crossmodal anchors. For line-line pairs, strong negative TOEs were found. In both experiments, the variation in TOE across conditions was well accounted for by the SW model.	t	\N
24606282	An experiment was carried out to determine whether the level of the speech fluency disorder can be estimated by means of automatic acoustic measurements. These measures analyze, for example, the amount of silence in a recording or the number of abrupt spectral changes in a speech signal. All the measures were designed to take into account symptoms of stuttering. In the experiment, 118 audio recordings of read speech by Czech native speakers were employed. The results indicate that the human-made rating of the speech fluency disorder in read speech can be predicted on the basis of automatic measurements. The number of abrupt spectral changes in the speech segments turns out to be the most appropriate measure to describe the overall speech performance. The results also imply that there are measures with good results describing partial symptoms (especially fixed postures without audible airflow).	t	\N
24611258	Previous researchers have found that participants associate higher frequencies with locations that are higher in space and lower frequencies with lower locations, creating a phenomenological-spatial association for the frequency of auditory tones. With such an association, the frequency of an auditory tone could potentially bias movements along multiple axes. This hypothesis was tested. In four experiments, nine frequencies (250-1,250 Hz) were binaurally presented to blindfolded participants (n = 10, 12, 20, & 9; M age = 22 yr.) who indicated the perceived location of the stimuli on a measurement scale oriented in the vertical, the horizontal (Experiment 1), or depth dimension (Experiment 2). In Experiment 3, participants were asked to indicate the perceived location of the frequencies on a two-dimensional vertical board located in front of them. In Experiment 4, participants indicated the perceived location in three-dimensional space. An optoelectronic device recorded at all locations. Analyses of constant error indicated a spatial association in the vertical, horizontal, and depth dimensions when responses were restricted to only one dimension (Experiments 1 & 2). Higher frequencies were perceived to be located higher, farther to the right, and farther away from the body than lower frequencies. However, this spatial association was only exhibited in the vertical dimension when the responses were unconstrained in two dimensions (vertical and horizontal; Experiment 3) and all three dimensions (Experiment 4). Although this spatial association is a robust phenomenon, it appears that the association only biases actions when indicating perceived locations in the vertical dimension during unconstrained responses.	t	\N
24616980	To investigate the changes of electrode impedance, THR/MCL values, and dynamic range (DR) in Combi 40+ cochlear implant after implantation. A respective study was carried out collecting 20 consecutively implanted children's electrode impedances, THR/MCL values, and DR at seven time point during the first three years after implantation. Their variation and correlations were analyzed. Overall, electrode impedances were lowest during the operation, and significantly rise to the highest at the first stimulation, then followed by a gradual decrease. After three months, electrode impedance of apical and medial cochlear segment were basically stable, while that of the basal segment was gradually increased. Dynamic range (DR) of apical and medial group electrode increased early after the operation and showed a stabilization from the second year, whereas that of basal group have a downward trend since the first year. However, the electric charge of each group increased significantly after three months, and then become stable after first year. Otherwise, a stronger negative rectilinear correlation was found between impedance changes with DR than with THR/MCL level. The electrode impedances vary clue to different electrode position. Measuring the electrode impedance can effectively evaluate the working status of Combi C40+ cochlear implant. The dynamic range of the electrode was negatively correlated with the impedances, which made it possible to predict the width of the dynamic range by measuring the impedance 3 or 6 months after operation.	t	\N
24622027	The safety of implanting a titanium microactuator into the lateral wall of cat scala tympani was assessed by comparing preoperative and postoperative auditory brainstem response (ABR) thresholds for 1 to 3 months. The safety of directly stimulating cochlear perilymph with an implantable hearing system requires maintaining preoperative hearing levels. This cat study is an essential step in the development of the next generation of fully implantable hearing devices for humans. Following GLP surgical standards, a 1-mm cochleostomy was drilled into the lateral wall of the scala tympani, and a nonfunctioning titanium anchor/microactuator assembly was inserted in 8 cats. The scala media was damaged in the 1 cat. ABR thresholds with click and 4- and 8-kHz stimuli were measured preoperatively and compared with postoperative thresholds at 1, 2, and 3 months. Nonimplanted ear thresholds were also measured to establish statistical significance for threshold shifts (>28.4 dB). Two audiologists independently interpreted thresholds. Postoperatively, 7 cats implanted in the scala tympani demonstrated no significant ABR threshold shift for click stimulus; one shifted ABR thresholds to 4- and 8-kHz stimuli. The eighth cat, with surgical damage to the scala media, maintained stable click threshold but had a significant shift to 4- and 8-kHz stimuli. This cat study provides no evidence of worsening hearing thresholds after fenestration of the scala tympani and insertion of a titanium anchor/microactuator, provided there is no surgical trauma to the scala media and the implanted device is securely anchored in the cochleostomy. These 2 issues have been resolved in the development of a fully implantable hearing system for humans. The long-term hearing stability (combined with histologic studies) reaffirm that the microactuator is well tolerated by the cat cochlea.	t	\N
24622351	Pure-tone auditory thresholds in children are below the standardized hearing threshold levels described by ). The development of hearing ability in Chinese and German children at different ages is analyzed and compared with data in the literature. Extended high frequencies above 8 kHz are included due to their vulnerability to noise. Ninety-one children in China and 197 children at a typical primary school in Germany underwent audiometric tests at 17 frequencies ranging from 125 Hz to 16 kHz. To analyze effects of age the children were assessed in age groups of 6 to 9, 9 to 12, and 13 to 16 years of age. Children in China and Germany have the poorest hearing sensitivity at low frequencies (below 1 kHz) and the best sensitivity at the extended high frequencies above 8 kHz. The mean deviations (both ears) from the standard thresholds for adults 18 to 25 years of age, averaged over all frequencies, are 6.7 dB for Chinese and 7.1 dB for German children in the youngest age groups. In the Chinese age group of 9- to 12-year olds the averaged threshold is 4.5 dB and for the same German age group 3.6 dB. For the Chinese age group of 13- to 16-year olds the averaged threshold is 3.3 dB. The improvements of the older German age group compared with the younger are significant at p < 0.05 and are nearly twice as high compared with the comparable Chinese age groups. Age-dependent improvements of hearing ability are evident in children and should be considered in evaluating audiometric test results. Audiometric tests in the extended high-frequency range could be used as an early warning system for future degeneration of hearing ability.	t	\N
24623783	The earliest stages of cortical processing of speech sounds take place in the auditory cortex. Transcranial magnetic stimulation (TMS) studies have provided evidence that the human articulatory motor cortex contributes also to speech processing. For example, stimulation of the motor lip representation influences specifically discrimination of lip-articulated speech sounds. However, the timing of the neural mechanisms underlying these articulator-specific motor contributions to speech processing is unknown. Furthermore, it is unclear whether they depend on attention. Here, we used magnetoencephalography and TMS to investigate the effect of attention on specificity and timing of interactions between the auditory and motor cortex during processing of speech sounds. We found that TMS-induced disruption of the motor lip representation modulated specifically the early auditory-cortex responses to lip-articulated speech sounds when they were attended. These articulator-specific modulations were left-lateralized and remarkably early, occurring 60-100 ms after sound onset. When speech sounds were ignored, the effect of this motor disruption on auditory-cortex responses was nonspecific and bilateral, and it started later, 170 ms after sound onset. The findings indicate that articulatory motor cortex can contribute to auditory processing of speech sounds even in the absence of behavioral tasks and when the sounds are not in the focus of attention. Importantly, the findings also show that attention can selectively facilitate the interaction of the auditory cortex with specific articulator representations during speech processing.	t	\N
24627167	In research investigating Stroop or Simon effects, data are typically analyzed at the level of mean response time (RT), with results showing faster responses for compatible than for incompatible trials. However, this analysis provides only limited information as it glosses over the shape of the RT distributions and how they may differ across tasks and experimental conditions. These limitations have encouraged the analysis of RT distributions using delta plots. In the present review, we aim to bring together research on distributional properties of auditory and visual interference effects. Extending previous reviews on distributional properties of the Simon effect, we additionally review studies reporting distributional analyses of Stroop effects. We show that distributional analyses of sequential effects (i.e., taking into account congruency of the previous trial) capture important similarities and differences of interference effects across tasks (Simon, Stroop) as well as across sensory modalities, despite some challenges associated to this approach.	t	\N
24627209	In multisensory research, faster responses are commonly observed when multimodal stimuli are presented, as compared to unimodal target presentations. This so-called redundant-signals effect can be explained by several frameworks, including separate-activation and coactivation models. The redundant-signals effect has been investigated in a large number of studies; however, most of those studies have been limited to the rejection of separate-activation models. Coactivation models have been analyzed in only a few studies, primarily using simple response tasks. Here, we investigated the mechanism of multisensory integration underlying go/no-go and choice responses to redundant auditory-visual stimuli. In the present study, the mean and variance of response times, as well as the accuracy rates of go/no-go and choice responses, were used to test a coactivation model based on the linear superposition of diffusion processes (Schwarz, 1994) within two absorbing barriers. The diffusion superposition model accurately describes the means and variances of response times as well as the proportions of correct responses observed in the two tasks. Linear superposition thus seems to be a general principle in the integration of redundant information provided by different sensory channels, and is not restricted to simple responses. The results connect existing theories of multisensory integration with theories on choice behavior.	t	\N
24632322	Two experiments investigated the cognitive skills that underlie children's susceptibility to semantic and phonological false memories in the Deese/Roediger-McDermott procedure (Deese, 1959; Roediger & McDermott, 1995). In Experiment 1, performance on the Verbal Similarities subtest of the British Ability Scales (BAS) II (Elliott, Smith, & McCulloch, 1997) predicted correct and false recall of semantic lures. In Experiment 2, performance on the Yopp-Singer Test of Phonemic Segmentation (Yopp, 1988) did not predict correct recall, but inversely predicted the false recall of phonological lures. Auditory short-term memory was a negative predictor of false recall in Experiment 1, but not in Experiment 2. The findings are discussed in terms of the formation of gist and verbatim traces as proposed by fuzzy trace theory (Reyna & Brainerd, 1998) and the increasing automaticity of associations as proposed by associative activation theory (Howe, Wimmer, Gagnon, & Plumpton, 2009).	t	\N
24633379	In order to acquire language, children require speech input. The prosody of the speech input plays an important role. In most cultures adults modify their code when communicating with children. Compared to normal speech this code differs especially with regard to prosody. For this review a selective literature search in PubMed and Scopus was performed. Prosodic characteristics are a key feature of spoken language. By analysing prosodic features, children gain knowledge about underlying grammatical structures. Child-directed speech (CDS) is modified in a way that meaningful sequences are highlighted acoustically so that important information can be extracted from the continuous speech flow more easily. CDS is said to enhance the representation of linguistic signs. Taking into consideration what has previously been described in the literature regarding the perception of suprasegmentals, CDS seems to be able to support language acquisition due to the correspondence of prosodic and syntactic units. However, no findings have been reported, stating that the linguistically reduced CDS could hinder first language acquisition.	t	\N
24635583	The attentional blink (AB) refers to the decline in report accuracy of a second target (T2) when presented shortly after a first target (T1) in a rapid serial visual presentation (RSVP) of distractors. It is known that masking T1 increases the magnitude of the AB, and masking a single target (equivalent to T1) in a RSVP stream attenuates the P3 to the target in correct trials. The major purpose of the present study was to clarify how these two effects may be integrated. An intervening distractor was presented at lag 1 (T1+1), at lag 2 (T1+2), or at neither of these two lags (no distractor). T2 was always presented at lag 3, as the last item in the stream. The P3 to T1 was attenuated and the P3 to T2 delayed in the T1+1 condition compared to the two other distractor conditions. These results clearly show that masking T1 attenuates the P3 to T1 and delays the P3 to T2 in the AB. Implications for extant theories of the AB are discussed.	t	\N
24637103	Processing local elements of hierarchical patterns at a superior level and independently from an intact global influence is a well-established characteristic of autistic visual perception. However, whether this confirmed finding has an equivalent in the auditory modality is still unknown. To fill this gap, 18 autistics and 18 typical participants completed a melodic decision task where global and local level information can be congruent or incongruent. While focusing either on the global (melody) or local level (group of notes) of hierarchical auditory stimuli, participants have to decide whether the focused level is rising or falling. Autistics showed intact global processing, a superior performance when processing local elements and a reduced global-to-local interference compared to typical participants. These results are the first to demonstrate that autistic processing of auditory hierarchical stimuli closely parallels processing of visual hierarchical stimuli. When analyzing complex auditory information, autistic participants present a local bias and a more autonomous local processing, but not to the detriment of global processing.	t	\N
24639401	Blind people rely more on vocal cues when they recognize a person's identity than sighted people. Indeed, a number of studies have reported better voice recognition skills in blind than in sighted adults. The present functional magnetic resonance imaging study investigated changes in the functional organization of neural systems involved in voice identity processing following congenital blindness. A group of congenitally blind individuals and matched sighted control participants were tested in a priming paradigm, in which two voice stimuli (S1, S2) were subsequently presented. The prime (S1) and the target (S2) were either from the same speaker (person-congruent voices) or from two different speakers (person-incongruent voices). Participants had to classify the S2 as either a old or a young person. Person-incongruent voices (S2) compared with person-congruent voices elicited an increased activation in the right anterior fusiform gyrus in congenitally blind individuals but not in matched sighted control participants. In contrast, only matched sighted controls showed a higher activation in response to person-incongruent compared with person-congruent voices (S2) in the right posterior superior temporal sulcus. These results provide evidence for crossmodal plastic changes of the person identification system in the brain after visual deprivation.	t	\N
24642285	We used magnetoencephalography (MEG) to examine brain activity related to the maintenance of non-verbal pitch information in auditory short-term memory (ASTM). We focused on brain activity that increased with the number of items effectively held in memory by the participants during the retention interval of an auditory memory task. We used very simple acoustic materials (i.e., pure tones that varied in pitch) that minimized activation from non-ASTM related systems. MEG revealed neural activity in frontal, temporal, and parietal cortices that increased with a greater number of items effectively held in memory by the participants during the maintenance of pitch representations in ASTM. The present results reinforce the functional role of frontal and temporal cortices in the retention of pitch information in ASTM. This is the first MEG study to provide both fine spatial localization and temporal resolution on the neural mechanisms of non-verbal ASTM for pitch in relation to individual differences in the capacity of ASTM. This research contributes to a comprehensive understanding of the mechanisms mediating the representation and maintenance of basic non-verbal auditory features in the human brain.	t	\N
24655365	Previous studies have shown that in semi-spontaneous speech, individuals with Broca's and anomic aphasia produce relatively many direct speech constructions. It has been claimed that in 'healthy' communication direct speech constructions contribute to the liveliness, and indirectly to the comprehensibility, of speech. To examine the effects of the occurrence of direct speech constructions on the perceived liveliness and speech comprehensibility of narratives produced by individuals with and without aphasia. Thirty-seven naive listeners rated 30 speech fragments with and without direct speech from ten speakers with and ten speakers without aphasia. The fragments originated from semi-structured interviews. The raters scored the perceived liveliness and the perceived comprehensibility of these fragments. For both groups of speakers, fragments containing direct speech constructions received higher scores for liveliness than fragments without direct speech constructions. However, no effect of direct speech was found on perceived comprehensibility. This is the first research to demonstrate that communication is perceived as more lively when it contains direct speech than when it does not, but yet is not more comprehensible. Individuals with Broca's and anomic aphasia are known to produce regularly direct speech constructions in elicited narratives. Given that liveliness is known to increase listeners' involvement and to help listeners stay focused, we suggest that this relative increase in direct speech by aphasic speakers may reflect a strategy to increase not only the liveliness of their discourse, but also listener focus and involvement.	t	\N
24658600	Extended stabilization of gaze leads to disappearance of dim visual targets presented peripherally. This phenomenon, known as Troxler fading, is thought to result from neuronal adaptation. Intense targets also disappear intermittently when surrounded by a moving pattern (the "mask"), a phenomenon known as motion-induced blindness (MIB). The similar phenomenology and dynamics of these disappearances may suggest that also MIB is, likewise, solely due to adaptation, which may be amplified by the presence of the mask. Here we directly compared the dependence of both phenomena on target contrast. Observers reported the disappearance and reappearance of a target of varying intensity (contrast levels: 8%-80%). MIB was induced by adding a mask that moved at one of various different speeds. The results revealed a lawful effect of contrast in both MIB and Troxler fading, but with opposite trends. Increasing target contrast increased (doubled) the rate of disappearance events for MIB, but decreased the disappearance rate to half in Troxler fading. The target mean invisible period decreased equally strongly with target contrast in MIB and in Troxler fading. The results suggest that both MIB and Troxler are equally affected by contrast adaptation, but that the rate of MIB is governed by an additional mechanism, possibly involving antagonistic processes between neuronal populations processing target and mask. Our results link MIB to other bi-stable visual phenomena that involve neuronal competition (such as binocular rivalry), which exhibit an analogous dependency on the strength of the competing stimulus components.	t	\N
24658601	The goal of this study was to create and validate a new set of sentence lists that could be used to evaluate the speech-perception abilities of listeners with hearing loss in cases where adult materials are inappropriate due to difficulty level or content. The authors aimed to generate a large number of sentence lists with an equivalent level of difficulty for the evaluation of performance over time and across conditions. The original Pediatric AzBio sentence corpus included 450 sentences recorded from one female talker. All sentences included in the corpus were successfully repeated by kindergarten and first-grade students with normal hearing. The mean intelligibility of each sentence was estimated by processing each sentence through a cochlear implant simulation and calculating the mean percent correct score achieved by 15 normal-hearing listeners. After sorting sentences by mean percent correct scores, 320 sentences were assigned to 16 lists of equivalent difficulty. List equivalency was then validated by presenting all sentence lists, in a novel random order, to adults and children with hearing loss. A final-validation stage examined single-list comparisons from adult and pediatric listeners tested in research or clinical settings. The results of the simulation study allowed for the creation of 16 lists of 20 sentences. The average intelligibility of each list ranged from 78.4 to 78.7%. List equivalency was then validated, when the results of 16 adult cochlear implant users and 9 pediatric hearing aid and cochlear implant users revealed no significant differences across lists. The binomial distribution model was used to account for the inherent variability observed in the lists. This model was also used to generate 95% confidence intervals for one and two list comparisons. A retrospective analysis of 361 instances from 78 adult cochlear implant users and 48 instances from 36 pediatric cochlear implant users revealed that the 95% confidence intervals derived from the model captured 94% of all responses (385 of 409). The cochlear implant simulation was shown to be an effective method for estimating the intelligibility of individual sentences for use in the evaluation of cochlear implant users. Furthermore, the method used for constructing equivalent sentence lists and estimating the inherent variability of the materials has also been validated. Thus, the AzBio Pediatric Sentence Lists are equivalent and appropriate for the assessment of speech-understanding abilities of children with hearing loss as well as adults for whom performance on AzBio sentences is near the floor.	t	\N
24671776	Learning about objects often requires making arbitrary associations among multisensory properties, such as the taste and appearance of a food or the face and voice of a person. However, the multisensory properties of individual objects usually are statistically constrained, such that some properties are more likely to co-occur than others, on the basis of their category. For example, male faces are more likely to co-occur with characteristically male voices than with female voices. Here, we report evidence that these natural multisensory statistics play a critical role in the learning of novel, arbitrary associative pairs. In Experiment 1, we found that learning of pairs consisting of human voices and gender-congruent faces was superior to learning of pairs consisting of human voices and gender-incongruent faces or of pairs consisting of human voices and pictures of inanimate objects (plants and rocks). In Experiment 2, we found that this "categorical congruency" advantage extended to nonhuman stimuli, as well-namely, to pairs of class-congruent animal pictures and vocalizations (e.g., dogs and barks) versus class-incongruent pairs (e.g., dogs and bird chirps). These findings suggest that associating multisensory properties that are statistically consistent with the various objects that we encounter in our daily lives is a privileged form of learning.	t	\N
24686722	PURPOSE Several acoustic cues specify any single phonemic contrast. Nonetheless, adult, native speakers of a language share weighting strategies, showing preferential attention to some properties over others. Cochlear implant (CI) signal processing disrupts the salience of some cues: In general, amplitude structure remains readily available, but spectral structure less so. This study asked how well speech recognition is supported if CI users shift attention to salient cues not weighted strongly by native speakers. METHOD Twenty adults with CIs participated. The /bɑ/-/wɑ/ contrast was used because spectral and amplitude structure varies in correlated fashion for this contrast. Adults with normal hearing weight the spectral cue strongly but the amplitude cue negligibly. Three measurements were made: labeling decisions, spectral and amplitude discrimination, and word recognition. RESULTS Outcomes varied across listeners: Some weighted the spectral cue strongly, some weighted the amplitude cue, and some weighted neither. Spectral discrimination predicted spectral weighting. Spectral weighting explained the most variance in word recognition. Age of onset of hearing loss predicted spectral weighting but not unique variance in word recognition. CONCLUSION The weighting strategies of listeners with normal hearing likely support speech recognition best, so efforts in implant design, fitting, and training should focus on developing those strategies.	t	\N
24687127	The purpose of this research was to compare the perception of Japanese vowel length contrasts by 4 groups of listeners who differed in their familiarity with length contrasts in their first language (L1; i.e., American English, Italian, Japanese, and Thai). Of the 3 nonnative groups, native Thai listeners were expected to outperform American English and Italian listeners, because vowel length is contrastive in their L1. Native Italian listeners were expected to demonstrate a higher level of accuracy for length contrasts than American English listeners, because the former are familiar with consonant (but not vowel) length contrasts (i.e., singleton vs. geminate) in their L1. A 2-alternative forced-choice AXB discrimination test that included 125 trials was administered to all the participants, and the listeners' discrimination accuracy (d') was reported. As expected, Japanese listeners were more accurate than all 3 nonnative groups in their discrimination of Japanese vowel length contrasts. The 3 nonnative groups did not differ from one another in their discrimination accuracy despite varying experience with length contrasts in their L1. Only Thai listeners were more accurate in their length discrimination when the target vowel was long than when it was short. Being familiar with vowel length contrasts in L1 may affect the listeners' cross-language perception, but it does not guarantee that their L1 experience automatically results in efficient processing of length contrasts in unfamiliar languages. The extent of success may be related to how length contrasts are phonetically implemented in listeners' L1.	t	\N
24690415	When musical notes are combined to make a chord, the closeness of fit of the combined spectrum to a single harmonic series (the 'harmonicity' of the chord) predicts the perceived consonance (how pleasant and stable the chord sounds; McDermott, Lehr, & Oxenham, 2010). The distinction between consonance and dissonance is central to Western musical form. Harmonicity is represented in the temporal firing patterns of populations of brainstem neurons. The current study investigates the role of brainstem temporal coding of harmonicity in the perception of consonance. Individual preference for consonant over dissonant chords was measured using a rating scale for pairs of simultaneous notes. In order to investigate the effects of cochlear interactions, notes were presented in two ways: both notes to both ears or each note to different ears. The electrophysiological frequency following response (FFR), reflecting sustained neural activity in the brainstem synchronised to the stimulus, was also measured. When both notes were presented to both ears the perceptual distinction between consonant and dissonant chords was stronger than when the notes were presented to different ears. In the condition in which both notes were presented to the both ears additional low-frequency components, corresponding to difference tones resulting from nonlinear cochlear processing, were observable in the FFR effectively enhancing the neural harmonicity of consonant chords but not dissonant chords. Suppressing the cochlear envelope component of the FFR also suppressed the additional frequency components. This suggests that, in the case of consonant chords, difference tones generated by interactions between notes in the cochlea enhance the perception of consonance. Furthermore, individuals with a greater distinction between consonant and dissonant chords in the FFR to individual harmonics had a stronger preference for consonant over dissonant chords. Overall, the results provide compelling evidence for the role of neural temporal coding in the perception of consonance, and suggest that the representation of harmonicity in phase locked neural firing drives the perception of consonance.	t	\N
24701988	Studies of facial affect recognition by people with traumatic brain injury (TBI) have shown this to be a significant problem. Vocal affect recognition also appears to be challenging for this population, but little is known about the degree to which one modality is impaired compared to the other. This study compared facial and vocal affect recognition of high and low intensity emotion expressions in people with moderate-to-severe TBI. The Diagnostic Analysis of Nonverbal Accuracy-2 (Adult Faces; Voices) was administered to 203 participants with TBI. Adults with TBI identified vocal emotion expressions with greater accuracy than facial emotion expressions. Facial affect recognition impairment was identified in 34% of participants, 22% were classified as having vocal affect recognition impairment and 15% showed impairment in both modalities. Participants were significantly less accurate at identifying low vs high intensity emotion expressions in both modalities. Happy facial expressions were significantly better identified than all other emotions. Errors were distributed across the emotion categories for vocal expressions. The degree of facial affect impairment was significantly greater than vocal affect impairment in this sample of people with moderate-to-severe TBI. Low intensity emotion expressions were particularly problematic and an advantage for positively valenced facial emotion expressions was indicated.	t	\N
24702433	In this paper, the authors analyze the auditory rehabilitation after cochlear implantation in adults with hearing impairment after head trauma, comparing their performance with that of other cochlear implant (CI) adult users who have post-lingual hearing impairment with other etiologies. The participants were divided into two groups: group 1 (N = 14) composed of CI adult users who have acquired severe to profound hearing loss after head trauma; group 2 (N = 231) composed of CI adult users who have severe to profound hearing loss from other etiologies. Performance was assessed using the following tests: tonal audiometry, speech audiometry, consonantal phonemes identification test, 100 words test, 100 words through the telephone test, monosyllables test, numbers test, sentences test, and sentences through the telephone test. Average results from group 1 were lower when compared with those of group 2 in all the tests used. No statistically significant difference was found for most tests. Statistically significant difference was found for consonantal phonemes identification test, 100 words through the telephone test, monosyllables test (when analyzed regarding the phonemes correctly repeated), and sentences through the telephone test. The performance of the group of CI adult users who have acquired hearing impairment after head trauma was globally lower than that observed on the group of hearing impairment with other etiologies. However, the difference was not statistically significant for most tests. Despite this difference in performance, the results from the group of CI adult users who have acquired hearing impairment after head trauma show the effectiveness of auditory rehabilitation through cochlear implantation in these situations.	t	\N
24703829	Exploding head syndrome is characterized by the perception of abrupt, loud noises when going to sleep or waking up. They are usually painless, but associated with fear and distress. In spite of the fact that its characteristic symptomatology was first described approximately 150 y ago, exploding head syndrome has received relatively little empirical and clinical attention. Therefore, a comprehensive review of the scientific literature using Medline, PsycINFO, Google Scholar, and PubMed was undertaken. After first discussing the history, prevalence, and associated features, the available polysomnography data and five main etiological theories for exploding head syndrome are summarized. None of these theories has yet reached dominance in the field. Next, the various methods used to assess and treat exploding head syndrome are discussed, as well as the limited outcome data. Finally, recommendations for future measure construction, treatment options, and differential diagnosis are provided.	t	\N
24707789	Syntactic priming occurs when structural information from one sentence influences processing of a subsequently encountered sentence (Bock, 1986; Ledoux et al., 2007). This article reports 2 eye-tracking experiments investigating the effects of a prime sentence on the processing of a target sentence that shared aspects of syntactic form. The experiments were designed to determine the degree to which lexical overlap between prime and target sentences produced larger effects, comparable to the widely observed "lexical boost" in production experiments (Pickering & Branigan, 1998; Pickering & Ferreira, 2008). The current experiments showed that priming effects during online comprehension were in fact larger when a verb was repeated across the prime and target sentences (see also Tooley et al., 2009). The finding of larger priming effects with lexical repetition supports accounts under which syntactic form representations are connected to individual lexical items (e.g., Tomasello, 2003; Vosse & Kempen, 2000, 2009).	t	\N
24708425	Visual illusions can reveal unconscious representations and processes at work in perception. Here we report a robust illusion that involves the misperception of moving, partially occluded objects. When a dynamically occluded object is seen through 2 misaligned apertures, the object appears misaligned in the direction of the apertures, creating the Aperture Capture Illusion. Specifically, when part of a dynamically occluded object disappears behind an occluding surface and then another part of the object comes into view immediately afterward, the 2 parts appear misaligned in the direction of the offset of the apertures through which they were seen. This illusion can be nulled: Separating the 2 object parts to increase the time interval between their appearance produced the percept of alignment. The ability to null the illusion in this manner demonstrates that dynamically occluded regions of moving objects continue to persist in perceptual awareness but, we argue, are perceived to move at a slower velocity than visible regions. We report 7 experiments establishing the existence of the illusion and ruling out several classes of explanation for it. We interpret the illusion and the ability to nullify it within the context of Palmer, Kellman, and Shipley's (2006) theory of spatiotemporal object formation.	t	\N
24719237	A briefly flashed target stimulus can become "invisible" when immediately followed by a mask-a phenomenon known as backward masking, which constitutes a major tool in the cognitive sciences. One form of backward masking is termed metacontrast masking. It is generally assumed that in metacontrast masking, the mask suppresses activity on which the conscious perception of the target relies. This assumption biases conclusions when masking is used as a tool-for example, to study the independence between perceptual detection and motor reaction. This is because other models can account for reduced perceptual performance without requiring suppression mechanisms. In this study, we used signal detection theory to test the suppression model against an alternative view of metacontrast masking, referred to as the summation model. This model claims that target- and mask-related activations fuse and that the difficulty in detecting the target results from the difficulty to discriminate this fused response from the response produced by the mask alone. Our data support this alternative view. This study is not a thorough investigation of metacontrast masking. Instead, we wanted to point out that when a different model is used to account for the reduced perceptual performance in metacontrast masking, there is no need to postulate a dissociation between perceptual and motor responses to account for the data. Metacontrast masking, as implemented in the Fehrer-Raab situation, therefore is not a valid method to assess perceptual-motor dissociations.	t	\N
24728649	Recent findings have suggested that transient attention can be triggered at two locations simultaneously. However, it is unclear whether doing so reduces the effect of attention at each attended location. In two experiments, we explored the consequences of dividing attention. In the first experiment, we compared the effects of one or two cues against an uncued baseline to determine the consequences of dividing attention in a paradigm with four rapid serial visual presentation (RSVP) streams. The results indicated that two simultaneous cues increase the accuracy of reporting two targets by almost the same amount as a single cue increases the report of a single target. These results suggest that when attention is divided between multiple locations, the attentional benefit at each location is not reduced in proportion to the total number of cues. A consequent prediction of this finding is that the identification of two RSVP targets should be better when they are presented simultaneously rather than sequentially. In a second experiment, we verified this prediction by finding evidence of lag-0 sparing: Two targets presented simultaneously in different locations were reported more easily than two targets separated by 100 ms. These findings argue against a biased-competition theory of attention. We suggest that visual attention, as triggered by a cue or target, is better described by a convergent gradient-field attention model.	t	\N
24728874	In autism spectrum disorders (ASD), social motivation theories suggest that the core social communication problems seen in children with ASD arise from diminished responsiveness to social reward. Although clinical and experimental data support these theories, the extent to which the reward deficit in ASD is unique for social rewards remains unclear. With the present investigation, we aimed to provide insight into the degree to which sociality as well as familiarity of reward incentives impact motivated goal-directed behavior in children with ASD. To do so, we directly compared the influence of familiar versus unfamiliar social reward relative to nonsocial, monetary reward in children with ASD relative to age- and IQ-matched typically developing controls (TDC) using a visual and auditory incentive go/nogo task with reward contingencies for successful response inhibitions. We found that children with ASD responded stronger to visual familiar and unfamiliar social reward as well as to nonsocial, monetary reward than TDC. While the present data are at odds with predictions made by social motivation theories, individual variations beyond clinical diagnosis, such as reward exposure across various social settings, help explain the pattern of results. The findings of this study stress the necessity for additional research on intra-individual as well as environmental factors that contribute to social reward responsiveness in individuals with ASD versus other neuropsychiatric disorders such as ADHD or conduct disorder.	t	\N
24732070	Human subjects performed in several behavioral conditions requiring, or not requiring, selective attention to visual stimuli. Specifically, the attentional task was to recognize strings of digits that had been presented visually. A nonlinear version of the stimulus-frequency otoacoustic emission (SFOAE), called the nSFOAE, was collected during the visual presentation of the digits. The segment of the physiological response discussed here occurred during brief silent periods immediately following the SFOAE-evoking stimuli. For all subjects tested, the physiological-noise magnitudes were substantially weaker (less noisy) during the tasks requiring the most visual attention. Effect sizes for the differences were >2.0. Our interpretation is that cortico-olivo influences adjusted the magnitude of efferent activation during the SFOAE-evoking stimulation depending upon the attention task in effect, and then that magnitude of efferent activation persisted throughout the silent period where it also modulated the physiological noise present. Because the results were highly similar to those obtained when the behavioral conditions involved auditory attention, similar mechanisms appear to operate both across modalities and within modalities. Supplementary measurements revealed that the efferent activation was spectrally global, as it was for auditory attention.	t	\N
24746946	Action representations associated with object use may be incidentally activated during visual object processing, and the time course of such activations may be influenced by lexical-semantic context (e.g., Lee, Middleton, Mirman, Kalénine, & Buxbaum (2012). Journal of Experimental Psychology: Human Perception and Performance, 39(1), 257-270). In this study we used the "visual world" eye-tracking paradigm to examine whether a deficit in producing skilled object-use actions (apraxia) is associated with abnormalities in incidental activation of action information, and assessed the neuroanatomical substrates of any such deficits. Twenty left hemisphere stroke patients, ten of whom were apraxic, performed a task requiring identification of a named object in a visual display containing manipulation-related and unrelated distractor objects. Manipulation relationships among objects were not relevant to the identification task. Objects were cued with neutral ("S/he saw the…."), or action-relevant ("S/he used the….") sentences. Non-apraxic participants looked at use-related non-target objects significantly more than at unrelated non-target objects when cued both by neutral and action-relevant sentences, indicating that action information is incidentally activated. In contrast, apraxic participants showed delayed activation of manipulation-based action information during object identification when cued by neutral sentences. The magnitude of delayed activation in the neutral sentence condition was reliably predicted by lower scores on a test of gesture production to viewed objects, as well as by lesion loci in the inferior parietal and posterior temporal lobes. However, when cued by a sentence containing an action verb, apraxic participants showed fixation patterns that were statistically indistinguishable from non-apraxic controls. In support of grounded theories of cognition, these results suggest that apraxia and temporal-parietal lesions may be associated with abnormalities in incidental activation of action information from objects. Further, they suggest that the previously-observed facilitative role of action verbs in the retrieval of object-related action information extends to participants with apraxia.	t	\N
24746955	Selective attention to phonology, i.e., the ability to attend to sub-syllabic units within spoken words, is a critical precursor to literacy acquisition. Recent functional magnetic resonance imaging evidence has demonstrated that a left-lateralized network of frontal, temporal, and posterior language regions, including the visual word form area, supports this skill. The current event-related potential (ERP) study investigated the temporal dynamics of selective attention to phonology during spoken word perception. We tested the hypothesis that selective attention to phonology dynamically modulates stimulus encoding by recruiting left-lateralized processes specifically while the information critical for performance is unfolding. Selective attention to phonology was captured by manipulating listening goals: skilled adult readers attended to either rhyme or melody within auditory stimulus pairs. Each pair superimposed rhyming and melodic information ensuring identical sensory stimulation. Selective attention to phonology produced distinct early and late topographic ERP effects during stimulus encoding. Data-driven source localization analyses revealed that selective attention to phonology led to significantly greater recruitment of left-lateralized posterior and extensive temporal regions, which was notably concurrent with the rhyme-relevant information within the word. Furthermore, selective attention effects were specific to auditory stimulus encoding and not observed in response to cues, arguing against the notion that they reflect sustained task setting. Collectively, these results demonstrate that selective attention to phonology dynamically engages a left-lateralized network during the critical time-period of perception for achieving phonological analysis goals. These findings suggest a key role for selective attention in on-line phonological computations. Furthermore, these findings motivate future research on the role that neural mechanisms of attention may play in phonological awareness impairments thought to underlie developmental reading disabilities.	t	\N
24751993	Voice pitch is an important information-bearing component of language that is subject to experience dependent plasticity at both early cortical and subcortical stages of processing. We have already demonstrated that pitch onset component (Na) of the cortical pitch response (CPR) is sensitive to flat pitch and its salience … CPR responses from Chinese listeners were elicited by three citation forms varying in pitch acceleration and duration. Results showed that the pitch onset component (Na) was invariant to changes in acceleration. In contrast, Na–Pb and Pb–Nb showed a systematic decrease in the interpeak latency and decrease in amplitude with increase in pitch acceleration that followed the time course of pitch change across the three stimuli. A strong correlation with pitch acceleration was observed for these two components only – a putative index of pitch-relevant neural activity associated with the more rapidly-changing portions of the pitch contour. Pc–Nc marks unambiguously the stimulus offset … and their functional roles as related to sensory and cognitive properties of the stimulus. [Corrected]	t	\N
24754984	To retrospectively analyze the auditory and speech development of young children with bilateral congenital aural atresia after using bone-anchored hearing aid (BAHA) softband. From August 2010 to January 2013, a total of 12 patients aged at a range of 3 months to 6 years with bilateral aural atresia using BAHA softband were divided into under 4-year-old group (n = 7, an average age of 14 months) and over 4-year-old group (n = 5, an average age of 77 months). The air and bone auditory thresholds of the under 4-year-old group were assessed by auditory brainstem response (ABR). The Infant-Toddler Meaningful Auditory Integration Scale (IT-MAIS) was used to evaluate early auditory development.Soundfield pure tone audiometry (PTA) was applied in the over 4-year-old group. The Standard Chinese lexical neighborhood test (LNT) was conducted to evaluate the speech discrimination scores unaided and aided for 6, 12 and 24 months. For the under 4-year-old group, the unaided air and bone conduction hearing thresholds were 70-90(73 ± 12) and 15-25(21 ± 4) dB nHL respectively.IT-MAIS score improved significantly after using softband for 6 months in comparisons with unaided situations. The average hearing threshold of patients in the over 4-year-old group was (63 ± 6) dB HL unaided versus (31 ± 4)dB HL aided. The average decline in pure-tone threshold was (32 ± 3)dB HL with BAHA softbands and LNT demonstrated improvements in the speech discrimination score. The differences of paired comparison were significant (all P < 0.05). The hearing ability of children with bilateral congenital aural atresia improves significantly after using BAHA softband. And its application at a young age can guarantee a relatively normal hearing development and avoid oral communication impairment.	t	\N
24764068	The effects on multisensory integration have rarely been examined in amblyopia. The McGurk effect is a well-established audiovisual illusion that is manifested when an auditory phoneme is presented concurrently with an incongruent visual phoneme. Visually healthy viewers will hear a phoneme that does not match the actual auditory stimulus, having been perceptually influenced by the visual phoneme. This study examines audiovisual integration in adults with amblyopia. Twenty-two subjects with amblyopia and 25 visually healthy controls participated. Participants viewed videos of combinations of visual and auditory phonemes, and were asked to report what they heard. Some videos had congruent video and audio (control), whereas others had incongruent video and audio (McGurk). The McGurk effect is strongest when the visual phoneme dominates over the audio phoneme, resulting in low auditory accuracy on the task. Adults with amblyopia demonstrated a weaker McGurk effect than visually healthy controls (P = 0.01). The difference was greatest when viewing monocularly with the amblyopic eye, and it was also evident when viewing binocularly or monocularly with the fellow eye. No correlations were found between the strength of the McGurk effect and either visual acuity or stereoacuity in subjects with amblyopia. Subjects with amblyopia and controls showed a similar response pattern to different speakers and syllables, and subjects with amblyopia consistently demonstrated a weaker effect than controls. Abnormal visual experience early in life can have negative consequences for audiovisual integration that persists into adulthood in people with amblyopia.	t	\N
24770403	To clarify the responsible gene for a family associated with hearing loss but having no well-known mitochondrial mutations. A Japanese family showing late-onset, progressive, and ski-sloping sensorineural hearing loss. Whole mitochondrial genome sequencing identified the 1673T>C mutation, a novel mitochondrial DNA mutation in the 16S ribosomal RNA gene. Whole mitochondrial genome sequencing is a powerful tool to identify the responsible gene for plausible mitochondrially inherited families. This is additional evidence that mitochondrial gene mutations may cause late-onset, progressive, and ski-sloping sensorineural hearing loss.	t	\N
24778251	Although it is well accepted that the speech motor system (SMS) is activated during speech perception, the functional role of this activation remains unclear. Here we test the hypothesis that the redundant motor activation contributes to categorical speech perception under adverse listening conditions. In this functional magnetic resonance imaging study, participants identified one of four phoneme tokens (/ba/, /ma/, /da/, or /ta/) under one of six signal-to-noise ratio (SNR) levels (-12, -9, -6, -2, 8 dB, and no noise). Univariate and multivariate pattern analyses were used to determine the role of the SMS during perception of noise-impoverished phonemes. Results revealed a negative correlation between neural activity and perceptual accuracy in the left ventral premotor cortex and Broca's area. More importantly, multivoxel patterns of activity in the left ventral premotor cortex and Broca's area exhibited effective phoneme categorization when SNR ≥ -6 dB. This is in sharp contrast with phoneme discriminability in bilateral auditory cortices and sensorimotor interface areas (e.g., left posterior superior temporal gyrus), which was reliable only when the noise was extremely weak (SNR > 8 dB). Our findings provide strong neuroimaging evidence for a greater robustness of the SMS than auditory regions for categorical speech perception in noise. Under adverse listening conditions, better discriminative activity in the SMS may compensate for loss of specificity in the auditory system via sensorimotor integration.	t	\N
24778384	The activation of listener's motor system during speech processing was first demonstrated by the enhancement of electromyographic tongue potentials as evoked by single-pulse transcranial magnetic stimulation (TMS) over tongue motor cortex. This technique is, however, technically challenging and enables only a rather coarse measurement of this motor mirroring. Here, we applied TMS to listeners' tongue motor area in association with ultrasound tissue Doppler imaging to describe fine-grained tongue kinematic synergies evoked by passive listening to speech. Subjects listened to syllables requiring different patterns of dorso-ventral and antero-posterior movements (/ki/, /ko/, /ti/, /to/). Results show that passive listening to speech sounds evokes a pattern of motor synergies mirroring those occurring during speech production. Moreover, mirror motor synergies were more evident in those subjects showing good performances in discriminating speech in noise demonstrating a role of the speech-related mirror system in feed-forward processing the speaker's ongoing motor plan.	t	\N
24779648	This study used event-related potentials (ERPs) to investigate discourse-coherence processing. Because there are scant data on ERP indices of discourse coherence in typical adults, it is important to study a non-clinical population before examining clinical populations. Twelve adults listened to a story with sentences in a coherent versus incoherent order. Sequences of nonsense syllables served as a control. ERPs in the 200-400 ms time window, reflecting phonological and lexical processing, and in the 600-900 ms time window, reflecting later discourse processing for integration, were investigated. Results revealed a right anterior and posterior positivity that was greater for coherent than for incoherent discourse during the 600-900 ms time window. These findings point to an index of discourse coherence and further suggest that ERPs can be used as a clinical tool to study discourse-processing disorders in populations with brain damage, such as aphasia and traumatic brain injury.	t	\N
24793771	Auditory filter theory dictates a physiological compromise between frequency and temporal resolution of cochlear signal processing. We examined neurophysiological correlates of these spectrotemporal tradeoffs in the human auditory system using auditory evoked brain potentials and psychophysical responses. Temporal resolution was assessed using scalp-recorded auditory brainstem responses (ABRs) elicited by paired clicks. The inter-click interval (ICI) between successive pulses was parameterized from 0.7 to 25 ms to map ABR amplitude recovery as a function of stimulus spacing. Behavioral frequency difference limens (FDLs) and auditory filter selectivity (Q10 of psychophysical tuning curves) were obtained to assess relations between behavioral spectral acuity and electrophysiological estimates of temporal resolvability. Neural responses increased monotonically in amplitude with increasing ICI, ranging from total suppression (0.7 ms) to full recovery (25 ms) with a temporal resolution of ∼3-4 ms. ABR temporal thresholds were correlated with behavioral Q10 (frequency selectivity) but not FDLs (frequency discrimination); no correspondence was observed between Q10 and FDLs. Results suggest that finer frequency selectivity, but not discrimination, is associated with poorer temporal resolution. The inverse relation between ABR recovery and perceptual frequency tuning demonstrates a time-frequency tradeoff between the temporal and spectral resolving power of the human auditory system.	t	\N
24795958	This study investigates how speed of motion is processed in language. In three eye-tracking experiments, participants were presented with visual scenes and spoken sentences describing fast or slow events (e.g., The lion ambled/dashed to the balloon). Results showed that looking time to relevant objects in the visual scene was affected by the speed of verb of the sentence, speaking rate, and configuration of a supporting visual scene. The results provide novel evidence for the mental simulation of speed in language and show that internal dynamic simulations can be played out via eye movements toward a static visual scene.	t	\N
24801737	Two experiments used the progressive demasking (PD) task to examine age differences in the ability to inhibit higher frequency competitors during the process of identifying a visually degraded word. In Experiment 1, older adults exhibited a larger inhibitory neighborhood frequency effect (i.e., slower identification of words with many higher frequency competitors) than younger adults, but additional analyses indicated that this difference could be explained by general slowing rather than a deficit in inhibitory abilities. In Experiment 2, a primed version of the PD task was used to promote hypothesis testing by semantically priming the target word (e.g., cry-weep) or a higher frequency competitor of the target (e.g, day-weep) prior to the onset of the demasking sequence. Although older adults were more likely to make identification errors consistent with an inhibitory deficit (e.g., identifying weep as week), these errors were infrequent overall and there was no corresponding evidence of a larger interference effect in the older adults' identification latencies. Taken together, performance in these two tasks provides little evidence of reduced inhibitory functioning in older adults. The implications for the inhibitory deficit hypothesis of cognitive aging and directions for future are discussed.	t	\N
24807850	The Finnish Matrix Test is the first sentence test in noise for the Finnish language. It was developed according to the HearCom standards and provides reliable speech intelligibility measurements with highly comparable results with the other international matrix tests. The aim of the study was to develop an accurate speech intelligibility test in noise for the Finnish language that is comparable across different languages. We chose a matrix sentence test, which comprises a base matrix of 10 names, verbs, numerals, adjectives and nouns. Test lists were formed from this matrix quasi randomly, providing test sentences of the same syntactical structure. The speech material corresponds to everyday spoken language and the phoneme distribution is representative of the Finnish language. The test was optimized by determining the speech recognition thresholds of the individual words and subsequently by applying level corrections of up to ±3 dB. Evaluation measurements were performed to check the equivalence of the different test lists with respect to speech intelligibility and to provide reference values for further clinical applications. After training, the mean speech recognition threshold (SRT) and the slope of the final test lists were -10.1 ± 0.1 dB signal-to-noise-ratio (SNR)and 16.7 ± 1.2%/dB, respectively (measurements at constant level; inter-list variability). The mean SRT and the slope of the test subjects were -10.1 ± 0.7 dB SNR and 17.5 ± 2.2%/dB (measurements at constant level; inter-subject variability). The expected SRT range for normal-hearing young adults for adaptive measurements is -9.7 ± 0.7 dB SNR.	t	\N
24814580	The goal of the present research was to provide direct evidence for the cross-language interaction of phonologies at the sub-lexical level by using the masked onset priming paradigm. More specifically, we investigated whether there is a cross-language masked onset priming effect (MOPE) with L2 (English) primes and L1 (Russian) targets and whether it is modulated by the orthographic similarity of primes and targets. Primes and targets had onsets that overlapped either only phonologically, only orthographically, both phonologically and orthographically, or did not have any overlap. Phonological overlap, but not orthographic overlap, between primes and targets led to faster naming latencies. In contrast, the ERP data provided evidence for effects of both phonological and orthographic overlap. Finally, the time-course of phonological and orthographic processing for our bilinguals mirrored the time-course previously reported for monolinguals in the ERP data. These results provide evidence for shared representations at the sub-lexical level for a bilingual's two languages.	t	\N
24815281	Reduced spectral resolution negatively impacts speech perception, particularly perception of vowels and consonant place. This study assessed impact of number of spectral channels on vowel discrimination by 6-month-old infants with normal hearing by comparing three listening conditions: Unprocessed speech, 32 channels, and 16 channels. Auditory stimuli (/ti/ and /ta/) were spectrally reduced using a noiseband vocoder and presented to infants with normal hearing via visual habituation. Results supported a significant effect of number of channels on vowel discrimination by 6-month-old infants. No differences emerged between unprocessed and 32-channel conditions in which infants looked longer during novel stimulus trials (i.e., discrimination). The 16-channel condition yielded a significantly different pattern: Infants demonstrated no significant difference in looking time to familiar vs novel stimulus trials, suggesting infants cannot discriminate /ti/ and /ta/ with only 16 channels. Results support effects of spectral resolution on vowel discrimination. Relative to published reports, young infants need more spectral detail than older children and adults to perceive spectrally degraded speech. Results have implications for development of perception by infants with hearing loss who receive auditory prostheses.	t	\N
24828221	There are many clinically available tests for the assessment of auditory processing skills in children and adults. However, there is limited data available on the maturational effects on the performance on these tests. The current study investigated maturational effects on auditory processing abilities using three psychophysical measures: temporal modulation transfer function (TMTF), iterated ripple noise (IRN) perception, and spectral ripple discrimination (SRD). A cross-sectional study. Three groups of subjects were tested: 10 adults (18-30 yr), 10 older children (12-18 yr), and 10 young children (8-11 yr) Temporal envelope processing was measured by obtaining thresholds for amplitude modulation detection as a function of modulation frequency (TMTF; 4, 8, 16, 32, 64, and 128 Hz). Temporal fine structure processing was measured using IRN, and spectral processing was measured using SRD. The results showed that young children had significantly higher modulation thresholds at 4 Hz (TMTF) compared to the other two groups and poorer SRD scores compared to adults. The results on IRN did not differ across groups. The results suggest that different aspects of auditory processing mature at different age periods and these maturational effects need to be considered while assessing auditory processing in children.	t	\N
24838018	The multisensory response enhancement (MRE), occurring when the response to a visual target integrated with a spatially congruent sound is stronger than the response to the visual target alone, is believed to be mediated by the superior colliculus (SC) (Stein & Meredith, 1993). Here, we used a focused attention paradigm to show that the spatial congruency effect occurs with red (SC-effective) but not blue (SC-ineffective) visual stimuli, when presented with spatially congruent sounds. To isolate the chromatic component of SC-ineffective targets and to demonstrate the selectivity of the spatial congruency effect we used the random luminance modulation technique (Experiment 1) and the tritanopic technique (Experiment 2). Our results indicate that the spatial congruency effect does not require the distribution of attention over different sensory modalities and provide correlational evidence that the SC mediates the effect.	t	\N
24842067	Auditory feedback plays an important role in children's speech development by providing the child with information about speech outcomes that is used to learn and fine-tune speech motor plans. The use of auditory feedback in speech motor learning has been extensively studied in adults by examining oral motor responses to manipulations of auditory feedback during speech production. Children are also capable of adapting speech motor patterns to perceived changes in auditory feedback; however, it is not known whether their capacity for motor learning is limited by immature auditory-perceptual abilities. Here, the link between speech perceptual ability and the capacity for motor learning was explored in two groups of 5- to 7-year-old children who underwent a period of auditory perceptual training followed by tests of speech motor adaptation to altered auditory feedback. One group received perceptual training on a speech acoustic property relevant to the motor task while a control group received perceptual training on an irrelevant speech contrast. Learned perceptual improvements led to an enhancement in speech motor adaptation (proportional to the perceptual change) only for the experimental group. The results indicate that children's ability to perceive relevant speech acoustic properties has a direct influence on their capacity for sensory-based speech motor adaptation.	t	\N
24850915	Many everyday skilled actions depend on moving in time with signals that are embedded in complex auditory streams (e.g. musical performance, dancing or simply holding a conversation). Such behaviour is apparently effortless; however, it is not known how humans combine auditory signals to support movement production and coordination. Here, we test how participants synchronize their movements when there are potentially conflicting auditory targets to guide their actions. Participants tapped their fingers in time with two simultaneously presented metronomes of equal tempo, but differing in phase and temporal regularity. Synchronization therefore depended on integrating the two timing cues into a single-event estimate or treating the cues as independent and thereby selecting one signal over the other. We show that a Bayesian inference process explains the situations in which participants choose to integrate or separate signals, and predicts motor timing errors. Simulations of this causal inference process demonstrate that this model provides a better description of the data than other plausible models. Our findings suggest that humans exploit a Bayesian inference process to control movement timing in situations where the origin of auditory signals needs to be resolved.	t	\N
24851348	Learning and discovery seem often to begin with noting patterns. Human infants are skilled at pattern detection, even patterns only definable at an abstract level, which is key to their acquisition of complex knowledge systems such as language and music. However, research examining infants' abstract rule learning has generated inconsistent results. We propose that apparent domain differences in infants' abstract rule learning may be the result of extraneous stimulus variation and discrepancies in the methodologies employed across studies probing this skill. We discuss how a behavioral methodology indexing infants' online learning would be valuable in furthering understanding of infants' (as well as adults') abstract rule learning and its neurophysiological concomitants. We outline current research aimed at developing such an index, and we propose future research, pairing such techniques with neurophysiological methods, aimed at shining more light on human skill at discovering structure.	t	\N
24856208	Human early visual cortex was traditionally thought to process simple visual features such as orientation, contrast, and spatial frequency via feedforward input from the lateral geniculate nucleus (e.g., [1]). However, the role of nonretinal influence on early visual cortex is so far insufficiently investigated despite much evidence that feedback connections greatly outnumber feedforward connections [2-5]. Here, we explored in five fMRI experiments how information originating from audition and imagery affects the brain activity patterns in early visual cortex in the absence of any feedforward visual stimulation. We show that category-specific information from both complex natural sounds and imagery can be read out from early visual cortex activity in blindfolded participants. The coding of nonretinal information in the activity patterns of early visual cortex is common across actual auditory perception and imagery and may be mediated by higher-level multisensory areas. Furthermore, this coding is robust to mild manipulations of attention and working memory but affected by orthogonal, cognitively demanding visuospatial processing. Crucially, the information fed down to early visual cortex is category specific and generalizes to sound exemplars of the same category, providing evidence for abstract information feedback rather than precise pictorial feedback. Our results suggest that early visual cortex receives nonretinal input from other brain areas when it is generated by auditory perception and/or imagery, and this input carries common abstract information. Our findings are compatible with feedback of predictive information to the earliest visual input level (e.g., [6]), in line with predictive coding models [7-10].	t	\N
24858108	Rapidly approaching (looming) sounds are ecologically salient stimuli that are perceived as nearer than they are due to overestimation of their loudness change and underestimation of their distance (Neuhoff, 1998; Seifritz et al., 2002). Despite evidence for crossmodal influence by looming sounds onto visual areas (Romei, Murray, Cappe, & Thut, 2009, 2013; Tyll et al., 2013), it is unknown whether such sounds bias visual percepts in similar ways. Nearer objects appear to be larger and brighter than distant objects. If looming sounds impact visual processing, then visual stimuli paired with looming sounds should be perceived as brighter and larger, even when the visual stimuli do not provide motion cues, i.e. are static. In Experiment 1 we found that static visual objects paired with looming tones (but not static or receding tones) were perceived as larger and brighter than their actual physical properties, as if they appear closer to the observer. In a second experiment, we replicate and extend the findings of Experiment 1. Crucially, we did not find evidence of such bias by looming sounds when visual processing was disrupted via masking or when catch trials were presented, ruling out simple response bias. Finally, in a third experiment we found that looming tones do not bias visual stimulus characteristics that do not carry visual depth information such as shape, providing further evidence that they specifically impact in-depth visual processing. We conclude that looming sounds impact visual perception through a mechanism transferring in-depth sound motion information onto the relevant in-depth visual dimensions (such as size and luminance but not shape) in a crossmodal remapping of information for a genuine, evolutionary advantage in stimulus detection.	t	\N
24869445	Two studies examined adult cochlear implant (CI) users' ability to match auditory rhythms occurring in music to visual rhythms occurring in dance (Cha Cha, Slow Swing, Tango and Jive). In Experiment 1, adults CI users (n = 10) and hearing controls matched a music excerpt to choreographed dance sequences presented as silent videos. In Experiment 2, participants matched a silent video of a dance sequence to music excerpts. CI users were successful in detecting timing congruencies across music and dance at well above-chance levels suggesting that they were able to process distinctive auditory and visual rhythm patterns that characterized each style. However, they were better able to detect cross-modal timing congruencies when the reference was an auditory rhythm than when the reference was a visual rhythm. Learning strategies that encourage cross-modal learning of musical rhythms may have applications in developing novel rehabilitative strategies to enhance music perception and appreciation outcomes of child implant users.	t	\N
24872325	In the present study we investigate the rules governing the perception of audiovisual synchrony within spatio-temporally cluttered visual environments. Participants viewed a ring of 19 discs modulating in luminance while hearing an amplitude modulating tone. Each disc modulated with a unique temporal phase (40 ms intervals), with only one synchronized to the tone. Participants searched for the synchronised disc whose spatial location varied randomly across trials. Square-wave modulation facilitated search: the synchronized disc was frequently chosen, with tight response distributions centred near zero-phase lag. In the sinusoidal condition responses were equally distributed over the 19 discs regardless of phase. To investigate whether subjective synchrony in the square-wave condition was limited by spatial or temporal factors we repeated the experiment with either reduced spatial density (9 discs) or temporal density (80 ms phase intervals). Reduced temporal density greatly facilitated synchrony perception but left the synchrony bandwidth unchanged, while no influence of spatial density was found. We conclude that audio-visual synchrony is not strongly constrained by the spatial or temporal density of the visual display, but by a temporal window within which audio-visual events are perceived as synchronous, with a full bandwidth of ~185 ms.	t	\N
24878593	One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework.	t	\N
24880672	To assess treatment outcomes via acoustic voice laboratory measurements before and after intervention in patients with common voice problems and Determine if outcome sensitivity of certain voice laboratory measures varies with disorder type. Retrospective and single-blinded. In this study, 40 patients with a single voice disorder diagnosis of either benign vocal fold lesions (lesions), primary muscle tension dysphonia (MTD-1), vocal fold atrophy (atrophy) or unilateral vocal fold paralysis (UVFP) underwent baseline testing, a single intervention-type (phonosurgery/voice therapy), and follow-up testing at uniform time points. Ten patients per diagnosis group were analyzed before and after treatment. Time- and frequency-based acoustic measures taken from vowels and sentences as well as patient-perceptual analysis (Voice Handicap Index-10) were reviewed. Statistically significant improvements were observed for three of four groups. Patients with muscle tension dysphonia displayed an improvement in Cepstral Spectral Index of Dysphonia speech (CSID) (P < 0.05). Patients with lesions had improved Voice Handicap Index-10 (P < 0.05), cepstral peak prominence (CPP) vowel standard deviation (P < 0.05), and CPP speech (P < 0.05). Patients with atrophy did not demonstrate significant improvement in any measure. Patients with unilateral vocal fold paralysis showed an improvement in CSID speech (P < 0.05) and CPP speech (P < 0.05). In addition, strong effect sizes were observed for many of the acoustic parameters studied. For all groups except atrophy, treatment was successful in improving patient perception of voice handicap and/or some acoustic voice parameters. A disorder-specific response to frequency-based acoustic measures was found.	t	\N
24887110	Age-related hearing loss (presbycusis) is a common human disorder, affecting one in three Americans aged 60 and over. Previous studies have shown that presbyacusis is associated with a loss of non-sensory cells in the cochlear lateral wall. Sox10 is a transcription factor crucial to the development and maintenance of neural crest-derived cells including some non-sensory cell types in the cochlea. Mutations of the Sox10 gene are known to cause various combinations of hearing loss and pigmentation defects in humans. This study investigated the potential relationship between Sox10 gene expression and pathological changes in the cochlear lateral wall of aged CBA/CaJ mice and human temporal bones from older donors. Cochlear tissues prepared from young adult (1-3 month-old) and aged (2-2.5 year-old) mice, and human temporal bone donors were examined using quantitative immunohistochemical analysis and transmission electron microscopy. Cells expressing Sox10 were present in the stria vascularis, outer sulcus and spiral prominence in mouse and human cochleas. The Sox10(+) cell types included marginal and intermediate cells and outer sulcus cells, including those that border the scala media and those extending into root processes (root cells) in the spiral ligament. Quantitative analysis of immunostaining revealed a significant decrease in the number of Sox10(+) marginal cells and outer sulcus cells in aged mice. Electron microscopic evaluation revealed degenerative alterations in the surviving Sox10(+) cells in aged mice. Strial marginal cells in human cochleas from donors aged 87 and older showed only weak immunostaining for Sox10. Decreases in Sox10 expression levels and a loss of Sox10(+) cells in both mouse and human aged ears suggests an important role of Sox10 in the maintenance of structural and functional integrity of the lateral wall. A loss of Sox10(+) cells may also be associated with a decline in the repair capabilities of non-sensory cells in the aged ear.	t	\N
24893743	RTs in conversation, with average gaps of 200 msec and often less, beat standard RTs, despite the complexity of response and the lag in speech production (600 msec or more). This can only be achieved by anticipation of timing and content of turns in conversation, about which little is known. Using EEG and an experimental task with conversational stimuli, we show that estimation of turn durations are based on anticipating the way the turn would be completed. We found a neuronal correlate of turn-end anticipation localized in ACC and inferior parietal lobule, namely a beta-frequency desynchronization as early as 1250 msec, before the end of the turn. We suggest that anticipation of the other's utterance leads to accurately timed transitions in everyday conversations.	t	\N
24897885	Twenty-four Japanese undergraduate pairs (12 male and 12 female pairs) participated as witnesses to a simulated criminal event. Although the witness pairs watched the same video together, through wireless headphones they experienced two different auditory versions with four differing items without being aware of the discrepancies. After the presentation, the witnesses were led to discuss six items, including two critical ones they had heard differently and another four they had heard in common. Witness memory performance was assessed individually with multiple-choice questionnaires in three sessions: before the discussion, after the discussion, and 1 week later. The results showed that participants tended to conform to their co-witness more often on the discussed items than on the not-discussed items. Source monitoring analyses on the four critical items revealed that even those participants who conformed were mostly cognizant of the source of their information just after the discussion, but they were prone to source-monitoring errors a week later.	t	\N
24904404	Auditory perception and auditory imagery have been shown to activate overlapping brain regions. We hypothesized that these phenomena also share a common underlying neural representation. To assess this, we used electrocorticography intracranial recordings from epileptic patients performing an out loud or a silent reading task. In these tasks, short stories scrolled across a video screen in two conditions: subjects read the same stories both aloud (overt) and silently (covert). In a control condition the subject remained in a resting state. We first built a high gamma (70-150 Hz) neural decoding model to reconstruct spectrotemporal auditory features of self-generated overt speech. We then evaluated whether this same model could reconstruct auditory speech features in the covert speech condition. Two speech models were tested: a spectrogram and a modulation-based feature space. For the overt condition, reconstruction accuracy was evaluated as the correlation between original and predicted speech features, and was significant in each subject (p < 10(-5); paired two-sample t-test). For the covert speech condition, dynamic time warping was first used to realign the covert speech reconstruction with the corresponding original speech from the overt condition. Reconstruction accuracy was then evaluated as the correlation between original and reconstructed speech features. Covert reconstruction accuracy was compared to the accuracy obtained from reconstructions in the baseline control condition. Reconstruction accuracy for the covert condition was significantly better than for the control condition (p < 0.005; paired two-sample t-test). The superior temporal gyrus, pre- and post-central gyrus provided the highest reconstruction information. The relationship between overt and covert speech reconstruction depended on anatomy. These results provide evidence that auditory representations of covert speech can be reconstructed from models that are built from an overt speech data set, supporting a partially shared neural substrate.	t	\N
24908166	Aging is associated with declines in auditory processing including speech comprehension abilities. Here, we evaluated both brainstem and cortical speech-evoked brain responses to elucidate how aging impacts the neural transcription and transfer of speech information between functional levels of the auditory nervous system. Behaviorally, older adults showed slower, more variable speech classification performance than younger listeners, which coincided with reduced brainstem amplitude and increased, but delayed, cortical speech-evoked responses. Mild age-related hearing loss showed differential correspondence with neurophysiological responses showing negative (brainstem) and positive (cortical) correlations with brain activity. Spontaneous brain activity, that is, "neural noise," did not differ between older and younger adults. Yet, mutual information and correlations computed between brainstem and cortex revealed higher redundancy (i.e., lower interdependence) in speech information transferred along the auditory pathway implying less neural flexibility in older adults. Results are consistent with the notion that weakened speech encoding in brainstem is overcompensated by increased cortical dysinhibition in the aging brain. Findings suggest aging negatively impacts speech listening abilities by distorting the hierarchy of speech representations, reducing neural flexibility through increased neural redundancy, and ultimately impairing the acoustic-phonetic mapping necessary for robust speech understanding.	t	\N
24931795	Alpha oscillations are a prominent electrophysiological signal measured across a wide range of species and cortical and subcortical sites. Alpha oscillations have been viewed for a long time as an "idling" rhythm, purely reflecting inactive sites. Despite earlier evidence from neurophysiology, awareness that alpha oscillations can substantially influence perception and behavior has grown only recently in cognitive neuroscience. Evidence for an active role of alpha for perception comes mainly from several visual, near-threshold experiments. In the current review, we extend this view by summarizing studies showing how alpha-defined brain states relate to illusory perception, i.e. cases of perceptual reports that are not "objectively" verifiable by distinct stimuli or stimulus features. These studies demonstrate that ongoing or prestimulus alpha oscillations substantially influence the perception of auditory, visual or multisensory illusions.	t	\N
24935820	Action perception and recognition are core abilities fundamental for human social interaction. A parieto-frontal network (the mirror neuron system) matches visually presented biological motion information onto observers' motor representations. This process of matching the actions of others onto our own sensorimotor repertoire is thought to be important for action recognition, providing a non-mediated "motor perception" based on a bidirectional flow of information along the mirror parieto-frontal circuits. State-of-the-art machine learning strategies for hand action identification have shown better performances when sensorimotor data, as opposed to visual information only, are available during learning. As speech is a particular type of action (with acoustic targets), it is expected to activate a mirror neuron mechanism. Indeed, in speech perception, motor centers have been shown to be causally involved in the discrimination of speech sounds. In this paper, we review recent neurophysiological and machine learning-based studies showing (a) the specific contribution of the motor system to speech perception and (b) that automatic phone recognition is significantly improved when motor data are used during training of classifiers (as opposed to learning from purely auditory data).	t	\N
24936610	All languages employ certain phonetic contrasts when distinguishing words. Infant speech perception is rapidly attuned to these contrasts before many words are learned, thus phonetic attunement is thought to proceed independently of lexical and referential knowledge. Here, evidence to the contrary is provided. Ninety-eight 9-month-old English-learning infants were trained to perceive a non-native Cantonese tone contrast.Two object–tone audiovisual pairings were consistently presented, which highlighted the target contrast (Object A with Tone X; Object B with Tone Y). Tone discrimination was then assessed. Results showed improved tone discrimination if object–tone pairings were perceived as being referential word labels, although this effect was modulated by vocabulary size. Results suggest how lexical and referential knowledge could play a role in phonetic attunement.	t	\N
24945666	Naturalistic stimuli, such as normal speech and narratives, are opening up intriguing prospects in neuroscience, especially when merging neuroimaging with machine learning methodology. Here we propose a task-optimized spatial filtering strategy for uncovering individual magnetoencephalographic (MEG) responses to audiobook stories. Ten subjects listened to 1-h-long recording once, as well as to 48 repetitions of a 1-min-long speech passage. Employing response replicability as statistical validity and utilizing unsupervised learning methods, we trained spatial filters that were able to generalize over datasets of an individual. For this blind-signal-separation (BSS) task, we derived a version of multi-set similarity-constrained canonical correlation analysis (SimCCA) that theoretically provides maximal signal-to-noise ratio (SNR) in this setting. Irrespective of significant noise in unaveraged MEG traces, the method successfully uncovered feasible time courses up to ~120 Hz, with the most prominent signals below 20 Hz. Individual trial-to-trial correlations of such time courses reached the level of 0.55 (median 0.33 in the group) at ~0.5 Hz, with considerable variation between subjects. By this filtering, the SNR increased up to 20 times. In comparison, independent component analysis (ICA) or principal component analysis (PCA) did not improve SNR notably. The validity of the extracted brain signals was further assessed by inspecting their associations with the stimulus, as well as by mapping the contributing cortical signal sources. The results indicate that the proposed methodology effectively reduces noise in MEG recordings to that extent that brain responses can be seen to nonrecurring audiobook stories. The study paves the way for applications aiming at accurately modeling the stimulus-response-relationship by tackling the response variability, as well as for real-time monitoring of brain signals of individuals in naturalistic experimental conditions.	t	\N
24946201	To examine active inhibition of irrelevant stimuli and evaluate its neural basis using functional near infrared spectroscopy in patients with attention deficits after traumatic brain injury (TBI). Case control study. Ten patients with TBI and 10 healthy control subjects participated in this study. The Paced Auditory Serial Addition Test (PASAT) was performed with (distracting PASAT) and without (PASAT) distracting Japanese kana phonetic characters presented between each number. A block design was used. Subjects alternately performed each task three times. Healthy controls performed better than patients with TBI on both the tasks. When performing the PASAT, healthy controls showed significant activity in every region of interest except the right lateral prefrontal cortex (PFC), but patients with TBI showed significant activity only in the left anterior PFC and left lateral PFC. When performing the distracting PASAT, the right lateral PFC was active in healthy controls, but not in patients with TBI. These results confirm that patients with moderate-to-severe TBI were affected by distractors that influenced order processing. It is suggested that the working memory of patients with TBI was affected by distracting stimuli, whereas that of healthy individuals was not.	t	\N
24963624	Recent experimental evidence suggests that the perception of temporal intervals is influenced by the temporal context in which they are presented. A longstanding example is the time-order-error, wherein the perception of two intervals relative to one another is influenced by the order in which they are presented. Here, we test whether the perception of temporal intervals in an absolute judgment task is influenced by the preceding temporal context. Human subjects participated in a temporal bisection task with no anchor durations (partition method). Intervals were demarcated by a Gaussian blob (visual condition) or burst of white noise (auditory condition) that persisted for one of seven logarithmically spaced sub-second intervals. Crucially, the order in which stimuli were presented was first-order counterbalanced, allowing us to measure the carryover effect of every successive combination of intervals. The results demonstrated a number of distinct findings. First, the perception of each interval was biased by the prior response, such that each interval was judged similarly to the preceding trial. Second, the perception of each interval was also influenced by the prior interval, such that perceived duration shifted away from the preceding interval. Additionally, the effect of decision bias was larger for visual intervals, whereas auditory intervals engendered greater perceptual carryover. We quantified these effects by designing a biologically-inspired computational model that measures noisy representations of time against an adaptive memory prior while simultaneously accounting for uncertainty, consistent with a Bayesian heuristic. We found that our model could account for all of the effects observed in human data. Additionally, our model could only accommodate both carryover effects when uncertainty and memory were calculated separately, suggesting separate neural representations for each. These findings demonstrate that time is susceptible to similar carryover effects as other basic stimulus attributes, and that the brain rapidly adapts to temporal context.	t	\N
24975235	Matrix sentence tests use words from a fixed word matrix to compose syntactically equivalent, but semantically unpredictable sentences. These tests are suitable for monitoring performance of cochlear implant (CI) users by repeated speech intelligibility testing. This study evaluates the Dutch matrix sentence test in CI users in quiet and in noise. It then investigates the possibility to improve the test-retest reliability for CI users by selecting subsets of sentences. Repeated speech intelligibility testing was performed in quiet and in noise. The effect of sentence selection on the test-retest reliability was predicted by computer simulations and experimentally evaluated using a cross-over design. Fifteen post-lingually deafened CI users, of which eleven participated in the cross-over study. The test-retest reliability equaled 2.3 dB in quiet and 1.3 dB in noise. The simulations predicted an improvement in test-retest reliability, especially in quiet. The cross-over study did not confirm the predictions. The results of the study suggest that the homogeneity of the sentences is not the prime component underlying the test-retest reliability. The Dutch matrix speech material and the selected subsets of sentences were equally suitable for speech intelligibility testing in CI users.	t	\N
24993019	Conversation scenes are a typical example in which classical models of visual attention dramatically fail to predict eye positions. Indeed, these models rarely consider faces as particular gaze attractors and never take into account the important auditory information that always accompanies dynamic social scenes. We recorded the eye movements of participants viewing dynamic conversations taking place in various contexts. Conversations were seen either with their original soundtracks or with unrelated soundtracks (unrelated speech and abrupt or continuous natural sounds). First, we analyze how auditory conditions influence the eye movement parameters of participants. Then, we model the probability distribution of eye positions across each video frame with a statistical method (Expectation-Maximization), allowing the relative contribution of different visual features such as static low-level visual saliency (based on luminance contrast), dynamic low level visual saliency (based on motion amplitude), faces, and center bias to be quantified. Through experimental and modeling results, we show that regardless of the auditory condition, participants look more at faces, and especially at talking faces. Hearing the original soundtrack makes participants follow the speech turn-taking more closely. However, we do not find any difference between the different types of unrelated soundtracks. These eyetracking results are confirmed by our model that shows that faces, and particularly talking faces, are the features that best explain the gazes recorded, especially in the original soundtrack condition. Low-level saliency is not a relevant feature to explain eye positions made on social scenes, even dynamic ones. Finally, we propose groundwork for an audiovisual saliency model.	t	\N
24993189	Multiple sound reflections from room materials and a listener's head induce slight spectral modifications of sounds. This coloration depends on the listener and source positions, and on the room itself. This study investigated whether coloration could help segregate competing sources. Obligatory streaming was evaluated for diotic speech-shaped noises using a rhythmic discrimination task. Thresholds for detecting anisochrony were always significantly higher when stimuli differed in spectrum. The tested differences corresponded to three spatial configurations involving different levels of head and room coloration. These results suggest that, despite the generally deleterious effects of reverberation on speech intelligibility, coloration could favor source segregation.	t	\N
24993221	Prosodic rhythm in speech [the alternation of "Strong" (S) and "weak" (w) syllables] is cued, among others, by slow rates of amplitude modulation (AM) within the speech envelope. However, it is unclear exactly which envelope modulation rates and statistics are the most important for the rhythm percept. Here, the hypothesis that the phase relationship between "Stress" rate (∼2 Hz) and "Syllable" rate (∼4 Hz) AMs provides a perceptual cue for speech rhythm is tested. In a rhythm judgment task, adult listeners identified AM tone-vocoded nursery rhyme sentences that carried either trochaic (S-w) or iambic patterning (w-S). Manipulation of listeners' rhythm perception was attempted by parametrically phase-shifting the Stress AM and Syllable AM in the vocoder. It was expected that a 1π radian phase-shift (half a cycle) would reverse the perceived rhythm pattern (i.e., trochaic → iambic) whereas a 2π radian shift (full cycle) would retain the perceived rhythm pattern (i.e., trochaic → trochaic). The results confirmed these predictions. Listeners judgments of rhythm systematically followed Stress-Syllable AM phase-shifts, but were unaffected by phase-shifts between the Syllable AM and the Sub-beat AM (∼14 Hz) in a control condition. It is concluded that the Stress-Syllable AM phase relationship is an envelope-based modulation statistic that supports speech rhythm perception.	t	\N
24993222	Perceptual compensation for coarticulation (PCCA) refers to listener responses consistent with perceptual reduction of the acoustic effects of the coarticulatory context on a target sound. The robustness of PCCA across individuals and across tasks have not been studied together previously. This study reports the results of two experiments designed to determine the robustness of perceptual compensation for vocalic influence on sibilant perception across tasks and the stability of such compensatory response within an individual. Identification and discrimination data, collected in the laboratory and on Amazon's Mechanical Turk, showed that individuals are moderately stable in their PCCA responses across tasks and the level of stability is consistent across both the lab-based and the internet-based cohorts, although some differences are observed.	t	\N
24995901	Residual inhibition (RI) is the temporary inhibition of tinnitus by use of masking stimuli when the device is turned off. The main aim of this study was to evaluate the effects of RI induced by auditory electrical stimulation (AES) in the primary auditory pathways using early auditory-evoked potentials (AEPs) in subjective idiopathic tinnitus (SIT) subjects. A randomized placebo-controlled study was conducted on forty-four tinnitus subjects. All enrolled subjects based on the responses to AES, were divided into two groups of RI and Non-RI (NRI). The results of the electrocochleography (ECochG), auditory brain stem response (ABR) and brain stem transmission time (BTT) were determined and compared pre- and post-AES in the studied groups. The mean differences in the compound action potential (CAP) amplitudes and III/V and I/V amplitude ratios were significantly different between the RI, NRI and PES controls. BTT was significantly decreased associated with RI. The observed changes in AEP associated with RI suggested some peripheral and central auditory alterations. Synchronized discharges of the auditory nerve fibers and inhibition of the abnormal activity of the cochlear nerve by AES may play important roles associated with RI. Further comprehensive studies are required to determine the mechanisms of RI more precisely.	t	\N
25000379	The purpose of this study was to determine the influence of dialect upon the perception of dysarthric speech. Speakers and listeners were self-identifying as either Caucasian American or African American. Three speakers were Caucasian American, three were African American. Four speakers had experienced a CVA and were dysarthric. Listeners were age matched and were equally divided for gender. Readers recorded 14 word sentences from the Assessment of Intelligibility of Dysarthric Speech. Listeners provided ratings of intelligibility, comprehensibility, and acceptability. Own-race biases were found for all measures; however, significant findings were found for intelligibility and comprehensibility in that the Caucasian Americans provided significantly higher scores for Caucasian American speakers. Clinical implications are discussed.	t	\N
25003309	The idea that the human mind can be divided into distinct (but interacting) functional modules is an important presupposition in many theories of cognition. While previous research on modularity predominantly studied input domains (e.g., vision) or central processes, the present study focused on cognitive representations of output domains. Specifically, we asked to what extent output domain representations are encapsulated (i.e., immune to influence from other domains, representing a key feature of modularity) by studying determinants of interference between simultaneous action demands (oculomotor and vocal responses). To examine the degree of encapsulation, we compared single- vs. dual-response performance triggered by single stimuli. Experiment 1 addressed the role of stimulus modality under dimensionally overlapping response requirements (stimuli and responses were spatial and compatible throughout). In Experiment 2, we manipulated the presence of dimensional overlap across responses. Substantial performance costs associated with dual-response (vs. single-response) demands were observed across response modalities, conditions, and experiments. Dimensional overlap combined with shared spatial codes across responses enabled response-code priming (i.e., beneficial crosstalk between output domains). Overall, the results are at odds with the idea of strong encapsulation of output system representations and show how processing content determines the extent of interdependency between output domains in cognition.	t	\N
25018691	In schizophrenia, evoked 40-Hz auditory steady-state responses (ASSRs) are impaired, which reflects the sensory deficits in this disorder, and baseline spontaneous oscillatory activity also appears to be abnormal. It has been debated whether the evoked ASSR impairments are due to the possible increase in baseline power. GABAergic interneuron-specific NMDA receptor (NMDAR) hypofunction mutant mice mimic some behavioral and pathophysiological aspects of schizophrenia. To determine the presence and extent of sensory deficits in these mutant mice, we recorded spontaneous local field potential (LFP) activity and its click-train evoked ASSRs from primary auditory cortex of awake, head-restrained mice. Baseline spontaneous LFP power in the pre-stimulus period before application of the first click trains was augmented at a wide range of frequencies. However, when repetitive ASSR stimuli were presented every 20 s, averaged spontaneous LFP power amplitudes during the inter-ASSR stimulus intervals in the mutant mice became indistinguishable from the levels of control mice. Nonetheless, the evoked 40-Hz ASSR power and their phase locking to click trains were robustly impaired in the mutants, although the evoked 20-Hz ASSRs were also somewhat diminished. These results suggested that NMDAR hypofunction in cortical GABAergic neurons confers two brain state-dependent LFP abnormalities in the auditory cortex; (1) a broadband increase in spontaneous LFP power in the absence of external inputs, and (2) a robust deficit in the evoked ASSR power and its phase-locking despite of normal baseline LFP power magnitude during the repetitive auditory stimuli. The "paradoxically" high spontaneous LFP activity of the primary auditory cortex in the absence of external stimuli may possibly contribute to the emergence of schizophrenia-related aberrant auditory perception.	t	\N
25036146	The objective of this study was to examine how the level of current required for cochlear implant listeners to detect single-channel electrical pulse trains relates to loudness perception on the same channel. The working hypothesis was that channels with relatively high thresholds, when measured with a focused current pattern, interface poorly to the auditory nerve. For such channels, a smaller dynamic range between perceptual threshold and the most comfortable loudness would result, in part, from a greater sensitivity to changes in electrical field spread compared to low-threshold channels. The narrower range of comfortable listening levels may have important implications for speech perception. Data were collected from eight, adult cochlear implant listeners implanted with the HiRes90k cochlear implant (Advanced Bionics Corp.). The partial tripolar (pTP) electrode configuration, consisting of one intracochlear active electrode, two flanking electrodes carrying a fraction (σ) of the return current, and an extracochlear ground, was used for stimulation. Single-channel detection thresholds and most comfortable listening levels were acquired using the most focused pTP configuration possible (σ ≥ 0.8) to identify three channels for further testing-those with the highest, median, and lowest thresholds-for each subject. Threshold, equal-loudness contours (at 50% of the monopolar dynamic range), and loudness growth functions were measured for each of these three test channels using various pTP fractions. For all test channels, thresholds increased as the electrode configuration became more focused. The rate of increase with the focusing parameter σ was greatest for the high-threshold channel compared to the median- and low-threshold channels. The 50% equal-loudness contours exhibited similar rates of increase in level across test channels and subjects. Additionally, test channels with the highest thresholds had the narrowest dynamic ranges (for σ ≥ 0.5) and steepest growth of loudness functions for all electrode configurations. Together with previous studies using focused stimulation, the results suggest that auditory responses to electrical stimuli at both threshold and suprathreshold current levels are not uniform across the electrode array of individual cochlear implant listeners. Specifically, the steeper growth of loudness and thus smaller dynamic ranges observed for high-threshold channels are consistent with a degraded electrode-neuron interface, which could stem from lower numbers of functioning auditory neurons or a relatively large distance between the neurons and electrodes. These findings may have potential implications for how stimulation levels are set during the clinical mapping procedure, particularly for speech-processing strategies that use focused electrical fields.	t	\N
25041936	In vision an extensive literature supports the existence of competitive dual-processing systems of category learning that are grounded in neuroscience and are partially-dissociable. The reflective system is prefrontally-mediated and uses working memory and executive attention to develop and test rules for classifying in an explicit fashion. The reflexive system is striatally-mediated and operates by implicitly associating perception with actions that lead to reinforcement. Although categorization is fundamental to auditory processing, little is known about the learning systems that mediate auditory categorization and even less is known about the effects of individual difference in the relative efficiency of the two learning systems. Previous studies have shown that individuals with elevated depressive symptoms show deficits in reflective processing. We exploit this finding to test critical predictions of the dual-learning systems model in audition. Specifically, we examine the extent to which the two systems are dissociable and competitive. We predicted that elevated depressive symptoms would lead to reflective-optimal learning deficits but reflexive-optimal learning advantages. Because natural speech category learning is reflexive in nature, we made the prediction that elevated depressive symptoms would lead to superior speech learning. In support of our predictions, individuals with elevated depressive symptoms showed a deficit in reflective-optimal auditory category learning, but an advantage in reflexive-optimal auditory category learning. In addition, individuals with elevated depressive symptoms showed an advantage in learning a non-native speech category structure. Computational modeling suggested that the elevated depressive symptom advantage was due to faster, more accurate, and more frequent use of reflexive category learning strategies in individuals with elevated depressive symptoms. The implications of this work for dual-process approach to auditory learning and depression are discussed.	t	\N
25059267	Chronic tinnitus affects 5-15% of the general population; in 1% of individuals with tinnitus this condition severely affects their quality of life. Pharmacological treatment is one of the options for the management of tinnitus patients, but their efficacy remains controversial. AIM. To evaluate the level of evidence to support the use of different drugs in reducing the severity of tinnitus. The pharmacological groups that have been investigated for the treatment of tinnitus include anesthetics, anticonvulsants, antidepressants, antihistamines, benzodiazepines, diuretics, corticosteroids, and of other substances. Intravenous lidocaine seems to be effective, but the short duration of the effect and the adverse reactions prevent its use. Compared with placebo, carbamazepine and gabapentine have not demonstrated effectiveness although they may be effective in some patients with auditory nerve vascular compression or myoclonus. Tricyclic antidepressants are no more effective than placebo at reducing tinnitus severity although they may improve comorbid depression. There is insufficient evidence to evaluate the effectiveness of selective serotonin reuptake inhibitors and benzodiazepines. Acamprosate may decrease the severity of tinnitus, but the level of evidence is low. There are no consistent results in the studies with intratympanic gentamicin or steroids in tinnitus associated with Meniere's disease. The use of pharmacotherapy in reducing the severity of tinnitus is not well supported by prospective, randomized, placebo-controlled clinical trials. Various drugs have been shown to be effective in some studies, but the clinical evidence is limited. Large randomized clinical trials are needed.	t	\N
25085738	We establish a new dissociation between the roles of working memory (WM) cognitive control and visual maintenance in selective attention as measured by the efficiency of distractor rejection. The extent to which focused selective attention can prevent distraction has been shown to critically depend on the level and type of load involved in the task. High perceptual load that consumes perceptual capacity leads to reduced distractor processing, whereas high WM load that reduces WM ability to exert priority-based executive cognitive control over the task results in increased distractor processing (e.g., Lavie, Trends in Cognitive Sciences, 9(2), 75-82, 2005). WM also serves to maintain task-relevant visual representations, and such visual maintenance is known to recruit the same sensory cortices as those involved in perception (e.g., Pasternak & Greenlee, Nature Reviews Neuroscience, 6(2), 97-107, 2005). These findings led us to hypothesize that loading WM with visual maintenance would reduce visual capacity involved in perception, thus resulting in reduced distractor processing-similar to perceptual load and opposite to WM cognitive control load. Distractor processing was assessed in a response competition task, presented during the memory interval (or during encoding; Experiment 1a) of a WM task. Loading visual maintenance or encoding by increased set size for a memory sample of shapes, colors, and locations led to reduced distractor response competition effects. In contrast, loading WM cognitive control with verbal rehearsal of a random letter set led to increased distractor effects. These findings confirm load theory predictions and provide a novel functional distinction between the roles of WM maintenance and cognitive control in selective attention.	t	\N
25089573	One of the leading theories for dyslexia suggests that it is the result of a difficulty in auditory temporal processing (ATP). This theory, as well as others, is supported by studies showing group differences and correlation between phonological awareness and ATP. However, these studies do not provide causal relationship. In the current study the authors aimed to test causal relationship between ATP and phonological awareness by comparing the performance of dyslexic and normal reader students in phonological awareness tasks before and after a short-term (5-day) training in either temporal processing (dichotic temporal order judgment; TOJ), nontemporal processing (intensity discrimination), or no training. TOJ training resulted in significant reduction of TOJ threshold and increase in phonological awareness tasks' scores. Intensity discrimination training resulted in a decrease of intensity discrimination threshold, but with no change in phonological awareness tasks. Those who had no training, had no change in TOJ and intensity discrimination thresholds, as well as in the phonological awareness tasks. These results show that (a) a short-term training in temporal processing with no other perceptual cues for adult dyslexic and normal readers can be efficient in improving their phonological awareness; and (b) phonological awareness (dis) ability has causal relationship to ATP.	t	\N
25096110	Three-month-olds discriminate resolved harmonic complexes on the basis of missing fundamental (MF) pitch. In view of reported difficulty in discriminating unresolved complexes at 7 months and striking changes in the organization of the auditory system during early infancy, infants' ability to discriminate unresolved complexes is of some interest. This study investigated the ability of 3-month-olds, 7-month-olds, and adults to discriminate the pitch of unresolved harmonic complexes using an observer-based method. Stimuli were MF complexes bandpass filtered with a -12 dB/octave slope, combined in random phase, presented at 70 dB sound pressure level (SPL) for 650 ms with a 50 ms rise/fall with a pink noise at 65 dB SPL. The conditions were (1) "LOW" unresolved harmonics (2500-4500 Hz) based on MFs of 160 and 200 Hz and (2) "HIGH" unresolved harmonics (4000-6000 Hz) based on MFs of 190 and 200 Hz. To demonstrate MF discrimination, participants had to ignore spectral changes in complexes with the same fundamental and respond only when the fundamental changed. Nearly all infants tested categorized complexes by MF pitch suggesting discrimination of pitch extracted from unresolved harmonics by 3 months. Adults also categorized the complexes by MF pitch, although musically trained adults were more successful than musically untrained adults.	t	\N
25096125	This paper investigated how auditory and vibrotactile feedback information is integrated within the context of violin quality evaluation. Fifteen violinists evaluated three violins on four criteria-"Rich Sound," "Loud and Powerful," "Alive and Responsive," and "Pleasure"-during a perceptual experiment. Violinists first evaluated the violins one at a time under three experimental conditions: (1) playing, (2) listening to it (played by a professional player) in an active way by fingering the score on an isolated neck, (3) same as (2) with vibrotactile feedback provided at the isolated neck. Violinists were then asked to evaluate the violins through pairwise comparisons under condition (3): Each violin was paired with itself while the level of vibrations of the isolated neck was either the original one or divided by two. The first part of the experiment demonstrated that Loud and Powerful judgments were affected by the presence of vibrations given that violins were rated louder in condition (3) than in (2). In the second part, violins were rated more positively with original vibration level at the isolated neck than with half the level, for all criteria but Alive and Responsive. Consistently with sensory interaction, the magnitude of the enhancement remained relatively constant across violins.	t	\N
25111522	Assess surgical complications, postoperative residual hearing, and speech perception outcomes of placement of a middle ear implant on the round window in conductive and mixed hearing loss cases. Single-subject, repeated-measures design where each subject served as his or her own control. Tertiary referral medical systems. Eighteen subjects with either conductive or mixed hearing loss who could not benefit from conventional amplification were enrolled in a clinical trial investigating vibratory stimulation of the round window. The floating mass transducer (FMT) was positioned in the round window niche. Unaided residual hearing, and aided sound field thresholds and speech perception abilities were evaluated preoperatively, and at 1, 3, 6, and 10 months post-activation of the external speech processor. Six subjects experienced complications that either required further medical management or resolved on their own. There was no difference in residual bone conduction thresholds or unaided word discrimination over time. All subjects experienced a significant improvement in aided speech perception abilities as compared to preoperative performance. Subjects with conductive and mixed hearing loss with placement of the FMT in the round window niche experienced improved sound field thresholds and speech perception, without compromising residual hearing thresholds. Vibratory stimulation of the round window via a middle ear implant may be an appropriate treatment option for patients with conductive and mixed hearing loss. Additional research is needed on the preferred placement of the FMT, improvement of functional gain, and methods to limit postoperative complications and need for revision surgery.	t	\N
25118919	To correlate the findings of an open-field audiometry with the thresholds of steady-state auditory-evoked potentials (SSAEPs) found in infants of up to 6 months of age with sensorineural hearing loss. This study included 19 infants with sensorineural hearing loss (8 males and 11 females), with minimum age of 2 months and maximum age of 6 months. The SSAEPs were assessed at 500 and 2000 Hz, and the audiometry was performed in open field through observation of behavioral responses to sound stimuli, at the same frequencies. We observed a significant correlation between the findings of both tests conducted at 500 and 2000 Hz, with p-values of 0.002 and 0.013, respectively. There was no statistical difference between ears (p=0.532) and genders (p=0.615). We conclude that there was a significant correlation between the SSAEP thresholds and the findings of the open-field audiometry. Therefore, we can affirm that the SSAEPs are a viable examination, able to predict the degree and configuration of hearing loss in infants of up to 6 months of age, and that they can be included in the clinical routine of hearing assessments conducted in children.	t	\N
25133472	Hearing preservation (HP) surgery was initiated more than 10 years ago for combined electric and acoustic stimulation (EAS). Preserved residual low-frequency hearing has been demonstrated to improve speech perception in noise as well as music appreciation in EAS users up to 2 years. Multiple study groups aimed to evaluate initial loss of residual hearing (RH) as a consequence of HP surgery. However, after 1 year and 2 years of follow-up, further decline was reported. This study aimed to determine RH, speech perception, and the subjective benefits of EAS 10 years after HP surgery. Nine postlingual EAS partially deaf patients who underwent HP surgery at Antwerp University Hospital were included in this study (11 implanted ears). Hearing preservation (0% = loss of hearing; >0%-25% = minimal HP; >25%-75% = partial HP; >75% = complete HP), speech perception and subjective benefits were evaluated preoperatively; at 3, 6, 12, 18, and 24 months postoperatively; and annually thereafter. Complete HP was obtained in three of 11 ears; partial HP in five of 11 ears; and minimal HP in two of 11 ears, measured during their most recent follow-up. One subject lost his RH completely across time. The mean rate of HP was 48% (ranging from 6 months up to 10 years postoperatively). Speech perception analysis up to 10 years showed a continuous statistically significant improvement. The maximum subjective benefit was reached 3 months after implantation and subsequently remained statistically significant unchanged for the next 10 years. Long-term HP in EAS users after HP surgery is feasible, although a small continuous decline of HP rate of 3% per year was observed (measured from first fitting up to 6 years postoperative). Nevertheless, a continuous improvement was found in the speech perception results of the EAS users across 10 years. Moreover, the positive subjective benefit, assessed 3 months postoperative, remained stable up to 10 years.	t	\N
25143548	When a high harmonic is removed from a cosine-phase harmonic complex, we hear a sine tone pop out of the perception; the sine tone has the pitch of the high harmonic, while the tone complex has the pitch of its fundamental frequency, f0. This phenomenon is commonly referred to as Duifhuis Pitch (DP). This paper describes, for the first time, the cortical representation of DP observed with magnetoencephalography. In experiment 1, conditions that produce the perception of a DP were observed to elicit a classic onset response in auditory cortex (P1m, N1m, P2m), and an increment in the sustained field (SF) established in response to the tone complex. Experiment 2 examined the effect of the phase spectrum of the complex tone on the DP activity: Schroeder-phase negative waves elicited a transient DP complex with a similar shape to that observed with cosine-phase waves but with much longer latencies. Following the transient DP activity, the responses of the negative and positive Schroeder-phase waves converged, and the increment in the SF slowly died away. In the absence of DP, the two Schroeder-phase conditions with low peak factors both produced larger SFs than cosine-phase waves with large peak factors. A model of the auditory periphery that includes coupling between adjacent frequency channels is used to explain the early neuromagnetic activity observed in auditory cortex.	t	\N
25153664	This study was designed to evaluate the binaural effects from bimodal hearing according to the aided hearing threshold in the nonimplanted ear. Subjects included 17 individuals who continued to use a hearing aid (HA) in the nonimplanted ear for more than 6 months postoperatively. Speech perception and sound localization were tested with unilateral cochlear implantation (CI) and bimodal hearing with and without background noise. Materials were presented at an average of 70 dB sound pressure level from a front loudspeaker in a quiet condition and then with background noise at a signal-to-noise ratio of +10 dB HL. Speech perception scores were based on percent-correct performance of repeating a spoken word under each condition. Sound localization scores were obtained by averaging the sum of angle differences between the active loudspeaker and the loudspeaker indicated by the subject. Speech perception scores (mean ± SD) of unilateral CI and bimodal hearing were 63.3% ± 17.7% and 73.1% ± 18.5% under the quiet condition (p = 0.029) and 65.5% ± 21.9% and 70.9% ± 23.6% under the noisy condition (p = 0.01), respectively. Angle differences (mean ± SD) of unilateral CI and bimodal hearing were 72.8 ± 27.4 degrees and 84.1 ± 29.9 degrees under the quiet condition (p = 0.052) and 79.3 ± 26.9 degrees and 77.3 ± 22.0 degrees under the noisy condition (p = 0.906), respectively. Patients were divided into two groups according to their aided hearing thresholds: Group 1 (aided hearing threshold ≤50 dB HL; n = 8) and Group 2 (aided hearing threshold >50 dB HL; n = 9). The speech perception scores of bimodal hearing in each group were 85.3% ± 13.3% and 60.8% ± 17.5% (p = 0.023) under the quiet condition and 82.7% ± 9.0% and 59.4% ± 26.8% under the noisy condition (p = 0.052), respectively. For sound localization, the angle differences of bimodal hearing in each group were 54 ± 28.6 degrees and 83.9 ± 11.9 degrees under the quiet condition (p = 0.042) and 63.0 ± 23.5 degrees and 89 ± 22.6 degrees under the noisy condition (p = 0.049), respectively. Based on the relationship between the aided hearing level and bimodal hearing performance, this current study suggests that bimodal benefits for sound localization and speech perception in noise are significant but only when sound detection is adequate for the hearing aids. Therefore, bimodal hearing could be applied to selective patients with favorable aided hearing levels.	t	\N
25153739	Unconscious priming is sensitive to contextual factors. The present study examined this adaptive process using masked arrow primes (< or >). Some targets required specific "fixed" left/right responses (< or >) and others required "free" left/right responses (<>). Different groups (n = 30 each) received response-congruent primes (SOA = 75 msec.) on 0.2, 0.5, or 0.8 of the fixed-response trials. Fixed responses were facilitated by congruent primes and free responses were faster when congruent with the prime. Critically, these masked priming effects emerged only in the 0.8 group. The pattern of extant prime-proportion effects in this paradigm best supports an adaptive associative-strength account rather than memory-recruitment or response-bias-suppression accounts.	t	\N
25155249	Computational modeling and eye-tracking were used to investigate how phonological and semantic information interact to influence the time course of spoken word recognition. We extended our recent models (Chen & Mirman, 2012; Mirman, Britt, & Chen, 2013) to account for new evidence that competition among phonological neighbors influences activation of semantically related concepts during spoken word recognition (Apfelbaum, Blumstein, & McMurray, 2011). The model made a novel prediction: Semantic input modulates the effect of phonological neighbors on target word processing, producing an approximately inverted-U-shaped pattern with a high phonological density advantage at an intermediate level of semantic input-in contrast to the typical disadvantage for high phonological density words in spoken word recognition. This prediction was confirmed with a new analysis of the Apfelbaum et al. data and in a visual world paradigm experiment with preview duration serving as a manipulation of strength of semantic input. These results are consistent with our previous claim that strongly active neighbors produce net inhibitory effects and weakly active neighbors produce net facilitative effects.	t	\N
25156097	The primary objective of the study was to investigate the feasibility, reliability, and validity of the Dutch digits in noise (DIN) test for measuring speech recognition in hearing aid and cochlear implant users and compare results to the standard sentences-in-noise (SIN) test. The relation between speech reception thresholds for DIN test and SIN test was analysed to determine the validity of the DIN test. As linguistic skills were expected to make different contributions in these tests, their influence was analysed. Participants were 12 normal-hearing listeners, 24 hearing aid users, and 24 cochlear implant users. The DIN test was feasible for more participants than the SIN test. Intraclass correlation coefficients showed high reliability. The standard error of measurement was smaller for the DIN test than for the SIN test. DIN test and SIN test were highly correlated (r = 0.95 and r = 0.56 for NH+ HA and CI users respectively). In the regression analysis no significant contribution of basic linguistic skills or personal factors was found. In the assessment of speech recognition in noise of aided hearing-impaired listeners with hearing aids or cochlear implants, the DIN test is a feasible, reliable and valid test.	t	\N
25156757	In four experiments in which participants searched for multiple target digits we hypothesized that search should be fastest when the targets are arranged closely together on the number line without any intervening distractor digits, i.e., the targets form a contiguous and coherent group. In Experiment 1 search performance was better for targets defined by numerical magnitude than parity (i.e., evenness); this result supports our hypothesis but could also be due to the linear separability of targets from distractors or the numerical distance between them. Experiment 2 controlled for target-distractor linear separability and numerical distance, yielding faster search when targets were surrounded by distractors on the number line than when they surrounded distractors. This result is consistent with target contiguity and coherence but also with grouping by similarity of target shapes. Experiment 3 controlled for all three alternative explanations (linear separability, numerical distance, and shape similarity) and search performance was better for contiguous targets than separated targets. In Experiment 4 search performance was better for a coherent target group than one with intervening distractors. Of the possibilities we considered, only the hypothesis based on the contiguity and coherence of the target group on the number line can account for the results from all four experiments.	t	\N
25158373	To evaluate cochlear functioning in patients (18-45 years old) with varying stages of chronic kidney disease (CKD). Using purposive sampling, 50 participants, 10 in each of the 5 stages of CKD, were selected and underwent pure tone audiometric testing and distortion product otoacoustic emissions (DPOAEs). Significant differences (p < 0.05) were found between pure tone audiometry and DPOAEs in detecting early cochlear dysfunction in the high-frequency range in stages 3 (6,000/5,000 Hz; p = 0.00), 4 (6,000/5,000 Hz; p < 0.03) and 5 (4,000/3,333 Hz; p < 0.01, 8,000/6,667 Hz: p < 0.05) with DPOAEs being more sensitive in identifying early cochlear dysfunction. Patients in stages 1 and 2 presented with normal puretone thresholds and DPOAEs, suggesting that cochlear functioning in these patients was normal. Early cochlear dysfunction, thereby indicating a subclinical hearing loss, was identified in stages 3, 4 and 5 by DPOAE testing. In addition, blood test results, drug intake and concomitant conditions were recorded and analysed which suggested a relationship between reduced cochlear functioning and increased electrolyte levels, treatment regimens and concomitant conditions. Participants in the later stages of CKD presented with early cochlear dysfunction, presenting with subclinical hearing loss. It was postulated that this subclinical hearing loss resulted from a combination of electrolytic, urea and creatinine imbalances, together with concomitant medical conditions and ototoxic drug intake. It was concluded that audiological monitoring be included in the management of patients with CKD and that DPOAEs be introduced as part of the test battery to monitor cochlear function in patients with varying degrees of CKD.	t	\N
25170790	Absolute pitch (AP) is the rare ability to identify or produce different pitches without using reference tones. At least two sequential processing stages are assumed to contribute to this phenomenon. The first recruits a pitch memory mechanism at an early stage of auditory processing, whereas the second is driven by a later cognitive mechanism (pitch labeling). Several investigations have used active tasks, but it is unclear how these two mechanisms contribute to AP during passive listening. The present work investigated the temporal dynamics of tone processing in AP and non-AP (NAP) participants by using EEG. We applied a passive oddball paradigm with between- and within-tone category manipulations and analyzed the MMN reflecting the early stage of auditory processing and the P3a response reflecting the later cognitive mechanism during the second processing stage. Results did not reveal between-group differences in MMN waveforms. By contrast, the P3a response was specifically associated with AP and sensitive to the processing of different pitch types. Specifically, AP participants exhibited smaller P3a amplitudes, especially in between-tone category conditions, and P3a responses correlated significantly with the age of commencement of musical training, suggesting an influence of early musical exposure on AP. Our results reinforce the current opinion that the representation of pitches at the processing level of the auditory-related cortex is comparable among AP and NAP participants, whereas the later processing stage is critical for AP. Results are interpreted as reflecting cognitive facilitation in AP participants, possibly driven by the availability of multiple codes for tones.	t	\N
25173195	Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect reflective of voice congruency is currently lacking.	t	\N
25175541	Development of a sense of self is a fundamental process needed for human social interaction. Although functional neuroimaging studies have revealed the importance of medial prefrontal cortex (mPFC) in self-referencing, how this function develops in infancy remains poorly understood. To determine the cerebral basis underlying processing of self-related stimuli, we used behavioral measures and functional multi-channel near-infrared spectroscopy (fNIRS) to measure prefrontal cortical responses in 6-month-old infants hearing their own names. We also investigated the influence of a mother's voice on name perception in infants - an ability that plays a crucial role in the recognition of social signals. Experiment 1 measured the behavioral preferences of infants for their own names and for other names, spoken either by their mothers or by strangers. Results showed that infants significantly preferred their own name to other names, regardless of speaker type. Experiment 2 examined hemodynamic responses to the same four conditions in the prefrontal cortex. Compared with other names, hearing their own names, especially when spoken by their mother, elicited greater activity in the infant's dorsal mPFC. Furthermore, the magnitude of the cerebral response correlated with the degree of behavioral preference only when involving their mother's voice. These findings suggest that, particularly in the context of their mothers' voice, the dorsal mPFC of infants is already sensitive to social signals related to self at 6 months. At the same time, familiarity and affection related processing are also discussed as possible factors modulating dorsal mPFC activation at this age.	t	\N
25178986	Working memory (WM) is a latent cognitive structure that serves to store and manipulate a limited amount of information over a short time period. How information is maintained in WM remains a debated issue: it is unclear whether stimuli from different sensory domains are maintained under distinct mechanisms or maintained under the same mechanism. Previous neuroimaging research on this issue to date has focused on individual brain regions and has not provided a comprehensive view of the functional networks underlying multi-domain WM. To study the functional networks involved in visual and auditory WM, we applied constrained principal component analysis (CPCA) to a functional magnetic resonance imaging (fMRI) dataset acquired when participants performed a change-detection task requiring them to remember only visual, only auditory, or both visual and auditory stimuli. Analysis revealed evidence of both [1] domain-specific networks responsive to either visual or auditory WM (but not both), and [2] domain-general networks responsive to both visual and auditory WM. The domain-specific networks showed load-dependent activations during only encoding, whereas a domain-general network was sensitive to WM load across encoding, maintenance, and retrieval. The latter domain-general network likely reflected attentional processes involved in WM encoding, retrieval, and possibly maintenance as well. These results do not support the domain-specific account of WM maintenance but instead favor the domain-general theory that items from different sensory domains are maintained under the same mechanism.	t	\N
25178987	Current views suggest that autism spectrum disorders (ASDs) are characterised by enhanced low-level auditory discrimination abilities. Little is known, however, about whether enhanced abilities are universal in ASD and how they relate to symptomatology. We tested auditory discrimination for intensity, frequency and duration in 21 adults with ASD and 21 IQ and age-matched controls. Contrary to predictions, there were significant deficits in ASD on all acoustic parameters. The findings suggest that low-level auditory discrimination ability varies widely within ASD and this variability relates to IQ level, and influences the severity of restricted and repetitive behaviours (RRBs). We suggest that it is essential to further our understanding of the potential contributing role of sensory perception ability on the emergence of RRBs.	t	\N
25186122	Sensory over-responsivity (SOR) refers to an exaggerated, intense, or prolonged behavioral response to ordinary sensory stimuli. The relationship of SOR to psychiatric disorders remains poorly understood. The current study examined the SOR construct within typically developing children with clinically significant anxiety, including the prevalence and course of SOR symptoms and relationship between SOR symptoms, demographic factors, and psychopathology. Children presenting at an anxiety specialty clinic (n = 88) completed a psychiatric diagnostic assessment, which included parent-report measures of SOR, anxiety, obsessive-compulsive disorder (OCD), and global behavior and child-report measures of anxiety, depression, and OCD. Sensory over-responsivity symptoms were very common: 93.2% were bothered by at least 1 tactile or auditory sensation, and the mean number of bothersome sensations was 9.2 (SD = 7.4). SOR symptoms were reported to be "moderately bothersome" on average and to onset at an early age. Sensory Over-Responsivity Inventory (SensOR) scores did not differ by psychiatric disorder diagnosis, but SensOR scores significantly correlated with measures of OCD and depression. Higher SensOR scores were associated with greater global impairment. A high rate of SOR symptom occurrence was observed in this sample of children seeking anxiety treatment, suggesting that SOR may not be entirely independent of anxiety and may be closely associated with OCD. Future research on the validity and nosology of SOR using psychiatric samples is warranted.	t	\N
25190408	This study examines how visual speech information affects native judgments of the intelligibility of speech sounds produced by non-native (L2) speakers. Native Canadian English perceivers as judges perceived three English phonemic contrasts (/b-v, θ-s, l-ɹ/) produced by native Japanese speakers as well as native Canadian English speakers as controls. These stimuli were presented under audio-visual (AV, with speaker voice and face), audio-only (AO), and visual-only (VO) conditions. The results showed that, across conditions, the overall intelligibility of Japanese productions of the native (Japanese)-like phonemes (/b, s, l/) was significantly higher than the non-Japanese phonemes (/v, θ, ɹ/). In terms of visual effects, the more visually salient non-Japanese phonemes /v, θ/ were perceived as significantly more intelligible when presented in the AV compared to the AO condition, indicating enhanced intelligibility when visual speech information is available. However, the non-Japanese phoneme /ɹ/ was perceived as less intelligible in the AV compared to the AO condition. Further analysis revealed that, unlike the native English productions, the Japanese speakers produced /ɹ/ without visible lip-rounding, indicating that non-native speakers' incorrect articulatory configurations may decrease the degree of intelligibility. These results suggest that visual speech information may either positively or negatively affect L2 speech intelligibility.	t	\N
25190410	Hearing-aid noise reduction should reduce background noise, but not disturb the target speech. This objective is difficult because noise reduction suffers from a trade-off between the amount of noise removed and signal distortion. It is unknown if this important trade-off differs between normal-hearing (NH) and hearing-impaired (HI) listeners. This study separated the negative effect of noise reduction (distortion) from the positive effect (reduction of noise) to allow the measurement of the detection threshold for noise-reduction (NR) distortion. Twelve NH subjects and 12 subjects with mild to moderate sensorineural hearing loss participated in this study. The detection thresholds for distortion were determined using an adaptive procedure with a three-interval, two-alternative forced-choice paradigm. Different levels of distortion were obtained by changing the maximum amount of noise reduction. Participants were also asked to indicate their preferred NR strength. The detection threshold for overall distortion was higher for HI subjects than for NH subjects, suggesting that stronger noise reduction can be applied for HI listeners without affecting the perceived sound quality. However, the preferred NR strength of HI listeners was closer to their individual detection threshold for distortion than in NH listeners. This implies that HI listeners tolerate fewer audible distortions than NH listeners.	t	\N
25190427	The purpose of this study was to evaluate the feasibility and efficiency of an observer-based, two-interval forced-choice infant psychophysical testing procedure. Ten of 11 infants (7-9 months of age) achieved a criterion of 80%-correct detection of a 50-dB sound pressure level noise band in a single testing session. Fewer trials were needed to reach criterion using the two-interval procedure than previously reported for the single-interval observer-based psychophysical procedure [Olsho, Koch, Halpin, and Carter (1987). Devel. Psychol. 23, 627-640]. These results provide preliminary evidence that the two-interval procedure is feasible and efficient while controlling for observer and listener response bias.	t	\N
25191774	This study aims to establish whether mothers of children with specific language impairments (SLI) have reduced emotional competence and whether individual dimensions of maternal emotional competence are related to emotional and behavioral problems in children. The clinical sample comprised 97 preschool children (23 girls) with SLI from, while the peer sample comprised 60 (34 girls) developmentally normal preschool children. The emotional competence of mothers was evaluated using the Emotional Competence Questionnaire (ESCQ-45). Emotional and behavioral difficulties in children were assessed by mothers, speech therapists, and teachers, using Achenbach's CBCL and CTRF scales. A lower emotional competence was found in mothers of children with specific language impairments. Mothers in clinical and peer samples differed in their ability to express emotions, while there was no statistically significant difference in their ability to recognize and manage emotions. Poor emotional regulation in mothers was linked to increased emotional reactivity, anxiety, and depressive manifestations in children with SLI, as well as to their speech comprehension. Emotional expression in mothers seems to be important for psychopathology in children with SLI, and their expressive and receptive speech. Our findings suggest that, in addition to the rehabilitation of children with SLI, clinical practice should implement preventative work with parents to enhance their emotional competence.	t	\N
25196948	Research on how lexical tone is neuroanatomically represented in the human brain is central to our understanding of cortical regions subserving language. Past studies have exclusively focused on tone perception of the spoken language, and little is known as to the lexical tone processing in reading visual words and its associated brain mechanisms. In this study, we performed two experiments to identify neural substrates in Chinese tone reading. First, we used a tone judgment paradigm to investigate tone processing of visually presented Chinese characters. We found that, relative to baseline, tone perception of printed Chinese characters were mediated by strong brain activation in bilateral frontal regions, left inferior parietal lobule, left posterior middle/medial temporal gyrus, left inferior temporal region, bilateral visual systems, and cerebellum. Surprisingly, no activation was found in superior temporal regions, brain sites well known for speech tone processing. In activation likelihood estimation (ALE) meta-analysis to combine results of relevant published studies, we attempted to elucidate whether the left temporal cortex activities identified in Experiment one is consistent with those found in previous studies of auditory lexical tone perception. ALE results showed that only the left superior temporal gyrus and putamen were critical in auditory lexical tone processing. These findings suggest that activation in the superior temporal cortex associated with lexical tone perception is modality-dependent.	t	\N
25198514	A growing body of evidence suggests that conscious visual awareness is not a prerequisite for human fear learning. For instance, humans can learn to be fearful of subliminal fear relevant images--images depicting stimuli thought to have been fear relevant in our evolutionary context, such as snakes, spiders, and angry human faces. Such stimuli could have a privileged status in relation to manipulations used to suppress usually salient images from awareness, possibly due to the existence of a designated sub-cortical 'fear module'. Here we assess this proposition, and find it wanting. We use binocular masking to suppress awareness of images of snakes and wallabies (particularly cute, non-threatening marsupials). We find that subliminal presentations of both classes of image can induce differential fear conditioning. These data show that learning, as indexed by fear conditioning, is neither contingent on conscious visual awareness nor on subliminal conditional stimuli being fear relevant.	t	\N
25200176	Following prolonged exposure to asynchronous multisensory signals, the brain adapts to reduce the perceived asynchrony. Here, in three separate experiments, participants performed a synchrony judgment task on audiovisual, audiotactile or visuotactile stimuli and we used inter-trial analyses to examine whether temporal recalibration occurs rapidly on the basis of a single asynchronous trial. Even though all combinations used the same subjects, task and design, temporal recalibration occurred for audiovisual stimuli (i.e., the point of subjective simultaneity depended on the preceding trial's modality order), but none occurred when the same auditory or visual event was combined with a tactile event. Contrary to findings from prolonged adaptation studies showing recalibration for all three combinations, we show that rapid, inter-trial recalibration is unique to audiovisual stimuli. We conclude that recalibration occurs at two different timescales for audiovisual stimuli (fast and slow), but only on a slow timescale for audiotactile and visuotactile stimuli.	t	\N
25211190	Children with a history of amblyopia, even if resolved, exhibit impaired visual-auditory integration and perceive speech differently. To determine whether a history of amblyopia is associated with abnormal visual-auditory speech integration. Retrospective observational study at an academic pediatric ophthalmologic clinic with an average of 4 years of follow-up. Participants were at least 3 years of age and without any history of neurologic or hearing disorders. Of 39 children originally in our study, 6 refused to participate. The remaining 33 participants completed the study. Twenty-four participants (mean [SD] age, 7.0 [1.5] years) had a history of amblyopia in 1 eye, with a visual acuity of at least 20/20 in the nonamblyopic eye. Nine controls (mean [SD] age, 8.0 [3.4] years) were recruited from referrals for visually insignificant etiologies or through preschool-screening eye examinations; all had 20/20 in both eyes. Participants were presented with a video demonstrating the McGurk effect (ie, a stimulus presenting an audio track playing the sound /pa/ and a separate video track of a person articulating /ka/). Normal visual-auditory integration produces the perception of hearing a fusion sound /ta/. Participants were asked to report which sound was perceived, /ka/, /pa/, or /ta/. Prevalence of perception of the fusion /ta/ sound. Prior to the study, amblyopic children were hypothesized to less frequently perceive /ta/. The McGurk effect was perceived by 11 of the 24 participants with amblyopia (45.8%) and all 9 controls (100%) (adjusted odds ratio, 22.3 [95% CI, 1.2-426.0]; P = .005). The McGurk effect was perceived by 100% of participants with amblyopia that was resolved by 5 years of age and by 100% of participants whose onset at amblyopia developed at or after 5 years of age. However, only 18.8% of participants with amblyopia that was unresolved by 5 years of age (n = 16) perceived the McGurk effect (adjusted odds ratio, 27.0 [95% CI, 1.1-654.0]; P = .02). This pilot study suggests that children with a history of amblyopia have impaired visual-auditory speech perception. Early childhood appears to serve as an approximate time point for the development of successful visual-auditory fusion, by which time amblyopia must have either resolved or begun. Interventions to resolve amblyopia may not only influence visual acuity but may also influence the perception of sound.	t	\N
25217343	Attentional control theory suggests that heightened anxiety, whether due to trait or state factors, causes an increased vulnerability to distraction even when the distracters are emotionally neutral. Recent passive oddball studies appear to support this theory in relation to the distraction caused by emotionally neutral sounds. However such studies have manipulated emotional state via the content of task stimuli, thus potentially confounding changes in emotion with differences in task demands. To identify the effect of anxiety on the distraction caused by emotionally neutral sounds, 50 participants completed a passive oddball task requiring emotionally neutral sounds to be ignored. Crucially, state anxiety was manipulated independent of the task stimuli (via unrelated audiovisual stimuli) thus removing confounds relating to task demands. Neither state or trait anxiety was found to influence the susceptibility to distraction by emotionally neutral sounds. These findings contribute to the ongoing debate concerning the impact of emotion on attention.	t	\N
25219227	To enhance speech recognition, as well as Mandarin tone recognition in noice, we proposed a speech coding strategy called zero-crossing of fine structure in low frequency (LFFS) for cochlear implant based on low frequency non-uniform sampling (LFFS for short). In the range of frequency perceived boundary of human ear, we used zero-crossing time of the fine structure to generate the stimulus pulse sequences based on the frequency selection rule. Acoustic simulation results showed that although on quiet background the performance of LFFS was similar to continuous interleaved sampling (CIS), on the noise background the performance of LFFS in Chinese tones, words and sentences were significantly better than CIS. In addition to this, we also got better Mandarin recognition factors distribution by using the improved index distribution model. LFFS contains more tonal information which was able to effectively improve Mandarin recognition of the cochlear implant.	t	\N
25231617	Whenever the visual stream is abruptly disturbed by eye movements, blinks, masks, or flashes of light, the visual system needs to retrieve the new locations of current targets and to reconstruct the timing of events to straddle the interruption. This process may introduce position and timing errors. We here report that very similar errors are seen in human subjects across three different paradigms when disturbances are caused by either eye movements, as is well known, or, as we now show, masking. We suggest that the characteristic effects of eye movements on position and time, spatial and temporal compression and saccadic suppression of displacement, are consequences of the interruption and the subsequent reconnection and are seen also when visual input is masked without any eye movements. Our data show that compression and suppression effects are not solely a product of ocular motor activity but instead can be properties of a correspondence process that links the targets of interest across interruptions in visual input, no matter what their source.	t	\N
25234404	In this study, we investigated the role of interactive auditory feedback in modulating the inadvertent forward drift experienced while attempting to walk in place with closed eyes following a few minutes of treadmill walking. Simulations of footstep sounds upon surface materials such as concrete and snow were provided by means of a system composed of headphones and shoes augmented with sensors. In a control condition, participants could hear their actual footstep sounds. Results showed an overall enhancement of the forward drift after treadmill walking independent of the sound perceived, while the strength of the aftereffect, measured as the proportional increase (posttest/pretest) in forward drift, was higher under the influence of snow compared to both concrete and actual sound. In addition, a higher knee angle flexion was found during the snow sound condition both before and after treadmill walking. Behavioral results confirmed those of a perceptual questionnaire, which showed that the snow sound was effective in producing strong pseudo-haptic illusions. Our results provide evidence that the walking in place aftereffect results from a recalibration of haptic, visuo-motor but also sound-motor control systems. Self-motion perception is multimodal.	t	\N
25234884	The medial olivocochlear reflex (MOCR) modulates cochlear amplifier gain and is thought to facilitate the detection of signals in noise. High-resolution distortion product otoacoustic emissions (DPOAEs) were recorded in teens, young, middle-aged, and elderly adults at moderate levels using primary tones swept from 0.5 to 4 kHz with and without a contralateral acoustic stimulus (CAS) to elicit medial efferent activation. Aging effects on magnitude and phase of the 2f1-f2 DPOAE and on its components were examined, as was the link between speech-in-noise performance and MOCR strength. Results revealed a mild aging effect on the MOCR through middle age for frequencies below 1.5 kHz. Additionally, positive correlations were observed between strength of the MOCR and performance on select measures of speech perception parsed into features. The elderly group showed unexpected results including relatively large effects of CAS on DPOAE, and CAS-induced increases in DPOAE fine structure as well as increases in the amplitude and phase accumulation of DPOAE reflection components. Contamination of MOCR estimates by middle ear muscle contractions cannot be ruled out in the oldest subjects. The findings reiterate that DPOAE components should be unmixed when measuring medial efferent effects to better consider and understand these potential confounds.	t	\N
25234912	With a cochlear implant, when stimulation from multiple channels is interleaved, the perceived loudness is greater than the loudness associated with any of the individual channels presented in isolation. This phenomenon is known as loudness summation. This study examined if loudness summation with monopolar and tripolar stimulation were equivalent at two loudnesses and two spacing configurations. Results suggest that loudness summation is similar for monopolar and tripolar modes. However, larger summation differences were observed for softer sounds and louder sounds with a larger spatial separation. The results are consistent with the idea that loudness summation is dependent on channel interaction and have implications for implementing current-focused processing strategies.	t	\N
25238461	Language comprehension is more than a process of decoding the literal meaning of a speaker's utterance. Instead, by making the assumption that speakers choose their words to be informative in context, listeners routinely make pragmatic inferences that go beyond the linguistic data. If language learners make these same assumptions, they should be able to infer word meanings in otherwise ambiguous situations. We use probabilistic tools to formalize these kinds of informativeness inferences-extending a model of pragmatic language comprehension to the acquisition setting-and present four experiments whose data suggest that preschool children can use informativeness to infer word meanings and that adult judgments track quantitatively with informativeness.	t	\N
25243992	Less proficient basic auditory processing has been previously connected to dyslexia. However, it is unclear whether a low proficiency level is a correlate of having a familial risk for reading problems, or whether it causes dyslexia. In this study, children's processing of amplitude rise time (ART), intensity and frequency differences was measured with event-related potentials (ERPs). ERP components of interest are components reflective of auditory change detection; the mismatch negativity (MMN) and late discriminative negativity (LDN). All groups had an MMN to changes in ART and frequency, but not to intensity. Our results indicate that fluent readers at risk for dyslexia, poor readers at risk for dyslexia and fluent reading controls have an LDN to changes in ART and frequency, though the scalp activation of frequency processing was different for familial risk children. On intensity, only controls showed an LDN. Contrary to previous findings, our results suggest that neither ART nor frequency processing is related to reading fluency. Furthermore, our results imply that diminished sensitivity to changes in intensity and differential lateralization of frequency processing should be regarded as correlates of being at familial risk for dyslexia, that do not directly relate to reading fluency.	t	\N
25246562	Temporal cues are important for discerning word boundaries and syllable segments in speech; their perception facilitates language acquisition and development. Beat synchronization and neural encoding of speech reflect precision in processing temporal cues and have been linked to reading skills. In poor readers, diminished neural precision may contribute to rhythmic and phonological deficits. Here we establish links between beat synchronization and speech processing in children who have not yet begun to read: preschoolers who can entrain to an external beat have more faithful neural encoding of temporal modulations in speech and score higher on tests of early language skills. In summary, we propose precise neural encoding of temporal modulations as a key mechanism underlying reading acquisition. Because beat synchronization abilities emerge at an early age, these findings may inform strategies for early detection of and intervention for language-based learning disabilities.	t	\N
25248621	We used a haptic enumeration task to investigate whether enumeration can be facilitated by perceptual grouping in the haptic modality. Eight participants were asked to count tangible dots as quickly and accurately as possible, while moving their finger pad over a tactile display. In Experiment 1, we manipulated the number and organization of the dots, while keeping the total exploration area constant. The dots were either evenly distributed on a horizontal line (baseline condition) or organized into groups based on either proximity (dots placed in closer proximity to each other) or configural cues (dots placed in a geometric configuration). In Experiment 2, we varied the distance between the subsets of dots. We hypothesized that when subsets of dots can be grouped together, the enumeration time will be shorter and accuracy will be higher than in the baseline condition. The results of both experiments showed faster enumeration for the configural condition than for the baseline condition, indicating that configural grouping also facilitates haptic enumeration. In Experiment 2, faster enumeration was also observed for the proximity condition than for the baseline condition. Thus, perceptual grouping speeds up haptic enumeration by both configural and proximity cues, suggesting that similar mechanisms underlie perceptual grouping in both visual and haptic enumeration.	t	\N
25251488	A continuing debate in language acquisition research is whether there are critical periods (CPs) in development during which the system is most responsive to environmental input. Recent advances in neurobiology provide a mechanistic explanation of CPs, with the balance between excitatory and inhibitory processes establishing the onset and molecular brakes establishing the offset of windows of plasticity. In this article, we review the literature on human speech perception development within the context of this CP model, highlighting research that reveals the interplay of maturational and experiential influences at key junctures in development and presenting paradigmatic examples testing CP models in human subjects. We conclude with a discussion of how a mechanistic understanding of CP processes changes the nature of the debate: The question no longer is, "Are there CPs?" but rather what processes open them, keep them open, close them, and allow them to be reopened.	t	\N
25255646	The aim of this study was to investigate distinctive change in the hearing impaired elderly listeners, especially about speech recognition. Subjects were 525 patients (235 males, 290 females), from 60 to 98 years of age who had visited the Hearing Aid Clinic of Otorhinolaryngology, National Center for Geriatrics and Gerontology Hospital, between June 2001 and December 2012. Pure-tone air conduction threshold determination was administered to each subject. The speech audiometry materials used to define speech discrimination ability were Japanese monosyllabic word lists, 67S word lists. Sex- and age-specific trends in maximum discrimination score (MDS) and rollover index (RI) were shown in this study. RI was computed by the formula (PBmax-PBmin)/PBmax. PBmax has almost the same meaning as MDS in Japan. PBmin represented the lowest PB score above the test intensity level of PBmax. Mean MDSs were 80.8% in their sixties, 75.3% in their seventies, 60.7% in their eighties, and 45.5% in their nineties. The rate of decrease in mean MDS per decade accelerated in the older generation. Mean RIs were 0.18 in patients in their sixties, 0.24 in their seventies, 0.30 in their eighties, and 0.30 in their nineties. It increased until the age of eighties. In the model 1, multiple logistic analyses were performed to examine the effect of age (in 10-year increments), sex and mean hearing levels in pure tone average of values at 500, 1000, 2000, 4000 Hz (in 10 dB increments) on the MDS < 60%. Significant associations were observed between MDS < 60% and age (odds ratio, 3.03; 95% confidence interval, 2.38 to 3.85), and mean hearing levels in pure tone average of values at 500, 1000, 2000, 4000 Hz (2.33; 2.03 to 2.68). Sex was not associated with MDS < 60%. In the model 2, multiple logistic analyses adjusted for age and sex were performed to examine the effect of hearing level at test frequencies of 125, 250, 500, 1000, 2000, 4000, and 8000 Hz (in 10 dB increments)on the MDS < 60%. Significant associations were observed between MDS < 60% and hearing level at 250 Hz (2.04; 1.44 to 2.89), and 2000 Hz (1.46; 1.16 to 1.83), and 4000 Hz(1.24; 1.02 to 1.50). These outcomes suggest that the important point in qualifying the fitting of hearing aid is to present the list words at an adequate sound pressure level for evaluation of speech understanding especially in the hearing impaired elderly listeners.	t	\N
25276437	The present study investigated 24 individuals suffering from chronic tinnitus (TI) and 24 nonaffected controls (CO). We recorded resting-state EEG and collected psychometric data to obtain information about how chronic tinnitus experience affects the cognitive and emotional state of TI. The study was meant to disentangle TI with high distress from those who suffer less from persistent tinnitus based on both neurophysiological and behavioral data. A principal component analysis of psychometric data uncovers two distinct independent dimensions characterizing the individual tinnitus experience. These independent states are distress and presence, the latter is described as the perceived intensity of sound experience that increases with tinnitus duration devoid of any considerable emotional burden. Neuroplastic changes correlate with the two independent components. TI with high distress display increased EEG activity in the oscillatory range around 25 Hz (upper β-band) that agglomerates over frontal recording sites. TI with high presence show enhanced EEG signal strength in the δ-, α-, and lower γ-bands (30-40 Hz) over bilateral temporal and left perisylvian electrodes. Based on these differential patterns we suggest that the two dimensions, namely, distress and presence, should be considered as independent dimensions of chronic subjective tinnitus.	t	\N
25299833	The present study aimed to determine the extent of hearing preservation retrospectively after atraumatic cochlear implant (CI) surgery using a specialized surgical technique and specially designed flexible electrode to minimize cochlear trauma. Retrospective study. Academic tertiary care center. A consecutive series of 34 patients who had some preoperative residual hearing were included in this study. Patients underwent CI surgery with a flexible 28-mm electrode using a round window insertion technique. All patients had at least 6 months of postoperative follow-up including audiometric testing and speech perception determined using the Freiburg monosyllable word test and the Oldenburger Sentence Test in noise. Audiometric testing served as a proxy for the evaluation of cochlear trauma and hearing preservation. Hearing was preserved to within 20 dB of preoperative low-frequency pure-tone audiometry (PTA) in 40.7% of patients. Hearing was preserved to within 20 dB of preoperative high-frequency PTA in 35.7% of patients. Overall, a deterioration in hearing thresholds was observed between preoperative assessment and first fitting. Speech perception improved significantly over time after surgery. Using appropriate surgical techniques, and electrodes specially designed to minimize cochlear trauma, hearing preservation can be achieved.	t	\N
25301014	To isolate the neural mechanisms associated with recognizing objects from those processing basic visual properties, control stimuli are required that contain the same perceptual properties as the objects but are unrecognizable. We demonstrate that conventional methods for generating control stimuli (phase scrambling, box scrambling, texture scrambling) yield poor controls because they dramatically distort the basic visual properties (e.g., spatial frequency, perceptual organization) to which even the earliest stages of visual processing are sensitive. We developed a new scrambling method, using a diffeomorphic transformation that preserves the basic perceptual properties of the image while removing meaning. We acquired perceptual ratings to determine the least amount of scrambling necessary to remove recognition. We hypothesized that our "diffeomorphic" images would produce neural activity at the earliest stages of the visual system that more closely matched activity in response to intact images relative to the other scrambling methods. To test this hypothesis, we used the HMAX computational model of object recognition and compared the simulated neural activity at the earliest stages of the visual system (layers S1, C1, and S2) between a set of 149 images scrambled using each distortion method to their intact version. We found that scrambled "diffeomorphed" images were indistinguishable to intact images in each layer of the model, but all of the other distortion methods yielded quite different patterns. Our results indicate that "diffeomorphed" images serve as more appropriate control stimuli in neuroimaging studies that aim to disentangle the representations of perceptual and semantic object properties.	t	\N
25315376	Under normal conditions, human speech is remarkably robust to degradation by noise and other distortions. However, people with hearing loss, including those with cochlear implants, often experience great difficulty in understanding speech in noisy environments. Recent work with normal-hearing listeners has shown that the amplitude fluctuations inherent in noise contribute strongly to the masking of speech. In contrast, this study shows that speech perception via a cochlear implant is unaffected by the inherent temporal fluctuations of noise. This qualitative difference between acoustic and electric auditory perception does not seem to be due to differences in underlying temporal acuity but can instead be explained by the poorer spectral resolution of cochlear implants, relative to the normally functioning ear, which leads to an effective smoothing of the inherent temporal-envelope fluctuations of noise. The outcome suggests an unexpected trade-off between the detrimental effects of poorer spectral resolution and the beneficial effects of a smoother noise temporal envelope. This trade-off provides an explanation for the long-standing puzzle of why strong correlations between speech understanding and spectral resolution have remained elusive. The results also provide a potential explanation for why cochlear-implant users and hearing-impaired listeners exhibit reduced or absent masking release when large and relatively slow temporal fluctuations are introduced in noise maskers. The multitone maskers used here may provide an effective new diagnostic tool for assessing functional hearing loss and reduced spectral resolution.	t	\N
25322936	Social learning in infancy is known to be facilitated by multimodal (e.g., visual, tactile, and verbal) cues provided by caregivers. In parallel with infants' development, recent research has revealed that maternal neural activity is altered through interaction with infants, for instance, to be sensitive to infant-directed speech (IDS). The present study investigated the effect of mother- infant multimodal interaction on maternal neural activity. Event-related potentials (ERPs) of mothers were compared to non-mothers during perception of tactile-related words primed by tactile cues. Only mothers showed ERP modulation when tactile cues were incongruent with the subsequent words, and only when the words were delivered with IDS prosody. Furthermore, the frequency of mothers' use of those words was correlated with the magnitude of ERP differentiation between congruent and incongruent stimuli presentations. These results suggest that mother-infant daily interactions enhance multimodal integration of the maternal brain in parenting contexts.	t	\N
25324090	Brief exposure to time-compressed speech yields both learning and generalization. Whether such learning continues over the course of multi-session training and if so whether it is more or less specific than exposure-induced learning is not clear, because the outcomes of intensive practice with time-compressed speech have rarely been reported. The goal here was to determine whether prolonged training on time-compressed speech yields additional learning and generalization beyond that induced by brief exposure. Listeners practiced the semantic verification of time-compressed sentences for one or three training sessions. Identification of trained and untrained tokens was subsequently compared between listeners who trained for one or three sessions, listeners who were briefly exposed to 20 time-compressed sentences and naive listeners. Trained listeners outperformed the other groups of listeners on the trained condition, but only the group that was trained for three sessions outperformed the other groups when tested with untrained tokens. These findings suggest that although learning of distorted speech can occur rapidly, more stable learning and generalization might be achieved with longer, multi-session practice. It is suggested that the findings are consistent with the framework proposed by the Reverse Hierarchy Theory of perceptual learning.	t	\N
25324114	This paper presents an efficient method to compute the numerical solutions of transmission-line (TL) cochlear models, and its application on the model of Verhulst et al. The stability region of the model is extended by adopting a variable step numerical method to solve the system of ordinary differential equations that describes it, and by adopting an adaptive scheme to take in account variations in the system status within each numerical step. The presented method leads to improve simulations numerical accuracy and large computational savings, leading to employ TL models for more extensive simulations than currently possible.	t	\N
25325841	To determine factors related to high levels of speech recognition in patients with the auditory brainstem implant (ABI). Retrospective case review. International multicenter data from hospitals and tertiary referral facilities. Patients with neurofibromatosis type 2 (NF2) and bilateral vestibular schwannomas. ABIs were placed after the removal of vestibular schwannomas. Demographic and surgical data were collected from 26 patients with ABIs who achieved scores of better than 30% correct identification of sentences presented in quiet listening conditions and without lipreading cues. Scores better than 30% speech recognition of standard sentence test materials (HINT or equivalent) in quiet listening conditions were obtained in 26 of the 84 NF2 patients (31%). ABI speech recognition was correlated with surgical position, length of deafness, the number of distinct pitch electrodes, perceptual levels, and ABI stimulation rate, but not correlated with tumor size, tumor stage, the number of electrodes used, or electrophysiological recordings. This paper presents the consensus opinion from a meeting of surgeons to compare outcomes across ABI surgical centers. The consensus opinion was that brainstem trauma is a primary factor in the variability of outcomes in NF2 patients. The significant co-factors in outcomes implied that ABI surgery should be accomplished with great care to minimize physical and venous trauma to the brainstem. It is clear that high levels of speech recognition, including high levels of open-set speech recognition, are possible with the ABI even in patients with NF2 and large tumors.	t	\N
25329080	How is semantic memory influenced by individual differences under conditions of distraction? This question was addressed by observing how participants recalled visual target words--drawn from a single category--while ignoring spoken distractor words that were members of either the same or a different (single) category. Working memory capacity (WMC) was related to disruption only with synchronous, not asynchronous, presentation, and distraction was greater when the words were presented synchronously. Subsequent experiments found greater negative priming of distractors among individuals with higher WMC, but this may be dependent on targets and distractors being comparable category exemplars. With less dominant category members as distractors, target recall was impaired--relative to control--only among individuals with low WMC. The results highlight the role of cognitive control resources in target-distractor selection and the individual-specific cost implications of such cognitive control.	t	\N
25333740	Words are made up of speech sounds. Almost all accounts of child speech development assume that children learn the pronunciation of first language (L1) speech sounds by imitation, most claiming that the child performs some kind of auditory matching to the elements of ambient speech. However, there is evidence to support an alternative account and we investigate the non-imitative child behavior and well-attested caregiver behavior that this account posits using Elija, a computational model of an infant. Through unsupervised active learning, Elija began by discovering motor patterns, which produced sounds. In separate interaction experiments, native speakers of English, French and German then played the role of his caregiver. In their first interactions with Elija, they were allowed to respond to his sounds if they felt this was natural. We analyzed the interactions through phonemic transcriptions of the caregivers' utterances and found that they interpreted his output within the framework of their native languages. Their form of response was almost always a reformulation of Elija's utterance into well-formed sounds of L1. Elija retained those motor patterns to which a caregiver responded and formed associations between his motor pattern and the response it provoked. Thus in a second phase of interaction, he was able to parse input utterances in terms of the caregiver responses he had heard previously, and respond using his associated motor patterns. This capacity enabled the caregivers to teach Elija to pronounce some simple words in their native languages, by his serial imitation of the words' component speech sounds. Overall, our results demonstrate that the natural responses and behaviors of human subjects to infant-like vocalizations can take a computational model from a biologically plausible initial state through to word pronunciation. This provides support for an alternative to current auditory matching hypotheses for how children learn to pronounce.	t	\N
25345762	Sudden sensorineural hearing loss (SSNHL) is strictly related to inner ear vascular injuries and recently to some atherosclerotic risk factors. The pathogenic role of inflammatory molecules in atherosclerosis is well established. However, there is little knowledge about the potential role of inflammatory cytokines and adhesion molecules on SSNHL etiology. The aim of this study was to evaluate the role of proinflammatory genetic polymorphisms of the MCP-1 (CCL2), E-selectin, and interleukin (IL)-6 gene in SSNHL patients. We evaluated the frequency and distribution of selected single nucleotide polymorphisms of the MCP-1 (CCL2), E-selectin, and IL-6 gene in 87 SSNHL patients and 107 healthy controls. Our results did not show significant difference between the compared groups for MCP-1 and E-selectin genes, whereas a significant difference was reported for the IL-6 gene (P < .0001). The main finding of our study is that the 174G/G polymorphism (with a wider distribution of wt/wt genotype in SSNHL patients than in the healthy controls) of the IL-6 gene is significantly associated with the risk of SSNHL, which is consistent with a previous finding on serum levels of IL-6 in SSNHL. It is possible that the variant acts as a triggering agent of different lipidemia-related phenotypes. Both the -174G/G polymorphism and elevated IL-6 levels in SSNHL patients could suggest that IL-6 plays a role in the inner ear involvement by atherosclerotic inflammatory events.	t	\N
25349105	To assess spoken language comprehension in non-speaking children with severe cerebral palsy (CP) and to explore possible associations with motor type and disability. Eighty-seven non-speaking children (44 males, 43 females, mean age 6y 8mo, SD 2y 1mo) with spastic (54%) or dyskinetic (46%) CP (Gross Motor Function Classification System [GMFCS] levels IV [39%] and V [61%]) underwent spoken language comprehension assessment with the computer-based instrument for low motor language testing (C-BiLLT), a new and validated diagnostic instrument. A multiple linear regression model was used to investigate which variables explained the variation in C-BiLLT scores. Associations between spoken language comprehension abilities (expressed in z-score or age-equivalent score) and motor type of CP, GMFCS and Manual Ability Classification System (MACS) levels, gestational age, and epilepsy were analysed with Fisher's exact test. A p-value <0.05 was considered statistically significant. Chronological age, motor type, and GMFCS classification explained 33% (R=0.577, R(2) =0.33) of the variance in spoken language comprehension. Of the children aged younger than 6 years 6 months, 52.4% of the children with dyskinetic CP attained comprehension scores within the average range (z-score ≥-1.6) as opposed to none of the children with spastic CP. Of the children aged older than 6 years 6 months, 32% of the children with dyskinetic CP reached the highest achievable age-equivalent score compared to 4% of the children with spastic CP. No significant difference in disability was found between CP-related variables (MACS levels, gestational age, epilepsy), with the exception of GMFCS which showed a significant difference in children aged younger than 6 years 6 months (p=0.043). Despite communication disabilities in children with severe CP, particularly in dyskinetic CP, spoken language comprehension may show no or only moderate delay. These findings emphasize the importance of introducing alternative and/or augmentative communication devices from early childhood.	t	\N
25350759	We examined neural indices of pre-attentive phonological and attentional auditory discrimination in children with developmental language disorder (DLD, n = 23) and typically developing (n = 16) peers from a geographically isolated Russian-speaking population with an elevated prevalence of DLD. Pre-attentive phonological MMN components were robust and did not differ in two groups. Children with DLD showed attenuated P3 and atypically distributed P2 components in the attentional auditory discrimination task; P2 and P3 amplitudes were linked to working memory capacity, development of complex syntax, and vocabulary. The results corroborate findings of reduced processing capacity in DLD and support a multifactorial view of the disorder.	t	\N
25372405	A major cue to the location of a sound source is the interaural time difference (ITD)-the difference in sound arrival time at the two ears. The neural representation of this auditory cue is unresolved. The classic model of ITD coding, dominant for a half-century, posits that the distribution of best ITDs (the ITD evoking a neuron's maximal response) is unimodal and largely within the range of ITDs permitted by head-size. This is often interpreted as a place code for source location. An alternative model, based on neurophysiology in small mammals, posits a bimodal distribution of best ITDs with exquisite sensitivity to ITDs generated by means of relative firing rates between the distributions. Recently, an optimal-coding model was proposed, unifying the disparate features of these two models under the framework of efficient coding by neural populations. The optimal-coding model predicts that distributions of best ITDs depend on head size and sound frequency: for high frequencies and large heads it resembles the classic model, for low frequencies and small head sizes it resembles the bimodal model. The optimal-coding model makes key, yet unobserved, predictions: for many species, including humans, both forms of neural representation are employed, depending on sound frequency. Furthermore, novel representations are predicted for intermediate frequencies. Here, we examine these predictions in neurophysiological data from five mammalian species: macaque, guinea pig, cat, gerbil and kangaroo rat. We present the first evidence supporting these untested predictions, and demonstrate that different representations appear to be employed at different sound frequencies in the same species.	t	\N
25375171	Gender dysphoria (also known as "transsexualism") is characterized as a discrepancy between anatomical sex and gender identity. Research points towards neurobiological influences. Due to the sexually dimorphic characteristics of the human voice, voice gender perception provides a biologically relevant function, e.g. in the context of mating selection. There is evidence for a better recognition of voices of the opposite sex and a differentiation of the sexes in its underlying functional cerebral correlates, namely the prefrontal and middle temporal areas. This fMRI study investigated the neural correlates of voice gender perception in 32 male-to-female gender dysphoric individuals (MtFs) compared to 20 non-gender dysphoric men and 19 non-gender dysphoric women. Participants indicated the sex of 240 voice stimuli modified in semitone steps in the direction to the other gender. Compared to men and women, MtFs showed differences in a neural network including the medial prefrontal gyrus, the insula, and the precuneus when responding to male vs. female voices. With increased voice morphing men recruited more prefrontal areas compared to women and MtFs, while MtFs revealed a pattern more similar to women. On a behavioral and neuronal level, our results support the feeling of MtFs reporting they cannot identify with their assigned sex.	t	\N
25379450	Autism Spectrum Disorder (ASD) is characterized by impairments in language and social-emotional cognition. Yet, findings of emotion recognition from affective prosody in individuals with ASD are inconsistent. This study investigated emotion recognition and neural processing of affective prosody in high-functioning adults with ASD relative to neurotypical (NT) adults. Individuals with ASD showed mostly typical brain activation of the fronto-temporal and subcortical brain regions in response to affective prosody. Yet, the ASD group showed a trend towards increased activation of the right caudate during processing of affective prosody and rated the emotional intensity lower than NT individuals. This is likely associated with increased attentional task demands in this group, which might contribute to social-emotional impairments.	t	\N
25382428	To verify the receiver in the ear and receiver in the aid adaptations by measuring in situ the speech perception and users' level of satisfaction. The study was approved by the research ethics committee (Process: 027/2011). Twenty subjects older than 18 years with audiological diagnosis of mild and moderate bilateral descending sensorineural hearing loss were evaluated. The subjects were divided into two groups, where G1 (group 1) was fitted with open-fit hearing aids with the built-in receiver unit (receiver in the ear) and G2 (group 2) was fitted with open-fit hearing aids with RITE. A probe microphone measurement was performed to check the gain and output provided by the amplification and for assessment of speech perception with Hearing in Noise Test with and without hearing aids. After a period of six weeks of use without interruption, the subjects returned for follow-up and answered the Satisfaction with Amplification in Daily Life questionnaire, and were again subjected to Hearing in Noise Test. Both groups presented better test results for speech recognition in the presence of noise. Groups 1 and 2 were satisfied with the use of hearing aids and improved speech recognition in silent and noisy situations with hearing aids.	t	\N
25382429	There is still no consensus in the literature as to the best acoustic stimulus for capturing vestibular evoked myogenic potential (VEMP). Low-frequency tone bursts are generally more effective than high-frequency, but recent studies still use clicks. Reproducibility is an important analytical parameter to observe the reliability of responses. To determine the reproducibility of p13 and n23 latency and amplitude of the VEMP for stimuli with different tone-burst frequencies, and to define the best test frequency. Cross-sectional cohort study. VEMP was captured in 156 ears, on the sternocleidomastoid muscle, using 100 tone-burst stimuli at frequencies of 250, 500, 1000, and 2000Hz, and sound intensity of 95dB nHL. Responses were replicated, that is, recorded three times on each side. No significant difference was observed for p13 and n23 latencies of the VEMP, captured at three moments with tone-burst stimuli at 250, 500, and 1000Hz. Only the frequency of 2000Hz showed a difference between captures of this potential (p<0.001). p13 and n23 amplitude analysis was also similar in the test-retest for all frequencies analyzed. p13 and n23 latencies and amplitudes of VEMP for tone-burst stimuli at frequencies of 250, 500, and 1000Hz are reproducible.	t	\N
25424987	The closure positive shift (CPS) event related potential (ERP) is commonly used as a neural measure of phrase boundary perception in speech. The present study investigated whether the CPS was elicited by acoustic cues at phrase boundaries in English. ERPs were recorded when participants listened passively to sentences with either early or late phrase boundaries. The closure positive shift (CPS) ERP was elicited at both early and late phrase boundaries. However, the latency, amplitude, and scalp distribution of these passive CPS ERPs in English sentences differed to active CPS ERPs measured in non-English sentences in previous studies. These results show that acoustic cues at the phrase boundaries in English are sufficient to elicit the CPS, and suggest that different processes might be involved in the generation of the CPS in active and passive conditions.	t	\N
25438583	The present study assessed brain activity changes related to perception of consonant and dissonant chords by musicians and non-musicians. Perception of dissonant chords in non-musicians was accompanied by increase of lower theta activity over right anterior regions, while consonant chords induced greater theta activity over left anterior regions; this pattern of asymmetrical activation was not observed in musicians. ERP analysis revealed that musicians had greater amplitude of early components (P100, N200) than non-musicians irrespective of chord type. The obtained results reflect more efficient musical harmony processing and, possibly, less emotional perception of chords in musicians.	t	\N
25445998	The localization of neuronal generators during an ERP study, using a high-density electroencephalogram (HD-EEG) equipment was made on three Evoked Related Potential (ERP) components, i.e., the Mismatch Negativity (MMN), the P300 and the N400. Furthermore, the ERP characteristics, their field distribution and the area of their maximum field intensity were extracted and compared between young and elderly, as well as between females and males. A two tone oddball experiment was conducted, involving 27 young adults and 18 elderly, healthy and right handed, and HD-EEG data were acquired. These data were then subjected to auditory ERPs extraction and thorough statistical analysis. The derived experimental results revealed significant age-related differences to both the latencies and the amplitudes of the MMN and the P300 and the topographic distribution of the HD-EEG amplitudes. Additionally, a shift in the maximum intensities from frontal to temporal lobe with aging appeared in the case of the P300, whereas no effect was observed for the MMN component. No statistical significant differences (p>0.05) due to age was found in N400 characteristics. Finally, gender-related differences were significant in the response time of the subjects, finding males response faster. The level and the location of the maximum intensity of sources also differed between genders, especially in young subjects. These findings justify the enhanced potential of HD-EEG data to accurately reflect the age and gender dependencies at the three components of simple auditory ERPs and pave the way for the investigation of neurodegenerative pathologies, such as the Alzheimer's disease.	t	\N
25446245	Perceptual grouping is the process of organizing sounds into perceptually meaningful elements. Psychological studies have found that tones presented as a regular frequency or temporal pattern are grouped according to gestalt principles, such as similarity, proximity, and good continuity. Predictive coding theory suggests that this process helps create an internal model for the prediction of sounds in a tone sequence and that an omission-related brain response reflects the violation of this prediction. However, it remains unclear which brain areas are related to this process, especially in paying attention to the stimuli. To clarify this uncertainty, the present study investigated the neural correlates of perceptual grouping effects. Using magnetoencephalography (MEG), we recorded the evoked response fields (ERFs) of amateur musicians and nonmusicians to sound omissions in tone sequences with a regular or random pattern of three different frequencies during an omission detection task. Omissions in the regular sequences were detected faster and evoked greater activity in the left Heschl's gyrus (HG), right postcentral gyrus, and bilateral superior temporal gyrus (STG) than did omissions in the irregular sequences. Additionally, an interaction between musical experience and regularity was found in the left HG/STG. Tone-evoked responses did not show this difference, indicating that the expertise effect did not reflect the superior tone processing acquired by amateur musicians due to musical training. These results suggest that perceptual grouping based on repetition of a pattern of frequencies affects the processing of omissions in tone sequences and induces more activation of the bilateral auditory cortex by violating internal models. The interaction in the left HG/STG may suggest different styles of processing for musicians and nonmusicians, although this difference was not reflected at the behavioral level.	t	\N
25447067	The effects of ear of presentation and competing speech on N400s to spoken words in context were examined in a dichotic sentence priming paradigm. Auditory sentence contexts with a strong or weak semantic bias were presented in isolation to the right or left ear, or with a competing signal presented in the other ear at a SNR of -12 dB. Target words were congruent or incongruent with the sentence meaning. Competing speech attenuated N400s to both congruent and incongruent targets, suggesting that the demand imposed by a competing signal disrupts the engagement of semantic comprehension processes. Bias strength affected N400 amplitudes differentially depending upon ear of presentation: weak contexts presented to the le/RH produced a more negative N400 response to targets than strong contexts, whereas no significant effect of bias strength was observed for sentences presented to the re/LH. The results are consistent with a model of semantic processing in which the RH relies on integrative processing strategies in the interpretation of sentence-level meaning.	t	\N
25448166	In this article, we review the PET neuroimaging literature, which indicates peculiarities of brain networks involved in speech restoration after cochlear implantation. We consider data on implanted patients during stimulation as well as during resting state, which indicates basic long-term reorganisation of brain functional architecture. On the basis of our analysis of neuroimaging literature and considering our own studies, we indicate that auditory recovery in deaf patients after cochlear implantation partly relies on visual cues. The brain develops mechanisms of audio-visual integration as a strategy to achieve high levels of speech recognition. It turns out that this neuroimaging evidence is in line with behavioural findings of better audiovisual integration in these patients. Thus, strong visually and audio-visually based rehabilitation during the first months after cochlear implantation would significantly improve and fasten the functional recovery of speech intelligibility and other auditory functions in these patients. We provide perspectives for further neuroimaging studies in cochlear implanted patients, which would help understand brain organisation to restore auditory cognitive processing in the implanted patients and would potentially suggest novel approaches for their rehabilitation. This article is part of a Special Issue entitled <Lasker Award>.	t	\N
25459663	In his pioneering research on the neural mechanisms of filial imprinting, Gabriel Horn has gone a long way to fulfilling Karl Lashley's dream of finding the 'engram' or memory trace in the brain. Here we review recent research into the engram(s) of song learning in songbirds, particularly zebra finches. When juvenile songbirds learn their songs from a tutor, they form and alter a central representation of the tutor song, known as the 'template'. Secondary auditory regions in the caudal medial pallium are likely to contain the neural substrate for the representation of tutor song, but the roles of the different regions remain to be elucidated. Female zebra finches do not sing, but nevertheless form an auditory memory of their father's song, for which the neural substrate is located in the caudomedial pallium. In males that are learning their songs, there is continual interaction between the secondary auditory regions and sensorimotor regions, similar to the interaction between Broca's and Wernicke's areas in human infants acquiring speech and language.	t	\N
25460240	Visual attention and perceptual grouping both help us from being overloaded by the vast amount of information, and attentional search is delayed when a target overlaps with a snake-like collinear distractor (Jingling & Tseng, 2013). We assessed whether awareness of the collinear distractor is required for this modulation. We first identified that visible long (=9 elements), but not short (=3 elements) collinear distractor slowed observers' detection of an overlapping target. Then we masked part of a long distractor (=9 elements) with continuous flashing color patches (=6 elements) so that the combined dichoptic percept to observers' awareness was a short collinear distractor (=3 elements). We found that the invisible collinear parts, like visible ones, can form a continuous contour to impair search, suggesting that conscious awareness is not a pre-requisite for contour integration and its interaction with selective attention.	t	\N
25461917	Recent studies have demonstrated the positive effects of musical training on the perception of vocally expressed emotion. This study investigated the effects of musical training on event-related potential (ERP) correlates of emotional prosody processing. Fourteen musicians and fourteen control subjects listened to 228 sentences with neutral semantic content, differing in prosody (one third with neutral, one third with happy and one third with angry intonation), with intelligible semantic content (semantic content condition--SCC) and unintelligible semantic content (pure prosody condition--PPC). Reduced P50 amplitude was found in musicians. A difference between SCC and PPC conditions was found in P50 and N100 amplitude in non-musicians only, and in P200 amplitude in musicians only. Furthermore, musicians were more accurate in recognizing angry prosody in PPC sentences. These findings suggest that auditory expertise characterizing extensive musical training may impact different stages of vocal emotional processing.	t	\N
25463460	Gamma-aminobutyric acid (GABA) is the main inhibitory neurotransmitter in the central auditory system. Altered GABAergic neurotransmission has been found in both the inferior colliculus and the auditory cortex in animal models of presbycusis. Edited magnetic resonance spectroscopy (MRS), using the MEGA-PRESS sequence, is the most widely used technique for detecting GABA in the human brain. However, to date there has been a paucity of studies exploring changes to the GABA concentrations in the auditory region of patients with presbycusis. In this study, sixteen patients with presbycusis (5 males/11 females, mean age 63.1 ± 2.6 years) and twenty healthy controls (6 males/14 females, mean age 62.5 ± 2.3 years) underwent audiological and MRS examinations. Pure tone audiometry from 0.125 to 8 kHz and tympanometry were used to assess the hearing abilities of all subjects. The pure tone average (PTA; the average of hearing thresholds at 0.5, 1, 2 and 4 kHz) was calculated. The MEGA-PRESS sequence was used to measure GABA+ concentrations in 4 × 3 × 3 cm(3) volumes centered on the left and right Heschl's gyri. GABA+ concentrations were significantly lower in the presbycusis group compared to the control group (left auditory regions: p = 0.002, right auditory regions: p = 0.008). Significant negative correlations were observed between PTA and GABA+ concentrations in the presbycusis group (r = -0.57, p = 0.02), while a similar trend was found in the control group (r = -0.40, p = 0.08). These results are consistent with a hypothesis of dysfunctional GABAergic neurotransmission in the central auditory system in presbycusis and suggest a potential treatment target for presbycusis.	t	\N
25463556	Previous research has revealed that the inhibition of return (IOR) effect emerges when individuals respond to a target at the same location as their own previous response or the previous response of a co-actor. The latter social IOR effect is thought to occur because the observation of co-actor's response evokes a representation of that action in the observer and that the observation-evoked response code subsequently activates the inhibitory mechanisms underlying IOR. The present study was conducted to determine if knowledge of the co-actor's response alone is sufficient to evoke social IOR. Pairs of participants completed responses to targets that appeared at different button locations. Button contact generated location-contingent auditory stimuli (high and low tones in Experiment 1 and colour words in Experiment 2). In the Full condition, the observer saw the response and heard the auditory stimuli. In the Auditory Only condition, the observer did not see the co-actor's response, but heard the auditory stimuli generated via button contact to indicate response endpoint. It was found that, although significant individual and social IOR effects emerged in the Full conditions, there were no social IOR effects in the Auditory Only conditions. These findings suggest that knowledge of the co-actor's response alone via auditory information is not sufficient to activate the inhibitory processes leading to IOR. The activation of the mechanisms that lead to social IOR seems to be dependent on processing channels that code the spatial characteristics of action.	t	\N
25468188	Schizophrenia and bipolar disorder share common etiological factors and pathophysiological pathways and have overlapping clinical features. Only few studies have directly compared early auditory information processing in the two disorders. The objective of this study was to investigate the M100 and M200 auditory responses in patients with schizophrenia and bipolar disorder and compare them with healthy controls using magnetoencephalography (MEG). Whole-head MEG data were acquired during an auditory oddball paradigm in 24 schizophrenia patients, 26 bipolar I disorder patients, and 31 healthy controls. The strengths and latencies of M100 and M200 in both hemispheres and the dipole source localizations were investigated from the standard stimuli. The M100 and M200 dipolar sources were localized to the left and right posterior portion of the superior temporal gyrus (STG) in all the subjects. An asymmetric pattern of M100 and M200 auditory response with more anterior sources in the right STG was observed in the healthy controls. However, both the schizophrenia and bipolar disorder patients showed a symmetric M100 and M200 source pattern. When compared with the healthy control group, both patient groups showed significantly reduced M100 and M200 source strength in both hemispheres. Our study suggests that early auditory information processing deficits may be similar in schizophrenia and bipolar disorder and may be related to abnormalities of the STG.	t	\N
25470620	The aim was to investigate whether frequency compression (FC) hearing aids provide additional benefit to that conferred by conventional amplification. Participants wore the same hearing aid with FC enabled and disabled for six weeks (ABA design) in each condition. Speech recognition tests (in both quiet and in noise) were administered alongside two questionnaires. Performance was compared across the two signal processing conditions and at different time points. Twelve experienced hearing-aid users (aged 65-84 years old) with moderate-to-severe high-frequency hearing loss participated in the study. FC resulted in statistically significantly higher mean scores in all of the administered speech tests. Improvements over time were limited to high frequency phoneme perception. No effect of FC on self-report outcomes was observed. FC may lead to significant improvements in speech perception outcomes in both quiet and noise for many individuals. No participant was significantly disadvantaged by the use of FC.	t	\N
25480080	Although there is an emerging consensus that both musical and linguistic pitch processing can be problematic for individuals with a developmental disorder termed congenital amusia, the nature of such a pitch-processing deficit, especially that demonstrated in a speech setting, remains unclear. Therefore, this study tested the performance of native Mandarin speakers, both with and without amusia, on discrimination and imitation tasks for Cantonese level tones, aiming to shed light on this issue. Results suggest that the impact of the phonological deficit, coupled with that of the domain-general pitch deficit, could provide a more comprehensive interpretation of Mandarin amusics' speech impairment. Specifically, when there was a high demand for pitch sensitivity, as in fine-grained pitch discriminations, the operation of the pitch-processing deficit played the more predominant role in modulating amusics' speech performance. But when the demand was low, as in discriminating naturally produced Cantonese level tones, the impact of the phonological deficit was more pronounced compared to that of the pitch-processing deficit. However, despite their perceptual deficits, Mandarin amusics' imitation abilities were comparable to controls'. Such selective impairment in tonal perception suggests that the phonological deficit more severely implicates amusics' input pathways.	t	\N
25481793	Auditory categorization is a vital skill for perceiving the acoustic environment. Categorization depends on the discriminability of the sensory input as well as on the ability of the listener to adaptively make use of the relevant features of the sound. Previous studies on categorization have focused either on speech sounds when studying discriminability or on visual stimuli when assessing optimal cue utilization. Here, by contrast, we examined neural sensitivity to stimulus discriminability and optimal cue utilization when categorizing novel, non-speech auditory stimuli not affected by long-term familiarity. In a functional magnetic resonance imaging (fMRI) experiment, listeners categorized sounds from two category distributions, differing along two acoustic dimensions: spectral shape and duration. By introducing spectral degradation after the first half of the experiment, we manipulated both stimulus discriminability and the relative informativeness of acoustic cues. Degradation caused an overall decrease in discriminability based on spectral shape, and therefore enhanced the informativeness of duration. A relative increase in duration-cue utilization was accompanied by increased activity in left parietal cortex. Further, discriminability modulated right planum temporale activity to a higher degree when stimuli were spectrally degraded than when they were not. These findings provide support for separable contributions of parietal and posterior temporal areas to perceptual categorization. The parietal cortex seems to support the selective utilization of informative stimulus cues, while the posterior superior temporal cortex as a primarily auditory brain area supports discriminability particularly under acoustic degradation.	t	\N
25485662	In recent years, a number of models of orthographic coding have been proposed in which the orthographic code consists of a set of units representing bigrams (open-bigram models). Three masked priming experiments were undertaken in an attempt to evaluate this idea: a conventional masked priming experiment, a sandwich priming experiment (Lupker & Davis, 2009) and an experiment involving a masked prime same-different task (Norris & Kinoshita, 2008). Three prime types were used, first-letter superset primes (e.g., wjudge-JUDGE), last-letter superset primes (e.g., judgew-JUDGE) and standard substitution-letter primes (e.g., juwge-JUDGE). In none of the experiments was there any evidence that the superset primes were more effective primes, the prediction made by open-bigram models. In fact, in the second and third experiments, first-letter superset primes were significantly worse primes than the other two prime types. These results provide no evidence for the existence of open-bigram units. They also suggest that prime-target mismatches at the first position produce orthographic codes that are less similar than mismatches at other positions. Implications for models of orthographic coding are discussed.	t	\N
25486807	The spatial specificity of auditory aftereffect was studied after a short-time adaptation (5 s) to the broadband noise (20-20000 Hz). Adapting stimuli were sequences of noise impulses with the constant amplitude, test stimuli--with the constant and changing amplitude: an increase of amplitude of impulses in sequence was perceived by listeners as approach of the sound source, while a decrease of amplitude--as its withdrawal. The experiments were performed in an anechoic chamber. The auditory aftereffect was estimated under the following conditions: the adapting and test stimuli were presented from the loudspeaker located at a distance of 1.1 m from the listeners (the subjectively near spatial domain) or 4.5 m from the listeners (the subjectively near spatial domain) or 4.5 m from the listeners (the subjectively far spatial domain); the adapting and test stimuli were presented from different distances. The obtained data showed that perception of the imitated movement of the sound source in both spatial domains had the common characteristic peculiarities that manifested themselves both under control conditions without adaptation and after adaptation to noise. In the absence of adaptation for both distances, an asymmetry of psychophysical curves was observed: the listeners estimated the test stimuli more often as approaching. The overestimation by listeners of test stimuli as the approaching ones was more pronounced at their presentation from the distance of 1.1 m, i. e., from the subjectively near spatial domain. After adaptation to noise the aftereffects showed spatial specificity in both spatial domains: they were observed only at the spatial coincidence of adapting and test stimuli and were absent at their separation. The aftereffects observed in two spatial domains were similar in direction and value: the listeners estimated the test stimuli more often as withdrawing as compared to control. The result of such aftereffect was restoration of the symmetry of psychometric curves and of the equiprobable estimation of direction of movement of test signals.	t	\N
25487588	A pilot study found the masking level difference (MLD) normal value of Chinese aged from 10 to 40, served to clinical practice. Seventy-three normal persons with hearing threshold < 20 dB and no disorder of otology were divided into three groups: classified 10-20 age as A group, 21-30 age as B group, 31-40 age as C group. MLD of age, gender and race were determined by the audiometer with dual channel by playing audio test disc. Same tests were compared with English language. The mean value of MLD in Chinese was (11.1 ± 3.4) dB and the 95% normal reference values were between 4.4 to 17.8 dB . There was no difference in term of gender. The value of MLD increased with age. In the term of language, foreign showed high value of MLD than that of Chinese. The measuring of MLD depends on age and language but not depends on gender. The normal threshold of MLD depends on age and language should be measured when normal values were applied to assess MLD in clinical practice.	t	\N
25495831	Individuals with autism spectrum disorders (ASD) show enhanced perceptual and memory abilities in the domain of pitch, but also perceptual deficits in other auditory domains. The present study investigated their skills with respect to "echoic memory," a form of short-term sensory memory intimately tied to auditory perception, using a developmental perspective. We tested 23 high-functioning participants with ASD and 26 typically developing (TD) participants, distributed in two age groups (children vs. young adults; mean ages: ∼11 and ∼21 years). By means of an adaptive psychophysical procedure, we measured the longest period for which periodic (i.e., repeated) noise could be reliably discriminated from nonperiodic (i.e., plain random) noise. On each experimental trial, a single noise sample was presented to the participant, who had to classify this sound as periodic or nonperiodic. The TD adults performed, on average, much better than the other three groups, who performed similarly overall. As a function of practice, the measured thresholds improved for the TD participants, but did not change for the ASD participants. Thresholds were not correlated to performance in a test assessing verbal memory. The variance of the participants' response biases was larger among the ASD participants than among the TD participants. The results mainly suggest that echoic memory takes a long time to fully develop in TD humans, and that this development stops prematurely in persons with ASD.	t	\N
25497692	Listeners vary substantially in their ability to recognize speech in noisy environments. Here we examined the role of genetic variation on individual differences in speech recognition in various noise backgrounds. Background noise typically varies in the levels of energetic masking (EM) and informational masking (IM) imposed on target speech. Relative to EM, release from IM is hypothesized to place greater demand on executive function to selectively attend to target speech while ignoring competing noises. Recent evidence suggests that the long allele variant in exon III of the DRD4 gene, primarily expressed in the prefrontal cortex, may be associated with enhanced selective attention to goal-relevant high-priority information even in the face of interference. We investigated the extent to which this polymorphism is associated with speech recognition in IM and EM conditions. In an unscreened adult sample (Experiment 1) and a larger screened replication sample (Experiment 2), we demonstrate that individuals with the DRD4 long variant show better recognition performance in noise conditions involving significant IM, but not in EM conditions. In Experiment 2, we also obtained neuropsychological measures to assess the underlying mechanisms. Mediation analysis revealed that this listening condition-specific advantage was mediated by enhanced executive attention/working memory capacity in individuals with the long allele variant. These findings suggest that DRD4 may contribute specifically to individual differences in speech recognition ability in noise conditions that place demands on executive function.	t	\N
25500178	The challenge in getting a decent signal to the brain for users of cochlear implants (CIs) is described. A breakthrough occurred in 1989 that later enabled most users to understand conversational speech with their restored hearing alone. Subsequent developments included stimulation in addition to that provided with a unilateral CI, either with electrical stimulation on both sides or with acoustic stimulation in combination with a unilateral CI, the latter for persons with residual hearing at low frequencies in either or both ears. Both types of adjunctive stimulation produced further improvements in performance for substantial fractions of patients. Today, the CI and related hearing prostheses are the standard of care for profoundly deaf persons and ever-increasing indications are now allowing persons with less severe losses to benefit from these marvelous technologies. The steps in achieving the present levels of performance are traced, and some possibilities for further improvements are mentioned. This article is part of a Special Issue entitled <Lasker Award>.	t	\N
25502451	A systematic review of the literature to evaluate the clinical outcome of cochlear implantation for patients with single-sided deafness (SSD) or asymmetrical hearing loss (AHL). We searched the PubMed, Embase, Cochrane Library, and CINAHL databases from their inception up to December 10, 2013 for SSD or AHL and cochlear implantation or their synonyms. In total, 781 articles were retrieved, of which 15 satisfied the eligibility criteria. Our outcomes of interest were speech perception in noise, sound localization, quality of life (QoL), and tinnitus. Critical appraisal showed that six studies reported on less than five patients or that they carried a low directness of evidence or a high risk of bias. Therefore, we extracted the data of nine studies (n = 112). Patient numbers, age, duration of deafness, classification of deafness, pure tone audiometry, follow-up duration, and outcome measurements were extracted from all nine articles. Because of large heterogeneity between studies, we were not able to pool data in a meta-analysis. We therefore summarized the results of the studies specified per outcome. There are no high-level-of-evidence studies concerning cochlear implantation in patients with SSD or AHL. Current literature suggests important benefits of cochlear implantation regarding sound localization, QoL, and tinnitus. Varying results were reported for speech perception in noise, possibly caused by the large clinical heterogeneity between studies. Larger and high-quality studies are certainly warranted.	t	\N
25512041	The issue of whether human perception of speech and song recruits integrated or dissociated neural systems is contentious. This issue is difficult to address directly since these stimulus classes differ in their physical attributes. We therefore used a compelling illusion (Deutsch et al. 2011) in which acoustically identical auditory stimuli are perceived as either speech or song. Deutsch's illusion was used in a functional MRI experiment to provide a direct, within-subject investigation of the brain regions involved in the perceptual transformation from speech into song, independent of the physical characteristics of the presented stimuli. An overall differential effect resulting from the perception of song compared with that of speech was revealed in right midposterior superior temporal sulcus/right middle temporal gyrus. A left frontotemporal network, previously implicated in higher-level cognitive analyses of music and speech, was found to co-vary with a behavioural measure of the subjective vividness of the illusion, and this effect was driven by the illusory transformation. These findings provide evidence that illusory song perception is instantiated by a network of brain regions that are predominantly shared with the speech perception network.	t	\N
25536235	A large number of neuroimaging studies have investigated imagined sensory processing and motor behaviours. These studies have reported neural activation patterns for imagined processes that resemble those of real sensory and motor events. The widespread use of such methods has raised questions about the extent to which imagined sensorimotor events mimic their overt counterparts, including their ability to elicit sensorimotor interactions. Direct behavioural evidence of imagery-induced multisensory interactions has been found recently in tasks involving auditory and visual processing. An influence of sensory imagery on the control of motor action, however, has not been investigated previously. Here, we show that both real and imagined moving sounds induce involuntary ocular movement in a nonvisual tracking task. The present data build on the results of previous studies of sensory imagery by showing that such conditions activate sensory neural areas. Moreover, we show an engagement of functional sensorimotor networks for imagined stimuli in a manner similar to the processing of real auditory stimuli.	t	\N
25536844	A central problem in recent research on speech production concerns the question to what extent speakers adapt their linguistic expressions to the needs of their addressees. It is claimed that speakers sometimes leak information about objects that are only visible for them and not for their listeners. Previous research only takes the occurrence of adjectives as evidence for the leakage of privileged information. The present study hypothesizes that leaked information is also encoded in the prosody of those adjectives. A production experiment elicited adjectives that leak information and adjectives that do not leak information. An acoustic analysis and prominence rating task showed that adjectives that leak information were uttered with a higher pitch and perceived as more prominent compared to adjectives that do not leak information. Furthermore, a guessing task suggested that the adjectives' prosody relates to how listeners infer possible privileged information.	t	\N
25545633	Past experience of everyday life activities, which forms the basis of our knowledge about the world, greatly affects how we understand stories. Yet, little is known about how this influence is instantiated in the human brain. Here, we used functional magnetic resonance imaging to investigate how past experience facilitates functional connectivity during the comprehension of stories rich in perceptual and motor details. We found that comprehenders' past experience with the scenes and actions described in the narratives selectively modulated functional connectivity between lower- and higher-level areas within the neural systems for visual and motor processing, respectively. These intramodal interactions may play an important role in integrating personal knowledge about a narrated situation with an evolving discourse representation. This study provides empirical evidence consistent with the idea that regions related to visual and motor processing are involved in the reenactment of experience as proposed by theories of embodied cognition.	t	\N
25549509	The motor system is engaged when we perceive movement in the environment, even when we have no sensorimotor experience of that movement. It has been suggested that this ability relies on internal models that comprise specific exteroceptive representations, such as audition and vision. It has been shown that, for human movements, the quality of perception depends on the closeness between the perceived movement and the perceiver's own capability of reproducing it. Thus, if we are able to reproduce a movement, we also have the interoceptive motor memories that enable us to run internal models and perceive the same movements more accurately when merely observed. In a behavioral study we investigated if participants would be able to distinguish between self-produced and other-produced movement sounds from a previously recorded hurdling performance. We also analyzed if participants' discriminative ability would vary as a function of specific sound features, examining rhythmic step structure and amplitude range. The results reveal that participants were able to distinguish between their own and others' movement sounds. However, changing either rhythmic step structure or amplitude range of the sounds did not influence this self-other discrimination. We suggest that identification of one's own movement sounds is holistically achieved as an auditory gestalt.	t	\N
25567437	To investigate the functional evaluation of auditory cortex by functional magnetic resonance imaging (fMRI) before cochlear implantation (CI). Twenty-one children with severe to profound sensorineural hearing loss (SNHL) (7 cases with normal inner ear structure, 3 cases with large vestibular aqueduct syndrome, 1 case with cochlear nerve hypoplasia) and 7 children with normal hearing were examined by fMRI under sedation. The subjects received acoustic stimuli at four random frequencies (500, 1 000, 2 000, 4 000 Hz) during fMRI examination. Tones were interleaved with silence in a block-periodic fMRI design with 16-second on-off intervals.Each stimulus repeated twice and the total acquisition time was 4 minutes and 28 seconds. Activation was identified in the auditory cortex of the 28 subjects in response to monaural and binaural acoustic stimulation, which was shown in the transverse temporal gyrus (Heschl gyrus), the thalamus and both.Of the 21 patients with SNHL, contralateral dominance was shown in six cases, ipsilateral dominance in five cases and bilateral equilibrium in 10 cases when one ear was exposed to the stimuli. Of the seven children with normal hearing, contralateral dominance was shown in three cases, ipsilateral dominance in two cases and bilateral equilibrium in two cases.No significant difference of the activation intensity in auditory cortex was shown between children with SNHL and normal hearing (P > 0.05). Good hearing recovery was obtained in the 21 patients after CI. No significant difference in the level of hearing and speech rehabilitation was shown between children with LVAS and with normal inner ear structure.Listening behavior response threshold was 55 dBHL in the case with cochlear nerve hypoplasia 10 months after CI. fMRI would be a feasible means of evaluating the function of the auditory cortex, which can be used for assessing the function of the entire auditory system prior to CI combined with ABR and imaging.	t	\N
25575604	Cochlear implant users show a profile of residual, yet poorly understood, musical abilities. An ability that has received little to no attention in this population is entrainment to a musical beat. We show for the first time that a heterogeneous group of cochlear implant users is able to find the beat and move their bodies in time to Latin Merengue music, especially when the music is presented in unpitched drum tones. These findings not only reveal a hidden capacity for feeling musical rhythm through the body in the deaf and hearing impaired population, but illuminate promising avenues for designing early childhood musical training that can engage implanted children in social musical activities with benefits potentially extending to non-musical domains.	t	\N
25596462	Emotional verbal messages are typically encountered in meaningful contexts, for instance, during face-to-face communication in social situations. Yet, they are often investigated by confronting single participants with isolated words on a computer screen, thus potentially lacking ecological validity. In the present study we recorded event-related brain potentials (ERPs) during emotional word processing in communicative situations provided by videos of a speaker, assuming that emotion effects should be augmented by the presence of a speaker addressing the listener. Indeed, compared to non-communicative situations or isolated word processing, emotion effects were more pronounced, started earlier and lasted longer in communicative situations. Furthermore, while the brain responded most strongly to negative words when presented in isolation, a positivity bias with more pronounced emotion effects for positive words was observed in communicative situations. These findings demonstrate that communicative situations--in which verbal emotions are typically encountered--strongly enhance emotion effects, underlining the importance of social and meaningful contexts in processing emotional and verbal messages.	t	\N
25603582	Learning and transfer were investigated for a categorical structure that could be mapped without loss of information from 1 sensory modality to another. The category space was composed of 3 nonoverlapping, linearly separable categories whose members were structured, temporal events. Each stimulus was composed of a sequence of on-off events that varied in duration and number of subevents (complexity). Categories were learned visually, haptically, or auditorily and transferred to the same or another modality. Despite the.isomorphism across modalities, significant differences appeared in both learning and transfer. The visual modality showed an early learning advantage, with information on the transfer test preserved best when encoded visually during learning, worst when encoded haptically, with auditory encoding intermediate. False recognition rates were elevated when categories were learned haptically and transferred to another modality. In general, classification accuracy was highest for the category prototype, with false recognition of the category prototype higher in the cross-modality conditions. The results are discussed in terms of current theories in modality transfer including the difficulties inherent when calculation of similarity must be considered in a cross-modal situation.	t	\N
25611595	Most general theories on serial order working memory (WM) assume the existence of position markers that are bound to the to-be-remembered items to keep track of the serial order. So far, the exact cognitive/neural characteristics of these markers have remained largely underspecified, while direct empirical evidence for their existence is mostly lacking. In the current study we demonstrate that retrieval from verbal serial order WM can be facilitated or hindered by spatial cuing: begin elements of a verbal WM sequence are retrieved faster after cuing the left side of space, while end elements are retrieved faster after cuing the right side of space. In direct complement to our previous work--where we showed the reversed impact of WM retrieval on spatial processing--we argue that the current findings provide us with a crucial piece of evidence suggesting a direct and functional involvement of space in verbal serial order WM. We outline the idea that serial order in verbal WM is coded within a spatial coordinate system with spatial attention being involved when searching through WM, and we discuss how this account can explain several hallmark observations related to serial order WM.	t	\N
25620313	Despite a robust hearing conservation program, military personnel continue to be at high risk for noise induced hearing loss (NIHL). For more than a decade, a number of laboratories have investigated the use of antioxidants as a safe and effective adjunct to hearing conservation programs. Of the antioxidants that have been investigated, N-acetylcysteine (NAC) has consistently reduced permanent NIHL in the laboratory, but its clinical efficacy is still controversial. This study provides a prospective, randomized, double-blinded, placebo-controlled clinical trial investigating the safety profile and the efficacy of NAC to prevent hearing loss in a military population after weapons training. Of the 566 total study subjects, 277 received NAC while 289 were given placebo. The null hypothesis for the rate of STS was not rejected based on the measured results. While no significant differences were found for the primary outcome, rate of threshold shifts, the right ear threshold shift rate difference did approach significance (p = 0.0562). No significant difference was found in the second primary outcome, percentage of subjects experiencing an adverse event between placebo and NAC groups (26.7% and 27.4%, respectively, p = 0.4465). Results for the secondary outcome, STS rate in the trigger hand ear, did show a significant difference (34.98% for placebo-treated, 27.14% for NAC-treated, p-value = 0.0288). Additionally, post-hoc analysis showed significant differences in threshold shift rates when handedness was taken into account. While the secondary outcomes and post-hoc analysis suggest that NAC treatment is superior to the placebo, the present study design failed to confirm this. The lack of significant differences in overall hearing loss between the treatment and placebo groups may be due to a number of factors, including suboptimal dosing, premature post-exposure audiograms, or differences in risk between ears or subjects. Based on secondary outcomes and post hoc analyses however, further studies seem warranted and are needed to clarify dose response and the factors that may have played a role in the observed results.	t	\N
25620314	This study explored the role of formant transitions and F0-contour continuity in binding together speech sounds into a coherent stream. Listening to a repeating recorded word produces verbal transformations to different forms; stream segregation contributes to this effect and so it can be used to measure changes in perceptual coherence. In experiment 1, monosyllables with strong formant transitions between the initial consonant and following vowel were monotonized; each monosyllable was paired with a weak-transitions counterpart. Further stimuli were derived by replacing the consonant-vowel transitions with samples from adjacent steady portions. Each stimulus was concatenated into a 3-min-long sequence. Listeners only reported more forms in the transitions-removed condition for strong-transitions words, for which formant-frequency discontinuities were substantial. In experiment 2, the F0 contour of all-voiced monosyllables was shaped to follow a rising or falling pattern, spanning one octave. Consecutive tokens either had the same contour, giving an abrupt F0 change between each token, or alternated, giving a continuous contour. Discontinuous sequences caused more transformations and forms, and shorter times to the first transformation. Overall, these findings support the notion that continuity cues provided by formant transitions and the F0 contour play an important role in maintaining the perceptual coherence of speech.	t	\N
25631452	Schizophrenia patients with auditory verbal hallucinations (AVH) have reduced structural integrity in the left arcuate fasciculus (AFL) compared to healthy controls. However, it is neither known whether these changes are specific to AVH, as opposed to hallucinations or schizophrenia per se, nor how radial and/or axial diffusivity are altered. This study aimed to test the hypothesis that reductions to the structural integrity of the AFL are specifically associated with AVH in schizophrenia. Diffusion tensor imaging scans and clinical data were obtained from the Australian Schizophrenia Research Bank for 39 schizophrenia patients with lifetime AVH (18 current, 21 remitted), 74 schizophrenia patients with no lifetime AVH (40 with lifetime hallucinations in other modalities, 34 no lifetime hallucinations) and 40 healthy controls. Fractional anisotropy was significantly reduced in the AFL of patients with lifetime AVH compared to both healthy controls (Cohen's d=1.24) and patients without lifetime AVH (d=.72), including compared to the specific subsets of patients without AVH who either had hallucinations in other modalities (d=.69) or no history of any hallucinations (d=.73). Radial, but not axial, diffusivity was significantly increased in patients with lifetime AVH compared to both healthy controls (d=.89) and patients without lifetime AVH (d=.39). Evidence was found for a non-linear relation between fractional anisotropy in the AFL and state AVH. Reduced integrity of the AFL is specifically associated with AVH, as opposed to schizophrenia in general or hallucinations in other modalities. Increased radial diffusivity suggests dysmyelination or demyelination of the AFL may play a role in AVH.	t	\N
25632078	Despite recent progress in our understanding of sensorimotor integration in speech learning, a comprehensive framework to investigate its neural basis is lacking at behaviorally relevant timescales. Structural and functional imaging studies in humans have helped us identify brain networks that support speech but fail to capture the precise spatiotemporal coordination within the networks that takes place during speech learning. Here we use neuronal oscillations to investigate interactions within speech motor networks in a paradigm of speech motor adaptation under altered feedback with continuous recording of EEG in which subjects adapted to the real-time auditory perturbation of a target vowel sound. As subjects adapted to the task, concurrent changes were observed in the theta-gamma phase coherence during speech planning at several distinct scalp regions that is consistent with the establishment of a feedforward map. In particular, there was an increase in coherence over the central region and a decrease over the fronto-temporal regions, revealing a redistribution of coherence over an interacting network of brain regions that could be a general feature of error-based motor learning in general. Our findings have implications for understanding the neural basis of speech motor learning and could elucidate how transient breakdown of neuronal communication within speech networks relates to speech disorders.	t	\N
25644083	French-learning infants have language-specific difficulties in processing lexical stress due to the lack of lexical stress in French. These difficulties in discriminating between words with stress-initial (trochaic) and stress-final (iambic) patterns emerge by 10months of age in the easier context of low variability (using a single item pronounced with a trochaic pattern vs. an iambic pattern) as well as in the more challenging context of high segmental variability (using lists of segmentally different trochaic and iambic items). These findings raise the question of stress pattern perception in simultaneous bilinguals learning French and a second language using stress at the lexical level. Bijeljac-Babic, Serres, Höhle, and Nazzi (2012) established that at 10 months of age, in the simpler context of low variability, such bilinguals have better stress discrimination abilities than French-learning monolinguals. The current study explored whether this advantage extends to the more challenging context of high segmental variability. Results first establish stress pattern discrimination in a group of bilingual 10-month-olds learning French and one language with (variable) lexical stress, but not in French-learning 10-month-old monolinguals. Second, discrimination in bilinguals appeared not to be affected by the language balance of the infants, suggesting that sensitivity to stress patterns might be maintained in these bilingual infants provided that they hear at least 30% of a language with lexical stress.	t	\N
25653380	In the diverse mechanosensory systems that animals evolved, the waveform of stimuli can be encoded by phase locking in spike trains of primary afferents. Coding of the fine structure of sounds via phase locking is thought to be critical for hearing. The upper frequency limit of phase locking varies across species and is unknown in humans. We applied a method developed previously, which is based on neural adaptation evoked by forward masking, to analyze mass potentials recorded on the cochlea and auditory nerve in the cat. The method allows us to separate neural phase locking from receptor potentials. We find that the frequency limit of neural phase locking obtained from mass potentials was very similar to that reported for individual auditory nerve fibers. The results suggest that this is a promising approach to examine neural phase locking in humans with normal or impaired hearing or in other species for which direct recordings from primary afferents are not feasible.	t	\N
25656319	Models of speech production explain event-related suppression of the auditory cortical response as reflecting a comparison between auditory predictions and feedback. The present MEG study was designed to test two predictions from this framework: (1) whether the reduced auditory response varies as a function of the mismatch between prediction and feedback; (2) whether individual variation in this response is predictive of speech-motor adaptation. Participants alternated between online imitation and listening tasks. In the imitation task, participants began each trial producing the same vowel (/e/) and subsequently listened to and imitated auditorily-presented vowels varying in acoustic distance from /e/. Results replicated suppression, with a smaller M100 during speaking than listening. Although we did not find unequivocal support for the first prediction, participants with less M100 suppression were better at the imitation task. These results are consistent with the enhancement of M100 serving as an error signal to drive subsequent speech-motor adaptation.	t	\N
25660196	Under certain conditions, sighted and blind humans can use echoes to discern characteristics of otherwise silent objects. Previous research concluded that robust horizontal-plane object localisation ability, without using head movement, depends on information above 2 kHz. While a strong interaural level difference (ILD) cue is available, it was not clear if listeners were using that or the monaural level cue that necessarily accompanies ILD. In this experiment, 13 sighted and normal-hearing listeners were asked to identify the right-vs.-left position of an object in virtual auditory space. Sounds were manipulated to remove binaural cues (binaural vs. diotic presentation) and prevent the use of monaural level cues (using level roving). With low- (<2 kHz) and high- (>2 kHz) frequency bands of noise, performance with binaural presentation and level rove exceeded that expected from use of monaural level cues and that with diotic presentation. It is argued that a high-frequency binaural cue (most likely ILD), and not a monaural level cue, is crucial for robust object localisation without head movement.	t	\N
25660759	Impairments in neural function are common when oxygen supply to the brain is reduced. This study examined neurocognitive processes that are vulnerable to oxygen deprivation. We induced moderate-to-severe hypoxia in healthy adults, thereby inducing impairments caused by low brain oxygen availability. 22 healthy adults participated in this matched-pairs study with a single-blind, randomised design. Baseline neurocognitive function was examined during a familiarisation trial and participants were assigned to hypoxia (10% O2) or sham (21% O2) groups. Neurocognitive performance was assessed via computerised test battery after 50 min of breathing a gas mixture that reduced arterial oxygen saturation by 20% (p<0.01). Hypoxia severely reduced performance across all neurocognitive domain scores; with significant drops in neurocognitive index (-20%), composite memory (-30%), verbal memory (-34%), visual memory (-23%), processing speed (-36%), executive function (-20%), psychomotor speed (-24%), reaction time (-10%), complex attention (-19%) and cognitive flexibility (-18%; all p<0.05). Practice effects were blocked by hypoxia but occurred in sham for information processing speed (+30%), executive function (+14%), psychomotor speed (+18%), reaction time (+5%), cognitive flexibility (+14%), and overall cognitive functioning (+9%; all p<0.05). Neuropsychological performance decrements caused by acute experimental hypoxia are comparable to cognitive domains impaired with high altitude exposure and mild traumatic brain injury.	t	\N
25676810	Neurologically healthy individuals use sensory feedback to alter future movements by updating internal models of the effector system and environment. For example, when visual feedback about limb movements or auditory feedback about speech movements is experimentally perturbed, the planning of subsequent movements is adjusted - i.e., sensorimotor adaptation occurs. A separate line of studies has demonstrated that experimentally delaying the sensory consequences of limb movements causes the sensory input to be attributed to external sources rather than to one's own actions. Yet similar feedback delays have remarkably little effect on visuo-motor adaptation (although the rate of learning varies, the amount of adaptation is only moderately affected with delays of 100-200ms, and adaptation still occurs even with a delay as long as 5000ms). Thus, limb motor learning remains largely intact even in conditions where error assignment favors external factors. Here, we show a fundamentally different result for sensorimotor control of speech articulation: auditory-motor adaptation to formant-shifted feedback is completely eliminated with delays of 100ms or more. Thus, for speech motor learning, real-time auditory feedback is critical. This novel finding informs theoretical models of human motor control in general and speech motor control in particular, and it has direct implications for the application of motor learning principles in the habilitation and rehabilitation of individuals with various sensorimotor speech disorders.	t	\N
25678274	While there is growing understanding of visual selective attention in children, some aspects such as selection in the presence of distractors are not well understood. Adult studies suggest that when presented with a visual search task, an enhanced negativity is seen beginning around 200 ms (the N2pc) that reflects selection of a target item among distractors. However, it is not known if similar selective attention-related activity is seen in children during visual search. This study was designed to investigate the presence of the N2pc in children. Nineteen children (ages 9-12 years) and 21 adults (ages 18-22 years) completed a visual search task in which they were asked to attend to a fixation surrounded by both a target and a distractor stimulus. Three types of displays were analyzed at parietal electrodes P7 and P8; lateral target/lateral distractor, lateral target/midline distractor, and midline target/lateral distractor. Both adults and children showed a significant increased negativity contralateral compared to ipsilateral to the target (reflected in the N2pc) in both displays with a lateral target while no such effect was seen in displays with a midline target. This suggests that children also utilized additional resources to select a target item when distractors are present. These findings demonstrate that the N2pc can be used as a marker of attentional object selection in children.	t	\N
25681737	Aging usually decreases the ability to understand language under difficult listening conditions. However, aging is also associated with increased between-subject variability. Here, we studied potential sources of inter-individual differences and investigated spoken language understanding of younger and older adults (age ranges 21-35 and 57-74 years, respectively) in a simulated "cocktail-party" scenario. A naturalistic "stock-price monitoring" task was employed in which prices of listed companies were simultaneously recited by four speakers at different locations in space. The participants responded when prices of a target company exceeded specific values, while ignoring all other companies. According to their individual performance levels three subgroups of participants were composed, consisting of 12 high-performing and 12 low-performing older adults, and 12 young adults matching the high-performing older group. The analysis of the event-related brain potentials indicated that all older adults showed delayed attentional control (indicated by a later P2) and reduced speech processing (indicated by a reduced N400), relative to the younger adults. High-performing older adults differed in increased allocation of attention and inhibitory control (indicated by a stronger P2-N2 complex) from their low-performing counterparts. The results are consistent with the idea of an adjustment of mental resources that could help compensating potential deficiencies in peripheral and central auditory processing.	t	\N
25704552	Grapheme-color synesthetes perceive achromatic graphemes to be inherently colored. In this study grapheme-color synesthetes and non-synesthetes discriminated (1) the color of visual targets presented along with aurally presented digit primes, and (2) the identity of aurally presented digit targets presented with visual color primes. Reaction times to visual color targets were longer when the color of the target was incongruent with the synesthetic percept reported for the prime. Likewise, discriminating aurally presented digit targets took longer when the color of the prime was incongruent with the synesthetic percept for the target. These priming effects were absent in non-synesthetes. We conclude that binding between digits and colors in grapheme-color synesthetes can occur bidirectionally across senses. The results are in line with the idea that synesthesia is the result of linking inducing stimuli (e.g. digits) to synesthetic percepts (colors) at an abstract - supra-modal - conceptual level of processing.	t	\N
25705651	Noise is a major cause of health disorders in workers and has unique importance in the auditory analysis of people exposed to it. The purpose of this study is to evaluate the arithmetic mean of the auditory thresholds at frequencies of 3, 4, and 6 kHz of workers from five professional categories exposed to occupational noise. We propose a retrospective cross-sectional cohort study to analyze 2.140 audiograms from seven companies having five sectors of activity: one footwear company, one beverage company, two ceramics companies, two metallurgical companies, and two transport companies. When we compared two categories, we noticed a significant difference only for cargo carriers in comparison to the remaining categories. In all activity sectors, the left ear presented the worst values, except for the footwear professionals (P > 0.05). We observed an association between the noise exposure time and the reduction of audiometric values for both ears. Significant differences existed for cargo carriers in relation to other groups. This evidence may be attributed to different forms of exposure. A slow and progressive deterioration appeared as the exposure time increased.	t	\N
25705995	To investigate audiological and quality of life outcomes for a new active transcutaneous device, called the bone conduction implant (BCI), where the transducer is implanted under intact skin. A clinical study with sound field audiometry and questionnaires at six-month follow-up was conducted with a bone-anchored hearing aid on a softband as reference device. Six patients (age 18-67 years) with mild-to-moderate conductive or mixed hearing loss. The surgical procedure was found uneventful with no adverse events. The first hypothesis that BCI had a statistically significant improvement over the unaided condition was proven by a pure-tone-average improvement of 31.0 dB, a speech recognition threshold improvement in quiet (27.0 dB), and a speech recognition score improvement in noise (51.2 %). At speech levels, the signal-to-noise ratio threshold for BCI was - 5.5 dB. All BCI results were better than, or similar to the reference device results, and the APHAB and GBI questionnaires scores showed statistically significant improvements versus the unaided situation, supporting the second and third hypotheses. The BCI provides significant hearing rehabilitation for patients with mild-to-moderate conductive or mixed hearing impairments, and can be easily and safely implanted under intact skin.	t	\N
25707867	The magnitude of spatial distance between sound stimuli is critically important for their preattentive discrimination, yet the effect of stimulus context on auditory motion processing is not clear. This study investigated the effects of acoustical change and stimulus context on preattentive spatial change detection. Auditory event-related potentials (ERPs) were recorded for stationary midline noises and two patterns of sound motion produced by linear or abrupt changes of interaural time differences. Each of the three types of stimuli was used as standard or deviant in different blocks. Context effects on mismatch negativity (MMN) elicited by stationary and moving sound stimuli were investigated by reversing the role of standard and deviant stimuli, while the acoustical stimulus parameters were kept the same. That is, MMN amplitudes were calculated by subtracting ERPs to identical stimuli presented as standard in one block and deviant in another block. In contrast, effects of acoustical change on MMN amplitudes were calculated by subtracting ERPs of standards and deviants presented within the same block. Preattentive discrimination of moving and stationary sounds indexed by MMN was strongly dependent on the stimulus context. Higher MMNs were produced in oddball configurations where deviance represented increments of the sound velocity, as compared to configurations with velocity decrements. The effect of standard-deviant reversal was more pronounced with the abrupt sound displacement than with gradual sound motion.	t	\N
25722025	Most high-level auditory functions require one to detect the onset and offset of sound sequences as well as registering the rate at which sounds are presented within the sound trains. By recording event-related brain potentials to onsets and offsets of tone trains as well as to changes in the presentation rate, we tested whether these fundamental auditory capabilities are functional at birth. Each of these events elicited significant event-related potential components in sleeping healthy neonates. The data thus demonstrate that the newborn brain is sensitive to these acoustic features suggesting that infants are geared towards the temporal aspects of segregating sound sources, speech and music perception already at birth.	t	\N
25726273	Auditory processing is remarkably fast and sensitive to the precise temporal structure of acoustic signals over a range of scales, from submillisecond phenomena such as localization to the construction of elementary auditory attributes at tens of milliseconds to basic properties of speech and music at hundreds of milliseconds. In light of the rapid (and often transitory) nature of auditory phenomena, in order to investigate the neurocomputational basis of auditory perception and cognition, a technique with high temporal resolution is appropriate. Here we briefly outline the utility of magnetoencephalography (MEG) for the study of the neural basis of audition. The basics of MEG are outlined in brief, and some of the most-used neural responses are described. We discuss the classic transient evoked fields (e.g., M100), responses elicited by change in a stimulus (e.g., pitch-onset response), the auditory steady-state response, and neural oscillations (e.g., theta-phase tracking). Because of the high temporal resolution and the good spatial resolution of MEG, paired with the convenient location of human auditory cortex for MEG-based recording, electromagnetic recording of this type is well suited to investigate various aspects from audition, from crafted laboratory experiments on pitch perception or scene analysis to naturalistic speech and music tasks.	t	\N
25730639	In serial recall tasks, presenting items in alternating female and male voices impairs performance relative to the single-voice presentation. This phenomenon, termed the talker-variability effect (TVE), was recently reexamined by Hughes, Marsh, and Jones (2009, 2011), who used the effect as confirmatory evidence for their perceptual-gestural account of serial recall performance. Despite the authors' claim of generalisability, the serial recall paradigm employed did not reflect the standard parameters more generally adopted in verbal short-term memory research. Specifically, the presentation rate of the stimuli was almost 3 times that typically used. We sought to determine if the TVE, as observed by Hughes et al., was generalisable to the standard serial recall task by directly comparing recall performance in talker-variable conditions at fast and slow stimulus presentation rates. Experiment 1 employed a systematic replication of the foundational study undertaken by Hughes et al. (2009). Utilising a novel stimulus set, Experiment 2 provided a subsequent test of the generalisability of the TVE, examining the influence of item properties. Both experiments showed a robust TVE at the atypical fast presentation rate; however, for the slower item presentation, the TVE was unreliable. Furthermore, error analysis suggests that item recall also contributes to the TVE, contrary to the current explanation proposed by Hughes et al. (2009, 2011). The challenge of the present data to the perceptual-gestural account of the TVE is explored. Alternative accounts that focus on the resource cost of categorical speech perception in the context of talker variability are posited.	t	\N
25732931	Pacific bluefin tuna (Thunnus orientalis) is a highly migratory, commercially valuable species potentially vulnerable to acoustic noise generated from human activities which could impact behavior and fitness. Although significant efforts have been made to understand hearing abilities of fishes, the large size and need to continuously swim for respiration have hindered investigations with tuna and other large pelagic species. In this study, Pacific bluefin tuna were trained to respond to a pure tone sound stimulus ranging 325-800 Hz and their hearing abilities quantified using a staircase psychophysical technique. Hearing was most sensitive from 400 to 500 Hz in terms of particle motion (radial acceleration -88 dB re 1 m s(-2); vertical acceleration -86 dB re 1 m s(-2)) and sound pressure (83 dB re 1 μPa). Compared to yellowfin tuna (Thunnus albacares) and kawakawa (Euthynnus affinis), Pacific bluefin tuna has a similar bandwidth of hearing and best frequency, but greater sensitivity overall. Careful calibration of the sound stimulus and experimental tank environment, as well as the adoption of behavioral methodology, demonstrates an experimental approach highly effective for the study of large fish species in the laboratory.	t	\N
25740512	In normal listeners, the tonal rules of music guide musical expectancy. In a minority of individuals, known as amusics, the processing of tonality is disordered, which results in severe musical deficits. It has been shown that the tonal rules of music are neurally encoded, but not consciously available in amusics. Previous neurophysiological studies have not explicitly controlled the level of attention in tasks where participants ignored the tonal structure of the stimuli. Here, we test whether access to tonal knowledge can be demonstrated in congenital amusia when attention is controlled. Electric brain responses were recorded while asking participants to detect an individually adjusted near-threshold click in a melody. In half the melodies, a note was inserted that violated the tonal rules of music. In a second task, participants were presented with the same melodies but were required to detect the tonal deviation. Both tasks required sustained attention, thus conscious access to the rules of tonality was manipulated. In the click-detection task, the pitch deviants evoked an early right anterior negativity (ERAN) in both groups. In the pitch-detection task, the pitch deviants evoked an ERAN and P600 in controls but not in amusics. These results indicate that pitch regularities are represented in the cortex of amusics, but are not consciously available. Moreover, performing a pitch-judgment task eliminated the ERAN in amusics, suggesting that attending to pitch information interferes with perception of pitch. We propose that an impaired top-down frontotemporal projection is responsible for this disorder.	t	\N
25740534	When two musical notes with simple frequency ratios are played simultaneously, the resulting musical chord is pleasing and evokes a sense of resolution or "consonance". Complex frequency ratios, on the other hand, evoke feelings of tension or "dissonance". Consonance and dissonance form the basis of harmony, a central component of Western music. In earlier work, we provided evidence that consonance perception is based on neural temporal coding in the brainstem (Bones et al., 2014). Here, we show that for listeners with clinically normal hearing, aging is associated with a decline in both the perceptual distinction and the distinctiveness of the neural representations of different categories of two-note chords. Compared with younger listeners, older listeners rated consonant chords as less pleasant and dissonant chords as more pleasant. Older listeners also had less distinct neural representations of consonant and dissonant chords as measured using a Neural Consonance Index derived from the electrophysiological "frequency-following response." The results withstood a control for the effect of age on general affect, suggesting that different mechanisms are responsible for the perceived pleasantness of musical chords and affective voices and that, for listeners with clinically normal hearing, age-related differences in consonance perception are likely to be related to differences in neural temporal coding.	t	\N
25744662	Hearing loss is one of the most common symptoms of mitochondrial disorders. However, audiological phenotypes associated with different molecular defects in mtDNA are not yet well characterized. A large cohort of 1499 nonconsanguineous patients aged 5-40 years with hearing loss of unknown etiology was screened for mutations in mtDNA. For further analysis, patients harboring m.1555A>G and m.3243A>G were selected. Hearing status of the patients was assessed by pure tone audiometry. Patterns of audiograms (hearing threshold levels at each examined frequency) were statistically compared among the carriers of the m.1555A>G and the m.3243A>G mutations. We identified 20 patients positive for m.1555A>G mutation and 16 patients positive for m.3243A>G change. The frequency of the above transitions was calculated in our cohort as 1.33% and 1.06%, respectively. Seventeen affected family members carrying the mutations were included into the study. Typical shape of the audiograms in patients with m.1555A>G mutation presented a ski-slope pattern, whereas the audiometric curves among the m.3243A>G individuals had a pantonal shape (a flat curve) with slight downward sloping at the higher frequencies. The differences were statistically significant. The onset of hearing loss was noted earlier among m.1555A>G than m.3243A>G patients (12.5 and 26 years, respectively). Aminoglycoside administration was declared in both groups in 11 and 4 cases respectively, and caused abrupt hearing deterioration in all cases. A pattern of audiogram in patients with mitochondrial deafness may suggest a localization of mtDNA mutation. The pathogenesis of the audiometric differences needs further study.	t	\N
25768088	Cartilage conduction (CC) is a new form of sound transmission which is induced by a transducer being placed on the aural cartilage. Although the conventional forms of sound transmission to the cochlea are classified into air or bone conduction (AC or BC), previous study demonstrates that CC is not classified into AC or BC (Laryngoscope 124: 1214-1219). Next interesting issue is whether CC is a hybrid of AC and BC. Seven volunteers with normal hearing participated in this experiment. The threshold-shifts by water injection in the ear canal were measured. AC, BC, and CC thresholds at 0.5-4 kHz were measured in the 0%-, 40%-, and 80%-water injection conditions. In addition, CC thresholds were also measured for the 20%-, 60%-, 100%-, and overflowing-water injection conditions. The contributions of the vibrations of the cartilaginous portion were evaluated by the threshold-shifts. For AC and BC, the threshold-shifts by the water injection were 22.6-53.3 dB and within 14.9 dB at the frequency of 0.5-4 kHz, respectively. For CC, when the water was filled within the bony portion, the thresholds were elevated to the same degree as AC. When the water was additionally injected to reach the cartilaginous portion, the thresholds at 0.5 and 1 kHz dramatically decreased by 27.4 and 27.5 dB, respectively. In addition, despite blocking AC by the injected water, the CC thresholds in force level were remarkably lower than those for BC. The vibration of the cartilaginous portion contributes to the sound transmission, particularly in the low frequency range. Although the airborne sound is radiated into the ear canal in both BC and CC, the mechanism underlying its generation is different between them. CC generates airborne sound in the canal more efficiently than BC. The current findings suggest that CC is not a hybrid of AC and BC.	t	\N
25784489	Electrical stimulation of upper limb nerves evokes a train of high-frequency wavelets (high-frequency oscillations, HFOs) on the human scalp. These HFOs are related to the influence of arousal-promoting structures on somatosensory input processing, and are generated in the primary somatosensory cortex (post-synaptic HFOs) and the terminal tracts of thalamocortical radiations (pre-synaptic HFOs). We previously reported that HFOs do not undergo habituation to repeated stimulations; here, we verified whether HFOs could be modulated by external sensitizing stimuli. We recorded somatosensory evoked potentials (SSEPs) in 15 healthy volunteers before and after sensitization training with an auditory stimulus. Pre-synaptic HFO amplitudes, reflecting somatosensory thalamic/thalamocortical activity, significantly increased after the sensitizing acoustic stimulation, whereas both the low-frequency N20 SSEP component and post-synaptic HFOs were unaffected. Cross-talk between subcortical arousal-related structures is a probable mechanism for the pre-synaptic HFO effect observed in this study. We propose that part of the ascending somatosensory input encoded in HFOs is specifically able to convey sensitized inputs. This preferential involvement in sensitization mechanisms suggests that HFOs play a critical role in the detection of potentially relevant stimuli, and act at very early stages of somatosensory input processing.	t	\N
25813739	Perception of simultaneity and temporal order is studied with simultaneity judgment (SJ) and temporal-order judgment (TOJ) tasks. In the former, observers report whether presentation of two stimuli was subjectively simultaneous; in the latter, they report which stimulus was subjectively presented first. SJ and TOJ tasks typically give discrepant results, which has prompted the view that performance is mediated by different processes in each task. We looked at these discrepancies from a model that yields psychometric functions whose parameters characterize the timing, decisional, and response processes involved in SJ and TOJ tasks. We analyzed 12 data sets from published studies in which both tasks had been used in within-subjects designs, all of which had reported differences in performance across tasks. Fitting the model jointly to data from both tasks, we tested the hypothesis that common timing processes sustain simultaneity and temporal-order judgments, with differences in performance arising from task-dependent decisional and response processes. The results supported this hypothesis, also showing that model psychometric functions account for aspects of SJ and TOJ data that classical analyses overlook. Implications for research on perception of simultaneity and temporal order are discussed.	t	\N
25813742	Previous studies have shown that humans are sensitive to statistical patterns indicating the likely locations, identities, and timings of visual targets. Here we tested whether participants can also use this kind of information to ameliorate the attentional blink (AB)—a reduction in accuracy for the second of two targets (T1, T2) presented at brief intertarget intervals (lags). In particular, we asked whether participants can use patterns arising from differential distributions of intertarget lags across trials to predict the arrival of T2. We tested this by comparing the ABs in an aging versus a nonaging distribution of trials, where aging refers to the increased likelihood of T2, given that it has not yet occurred, when lags occur with equal frequencies. Experiments 1 and 2 showed that the aging condition yielded greater T2 accuracy at longer lags than did the nonaging condition. In Experiment 3, we used a more sensitive response time measure to show faster T2 discrimination at shorter lags in the nonaging condition. These results demonstrate that participants can predict the likely onset of T2 by using statistical patterns present in the AB task, and that they can use this ability to more effectively direct limited processing resources.	t	\N
25827259	Infants' attention is captured by the redundancy of amodal stimulation in multimodal objects and events. Evidence from this study demonstrates that intersensory redundancy can facilitate discrimination of rhythm changes presented in the visual modality alone in visually impaired infants, suggesting that multisensory rehabilitation strategies could prove helpful in this population.	t	\N
25862624	Animal models of tinnitus complement human findings and potentially deepen our insight into the neural substrates of tinnitus. The fact that animal data are largely based on recordings from the auditory system, in particular from subcortical structures, makes comparison with human electrophysiological data from predominantly cortical areas difficult. Electro/magnetoencephalography and imaging data extend beyond the auditory cortex. The most challenging link to be made is the one between the macroscopic data in humans and the microscopic (single neuron action potentials) and mesoscopic (local field potentials) results obtained in animal models. Since invasive recordings in humans are rare, a bridge needs to be built on the basis of changes in brain rhythms in animals with putative tinnitus.	t	\N
25878278	There is substantial evidence that sensory deprivation leads to important cross-modal brain reorganization that is paralleled by enhanced perceptual abilities. However, it remains unclear how widespread these enhancements are, and whether they are intercorrelated or arise at the expense of other perceptual abilities. One specific area where such a trade-off might arise is that of spatial hearing, where blind individuals have been shown to possess superior monaural localization abilities in the horizontal plane, but inferior localization abilities in the vertical plane. While both of these tasks likely involve the use of monaural cues due to the absence of any relevant binaural signal, there is currently no proper explanation for this discrepancy, nor has any study investigated both sets of abilities in the same sample of blind individuals. Here, we assess whether the enhancements observed in the horizontal plane are related to the deficits observed in the vertical plane by testing sound localization in both planes in groups of blind and sighted persons. Our results show that the blind individuals who displayed the highest accuracy at localizing sounds monaurally in the horizontal plane are also the ones who exhibited the greater deficit when localizing in the vertical plane. These findings appear to argue against the idea of generalized perceptual enhancements in the early blind, and instead suggest the possibility of a trade-off in the localization proficiency between the two auditory spatial planes, such that learning to use monaural cues for the horizontal plane comes at the expense of using those cues to localize in the vertical plane.	t	\N
25880903	Stuttering is a neurodevelopmental disorder that affects the timing and rhythmic flow of speech production. When speech is synchronized with an external rhythmic pacing signal (e.g., a metronome), even severe stuttering can be markedly alleviated, suggesting that people who stutter may have difficulty generating an internal rhythm to pace their speech. To investigate this possibility, children who stutter and typically-developing children (n=17 per group, aged 6-11 years) were compared in terms of their auditory rhythm discrimination abilities of simple and complex rhythms. Children who stutter showed worse rhythm discrimination than typically-developing children. These findings provide the first evidence of impaired rhythm perception in children who stutter, supporting the conclusion that developmental stuttering may be associated with a deficit in rhythm processing.	t	\N
25896774	In this report, we used filtered noise bands to constrain listeners' access to interaural level differences (ILDs) and interaural time differences (ITDs) in a sound source localization task. The samples of interest were listeners with single-sided deafness (SSD) who had been fit with a cochlear implant in the deafened ear (SSD-CI). The comparison samples included listeners with normal hearing and bimodal hearing, i.e., with a cochlear implant in 1 ear and low-frequency acoustic hearing in the other ear. The results indicated that (i) sound source localization was better in the SSD-CI condition than in the SSD condition, (ii) SSD-CI patients rely on ILD cues for sound source localization, (iii) SSD-CI patients show functional localization abilities within 1-3 months after device activation and (iv) SSD-CI patients show better sound source localization than bimodal CI patients but, on average, poorer localization than normal-hearing listeners. One SSD-CI patient showed a level of localization within normal limits. We provide an account for the relative localization abilities of the groups by reference to the differences in access to ILD cues.	t	\N
25912703	In this review we analyse cochlear implantation in terms of the fundamental aspects of the functioning of the auditory system. Concepts concerning neuronal plasticity applied to electrical stimulation in perinatal and adult deep hypoacusis are reviewed, and the latest scientific bases that justify early implantation following screening for congenital deafness are discussed. Finally, this review aims to serve as an example of the importance of fostering the sub-specialty of neurotology in our milieu, with the aim of bridging some of the gaps between specialties and thus improving both the knowledge in the field of research on auditory pathologies and in the screening of patients. The objectives of this review, targeted above all towards specialists in the field of otorhinolaryngology, are to analyse some significant neurological foundations in order to reach a better understanding of the clinical events that condition the indications and the rehabilitation of patients with cochlear implants, as well as to use this means to foster the growth of the sub-specialty of neurotology.	t	\N
25914938	The study examined the effect of structural composition, position of occlusion, and education background on response time and accuracy rate of adult participants in recognition of occluded Chinese characters used in graphic design. Based on 18 Chinese characters selected from the top 4,000 most commonly used characters, a recognition experiment was conducted. Results indicated that, for the characters with two different composition structures, the right position was the best position of occlusion because the recognition of the radical or feature of a character would be least affected, leading to a shorter response time and more accurate reading comprehension. Educational background did not have a significant influence on response time and recognition accuracy.	t	\N
25914939	This study examined the effects of audio-visual aids on anxiety, comprehension test scores, and retention in reading and listening to short stories in English as a Foreign Language (EFL) classrooms. Reading and listening tests, general and test anxiety, and retention were measured in English-major college students in an experimental group with audio-visual aids (n=83) and a control group without audio-visual aids (n=94) with similar general English proficiency. Lower reading test anxiety, unchanged reading comprehension scores, and better reading short-term and long-term retention after four weeks were evident in the audiovisual group relative to the control group. In addition, lower listening test anxiety, higher listening comprehension scores, and unchanged short-term and long-term retention were found in the audiovisual group relative to the control group after the intervention. Audio-visual aids may help to reduce EFL learners' listening test anxiety and enhance their listening comprehension scores without facilitating retention of such materials. Although audio-visual aids did not increase reading comprehension scores, they helped reduce EFL learners' reading test anxiety and facilitated retention of reading materials.	t	\N
25948273	The human brain has evolved to operate effectively in highly complex acoustic environments, segregating multiple sound sources into perceptually distinct auditory objects. A recent theory seeks to explain this ability by arguing that stream segregation occurs primarily due to the temporal coherence of the neural populations that encode the various features of an individual acoustic source. This theory has received support from both psychoacoustic and functional magnetic resonance imaging (fMRI) studies that use stimuli which model complex acoustic environments. Termed stochastic figure-ground (SFG) stimuli, they are composed of a "figure" and background that overlap in spectrotemporal space, such that the only way to segregate the figure is by computing the coherence of its frequency components over time. Here, we extend these psychoacoustic and fMRI findings by using the greater temporal resolution of electroencephalography to investigate the neural computation of temporal coherence. We present subjects with modified SFG stimuli wherein the temporal coherence of the figure is modulated stochastically over time, which allows us to use linear regression methods to extract a signature of the neural processing of this temporal coherence. We do this under both active and passive listening conditions. Our findings show an early effect of coherence during passive listening, lasting from ∼115 to 185 ms post-stimulus. When subjects are actively listening to the stimuli, these responses are larger and last longer, up to ∼265 ms. These findings provide evidence for early and preattentive neural computations of temporal coherence that are enhanced by active analysis of an auditory scene.	t	\N
25951749	The debate about the causal role of the motor system in speech perception has been reignited by demonstrations that motor processes are engaged during the processing of speech sounds. Here, we evaluate which aspects of auditory speech processing are affected, and which are not, in a stroke patient with dysfunction of the speech motor system. We found that the patient showed a normal phonemic categorical boundary when discriminating two non-words that differ by a minimal pair (e.g., ADA-AGA). However, using the same stimuli, the patient was unable to identify or label the non-word stimuli (using a button-press response). A control task showed that he could identify speech sounds by speaker gender, ruling out a general labelling impairment. These data suggest that while the motor system is not causally involved in perception of the speech signal, it may be used when other cues (e.g., meaning, context) are not available.	t	\N
25970594	Previous studies investigating speech perception in noise have typically been conducted with static masker positions. The aim of this study was to investigate the effect of spatial separation of source and masker (spatial release from masking, SRM) in a moving masker setup and to evaluate the impact of adaptive beamforming in comparison with fixed directional microphones in cochlear implant (CI) users. Speech reception thresholds (SRT) were measured in S0N0 and in a moving masker setup (S0Nmove) in 12 normal hearing participants and 14 CI users (7 subjects bilateral, 7 bimodal with a hearing aid in the contralateral ear). Speech processor settings were a moderately directional microphone, a fixed beamformer, or an adaptive beamformer. The moving noise source was generated by means of wave field synthesis and was smoothly moved in a shape of a half-circle from one ear to the contralateral ear. Noise was presented in either of two conditions: continuous or modulated. SRTs in the S0Nmove setup were significantly improved compared to the S0N0 setup for both the normal hearing control group and the bilateral group in continuous noise, and for the control group in modulated noise. There was no effect of subject group. A significant effect of directional sensitivity was found in the S0Nmove setup. In the bilateral group, the adaptive beamformer achieved lower SRTs than the fixed beamformer setting. Adaptive beamforming improved SRT in both CI user groups substantially by about 3 dB (bimodal group) and 8 dB (bilateral group) depending on masker type. CI users showed SRM that was comparable to normal hearing subjects. In listening situations of everyday life with spatial separation of source and masker, directional microphones significantly improved speech perception with individual improvements of up to 15 dB SNR. Users of bilateral speech processors with both directional microphones obtained the highest benefit.	t	\N
26027328	The parallel psychophysical and MMN study focused at the sensitivity of human hearing system to variations in velocity of sound image movement. The motion of sound stimuli with various velocities in the 450 deg/s to 732 deg/s range in increments of 6 deg/s to the left or to the right from the head midline was simulated by introducing linear changes of interaural delay into dichotic stimuli. The psychophysical experiments were designed according to the 2-alternative forced choice paradigm. The subjects were presented by pairs of moving stimuli and were asked to decide which moved faster. The stimuli created for the present study ensured that the subjects performed the discrimination task without relying on associated cues of sound displacement or duration. The psychophysical measures were compared with electrophysiological indexes of sound processing (auditory evoked responses (ERPs) and mismatch negativity (MMN)). Significant MMN was elicited by the difference of 170 deg/s between the reference and test velocity, which corresponded to the relative velocity increase of 38%. At the same time, the difference thresholds for velocity were much higher and exceeded 50%. The results suggest that MMN magnitude depended on the velocity difference between standard and deviant stimuli and was more sensitive to velocity difference than psychophysical measure.	t	\N
26067532	In peripheral vision, objects that are easily discriminated on their own become less discriminable in the presence of surrounding clutter. This phenomenon is known as crowding.The neural mechanisms underlying crowding are not well understood. Better insight might come from single-neuron recording in nonhuman primates, provided they exhibit crowding; however, previous demonstrations of crowding have been confined to humans. In the present study, we set out to determine whether crowding occurs in rhesus macaque monkeys. We found that animals trained to identify a target letter among flankers displayed three hallmarks of crowding as established in humans. First, at a given eccentricity, increasing the spacing between the target and the flankers improved recognition accuracy. Second, the critical spacing, defined as the minimal spacing at which target discrimination was reliable, was proportional to eccentricity. Third, the critical spacing was largely unaffected by object size. We conclude that monkeys, like humans, experience crowding. These findings open the door to studies of crowding at the neuronal level in the monkey visual system.	t	\N
26075675	Vibroplasty has offered a new modality of hearing rehabilitation in patients with mixed, conductive, and sensorineural hearing loss who cannot wear hearing aids. Potentially, the positioning of the floating mass transducer (FMT) in vibroplasty surgery has a critical effect on hearing outputs. In this study, the impact on hearing outputs and coupling efficiency are evaluated by comparing various vibroplasty applications in the middle ear. No other study to date has examined the coupling efficiency of round window (RW) versus an ossicular vibroplasty application. Prospective cohort study of patients with underlying ear pathologies who were not able to wear hearing aids. This is an ongoing prospective study of 16 patients. All patients had a standard audiological test battery. Direct drive transfer function analysis results were correlated with bone conduction thresholds to assess the efficiency of the FMT coupling. Speech perception in quiet and quality of life measure questionnaires were used to assess outcomes. Nine patients had round window vibroplasty, six patients had stapes vibroplasty, and one patient had traditional incus vibroplasty. Patients with a soft tissue coupler between the FMT and the RW had significantly reduced coupling efficiency. Patients who had direct RW contact had significantly improved coupling efficiency. Patients who underwent stapes or incus vibroplasty had the greatest coupling efficiency. This study demonstrates that attachment to the stapes or incus provides the best coupling when compared to round window vibroplasty. When applicable, stapes or incus coupling should be the first choice when implementing vibroplasty.	t	\N
26184883	Communication by sounds requires that the communication channels (i.e. speech/speakers and other sound sources) had been established. This allows to separate concurrently active sound sources, to track their identity, to assess the type of message arriving from them, and to decide whether and when to react (e.g., reply to the message). We propose that these functions rely on a common generative model of the auditory environment. This model predicts upcoming sounds on the basis of representations describing temporal/sequential regularities. Predictions help to identify the continuation of the previously discovered sound sources to detect the emergence of new sources as well as changes in the behavior of the known ones. It produces auditory event representations which provide a full sensory description of the sounds, including their relation to the auditory context and the current goals of the organism. Event representations can be consciously perceived and serve as objects in various cognitive operations.	t	\N
26222939	To compare children with typical language development (TLD) and evolutional phonological disorder (EPD) regarding the phonemic discrimination and the linguistic performance of language levels (morphological, syntactic, semantic, and perceptual and productive vocabulary). The sample comprised 36 children, aged between 5 years and 7 years and 11 months, with TLD and EPD. Children with EPD were awaiting care in the speech units of two higher education institutions and children with TLD were screened in public schools. For inclusion in the study, the criteria were the following: being authorized by the informed consent and being within the required age group. After the inclusion of children, all subjects underwent the phonemic discrimination test with figures, to the average phrase value test, which assesses the morphosyntactic and semantic/lexical aspects, and the expressive vocabulary test. For statistical evaluation of the influence of variables, Spearman's nonparametric correlation coefficient was used, and for comparison between the groups regarding evaluation performance, Mann-Whitney test was used. Only children with EPD showed influence of phonemic discrimination in other linguistic levels. In addition, a significant difference was observed between the performances of both groups in relation to the phonemic discrimination, vocabulary, and all levels assessed in the questions and description modality. The comparison between both groups showed a statistically significant difference in phonemic discrimination and morphosyntactic and lexical/semantic development, with better performance in the TLD group.	t	\N
26222946	To analyze the intra- and inter-rater agreement for visual analog scale and numerical scale in task of sustained vowel and to determine numerical cutoff points to visual analog scale corresponding to the degrees of the numeric scale. We selected 205 samples of the usual task of the sustained vowel /a/. Six voice specialists rated the overall degree of vocal deviation, first by visual analog scale and, after two days, by the numeric scale. The results obtained by both scales were compared and the intra- and inter-rater agreement, the correlation between the scales, and the estimated cutoff points using the intraclass correlation and concordance Kappa coefficients, the Spearman coefficient, and analysis of variance, and the values of sensitivity and specificity were analyzed. A strong correlation was observed between the scales. The following numerical cutoff values were found for visual analog scale corresponding to the numerical scale: neutral (degree zero) - 0 to 34 mm; mild (degree one) - 34.1 to 51 mm; moderate (degree two) - 51.1 to 63.5 mm; intense (degree three) - 63.6 to 77.5 mm; and extreme (degree four) - above 77.5 mm. The visual analog scale and numerical scale showed a strong correlation, being observed the greater intra- and inter-rater agreement in visual analog scale. Numerical cutoff values for visual analog scale were found. This correlation enables the comparison between the results found in the evaluation of the overall degree of vocal deviation by both scales, which are widely used in research and in the clinical speech therapy routine.	t	\N
26356376	During dental treatments, patients may experience negative emotions associated with the procedure. This study was conducted with the aim of using functional magnetic resonance imaging (fMRI) to visualize cerebral cortical stimulation among dental patients in response to auditory stimuli produced by ultrasonic scaling and power suction equipment. Subjects (n = 7) aged 23-35 years were recruited for this study. All were right-handed and underwent clinical pure-tone audiometry testing to reveal a normal hearing threshold below 20 dB hearing level (HL). As part of the study, subjects initially underwent a dental calculus removal treatment. During the treatment, subjects were exposed to ultrasonic auditory stimuli originating from the scaling handpiece and salivary suction instruments. After dental treatment, subjects were imaged with fMRI while being exposed to recordings of the noise from the same dental instrument so that cerebral cortical stimulation in response to aversive auditory stimulation could be observed. The independent sample confirmatory t-test was used. Subjects also showed stimulation in the amygdala and prefrontal cortex, indicating that the ultrasonic auditory stimuli elicited an unpleasant response in the subjects. Patients experienced unpleasant sensations caused by contact stimuli in the treatment procedure. In addition, this study has demonstrated that aversive auditory stimuli such as sounds from the ultrasonic scaling handpiece also cause aversive emotions. This study was indicated by observed stimulation of the auditory cortex as well as the amygdala, indicating that noise from the ultrasonic scaling handpiece was perceived as an aversive auditory stimulus by the subjects. Subjects can experience unpleasant sensations caused by the sounds from the ultrasonic scaling handpiece based on their auditory stimuli.	t	\N
26380996	The purpose was to evaluate brain plasticity that contributes to speech performance after cochlear implantation (CI) in postlingual elderly (>60 years) patients. Fifteen elderly postlingual deaf patients who underwent preoperative brain fluorodeoxyglucose positron emission tomography (FDG-PET) and were followed-up for more than 1 year after CI were included. The mean age of these patients was 64.6 years (range, 60-80 years). Based on their sentence score at 1 year after CI surgery, the patients were classified into two groups: poor performers (CID score of <80) and good performers (CID score of ≥80). The duration of deafness, age at operation, preoperative residual hearing, and preoperative brain metabolism were analyzed. SPM5 software was used for FDG-PET image preprocessing and statistical analysis. Neither deafness duration nor preoperative residual hearing was associated with speech performance. The age at operation had little association with speech performance. Deaf patients whose brain metabolism was higher in frontotemporal regions became good CI users but those with higher metabolism in visual association areas became poor CI users. No significant cortical area of higher metabolism was associated with the duration of deafness. Overactivation in the visual processing pathway correlated with a poor CI outcome at 1 year. Deaf patients who are going to be poorer performers with CI devices maintain visual information processing during preoperative silent resting periods.	t	\N
26381009	The occurrence of oval window atresia is a rare anomaly with conductive hearing loss. Traditional atresia surgeries involve challenging surgical techniques with risks of irreversible inner ear damage. Recent reports on Bonebridge (Medel, Innsbruck, Austria), a novel implantable bone conduction hearing aid system, assert that the device is safe and effective for conductive hearing loss. We present a case of Bonebridge implantation in an eight-year-old girl with bilateral oval window atresia.	t	\N
26382004	Previewing distracters enhances the efficiency of visual search. Watson and Humphreys (1997) proposed that the preview benefit rests on visual marking, a mechanism which actively encodes distracter locations at preview and inhibits them afterwards at search. As Watson and Humphreys did, we used a letter-color search task to study constraints of visual marking in conjunction search and near-efficient single-feature search with single-colored and homogeneous distracter letters. Search performance was measured for fixed target and distracter features (block design) and for randomly changed features across trials (random design). In single-feature search there was a full preview benefit for both block and random designs. In conjunction search a full preview benefit was obtained only for the block design; randomly changing target and distracter features disrupted the preview benefit. However, the preview benefit was restored when the distracters were organized in spatially coherent blocks. These findings imply that the temporal segregation of old and new items is sufficient for visual marking in near-efficient single-feature search, while in conjunction search it is not. We propose a supplanting grouping principle for the preview benefit: When the new items add a new color, conjunction search is initialized and attentional resources are withdrawn from the marking mechanism. Visual marking can be restored by a second grouping principle that joins with temporal asynchrony. This principle can be either spatial or feature based. In the case of the latter, repetition priming is necessary to establish joint grouping by color and temporal asynchrony.	t	\N
26463676	The effects of inner ear abnormality on audibility have been explored since the early 20th century when sound detection measures were first used to define and quantify 'hearing loss'. The development in the 1970s of objective measures of cochlear hair cell function (cochlear microphonics, otoacoustic emissions, summating potentials) and auditory nerve/brainstem activity (auditory brainstem responses) have made it possible to distinguish both synaptic and auditory nerve disorders from sensory receptor loss. This distinction is critically important when considering aetiology and management. In this review we address the clinical and pathophysiological features of auditory neuropathy that distinguish site(s) of dysfunction. We describe the diagnostic criteria for: (i) presynaptic disorders affecting inner hair cells and ribbon synapses; (ii) postsynaptic disorders affecting unmyelinated auditory nerve dendrites; (iii) postsynaptic disorders affecting auditory ganglion cells and their myelinated axons and dendrites; and (iv) central neural pathway disorders affecting the auditory brainstem. We review data and principles to identify treatment options for affected patients and explore their benefits as a function of site of lesion.	t	\N
22284837	Best practices concerning the audiological management of the child diagnosed with auditory neuropathy spectrum disorder (ANSD) have not been definitively defined nor fully understood. One reason is that previous studies have demonstrated conflicting findings regarding the outcomes of cochlear implantation for children with ANSD. Thus, the question remains whether children with ANSD are able to achieve similar outcomes following cochlear implantation as those children with sensorineural hearing loss (SNHL). To assess speech perception outcomes for children with cochlear implants who have a diagnosis of ANSD as well as their age-matched peers who have sensorineural hearing loss. Retrospective study Thirty-five subject pairs (n = 70) ranging in age at implant activation from to 10 to 121 mo (mean 39.2 mo) were included in this retrospective study. Subjects were matched on variables including age at initial implant activation and months of implant use at postoperative test point. Speech recognition scores for monosyllabic and multisyllabic stimuli were compared across the subject groups. For those not developmentally and/or linguistically ready for completion of open-set speech recognition testing with recorded stimuli, GASP (Glendonald Auditory Screening Procedure) word recognition and/or questionnaire data using either the LittlEARS or Meaningful Auditory Integration Scale were compared across the groups. Statistical analysis using a repeated-measures analysis of variance (ANOVA) evaluated the effects of etiology (ANSD or SNHL) on postoperative outcomes. The results of this study demonstrate that children with ANSD can clearly benefit from cochlear implantation and that their long-term outcomes are similar to matched peers with SNHL on measures of speech recognition. There were no significant differences across the ANSD and SNHL groups on any of the tested measures. Cochlear implantation is a viable treatment option for children with a diagnosis of ANSD who are not making auditory progress with hearing aids that have been fit using the Desired Sensation Level method (DSL v5.0). Expected outcomes of cochlear implantation for children with ANSD, excluding children with cochlear nerve deficiency, are no different than for children with non-ANSD SNHL. These results are important for counseling families on the expected outcomes and realistic expectations following cochlear implantation for children with ANSD who demonstrate no evidence of cochlear nerve deficiency.	t	\N
