20044890	The present study aimed to examine the influence of musical expertise on the metric and semantic aspects of speech processing. In two attentional conditions (metric and semantic tasks), musicians listened to short sentences ending in trisyllabic words that were semantically and/or metrically congruous or incongruous. Both ERPs and behavioral data were analyzed and the results were compared to previous nonmusicians' data. Regarding the processing of meter, results showed that musical expertise influenced the automatic detection of the syllable temporal structure (P200 effect), the integration of metric structure and its influence on word comprehension (N400 effect), as well as the reanalysis of metric violations (P600 and late positivities effects). By contrast, results showed that musical expertise did not influence the semantic level of processing. These results are discussed in terms of transfer of training effects from music to speech processing.	\N	\N
20117764	We present new evidence based on fMRI for the existence and neural architecture of an abstract supramodal language system that can integrate linguistic inputs arising from different modalities such that speech and print each activate a common code. Working with sentence material, our aim was to find out where the putative supramodal system is located and how it responds to comprehension challenges. To probe these questions we examined BOLD activity in experienced readers while they performed a semantic categorization task with matched written or spoken sentences that were either well-formed or contained anomalies of syntactic form or pragmatic content. On whole-brain scans, both anomalies increased net activity over non-anomalous baseline sentences, chiefly at left frontal and temporal regions of heteromodal cortex. The anomaly-sensitive sites correspond approximately to those that previous studies (Michael et al., 2001; Constable et al., 2004) have found to be sensitive to other differences in sentence complexity (object relative minus subject relative). Regions of interest (ROIs) were defined by peak response to anomaly averaging over modality conditions. Each anomaly-sensitive ROI showed the same pattern of response across sentence types in each modality. Voxel-by-voxel exploration over the whole brain based on a cosine similarity measure of common function confirmed the specificity of supramodal zones.	\N	\N
20146608	The neural responses to sensory consequences of a self-produced motor act are suppressed compared with those in response to a similar but externally generated stimulus. Previous studies in the somatosensory and auditory systems have shown that the motor-induced suppression of the sensory mechanisms is sensitive to delays between the motor act and the onset of the stimulus. The present study investigated time-dependent neural processing of auditory feedback in response to self-produced vocalizations. ERPs were recorded in response to normal and pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the same vocalizations. The pitch-shifted stimulus was delivered to the subjects' auditory feedback after a randomly chosen time delay between the vocal onset and the stimulus presentation. Results showed that the neural responses to delayed feedback perturbations were significantly larger than those in response to the pitch-shifted stimulus occurring at vocal onset. Active vocalization was shown to enhance neural responsiveness to feedback alterations only for nonzero delays compared with passive listening to the playback. These findings indicated that the neural mechanisms of auditory feedback processing are sensitive to timing between the vocal motor commands and the incoming auditory feedback. Time-dependent neural processing of auditory feedback may be an important feature of the audio-vocal integration system that helps to improve the feedback-based monitoring and control of voice structure through vocal error detection and correction.	\N	\N
20347261	In this study, the subjective and objective voice measures of seven female physical education student teachers during a semester of student teaching were investigated. The participants completed the voice measures at three data collection time points: baseline, middle, and end of the semester. The voice measures included acoustic and aerodynamic data, perceptual rating scales of vocal quality and vocal fatigue, an end-of-semester questionnaire, and the Voice Handicap Index. Results demonstrated that the subjective and objective voice measures changed at the middle and the end of the semester as compared with those at baseline. The change in the voice measures may suggest that the vocal mechanism was adapting to the increased vocal demands of teaching physical education.	\N	\N
20411315	Noisy recordings of dialogue often serve as evidence in criminal proceedings. The present article explores the ability of two types of contextual information, currently present in the legal system, to bias subjective interpretations of such evidence. The present experiments demonstrate that the general context of the legal system and the presence of transcripts of the recorded speech are both able to bias interpretations of degraded & benign recordings into interpretable & incriminating. Furthermore we demonstrate a curse of knowledge whereby people become miscalibrated to the true quality of degraded recordings when provided transcripts. Current methods of dealing with auditory evidence are insufficient to mollify the effects of biasing information within the criminal justice system.	\N	\N
20433240	On the basis of results from behavioral studies that spatial attention improves the exclusion of external noise in the target region, we predicted that attending to a spatial region would reduce the impact of external noise on the BOLD response in corresponding cortical areas, seen as reduced BOLD responses in conditions with large amounts of external noise but relatively low signal, and increased dynamic range of the BOLD response to variations in signal contrast. We found that, in the presence of external noise, covert attention reduced the trial-by-trial BOLD response by 15.5-18.9% in low signal contrast conditions in V1. It also increased the BOLD dynamic range in V1, V2, V3, V3A/B, and V4 by a factor of at least three. Overall, covert attention reduced the impact of external noise by about 73-85% in these early visual areas. It also increased the contrast gain by a factor of 2.6-3.8.	\N	\N
20576373	Patients with systemic lupus erythematosus (SLE) may develop hearing and balance disorders as a result of the immune-mediated inner ear damage due to vasculitis or ototoxicity of drugs used in SLE treatment. The aim of the study was evaluation of the hearing organ disorders in patients with SLE with particular regard to their prevalence and relationship to duration and severity of disease. The severity was assessed from involvement of organs that resulted in poorer SLE outcome, i.e. kidneys and central nervous system (CNS), and from the presence of antibodies associated with unfavourable SLE prognosis. Thirty-five unselected, consecutive patients (33 women, two men, mean age 47.8 years) with SLE diagnosed in compliance to the international diagnostic criteria of the American Rheumatism Association (1982) were enrolled into the study. The control group consisted of 30 otologically healthy persons matched to the SLE group for age and sex. Case history was recorded for all patients from questionnaire data and laryngological examinations were performed, followed by pure-tone, speech and impedance audiometry and auditory brainstem response audiometry (ABR). In the anamnesis 71.4% of patients reported vertigo, 62.9% headaches, 40% tinnitus, 25.7% hyperacusis, 17.1% hearing loss and 2.9% ear fullness. It was found that SLE patients had a significantly poorer mean hearing thresholds than the control group for all frequencies, except for 500; 2000 and 4000 Hz. Longer ABR latency averages were observed in the group of SLE patients compared to control. Ten patients (28.6%) developed high-frequency and symmetric sensorineural hearing loss (SNHL). Significant positive correlation between mean air-conduction hearing thresholds and SLE duration (r = 0.46, p < 0.001) was found. After taking age into consideration, hearing acuity in SLE was related to duration of disease in younger patients. Furthermore, no relation was seen between hearing level and severity of disease. Auditory system involvement ought to be considered as one of elements of the clinical picture of systemic lupus erythematosus while determination of its character, original or secondary, requires further research.	\N	\N
20599334	The children with difficulty in receiving sounds presented at rapid rates in speech sounds and language learning period, may have delay in speech sounds and language development due to hearing speech sounds not clearly. Auditory temporal processing (ATP) is the ability to perceive auditory signals of brief duration accurately when presented at rapid rates. ATP can be evaluated by the random gap detection test (RGDT), which detects a brief gap between two stimuli. In this study, we investigated performance of children with previous language delay (PLD), currently having disorders in more than one speech sounds, on random gap detection test (RGDT) and RGDT-expanded (RGDT-EXP) tests. 12 children (8 male, 4 female) with previous language delay (PLD) and complaint of expressing speech sounds distorted, were included into the study. They had applied language training for at least one-year period in the past and in the current time, their language development is normal. They expressed one or more speech sounds as distorted. The control group consisted of 10 normal hearing children with normal phonological development and language matched for age; and who had not PLD (5 male, 5 female). Children language levels were evaluated by Preschool Language Scale-4 test; or Clinical evaluation of language fundamentals, fourth edition (CELF-4) according to child's age. Speech sounds development was assessed by Speech Sound Development Test (SSDT). They were applied RGDT and/if necessary, RGDT-EXP. Each child responded whether he/she heard one or two tones. Their responses were taken as verbally and/or hold up one finger or two fingers. In the second test, they were applied speech discrimination test in quiet environment and in noise. Gap detection thresholds (GDTs) were detected at 500-4000 Hz; and Composite GDTs (CGDTs) were found for the study and control groups. GDT/CGDT > 20 ms was considered as abnormal for temporal processing disorder. In the study group with PLD, mean of the GDTs were all over the normal limits; and in control group, mean of GDTs were all in normal limits. The difference between the mean GDTs of the study group were significantly higher than the control groups at all frequencies of 500-4000 Hz. In PLD group, CGDT (103.53 ± 11.63 ms) was significantly higher than that of the control group, (10.35 ± 0.65 ms) (p=0.021). The children with PLD have difficulties in perception of speech sounds at a certain rate, even they have not language learning difficulties. Therefore, difficulty in distinguishing of speech sounds may cause especially receptive language development delay. We believe that perception of the speech sounds and language in a certain speed; and temporally degraded speech programmes should be incorporated into the training programme and may help to prevent delays.	\N	\N
20617885	The present study investigated the effects of auditory selective attention on the processing of syntactic information in music and speech using event-related potentials. Spoken sentences or musical chord sequences were either presented in isolation, or simultaneously. When presented simultaneously, participants had to focus their attention either on speech, or on music. Final words of sentences and final harmonies of chord sequences were syntactically either correct or incorrect. Irregular chords elicited an early right anterior negativity (ERAN), whose amplitude was decreased when music was simultaneously presented with speech, compared to when only music was presented. However, the amplitude of the ERAN-like waveform elicited when music was ignored did not differ from the conditions in which participants attended the chord sequences. Irregular sentences elicited an early left anterior negativity (ELAN), regardless of whether speech was presented in isolation, was attended, or was to be ignored. These findings suggest that the neural mechanisms underlying the processing of syntactic structure of music and speech operate partially automatically, and, in the case of music, are influenced by different attentional conditions. Moreover, the ERAN was slightly reduced when irregular sentences were presented, but only when music was ignored. Therefore, these findings provide no clear support for an interaction of neural resources for syntactic processing already at these early stages.	\N	\N
20642737	Acoustic and visual interceptive actions were tested in this research by comparing the performance of blind, blind-folded, and sighted individuals. An indirect interception method was employed in which the participant had to roll an intercepting ball towards a moving target on a perpendicular track. The interception task used conditions that varied the speed, rolling distance, and target size/intensity. While performance was highly consistent and accurate for visual participants in this research, the blind and blind-folded participants demonstrated much more performance variability in response to changes in speed and distance. Manipulation of target size and intensity did not affect judgments, however performance tended to be more accurate at shorter distances and with faster target speeds. Results from this research are discussed in terms of their implications for tau in acoustic interception, and the use of spatial and temporal cues for guiding interceptive actions.	\N	\N
20644955	Two experiments investigated the effects of interval duration ratio on perception of local timing perturbations, accuracy of rhythm production, and phase correction in musicians listening to or tapping in synchrony with cyclically repeated auditory two-interval rhythms. Ratios ranged from simple (1:2) to complex (7:11, 5:13), and from small (5:13 = 0.38) to large (6:7 = 0.86). Rhythm production and perception exhibited similar ratio-dependent biases: rhythms with small ratios were produced with increased ratios, and timing perturbations in these rhythms tended to be harder to detect when they locally increased the ratio than when they reduced it. The opposite held for rhythms with large ratios. This demonstrates a close relation between rhythm perception and production. Unexpectedly, however, the neutral "attractor" was not the simplest ratio (1:2 = 0.50) but a complex ratio near 4:7 (= 0.57). Phase correction in response to perturbations was generally rapid and did not show the ratio-dependent biases observed in rhythm perception and production. Thus, phase correction operates efficiently and autonomously even in synchronization with rhythms exhibiting complex interval ratios.	\N	\N
20663254	Patients with schizophrenia (SZ) characteristically exhibit supranormal levels of cortical activity to self-induced sensory stimuli, ostensibly because of abnormalities in the neural signals (corollary discharges, CDs) normatively involved in suppressing the sensory consequences of self-generated actions. The nature of these abnormalities is unknown. This study investigated whether SZ patients experience CDs that are abnormally delayed in their arrival at the sensory cortex. Twenty-one patients with SZ and 25 matched control participants underwent electroencephalography (EEG). Participants' level of cortical suppression was calculated as the amplitude of the N1 component evoked by a button press-elicited auditory stimulus, subtracted from the N1 amplitude evoked by the same stimulus presented passively. In the three experimental conditions, the auditory stimulus was delivered 0, 50 or 100 ms subsequent to the button-press. Fifteen SZ patients and 17 healthy controls (HCs) also underwent diffusion tensor imaging (DTI), and the fractional anisotropy (FA) of participants' arcuate fasciculus was used to predict their level of cortical suppression in the three conditions. While the SZ patients exhibited subnormal N1 suppression to undelayed, self-generated auditory stimuli, these deficits were eliminated by imposing a 50-ms, but not a 100-ms, delay between the button-press and the evoked stimulus. Furthermore, the extent to which the 50-ms delay normalized a patient's level of N1 suppression was linearly related to the FA of their arcuate fasciculus. These data suggest that SZ patients experience temporally delayed CDs to self-generated auditory stimuli, putatively because of structural damage to the white-matter (WM) fasciculus connecting the sites of discharge initiation and destination.	\N	\N
20665720	The effect of stimulus modulation rate on the underlying neural activity in human auditory cortex is not clear. Human studies (using both invasive and noninvasive techniques) have demonstrated that at the population level, auditory cortex follows stimulus envelope. Here we examined the effect of stimulus modulation rate by using a rare opportunity to record both spiking activity and local field potentials (LFP) in auditory cortex of patients during repeated presentations of an audio-visual movie clip presented at normal, double, and quadruple speeds. Mean firing rate during evoked activity remained the same across speeds and the temporal response profile of firing rate modulations at increased stimulus speeds was a linearly scaled version of the response during slower speeds. Additionally, stimulus induced power modulation of local field potentials in the high gamma band (64-128 Hz) exhibited similar temporal scaling as the neuronal firing rate modulations. Our data confirm and extend previous studies in humans and anesthetized animals, supporting a model in which both firing rate, and high-gamma LFP power modulations in auditory cortex follow the temporal envelope of the stimulus across different modulation rates.	\N	\N
20666594	Spoken word recognition is achieved via competition between activated lexical candidates that match the incoming speech input. The competition is modulated by prelexical cues that are important for segmenting the auditory speech stream into linguistic units. One such prelexical cue that listeners rely on in spoken word recognition is phonotactics. Phonotactics defines possible combinations of phonemes within syllables or words in a given language. The present study aimed at investigating both temporal and topographical aspects of the neuronal correlates of phonotactic processing by simultaneously applying ERPs and functional near-infrared spectroscopy (fNIRS). Pseudowords, either phonotactically legal or illegal with respect to the participants' native language, were acoustically presented to passively listening adult native German speakers. ERPs showed a larger N400 effect for phonotactically legal compared to illegal pseudowords, suggesting stronger lexical activation mechanisms in phonotactically legal material. fNIRS revealed a left hemispheric network including fronto-temporal regions with greater response to phonotactically legal pseudowords than to illegal pseudowords. This confirms earlier hypotheses on a left hemispheric dominance of phonotactic processing most likely due to the fact that phonotactics is related to phonological processing and represents a segmental feature of language comprehension. These segmental linguistic properties of a stimulus are predominantly processed in the left hemisphere. Thus, our study provides first insights into temporal and topographical characteristics of phonotactic processing mechanisms in a passive listening task. Differential brain responses between known and unknown phonotactic rules thus supply evidence for an implicit use of phonotactic cues to guide lexical activation mechanisms.	\N	\N
20801079	Tinnitus can be considered an auditory phantom percept, in which patients hear an internal sound in the absence of any external sound source, mimicking tonal memory. Tinnitus however can be perceived exclusively uni- or bilaterally. The neurophysiological differences were investigated between unilateral and bilateral tinnitus using LORETA source localized resting state EEG recordings. The difference between unilateral and bilateral tinnitus is reflected by high frequency activity (beta and gamma) in the superior prefrontal gurus, right parahippocampus, right angular gyrus and right auditory cortex. Unilateral tinnitus is characterized by contralateral beta2 in the superior prefrontal gyrus in comparison to bilateral tinnitus, but gamma in comparison to non-tinnitus subjects. Bilateral tinnitus has delta activity in the ventrolateral prefrontal cortex in comparison to unilateral tinnitus, and bilateral beta1 in comparison to non-tinnitus subjects. Bilateral tinnitus is also characterized by bilateral frontopolar beta1 activity. Unilateral and bilateral tinnitus can be differentiated based on their resting state oscillation patterns: beta3 and gamma-band activity in the superior premotor cortex, parahippocampal area and angular gyrus seem to form the core of a spatial localization network involved in tinnitus. These differences should be taken into account when evaluating functional neuroimaging data relating to tinnitus.	\N	\N
20808225	The purpose of this study was to examine speech recognition through hearing aids for seven telephone listening conditions. Speech recognition scores were measured for 20 participants in six wireless routing transmission conditions and one acoustic telephone condition. In the wireless conditions, the speech signal was delivered to both ears simultaneously (bilateral speech) or to one ear (unilateral speech). The effect of changing the noise level in the nontest ear during unilateral conditions was also examined. Participants were fitted with hearing aids using both nonoccluding and occluding dome ear tips. Participants were seated in a room with background noise present and speech was transmitted to the participants without additional noise. There was no effect of changing the noise level in the nontest ear and no difference between unilateral wireless routing and acoustic telephone listening. For wireless transmission, bilateral presentation resulted in significantly better speech recognition than unilateral presentation. Bilateral wireless conditions allowed for significantly better recognition than the acoustic telephone condition for participants fitted with occluding ear tips only. Routing the signal to both hearing aids resulted in significantly better speech recognition than unilateral signal routing. Wireless signal routing was shown to be beneficial compared with acoustic telephone listening and in some conditions resulted in the best performance of all of the listening conditions evaluated. However, this advantage was only evident when the signal was routed to both ears and when hearing aid wearers were fitted with occluding domes. Therefore, it is expected that the benefits of this new wireless streaming technology over existing telephone coupling methods will be most evident clinically in hearing aid wearers who require more limited venting than is typically used in open canal fittings.	\N	\N
20812786	In contrast to visual object processing, relatively little is known about how the human brain processes everyday real-world sounds, transforming highly complex acoustic signals into representations of meaningful events or auditory objects. We recently reported a fourfold cortical dissociation for representing action (nonvocalization) sounds correctly categorized as having been produced by human, animal, mechanical, or environmental sources. However, it was unclear how consistent those network representations were across individuals, given potential differences between each participant's degree of familiarity with the studied sounds. Moreover, it was unclear what, if any, auditory perceptual attributes might further distinguish the four conceptual sound-source categories, potentially revealing what might drive the cortical network organization for representing acoustic knowledge. Here, we used functional magnetic resonance imaging to test participants before and after extensive listening experience with action sounds, and tested for cortices that might be sensitive to each of three different high-level perceptual attributes relating to how a listener associates or interacts with the sound source. These included the sound's perceived concreteness, effectuality (ability to be affected by the listener), and spatial scale. Despite some variation of networks for environmental sounds, our results verified the stability of a fourfold dissociation of category-specific networks for real-world action sounds both before and after familiarity training. Additionally, we identified cortical regions parametrically modulated by each of the three high-level perceptual sound attributes. We propose that these attributes contribute to the network-level encoding of category-specific acoustic knowledge representations.	\N	\N
20844257	It is important to ensure that hearing aid fitting strategies for infants take into account the infant's developing speech perception system. As a way of exploring this issue, this study examined how 6- and 9-month-olds with normal hearing perceive native-language speech in which the natural spectral shape was altered to emphasize either high-frequency (positive spectral tilt) or low-frequency (negative spectral tilt) information. Discrimination was tested using a visual habituation procedure. Forty-eight 6-month-olds and forty-eight 9-month-olds were presented with a fricative contrast, /f/-/s/, in 1 of 3 conditions: (a) as unmodified speech; (b) with a -6 dB/octave tilt; or (c) with a +6 dB/octave tilt. Six-month-olds showed evidence of discriminating /f/-/s/ in all 3 conditions, but 9-month-olds showed such evidence only in the unmodified condition. The findings suggest that the perceptual reorganization that emerges for consonants at the end of the first year affects 9-month-olds' discrimination of native speech sounds. Perceptual reorganization is usually indexed by a decline in the ability to discriminate nonnative speech sounds. In this study, 6-month-olds demonstrated an acoustic-based sensitivity to both modified and unmodified native speech sounds, but 9-month-olds were most sensitive to the unmodified speech sounds that adhered to the native spectral profile.	\N	\N
20864070	Auditory and visual processes demonstrably enhance each other based on spatial and temporal coincidence. Our recent results on visual search have shown that auditory signals also enhance visual salience of specific objects based on multimodal experience. For example, we tend to see an object (e.g., a cat) and simultaneously hear its characteristic sound (e.g., "meow"), to name an object when we see it, and to vocalize a word when we read it, but we do not tend to see a word (e.g., cat) and simultaneously hear the characteristic sound (e.g., "meow") of the named object. If auditory-visual enhancements occur based on this pattern of experiential associations, playing a characteristic sound (e.g., "meow") should facilitate visual search for the corresponding object (e.g., an image of a cat), hearing a name should facilitate visual search for both the corresponding object and corresponding word, but playing a characteristic sound should not facilitate visual search for the name of the corresponding object. Our present and prior results together confirmed these experiential association predictions. We also recently showed that the underlying object-based auditory-visual interactions occur rapidly (within 220ms) and guide initial saccades towards target objects. If object-based auditory-visual enhancements are automatic and persistent, an interesting application would be to use characteristic sounds to facilitate visual search when targets are rare, such as during baggage screening. Our participants searched for a gun among other objects when a gun was presented on only 10% of the trials. The search time was speeded when a gun sound was played on every trial (primarily on gun-absent trials); importantly, playing gun sounds facilitated both gun-present and gun-absent responses, suggesting that object-based auditory-visual enhancements persistently increase the detectability of guns rather than simply biasing gun-present responses. Thus, object-based auditory-visual interactions that derive from experiential associations rapidly and persistently increase visual salience of corresponding objects.	\N	\N
20883507	The prior entry hypothesis of attention holds that attended stimuli are perceived earlier than unattended stimuli. Whereas this speeding of perceptual processing has been repeatedly demonstrated for spatial attention, it has not been reported within the temporal domain. To fill this gap, we tested whether temporal attention accelerates auditory perceptual processing by employing event-related potentials as on-line indicators of perceptual processing. In a modified oddball paradigm, we presented a single tone in each trial, either a frequent standard tone or an infrequent deviant or target tone. Temporal attention to tones was manipulated via constant foreperiods. We observed that the latency of the N2, an event-related potential reflecting perceptual processing, is shortened by temporal attention. This result provides first evidence for the idea that temporal attention accelerates perceptual processing as suggested by the prior entry hypothesis.	\N	\N
20890206	Perception-in-noise deficits have been demonstrated across many populations and listening conditions. Many factors contribute to successful perception of auditory stimuli in noise, including neural encoding in the central auditory system. Physiological measures such as cortical auditory-evoked potentials (CAEPs) can provide a view of neural encoding at the level of the cortex that may inform our understanding of listeners' abilities to perceive signals in the presence of background noise. To understand signal-in-noise neural encoding better, we set out to determine the effect of signal type, noise type, and evoking paradigm on the P1-N1-P2 complex. Tones and speech stimuli were presented to nine individuals in quiet and in three background noise types: continuous speech spectrum noise, interrupted speech spectrum noise, and four-talker babble at a signal-to-noise ratio of -3 dB. In separate sessions, CAEPs were evoked by a passive homogenous paradigm (single repeating stimulus) and an active oddball paradigm. The results for the N1 component indicated significant effects of signal type, noise type, and evoking paradigm. Although components P1 and P2 also had significant main effects of these variables, only P2 demonstrated significant interactions among these variables. Signal type, noise type, and evoking paradigm all must be carefully considered when interpreting signal-in-noise evoked potentials. Furthermore, these data confirm the possible usefulness of CAEPs as an aid to understand perception-in-noise deficits.	\N	\N
20932562	Functional neuroimaging studies of syntactic processing have been interpreted as identifying the neural locations of parsing and interpretive operations. However, current behavioral studies of sentence processing indicate that many operations occur simultaneously with parsing and interpretation. In this review, we point to issues that arise in discriminating the effects of these concurrent processes from those of the parser/interpreter in neural measures and to approaches that may help resolve them.	\N	\N
20934172	This paper presents the results of three studies of intelligibility and quality of speech recorded through a bone conduction microphone (BCM). All speech signals were captured and recorded using a Temco HG-17 BCM. Twelve locations on or close to the skull were selected for the BCM placement. In the first study, listeners evaluated the intelligibility and quality of the bone conducted speech signals presented through traditional earphones. Listeners in the second study evaluated the intelligibility and quality of signals presented through a loudspeaker. In the third study the signals were reproduced through a bone conduction headset; however, signal evaluation was limited to speech intelligibility only. In all three studies, the Forehead and Temple BCM locations yielded the highest intelligibility and quality rating scores. The Collarbone location produced the least intelligible and lowest quality signals across all tested BCM locations.	\N	\N
20950509	To assess the hearing changes associated with sacrificing an intact ossicular chain during cholesteatoma surgery. We reviewed the operation notes of surgical procedures performed by the senior author between October 2000 and April 2006. Thirty-three cases were identified in which cholesteatoma surgery had been performed in the presence of a mobile, intact ossicular chain. One set of case notes was missing; therefore, 32 cases were included in the analysis. The ossicular chain was preserved in 17 cases (14 males and three females) and sacrificed in 15 (eight males and seven females). At the first post-operative assessment, a median air-bone gap deterioration of 3.3 dB was seen in patients in whom the ossicular chain had been sacrificed, while a median air-bone gap improvement of 3.3 dB was seen in those in whom the chain had been preserved. However, multivariable logistic regression analysis suggested that this difference in hearing outcomes was due to pre-operative hearing status, and that preservation of the ossicular chain did not lead to a better outcome. In cholesteatoma surgery, there is at most a marginal benefit in preserving the ossicular chain. In the current study, the better hearing outcomes associated with preservation of the ossicular chain were accounted for by patients' better pre-operative hearing status. This study did not demonstrate a difference in residual disease rate, but was underpowered to do so.	\N	\N
20961518	Researchers often conduct visual world studies to investigate how listeners integrate linguistic information with prior context. Such studies are likely to generate anticipatory baseline effects (ABEs), differences in listeners' expectations about what a speaker might mention that exist before a critical speech stimulus is presented. ABEs show that listeners have attended to and accessed prior contextual information in time to influence the processing of the critical speech stimulus. However, further evidence is required to show that the information actually did influence subsequent processing. ABEs can compromise the validity of inferences about information integration if they are not appropriately controlled. We discuss four solutions: statistical estimation, experimental control, elimination of "on-target" trials, and neutral gaze. An experiment compares the performance of these solutions, and suggests that the elimination of on-target trials introduces bias in the direction of ABEs, due to the statistical phenomenon of regression toward the mean. We conclude that statistical estimation, possibly coupled with experimental control, offers the most valid and least biased solution.	\N	\N
21060141	To investigate the effects of increased syntactic complexity and utterance length demands on speech production and comprehension in individuals with Parkinson's disease (PD) using behavioral and physiological measures. Speech response latency, interarticulatory coordinative consistency, accuracy of speech production, and response latency and accuracy on a receptive language task were analyzed in 16 individuals with PD and 16 matched control participants. Individuals with PD had higher oral motor coordination variability, took a longer time to initiate speech, and made more errors on the speaking task compared with the control group. They also received lower scores on the 2 complex conditions of the receptive language task. Increased length and syntactic complexity negatively affected performance in both groups of speakers. These findings provide a novel window into the speech deficits associated with PD by examining performance on longer, sentence-level utterances in contrast to earlier investigations of single-word or nonword productions. Speech motor control processes and language comprehension were adversely affected in the majority of our participants with mild to moderate PD compared to the control group. Finally, increased syntactic complexity and sentence length affected both the healthy aging and PD groups' speech production performance at the behavioral and kinematic levels.	\N	\N
21067852	Integration of simultaneous auditory and visual information about an event can enhance our ability to detect that event. This is particularly evident in the perception of speech, where the articulatory gestures of the speaker's lips and face can significantly improve the listener's detection and identification of the message, especially when that message is presented in a noisy background. Speech is a particularly important example of multisensory integration because of its behavioural relevance to humans and also because brain regions have been identified that appear to be specifically tuned for auditory speech and lip gestures. Previous research has suggested that speech stimuli may have an advantage over other types of auditory stimuli in terms of audio-visual integration. Here, we used a modified adaptive psychophysical staircase approach to compare the influence of congruent visual stimuli (brief movie clips) on the detection of noise-masked auditory speech and non-speech stimuli. We found that congruent visual stimuli significantly improved detection of an auditory stimulus relative to incongruent visual stimuli. This effect, however, was equally apparent for speech and non-speech stimuli. The findings suggest that speech stimuli are not specifically advantaged by audio-visual integration for detection at threshold when compared with other naturalistic sounds.	\N	\N
21073461	Impaired cognitive control has been implicated as an important developmental pathway to attention deficit/hyperactivity disorder (ADHD). Cognitive control is crucial to suppress interference resulting from conflicting information and can be measured by Stroop-like tasks. This study was conducted to gain insight into conflict processing in children with ADHD. Event-related potentials (ERPs) were recorded in an auditory Stroop task. Twenty-four children with ADHD were compared with 24 control children (aged 8-12 years). No deficit in interference control was found on the auditory Stroop task in children with ADHD. Children with ADHD responded more slowly, less accurately and more variably compared to controls. No differences between the groups occurred in the early conflict-related ERPs. However, the difference between the congruent and the incongruent condition in the 450-550 ms time window was absent in the ADHD group compared to controls. In addition, the conflict sustained potential was found frontally in the ADHD group but parietally in the control group. These ERP findings suggest that children with ADHD evaluate conflict to a lesser extent and differ in the way their brains select appropriate responses during conflict compared with controls.	\N	\N
21144500	Phonetic variation has been considered a barrier that listeners must overcome in speech perception, but has been proved beneficial in category learning. In this paper, I show that listeners use within-speaker variation to accommodate gross categorical variation. Within the perceptual learning paradigm, listeners are exposed to p-initial words in English produced by a native speaker of French. Critically, listeners are trained on these words with either invariant or highly-variable VOTs. While a gross boundary shift is made for participants exposed to the variable VOTs, no such shift is observed after exposure to the invariant stimuli. These data suggest that increasing variation improves the mapping of perceptually mismatched stimuli.	\N	\N
21150681	To evaluate the results of late cochlear implantation in prelingually deaf patients with significant residual hearing loss and to evaluate patient factors relevant to postoperative auditory outcomes in this patient group. Analysis of results of cochlear implantation using postoperative speech perception test scores per each condition. Tertiary referral center. Thirty-two subjects with severe to profound hearing loss that developed before the age of 4. Subjects were implanted at a mean age of 24.8 years (range, 16-44) with Nucleus CI24 (n = 18, 56%), Clarion HiRes 90K (n = 11, 34%), and Medel PULSA (n = 3, 10%) device. Details of etiology, duration of deafness, hearing aid history, hearing thresholds before operation, communication mode, and educational environment were investigated. Speech perception tests were performed preoperatively and 12 months after the operation. Postoperative speech perception test scores between different options within patient group. : The results showed significant improvement in open set speech perception (sentence) scores after the implantation (mean scores from 7.0 to 46.7, p < 0.05). Preoperative hearing of better ear and preoperative speech perception scores correlated with postoperative performances (r = -0.70 and r = 0.46, respectively, p < 0.05). Education and communication mode were also closely related to postoperative performances. In the group with poorer performances, preoperative hearing thresholds were significantly worse than those with better performances, and a larger portion of those patients attended special schools and used sign language. We found that residual auditory capacity in the better ear is an important factor in predicting outcomes after cochlear implantation in patients with prelingual hearing loss.	\N	\N
21161816	The present study focuses on language laterality as measured with dichotic listening (DL) to consonant-vowel syllables (CV syllables) in refugees with post-traumatic stress disorder (PTSD). PTSD is associated with impaired callosal transfer and with increased right hemisphere activation and impaired executive skills that could influence the processing of dichotic stimuli. A total of 22 participants with PTSD were compared to 23 participants without a diagnosis of PTSD. All participants had similar experiences of acts of war and political violence. They were tested with dichotic listening to CV syllables with free recall and directed attention following the forced attention paradigm. The PTSD group showed increased right ear advantage due to impaired left ear reporting and also smaller attention modulation compared to the control group, and the performance shared variance with self-report measures of arousal and intrusive memories. The results are discussed towards a model of impaired functionality of the frontal lobe and right hemisphere versus impaired callosal transfer, both yielding predictions for the processing of the left ear input and the ability to attention modulation of the performance.	\N	\N
21167923	Psychoacoustic studies have shown that complex tones containing resolved harmonics evoke stronger pitches than complex tones with only unresolved harmonics. Also, unresolved harmonics presented in alternating sine and cosine (ALT) phase produce a doubling of pitch. We examine here whether the temporal pattern of phase-locked neural activity reflected in the scalp recorded human frequency following response (FFR) preserves information relevant to pitch strength, and to the doubling of pitch for ALT stimuli. Results revealed stronger neural periodicity strength for resolved stimuli, although the effect of resolvability was weak compared to the effect observed behaviorally; autocorrelation functions and FFR spectra suggest a different pattern of phase-locked neural activity for ALT stimuli with resolved and unresolved harmonics consistent with the doubling of pitch observed in our behavioral estimates; and the temporal pattern of neural activity underlying pitch encoding appears to be similar at the auditory nerve (auditory nerve model response) and the rostral brainstem level (FFR). These findings suggest that the phase-locked neural activity reflected in the scalp recorded FFR preserves neural information relevant to pitch that could serve as an electrophysiological correlate of the behavioral pitch measure. The scalp recorded FFR may provide for a non-invasive analytic tool to evaluate neural encoding of complex sounds in humans.	\N	\N
21178803	To determine the response to treatment of pediatric patients diagnosed with autoimmune inner ear disease. Seven children who presented with sensorineural hearing loss and were diagnosed with autoimmune inner ear disease. Diagnosis through blood testing. Treatment with steroids and/or cytotoxic medication. Improvement in pure-tone average and speech discrimination scores on audiometric testing. Six of the 7 children (85.7%) improved with treatment, and the remaining patient had no measurable progression of disease. Children with autoimmune inner ear disease seem to benefit from treatment with steroids and/or cytotoxic medication. Although such medications must be used with caution in the pediatric population, they should not be withheld simply because of young age.	\N	\N
21196054	Humans and other animals can attend to one of multiple sounds and follow it selectively over time. The neural underpinnings of this perceptual feat remain mysterious. Some studies have concluded that sounds are heard as separate streams when they activate well-separated populations of central auditory neurons, and that this process is largely pre-attentive. Here, we argue instead that stream formation depends primarily on temporal coherence between responses that encode various features of a sound source. Furthermore, we postulate that only when attention is directed towards a particular feature (e.g. pitch) do all other temporally coherent features of that source (e.g. timbre and location) become bound together as a stream that is segregated from the incoherent features of other sources.	\N	\N
21198980	Important to Western tonal music is the relationship between pitches both within and between musical chords; melody and harmony are generated by combining pitches selected from the fixed hierarchical scales of music. It is of critical importance that musicians have the ability to detect and discriminate minute deviations in pitch in order to remain in tune with other members of their ensemble. Event-related potentials indicate that cortical mechanisms responsible for detecting mistuning and violations in pitch are more sensitive and accurate in musicians as compared with non-musicians. The aim of the present study was to address whether this superiority is also present at a subcortical stage of pitch processing. Brainstem frequency-following responses were recorded from musicians and non-musicians in response to tuned (i.e. major and minor) and detuned (± 4% difference in frequency) chordal arpeggios differing only in the pitch of their third. Results showed that musicians had faster neural synchronization and stronger brainstem encoding for defining characteristics of musical sequences regardless of whether they were in or out of tune. In contrast, non-musicians had relatively strong representation for major/minor chords but showed diminished responses for detuned chords. The close correspondence between the magnitude of brainstem responses and performance on two behavioral pitch discrimination tasks supports the idea that musicians' enhanced detection of chordal mistuning may be rooted at pre-attentive, sensory stages of processing. Findings suggest that perceptually salient aspects of musical pitch are not only represented at subcortical levels but that these representations are also enhanced by musical experience.	\N	\N
21211833	Metaphonological tasks, such as rhyme judgment, have been the primary tool for the investigation of the effects of orthographic knowledge on spoken language. However, it has been recently argued that the orthography effect in rhyme judgment does not reflect the automatic activation of orthographic codes but rather stems from sophisticated response strategies. Such a claim stands in sharp contrast with recent findings using event-related brain potentials (ERPs) in lexical and semantic tasks, which were taken to suggest that orthographic information occurs early enough to affect the core process of lexical access. Here, we show that the electrophysiological signature of the orthography effect in rhyme judgment is indeed different from the one obtained in online lexical or semantic tasks. That is, we did not find the orthography effect in the 300-350 ms time window which has previously been shown to process lexical information in the lexico-semantic tasks, but the effect appeared within the 175-250 ms and the 375-750 ms time-windows which we interpreted to reflect segmentation and decisional process, respectively. We conclude that the interactions between phonology and orthography are task-specific. Metaphonological tasks appear of limited use for studying the core processes and interactions that underlie lexical access.	\N	\N
21216396	We show that comprehenders' expectations about upcoming discourse coherence relations influence the resolution of local structural ambiguity. We employ cases in which two clauses share both a syntactic relationship and a discourse relationship, and hence in which syntactic and discourse processing might be expected to interact. An off-line sentence-completion study and an on-line self-paced reading study examined readers' expectations for high/low relative-clause attachments following implicit-causality and non-implicit causality verbs (John detests/babysits the children of the musician who…). In the off-line study, the widely reported low-attachment preference for English is observed in the non-implicit causality condition, but this preference gives way to more high attachments in the implicit-causality condition in cases in which (i) the verb's causally implicated referent occupies the high-attachment position and (ii) the relative clause provides an explanation for the event described by the matrix clause (e.g., …who are arrogant and rude). In the on-line study, a similar preference for high attachment emerges in the implicit-causality context-crucially, before the occurrence of any linguistic evidence that the RC does in fact provide an explanation-whereas the low-attachment preference is consistent elsewhere. These findings constitute the first demonstration that expectations about ensuing discourse coherence relationships can elicit full reversals in syntactic attachment preferences, and that these discourse-level expectations can affect on-line disambiguation as rapidly as lexical and morphosyntactic cues.	\N	\N
21236276	Categorical perception (CP) is a mechanism whereby non-identical stimuli that have the same underlying meaning become invariantly represented in the brain. Through behavioral identification and discrimination tasks, CP has been demonstrated to occur broadly across the auditory modality, including in perception of speech (e.g. phonemes) and music (e.g. chords) stimuli. Several functional imaging studies have linked CP of speech with activity in multiple regions of the left superior temporal sulcus (STS). As language processing is generally left-hemisphere dominant and, conversely, fine-grained spectral processing shows a right hemispheric bias, we hypothesized that CP of musical stimuli would be associated with right STS activity. Here, we used functional magnetic resonance imaging (fMRI) to test healthy, musically-trained volunteers as they (a) underwent a musical chord adaptation/habituation paradigm and (b) performed an active discrimination task on within- and between-category chord pairs, as well as an acoustically-matched, more continuously-perceived orthogonal sound set. As predicted, greater right STS activity was linked to categorical processing in both experimental paradigms. The results suggest that the left and right STS are functionally specialized and that the right STS may take on a key role in CP of spectrally complex sounds.	\N	\N
21236499	To evaluate the audiological outcome of children with congenital cytomegalovirus infection. In a prospective study, the hearing of ninety seven congenitally cytomegalovirus-infected children, born between January 2003 and July 2009, was systematically evaluated until the age of six, applying the Flemish CMV protocol. Depending on the age of the child, the protocol provides hearing evaluation by objective-, play- or conventional audiometry. Symptomatic children with hearing loss at birth were treated with ganciclovir, if parents consented. Seventy children had a pass on initial screening, 27 had unilateral or bilateral hearing loss. Within the normal hearing group, one asymptomatic and two symptomatic children developed late-onset hearing loss. Within the group with hearing loss, 8 children received ganciclovir, while 8 symptomatic and 11 asymptomatic children did not receive ganciclovir. As for the treated group, 37.5% of the children had stable hearing loss, one child had progressive and one child had fluctuating hearing loss. Improvement of hearing threshold occurred in 37.5% of the children. Among the untreated symptomatic children, hearing loss remained stable in 50%, while progression occurred in 37.5%. In the group of asymptomatic children with hearing loss, hearing loss was most commonly stable (72.7%). Within the group of normal hearing ears at birth (n=156), there is a significant better progression in pure tone average for ears of asymptomatic subjects in comparison to ears of symptomatic subjects (p≤0.0001). As for the group of ears with hearing loss at birth (n=38), analysis shows no evidence for a difference in pure tone average progression between the different groups (p=0.38). Cytomegalovirus infection may cause hearing loss, in both symptomatic and asymptomatic children. Our data show a significant difference, between both groups, in the progression of pure tone average of normal hearing ears at birth, in favor of the asymptomatic children. This is not the case for ears with hearing loss at birth. However, this may be due to the small number of ears in this group. Our data show the tendency that treatment with ganciclovir increases the likelihood of improvement and reduces the likelihood of deterioration of the hearing.	\N	\N
21241284	Several studies have found that women tend to demonstrate stronger preferences for masculine men as short-term partners than as long-term partners, though there is considerable variation among women in the magnitude of this effect. One possible source of this variation is individual differences in the extent to which women perceive masculine men to possess antisocial traits that are less costly in short-term relationships than in long-term relationships. Consistent with this proposal, here we show that the extent to which women report stronger preferences for men with low (i.e., masculine) voice pitch as short-term partners than as long-term partners is associated with the extent to which they attribute physical dominance and low trustworthiness to these masculine voices. Thus, our findings suggest that variation in the extent to which women attribute negative personality characteristics to masculine men predicts individual differences in the magnitude of the effect of relationship context on women's masculinity preferences, highlighting the importance of perceived personality attributions for individual differences in women's judgments of men's vocal attractiveness and, potentially, their mate preferences.	\N	\N
21251921	Recent neuroimaging studies proposed the importance of the anterior auditory pathway for speech comprehension. Its clinical significance is implicated by semantic dementia or pure word deafness. Neurodegenerative or cerebrovascular nature, however, precluded precise localization of the cortex responsible for speech perception. Electrical cortical stimulation could delineate such localization by producing transient, functional impairment. We investigated engagement of the left anterior temporal cortex in speech perception by means of direct electrical cortical stimulation. Subjects were two partial epilepsy patients, who underwent direct cortical stimulation as a part of invasive presurgical evaluations. Stimulus sites were coregistered to presurgical 3D-MRI, and then to MNI standard space for anatomical localization. Separate from the posterior temporal language area, electrical cortical stimulation revealed a well-restricted language area in the anterior part of the superior temporal sulcus and gyrus (aSTS/STG) in both patients. Auditory sentence comprehension was impaired upon electrical stimulation of aSTS/STG. In one patient, additional investigation revealed that the functional impairment was restricted to auditory sentence comprehension with preserved visual sentence comprehension and perception of music and environmental sounds. Both patients reported that they could hear the voice but not understand the sentence well (e.g., heard as a series of meaningless utterance). The standard coordinates of this restricted area at left aSTS/STG well corresponded with the coordinates of speech perception reported in neuroimaging activation studies in healthy subjects. The present combined anatomo-functional case study, for the first time, demonstrated that aSTS/STG in the language dominant hemisphere actively engages in speech perception.	\N	\N
21255123	To advance our understanding of the biological basis of speech-in-noise perception, we investigated the effects of background noise on both subcortical- and cortical-evoked responses, and the relationships between them, in normal hearing young adults. The addition of background noise modulated subcortical and cortical response morphology. In noise, subcortical responses were later, smaller in amplitude and demonstrated decreased neural precision in encoding the speech sound. Cortical responses were also delayed by noise, yet the amplitudes of the major peaks (N1, P2) were affected differently, with N1 increasing and P2 decreasing. Relationships between neural measures and speech-in-noise ability were identified, with earlier subcortical responses, higher subcortical response fidelity and greater cortical N1 response magnitude all relating to better speech-in-noise perception. Furthermore, it was only with the addition of background noise that relationships between subcortical and cortical encoding of speech and the behavioral measures of speech in noise emerged. Results illustrate that human brainstem responses and N1 cortical response amplitude reflect coordinated processes with regards to the perception of speech in noise, thereby acting as a functional index of speech-in-noise perception.	\N	\N
21261450	Many models of spoken word recognition posit that the acoustic stream is parsed into phoneme level units, which in turn activate larger representations [McClelland, J. L., & Elman, J. L. The TRACE model of speech perception. Cognitive Psychology, 18, 1-86, 1986], whereas others suggest that larger units of analysis are activated without the need for segmental mediation [Greenberg, S. A multitier theoretical framework for understanding spoken language. In S. Greenberg & W. A. Ainsworth (Eds.), Listening to speech: An auditory perspective (pp. 411-433). Mahwah, NJ: Erlbaum, 2005; Klatt, D. H. Speech perception: A model of acoustic-phonetic analysis and lexical access. Journal of Phonetics, 7, 279-312, 1979; Massaro, D. W. Preperceptual images, processing time, and perceptual units in auditory perception. Psychological Review, 79, 124-145, 1972]. Identifying segmental effects in the brain's response to speech may speak to this question. For example, if such effects were localized to relatively early processing stages in auditory cortex, this would support a model of speech recognition in which segmental units are explicitly parsed out. In contrast, segmental processes that occur outside auditory cortex may indicate that alternative models should be considered. The current fMRI experiment manipulated the phonotactic frequency (PF) of words that were auditorily presented in short lists while participants performed a pseudoword detection task. PF is thought to modulate networks in which phoneme level units are represented. The present experiment identified activity in the left inferior frontal gyrus that was positively correlated with PF. No effects of PF were found in temporal lobe regions. We propose that the observed phonotactic effects during speech listening reflect the strength of the association between acoustic speech patterns and articulatory speech codes involving phoneme level units. On the basis of existing lesion evidence, we interpret the function of this auditory-motor association as playing a role primarily in production. These findings are consistent with the view that phoneme level units are not necessarily accessed during speech recognition.	\N	\N
21261634	Tones that are self-generated elicit a smaller N1 than externally triggered tones. Typically, however, self-generated tones are also more predictable in time than externally triggered ones. The present study investigated whether the attenuated N1 can be explained by predictability based on the temporal relationship between action and effect. Participants listened to tones that were self-generated by a key-press or preceded by a visual cue. The tones followed the key-presses or cues after a fixed (predictable context) or variable delay (unpredictable context). Tones triggered by a key-press elicited a smaller N1 than tones following a visual cue. This finding suggests that the reduced N1 to self-generated tones is not merely due to the fact that the tone's timing can be predicted based on its temporal relationship to the key-press. Whether a tone was presented in a predictable or an unpredictable context did not affect the N1.	\N	\N
21264707	When two targets are presented in rapid succession at the same spatial location, processing of the first is highly efficient, while processing of the second is often profoundly impaired at brief inter-target intervals (attentional blink; AB). While the AB has been shown to impact many processes, it is still unclear whether this includes the ability to shift spatial attention. The present study examined this question using a more sensitive dependent measure than past studies; namely, response times. It also evaluated whether masking of the cue stimulus modulated the effect of the AB on spatial shifts. The results showed significant cueing effects on T2 response times that were strongly modulated by the AB. This supports suggested links between mechanisms underlying object processing and spatial shifts of attention.	\N	\N
21264734	The fast and accurate enumeration of a small set of objects, called subitizing, is thought to involve a different mechanism from other numerosity judgments, such as those based on estimation. In this report, we examine the subitizing limit using a novel enumeration task that obtained the perceived locations of enumerated objects. Observers were shown brief masked displays (50, 200, and 350 ms) of 2-9 small black discs randomly placed on a gray screen and then asked to place a marker where each disc had been located. The number of these markers provided an estimate of the number of items processed. This "pointing" methodology enabled observers to accurately "enumerate" displays containing up to six items in contrast with the four-item limit typically found when using standard reporting methods (and replicated here in Experiment 2). These results suggest a different account of the limits found in most subitizing and enumeration studies.	\N	\N
21272630	The auditory system can encode interaural delays in highpass-filtered complex sounds by phase locking to their slowly modulating envelopes. Spectrotemporal analysis of interaurally time-delayed highpass waveforms reveals the presence of a concomitant interaural level cue. The current study systematically investigated the contribution of time and concomitant level cues carried by positive and negative envelope slopes of a modified sinusoidally amplitude-modulated (SAM) high-frequency carrier. The waveforms were generated from concatenation of individual modulation cycles whose envelope peaks were extended by the desired interaural delay, allowing independent control of delays in the positive and negative modulation slopes. In experiment 1, thresholds were measured using a 2-interval forced-choice adaptive task for interaural delays in either the positive or negative modulation slopes. In a control condition, thresholds were measured for a standard SAM tone. In experiment 2, decision weights were estimated using a multiple-observation correlational method in a single-interval forced-choice task for interaural delays carried simultaneously by the positive, and independently, negative slopes of the modulation envelope. In experiment 3, decision weights were measured for groups of 3 modulation cycles at the start, middle, and end of the waveform to determine the influence of onset dominance or recency effects. Results were consistent across experiments: thresholds were equal for the positive and negative modulation slopes. Decision weights were positive and equal for the time cue in the positive and negative envelope slopes. Weights were also larger for modulations cycles near the waveform onset. Weights estimated for the concomitant interaural level cue were positive for the positive envelope slope and negative for the negative slope, consistent with exclusive use of time cues.	\N	\N
21272930	Using event-related potentials (ERPs), we investigated the N400 (an ERP component that occurs in response to meaningful stimuli) in children aged 8-10 years old and examined relationships between the N400 and individual differences in listening comprehension, word recognition and non-word decoding. Moreover, we tested the claim that the N400 effect provides a valuable indicator of behavioural vocabulary knowledge. Eighteen children were presented with picture-word pairs that were either 'congruent' (the picture depicted the spoken word) or 'incongruent' (they were unrelated). Three peaks were observed in the ERP waveform triggered to the onset of the picture-word stimuli: an N100 in fronto-central channels, an N200 in central-parietal channels and an N400 in frontal, central and parietal channels. In contrast to the N100 peak, the N200 and N400 peaks were sensitive to semantic incongruency with greater peak amplitudes for incongruent than congruent conditions. The incongruency effects for each peak correlated positively with listening comprehension but when the peak amplitudes were averaged across congruent/incongruent conditions they correlated positively with non-word decoding. These findings provide neurophysiological support for the position that sensitivity to semantic context (reflected in the N400 effect) is crucial for comprehension whereas phonological decoding skill relates to more general processing differences reflected in the ERP waveform. There were no correlations between ERP and behavioural measures of expressive or receptive vocabulary knowledge for the same items, suggesting that the N400 effect may not be a reliable estimate of vocabulary knowledge in children aged 8-10 years.	\N	\N
21275499	Hearing-aid users' problems with their own voice caused by occlusion are well known. Conversely, it remains essentially undocumented whether hearing-aid users expected not to have occlusion-related problems experience own-voice issues. To investigate this topic, a dedicated Own Voice Qualities (OVQ) questionnaire was developed and used in two experiments with stratified samples. In the main experiment, the OVQ was administered to 169 hearing-aid users (most of whom were expected not to have occlusion-related problems) and to a control group of 56 normally-hearing people. In the follow-up experiment, the OVQ was used in a cross-over study where 43 hearing-aid users rated own voice for an open fitting and a small-vent earmould fitting. The results from the main experiment show that hearing-aid users (without occlusion) have more problems than the normal-hearing controls on several dimensions of own voice. The magnitude of these differences was found to be generally larger than the differences observed between the open fitting and the small-vent fitting in the follow-up experiment. This suggests that own voice is a potentially important concern, even for hearing-aid users who are not expected to have occlusion-related problems.	\N	\N
21295773	Does extensive practice reduce or eliminate central interference in dual-task processing? We explored the reorganization of task architecture with practice by combining interference analysis (delays in dual-task experiment) and random-walk models of decision making (measuring the decision and non-decision contributions to RT). The main delay observed in the Psychologically Refractory Period at short stimulus onset asynchronies (SOA) values was largely unaffected by training. However, the range of SOAs over which this interference regime held diminished with learning. This was consistent with an overall shift observed in single-task performance from a highly variable decision time to a reliable (non-decision time) contribution to response time. Executive components involved in coordinating dual-task performance decreased (and became more stable) after extensive practice. The results suggest that extensive practice reduces the duration of central decision stages, but that the qualitative property of central seriality remains a structural invariant.	\N	\N
21299376	To implement a fast method for measuring psychophysical tuning curves (PTCs) for use in clinical applications, such as assessment of frequency selectivity and detection of dead regions in the cochlea. The method is based on that described by Sek et al (2005) and has been implemented in software that can be run on a PC with a good-quality sound card. In addition to the main narrowband noise masker, a lowpass noise masker can be generated to prevent detection of a distortion band corresponding to the simple difference tone. The software includes a routine for measuring the absolute threshold at the signal frequency and includes methods for estimating the frequency at the tip of the PTC. A PTC can typically be determined in about three minutes. A small amount of practice (two to three runs) may be required to achieve stable results. The software implementation allows PTCs to be measured quickly without a requirement for specialised equipment.	\N	\N
21305548	To investigate the level of hearing loss and the configuration of the mean audiometric curve over the course of Menière's disease, correcting the data according to patient age. A retrospective study of 3,963 hearing tests. Descriptive, longitudinal study of pure-tone audiometries of 237 patients at a tertiary hospital who had been diagnosed with definitive Menière's disease according to the American Academy of Otorhinolaryngology criteria. All audiometric results were age-corrected, and patients were followed for 1 to 31 years. In patients who had undergone surgery, only the data collected before the operation were assessed. In patients with unilateral disease, the mean hearing loss was characteristically low frequency, even in very advanced stages of the disease. Hearing loss was accentuated at 5 and 15 years from onset. In bilateral cases, hearing loss was slightly more severe and the average loss produced a flatter audiometric curve than in unilateral cases. In Menière's disease, audiometry results corrected for patient age show an inherent upward-sloping configuration of the mean audiometric curve at all time points during the disease. The hearing pattern differs between unilateral and bilateral disease. The audiometric curve configuration may be an indicator of future bilateral disease.	\N	\N
21305666	Both sighted and blind individuals can readily interpret meaning behind everyday real-world sounds. In sighted listeners, we previously reported that regions along the bilateral posterior superior temporal sulci (pSTS) and middle temporal gyri (pMTG) are preferentially activated when presented with recognizable action sounds. These regions have generally been hypothesized to represent primary loci for complex motion processing, including visual biological motion processing and audio-visual integration. However, it remained unclear whether, or to what degree, life-long visual experience might impact functions related to hearing perception or memory of sound-source actions. Using functional magnetic resonance imaging (fMRI), we compared brain regions activated in congenitally blind versus sighted listeners in response to hearing a wide range of recognizable human-produced action sounds (excluding vocalizations) versus unrecognized, backward-played versions of those sounds. Here, we show that recognized human action sounds commonly evoked activity in both groups along most of the left pSTS/pMTG complex, though with relatively greater activity in the right pSTS/pMTG by the blind group. These results indicate that portions of the postero-lateral temporal cortices contain domain-specific hubs for biological and/or complex motion processing independent of sensory-modality experience. Contrasting the two groups, the sighted listeners preferentially activated bilateral parietal plus medial and lateral frontal networks, whereas the blind listeners preferentially activated left anterior insula plus bilateral anterior calcarine and medial occipital regions, including what would otherwise have been visual-related cortex. These global-level network differences suggest that blind and sighted listeners may preferentially use different memory retrieval strategies when hearing and attempting to recognize action sounds.	\N	\N
21309643	To measure the mental health status of deaf adolescents with cochlear implants (CI). STUDY SAMPLE AND DESIGN: We used the "Strengths and Difficulties Questionnaire" (SDQ) to assess the mental health problems of 32 adolescents with CI (mean age 15.0 years) and 212 normal hearing peers (mean age 15.0 years). Parent and teacher ratings for the CI subjects (ES emotional symptoms, HA inattention-hyperactivity, CP conduct-problems and PBS pro-social behavior) did not differ significantly from the results of normal hearing peers. However, teachers rated significantly more cases as having peer problems (PP) and more cases as having very high (clinical) total difficulty scores (TDS) in the CI group. The SDQ results of the CI users correlated significantly with poor results in auditory performance and special school education. The age at CI implantation was not found to be a correlated with emotional, behavioral and social problems. Our findings indicate that the mental health of deaf adolescents with CI is comparable to that of normal hearing peers.	\N	\N
21315158	Music perception generally involves processing the frequency relationships between successive pitches and extraction of the melodic contour. Previous evidence has suggested that the 'ups' and 'downs' of melodic contour are categorically and automatically processed, but knowledge of the brain regions that discriminate different types of contour is limited. Here, we examined melodic contour discrimination using multivariate pattern analysis (MVPA) of fMRI data. Twelve non-musicians were presented with various ascending and descending melodic sequences while being scanned. Whole-brain MVPA was used to identify regions in which the local pattern of activity accurately discriminated between contour categories. We identified three distinct cortical loci: the right superior temporal sulcus (rSTS), the left inferior parietal lobule (lIPL), and the anterior cingulate cortex (ACC). These results complement previous findings of melodic processing within the rSTS, and extend our understanding of the way in which abstract auditory sequences are categorized by the human brain.	\N	\N
21316354	This study examined the electrophysiological correlates of auditory and visual working memory in children with Specific Language Impairments (SLI). Children with SLI and age-matched controls (11;9-14;10) completed visual and auditory working memory tasks while event-related potentials (ERPs) were recorded. In the auditory condition, children with SLI performed similarly to controls when the memory load was kept low (1-back memory load). As expected, when demands for auditory working memory were higher, children with SLI showed decreases in accuracy and attenuated P3b responses. However, children with SLI also evinced difficulties in the visual working memory tasks. In both the low (1-back) and high (2-back) memory load conditions, P3b amplitude was significantly lower for the SLI as compared to CA groups. These data suggest a domain-general working memory deficit in SLI that is manifested across auditory and visual modalities.	\N	\N
21325685	Our auditory system separates and tracks temporally interleaved sound sources by organizing them into distinct auditory streams. This streaming phenomenon is partly determined by physical stimulus properties but additionally depends on the internal state of the listener. As a consequence, streaming perception is often bistable and reversals between one- and two-stream percepts may occur spontaneously or be induced by a change of the stimulus. Here, we used functional MRI to investigate perceptual reversals in streaming based on interaural time differences (ITD) that produce a lateralized stimulus perception. Listeners were continuously presented with two interleaved streams, which slowly moved apart and together again. This paradigm produced longer intervals between reversals than stationary bistable stimuli but preserved temporal independence between perceptual reversals and physical stimulus transitions. Results showed prominent transient activity synchronized with the perceptual reversals in and around the auditory cortex. Sustained activity in the auditory cortex was observed during intervals where the ΔITD could potentially produce streaming, similar to previous studies. A localizer-based analysis additionally revealed transient activity time locked to perceptual reversals in the inferior colliculus. These data suggest that neural activity associated with streaming reversals is not limited to the thalamo-cortical system but involves early binaural processing in the auditory midbrain, already.	\N	\N
21327367	Identification of the second of two targets (T1, T2, inserted in a stream of distractors) is impaired when presented within 500 ms after the first (attentional blink, AB). Barring a T1-T2 task-switch, it is thought that T2 must be backward-masked to obtain an AB (Giesbrecht & Di Lollo, Journal of Experimental Psychology: Human Perception and Performance, 24, 1454-1466, 1998). We tested the hypothesis that Giesbrecht & Di Lollo's findings were vitiated by ceiling constraints arising from either response scale (experiment 1) or data limitations (experiment 2). In experiment 1, digit-distractors were replaced with pseudoletters to increase task difficulty, bringing performance below ceiling. An AB occurred without backward masking of T2. In experiment 2, a ceiling-free procedure estimated the number of noise dots needed for 80% T2 identification. An AB was revealed: fewer noise dots were required during the AB period than outside it. Both outcomes confirm that an AB can be obtained without either masking of T2 or task switching.	\N	\N
21335029	Brain-computer interface (BCI) is a developing, novel mode of communication for individuals with severe motor impairments or those who have no other options for communication aside from their brain signals. However, the majority of current BCI systems are based on visual stimuli or visual feedback, which may not be applicable for severe locked-in patients that have lost their eyesight or the ability to control their eye movements. In the present study, we investigated the feasibility of using auditory steady-state responses (ASSRs), elicited by selective attention to a specific sound source, as an electroencephalography (EEG)-based BCI paradigm. In our experiment, two pure tone burst trains with different beat frequencies (37 and 43 Hz) were generated simultaneously from two speakers located at different positions (left and right). Six participants were instructed to close their eyes and concentrate their attention on either auditory stimulus according to the instructions provided randomly through the speakers during the inter-stimulus interval. EEG signals were recorded at multiple electrodes mounted over the temporal, occipital, and parietal cortices. We then extracted feature vectors by combining spectral power densities evaluated at the two beat frequencies. Our experimental results showed high classification accuracies (64.67%, 30 commands/min, information transfer rate (ITR) = 1.89 bits/min; 74.00%, 12 commands/min, ITR = 2.08 bits/min; 82.00%, 6 commands/min, ITR = 1.92 bits/min; 84.33%, 3 commands/min, ITR = 1.12 bits/min; without any artifact rejection, inter-trial interval = 6s), enough to be used for a binary decision. Based on the suggested paradigm, we implemented a first online ASSR-based BCI system that demonstrated the possibility of materializing a totally vision-free BCI system.	\N	\N
21342695	The purpose of this exploratory study was to assess perceptions of quality of life for individuals with hearing impairment who have not consulted for services and their significant others who are in same-sex relationships vs. those who are in different-sex relationships. Data were collected on a total of 20 older couples: 10 in same-sex relationships and 10 in different-sex relationships. In each of the couples, one member self-identified as having hearing impairment. The couples completed an audiologic evaluation, a disease-specific quality of life questionnaire, and a short, structured interview (which served as a general measure of quality of life). No differences between the groups were found on demographic or audiologic variables. Differences between the groups and within the couples were found on the disease-specific and overall quality of life measures. Participants with hearing impairment in different-sex relationships reported significantly more total consequences of hearing impairment than those in the same-sex relationships. Differences were found in the rate of reporting for various contributors to overall quality of life and consequences of hearing impairment on quality of life. There was more congruity between same-sex couples than different-sex couples. There appear to be important differences in perceptions of both disease-specific and overall quality of life based on sexual orientation for older couples who have not consulted for hearing services. These differences can help inform clinical practice with this under-researched population. Readers will be able to: (1) Describe quality of life variables for individuals with hearing problems in same- and different-sex relationships, (2) understand the differences in quality of life variables between same- and different-sex couples, (3) consider the clinical implications of these quality of life variables.	\N	\N
21358012	The aim of the current study is to investigate hearing function in patients with allergic rhinitis. Fifty-eight patients with positive skin prick test (Group 1) (116 ears) and 31 subjects with negative skin prick test (62 ears) as group 2 were included. Pure tone audiometry at 250, 500, 1000, 2000, 4000 and 8000 Hz and immittance measures, including tympanometry and acoustic reflex tests, were performed in both groups. There was statistically significant difference between pure-tone threshold of the group 1 and group 2 at 8000 Hz (p< 0.05). Based on our study, the patients with allergic rhinitis had better hearing than the control group at 8000 Hz.	\N	\N
21361412	Perturbation analysis was used to determine the relative contribution of target enhancement and noise cancellation in the identification of rudimentary sound source in noise. In a two-interval, forced-choice procedure, listeners identified the impact sound produced by the larger of two stretched membranes as target. The noise on each presentation was the impact sound of a variable-sized plate. For four of five listeners, the relative weights on the noise were positive indicating enhancement, and for the remaining listeners, they were negative indicating cancellation. The results underscore the difficulty with evaluating models of masking solely in terms of measures of performance accuracy.	\N	\N
21361445	Low-frequency masking by intense high-frequency noise bands, referred to as remote masking (RM), was the first evidence to challenge energy-detection models of signal detection. Its underlying mechanisms remain unknown. RM was measured in five normal-hearing young-adults at 250, 350, 500, and 700 Hz using equal-power, spectrally matched random-phase noise (RPN) and low-noise noise (LNN) narrowband maskers. RM was also measured using equal-power, two-tone complex (TC2) and eight-tone complex (TC8). Maskers were centered at 3000 Hz with one or two equivalent rectangular bandwidths (ERBs). Masker levels varied from 80 to 95 dB sound pressure level in 5 dB steps. LNN produced negligible masking for all conditions. An increase in bandwidth in RPN yielded greater masking over a wider frequency region. Masking for TC2 was limited to 350 and 700 Hz for one ERB but shifted to only 700 Hz for two ERBs. A spread of masking to 500 and 700 Hz was observed for TC8 when the bandwidth was increased from one to two ERBs. Results suggest that high-frequency noise bands at high levels could generate significant low-frequency masking. It is possible that listeners experience significant RM due to the amplification of various competing noises that might have significant implications for speech perception in noise.	\N	\N
21361446	The additivity of nonsimultaneous masking was studied using Gaussian-shaped tone pulses (referred to as Gaussians) as masker and target stimuli. Combinations of up to four temporally separated Gaussian maskers with an equivalent rectangular bandwidth of 600 Hz and an equivalent rectangular duration of 1.7 ms were tested. Each masker was level-adjusted to produce approximately 8 dB of masking. Excess masking (exceeding linear additivity) was generally stronger than reported in the literature for longer maskers and comparable target levels. A model incorporating a compressive input/output function, followed by a linear summation stage, underestimated excess masking when using an input/output function derived from literature data for longer maskers and comparable target levels. The data could be predicted with a more compressive input/output function. Stronger compression may be explained by assuming that the Gaussian stimuli were too short to evoke the medial olivocochlear reflex (MOCR), whereas for longer maskers tested previously the MOCR caused reduced compression. Overall, the interpretation of the data suggests strong basilar membrane compression for very short stimuli.	\N	\N
21368435	A human factors experiment employed a hemi-anechoic sound field in which listeners were required to localize a vehicular backup alarm warning signal (both a standard and a frequency-augmented alarm) in 360-degrees azimuth in pink noise of 60 dBA and 90 dBA. Measures of localization performance included: (1) percentage correct localization, (2) percentage of right--left localization errors, (3) percentage of front-rear localization errors, and (4) localization absolute deviation in degrees from the alarm's actual location. In summary, the data demonstrated that, with some exceptions, normal hearing listeners' ability to localize the backup alarm in 360-degrees azimuth did not improve when wearing augmented hearing protectors (including dichotic sound transmission earmuffs, flat attenuation earplugs, and level-dependent earplugs) as compared to when wearing conventional passive earmuffs or earplugs of the foam or flanged types. Exceptions were that in the 90 dBA pink noise, the flat attenuation earplug yielded significantly better accuracy than the polyurethane foam earplug and both the dichotic and the custom-made diotic electronic sound transmission earmuffs. However, the flat attenuation earplug showed no benefit over the standard pre-molded earplug, the arc earplug, and the passive earmuff. Confusions of front-rear alarm directions were most significant in the 90 dBA noise condition, wherein two types of triple-flanged earplugs exhibited significantly fewer front-rear confusions than either of the electronic muffs. On all measures, the diotic sound transmission earmuff resulted in the poorest localization of any of the protectors due to the fact that its single-microphone design did not enable interaural cues to be heard. Localization was consistently more degraded in the 90 dBA pink noise as compared with the relatively quiet condition of the 60 dBA pink noise. A frequency-augmented backup alarm, which incorporated 400 Hz and 4000 Hz components to exploit the benefits of interaural phase and intensity cues respectively, slightly but significantly improved localization compared with the standard, more narrow-bandwidth backup alarm, and these results have implications for the updating of backup alarm standards.	\N	\N
21368442	Numerous studies have shown that the reliability of using laboratory measurements to predict individual or even group hearing protector attenuation for occupationally exposed workers is quite poor. This makes it difficult to properly assign hearing protectors when one wishes to closely match attenuation to actual exposure. An alternative is the use of field-measurement methods, a number of which have been proposed and are beginning to be implemented. We examine one of those methods, namely the field microphone-in-real-ear (F-MIRE) approach in which a dual-element microphone probe is used to measure noise reduction by quickly sampling the difference in noise levels outside and under an earplug, with appropriate adjustments to predict real-ear attenuation at threshold (REAT). We report on experiments that validate the ability of one commercially available F-MIRE device to predict the REAT of an earplug fitted identically for two tests. Results are reported on a representative roll-down foam earplug, stemmed-style pod plug, and pre-molded earplug, demonstrating that the 95% confidence level of the Personal Attenuation Rating (PAR) as a function of the number of fits varies from ± 4.4 dB to ± 6.3 dB, depending on the plug type, which can be reduced to ± 3.1 dB to ± 4.5 dB with a single repeat measurement. The added measurement improves precision substantially. However, the largest portion of the error is due to the user's fitting variability and not the uncertainty of the measurement system. Further we evaluated the inherent uncertainty of F-MIRE vs. the putative "gold standard" REAT procedures finding, that F-MIRE measurement uncertainty is less than one-half that of REAT at most test frequencies. An American National Standards Institute (ANSI) working group (S12/WG11) is currently involved in developing methods similar to those in this paper so that procedures for evaluating and reporting uncertainty on all types of field attenuation measurement systems can be standardized. We conclude that the hearing conservationist now has available a portable, convenient, quick, and easy-to-use system that can improve training and motivation of employees, assign hearing protection devices based on noise exposures, and address other management and compliance issues.	\N	\N
21380585	With a new metric called phonological Levenshtein distance (PLD20), the present study explores the effects of phonological similarity and word frequency on spoken word recognition, using polysyllabic words that have neither phonological nor orthographic neighbors, as defined by neighborhood density (the N-metric). Inhibitory effects of PLD20 were observed for these lexical hermits: Close-PLD20 words were recognized more slowly than distant PLD20 words, indicating lexical competition. Importantly, these inhibitory effects were found only for low- (not high-) frequency words, in line with previous findings that phonetically related primes inhibit recognition of low-frequency words. These results indicate that the properties of PLD20--a continuous measure of word-form similarity--make it a promising new metric for quantifying phonological distinctiveness in spoken word recognition research.	\N	\N
21382386	Although developmental dyslexia is often referred to as a cross-modal disturbance, tests of different modalities using the same stimuli are lacking. We compared the performance of 23 children with dyslexia and 42 chronologically matched control readers on reading versus repetition tasks and visual versus auditory lexical decision using the same stimuli. With respect to control readers, children with dyslexia were impaired only on stimuli in the visual modality; they had no deficit on the repetition and auditory lexical decision tasks. By applying the rate-amount model (Faust et al., 1999), we showed that performance of children with dyslexia on visual (but not auditory) tasks was associated with that of control readers by a linear relationship (with a 1.78 slope), suggesting that a global factor accounts for visual (but not auditory) task performance. We conclude that the processing of linguistic stimuli in the visual and auditory modalities is carried out by independent processes and that dyslexic children have a selective deficit in the visual modality.	\N	\N
21388613	Perceptual implicit memory is typically most robust when the perceptual processing at encoding matches the perceptual processing required during retrieval. A consistent exception is the robust priming that semantic generation produces on the perceptual identification test (Masson & MacLeod, 2002), a finding which has been attributed to either (1) conceptual influences in this nominally perceptual task, or (2) covert orthographic processing during generative encoding. The present experiments assess these possibilities using both auditory and visual perceptual identification, tests in which participants identify auditory words in noise or rapidly-presented visual words. During the encoding phase of the experiments, participants generated some words and perceived others in an intermixed study list. The perceptual control condition was visual (reading) or auditory (hearing), and varied across participants. The reading and hearing conditions exhibited the expected modality-specificity, producing robust intra-modal priming and non-significant cross-modal priming. Priming in the generate condition depended on the perceptual control condition. With a read control condition, semantic generation produced robust visual priming but no auditory priming. With a hear control condition, the results were reversed: semantic generation produced robust auditory priming but not visual priming. This set of results is not consistent with a straightforward application of either the conceptual-influence or covert-orthography account, and implies that the nature of encoding in the generate condition is influenced by the broader list context.	\N	\N
21389700	Bone conduction (BC) is the way sound energy is transmitted by the skull bones to the cochlea causing a sound perception. Even if the BC sound transmission involves several pathways including sound pressure induced in the ear canal, inertial forces acting on the middle ear ossicles and cochlear fluids, alteration of the cochlear space, and pressure transmission through the 3rd window of the cochlea, the BC sound ultimately produces a wave motion on the basilar membrane similar to that of air-conducted sound. The efficiency of the BC stimulation is largely dependent on the skull bone where the skull acts as a rigid body at low frequencies and incorporates different types of wave transmission at higher frequencies. The interaural stimulation difference is determined by the difference between contralateral and ipsilateral BC sound transmission: the transcranial BC sound transmission. To benefit from binaural processing, the transcranial transmission should be low, while the same should be high when using BC hearing aids for unilateral deaf subjects. By appropriately positioning the stimulation, high or low transcranial transmission can be achieved.	\N	\N
21390207	Human languages evolve continuously, and a puzzling problem is how to reconcile the apparent robustness of most of the deep linguistic structures we use with the evidence that they undergo possibly slow, yet ceaseless, changes. Is the state in which we observe languages today closer to what would be a dynamical attractor with statistically stationary properties or rather closer to a non-steady state slowly evolving in time? Here we address this question in the framework of the emergence of shared linguistic categories in a population of individuals interacting through language games. The observed emerging asymptotic categorization, which has been previously tested--with success--against experimental data from human languages, corresponds to a metastable state where global shifts are always possible but progressively more unlikely and the response properties depend on the age of the system. This aging mechanism exhibits striking quantitative analogies to what is observed in the statistical mechanics of glassy systems. We argue that this can be a general scenario in language dynamics where shared linguistic conventions would not emerge as attractors, but rather as metastable states.	\N	\N
21398015	A meaningful interaction with our environment relies on the ability to focus on relevant sensory input and to ignore irrelevant information, i.e. top-down control and attention processes are employed to select from competing stimuli following internal goals. In this, the demands for the recruitment of top-down control processes depend on the relative perceptual salience of the competing stimuli. In the present functional magnetic resonance imaging (fMRI) study, we investigated the recruitment of top-down control processes in response to varying degrees of control demands in the auditory modality. For this purpose, we tested 20 male and 20 female subjects with a dichotic listening paradigm, in which the relative perceptual salience of two simultaneously presented stimuli was systematically manipulated by varying the inter-aural intensity difference (IID) and asking the subjects to selectively attend to either ear. The analysis showed that the interaction between IID and attentional direction involves two networks in the brain. A fronto-parietal network, including the pre-supplementary motor area, anterior cingulate cortex, inferior frontal junction, insula and inferior parietal lobe, was recruited during cognitively demanding conditions and can thus be seen as a top-down cognitive control network. In contrast, a second network including the superior temporal and the post-central gyri was engaged under conditions with low cognitive control demands. These findings demonstrate how cognitive control is achieved through the interplay of distinct brain networks, with their differential engagement determined as a function of the level of competition between the sensory stimuli.	\N	\N
21417674	This study was designed to evaluate an automated pure-tone audiometric procedure (AMTAS(®)) for 4-8 year-old children, and a quality assessment method (QUALIND(®)) that predicts the accuracy of the test. Children were tested with AMTAS and conventional manual air-conduction audiometry. A group of adults was tested for comparison. Eighty-one 4-8 year-old children and 15 adults. Most had normal hearing. For most subjects (93% of adults and 91% of children) differences between AMTAS and manual thresholds were similar to differences that occur when two experienced audiologists test the same subjects. QUALIND detected the inaccurate audiograms with a sensitivity of 71% and a specificity of 91%. When inaccurate audiograms identified by QUALIND are excluded, the accuracy of AMTAS is similar to the accuracy of manual audiometry. AMTAS produces accurate air-conduction audiograms in a high proportion of 4-8 year-old children and adults. QUALIND successfully identified most inaccurate AMTAS audiograms. The method can decrease the cost and increase efficiency and accessibility of hearing testing.	\N	\N
21420739	Perception of environmental sounds from impacted materials (Wood, Metal and Glass) was examined by conducting a categorization experiment. Stimuli consisted of sound continua evoking progressive transitions between material categories. Results highlighted shallower response curves in subjects with schizophrenia than healthy participants, and are discussed in the framework of Signal Detection Theory and in terms of impaired perception of specific timbre features in schizophrenia.	\N	\N
21420989	There is ongoing debate with respect to interpretation of the finding that, in contrast to perceptual size judgments, actions are relatively unaffected by the Müller-Lyer illusion. In normal unrestricted viewing situations observers cannot perform an action directed at an object without simultaneously perceiving the object - this makes it difficult to unequivocally establish whether observed effects are a function of vision for perception, vision for action, a combination of both, or of a single all-purpose visual system. However, there is evidence that observers are capable of performing actions towards objects of which they are not consciously aware, implying that two distinct visual thresholds may exist; one accompanying vision for action and one accompanying vision for perception. To investigate this possibility we created a situation in which visual information was presented below the perception threshold, but above the purported action threshold, allowing examination of action responses independent of contributions from vision for perception. Following a perceptual categorization task, participants performed delayed pointing movements towards briefly exposed masked Müller-Lyer targets of different sizes. When the targets were presented below the perception threshold, participants were unable to discriminate between them, yet their delayed pointing movements were affected by target size (but not the illusion). The results imply that vision for action is functional even after a delay and/or that the pickup of egocentric information is associated with a lower visual threshold than the pickup of allocentric information.	\N	\N
21422306	To evaluate the significance of the Carhart notch (a 2-kHz bone conduction threshold dip [2KBD]) in the diagnosis of stapes fixation by comparing its incidence among ears with various ossicular chain abnormalities. Retrospective study. University hospital. A total of 153 ears among 127 consecutive patients with a congenital ossicular anomaly or otosclerosis. The 2KBD depth was defined as the threshold at 2 kHz minus the mean of thresholds at 1 and 4 kHz. The presence of 2KBD (depth, ≥10 dB), 2KBD depth, relationship between 2KBD depth and air-bone gap, and 2-kHz bone conduction recovery after operation were evaluated in a stapes fixation group (which included cases of otosclerosis and congenital stapes fixation), an incudostapedial joint detachment group, and a malleus or incus fixation group. A 2KBD was present in 32 of 102 stapes fixation ears (31.4%), 5 of 19 incudostapedial joint detachment ears (26.3%), and 6 of 20 malleus or incus fixation ears (30.0%) (12 ears had other diagnoses). The mean (SD) 2KBD depths were 17.3 (5.2) dB in the stapes fixation group, 18.5 (2.2) dB in the incudostapedial joint detachment group, and 16.3 (2.1) dB in the malleus or incus fixation group. No statistically significant differences were noted among these 3 groups. No correlation was noted between 2KBD depth and air-bone gap extent. Recovery of 2-kHz bone conduction threshold in the stapes fixation group was less than that in the other 2 groups. Incidence of 2KBD was similar among the stapes fixation, incudostapedial joint detachment, and malleus or incus fixation groups, implying that 2KBD is not a useful predictor of stapes fixation.	\N	\N
21424256	Assessing intentions, direction, and velocity of others is necessary for most daily tasks, and such information is often made available by both visual and auditory motion cues. Therefore, it is not surprising our great ability to perceive human motion. Here, we explore the multisensory integration of cues of biological motion walking speed. After testing for audiovisual asynchronies (visual signals led auditory ones by 30 ms in simultaneity temporal windows of 76.4 ms), in the main experiment, visual, auditory, and bimodal stimuli were compared to a standard audiovisual walker in a velocity discrimination task. Results in variance reduction conformed to optimal integration of congruent bimodal stimuli across all subjects. Interestingly, the perceptual judgements were still close to optimal for stimuli at the smallest level of incongruence. Comparison of slopes allows us to estimate an integration window of about 60 ms, which is smaller than that reported in audiovisual speech.	\N	\N
21431434	Our objective is to determine whether the degree of endolymphatic hydrops as it is detected in vivo in patients with definite Meniere's disease correlates with audiovestibular function. In this prospective study, 37 patients with definite Meniere's disease according to AAO-HNS criteria were included. Intratympanic contrast enhanced temporal bone MRI was performed using a 3D FLAIR protocol. The degree of endolymphatic hydrops in the cochlea and the vestibulum was graded on a Likert scale (0-3). The degree of hydrops was then analyzed with respect to its correlation with audiometric hearing levels, electrocochleographic SP/AP ratios, interaural amplitude ratios of vestibular evoked myogenic potentials and degree of horizontal semicircular canal paresis on caloric irrigation. There was a significant correlation between the degree of hydrops on the one hand and the averaged hearing level at 0.25-1 and 0.5-3 kHz and the vestibular evoked myogenic potential interaural amplitude ratio on the other hand. A trend toward a correlation was noticed between the hydrops and the caloric response, no correlation was noticed between the hydrops and the SP/AP ratio. The degree of endolymphatic hydrops correlates with a progressive loss of auditory and sacculus function in patients with Meniere`s disease.	\N	\N
21432625	Emotional inferences from speech require the integration of verbal and vocal emotional expressions. We asked whether this integration is comparable when listeners are exposed to their native language and when they listen to a language learned later in life. To this end, we presented native and non-native listeners with positive, neutral and negative words that were spoken with a happy, neutral or sad tone of voice. In two separate tasks, participants judged word valence and ignored tone of voice or judged emotional tone of voice and ignored word valence. While native listeners outperformed non-native listeners in the word valence task, performance was comparable in the voice task. More importantly, both native and non-native listeners responded faster and more accurately when verbal and vocal emotional expressions were congruent as compared to when they were incongruent. Given that the size of the latter effect did not differ as a function of language proficiency, one can conclude that the integration of verbal and vocal emotional expressions occurs as readily in one's second language as it does in one's native language.	\N	\N
21440971	Congenital amusia manifests as a lifelong difficulty in making sense of musical sound. The extent to which this disorder is accompanied by deficits in visuo-spatial processing is an important question, bearing on the issue of whether pitch processing draws on supramodal spatial representations. The present study assessed different aspects of visuo-spatial processing with a range of tasks (Shepard-Metzler Mental Rotation, Corsi Blocks Task, Visual Patterns Test) in 14 amusics and matched controls. The absence of a group difference on any of these tasks fails to support a previous claim that the disorder is strongly related to deficits in spatial processing. However, a subgroup of amusics, with significantly elevated thresholds on a pitch direction discrimination task relative to the rest of the group, were slower, but equally accurate, at Mental Rotation. This finding is discussed in relation to the nature of supramodal representations of contour and strategies for dynamic mental transformation.	\N	\N
21458056	Speech production can be broadly separated into two distinct components: Phonation and Articulation. These two aspects require the efficient control of several phono-articulatory effectors. Speech is indeed generated by the vibration of the vocal-folds in the larynx (F0) followed by ''filtering" by articulators, to select certain resonant frequencies out of that wave (F1, F2, F3, etc.). Recently it has been demonstrated that the motor representation of articulators (lips and tongue) participates in the discrimination of articulatory sounds (lips- and tongue-related speech sounds). Here we investigate whether the results obtained on articulatory sounds discrimination could be extended to phonation by applying a dual-pulse TMS protocol while subjects had to discriminate F0-shifted vocal utterances [a]. Stimulation over the larynx motor representation, compared to the control site (tongue/lips motor cortex), induced a reduction in RT for stimuli including a subtle pitch shift. We demonstrate that vocal pitch discrimination, in analogy with the articulatory component, requires the contribution of the motor system and that this effect is somatotopically organized.	\N	\N
21463563	Bone-conduction thresholds have been used in audiologic assessments of both infants and adults to differentiate between conductive and sensorineural hearing losses. However, air- and bone-conduction thresholds estimated for infants with normal hearing using physiological measures have identified an "air-bone gap" in the low frequencies that does not result from conductive hearing impairment but, rather, from maturational differences in sensitivity. This maturational air-bone gap appears to be present up to at least 2 yr of age. Because most infants older than 6 mo of age are clinically assessed behaviorally, rather than physiologically, it is necessary to determine whether a similar maturational air-bone gap is present for behavioral air- and bone-conduction thresholds. The purpose of this study was to estimate behavioral bone-conduction thresholds for infants using a standard clinical visual reinforcement audiometry (VRA) protocol to determine whether frequency-dependent maturational patterns exist as previously reported for physiological bone-conduction thresholds. Behavioral bone-conduction minimum response levels were estimated at 500, 1000, 2000, and 4000 Hz using VRA for each participant. Young (7-15 mo; N = 17) and older (18-30 mo; N = 20) groups of infants were assessed. All infants were screened and considered to be at low risk for hearing loss. Preliminary "normal levels" were determined by calculating the 90th percentile for responses present as a cumulative percentage. Mean bone-conduction thresholds were compared and analyzed using a mixed-model analysis of variance across frequency and age group. Linear regression analysis was also performed to assess the effect of age on bone-conduction thresholds. Results of this study indicate that, when measured behaviorally, infants under 30 mo of age show frequency-dependent bone-conduction thresholds whereby their responses at 500 and 1000 Hz are significantly better than those at 2000 and 4000 Hz. However, thresholds obtained from the younger group of infants (mean age of 10.6 mo) were not significantly different from those obtained from the older group of infants (mean age of 23.0 mo) at any frequency. The findings of the present study are similar to the results obtained from previous physiological studies. Compared to previously documented air-conduction thresholds of infants using similar VRA techniques, a maturational air-bone gap is observed in the low frequencies. Therefore, differences between infant and adult bone-conduction thresholds persist until at least 30 mo of age. As a result, different "normal levels" should be used when assessing bone-conduction hearing sensitivity of infants using behavioral methods.	\N	\N
21476653	Different non-exponential decays such as the concave and the convex double sloped decays in the coupled rooms provide distinct sound qualities. These are commonly considered to occur in the less reverberant sub-room and the more reverberant sub-room, respectively. However, numerical simulations and experiments in this paper show that the demarcation line is not located along the physical boundaries (e.g., the partition and the coupling aperture), but in the more reverberant sub-room. The sound field with the concave double sloped decay penetrates into the auxiliary sub-room to an extent which is influenced by the difference between the two natural reverberations of the sub-rooms. Furthermore the sound energy flows in different regions are investigated, demonstrating how energy feedback leads to the concave double sloped decay.	\N	\N
21476654	Talkers adjust their vocal effort to communicate at different distances, aiming to compensate for the sound propagation losses. The present paper studies the influence of four acoustically different rooms on the speech produced by 13 male talkers addressing a listener at four distances. Talkers raised their vocal intensity by between 1.3 and 2.2 dB per double distance to the listener and lowered it as a linear function of the quantity "room gain" at a rate of -3.6 dB/dB. There were also significant variations in the mean fundamental frequency, both across distance (3.8 Hz per double distance) and among environments (4.3 Hz), and in the long-term standard deviation of the fundamental frequency among rooms (4 Hz). In the most uncomfortable rooms to speak in, talkers prolonged the voiced segments of the speech they produced, either as a side-effect of increased vocal intensity or in order to compensate for a decrease in speech intelligibility.	\N	\N
21476664	The enhancement effect is consistently shown when simultaneously masked stimuli are preceded by the masker alone, with a reduction in the amount of masking relative to when that precursor is absent. One explanation for this effect proposed by Viemeister and Bacon [(1982). J. Acoust. Soc. Am. 71, 1502-1507] is the adaptation of inhibition, which predicts that an enhanced component (the "target") will be effectively more intense within the auditory system than one that has not been enhanced. Forward masking studies have indicated this effect of increased gain; however, other explanations of the enhancement effect have also been suggested. In order to provide an alternative measure of the amount of effective gain for an enhanced target, a subjective binaural centering task was used in which listeners matched the intensities of enhanced and unenhanced 2-kHz tones presented to opposite ears to produce a centered stimulus. The results showed that the enhancement effect produces an effective 4-5 dB increase in the level of the enhanced target. The enhancement effect was also measured using other enhancement paradigms which yielded similar results over a range of levels for the target, supporting an account based on adaptation of inhibition.	\N	\N
21477909	Operatic music involves both singing and acting (as well as rich audiovisual background arising from the orchestra and elaborate scenery and costumes) that multiply the mechanisms by which emotions are induced in listeners. The present study investigated the effects of music, plot, and acting performance on emotions induced by opera. There were three experimental conditions: (1) participants listened to a musically complex and dramatically coherent excerpt from Tosca; (2) they read a summary of the plot and listened to the same musical excerpt again; and (3) they re-listened to music while they watched the subtitled film of this acting performance. In addition, a control condition was included, in which an independent sample of participants succesively listened three times to the same musical excerpt. We measured subjective changes using both dimensional, and specific music-induced emotion questionnaires. Cardiovascular, electrodermal, and respiratory responses were also recorded, and the participants kept track of their musical chills. Music listening alone elicited positive emotion and autonomic arousal, seen in faster heart rate, but slower respiration rate and reduced skin conductance. Knowing the (sad) plot while listening to the music a second time reduced positive emotions (peacefulness, joyful activation), and increased negative ones (sadness), while high autonomic arousal was maintained. Watching the acting performance increased emotional arousal and changed its valence again (from less positive/sad to transcendent), in the context of continued high autonomic arousal. The repeated exposure to music did not by itself induce this pattern of modifications. These results indicate that the multiple musical and dramatic means involved in operatic performance specifically contribute to the genesis of music-induced emotions and their physiological correlates.	\N	\N
21491357	Obesity may be associated with increased tissue bulk in the laryngeal airway, neck, and chest wall, and as such may affect vocal function. Eight obese and eight nonobese adults participated in this study; the obese participants underwent bariatric surgical procedures. This mixed-design study included cross-sectional analysis for group differences and longitudinal analysis for multidimensional changes in vocal function from four assessments collected over 6 months. No significant differences were detected between groups from the preoperative assessment. Further, no changes were detected over time for acoustic parameters, maximum phonation time, laryngeal airway resistance, and airflow during a sustained vowel for either group. Only minor differences were detected for strain, pitch, and loudness perceptions of voice over time, but not between groups. Phonation threshold pressure (PTP), at comfortable and high pitches (30% and 80% of the F0 range) changed significantly over time, but not between groups. Examination of individual data revealed a trend for PTP at 30% F0 to decrease as body mass index decreased. PTP may be informative for assessing vocal function in clients who present with obesity and voice symptoms.	\N	\N
21493300	The aim of this prospective study was to audiologically evaluate consecutive glaucoma patients with or without exfoliation. Prospective study. Glaucoma Unit and Audiology Department at a university hospital. Consecutive subjects with exfoliative glaucoma (XFG) or primary open-angle glaucoma (POAG) aged between 50 and 70 years were enrolled. Auditory thresholds at 0.5, 1.0, 2.0, 4.0, and 8.0 Hz were measured bilaterally. Cochlear activity was assessed by recording distortion product otoacoustic emissions (DPOEs). Functional changes in the retrocochlear auditory pathway were evaluated by auditory brainstem responses (ABRs). One hundred and ten patients with XFG and 85 patients with POAG who presented in a glaucoma clinic were investigated. The mean age of study patients was 66.2 ± 5.6 years; range, 50-70 years). The odds of pathologic ABR central transmission time (interpeak latencies I-III, III-V, and I-V and waves I, III, and V) were 4.34 times higher in patients with XFG than in patients with POAG (95% confidence interval [CI], 2.22-8.49; P < .001). This significant association remained after adjusting for sex and age (odds ratio [OR] 4.12; 95% CI, 2.07-8.22; P < .001). Furthermore, the odds of ABR remained significantly higher in patients with XFG than in patients with POAG (OR 4.36; 95% CI, 2.10-9.06; P < .001) after controlling for systemic diseases (arterial hypertension, coronary heart disease, high cholesterol, and stroke). In the first study to compare XFG and POAG monitoring of the peripheral and central auditory pathway, it has been documented that XFG patients show a greater prevalence of retrocochlear pathology.	\N	\N
21493334	To compare noise reduction of commercially available ear-level hearing protection (muffs/inserts) to that of firearm muzzle suppressors. Experimental sound measurements under consistent environmental conditions. None. Muzzle suppressors for 2 pistol and 2 rifle calibers were tested using the Bruel & Kjaer 2209 sound meter and Bruel & Kjaer 4136 microphone calibrated with the Bruel & Kjaer Pistonphone using Military-Standard 1474D placement protocol. Five shots were recorded unsuppressed and 10 shots suppressed under consistent environmental conditions. Sound reduction was then compared with the real-world noise reduction rate of the best available ear-level protectors. All suppressors offered significantly greater noise reduction than ear-level protection, usually greater than 50% better. Noise reduction of all ear-level protectors is unable to reduce the impulse pressure below 140 dB for certain common firearms, an international standard for prevention of sensorineural hearing loss. Modern muzzle-level suppression is vastly superior to ear-level protection and the only available form of suppression capable of making certain sporting arms safe for hearing. The inadequacy of standard hearing protectors with certain common firearms is not recognized by most hearing professionals or their patients and should affect the way hearing professionals counsel patients and the public.	\N	\N
21506894	To assess the effect of the static force of a bone vibrator on the results of bone conduction (BC) threshold measurements and comfort. BC thresholds were determined for 40 participants using the standardized P-3333 headband and a leather adjustable headstrap with variable static forces (2.4, 3.4, 4.4, 5.4 N). Comfort ratings were examined using a five-point Likert scale. Mean BC thresholds were within ± 2 dB across all conditions; differences may be considered small enough to be clinically insignificant. Participants experienced significantly greater discomfort with the P-3333 versus the adjustable headstrap. The mean static force of the P-3333 varied considerably and was higher in situ than the calibration standard of 5.4 N. The results suggest that future revisions of relevant international and national standards should address the use of an adjustable headstrap and a static force less than 5.4 N.	\N	\N
21512424	To evaluate the outcomes of younger (<60 yr) and older (≥60 yr) patients implanted with the Vibrant Soundbridge (VSB). The aim was to determine if there were differences between groups. A retrospective study was used to evaluate all patients who were implanted and fit with a VSB during 2008 and 2009 at the Department of Otorhinolaryngology-Head and Neck Surgery, Medical University Innsbruck. Differences in audiologic, medical, and surgical outcomes between younger and older patients were evaluated. No patients had major complications during or after the surgical procedure. All patients had a good hearing benefit as supported by improvements in hearing thresholds from the preoperative to the postoperative condition in the sound field. There were differences between groups in speech understanding postoperatively; however, the differences were not statistically significant. All patients had, independent of age, good audiologic benefit from VSB use. Based on the low risk of medical or surgical complications, the easy use of the hearing implant, audiologic improvements, and potential social benefits, we think that the VSB should be regularly offered to adults with hearing loss, whether they are young or old.	\N	\N
21517207	The purpose of this study was to examine the effects of expertise on motion anticipation. We conducted 2 experiments in which novices and expert pilots viewed simulated aircraft landing scenes. The scenes were interrupted by the display of a black screen and then started again after a forward or backward shift. The participant's task was to determine whether the moving scene had been shifted forward or backward. A forward misjudgment of the final position of the moving scene was interpreted as a representational momentum (RM) effect. Experiment 1 showed that an RM effect was detected only for experts. The lack of motion anticipation on the part of novices is a surprising result for the RM literature. It could be related to scene unfamiliarity, encoding time, or shift size. Experiment 2 was run with novices only. It was aimed at testing the potential impact of 2 factors on the RM effect: scene encoding time and shift size. As a whole, the results showed that encoding time and shift size are important factors in anticipation processes in realistic dynamic situations.	\N	\N
21524739	Word segmentation from continuous speech is a difficult task that is faced by human infants when they start to learn their native language. Several studies indicate that infants might use several different cues to solve this problem, including intonation, linguistic stress, and transitional probabilities between subsequent speech sounds. In this work, a computational model for word segmentation and learning of primitive lexical items from continuous speech is presented. The model does not utilize any a priori linguistic or phonemic knowledge such as phones, phonemes or articulatory gestures, but computes transitional probabilities between atomic acoustic events in order to detect recurring patterns in speech. Experiments with the model show that word segmentation is possible without any knowledge of linguistically relevant structures, and that the learned ungrounded word models show a relatively high selectivity towards specific words or frequently co-occurring combinations of short words.	\N	\N
21525779	The results reported in this paper indicate that native speakers of Mandarin Chinese rate the perceptual similarities among the lexical tones of Mandarin differently than do native speakers of American English. Mandarin listeners were sensitive to tone contour while English listeners attended to pitch levels. Chinese listeners also rated tones that are neutralized by phonological tone sandhi rules in Mandarin as more similar to each other than did English speakers--indicating a role of phonology in determining perceptual salience. In two further experiments, we found that some of these differences were eliminated when the listening task focused listeners' attention on the auditory properties of the stimuli, but, interestingly, a degree of language specificity remained even in the most purely psychophysical listening tasks with speech stimuli.	\N	\N
21540053	The auditory system faithfully represents sufficient details from sound sources such that downstream cognitive processes are capable of acting upon this information effectively even in the face of signal uncertainty, degradation or interference. This robust sound source representation leads to an invariance in perception vital for animals to interact effectively with their environment. Due to unique nonlinearities in the cochlea, sound representations early in the auditory system exhibit a large amount of variability as a function of stimulus intensity. In other words, changes in stimulus intensity, such as for sound sources at differing distances, create a unique challenge for the auditory system to encode sounds invariantly across the intensity dimension. This challenge and some strategies available to sensory systems to eliminate intensity as an encoding variable are discussed, with a special emphasis upon sound encoding.	\N	\N
21543605	The mammalian auditory system contains descending neural pathways, some of which project onto the cochlea via the medial olivocochlear (MOC) system. The function of this efferent auditory system is not entirely clear. Behavioral studies in animals with olivocochlear (OC) lesions suggest that the MOC serves to facilitate sound localization in noise. In the current work, noise-induced OC activity (the OC reflex) and sound-localization performance in noise were measured in normal-hearing humans. Consistent with earlier studies, both measures were found to vary substantially across individuals. Importantly, significant correlations were observed between OC-reflex strength and the effect of noise on sound-localization performance; the stronger the OC reflex, the less marked the effect of noise. These results suggest that MOC activation by noise helps to counteract the detrimental effects of background noise on neural representations of direction-dependent spectral features, which are especially important for accurate localization in the up/down and front/back dimensions.	\N	\N
21545768	Previous studies have demonstrated a relation between cognitive capacity, in particular working memory, and the ability to understand speech in noise with different types of hearing aid signal processing. The present study investigates the relation between working memory capacity and the speech recognition performance of persons with hearing impairment under both aided and unaided conditions, following a period of familiarization to both fast- and slow-acting compression settings in the participants' own hearing aids. Speech recognition was tested in modulated and steady state noise with fast and slow compression release settings (for aided conditions) with each of two materials. Working memory capacity was also measured. Thirty experienced hearing aid users with a mean age of 70 yr (SD = 7.8) and pure-tone average hearing threshold across the frequencies 0.25, 0.5, 1, 2, 3, 4, and 6 kHz (PTA7) and for both ears of 45.8 dB HL (SD = 6.6. 9 wk experience with each of fast-acting and slow-acting compression. Speech recognition data were analyzed using repeated measures analysis of variance with the within-subjects factors of material (high constraint, low constraint), noise type (steady state, modulated), and compression (fast, slow), and the between-subjects factor working memory capacity (high, low). With high constraint material, there were three-way interactions including noise type and working memory as well as compression, in aided conditions, and performance level, in unaided conditions, but no effects of either working memory or compression with low constraint material. Investigation of simple main effects showed a significant effect of working memory during speech recognition under conditions of both "high degradation" (modulated noise, fast-acting compression, low signal-to-noise ratio [SNR]) and "low degradation" (steady state noise, slow-acting compression, high SNR). The finding of superior performance of persons with high working memory capacity in modulated noise with fast-acting compression agrees with findings of previous studies including a familiarization period of at least 9 wk, in contrast to studies with familiarization of 4 wk or less that have shown that persons with lower cognitive capacity may benefit from slow-acting compression. Working memory is a crucial factor in speech understanding in noise for persons with hearing impairment, irrespective of whether hearing is aided or unaided. Working memory supports speech understanding in noise under conditions of both "high degradation" and "low degradation." A subcomponent view of working memory may contribute to our understanding of these phenomena. The effect of cognition on speech understanding in modulated noise with fast-acting compression may only pertain after a period of 4-9 wk of familiarization and that prior to such a period, persons with lower cognitive capacity may benefit more from slow-acting compression.	\N	\N
21547604	Listeners require context to understand the highly reduced words that occur in casual speech. The present study reports four auditory lexical decision experiments in which the role of semantic context in the comprehension of reduced versus unreduced speech was investigated. Experiments 1 and 2 showed semantic priming for combinations of unreduced, but not reduced, primes and low-frequency targets. In Experiment 3, we crossed the reduction of the prime with the reduction of the target. Results showed no semantic priming from reduced primes, regardless of the reduction of the targets. Finally, Experiment 4 showed that reduced and unreduced primes facilitate upcoming low-frequency related words equally if the interstimulus interval is extended. These results suggest that semantically related words need more time to be recognized after reduced primes, but once reduced primes have been fully (semantically) processed, these primes can facilitate the recognition of upcoming words as well as do unreduced primes.	\N	\N
21554838	Tinnitus is a disturbing symptom and is often the main reason for otology referral. It is usually associated with hearing loss of varying aetiology, and is thought to begin in the cochlea, with later abnormal central activity. We hypothesise that tinnitus without hearing loss may be caused by central and subcortical abnormalities and altered outer hair cell function. To compare the auditory brainstem responses, middle latency responses and otoacoustic emissions in normal-hearing individuals with and without tinnitus. The audiological test results of 25 normal hearing subjects with tinnitus (age 18-45 years) were determined, and compared with those of a control group. A statistically significant difference was found between study group tinnitus ears vs control group ears, as regards wave I latency prolongation, shortening of wave V and absolute I-III and I-V interpeak latency, enlargement of wave Na and Pa amplitude, and distortion product and transient evoked otoacoustic emission signal-to-noise ratios. There was no statistically significant difference between unilateral vs bilateral tinnitus ears. The pathogenesis and optimum management of tinnitus are still unclear. It often occurs with primary ear disease, usually associated with hearing loss, but may occur in patients with normal hearing. Observed changes in auditory brainstem and middle latency responses indicate central auditory alterations. Tinnitus involves both peripheral and central activity, and complete audiological and neurophysiological investigation is required. Management should be based on both audiological and neurophysiological findings.	\N	\N
21563460	To understand the usage of MP3 and effects on hearing of middle school students in Xi'an, and discuss controlling strategies. Stratified random cluster sampling method was used in the 1567 middle school students in Xi'an through questionnaire survey, ear examination and hearing examination, data were analysed by the SPSS13.0 statistical software. 1) The rate of holding MP3 in the middle school students was 85.2%. Average daily use time was (1.41 +/- 1.11) h. 2) The noise group of pure tone hearing threshold was significantly higher compared with the control group (P<0.01), and increased the detection rate of hearing loss with the increasing use of MP3. 3) The detection rate of symptoms increased with the increasing use of MP3. The usage of MP3 can harm hearing in middle school students, which can result in neurasthenic syndrome.	\N	\N
21568377	The ability of listeners with bilateral sensorineural hearing loss to localize a speech source in a multitalker mixture was measured. Five simultaneous words spoken by different talkers were presented over loudspeakers in a small room, and listeners localized one target word. Errors were significantly larger in this group compared to a control group with normal hearing. Localization of the target presented alone was not different between groups. The results suggest that hearing loss does not impair spatial hearing per se, but degrades the spatial representation of multiple simultaneous sounds.	\N	\N
21568378	A previous letter by Gee et al. [J. Acoust. Soc. Am. 121, EL1-EL7 (2007)] revealed likely shortcomings in using common, stationary (long-term) spectrum-based measures to quantify the perception of nonlinearly propagated noise. Here, the Glasberg and Moore [J. Audio Eng. Soc. 50, 331-342 (2002)] algorithm for time-varying loudness is investigated. Their short-term loudness, when applied to a shock-containing broadband signal and a phase-randomized signal with equivalent long-term spectrum, does not show a significant difference in loudness between the signals. Further analysis and discussion focus on the possible utility of the instantaneous loudness and the need for additional investigation in this area.	\N	\N
21569784	In recent years it has been shown that a disorder in recognizing familiar people can be observed in patients with lesions affecting the anterior parts of the temporal lobes and that these disorders can be multi-modal, simultaneously affecting the visual, auditory and linguistic channels that allow person identification. Several authors have also shown that patients with right anterior temporal atrophy are more impaired in assessing familiarity and in retrieving person-specific semantic information from faces than from names, whereas the opposite pattern of performance can be observed in patients with left temporal lobe atrophy. Voice recognition disorders have been studied much less even despite their clinical and theoretical importance. The aim of the present review, therefore, was to compare recognition of familiar faces and voices, taking into account not only results obtained in individual patients with right anterior temporal lesions, but also those of group studies of unselected right- and left brain-damaged patients and results of experimental investigations conducted on face and voice recognition in normal subjects. Results of the review showed that: (1) voice recognition disorders are mainly due to right temporal lesions, similarly to face recognition disorders; (2) famous voice recognition disorders can be dissociated from unfamiliar voice discrimination impairments; (3) although face and voice recognition disorders tend to co-occur, they can also dissociate and in these patients there is a prevalent involvement of the right fusiform gyrus when face recognition disorders are on the foreground, and of the right superior temporal gyrus when voice recognition disorders are prominent; (4) normal subjects have greater difficulty evaluating familiarity and drawing semantic information from the voices than from the faces of celebrities. These data are at variance with models which assume that familiarity feelings may be generated at the level of person identity nodes (PINs) and that the latter may be considered as modality-free gateways to single semantic systems in which information about people is stored in an amodal format.	\N	\N
21586256	In a number of European countries, a functional self-test to screen for hearing impairment is available via telephone and the Internet. The tests estimate speech-reception thresholds using an adaptive procedure in which digit triplets are presented at varying signal-to-noise ratios. In different languages, the stimuli were created either with or without coarticulation; and some implementations use fresh noise samples, while others do not. The present investigation concerns the influence of coarticulation, prosody, and noise freshness on measured thresholds. We performed a laboratory study using 12 normal-hearing listeners. In a blocked design we compared speech-reception thresholds for conditions with and without fresh noise tokens. In each block we used three types of triplets: with coarticulation and prosody, with neither, and without coarticulation but with prosody. Thirty-six thresholds were recorded per subject, and they were analyzed using analyses of variance. The results showed no significant differences among the three triplet conditions. The freshness of the noise did not affect thresholds when, at least, a fresh noise token was used per threshold estimate (23 presentations). Scores dropped significantly when a whole experimental block was performed with a single noise token.	\N	\N
21601842	Evolution and the brain have done a marvelous job solving many tricky problems in action control, including problems of learning, hierarchical control over serial behavior, continuous recalibration, and fluency in the face of slow feedback. Given that evolution tends to be conservative, it should not be surprising that these solutions are exploited to solve other tricky problems, such as the design of a communication system. We propose that a mechanism of motor control, paired controller/predictor models, has been exploited for language learning, comprehension, and production. Our account addresses the development of grammatical regularities and perspective, as well as how linguistic symbols become meaningful through grounding in perception, action, and emotional systems.	\N	\N
21603614	Recent studies suggest that human auditory perception follows a prolonged developmental trajectory, sometimes continuing well into adolescence. Whereas both sensory and cognitive accounts have been proposed, the development of the ability to base current perceptual decisions on prior information, an ability that strongly benefits adult perception, has not been directly explored. Here we ask whether the auditory frequency discrimination of preschool children also improves when given the opportunity to use previously presented standard stimuli as perceptual anchors, and whether the magnitude of this anchoring effect undergoes developmental changes. Frequency discrimination was tested using two adaptive same/different protocols. In one protocol (with-reference), a repeated 1-kHz standard tone was presented repeatedly across trials. In the other (no-reference), no such repetitions occurred. Verbal memory and early reading skills were also evaluated to determine if the pattern of correlations between frequency discrimination, memory and literacy is similar to that previously reported in older children and adults. Preschool children were significantly more sensitive in the with-reference than in the no-reference condition, but the magnitude of this anchoring effect was smaller than that observed in adults. The pattern of correlations among discrimination thresholds, memory and literacy replicated previous reports in older children. The processes allowing the use of context to form perceptual anchors are already functional among preschool children, albeit to a lesser extent than in adults. Nevertheless, immature anchoring cannot fully account for the poorer frequency discrimination abilities of young children. That anchoring is present among the majority of typically developing preschool children suggests that the anchoring deficits observed among individuals with dyslexia represent a true deficit rather than a developmental delay.	\N	\N
21604886	Inhibitory control functions in old age were investigated with the "masked prime" paradigm in which participants executed speeded manual choice responses to simple visual targets. These were preceded--either immediately or at some earlier time--by a backward-masked prime. Young adults produced positive compatibility effects (PCEs)--faster and more accurate responses for matching than for nonmatching prime-target pairs--when prime and target immediately followed each other, and the reverse effect (negative compatibility effect, NCE) for targets that followed the prime after a short interval. Older adults produced similar PCEs to young adults, indicating intact low-level motor activation, but failed to produce normal NCEs even with longer delays (Experiment 1), increased opportunity for prime processing (Experiment 2), and prolonged learning (Experiment 3). However, a fine-grained analysis of each individual's time course of masked priming effects revealed NCEs in the majority of older adults, of the same magnitude as those of young adults. These were significantly delayed (even more than expected on the basis of general slowing), indicating a disproportionate impairment of low-level inhibitory motor control in old age.	\N	\N
21616987	This project examined receptive vocabulary treatment outcomes in the two languages of a bilingual preschooler with moderate to severe language impairment. A series of single-subject experimental designs was used to compare English-only (EO) and bilingual (BI) approaches to receptive vocabulary treatment. The participant, Nam, was a boy age 3;11 (years;months) who was learning Vietnamese as a first language at home and English in his early childhood education program. Treatment was implemented by an EO interventionist using a computer interface and prerecorded audio files in Vietnamese and English. The dependent measure was the percentage of items that were correctly identified in each language. Combined studies revealed that the BI approach increased Nam's attention to task and was as effective as the EO approach for increasing his receptive vocabulary in English. Nam made vocabulary gains in both treatment conditions; receptive vocabulary gains were evident in both Vietnamese and English. This project showed that it is feasible for an EO clinician to promote gains in both the home and school languages of a BI child through creative collaborations with BI colleagues and the use of technology. Replication with additional participants and treatment activities is needed to make further generalizations.	\N	\N
21623776	Isotretinoin is widely used in the treatment of extensive and nodulocystic acne. The objective of this prospective study was to investigate whether oral isotretinoin could affect the hearing system. Thirty-eight patients with acne vulgaris (76 ears) who were diagnosed and treated at the Department of Dermatology were included in the current study. Study evaluation visits were performed at baseline and at Weeks 1, 2 and 3. Pure-tone averages (PTAs) of air conduction thresholds at 250 Hz (PTA1); 500, 1000, and 2000 Hz (PTA2); 4000, 8000, and 10,000 Hz (PTA3); and 12,500, 16,000, 18,000 and 20,000 Hz (PTA4) for each ear were calculated separately. Assessment of the efficacy was based on the audiometric findings. Compared with pre-treatment evaluation, the PTAs of patients were found to be significantly different at the first week for PTA2 (P = 0.033) and PTA3 (P = 0.001), at the second week for PTA1 (P = 0.036), and at the third week for PTA4 (P = 0.002). Our results suggest that the oral isotretinoin (13-cis retinoic acid), which is a derivative of retinol (vitamin A), improved the hearing level of the patients in all audiometric frequencies in a short-period follow-up.	\N	\N
21624926	Cognitive impairment is a core element shared by a large number of different neurological and neuropsychiatric diseases. Irrespective of their different aetiologies and symptomatologies, most appear to converge at the functional deficiency of the auditory-frontal cortex network of auditory discrimination, which indexes cognitive impairment shared by these abnormalities. This auditory-frontal cortical deficiency, and hence cognitive decline, can now be objectively measured with the mismatch negativity and its magnetic equivalent. The auditory-frontal cortical network involved seems, therefore, to play a pivotal, unifying role in the different abnormalities. It is, however, more likely that the dysfunction that can be detected with the mismatch negativity and its magnetoencephalographic equivalent manifests a more widespread brain disorder, namely, a deficient N-methyl-D-aspartate receptor function, shared by these abnormalities and accounting for most of the cognitive decline.	\N	\N
21625011	The McGurk effect demonstrates the influence of visual cues on auditory perception. Mismatching information from both sensory modalities can fuse to a novel percept that matches neither the auditory nor the visual stimulus. This illusion is reported in 60-80% of trials. We were interested in the impact of ongoing brain oscillations-indexed by fluctuating local excitability and interareal synchronization-on upcoming perception of identical stimuli. The perception of the McGurk effect is preceded by high beta activity in parietal, frontal, and temporal areas. Beta activity is pronounced in the left superior temporal gyrus (lSTG), which is considered as a site of multimodal integration. This area is functionally (de)coupled to distributed frontal and temporal regions in illusion trials. The disposition to fuse multisensory information is enhanced as the lSTG is more strongly coupled to frontoparietal regions. Illusory perception is accompanied by a decrease in poststimulus theta-band activity in the cuneus, precuneus, and left superior frontal gyrus. Event-related activity in the left middle temporal gyrus is pronounced during illusory perception. Thus, the McGurk effect depends on fluctuating brain states suggesting that functional connectedness of left STS at a prestimulus stage is crucial for an audiovisual percept.	\N	\N
21639675	It is well established that in masked priming, a target word (e.g., JUDGE) is primed more effectively by a transposed letter (TL) prime (e.g., jugde) than by an orthographic control prime (e.g., junpe). This is inconsistent with the slot coding schemes used in many models of visual word recognition. Several alternative coding schemes have been proposed in which special bigram detectors for frequently occurring nonadjacent letter combinations are developed as a product of perceptual learning. In order to examine this perceptual learning hypothesis, we asked whether bigram detectors are defined in terms of visuospatial coordinates. Japanese-English bilinguals who were equally familiar with horizontal and vertical text in Japanese demonstrated strong TL priming in both orientations when reading Japanese words, but, when reading English words, the evidence for vertical TL priming was not as strong. However, native English speakers showed a clear TL priming effect with vertically presented English words despite minimal exposure to vertical text, which is not consistent with a perceptual learning account. It is proposed instead that the initial letter array is transformed into an abstract ordinal code (first to last) regardless of orientation and that the speed with which this transformation is carried out depends on the familiarity of the script.	\N	\N
21645986	The etiology of developmental dyslexia remains widely debated. An appealing theory postulates that the reading and spelling problems in individuals with dyslexia originate from reduced sensitivity to slow-rate dynamic auditory cues. This low-level auditory deficit is thought to provoke a cascade of effects, including inaccurate speech perception and eventually unspecified phoneme representations. The present study investigated sensitivity to frequency modulation and amplitude rise time, speech-in-noise perception and phonological awareness in 11-year-old children with dyslexia and a matched normal-reading control children. Group comparisons demonstrated that children with dyslexia were less sensitive than normal-reading children to slow-rate dynamic auditory processing, speech-in-noise perception, phonological awareness and literacy abilities. Correlations were found between slow-rate dynamic auditory processing and phonological awareness, and speech-in-noise perception and reading. Yet, no significant correlation between slow-rate dynamic auditory processing and speech-in-noise perception was obtained. Together, these results indicate that children with dyslexia have difficulties with slow-rate dynamic auditory processing and speech-in-noise perception and that these problems persist until sixth grade.	\N	\N
21647889	To study to what extent it is possible to achieve identical insertion depths and to maintain the same performance after cochlear reimplantation. Outcome research on a retrospective case series in a tertiary university referral center. Data were collected for 12 adults and three children who underwent reimplantation during the last 3 years with a new HiRes90K device with HiFocus 1J electrode owing to failure of the feed-through seal. Multislice computed tomography scans were used to compare positions of the original and newly placed electrode arrays. The speech-perception scores on a consonant-vowel-consonant word test before and after reimplantation were compared. All reimplantations were successfully performed by two experienced cochlear implantation surgeons, and no complications were observed. Postoperative imaging showed that the average displacement of the new implant was only 0.59 mm. Reactivation of the implant gave immediate open set speech understanding in all patients, and speech perception rapidly returned to the previous level obtained with the original implant within weeks; it was even significantly better at the 3-month follow-up. No relation was found between changes in performance and the amount of displacement of the electrode array. After cochlear reimplantation with the same device, electrode-array position can be accurately replicated and speech perception can be regained or even improved within weeks.	\N	\N
21649758	This event-related functional magnetic resonance imaging (fMRI) study was designed in such a manner so as to contribute to the present debate on behavioural and functional transfer effects associated with intensive language training. To address this novel issue, we measured professional simultaneous interpreters and control subjects while they performed a non-verbal auditory discrimination task that primarily relies on attention and categorization functions. The fMRI results revealed that the discrimination of the target stimuli was associated with differential blood oxygen level-dependent responses in fronto-parietal regions between the two groups, even though in-scanner behavioural results did not show significant group differences. These findings are in line with previous observations showing the contribution of fronto-parietal regions to auditory attention and categorization functions. Our results imply that language training modulates brain activity in regions involved in the top-down regulation of auditory functions.	\N	\N
21669859	It is generally agreed that considerable amounts of low-level sensory processing of visual stimuli can occur without conscious awareness. On the other hand, the degree of higher level visual processing that occurs in the absence of awareness is as yet unclear. Here, event-related potential (ERP) measures of brain activity were recorded during a sandwich-masking paradigm, a commonly used approach for attenuating conscious awareness of visual stimulus content. In particular, the present study used a combination of ERP activation contrasts to track both early sensory-processing ERP components and face-specific N170 ERP activations, in trials with versus without awareness. The electrophysiological measures revealed that the sandwich masking abolished the early face-specific N170 neural response (peaking at ~170 ms post-stimulus), an effect that paralleled the abolition of awareness of face versus non-face image content. Furthermore, however, the masking appeared to render a strong attenuation of earlier feedforward visual sensory-processing signals. This early attenuation presumably resulted in insufficient information being fed into the higher level visual system pathways specific to object category processing, thus leading to unawareness of the visual object content. These results support a coupling of visual awareness and neural indices of face processing, while also demonstrating an early low-level mechanism of interference in sandwich masking.	\N	\N
21675563	This study examined the effects of visual-verbalload (as measured by a visually presented reading-memory task with three levels) on a visual/auditory stimulus-response task. The three levels of load were defined as follows: "No Load" meant no other stimuli were presented concurrently; "Free Load" meant that a letter (A, B, C, or D) appeared at the same time as the visual or auditory stimulus; and "Force Load" was the same as "Free Load," but the participants were also instructed to count how many times the letter A appeared. The stimulus-response task also had three levels: "irrelevant," "compatible," and "incompatible" spatial conditions. These required different key-pressing responses. The visual stimulus was a red ball presented either to the left or to the right of the display screen, and the auditory stimulus was a tone delivered from a position similar to that of the visual stimulus. Participants also processed an irrelevant stimulus. The results indicated that participants perceived auditory stimuli earlier than visual stimuli and reacted faster under stimulus-response compatible conditions. These results held even under a high visual-verbal load. These findings suggest the following guidelines for systems used in driving: an auditory source, appropriately compatible signal and manual-response positions, and a visually simplified background.	\N	\N
21676085	The phonological deficit theory of dyslexia assumes that degraded speech sound representations might hamper the acquisition of stable letter-speech sound associations necessary for learning to read. However, there is only scarce and mainly indirect evidence for this assumed letter-speech sound association problem. The present study aimed at clarifying the nature and the role of letter-speech sound association problems in dyslexia by analysing event-related potentials (ERP) of 11-year-old dyslexic children to speech sounds in isolation or combined with letters, which were presented either simultaneously with or 200 ms before the speech sounds. Recent studies with normal readers revealed that letters systematically modulated speech sound processing in an early (mismatch negativity or MMN) and late (Late Discriminatory Negativity or LDN) time-window. The amplitude of the MMN and LDN to speech sounds was enhanced when speech sounds were presented with letters. The dyslexic readers in the present study, however, did not exhibit any early influences of letters on speech sounds even after 4 years of reading instruction, indicating no automatic integration of letters and speech sounds. Interestingly, they revealed a systematic late effect of letters on speech sound processing, probably reflecting the mere association of letters and speech sounds. This pattern is strongly divergent from that observed in age-matched normal readers, who showed both early and late effects, but reminiscent of that observed in beginner normal readers in a previous study (Froyen, Bonte, van Atteveldt & Blomert, 2009). The finding that the quality of letter-speech sound processing is directly related to reading fluency urges further research into the role of audiovisual integration in the development of reading failure in dyslexia.	\N	\N
21676091	Early post-natal nutrition influences later development, but there are no studies comparing brain function in healthy infants as a function of dietary intake even though the major infant diets differ significantly in nutrient composition. We studied brain responses (event-related potentials; ERPs) to speech sounds for infants who were fed either breast milk (BF), milk-based formula (MF), or soy formula (SF) during the first 6 months of life. Two syllables presented in an oddball paradigm elicited a late positive wave (P350) from temporal and frontal brain regions involved in language processes. All groups showed significantly greater response amplitudes to the infrequent syllable across sites at 3 months and frontally at 6 months, but significant discrimination at temporal sites was only observed at 6 months in BF infants. Decreases in response amplitudes from 3 to 6 months were greater for the frequently presented syllable, most prominent in BF infants, and greater in females than males. The results indicate greater syllable discrimination in BF than formula-fed infants, but whether this can be attributed to dietary influences alone remains unclear. Feeding method and background factor differences between breastfed and formula-fed infants may also contribute to the observed differences. The general absence of differences between formula-fed groups is notable and suggests that milk-based formula and soy formula equally support brain development and function during the first post-natal 6 months. Finally, the results indicate gender differences in the development of neural and temporal processes involved in sensory discrimination, and suggest that at 6 months these processes are better developed in females.	\N	\N
21676999	The conditions of sound fields used in research, especially testing and fitting of hearing aids, are usually simplified or reduced to fundamental physical fields, such as the free or the diffuse sound field. The concepts of such ideal conditions are easily introduced in theoretical and experimental investigations and in models for directional microphones, for example. When it comes to real-world application of hearing aids, however, the field conditions are more complex with regard to specific stationary and transient properties in room transfer functions and the corresponding impulse responses and binaural parameters. Sound fields can be categorized in outdoor rural and urban and indoor environments. Furthermore, sound fields in closed spaces of various sizes and shapes and in situations of transport in vehicles, trains, and aircrafts are compared with regard to the binaural signals. In laboratory tests, sources of uncertainties are individual differences in binaural cues and too less controlled sound field conditions. Furthermore, laboratory sound fields do not cover the variety of complex sound environments. Spatial audio formats such as higher-order ambisonics are candidates for sound field references not only in room acoustics and audio engineering but also in audiology.	\N	\N
21678230	Studies of change detection have increased our understanding of attention, perception, and memory. In two innovative experiments we showed that the change detection phenomenon can be used to examine other areas of cognition-specifically, the processing of linguistic and indexical information in spoken words. One hypothesis suggests that cognitive resources must be used to process indexical information, whereas an alternative suggests that it is processed more slowly than linguistic information. Participants performed a lexical decision task and were asked whether the voice presenting the stimuli changed. Nonwords varying in their likeness to real words were used in the lexical decision task to encourage participants to vary the amount of cognitive resources/processing time. More cognitive resources/processing time are required to make a lexical decision with word-like nonwords. Participants who heard word-like nonwords were more likely to detect the change when it occurred (Experiment 1) and were more confident that the voice was the same when it did not change (Experiment 2). These results suggest that indexical information is processed more slowly than linguistic information and demonstrate how change detection can provide insight to other areas of cognition.	\N	\N
21681660	Two experiments examined whether perceptual recovery from Korean consonant-cluster simplification is based on language-specific phonological knowledge. In tri-consonantal C1C2C3 sequences such as /lkt/ and /lpt/ in Seoul Korean, either C1 or C2 can be completely deleted. Seoul Koreans monitored for C2 targets (/p/ or / k/, deleted or preserved) in the second word of a two-word phrase with an underlying /l/-C2-/t/ sequence. In Experiment 1 the target-bearing words had contextual lexical-semantic support. Listeners recovered deleted targets as fast and as accurately as preserved targets with both Word and Intonational Phrase (IP) boundaries between the two words. In Experiment 2, contexts were low-pass filtered. Listeners were still able to recover deleted targets as well as preserved targets in IP-boundary contexts, but better with physically-present targets than with deleted targets in Word-boundary contexts. This suggests that the benefit of having target acoustic-phonetic information emerges only when higher-order (contextual and phrase-boundary) information is not available. The strikingly efficient recovery of deleted phonemes with neither acoustic-phonetic cues nor contextual support demonstrates that language-specific phonological knowledge, rather than language-universal perceptual processes which rely on fine-grained phonetic details, is employed when the listener perceives the results of a continuous-speech process in which reduction is phonetically complete.	\N	\N
21682395	The perceived negative influence of standard hearing protectors on communication is a common argument for not wearing them. Thus, "augmented" protectors have been developed to improve speech intelligibility. Nevertheless, their actual benefit remains a point of concern. In this paper, speech perception with active earplugs is compared to standard passive custom-made earplugs. The two types of active protectors included amplify the incoming sound with a fixed level or to a user selected fraction of the maximum safe level. For the latter type, minimal and maximal amplification are selected. To compare speech intelligibility, 20 different speech-in-noise fragments are presented to 60 normal-hearing subjects and speech recognition is scored. The background noise is selected from realistic industrial noise samples with different intensity, frequency, and temporal characteristics. Statistical analyses suggest that the protectors' performance strongly depends on the noise condition. The active protectors with minimal amplification outclass the others for the most difficult and the easiest situations, but they also limit binaural listening. In other conditions, the passive protectors clearly surpass their active counterparts. Subsequently, test fragments are analyzed acoustically to clarify the results. This provides useful information for developing prototypes, but also indicates that tests with human subjects remain essential.	\N	\N
21682407	When a test sound consisting of pure tones with equal intensities is preceded by a precursor sound identical to the test sound except for a reduction in the intensity of one tone, an auditory "enhancement" phenomenon occurs: In the test sound, the tone which was previously softer stands out perceptually. Here, enhancement was investigated using inharmonic sounds made up of five pure tones well resolved in the auditory periphery. It was found that enhancement can be elicited not only by increases in intensity but also by shifts in frequency. In both cases, when the precursor and test sounds are separated by a 500-ms delay, inserting a burst of pink noise during the delay has little effect on enhancement. Presenting the precursor and test sounds to opposite ears rather than to the same ear significantly reduces the enhancement resulting from increases in intensity, but not the enhancement resulting from shifts in frequency. This difference suggests that the mechanisms of enhancement are not identical for the two types of change. For frequency shifts, enhancement may be partly based on the existence of automatic "frequency-shift detectors" [Demany and Ramos, J. Acoust. Soc. Am. 117, 833-841 (2005)].	\N	\N
21688937	Cross-modal temporal recalibration describes a shift in the point of subjective simultaneity (PSS) between 2 events following repeated exposure to asynchronous cross-modal inputs--the adaptors. Previous research suggested that audiovisual recalibration is insensitive to the spatial relationship between the adaptors. Here we show that audiovisual recalibration can be driven by cross-modal spatial grouping. Twelve participants adapted to alternating trains of lights and tones. Spatial position was manipulated, with alternating sequences of a light then a tone, or a tone then a light, presented on either side of fixation (e.g., left tone--left light--right tone--right light, etc.). As the events were evenly spaced in time, in the absence of spatial-based grouping it would be unclear if tones were leading or lagging lights. However, any grouping of spatially colocalized cross-modal events would result in an unambiguous sense of temporal order. We found that adapting to these stimuli caused the PSS between subsequent lights and tones to shift toward the temporal relationship implied by spatial-based grouping. These data therefore show that temporal recalibration is facilitated by spatial grouping.	\N	\N
21689988	Mozart's Sonata for Two Pianos in D major, K.448 (Mozart K.448), has been shown to improve mental function, leading to what is known as the Mozart Effect. Our previous work revealed that epileptiform discharges in children with epilepsy decrease during and right after listening to Mozart K.448. However, the duration of the effect was not studied. In the study described here, we evaluated the long-term effect of Mozart K.448 on epileptiform discharges in children with epilepsy. Eighteen children with epilepsy whose seizures were clinically well controlled with antiepileptic drugs were included. For each child, EEGs had revealed persistent epileptiform discharges for at least 6 months. These patients listened to Mozart K.448 for 8 minutes once a day before bedtime for 6 months. Epileptiform discharges were recorded and compared before and after 1, 2, and 6 months of listening to Mozart K.448. All of the children remained on the same antiepileptic drug over the 6 months. Relationships between number of epileptiform discharges and foci of discharges, intelligence, epilepsy etiology, age, and gender were analyzed. Epileptiform discharges significantly decreased by 53.2±47.4, 64.4±47.1, and 71.6±45.8%, respectively, after listening to Mozart K.448 for 1, 2, and 6 months. All patients except those with occipital discharges showed a significant decrease in epileptiform discharges. Patients with normal intelligence and idiopathic epilepsy had greater decreases than those with mental retardation and symptomatic epilepsy. Age and gender did not affect the results. We conclude that long-term listening to Mozart K.448 may be effective in decreasing epileptiform discharges in children with epilepsy in a chronologically progressive manner.	\N	\N
21700953	We previously reported that fast-moving dot arrays cause orientation-tuned masking of static gratings (D. Apthorp, J. Cass, & D. Alais, 2010), which we attribute to "motion streaks." Using similar "streaky" dot motion, we describe spatial frequency tuning of grating threshold elevations caused by masking (Experiment 1) and adaptation (Experiment 2) to motion. To compare the streaks with psychophysical tunings, we Fourier analyzed time-averaged translating dots, which were bandpass (peaking at ∼2.3 c/deg). Masking, however, was strongest at lower test frequencies (≤1 c/deg) and largely isotropic over orientation, although a small orientation-tuned effect occurred at ∼1.2 c/deg. Results were broadly similar across monoptic and dichoptic conditions. Adaptation to fast motion produced spatially bandpass threshold elevations for parallel test gratings, peaking slightly lower than the peak Fourier frequency, with little elevation below 1 c/deg (unlike the low-pass elevation resulting from masking). Slow adaptation produced little elevation for parallel gratings. For orthogonal test gratings, fast motion adaptation produced low-pass threshold elevations and slow motion produced bandpass elevations, suggesting that separable mechanisms process fast (streaky) and slow motion. The different threshold elevation patterns over spatial frequency for masking and adaptation suggest that the adaptation effects are mainly within-channel suppression, whereas the masking effects may be mainly due to between-channel suppression.	\N	\N
21717096	The ocular vestibular evoked myogenic potential (oVEMP) is a relatively new method used to assess otolith-ocular pathways in humans. When elicited using air-conducted (AC) sound stimulation, the oVEMP is thought to reflect mostly saccular activation. However, it has been recently suggested that utricular afferents may also contribute to the AC evoked oVEMP. While previous frequency tuning studies of the AC evoked oVEMP report predominately high frequency sensitivity (>400 Hz), few have included the lower frequencies (<200 Hz) at which it has been proposed the utricle is most sensitive. In this study, ten normal subjects were stimulated with AC sound delivered unilaterally using headphones over frequencies from 50 to 1,200 Hz at a near constant A-weighted intensity of 120 dB peak sound pressure level. For AC stimulation, the oVEMP demonstrated maximum amplitudes around 600 Hz, with a second, smaller peak occurring around 100 Hz. The AC evoked oVEMP tuning has two peaks, a dominant one consistent with excitation of the saccule and a smaller one consistent with excitation of the utricle.	\N	\N
21724369	In spite of voice being an important parameter of mate choice, none of the studies have described the acoustic characteristics of the sexually appealing voice. Two hundred adults (100 men and 100 women) in the age range of 18-24 years were asked to narrate a topic, which was recorded directly onto Computerized Speech Lab (CSL) 6103 hardware. Recorded stimuli were presented to the six judges, and they were asked to indicate if the voice is sexually appealing on a five-point rating scale. The voices, which are consistently identified as sexually very attractive and unattractive were subjected to cepstral analysis through CSL. The results of perceptual analysis revealed that 28 of the female voice samples and 39 of the male voice samples were rated as sexually attractive. These ratings were consistent within and across the judges. The cepstral analysis was then performed in all the voice samples and the results of independent t test revealed higher values of cepstral peak parameter (CPP) in the sexually attractive voices in comparison to the other voice samples in both the genders. The obtained results are discussed with respect to the harmonic organization in the voice samples. The results of cepstral analysis in sexually attractive voices revealed higher values of CPP in comparison to the voices rated as sexually not appealing. This could be because of the presence of well-defined harmonic structure evidenced in the sexually appealing voice in comparison to voices rated as very unappealing. Our findings suggest that cepstral analysis is a good indicator of sexually appealing voice.	\N	\N
21728456	Cognitive control resolves conflicts between appropriate and inappropriate response tendencies. Is this achieved by a unitary all-purpose conflict control system, or do independent subsystems deal with different aspects of conflicting information? In a fully factorial hybrid prime-Simon task, participants responded to the identity of targets displayed at different nominally irrelevant screen locations, preceded by nominally irrelevant, consciously or nonconsciously perceived primes. The response required by the target's identity could match or mismatch (a) the target's location, and (b) the prime's identity, resulting in potential conflict (a) across and (b) within stimulus domains. Conflict effects were investigated within and across trials. Results suggest that (i) nonconsciously perceived information elicits within-trial control, but--unlike consciously perceived information--no across-trial behavioral modulation; (ii) separate subsystems deal with conflicts arising from different stimulus domains; and (iii) occasional apparent interactions between domains reflect a particular difficulty in reactivating a just-discarded response (reactivation aversion effect, RAE).	\N	\N
21728464	Increasing perceptual load reduces the processing of visual stimuli outside the focus of attention, but the mechanism underlying these effects remains unclear. Here we tested an account attributing the effects of perceptual load to modulations of visual cortex excitability. In contrast to stimulus competition accounts, which propose that load should affect simultaneous, but not sequential, stimulus presentations, the visual excitability account makes the novel prediction that load should affect detection sensitivity for both simultaneous and sequential presentations. Participants fixated a stimulus stream, responding to targets defined by either a color (low load) or color and orientation conjunctions (high load). Additionally, detection sensitivity was measured for a peripheral critical stimulus (CS) presented occasionally. Increasing load at fixation reduced sensitivity to the peripheral CSs; this effect was similar regardless of whether CSs were presented simultaneously with central stimuli or during the (otherwise empty) interval between them. Controls ruled out explanations of the results in terms of strategic task prioritization. These findings support a cortical excitability account for perceptual load, challenging stimulus competition accounts.	\N	\N
21729437	To describe the effect of age and noise on high frequency hearing thresholds in an Italian population aged 70 years and older, in order to investigate the interaction between presbycusis and noise exposure. We compared 460 subjects: 367 affected by presbycusis alone (204 women and 163 men) and 93 affected by presbycusis and noise exposure (eight women and 85 men). Pure tone average hearing thresholds, for each ear, were compared between groups, and between sexes and ages within groups. A slight threshold difference was found between the two groups at 4 kHz. After adjusting for age and gender, this difference was found to be related only to differing patient age. Men's and women's thresholds differed significantly in both groups, especially at high frequencies, at which threshold deterioration was worse in men than women. The threshold differences between patients with presbycusis with and without noise exposure were limited. Larger studies are needed to assess the relative effects of ageing and noise exposure on hearing thresholds.	\N	\N
21750713	In the present study we investigated the capacity of the memory store underlying the mismatch negativity (MMN) response in musicians and nonmusicians for complex tone patterns. While previous studies have focused either on the kind of information that can be encoded or on the decay of the memory trace over time, we studied capacity in terms of the length of tone sequences, i.e., the number of individual tones that can be fully encoded and maintained. By means of magnetoencephalography (MEG) we recorded MMN responses to deviant tones that could occur at any position of standard tone patterns composed of four, six or eight tones during passive, distracted listening. Whereas there was a reliable MMN response to deviant tones in the four-tone pattern in both musicians and nonmusicians, only some individuals showed MMN responses to the longer patterns. This finding of a reliable capacity of the short-term auditory store underlying the MMN response is in line with estimates of a three to five item capacity of the short-term memory trace from behavioural studies, although pitch and contour complexity covaried with sequence length, which might have led to an understatement of the reported capacity. Whereas there was a tendency for an enhancement of the pattern MMN in musicians compared to nonmusicians, a strong advantage for musicians could be shown in an accompanying behavioural task of detecting the deviants while attending to the stimuli for all pattern lengths, indicating that long-term musical training differentially affects the memory capacity of auditory short-term memory for complex tone patterns with and without attention. Also, a left-hemispheric lateralization of MMN responses in the six-tone pattern suggests that additional networks that help structuring the patterns in the temporal domain might be recruited for demanding auditory processing in the pitch domain.	\N	\N
21755126	To assess the difficulty of paced auditory serial addition test (PASAT) in a population of high intellectual level, under ideal cognitive testing circumstances. One hundred medical students underwent PASAT testing. They had slept well the night before, they had eaten before the assessment, they were not using any drugs that could affect the central nervous system and they did not have depression, anxiety or any chronic disease. The average result from the three-second version of PASAT was 57.5% and, from the two-second version, it was 44.3%. Even under ideal circumstances, PASAT is a very difficult test for the general population. It may not be ideal for neurologists to screen, assess and follow up patients with cognitive function in multiple sclerosis.	\N	\N
21762032	Reaction times for categorization of a probe face according to its sex or fame were contrasted as a function of whether the category of a preceding, sandwich-masked prime face was congruent or incongruent. Prime awareness was measured by the ability to later categorize the primes, and this was close to chance and typically uncorrelated with priming. When prime faces were never presented as visible probes within a test, priming was not reliable; when prime faces were also seen as probes, priming was only reliable if visible and masked presentation of faces were interleaved (not simply if primes had been visible in a previous session). In the latter case, priming was independent of experimentally induced face-response or face-category contingencies, ruling out any simple form of stimulus-response learning. We conclude that the reliable masked congruency priming reflects bindings between stimuli and multiple, abstract classifications that can be generated both overtly and covertly.	\N	\N
21762876	Schizophrenia patients have vocal affect (prosody) deficits that are treatment resistant and associated with negative symptoms and poor outcome. The neural correlates of this dysfunction are unclear. Prior study has suggested that schizophrenia vocal affect perception deficits stem from an inability to use acoustic cues, notably pitch, in decoding emotion. Functional magnetic resonance imaging was performed in 24 schizophrenia patients and 28 healthy control subjects, during the performance of a four-choice (happiness, fear, anger, neutral) vocal affect identification task in which items for each emotion varied parametrically in affective salient acoustic cue levels. We observed that parametric increases in cue levels in schizophrenia failed to produce the same identification rate increases as in control subjects. These deficits correlated with diminished reciprocal activation changes in superior temporal and inferior frontal gyri and reduced temporo-frontal connectivity. Task activation also correlated with independent measures of pitch perception and negative symptom severity. These findings illustrate the interplay between sensory and higher-order cognitive dysfunction in schizophrenia. Sensory contributions to vocal affect deficits also suggest that this neurobehavioral marker could be targeted by pharmacological or behavioral remediation of acoustic feature discrimination.	\N	\N
21765387	To compare hearing results in patients undergoing ossiculoplasty using either partial ossicular replacement prosthesis (PORP) or total ossicular replacement prosthesis (TORP) with Silastic banding and malleus relocation techniques in cases with malleus and stapes both present and mobile. Prospective nonrandomized clinical study. Tertiary referral center. Five hundred eighty-five patients undergoing ossiculoplasty were enrolled in this study from April 1991 to May 2010. Comparative analyses were made between a group of 304 patients who underwent ossiculoplasty with partial prosthesis positioned from the malleus to the stapes head and 281 patients who underwent ossiculoplasty with total prosthesis positioned from the malleus to the stapes footplate. Preoperative and postoperative audiometric evaluation using conventional audiometry, that is, air-bone gap (ABG), bone-conduction thresholds, and air-conduction thresholds were assessed. In the PORP group, the mean postoperative ABG was 13.1 dB compared with 8.9 dB in the TORP group, (95% confidence interval [CI], 2.2-6.2 dB; p ≤ 0.001). Fifty-four percent of patients from the PORP group had a postoperative ABG of 10 dB or less, compared with 68.9% in the TORP group (mean difference, 14.6%; 95% CI, 6%-23%; p < 0.001). The postoperative ABG was closed to within 20 dB in 70.4% of cases in the PORP group compared with 86.9% in the TORP group (mean difference, 14.5%; 95% CI, 10%-23%; p < 0.001). In patients with an absent incus and intact stapes and malleus, ossicular reconstruction with TORP combined with our malleus relocation and Silastic banding technique results in significantly better hearing outcomes compared with reconstructions with PORP.	\N	\N
21767048	The urge to move in response to music, combined with the positive affect associated with the coupling of sensory and motor processes while engaging with music (referred to as sensorimotor coupling) in a seemingly effortless way, is commonly described as the feeling of being in the groove. Here, we systematically explore this compelling phenomenon in a population of young adults. We utilize multiple levels of analysis, comprising phenomenological, behavioral, and computational techniques. Specifically, we show (a) that the concept of the groove is widely appreciated and understood in terms of a pleasurable drive toward action, (b) that a broad range of musical excerpts can be appraised reliably for the degree of perceived groove, (c) that the degree of experienced groove is inversely related to experienced difficulty of bimanual sensorimotor coupling under tapping regimes with varying levels of expressive constraint, (d) that high-groove stimuli elicit spontaneous rhythmic movements, and (e) that quantifiable measures of the quality of sensorimotor coupling predict the degree of experienced groove. Our results complement traditional discourse regarding the groove, which has tended to take the psychological phenomenon for granted and has focused instead on the musical and especially the rhythmic qualities of particular genres of music that lead to the perception of groove. We conclude that groove can be treated as a psychological construct and model system that allows for experimental exploration of the relationship between sensorimotor coupling with music and emotion.	\N	\N
21786896	The form of the psychometric function (PF) for auditory frequency discrimination is of theoretical interest and practical importance. In this study, PFs for pure-tone frequency discrimination were measured for several standard frequencies (200-8000 Hz) and levels [35-85 dB sound pressure level (SPL)] in normal-hearing listeners. The proportion-correct data were fitted using a cumulative-Gaussian function of the sensitivity index, d', computed as a power transformation of the frequency difference, Δf. The exponent of the power function corresponded to the slope of the PF on log(d')-log(Δf) coordinates. The influence of attentional lapses on PF-slope estimates was investigated. When attentional lapses were not taken into account, the estimated PF slopes on log(d')-log(Δf) coordinates were found to be significantly lower than 1, suggesting a nonlinear relationship between d' and Δf. However, when lapse rate was included as a free parameter in the fits, PF slopes were found not to differ significantly from 1, consistent with a linear relationship between d' and Δf. This was the case across the wide ranges of frequencies and levels tested in this study. Therefore, spectral and temporal models of frequency discrimination must account for a linear relationship between d' and Δf across a wide range of frequencies and levels.	\N	\N
21787870	Repetition has been shown to activate the so-called 'dorsal stream', a network of temporo-parieto-frontal areas subserving the mapping of acoustic speech input onto articulatory-motor representations. Among these areas, a region in the posterior Sylvian fissure at the temporo-parietal boundary (also called 'area Spt') has been suggested to play a central role particularly with increasing computational demands on phonological processing. Most of the relevant evidence stems from tasks requiring metalinguistic processing. To date, the relevance of area Spt in natural phonological operations based on implicit linguistic knowledge has not yet been investigated. We examined two types of phonological processes assumed to be lateralized differently, i.e., the processing of syllabic stress versus subsyllabic segmental processing. In two ways, subjects modified an auditorily presented pseudoword before reproducing it overtly: (a) by a prosodic manipulation involving a stress shift across syllable boundaries, (b) by a segmental manipulation involving a vowel substitution. Manipulation per se was expected to engage area Spt. Segmental compared to prosodic processing was expected to reveal predominantly left lateralized activation, while prosodic compared to segmental processing was expected to result in bilateral or right-lateralized activation. Contrary to expectation, activation in area Spt did not vary with increased phonological processing demand. Instead, area Spt was engaged regardless of whether subjects simply repeated a pseudoword or performed a phonological manipulation before reproduction. However, for both segmental and prosodic stimuli, reproduction after manipulation (compared to repetition) activated the left intraparietal sulcus and left inferior frontal cortex. We propose that these parieto-frontal regions are recruited when the task requires phonological manipulation over and above the more automated transfer of auditory into articulatory verbal codes, which appears to involve area Spt. When directly contrasted with prosodic manipulation, segmental manipulation resulted in increased activation predominantly in left inferior frontal areas. This may be due to an increased demand on phonological sequencing operations at the subsyllabic phoneme level. Contrasted with segmental manipulations, prosodic manipulation did not result in increased activation, which may be due to a lower degree of morphosyntactic and to syllable-level processing.	\N	\N
21792976	To evaluate vestibular function in patients with the mitochondrial A3243G mutation. Data from patients with the A3243G mutation attending an academic tertiary referral center were prospectively recorded. The clinical histories of 13 unrelated patients with the mitochondrial A3243G mutation (six mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes; and seven maternally inherited diabetes and deafness) were recorded, in particular their history of vestibular symptoms. Vestibular examinations including caloric testing and vestibular evoked myogenic potentials in response to air-conducted sound (ACS-VEMPs) were performed. In seven patients who showed abnormal ACS-VEMP, VEMP in response to galvanic stimuli (galvanic-VEMP) were also recorded. Eleven of the 13 patients had vestibular symptoms. The age of onset of vestibular symptoms was significantly later than the ages of onset of hearing loss and diabetes mellitus (P < .05). Ten of the 13 patients showed abnormal caloric responses, whereas 12 patients showed abnormal ACS-VEMPs on one or both sides. All of the seven patients who underwent galvanic-VEMP testing showed normal responses. The A3243G mutation is associated with vestibular dysfunction involving both the superior and inferior vestibular nerve systems. Furthermore, our results from galvanic-VEMP testing suggests that a labyrinthine lesion is primarily responsible for the symptoms of vestibular dysfunction.	\N	\N
21812557	Several perspectives on speech perception posit a central role for the representation of articulations in speech comprehension, supported by evidence for premotor activation when participants listen to speech. However, no experiments have directly tested whether motor responses mirror the profile of selective auditory cortical responses to native speech sounds or whether motor and auditory areas respond in different ways to sounds. We used fMRI to investigate cortical responses to speech and nonspeech mouth (ingressive click) sounds. Speech sounds activated bilateral superior temporal gyri more than other sounds, a profile not seen in motor and premotor cortices. These results suggest that there are qualitative differences in the ways that temporal and motor areas are activated by speech and click sounds: Anterior temporal lobe areas are sensitive to the acoustic or phonetic properties, whereas motor responses may show more generalized responses to the acoustic stimuli.	\N	\N
21812560	Complex auditory exposures in ambient environments include systems of not only linguistic but also musical sounds. Because musical exposure is often passive, consisting of listening rather than performing, examining listeners without formal musical training allows for the investigation of the effects of passive exposure on our nervous system without active use. Additionally, studying listeners who have exposure to more than one musical system allows for an evaluation of how the brain acquires multiple symbolic and communicative systems. In the present fMRI study, listeners who had been exposed to Western-only (monomusicals) and both Indian and Western musical systems (bimusicals) since childhood and did not have significant formal musical training made tension judgments on Western and Indian music. Significant group by music interactions in temporal and limbic regions were found, with effects predominantly driven by between-music differences in temporal regions in the monomusicals and by between-music differences in limbic regions in the bimusicals. Effective connectivity analysis of this network via structural equation modeling (SEM) showed significant path differences across groups and music conditions, most notably a higher degree of connectivity and larger differentiation between the music conditions within the bimusicals. SEM was also used to examine the relationships among the degree of music exposure, affective responses, and activation in various brain regions. Results revealed a more complex behavioral-neural relationship in the bimusicals, suggesting that affective responses in this group are shaped by multiple behavioral and neural factors. These three lines of evidence suggest a clear differentiation of the effects of the exposure of one versus multiple musical systems.	\N	\N
21812631	To determine administration times for word recognition presented via monitored live voice (MLV) and compact disc (CD) recordings. A quasi-experimental design was used. Fifty-word NU-6 lists were presented in three conditions: (1) MLV, (2) short ISI CD recording, and (3) long ISI CD recording. Listeners with normal hearing (NH) and hearing impairment (HI) participated in this study. Average administration time using MLV was significantly shorter than using recorded word lists for both groups of listeners. MLV presentation to the NH listeners was significantly faster than the MLV presentation to the HI listeners. There were no significant differences between groups in the administration times for any of the recorded lists (long or short ISI). Considerably more variability in administration time was observed for MLV presentation compared to recorded presentations. MLV presentation was about one minute faster than the shortest CD recording of the NU-6 fifty-item word lists, but it was only 49 seconds quicker when administering tests to individuals with hearing loss. Because the majority of our patients are hearing impaired, the difference of 49 seconds is not clinically significant. This difference is even less when 25-item word lists are used.	\N	\N
21812635	The objective of this study was to evaluate hearing loss among workers exposed to styrene, alone or with noise. This cross-sectional study was conducted as part of NoiseChem, a European Commission 5th Framework Programme research project, by occupational health institutes in Finland, Sweden, and Poland. Participants' ages ranged from 18-72 years (n = 1620 workers). Participants exposed to styrene, alone or with noise, were from reinforced fiberglass products manufacturing plants (n = 862). Comparison groups were comprised of workers noise-exposed (n = 400) or controls (n = 358). Current styrene exposures ranged from 0 to 309 mg/m(3), while mean current noise levels ranged from 70-84 dB(A). Hearing thresholds of styrene-exposed participants were compared with Annexes A and B from ANSI S3.44, 1996. The audiometric thresholds of styrene exposed workers were significantly poorer than those in published standards. Age, gender, and styrene exposure met the significance level criterion in the multiple logistic regression for the binary outcome 'hearing loss' (P = 0.0000). Exposure to noise (<85 dBA p = 0.0001; ≥85 dB(A) p = 0.0192) interacted significantly with styrene exposure. Occupational exposure to styrene is a risk factor for hearing loss, and styrene-exposed workers should be included in hearing loss prevention programs.	\N	\N
21817926	Recent studies have shown that audiovisual synchrony is recalibrated after exposure to asynchronous auditory and visual signals. This temporal recalibration has been shown only under a dual-task situation for speech signals. Here we examined whether the temporal recalibration occurs for audiovisual speech in a single-task situation using an offline adaptation method. In the experiment, participants were exposed to synchronous or asynchronous audiovisual syllables (either congruent or incongruent) for 3 min. The adaptation phase was followed by test trials, in which participants judged whether the auditory or visual stimulus was presented first. Results showed shifts in the point of subjective simultaneity and the sensitivity. Our results suggest that attention to adaptation stimuli is necessary to induce temporal recalibration for speech.	\N	\N
21824022	Attentional bias to threatening visual stimuli (words or pictures) is commonly present in anxious individuals, but not in non-anxious people. There is evidence to show that attentional bias to threat can be induced in all individuals when threat is imposed by threat not of symbolic nature, but by cues that predict aversive stimulation (loud noise or electric shock). However, it is not known whether attentional bias in such situations is still influenced by individual differences in anxiety. This question was addressed in two experiments using a spatial cuing task in which visual cues predicted the occurrence of an aversive event consisting of a loud human scream. Speeded attentional engagement to threat cues was positively correlated with trait anxiety in Experiment 1. Experiment 2 showed that speeded attentional engagement was present only in participants selected for high anxiety but not in low-anxious participants. In both experiments, slower disengagement from threat cues was found in all participants, irrespective of their trait anxiety levels.	\N	\N
21826005	Reduced hearing ability has been shown to influence various aspects of daily life, such as communication, psychosocial functioning, and working life. The aim of this study is to examine the association between hearing ability in noise and both sick leave and self-reported work productivity. In addition, the relationship between hearing ability and perceived health-caused limitations at work is examined. Data were collected at the baseline measurement of the Dutch "National Longitudinal Study on Hearing" and at each month during a subsequent period of 3 mo. Hearing ability was determined by means of the National Hearing Test, a speech-in-noise test over the Internet using digit triplets. The sample comprised 748 workers (385 with normal hearing ability and 363 with insufficient or poor hearing ability). Linear regression analyses revealed a significant adverse association between reduced hearing ability and self-reported absolute and differential productivity; for every dB signal-to-noise ratio (dB SNR) poorer hearing ability, self-rated absolute productivity for people experiencing little social support decreased by 0.054 points on a scale from 0 to 10 (b = -0.054; 95% confidence interval [CI] = -0.088 to -0.02). For people with less than three other chronic conditions, self-rated differential productivity also decreased significantly with decreasing hearing ability (no chronic conditions: b = -0.048 points/dB SNR on a scale from -10 to + 10, 95% CI = -0.094 to -0.001; one or two other chronic conditions: b = -0.035 points/dB SNR, 95% CI = -0.067 to -0.002). With adjustment for confounders, poorer hearing ability in noise furthermore significantly increased the odds for experiencing limitations (in the type or amount of work one could do) sometimes (odds ratio = 1.14; 95% CI = 1.07-1.21) and often to very often (odds ratio = 1.24; 95% CI = 1.05-1.45) in comparison with experiencing limitation seldom to never. A higher level of need for recovery among people with poorer hearing ability appeared to be one of the factors mediating the higher odds for sick leave of more than 5 days. Reduced hearing ability in noise was significantly associated with a lower self-reported absolute and differential productivity in specific cases. Also, poorer hearing increased the odds for experiencing health-caused limitations in the type or amount of work one can do. The significant relationship between hearing ability and sick leave, which was found when not adjusting for confounders, could partly be explained by a higher need for recovery among people with reduced hearing ability in noise.	\N	\N
21832862	To investigate interactions (if any) in the bone-conduction auditory steady-state response (BC ASSR) between multiple brief tones presented simultaneously. 500-, 1,000-, 2,000-, and 4,000-Hz brief tones, repeated at a rate of 77-101 Hz, were presented using a B-71 vibrator. BC ASSR thresholds and amplitudes at 50 dB nHL were measured in two conditions where the stimulus was either presented alone or together with other stimuli. Significantly larger amplitudes in the single-stimulus condition were found at 50 dB nHL. However, there was no significant threshold difference between single- and multiple-stimulus conditions. The BC ASSR thresholds (means ± SD) at 500, 1,000, 2,000, and 4,000 Hz were 96.7 ± 9.7, 75.3 ± 11.5, 65.6 ± 7.4, and 57.8 ± 7.2 dB re 1 μN ppe, respectively. Interactions occurred in the multiple-stimulus condition at high presentation levels, but not at threshold levels. The results of the present study imply that BC ASSR thresholds to multiple brief-tone stimuli can be assessed at the same time, at least in normal-hearing adults.	\N	\N
21832892	This report focuses on how speech perception, speech production, language, and literacy performance in adolescence are influenced by a common set of predictor variables obtained during elementary school in a large group of teenagers using cochlear implants (CIs). Time-lag analyses incorporating seven common predictor variables associated with the elementary school test period were evaluated. The elementary school-age variables included five contributors across the performance domains: gender, performance intelligence quotient, family size, socioeconomic status, and duration of deafness (operationally defined as the time period between the age of implantation and the onset of deafness). Regression analyses then examined how communication mode in early elementary grades influenced skills exhibited in high school and how this influence was mediated by information capacity of immediate memory. High correlations occurred between outcome measures collected at CI-E session and similar measures collected at CI-HS (values ranging from 0.75 to 0.83), indicating that the relative standing of individuals on these outcomes is highly stable over time. The best performers in elementary grades exhibit the best outcomes in high school, and early difficulties tend to persist throughout the elementary and high school years. The most highly related outcome areas were language and reading/literacy (values ranging from 0.74 to 0.88). These skills seem closely linked, and CI children who demonstrate the best vocabulary and syntax skills in elementary grades achieved the highest literacy performance in high school. Speech perception and speech production skills are also highly correlated with one another (r = 0.69 to 0.87), suggesting that the most direct result of improved auditory input from a CI is the child's ability to produce intelligible speech. The lowest correlations are observed between reading/literacy and speech perception (r = 0.30 to 0.54) or speech production (values ranging from 0.31 to 0.58). CI-E verbal rehearsal speed is an independent and powerful predictor of each early performance outcome, accounting for between 13% and 30% of the variance in early outcomes above and beyond that accounted for by gender, family size, socioeconomic status, performance intelligence quotient, duration of deafness, and the CI-E sign enhancement ratio. Group mean scores for language, reading, and social adjustment were generally within an SD of normative samples of typically developing age-mates with normal hearing. Use of sign to enhance spoken communication negatively influenced verbal rehearsal speed, which was a strong predictor of all early outcomes, which in turn strongly influenced later outcomes. These analyses suggest that early communication mode exerts a powerful influence on early outcomes that persist into later years. Speech perception, speech intelligibility, language, literacy, and psychosocial adjustment far exceeded that reported for similar groups before the advent of CI technology.	\N	\N
21835531	Behavioral and neurophysiological transfer effects from music experience to language processing are well-established but it is currently unclear whether or not linguistic expertise (e.g., speaking a tone language) benefits music-related processing and its perception. Here, we compare brainstem responses of English-speaking musicians/non-musicians and native speakers of Mandarin Chinese elicited by tuned and detuned musical chords, to determine if enhancements in subcortical processing translate to improvements in the perceptual discrimination of musical pitch. Relative to non-musicians, both musicians and Chinese had stronger brainstem representation of the defining pitches of musical sequences. In contrast, two behavioral pitch discrimination tasks revealed that neither Chinese nor non-musicians were able to discriminate subtle changes in musical pitch with the same accuracy as musicians. Pooled across all listeners, brainstem magnitudes predicted behavioral pitch discrimination performance but considering each group individually, only musicians showed connections between neural and behavioral measures. No brain-behavior correlations were found for tone language speakers or non-musicians. These findings point to a dissociation between subcortical neurophysiological processing and behavioral measures of pitch perception in Chinese listeners. We infer that sensory-level enhancement of musical pitch information yields cognitive-level perceptual benefits only when that information is behaviorally relevant to the listener.	\N	\N
21840170	This study investigates the effect of consensus training of listeners on intrarater and interrater reliability and agreement of perceptual voice analysis. The use of such training, including a reference voice sample, could be assumed to make the internal standards held in memory common and more robust, which is of great importance to reduce the variability of auditory perceptual ratings. A prospective design with testing before and after training. Thirteen students of audiologopedics served as listening subjects. The ratings were made using a multidimensional protocol with four-point equal-appearing interval scales. The stimuli consisted of text reading by authentic dysphonic patients. The consensus training for each perceptual voice parameter included (1) definition, (2) underlying physiology, (3) presentation of carefully selected sound examples representing the parameter in three different grades followed by group discussions of perceived characteristics, and (4) practical exercises including imitation to make use of the listeners' proprioception. Intrarater reliability and agreement showed a marked improvement for intermittent aphonia but not for vocal fry. Interrater reliability was high for most parameters before training with a slight increase after training. Interrater agreement showed marked increases for most voice quality parameters as a result of the training. The results support the recommendation of specific consensus training, including use of a reference voice sample material, to calibrate, equalize, and stabilize the internal standards held in memory by the listeners.	\N	\N
21842332	The visible movement of a talker's face is an influential component of speech perception. However, the ability of this influence to function when large areas of the face (~50%) are covered by simple substantial occlusions, and so are not visible to the observer, has yet to be fully determined. In Experiment 1, both visual speech identification and the influence of visual speech on identifying congruent and incongruent auditory speech were investigated using displays of a whole (unoccluded) talking face and of the same face occluded vertically so that the entire left or right hemiface was covered. Both the identification of visual speech and its influence on auditory speech perception were identical across all three face displays. Experiment 2 replicated and extended these results, showing that visual and audiovisual speech perception also functioned well with other simple substantial occlusions (horizontal and diagonal). Indeed, displays in which entire upper facial areas were occluded produced performance levels equal to those obtained with unoccluded displays. Occluding entire lower facial areas elicited some impairments in performance, but visual speech perception and visual speech influences on auditory speech perception were still apparent. Finally, implications of these findings for understanding the processes supporting visual and audiovisual speech perception are discussed.	\N	\N
21846981	The Nucleus Straight Research Array (SRA) cochlear implant has a new 25-mm electrode carrier designed to minimize insertion trauma, in particular allowing easy insertion via the round window. The aims of this study were to measure preoperative to postoperative benefit in terms of speech recognition in quiet and in noise in three groups of patients (electrical complement, EC; electrical stimulation, ES; electro-acoustic stimulation, EAS) with varying levels of low-frequency hearing, and to evaluate the preservation of residual hearing after implantation with the SRA cochlear implant. The study design was prospective with sequential enrolment and within-subject comparisons: 23 adult cochlear implant candidates were divided into three groups according to their level of preoperative residual hearing at 500 Hz (EC ≤50 dB; 50 dB < EAS < 80 dB; ES ≥80 dB). Monosyllabic word recognition using the SRA cochlear implant in combination with residual low-frequency hearing was assessed at 4 and 13 months after implantation. Hearing threshold levels were also monitored over time. Subjects across all three groups had significant improvements in speech recognition scores (i.e. >20 percentage points) both for listening in quiet (71% of subjects) and in noise (100% of subjects). The average score at 4 months after operation for words presented in quiet was 61.7%, and in 10 dB SNR noise 46.5%, compared to 34.4 and 10.6% preoperatively (p < 0.001). All subjects retained measurable hearing at 500 Hz in the implanted ear at 4 months after the operation; mean increases were 19, 29 and 1 dB for the EC, EAS and ES groups (n = 21). Across frequencies of 125-1000 Hz, the median increase in thresholds was 15 dB up to 13 months postoperatively (n = 15). Speech recognition performance of subjects with various levels of residual low-frequency hearing was significantly improved with the SRA cochlear implant. A high level and rate of hearing preservation was achieved with the SRA implanted using a round window surgical technique. Subjects with preoperative low-frequency hearing levels between 50 and 80 dB HL (EAS group) tended to lose more hearing than those with either better or worse hearing.	\N	\N
21848924	Tinnitus is characterized by an ongoing conscious perception of a sound in the absence of any external sound source. Chronic tinnitus is notoriously characterized by its resistance to treatment. In the present study the objective was to verify whether the neural generators and/or the neural tinnitus network, evaluated through EEG recordings, change over time as previously suggested by MEG. We therefore analyzed the source-localized EEG recordings of a very homogenous group of left-sided narrow-band noise tinnitus patients. Results indicate that the generators involved in tinnitus of recent onset seem to change over time with increased activity in several brain areas [auditory cortex, supplementary motor area and dorsal anterior cingulate cortex (dACC) plus insula], associated with a decrease in connectivity between the different auditory and nonauditory brain structures. An exception to this general connectivity decrease is an increase in gamma-band connectivity between the left primary and secondary auditory cortex and the left insula, and also between the auditory cortices and the right dorsal lateral prefrontal cortex. These networks are both connected to the left parahippocampal area. Thus acute and chronic tinnitus are related to differential activity and connectivity in a network comprising the auditory cortices, insula, dACC and premotor cortex.	\N	\N
21849065	Segregating auditory scenes into distinct objects or streams is one of our brain's greatest perceptual challenges. Streaming has classically been studied with bistable sound stimuli, perceived alternately as a single group or two separate groups. Throughout the last decade different methodologies have yielded inconsistent evidence about the role of auditory cortex in the maintenance of streams. In particular, studies using functional magnetic resonance imaging (fMRI) have been unable to show persistent activity within auditory cortex (AC) that distinguishes between perceptual states. We use bistable stimuli, an explicit perceptual categorization task, and a focused region of interest (ROI) analysis to demonstrate an effect of perceptual state within AC. We find that AC has more activity when listeners perceive the split percept rather than the grouped percept. In addition, within this ROI the pattern of acoustic response across voxels is significantly correlated with the pattern of perceptual modulation. In a whole-brain exploratory test, we corroborate previous work showing an effect of perceptual state in the intraparietal sulcus. Our results show that the maintenance of auditory streams is reflected in AC activity, directly relating sound responses to perception, and that perceptual state is further represented in multiple, higher level cortical regions.	\N	\N
21861386	The listener-distinctive features of recognition of different emotional intonations (positive, negative and neutral) of male and female speakers in the presence or absence of background noise were studied in 49 adults aged 20-79 years. In all the listeners noise produced the most pronounced decrease in recognition accuracy for positive emotional intonation ("joy") as compared to other intonations, whereas it did not influence the recognition accuracy of "anger" in 65-79-year-old listeners. The higher emotion recognition rates of a noisy signal were observed for speech emotional intonations expressed by female speakers. Acoustic characteristics of noisy and clear speech signals underlying perception of speech emotional prosody were found for adult listeners of different age and gender.	\N	\N
21862447	The distractibility that older adults experience when listening to speech in challenging conditions has been attributed in part to reduced inhibition of irrelevant information within and across sensory systems. Whereas neuroimaging studies have shown that younger adults readily suppress visual cortex activation when listening to auditory stimuli, it is unclear the extent to which declining inhibition in older adults results in reduced suppression or compensatory engagement of other sensory cortices. The current functional magnetic resonance imaging study examined the effects of age and stimulus intelligibility in a word listening task. Across all participants, auditory cortex was engaged when listening to words. However, increasing age and declining word intelligibility had independent and spatially similar effects: both were associated with increasing engagement of visual cortex. Visual cortex activation was not explained by age-related differences in vascular reactivity but rather auditory and visual cortices were functionally connected across word listening conditions. The nature of this correlation changed with age: younger adults deactivated visual cortex when activating auditory cortex, middle-aged adults showed no relation, and older adults synchronously activated both cortices. These results suggest that age and stimulus integrity are additive modulators of crossmodal suppression and activation.	\N	\N
21875609	Event-related potential (ERP) evidence indicates that listeners selectively attend to word onsets in continuous speech, but the reason for this preferential processing is unknown. The current study measured ERPs elicited by syllable onsets in an artificial language to test the hypothesis that listeners direct attention to word onsets because their identity is unpredictable. Both before and after recognition training, participants listened to a continuous stream of six nonsense words arranged in pairs, such that the second word in each pair was completely predictable. After training, first words in pairs elicited a larger negativity beginning around 100 ms after onset. This effect was not evident for the completely predictable second words in pairs. These results suggest that listeners are most likely to attend to the segments in speech that they are least able to predict.	\N	\N
21877811	The conventional articulation index (AI) measure cannot be applied in situations where non-linear operations are involved and additive noise is present. This is because the definitions of the target and masker signals become vague following non-linear processing, as both the target and masker signals are affected. The aim of the present work is to modify the basic form of the AI measure to account for non-linear processing. This was done using a new definition of the output or effective SNR obtained following non-linear processing. The proposed output SNR definition for a specific band was designed to handle cases where the non-linear processing affects predominantly the target signal rather than the masker signal. The proposed measure also takes into consideration the fact that the input SNR in a specific band cannot be improved following any form of non-linear processing. Overall, the proposed measure quantifies the proportion of input band SNR preserved or transmitted in each band after non-linear processing. High correlation (r = 0.9) was obtained with the proposed measure when evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech corrupted in four different real-world maskers.	\N	\N
21884311	Infants attune to their birth language during the second half of infancy. However, internationally adopted children are often uniquely required to attune to their birth language, and then reattune to their adoptive language. Children who were adopted from India into America at ages 6-60 months (N = 8) and had minimal further exposure to their birth languages were compared to age-matched American non-adopted controls. Without training, neither group could discriminate a phonemic contrast that occurs in their birth language but not in English. However, after training on the contrast, the adopted group (N = 8) improved significantly and discriminated the contrast more accurately than their non-adopted peers. While English had explicitly replaced the birth language of the adopted sample, traces of early exposure conferred privileges on subsequent learning. These findings are consistent with behavioral and neurophysiological data from animals that have identified some of the mechanisms underlying such a 'retention without further use' phenomenon.	\N	\N
21895386	An important step in developing a theory of calibration is establishing what it is that participants become calibrated to as a result of feedback. Three experiments used a transfer of calibration paradigm to investigate this issue. In particular, these experiments investigated whether recalibration of perception of length transferred from audition to dynamic (i.e., kinesthetic) touch when objects were grasped at one end (Experiment 1), when objects were grasped at one end and when they were grasped at a different location (i.e., the middle) (Experiment 2), and when false (i.e., inflated) feedback was provided about object length (Experiment 3). In all three experiments, there was a transfer of recalibration of perception of length from audition to dynamic touch when feedback was provided on perception by audition. Such results suggest that calibration is not specific to a particular perceptual modality and are also consistent with previous research that perception of object length by audition and dynamic touch are each constrained by the object's mechanical properties.	\N	\N
21898434	To determine the effect of cochlear implantation (CI) on health-related quality of life (HRQoL), tinnitus, and psychological comorbidity in patients with severe to profound postlingual hearing loss and to analyze the relationship between these parameters. Prospective study. Using six validated questionnaires, we evaluated the pre-CI and post-CI scores of HRQoL, tinnitus, perceived stress, symptoms of depression and anxiety, and coping strategies in 43 patients implanted unilaterally with a multichannel implant for at least 6 months. In addition to improvements in hearing, speech understanding, and disease-specific HRQoL, psychological comorbidity was reduced and coping strategies were improved following CI. In the 39 tinnitus patients, their tinnitus was reduced. We found negative correlations between HRQoL and stress, depression, and anxiety. Pre-CI, tinnitus severity did not correlate with HRQoL and psychological comorbidity. However, patients with a high-level tinnitus had lower HRQoL as well as a higher level of perceived stress and anxiety symptoms than patients with a low-level tinnitus and no/incidental tinnitus before CI. Moreover, patients with severe hearing loss had a higher level of perceived symptoms of stress and depression than patients with profound hearing loss before CI. The present study provides evidence that tinnitus and psychological comorbidity may play an important role in the rehabilitation of CI patients, and that there is a correlation between HRQoL and these parameters. In addition to hearing tests, tinnitus, stress, and psychological comorbidity should be assessed using validated questionnaires before and after CI. This will help to improve the rehabilitation process.	\N	\N
21902007	The purpose of the present study was to see if 7-10-year-old socially anxious children (n = 26) made systematic errors in identifying and sending emotions in facial expressions, paralanguage, and postures as compared with the more random errors of children who were inattentive-hyperactive (n = 21). It was found that socially anxious children made more errors in identifying anger and fear in children's facial expressions and anger in adults' postures and in expressing anger in their own facial expressions than did their inattentive-hyperactive peers. Results suggest that there may be systematic difficulties specifically in visual nonverbal emotion communication that contribute to the personal and social difficulties socially anxious children experience.	\N	\N
21902880	In visual competition, the perception of ambiguous visual patterns changes spontaneously. Although the process causing this perceptual alternation remains unclear, recent evidence suggests various types of non-visual influences in resolving visual ambiguity. In the present study, we investigated cross-modal modulation of a transient stimulus on visual perceptual stability (i.e., alternation frequency). Participants observed an ambiguous visual figure and reported their perceptual alternations. Concurrently, we presented visual and auditory transient events. The results revealed that the auditory as well as visual transient events destabilize the current perception (i.e., they increase alternation frequency) around 0.5-1.5 s after the event. In addition, the magnitudes of auditory and visual effects were comparable and positively correlated within participants. These results suggest that the visual perceptual stability can be under the influence of processes that are shared by different senses.	\N	\N
21903084	Rainstorms, insect swarms, and galloping horses produce "sound textures"--the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modulations, and measured statistics of the resulting decomposition. We then assessed the realism and recognizability of novel sounds synthesized to have matching statistics. Statistics of individual frequency channels, capturing spectral power and sparsity, generally failed to produce compelling synthetic textures; however, combining them with correlations between channels produced identifiable and natural-sounding textures. Synthesis quality declined if statistics were computed from biologically implausible auditory models. The results suggest that sound texture perception is mediated by relatively simple statistics of early auditory representations, presumably computed by downstream neural populations. The synthesis methodology offers a powerful tool for their further investigation.	\N	\N
21904250	The diversion of attention from a primary goal by irrelevant events is known as attention capture, and is often followed by a directed action. The hypothesis that corticospinal excitability is modulated by attention capture was tested using transcranial magnetic stimulation. Participants watched a video while sounds were intermittently presented. Motor evoked potentials (MEPs) were elicited in each hand using transcranial magnetic stimulation 1 s after sound onset. MEP amplitudes were assessed as a function of hand (dominant, nondominant), sound location (ipsilateral or contralateral to hand location), and sound sample valence (negative, neutral, positive). Results showed that MEP amplitudes increased during sound presentation, but only for the dominant hand. There were no effects of location or emotional valence. The selective modulation of the dominant hand motor cortex may indicate that auditory events can prime the preferred hand for action.	\N	\N
21909974	A key requirement for encoding the auditory environment is the ability to dynamically alter cochlear sensitivity. However, merely attaining a steady state of maximal sensitivity is not a viable solution since the sensory cells and ganglion cells of the cochlea are prone to damage following exposure to loud sound. Most often, such damage is via initial metabolic insult that can lead to cellular death. Thus, establishing the highest sensitivity must be balanced with protection against cellular metabolic damage that can lead to loss of hair cells and ganglion cells, resulting in loss of frequency representation. While feedback mechanisms are known to exist in the cochlea that alter sensitivity, they respond only after stimulus encoding, allowing potentially damaging sounds to impact the inner ear at times coincident with increased sensitivity. Thus, questions remain concerning the endogenous signaling systems involved in dynamic modulation of cochlear sensitivity and protection against metabolic stress. Understanding endogenous signaling systems involved in cochlear protection may lead to new strategies and therapies for prevention of cochlear damage and consequent hearing loss. We have recently discovered a novel cochlear signaling system that is molecularly equivalent to the classic hypothalamic-pituitary-adrenal (HPA) axis. This cochlear HPA-equivalent system functions to balance auditory sensitivity and susceptibility to noise-induced hearing loss, and also protects against cellular metabolic insults resulting from exposures to ototoxic drugs. We review the anatomy, physiology, and cellular signaling of this system, and compare it to similar signaling in other organs/tissues of the body.	\N	\N
21914244	How do children develop the mapping between prosody and other levels of linguistic knowledge? This question has received considerable attention in child language research. In the present study two experiments were conducted to investigate four- to five-year-old Mandarin-speaking children's sensitivity to prosody in ambiguity resolution. Experiment 1 used eye-tracking to assess children's use of stress in resolving structural ambiguities. Experiment 2 took advantage of special properties of Mandarin to investigate whether children can use intonational cues to resolve ambiguities involving speech acts. The results of our experiments show that children's use of prosodic information in ambiguity resolution varies depending on the type of ambiguity involved. Children can use prosodic information more effectively to resolve speech act ambiguities than to resolve structural ambiguities. This finding suggests that the mapping between prosody and semantics/pragmatics in young children is better established than the mapping between prosody and syntax.	\N	\N
21916216	Processing of auditory information in central nervous system bases on the series of quickly occurring neural processes that cannot be separately monitored using only the fMRI registration. Simultaneous recording of the auditory evoked potentials, characterized by good temporal resolution, and the functional magnetic resonance imaging with excellent spatial resolution allows studying higher auditory functions with precision both in time and space. was to implement the simultaneous AEP-fMRI recordings method for the investigation of information processing at different levels of central auditory system. Five healthy volunteers, aged 22-35 years, participated in the experiment. The study was performed using high-field (3T) MR scanner from Siemens and 64-channel electrophysiological system Neuroscan from Compumedics. Auditory evoked potentials generated by acoustic stimuli (standard and deviant tones) were registered using modified odd-ball procedure. Functional magnetic resonance recordings were performed using sparse acquisition paradigm. The results of electrophysiological registrations have been worked out by determining voltage distributions of AEP on skull and modeling their bioelectrical intracerebral generators (dipoles). FMRI activations were determined on the basis of deviant to standard and standard to deviant functional contrasts. Results obtained from electrophysiological studies have been integrated with functional outcomes. Morphology, amplitude, latency and voltage distribution of auditory evoked potentials (P1, N1, P2) to standard stimuli presented during simultaneous AEP-fMRI registrations were very similar to the responses obtained outside scanner room. Significant fMRI activations to standard stimuli were found mainly in the auditory cortex. Activations in these regions corresponded with N1 wave dipoles modeled based on auditory potentials generated by standard tones. Auditory evoked potentials to deviant stimuli were recorded only outside the MRI scanner. However, deviant stimuli induced significant fMRI activations. They were observed mainly in the anterior cingulate gyrus, insula and parietal lobes. These regions of the brain are related to attention and decision-making processes. The results showed that applied paradigm is suitable for investigation of acoustic processing on the level of auditory cortex. Technique of the simultaneous AEP-fMRI registrations seems to be promising for investigation of more complex nervous processes in central auditory system with good temporo-spatial resolution.	\N	\N
21917204	Modern health services need efficient tools for measuring outcomes from interventions, that is, tools of proven efficacy which make minimal demands on the time of clinicians in learning to administer tests and in interpreting results. This paper describes an apparatus designed to meet those requirements. The apparatus administers performance tests of spatial listening for children and adults with unilateral and bilateral cochlear implants. The apparatus was designed with guidance from clinicians. It possesses three key attributes: it is simple to use; the results of tests are scored automatically and are compared with reference data; the apparatus generates comprehensive personalized reports for individual participants that can be included in clinical notes. This paper describes the apparatus and reports results of a test measuring spatial release from masking of speech which illustrates the compatibility between the new apparatus and an older apparatus with which the reference data were gathered.	\N	\N
21917210	Bilateral cochlear implants (CIs) have been provided to children who are deaf in both ears with intent to promote binaural hearing. If it is possible to establish binaural hearing with two CIs, these children would be able to make use of interaural level and timing differences to localize sound and to distinguish between sounds separated in space. These skills are central to the ability to attend to one particular sound amidst a number of sound sources. This may be particularly important for children because they are typically learning and interacting in groups. However, the development of binaural processing could be disrupted by effects of bilateral deafness, effects of unilateral CI use, or issues related to the child's age at onset of deafness and age at the time of the first and second cochlear implantation. This research aims to determine whether binaural auditory processing is affected by these variables in an effort to determine the optimal timing for bilateral cochlear implantation in children. It is now clear that the duration of bilateral deafness should be limited in children to restrict reorganization in the auditory thalamo-cortical pathways. It has also been shown that unilateral CI use can halt such reorganization to some extent and promote auditory development. At the same time, however, unilateral input might compromise the development of binaural processing if CIs are provided sequentially. Mismatches in responses from the auditory brainstem and cortex evoked by the first and second CI after a long period of unilateral CI use suggest asymmetry in the bilateral auditory pathways which is significantly more pronounced than in children receiving bilateral implants simultaneously. Moreover, behavioural responses to level and timing differences between implants suggest that these important binaural cues are not being processed normally by children who received a second CI after a long period of unilateral CI use and at older ages. In sum, there may be multiple sensitive periods in the developing auditory system, which must be considered when determining the optimal timing for bilateral cochlear implantation.	\N	\N
21918451	To determine whether common approaches to setting stimulus parameters influence the depth of fine structure present in the distortion product otoacoustic emission (DPOAE) response. Because the presence of fine structure has been suggested as a possible source of errors, if one of the common parametric approaches results in reduced fine-structure depth, it may be preferred over other approaches. DPOAE responses were recorded in a group of 21 subjects with normal hearing for 1/3-octave intervals surrounding 3 f2s (1, 2, and 4 kHz) at three L2s (30, 45, and 55 dB SPL). For each f2 and L2 combination, L1 and f2/f1 were set according to three commonly used parametric approaches. These included a simple approach, the approach recommended by Kummer et al., and the approach described by Johnson et al. These three approaches primarily differ in the recommended relationship between L1 and L2. For each parametric approach, DPOAE fine structure was evaluated by varying f2 in small steps. Differences in DPOAE level and DPOAE fine-structure depth across f2, L2, and the various stimulus parameters were evaluated using repeated-measures analysis of variance. As expected, significant variations in DPOAE level were observed across the three parametric approaches. For stimulus levels #45 dB SPL, the simple stimuli resulted in lower DPOAE levels than were observed for other approaches. An unexpected finding was that stimulus parameters developed by Johnson et al., which were believed to produce higher DPOAE levels than other approaches, produced the lowest DPOAE levels of the three approaches when f2 = 4 kHz. Significant differences in fine-structure depth were also observed. Greater fine-structure depth was observed with the simple parameters, although this effect was restricted to L2 # 45 dB SPL. When L2 = 55 dB SPL, all three parametric approaches resulted in equivalent fine-structure depth. A significant difference in fine-structure depth across the 3 f2s was also observed. The interval surrounding 2 kHz was associated with greater fine-structure depth than the intervals surrounding 1 and 4 kHz. The simple stimulus parameters resulted in more fine structure than the other parametric approaches; however, this effect was restricted to L2 # 45 dB SPL. At the moderate stimulus levels used in most clinical applications of DPOAEs (L2 = 55 dB SPL), all three approaches resulted in similar fine-structure depths. These findings suggest that manipulating stimulus parameters, particularly the L1, L2 relationship, is not an effective technique for reducing fine structure, except at the lowest stimulus levels, and that all the common parameters result in equivalent fine structure for moderate stimulus levels. These results also suggest that the stimulus parameters used in future studies of the clinical implications of fine structure may be relatively unimportant, unless stimulus levels #45 dB SPL will be evaluated.	\N	\N
21921852	Bilateral stimulation through cochlear implants induces a brain activity pattern closer to the normal one than unilateral stimulation. Although it has been shown that speech comprehension through bilateral cochlear implants leads to better performances than after unilateral implantation, the existence of neural underpinnings of this improvement remains to be studied. We performed an H2O positron emission tomographic study of word recognition in 5 patients with bilateral cochlear implants and 5 normal-hearing controls. Subjects had to distinguish words from nonwords in binaural and monaural conditions. There was no overactivation in patients for binaural stimulation, with a hypoactivation in the right temporal cortex. For monaural stimulation, patients demonstrated more activation contralaterally to the stimulation side in the posterior temporal cortex and in the cerebellum. Binaural stimulation through cochlear implants is advantageous compared with the monaural at the neurofunctional level because the pattern of brain activity is closer to the normal one.	\N	\N
21924100	To explore the factors that influence the stability of evaluation results judged by a jury through a standard research on perceptual evaluation measurements of voice quality. Voice samples from 300 patients with dysphonia and 100 control subjects with normal voice were recorded and assessed by a jury composed of 6 experienced listeners from different hospitals. The voice samples were discourse voices and ordered randomly 3 times, and the mean of 3 evaluations using visual analogue scale were the final results. The jury was instructed to classify voice samples according to the G (grade), R (rough) and B (breathy) components of the GRBAS scale on a 4-point scale ranging from 0 for normal to 3 for severe dysphonia. Κ value was used to analyze the concordance of evaluation results and regression analysis was used to research the effects of the extent of voice disorder to the stability of perceptual evaluation. The discordance of evaluation existed both between the jury and in listeners themselves. The concordance of listeners themselves of each evaluation parameter was not bad, good, or even very good, and the concordance of evaluation of G was the best (κ value: 0.46 - 0.85), then R (κ value: 0.41 - 0.84) and B (κ value: 0.41 - 0.81). The concordance between the jury was worse than that in themselves. And except a listener whose concordance of evaluation was under the requirement, the concordance of evaluation of G was the best (κ value: 0.43 - 0.96), then R (κ value: 0.33 - 0.78) and B (κ value: 0.002 - 0.45). The stability of evaluation of normal voice and severe voice disorder was better than mild and moderate voice disorder. The discordance between the jury was the main factor that influence the stability of perceptual evaluation. The evaluation parameters and extent of voice disorder will influence the stability of perceptual evaluation of the jury.	\N	\N
21925521	The left inferior frontal gyrus (LIFG) exhibits increased responsiveness when people listen to words composed of speech sounds that frequently co-occur in the English language (Vaden, Piquado, & Hickok, 2011), termed high phonotactic frequency (Vitevitch & Luce, 1998). The current experiment aimed to further characterize the relation of phonotactic frequency to LIFG activity by manipulating word intelligibility in participants of varying age. Thirty six native English speakers, 19-79 years old (mean=50.5, sd=21.0) indicated with a button press whether they recognized 120 binaurally presented consonant-vowel-consonant words during a sparse sampling fMRI experiment (TR=8 s). Word intelligibility was manipulated by low-pass filtering (cutoff frequencies of 400 Hz, 1000 Hz, 1600 Hz, and 3150 Hz). Group analyses revealed a significant positive correlation between phonotactic frequency and LIFG activity, which was unaffected by age and hearing thresholds. A region of interest analysis revealed that the relation between phonotactic frequency and LIFG activity was significantly strengthened for the most intelligible words (low-pass cutoff at 3150 Hz). These results suggest that the responsiveness of the left inferior frontal cortex to phonotactic frequency reflects the downstream impact of word recognition rather than support of word recognition, at least when there are no speech production demands.	\N	\N
21932260	Cross-modal processing enables the utilization of information received via different sensory organs to facilitate more complicated human actions. We used functional MRI on early-blind individuals to study the neural processes associated with cross auditory-spatial learning. The auditory signals, converted from echoes of ultrasonic signals emitted from a navigation device, were novel to the participants. The subjects were trained repeatedly for 4 weeks in associating the auditory signals with different distances. Subjects' blood-oxygenation-level-dependent responses were captured at baseline and after training using a sound-to-distance judgment task. Whole-brain analyses indicated that the task used in the study involved auditory discrimination as well as spatial localization. The learning process was shown to be mediated by the inferior parietal cortex and the hippocampus, suggesting the integration and binding of auditory features to distances. The right cuneus was found to possibly serve a general rather than a specific role, forming an occipital-enhanced network for cross auditory-spatial learning. This functional network is likely to be unique to those with early blindness, since the normal-vision counterparts shared activities only in the parietal cortex.	\N	\N
21936759	The present study investigated the relationship between non-verbal behaviours and perceptions of the communication abilities of an individual with anomia secondary to traumatic brain injury (TBI). Thirty-four university students studying Communication Sciences and Disorders were randomly assigned to watch or listen to six short clips of an individual with TBI engaged in conversation. Participants rated the individual on communication parameters from a modified version of the Pragmatic Protocol and four other dependent measures of communicative competence. A significant positive correlation was identified between perceptions of gestures and ratings of overall communicative competence, and between perceptions of hand and arm movements and ratings of overall communicative competence. Participant raters who viewed the individual's movements as inappropriate also rated her overall communication abilities less favourably. This finding highlights individuality in perception of communication competence and the importance of assessing communication partners' perceptions in a client's environment to determine socially relevant treatment goals.	\N	\N
21939965	Listeners rapidly adjust to talkers' pronunciations, accommodating those pronunciations into the relevant phonemic category to improve subsequent perception. Previous work has suggested that such learning is restricted to pronunciations that are representative of how the speaker talks (Kraljic, Samuel, & Brennan, 2008). If an ambiguous pronunciation, for example, can be attributed to an external source (such as a pen in the speaker's mouth), or if it is preceded by normal pronunciations of the same sound, learning is blocked. In three experiments, we explore this blocking effect in more detail. Our aim is to better understand the nature of the representations underlying the perceptual learning process. Experiment 1 replicates the blocking effect. Experiments 2 and 3 demonstrate that it can be eliminated when certain visual information occurs simultaneously with the auditory signal. The pattern of learning and non-learning is best accounted for by the view that speech perception is mediated by episodic representations that include potentially relevant visual information.	\N	\N
21940463	Several studies report that adults and adolescents with reading disabilities also experience difficulties with selective attention. In the present study, event-related brain potentials (ERPs) were used to examine the neural mechanisms of selective attention in kindergarten children at risk for reading disabilities (AR group, n = 8) or on track in early literacy skills (OT group, n = 6) across the first semester of kindergarten. The AR group also received supplemental instruction with the Early Reading Intervention (ERI). Following ERI, the AR group demonstrated improved skills on standardized early literacy measures such that there were no significant differences between the AR and OT groups at posttest or winter follow-up. Analysis of the ERP data revealed that at the start of kindergarten, the AR group displayed reduced effects of attention on sensorineural processing compared to the OT group. Following intervention, this difference between groups disappeared, with the AR group only showing improvements in the effect of attention on sensorineural processing. These data indicate that the neural mechanisms of selective attention are atypical in kindergarten children at risk for reading failure but can be improved by effective reading interventions.	\N	\N
21942418	We report a series of experiments designed to demonstrate that the presentation of a sound can facilitate the identification of a concomitantly presented visual target letter in the backward masking paradigm. Two visual letters, serving as the target and its mask, were presented successively at various interstimulus intervals (ISIs). The results demonstrate that the crossmodal facilitation of participants' visual identification performance elicited by the presentation of a simultaneous sound occurs over a very narrow range of ISIs. This critical time-window lies just beyond the interval needed for participants to differentiate the target and mask as constituting two distinct perceptual events (Experiment 1) and can be dissociated from any facilitation elicited by making the visual target physically brighter (Experiment 2). When the sound is presented at the same time as the mask, a facilitatory, rather than an inhibitory effect on visual target identification performance is still observed (Experiment 3). We further demonstrate that the crossmodal facilitation of the visual target by the sound depends on the establishment of a reliable temporally coincident relationship between the two stimuli (Experiment 4); however, by contrast, spatial coincidence is not necessary (Experiment 5). We suggest that when visual and auditory stimuli are always presented synchronously, a better-consolidated object representation is likely to be constructed (than that resulting from unimodal visual stimulation).	\N	\N
21945200	The human auditory brainstem is known to be exquisitely sensitive to fine-grained spectro-temporal differences between speech sound contrasts, and the ability of the brainstem to discriminate between these contrasts is important for speech perception. Recent work has described a novel method for translating brainstem timing differences in response to speech contrasts into frequency-specific phase differentials. Results from this method have shown that the human brainstem response is surprisingly sensitive to phase differences inherent to the stimuli across a wide extent of the spectrum. Here we use an animal model of the auditory brainstem to examine whether the stimulus-specific phase signatures measured in human brainstem responses represent an epiphenomenon associated with far-field (i.e., scalp-recorded) measurement of neural activity, or alternatively whether these specific activity patterns are also evident in auditory nuclei that contribute to the scalp-recorded response, thereby representing a more fundamental temporal processing phenomenon. Responses in anaesthetized guinea pigs to three minimally-contrasting consonant-vowel stimuli were collected simultaneously from the cortical surface vertex and directly from central nucleus of the inferior colliculus (ICc), measuring volume conducted neural activity and multiunit, near-field activity, respectively. Guinea pig surface responses were similar to human scalp-recorded responses to identical stimuli in gross morphology as well as phase characteristics. Moreover, surface-recorded potentials shared many phase characteristics with near-field ICc activity. Response phase differences were prominent during formant transition periods, reflecting spectro-temporal differences between syllables, and showed more subtle differences during the identical steady state periods. ICc encoded stimulus distinctions over a broader frequency range, with differences apparent in the highest frequency ranges analyzed, up to 3000 Hz. Based on the similarity of phase encoding across sites, and the consistency and sensitivity of response phase measured within ICc, results suggest that a general property of the auditory system is a high degree of sensitivity to fine-grained phase information inherent to complex acoustical stimuli. Furthermore, results suggest that temporal encoding in ICc contributes to temporal features measured in speech-evoked scalp-recorded responses.	\N	\N
21949873	The modulation of brain activity as a function of auditory location was investigated using electro-encephalography in combination with standardized low-resolution brain electromagnetic tomography. Auditory stimuli were presented at various positions under anechoic conditions in free-field space, thus providing the complete set of natural spatial cues. Variation of electrical activity in cortical areas depending on sound location was analyzed by contrasts between sound locations at the time of the N1 and P2 responses of the auditory evoked potential. A clear-cut double dissociation with respect to the cortical locations and the points in time was found, indicating spatial processing (1) in the primary auditory cortex and posterodorsal auditory cortical pathway at the time of the N1, and (2) in the anteroventral pathway regions about 100 ms later at the time of the P2. Thus, it seems as if both auditory pathways are involved in spatial analysis but at different points in time. It is possible that the late processing in the anteroventral auditory network reflected the sharing of this region by analysis of object-feature information and spectral localization cues or even the integration of spatial and non-spatial sound features.	\N	\N
21957257	Children use information from both the auditory and visual modalities to aid in understanding speech. A dramatic illustration of this multisensory integration is the McGurk effect, an illusion in which an auditory syllable is perceived differently when it is paired with an incongruent mouth movement. However, there are significant interindividual differences in McGurk perception: some children never perceive the illusion, while others always do. Because converging evidence suggests that the posterior superior temporal sulcus (STS) is a critical site for multisensory integration, we hypothesized that activity within the STS would predict susceptibility to the McGurk effect. To test this idea, we used BOLD fMRI in 17 children aged 6-12 years to measure brain responses to the following three audiovisual stimulus categories: McGurk incongruent, non-McGurk incongruent, and congruent syllables. Two separate analysis approaches, one using independent functional localizers and another using whole-brain voxel-based regression, showed differences in the left STS between perceivers and nonperceivers. The STS of McGurk perceivers responded significantly more than that of nonperceivers to McGurk syllables, but not to other stimuli, and perceivers' hemodynamic responses in the STS were significantly prolonged. In addition to the STS, weaker differences between perceivers and nonperceivers were observed in the fusiform face area and extrastriate visual cortex. These results suggest that the STS is an important source of interindividual variability in children's audiovisual speech perception.	\N	\N
21959609	The literature suggests that contralateral acoustic stimulation (CAS) alters the amplitude of the distortion product otoacoustic emissions (DPOAEs), but it is still unknown whether the DPOAE Input/Output (I/O) functions are also affected. To elucidate this aspect of the DPOAEs, the present study assessed the effects of CAS on DPOAE I/O functions at the frequencies of 2 kHz and 4 kHz, in a sample of term neonatal subjects. Sixty randomly selected neonates were included in the study. The DPOAE I/O functions were obtained at 2 kHz and 4 kHz, in the presence of a 60 dB SPL broad band-contralateral white noise, using the TDH39 headphones contralaterally. DPOAEs were recorded up to a stimulus level of L2 = 35 dB peSPL. Significant DPOAE amplitude suppression effects were observed at various L2 stimulus levels for both tested frequencies at 2 and 4 kHz. In contrast, the corresponding DPOAE slopes showed various alterations that were not statistically significant. The data from the present study show that contralateral acoustic stimulation significantly affects only the amplitude of the DPOAE I/O functions; the slope is affected, but not significantly. This observation can shed light on the nature of CAS, suggesting that the latter is primarily a linear phenomenon without the cochlear compression and non-linear components seen in the healthy cochlea. From the available data it is not possible to infer whether the sample size has influenced the obtained results and the study should be repeated with a larger sample size and assessing more frequencies.	\N	\N
21964385	Temporal summation of C-fiber evoked responses generates an increase in action potential discharge in second-order neurons and in perceived pain intensity (wind-up). This may be related to the central serotonergic system which modulates and partly inhibits sensory input. Aim of the study was to investigate the relationship between wind-up and serotonergic activity using loudness dependence of auditory evoked potentials (LDAEP). 18 healthy subjects were compared to 18 patients with major depression, a disease with a putative serotonin deficit. They were examined with quantitative sensory testing (QST) using the protocol of the German Research Network on Neuropathic Pain (DFNS), including the wind-up ratio (WUR), LDAEP, and psychometric measurements. We found a slight positive correlation between WUR and LDAEP both in healthy controls and depressed patients combined (r=0.340, p=0.043), indicating that WUR may be modulated by serotonergic activity. It can be concluded that inhibitory control to noxious stimuli is partly associated with the central serotonergic function as indicated by LDAEP.	\N	\N
21972849	Filmmakers use continuity editing to engender a sense of situational continuity or discontinuity at editing boundaries. The goal of this study was to assess the impact of continuity editing on how people perceive the structure of events in a narrative film and to identify brain networks that are associated with the processing of different types of continuity editing boundaries. Participants viewed a commercially produced film and segmented it into meaningful events, while brain activity was recorded with functional magnetic resonance imaging (MRI). We identified three degrees of continuity that can occur at editing locations: edits that are continuous in space, time, and action; edits that are discontinuous in space or time but continuous in action; and edits that are discontinuous in action as well as space or time. Discontinuities in action had the biggest impact on behavioral event segmentation, and discontinuities in space and time had minor effects. Edits were associated with large transient increases in early visual areas. Spatial-temporal changes and action changes produced strikingly different patterns of transient change, and they provided evidence that specialized mechanisms in higher order perceptual processing regions are engaged to maintain continuity of action in the face of spatiotemporal discontinuities. These results suggest that commercial film editing is shaped to support the comprehension of meaningful events that bridge breaks in low-level visual continuity, and even breaks in continuity of spatial and temporal location.	\N	\N
21973370	Within an auditory channel, the speech waveform contains both temporal envelope (E(O)) and temporal fine structure (TFS) information. Vocoder processing extracts a modified version of the temporal envelope (E') within each channel and uses it to modulate a channel carrier. The resulting signal, E'(Carr), has reduced information content compared to the original "E(O) + TFS" signal. The dynamic range over which listeners make additional use of E(O) + TFS over E'(Carr) cues was investigated in a competing-speech task. The target-and-background mixture was processed using a 30-channel vocoder. In each channel, E(O) + TFS replaced E'(Carr) at either the peaks or the valleys of the signal. The replacement decision was based on comparing the short-term channel level to a parametrically varied "switching threshold," expressed relative to the long-term channel level. Intelligibility was measured as a function of switching threshold, carrier type, target-to-background ratio, and replacement method. Scores showed a dependence on all four parameters. Derived intensity-importance functions (IIFs) showed that E(O) + TFS information from 8-13 dB below to 10 dB above the channel long-term level was important. When E(O) + TFS information was added at the peaks, IIFs peaked around -2 dB, but when E(O) + TFS information was added at the valleys, the peaks lay around +1 dB.	\N	\N
21973372	Deutsch's octave illusion occurs when two tones that are spaced an octave apart are repeatedly presented in alternation; the sequence is presented to both ears simultaneously but offset by one tone, so that two dichotic chords are repeatedly presented in alternation. The most common illusory percept consists of an intermittent high tone in one ear alternating with an intermittent low tone in the other ear. The aim of this study was to investigate whether, once the illusory percept has emerged, the illusion will persist when the original sequence is followed by another sequence consisting of the repeated presentation of one of the two dichotic chords. Forty naïve subjects were tested with stimuli consisting first of a priming sequence containing dichotic octaves alternating between ears followed immediately by a test sequence consisting of a single dichotic octave presented repeatedly. The durations of the priming and test sequences were manipulated. The findings showed that the illusory percept is maintained after the switch from alternation to repetition and that the relative length of the priming and test sequences has a negligible influence on the persistence of the illusory percept.	\N	\N
21974490	Naive listeners' perceptual assimilations of non-native vowels to first-language (L1) categories can predict difficulties in the acquisition of second-language vowel systems. This study demonstrates that listeners having two slightly different dialects as their L1s can differ in the perception of foreign vowels. Specifically, the study shows that Bohemian Czech and Moravian Czech listeners assimilate Dutch high front vowels differently to L1 categories. Consequently, the listeners are predicted to follow different paths in acquiring these Dutch vowels. These findings underscore the importance of carefully considering the specific dialect background of participants in foreign- and second-language speech perception studies.	\N	\N
21981669	The neural representation of segmental and tonal phonological distinctions has been shown by means of the MMN ERP, yet this is not the case for intonational discourse contrasts. In Catalan, a rising-falling intonational sequence can be perceived as a statement or as a counterexpectational question, depending exclusively on the size of the pitch range interval of the rising movement. We tested here, using the MMN, whether such categorical distinctions elicited distinct neurophysiological patterns of activity, supporting their specific neural representation. From a behavioral identification experiment, we set the boundary between the two categories and defined four stimuli across the continuum. Although the physical distance between each pair of stimuli was kept constant, the central pair represented an across-category contrast, whereas the other pairs represented within-category contrasts. These four auditory stimuli were contrasted by pairs in three different oddball blocks. The mean amplitude of the MMN was larger for the across-category contrast, suggesting that intonational contrasts in the target language can be encoded automatically in the auditory cortex. These results are in line with recent findings in other fields of linguistics, showing that, when a boundary between categories is crossed, the MMN response is not just larger but rather includes a separate subcomponent.	\N	\N
21985220	The high energy demand of the auditory and visual pathways render these sensory systems prone to diseases that impair mitochondrial function. Primary open-angle glaucoma, a neurodegenerative disease of the optic nerve, has recently been associated with a spectrum of mitochondrial abnormalities. This study sought to investigate auditory processing in individuals with open-angle glaucoma. DESIGN/STUDY SAMPLE: Twenty-seven subjects with open-angle glaucoma underwent electrophysiologic (auditory brainstem response), auditory temporal processing (amplitude modulation detection), and speech perception (monosyllabic words in quiet and background noise) assessment in each ear. A cohort of age, gender and hearing level matched control subjects was also tested. While the majority of glaucoma subjects in this study demonstrated normal auditory function, there were a significant number (6/27 subjects, 22%) who showed abnormal auditory brainstem responses and impaired auditory perception in one or both ears. The finding that a significant proportion of subjects with open-angle glaucoma presented with auditory dysfunction provides evidence of systemic neuronal susceptibility. Affected individuals may suffer significant communication difficulties in everyday listening situations.	\N	\N
21987910	When the fundamental frequency (f0) is removed from a complex stimulus, the pitch of the f0 is still perceived by the listener. Through the use of the scalp-recorded frequency-following response, this study examined the relative contributions of thef0 and its harmonics in pitch processing by systematically manipulating the speech stimulus to remove component frequencies. 12 American and 12 Chinese adults were recruited. There were statistically significant effects of pitch strength and frequency error for the experimental-condition factor. There were significantly larger responses to the harmonics-only conditions than those obtained in the f0-only and control conditions. No statistically significant difference was observed between the two groups of participants. These findings indicate that neural responses associated with individual harmonics dominate the pitch processing in the human brainstem, irrespective of whether the listener's native language is nontonal or tonal.	\N	\N
22000998	Recent years have seen a growing debate concerning the function of the cerebellum. Here we used a pitch discrimination task and PET to test for cerebellar involvement in the active control of sensory data acquisition. Specifically, we predicted greater cerebellar activity during active pitch discrimination compared to passive listening, with the greatest activity when pitch discrimination was most difficult. Ten healthy subjects were trained to discriminate deviant tones presented with a slightly higher pitch than a standard tone, using a Go/No Go paradigm. To ensure that discrimination performance was matched across subjects, individual psychometric curves were assessed beforehand using a two-step psychoacoustic procedure. Subjects were scanned while resting in the absence of any sounds, while passively listening to standard tones, and while detecting deviant tones slightly higher in pitch among these standard tones at four different performance levels. Consistent with our predictions, 1) passive listening alone elicited cerebellar activity (lobule IX), 2) cerebellar activity increased during pitch discrimination as compared to passive listening (crus I and II, lobules VI, VIIB, and VIIIB), and 3) this increase was correlated with the difficulty of the discrimination task (lobules V, VI, and IX). These results complement recent findings showing pitch discrimination deficits in cerebellar patients (Parsons et al., 2009) and further support a role for the cerebellum in sensory data acquisition. The data are discussed in the light of anatomical and physiological evidence functionally connecting auditory system and cerebellum.	\N	\N
22002633	Attentional blink (AB) refers to a phenomenon where the correct identification of a first target (i.e., target) impairs the processing of a second target (i.e., probe) nearby in time. In the present study, we investigate the influence of temporal attention on auditory AB by means of scalp-recorded event-related potentials. Participants were instructed to focus their attention on a particular time interval following the target (i.e., short, middle, or long temporal position) in order to detect the occurrence of the probe in a rapid series of distractor sounds. We found a large probe processing deficit when the probe occurred immediately after the target. This AB decreased as the time interval between the target and the probe increased and coincided with the generation of a positive wave at parietal sites (i.e., P3b). The P3b elicited by the probe peaked earlier when the probe occurred at the designated time than when it occurred at another position in time. The results indicate that temporal attention can be deployed to a particular time, which facilitates short-term consolidation of the probe.	\N	\N
22004192	Speech processing requires sensitivity to long-term regularities of the native language yet demands listeners to flexibly adapt to perturbations that arise from talker idiosyncrasies such as nonnative accent. The present experiments investigate whether listeners exhibit dimension-based statistical learning of correlations between acoustic dimensions defining perceptual space for a given speech segment. While engaged in a word recognition task guided by a perceptually unambiguous voice-onset time (VOT) acoustics to signal beer, pier, deer, or tear, listeners were exposed incidentally to an artificial "accent" deviating from English norms in its correlation of the pitch onset of the following vowel (F0) to VOT. Results across four experiments are indicative of rapid, dimension-based statistical learning; reliance on the F0 dimension in word recognition was rapidly down-weighted in response to the perturbation of the correlation between F0 and VOT dimensions. However, listeners did not simply mirror the short-term input statistics. Instead, response patterns were consistent with a lingering influence of sensitivity to the long-term regularities of English. This suggests that the very acoustic dimensions defining perceptual space are not fixed and, rather, are dynamically and rapidly adjusted to the idiosyncrasies of local experience, such as might arise from nonnative-accent, dialect, or dysarthria. The current findings extend demonstrations of "object-based" statistical learning across speech segments to include incidental, online statistical learning of regularities residing within a speech segment.	\N	\N
22005285	Feelings of deliciousness during having foods are mainly produced by perceptions of sensory information extracted from foods themselves, such as taste and olfaction. However, environmental factors might modify the feeling of deliciousness. In the present study, we investigated how the condition of audio-visual environments affects the feeling of deliciousness during having sweet foods. Electroencephalograms (EEGs) were recorded from the frontal region of the scalp of healthy participants under virtual scenes of tearoom and construction work, respectively. The participants were asked to rate deliciousness after the recordings. Frequency analyses were performed from the EEGs. During having the foods, occupancy rates of beta frequency band between tearoom scenes and construction work scenes were markedly different, but not in other frequency bands. During having no food, in contrast, there was no difference of occupancy rates in respective frequency bands between the two different scenes. With regard to deliciousness during having sweet foods, all participants rated high scores under the scenes of tearoom than those under the scenes of construction work. Interestingly, there is a positive correlation between occupancy rates of beta frequency band and scores of deliciousness. These findings suggest that comfortable audio-visual environments play an important role in increasing the feeling of deliciousness during having sweet foods, in which beta frequency rhythms may be concerned with producing comprehensive feelings of deliciousness.	\N	\N
22005291	Neural sensitivity to acoustic regularities supports fundamental human behaviors such as hearing in noise and reading. Although the failure to encode acoustic regularities in ongoing speech has been associated with language and literacy deficits, how auditory expertise, such as the expertise that is associated with musical skill, relates to the brainstem processing of speech regularities is unknown. An association between musical skill and neural sensitivity to acoustic regularities would not be surprising given the importance of repetition and regularity in music. Here, we aimed to define relationships between the subcortical processing of speech regularities, music aptitude, and reading abilities in children with and without reading impairment. We hypothesized that, in combination with auditory cognitive abilities, neural sensitivity to regularities in ongoing speech provides a common biological mechanism underlying the development of music and reading abilities. We assessed auditory working memory and attention, music aptitude, reading ability, and neural sensitivity to acoustic regularities in 42 school-aged children with a wide range of reading ability. Neural sensitivity to acoustic regularities was assessed by recording brainstem responses to the same speech sound presented in predictable and variable speech streams. Through correlation analyses and structural equation modeling, we reveal that music aptitude and literacy both relate to the extent of subcortical adaptation to regularities in ongoing speech as well as with auditory working memory and attention. Relationships between music and speech processing are specifically driven by performance on a musical rhythm task, underscoring the importance of rhythmic regularity for both language and music. These data indicate common brain mechanisms underlying reading and music abilities that relate to how the nervous system responds to regularities in auditory input. Definition of common biological underpinnings for music and reading supports the usefulness of music for promoting child literacy, with the potential to improve reading remediation.	\N	\N
22005389	Several studies demonstrated that active exploration as compared to passive observation of a variety of objects leads to improved performance concerning these actively studied objects later on. These results may be specifically due to an improvement in perceptual recognition but in principle they may also be due to a speeding up of responses to actively studied objects. Recently, however, it was suggested that the benefit of active exploration on perceptual recognition may be restricted to a specific class of (biologically relevant) stimuli. By employing measures derived from signal detection theory we were able to show in all our three experiments that active exploration of virtual 3D objects leads to improved perceptual sensitivity in a subsequent test phase. The improvement with these objects means that the benefit of active exploration is not restricted to a specific class of biologically relevant stimuli. The results of our second experiment further demonstrate that the benefit of active exploration is even strong enough to fully compensate for the effect of perceptual degradation, thereby emphasizing the major impact of active exploration. In our third experiment, we explored the possibility that effects of active exploration might be due to major changes in attentional strategies rather than to the action-related aspect. Results revealed that an attentional requirement left the active-passive difference by and large intact supporting the view that the advantage of active object exploration lies in the action itself.	\N	\N
22006524	This article provides a demonstration of an analytical technique that can be used to investigate the causes of perceptual phenomena. The technique is based on the concept of the ideal observer, an optimal signal classifier that makes decisions that maximize the probability of a correct response. To demonstrate the technique, an analysis was conducted to investigate the role of the auditory periphery in the production of temporal masking effects. The ideal observer classified output from four models of the periphery. Since the ideal observer is the best of all possible observers, if it demonstrates masking effects, then all other observers must as well. If it does not demonstrate masking effects, then nothing about the periphery requires masking to occur, and therefore masking would occur somewhere else. The ideal observer exhibited several forward masking effects but did not exhibit backward masking, implying that the periphery has a causal role in forward but not backward masking. A general discussion of the strengths of the technique and supplementary equations are also included.	\N	\N
22010902	Language acquisition involves both acquiring a set of words (i.e. the lexicon) and learning the rules that combine them to form sentences (i.e. syntax). Here, we show that consonants are mainly involved in word processing, whereas vowels are favored for extracting and generalizing structural relations. We demonstrate that such a division of labor between consonants and vowels plays a role in language acquisition. In two very similar experimental paradigms, we show that 12-month-old infants rely more on the consonantal tier when identifying words (Experiment 1), but are better at extracting and generalizing repetition-based srtuctures over the vocalic tier (Experiment 2). These results indicate that infants are able to exploit the functional differences between consonants and vowels at an age when they start acquiring the lexicon, and suggest that basic speech categories are assigned to different learning mechanisms that sustain early language acquisition.	\N	\N
22015572	Sparse and clustered-sparse temporal sampling fMRI protocols have been devised to reduce the influence of auditory scanner noise in the context of auditory fMRI studies. Here, we report an improvement of the previously established clustered-sparse acquisition scheme. The standard procedure currently used by many researchers in the field is a scanning protocol that includes relatively long silent pauses between image acquisitions (and therefore, a relatively long repetition time or cluster-onset asynchrony); it is during these pauses that stimuli are presented. This approach makes it unlikely that stimulus-induced BOLD response is obscured by scanner-noise-induced BOLD response. It also allows the BOLD response to drop near baseline; thus, avoiding saturation of BOLD signal and theoretically increasing effect size. A possible drawback of this approach is the limited number of stimulus presentations and image acquisitions that are possible in a given period of time, which could result in an inaccurate estimation of effect size (higher standard error). Since this line of reasoning has not yet been empirically tested, we decided to vary the cluster-onset asynchrony (7.5, 10, 12.5, and 15 s) in the context of a clustered-sparse protocol. In this study sixteen healthy participants listened to spoken sentences. We performed whole-brain fMRI group statistics and region of interest analysis with anatomically defined regions of interest (auditory core and association areas). We discovered that the protocol, which included a short cluster-onset asynchrony (7.5 s), yielded more advantageous results than the other protocols, which involved longer cluster-onset asynchrony. The short cluster-onset asynchrony protocol exhibited a larger number of activated voxels and larger mean effect sizes with lower standard errors. Our findings suggest that, contrary to prior experience, a short cluster-onset asynchrony is advantageous because more stimuli can be delivered within any given period of time. Alternatively, a given number of stimuli can be presented in less time, and this broadens the spectrum of possible fMRI applications.	\N	\N
22046436	The present study investigated the minimum amount of auditory stimulation that allows differentiation of spoken voices, instrumental music, and environmental sounds. Three new findings were reported. 1) All stimuli were categorized above chance level with 50 ms-segments. 2) When a peak-level normalization was applied, music and voices started to be accurately categorized with 20 ms-segments. When the root-mean-square (RMS) energy of the stimuli was equalized, voice stimuli were better recognized than music and environmental sounds. 3) Further psychoacoustical analyses suggest that the categorization of extremely brief auditory stimuli depends on the variability of their spectral envelope in the used set. These last two findings challenge the interpretation of the voice superiority effect reported in previously published studies and propose a more parsimonious interpretation in terms of an emerging property of auditory categorization processes.	\N	\N
22047947	Predictive coding theories posit that the perceptual system is structured as a hierarchically organized set of generative models with increasingly general models at higher levels. The difference between model predictions and the actual input (prediction error) drives model selection and adaptation processes minimizing the prediction error. Event-related brain potentials elicited by sensory deviance are thought to reflect the processing of prediction error at an intermediate level in the hierarchy. We review evidence from auditory and visual studies of deviance detection suggesting that the memory representations inferred from these studies meet the criteria set for perceptual object representations. Based on this evidence we then argue that these perceptual object representations are closely related to the generative models assumed by predictive coding theories.	\N	\N
22051554	Relative blindsight is said to occur when different levels of subjective awareness are obtained at equality of objective performance. Using metacontrast masking, Lau and Passingham (2006) reported relative blindsight in normal observers at the shorter of two stimulus-onset asynchronies (SOAs) between target and mask. Experiment 1 replicated the critical asymmetry in subjective awareness at equality of objective performance. We argue that this asymmetry cannot be regarded as evidence for relative blindsight because the observers' responses were based on different attributes of the stimuli (criterion contents) at the two SOAs. With an invariant criterion content (Experiment 2), there was no asymmetry in subjective awareness across the two SOAs even though objective performance was the same. Experiment 3 examined the effect of criterion level on estimates of relative blindsight. Collectively, the present results question whether metacontrast masking is a suitable paradigm for establishing relative blindsight. Implications for theories of consciousness are discussed.	\N	\N
22056506	Behavioral and neurophysiological studies have shown an enhancement of visual perception in crossmodal audiovisual stimulation conditions, both for sensitivity and reaction times, when the stimulation in the two sensory modalities occurs in condition of space and time congruency. The purpose of the present work is to verify whether congruent visual and acoustic stimulations can improve the detection of visual stimuli in people affected by low vision. Participants were asked to detect the presence of a visual stimulus (yes/no task) either presented in isolation (i.e., unimodal visual stimulation) or simultaneously with auditory stimuli, which could be placed in the same spatial position (i.e., crossmodal congruent conditions) or in different spatial positions (i.e., crossmodal incongruent conditions). The results show for the first time audiovisual integration effects in low vision individuals. In particular, it has been observed a significant visual detection benefit in the crossmodal congruent as compared to the unimodal visual condition. This effect is selective for visual stimulation that occurs in the portion of visual field that is impaired, and disappears in the region of space in which vision is spared. Surprisingly, there is a marginal crossmodal benefit when the sound is presented at 16 degrees far from the visual stimulus. The observed crossmodal effect seems to be determined by the contribution of both senses to a model of optimal combination, in which the most reliable provides the highest contribution. These results, indicating a significant beneficial effect of synchronous and spatially congruent sounds in a visual detection task, seem very promising for the development of a rehabilitation approach of low vision diseases based on the principles of multisensory integration.	\N	\N
22070077	To analyze the hearing loss profiles in patients with tinnitus, and then provide clinical foundation for further studying the etiology and examination methods of tinnitus. Ear specialist examination, acoustic impedance test,normal frequency pure tone audiometry and extended high frequency audiometry were applied to 200 patients with chief complaint of subjective tinnitus. Among the 200 tinnitus cases, 123 (61.5%) patients were diagnosed with unilateral tinnitus, 77 (38.5%) patients with bilateral tinnitus and 46 (23.0%) cases with normal hearing. In those patients with unilateral tinnitus, by comparing the hearing threshold of affected side and contralateral side (0.125-8 kHz), the difference was statistically significant (P < 0.05), but in extended high frequency (> 10 kHz), the difference between two groups was not statistically significant (P > 0.05). There was significant difference in hearing threshold between tinnitus patients with normal and abnormal hearing in normal frequency (P < 0.05), meantime the detection rate in abnormal hearing group was lower than the normal group. Tinnitus can occur in people with normal hearing. Early in tinnitus,further study need be undertaken on whether the audiometry extended high frequency can offer the early evidence of hearing loss for tinnitus patients or not.	\N	\N
22072599	Difficulty understanding speech in background noise, even with amplification to restore audibility, is a common problem for hearing-impaired individuals and is especially frequent in older adults. Despite the debilitating nature of the problem the cause is not yet completely clear. This review considers the role of spatial processing ability in understanding speech in noise, highlights the potential impact of disordered spatial processing, and attempts to establish if aging leads to reduced spatial processing ability. Evidence supporting and opposing the hypothesis that spatial processing is disordered among the aging population is presented. With a few notable exceptions, spatial processing ability was shown to be reduced in an older population in comparison to young adults, leading to poorer speech understanding in noise. However, it is argued that to conclude aging negatively effects spatial processing ability may be oversimplified or even premature given potentially confounding factors such as cognitive ability and hearing impairment. Further research is required to determine the effect of aging and hearing impairment on spatial processing and to investigate possible remediation options for spatial processing disorder.	\N	\N
22073602	The effectiveness of bone anchored hearing aid (BAHA) for the patients with congenital aural atresia was evaluated by multicenter clinical study in Japan. Twenty patients (17 bilateral and 3 hemilateral) of congenital auricular atresia were registered for this study and finally, 18 of them (15 bilateral and 3 unilateral) were subjected to further evaluation. Primary endpoint of this study was free sound-field pure-tone audiometory and speech threshold hearing test in quiet and noisy circumstances. Secondary endpoint of this study was patient's satisfaction based upon APHAB (Abbreviated Profile of Hearing Aid Benefit) questionnaire survey. These results were compared between before and 12 weeks after BAHA surgery. Both hearing level of pure tone and speech threshold significantly improved after BAHA surgery. APHAB scores also suggested the improvement of the QOL after BAHA usage, except for the scores that concerned with unpleasantness of noisy sound. BAHA is one of the useful options for the treatment of congenital auricular atresia.	\N	\N
22080221	Gender is salient, socially critical information obtained from faces and voices, yet the brain processes underlying gender discrimination have not been well studied. We investigated neural correlates of gender processing of voices in two ERP studies. In the first, ERP differences were seen between female and male voices starting at 87 ms, in both spatial-temporal and peak analyses, particularly the fronto-central N1 and P2. As pitch differences may drive gender differences, the second study used normal, high- and low-pitch voices. The results of these studies suggested that differences in pitch produced early effects (27-63 ms). Gender effects were seen on N1 (120 ms) with implicit pitch processing (study 1), but were not seen with manipulations of pitch (study 2), demonstrating that N1 was modulated by attention. P2 (between 170 and 230 ms) discriminated male from female voices, independent of pitch. Thus, these data show that there are two stages in voice gender processing; a very early pitch or frequency discrimination and a later more accurate determination of gender at the P2 latency.	\N	\N
22087889	The effect of temporal asymmetry on amplitude modulation detection was studied using sawtooth modulators with rising (ramped) or falling (damped) temporal envelopes within each period of modulation. For pure-tone carriers, damped modulation was more detectable than ramped modulation for a 5-kHz carrier (by a threshold difference of 3.2 dB on average) but not for a 1-kHz carrier. The threshold difference obtained at 5 kHz between the ramped and damped modulators was consistent across modulation rates (8-128 Hz). This carrier frequency dependence suggests that the effect of temporally asymmetry on modulation detection originates from envelope-based, within-channel mechanisms.	\N	\N
22087927	Speech-in-noise-measurements are important in clinical practice and have been the subject of research for a long time. The results of these measurements are often described in terms of the speech reception threshold (SRT) and SNR loss. Using the basic concepts that underlie several models of speech recognition in steady-state noise, the present study shows that these measures are ill-defined, most importantly because the slope of the speech recognition functions for hearing-impaired listeners always decreases with hearing loss. This slope can be determined from the slope of the normal-hearing speech recognition function when the SRT for the hearing-impaired listener is known. The SII-function (i.e., the speech intelligibility index (SII) against SNR) is important and provides insights into many potential pitfalls when interpreting SRT data. Standardized SNR loss, sSNR loss, is introduced as a universal measure of hearing loss for speech in steady-state noise. Experimental data demonstrates that, unlike the SRT or SNR loss, sSNR loss is invariant to the target point chosen, the scoring method or the type of speech material.	\N	\N
22088028	Monolingual Peruvian Spanish listeners identified natural tokens of the Canadian French (CF) and Canadian English (CE) /ɛ/ and /æ/, produced in five consonantal contexts. The results demonstrate that while the CF vowels were mapped to two different native vowels, /e/ and /a/, in all consonantal contexts, the CE contrast was mapped to the single native vowel /a/ in four out of five contexts. Linear discriminant analysis revealed that acoustic similarity between native and target language vowels was a very good predictor of context-specific perceptual mappings. Predictions are made for Spanish learners of the /ɛ/-/æ/ contrast in CF and CE.	\N	\N
22090001	The primary purpose of this study was to evaluate a group of postlingually deafened adults, whose aided speech recognition exceeded commonly accepted candidacy criteria for implantation. The study aimed to define performance and qualitative outcomes of cochlear implants in these individuals compared with their optimally fitted hearing aid(s). Retrospective case series. Tertiary referral center. All postlingually deafened subjects (N = 27), who were unsuccessful hearing aid users implanted between 2000 and 2010 with a preimplantation Hearing in Noise Test (HINT) score of 60% or more were included. We compared patients' preoperative performance (HINT score) with hearing aids to postoperative performance with the cochlear implant after 12 months of device use. In addition, the Hearing Handicap Inventory questionnaire was used to quantify the hearing-related handicap change perceived after the implantation. The study group demonstrated significant postoperative improvement on all outcome measures; most notably, the mean HINT score improved from 68.4% (standard deviation, 8.3) to 91.9% (standard deviation, 9.7). Additionally, there was a significant improvement in hearing-related handicap perceived by all patients. The envelope of implantation candidacy criteria continues to expand as shown by this study's cohort. Patient satisfaction and speech recognition results are very encouraging in support of treating those who currently perform at a level above the conventional candidacy threshold but struggle with optimally fitted hearing aids.	\N	\N
22099165	Behavioral and electrophysiological measures of target and distractor processing were examined in an auditory selective attention task before and after three weeks of distractor suppression training. Behaviorally, training improved target recognition and led to less conservative and more rapid responding. Training also effectively shortened the temporal distance between distractors and targets needed to achieve a fixed level of target sensitivity. The effects of training on event-related potentials were restricted to the distracting stimulus: earlier N1 latency, enhanced P2 amplitude, and weakened P3 amplitude. Nevertheless, as distractor P2 amplitude increased, so too did target P3 amplitude, connecting experience-dependent changes in distractor processing with greater distinctiveness of targets in working memory. We consider the effects of attention training on the processing priorities, representational noise, and inhibitory processes operating in working memory.	\N	\N
22100742	Tinnitus occurs with or without prior noise exposure (noise-induced tinnitus (NIT) and spontaneous tinnitus (ST)), and is considered a symptom related to permanent hearing impairment (HI) or temporary hearing threshold shift (TTS). To carry out a cross-sectional interview study on TTS, ST and NIT during a standard audiometric screening of 756 7-year-old children in Gothenburg. 41% out of 756 children reported either NIT or ST on several occasions, 17% reported recurrent TTS and 7% failed the audiometry screening. The probability of ST was 27% for children with no HI or TTS (OR=1.23 (95% CI 1.12 to 1.34)) but 63% (OR=1.16 (95% CI 1.02 to 1.33)) if exhibiting both HI and TTS. This study confirms an increased occurrence of spontaneous tinnitus in children with TTS or HI and in children with both TTS and HI, in particular, but also in children with normal hearing. Possibly, tinnitus in young children correlates with stress as in adolescents and adults.	\N	\N
22107443	The objective of this study was to compare two recently proposed methods for fast measurements of psychophysical tuning curves (fast-PTCs) in terms of resulting tuning curve features and training effects. Fast-PTCs with swept-noise (SN) and gated-noise (GN) maskers were measured at signal frequencies of 500, 1000, 2000, and 4000 Hz. The effect of amplitude modulating the signal in the GN condition was evaluated. Two PTC runs were obtained for each condition to assess training effects. Eight normally-hearing young adults participated in the study. The SN and GN methods resulted in similar estimates of frequency selectivity when training effects were considered. Amplitude modulating the tone in the GN method reduced the effect of training. On average, SN-PTCs were most repeatable compared to the two other methods and they were not affected by training. Estimation of the shift in the PTC tip frequency was not affected by the measurement method or training effects. Fast-PTC methods resulted in similar estimates of tuning as compared to published notched-noise data. The SN method and the GN procedure with amplitude modulated signals allowed for time-efficient estimation of frequency selectivity that was unaffected by training.	\N	\N
22115727	We used a qualitative dissociation procedure to assess semantic priming from spatially attended and unattended masked words. Participants categorized target words that were preceded by parafoveal prime words belonging to either the same (20%) or the opposite (80%) category as the target. Using this paradigm, only non-strategic use of the prime would result in facilitation of the target responses in related trials. Primes were immediately masked or masked with a delay, while spatial attention was allocated to the primes' location or away from the primes' location. Immediate masked, strongly related primes facilitated target responses irrespective of the spatial attention. Delayed masked, related primes led to reversed (strategic) or facilitatory priming depending on whether they were cued or uncued. These findings demonstrate that perceiving a stimulus with or without awareness depends on both stimulus quality and attention orienting and that non-strategic priming can be observed from clear visible but spatially unattended words.	\N	\N
22119398	Impairment in long-term memory is one of the most salient alterations in cognitive aging. Findings of age-related deficits in source monitoring and recollection have revealed a selective decline in memory for detailed information. The underlying mechanism of this phenomenon is not well understood. We hypothesized that the influence of task-irrelevant visual stimuli present in our environment interferes with retrieval of detailed memories more for older than younger adults. We compared memory performance on a recall test for visual details when older adult participants' eyes were closed versus performance when their eyes were open and irrelevant visual stimuli were presented. The results showed that the presence of irrelevant visual information diminished long-term memory performance based on an objective measure of recollection for visual details. Comparison of the current results to findings from our earlier study using the same experimental paradigm with younger adults revealed that visual distraction disrupted recollection of relevant details to a greater degree in older than younger adults. This result suggests that visual distraction overwhelms older adults' declining cognitive control resources that are instrumental in the retrieval and selection of mnemonic details. More generally, these findings explicate a mechanistic basis for selective impairment of recollection in normal aging.	\N	\N
22124890	Interaural time differences (ITDs) can be used to localize sounds in the horizontal plane. ITDs can be extracted from either the fine structure of low-frequency sounds or from the envelopes of high-frequency sounds. Studies of the latter have included stimuli with periodic envelopes like amplitude-modulated tones or transposed stimuli, and high-pass filtered Gaussian noises. Here, four experiments are presented investigating the perceptual relevance of ITD cues in synthetic and recorded "rustling" sounds. Both share the broad long-term power spectrum with Gaussian noise but provide more pronounced envelope fluctuations than Gaussian noise, quantified by an increased waveform fourth moment, W. The current data show that the JNDs in ITD for band-pass rustling sounds tended to improve with increasing W and with increasing bandwidth when the sounds were band limited. In contrast, no influence of W on JND was observed for broadband sounds, apparently because of listeners' sensitivity to ITD in low-frequency fine structure, present in the broadband sounds. Second, it is shown that for high-frequency rustling sounds ITD JNDs can be as low as 30 μs. The third result was that the amount of dominance for ITD extraction of low frequencies decreases systematically with increasing amount of envelope fluctuations. Finally, it is shown that despite the exceptionally good envelope ITD sensitivity evident with high-frequency rustling sounds, minimum audible angles of both synthetic and recorded high-frequency rustling sounds in virtual acoustic space are still best when the angular information is mediated by interaural level differences.	\N	\N
22127548	The present acoustic-phonetic study explores whether voicing and devoicing assimilations of French fricatives are equivalent in magnitude and whether they operate similarly (i.e., complete vs. gradient, obligatory vs. optional, regressive vs. progressive). It concurrently assesses the contribution of speakers' articulation rate to the proportion of voicing (i.e., voicing ratios) in /s/ and /z/ embedded in fricative#stop sequences. Data analyses show that voicing and devoicing assimilation are similar in many regards: the absolute amounts of voicing change are equivalent in magnitude (0.77, 0.78) for the two processes: changes in voicing ratios are accompanied by changes in fricative and preceding vowel durations. These concomitant alterations result in the increased acoustic-phonetic similarity between the assimilated and the non-assimilated forms, suggesting that the two processes might be complete. In addition, the two processes operate regressively and across word-boundary. However, data show that the voicing assimilation of /s/ is not rate dependent, which suggest that it might be obligatory, while the devoicing assimilation of /z/ is rate dependent, which suggest that it might be optional.	\N	\N
22133279	To investigate the relationship between plasma reactive oxygen species (ROS) levels and severity of age-related hearing impairment in humans. We recruited 302 adult subjects aged 40-77 years with normal or symmetrical sensorineural hearing loss. The association of plasma ROS levels on pure tone average of low frequencies (PTA-low) and pure tone average of high frequencies (PTA-high) were analyzed. Luminol-dependent chemiluminescence signals, which reflect hydrogen peroxide (H(2)O(2)), hypochlorite (HOCl/OCl(-)) and hydroxyl radicals (•OH) levels, showed significant positive association with PTA-low and PTA-high after adjusting for age, gender, central obesity, systemic diseases, and health-related habits (smoking, drinking, antioxidant intake). Lucigenin-dependent chemiluminescence signals, which mainly reflect superoxide anion (O(2)•(-)) levels, showed significant positive association with PTA-low, but not with PTA-high after adjusting for other variables. We concluded that plasma ROS levels were associated with severity of age-related hearing impairment in humans. Various ROS may differently affect auditory dysfunctions.	\N	\N
22162387	In typically developing (TD) individuals, behavioral and event-related potential (ERP) studies suggest that audiovisual (AV) integration enables faster and more efficient processing of speech. However, little is known about AV speech processing in individuals with autism spectrum disorders (ASD). This study examined ERP responses to spoken words to elucidate the effects of visual speech (the lip movements accompanying a spoken word) on the range of auditory speech processing stages from sound onset detection to semantic integration. The study also included an AV condition, which paired spoken words with a dynamic scrambled face in order to highlight AV effects specific to visual speech. Fourteen adolescent boys with ASD (15-17 years old) and 14 age- and verbal IQ-matched TD boys participated. The ERP of the TD group showed a pattern and topography of AV interaction effects consistent with activity within the superior temporal plane, with two dissociable effects over frontocentral and centroparietal regions. The posterior effect (200-300 ms interval) was specifically sensitive to lip movements in TD boys, and no AV modulation was observed in this region for the ASD group. Moreover, the magnitude of the posterior AV effect to visual speech correlated inversely with ASD symptomatology. In addition, the ASD boys showed an unexpected effect (P2 time window) over the frontocentral region (pooled electrodes F3, Fz, F4, FC1, FC2, FC3, FC4), which was sensitive to scrambled face stimuli. These results suggest that the neural networks facilitating processing of spoken words by visual speech are altered in individuals with ASD.	\N	\N
22171057	Neural activity in the auditory system decreases with repeated stimulation, matching stimulus probability in multiple timescales. This phenomenon, known as stimulus-specific adaptation, is interpreted as a neural mechanism of regularity encoding aiding auditory object formation. However, despite the overwhelming literature covering recordings from single-cell to scalp auditory-evoked potential (AEP), stimulation timing has received little interest. Here we investigated whether timing predictability enhances the experience-dependent modulation of neural activity associated with stimulus probability encoding. We used human electrophysiological recordings in healthy participants who were exposed to passive listening of sound sequences. Pure tones of different frequencies were delivered in successive trains of a variable number of repetitions, enabling the study of sequential repetition effects in the AEP. In the predictable timing condition, tones were delivered with isochronous interstimulus intervals; in the unpredictable timing condition, interstimulus intervals varied randomly. Our results show that unpredictable stimulus timing abolishes the early part of the repetition positivity, an AEP indexing auditory sensory memory trace formation, while leaving the later part (≈ >200 ms) unaffected. This suggests that timing predictability aids the propagation of repetition effects upstream the auditory pathway, most likely from association auditory cortex (including the planum temporale) toward primary auditory cortex (Heschl's gyrus) and beyond, as judged by the timing of AEP latencies. This outcome calls for attention to stimulation timing in future experiments regarding sensory memory trace formation in AEP measures and stimulus probability encoding in animal models.	\N	\N
22172209	Using online measures of familiar word recognition in the looking-while-listening procedure, this prospective longitudinal study revealed robust links between processing efficiency and vocabulary growth from 18 to 30 months in children classified as typically developing (n = 46) and as "late talkers" (n = 36) at 18 months. Those late talkers who were more efficient in word recognition at 18 months were also more likely to "bloom," showing more accelerated vocabulary growth over the following year, compared with late talkers less efficient in early speech processing. Such findings support the emerging view that early differences in processing efficiency evident in infancy have cascading consequences for later learning and may be continuous with individual differences in language proficiency observed in older children and adults.	\N	\N
22174701	In this review paper aimed at the non-specialist, we explore the use that neuroscientists and musicians have made of perceptual illusions based on ambiguity. The pivotal issue is auditory scene analysis (ASA), or what enables us to make sense of complex acoustic mixtures in order to follow, for instance, a single melody in the midst of an orchestra. In general, ASA uncovers the most likely physical causes that account for the waveform collected at the ears. However, the acoustical problem is ill-posed and it must be solved from noisy sensory input. Recently, the neural mechanisms implicated in the transformation of ambiguous sensory information into coherent auditory scenes have been investigated using so-called bistability illusions (where an unchanging ambiguous stimulus evokes a succession of distinct percepts in the mind of the listener). After reviewing some of those studies, we turn to music, which arguably provides some of the most complex acoustic scenes that a human listener will ever encounter. Interestingly, musicians will not always aim at making each physical source intelligible, but rather express one or more melodic lines with a small or large number of instruments. By means of a few musical illustrations and by using a computational model inspired by neuro-physiological principles, we suggest that this relies on a detailed (if perhaps implicit) knowledge of the rules of ASA and of its inherent ambiguity. We then put forward the opinion that some degree perceptual ambiguity may participate in our appreciation of music.	\N	\N
22177410	The study of the neural basis of syntactic processing has greatly benefited from neuroimaging techniques. Research on syntactic processing in bilinguals has used a variety of techniques, including mainly functional magnetic resonance imaging (fMRI) and event-related potentials (ERP). This paper reports on a functional near-infrared spectroscopy (fNIRS) study on syntactic processing in highly proficient young adult speakers of Portuguese (mother tongue) (L1) and French (second language) (L2). They made a syntactic judgment of visually presented sentences, which either did or did not contain noun-verb agreement violations. The results showed that syntactic processing in both languages resulted in significant activation in anterior frontal regions of the left hemisphere and in the temporal superior posterior areas of the right hemisphere, with a more prominent activation for L2 in some areas. These findings corroborate previously reported neuroimaging evidence, showing the suitability of fNIRS for the study of syntactic processing in the bilingual brain.	\N	\N
22178743	Rhythm is a phenomenon that fundamentally affects the perception of events unfolding in time. In language, we define 'rhythm' as the temporal structure that underlies the perception and production of utterances, whereas 'meter' is defined as the regular occurrence of beats (i.e. stressed syllables). In stress-timed languages such as German, this regularity functions as a powerful temporal and structural cue in speech comprehension. Recent evidence shows that it also interacts with higher level linguistic faculties such as syntax (Schmidt-Kassow & Kotz, 2009a). The current ERP experiment investigated the impact of metric structure on lexico-semantic processing, comparing the effects of semantic and metric expectancy in regular and irregular metric sentence contexts. We predicted that (1) semantically unexpected words would result in an increased N400 amplitude and (2) metric context modulates the N400 amplitude. Our results confirm these predictions: semantically unexpected words elicit an N400 that is significantly smaller in a metrically regular than a metrically irregular sentence context. The current findings support the idea that metric regularity enhances the prediction of stress locations in a sentence context, which in turn facilitates lexico-semantic integration.	\N	\N
22183282	Different studies have been carried out in order to correlate audiometric thresholds and distortion product otoacoustic emissions measurements (DPOAE). However, high variability and external interferences make hearing thresholds estimates by means of the DPOAE very little sensitive. The aim of this study was to check the correspondence between the pure tone thresholds and the cochlear response thresholds by DPOAE Input/output functions, considering the influence of the following variables: gender, past of acute otitis media, and ear side. Prospective study comprehending 69 normal hearing individuals. Multiple mix regression models were applied to evaluate the correspondence between the two measurements studied. Statistically significant positive correlation was observed among all the frequencies compared (2000, 3000, 4000 e 6000 Hz). The 1 dB HL resolution pure tone thresholds and the above-mentioned variables had a direct impact on the high correlation between the measures studied, and it also reduced response variability. Nevertheless, response variability was still high, limiting the use of DPOAE I/O functions for hearing threshold estimates. We suggest that these variables should be considered for future studies with pure tone thresholds estimations by DPOAE I/O functions.	\N	\N
22197571	This study aimed to assess the effect of musical training in statistical learning of tone sequences using Magnetoencephalography (MEG). Specifically, MEG recordings were used to investigate the neural and functional correlates of the pre-attentive ability for detection of deviance, from a statistically learned tone sequence. The effect of long-term musical training in this ability is investigated by means of comparison of MMN in musicians to non-musicians. Both groups (musicians and non-musicians) showed a mismatch negativity (MMN) response to the deviants and this response did not differ amongst them neither in amplitude nor in latency. Another interesting finding of this study is that both groups revealed a significant difference between the standards and the deviants in the response of P50 and this difference was significantly larger in the group of musicians. The increase of this difference in the group of musicians underlies that intensive, specialized and long term exercise can enhance the ability of the auditory cortex to discriminate new auditory events from previously learned ones according to transitional probabilities. A behavioral discrimination task between the standard and the deviant sequences followed the MEG measurement. The behavioral results indicated that the detection of deviance was not explicitly learned by either group, probably due to the lack of attentional resources. These findings provide valuable insights on the functional architecture of statistical learning.	\N	\N
22199192	The effects of type of stimuli (i.e., nonspeech vs. speech), speech (i.e., natural vs. synthetic), gender of speaker and listener, speaker (i.e., self vs. other), and frequency alteration in self-produced speech on the late auditory cortical evoked potential were examined. Young adult men (n = 15) and women (n = 15), all with normal hearing, participated. P1-N1-P2 components were evoked with the following stimuli: 723-Hz tone bursts; naturally produced male and female /a/ tokens; synthetic male and female /a/ tokens; an /a/ token self-produced by each participant; and the same /a/ token produced by the participant but with a shift in frequency. In general, P1-N1-P2 component latencies were significantly shorter when evoked with the tonal stimulus versus speech stimuli and natural versus synthetic speech (p < .05). Women had significantly shorter latencies for only the P2 component (p < .05). For the tonal versus speech stimuli, P1 amplitudes were significantly smaller, and N1 and P2 amplitudes were significantly larger (p < .05). There was no significant effect of gender on the P1, N1, or P2 amplitude (p > .05). These findings are consistent with the notion that spectrotemporal characteristics of nonspeech and speech stimuli affect P1-N1-P2 latency and amplitude components.	\N	\N
22201556	Congenital amusia is a neurodevelopmental disorder that is characterized primarily by difficulties in the pitch domain. The aim of the present study was to investigate the perception of musical timbre in a group of individuals with congenital amusia by probing discrimination and short-term memory for real-world timbral stimuli as well as examining the ability of these individuals to sort instrumental tones according to their timbral similarity. Thirteen amusic individuals were matched with thirteen non-amusic controls on a range of background variables. The discrimination task included stimuli of two different durations and pairings of instrumental tones that reflected varying distances in a perceptual timbre space. Performance in the discrimination task was at ceiling for both groups. In contrast, amusic individuals scored lower than controls on the short-term timbral memory task. Amusic individuals also performed worse than controls on the sorting task, suggesting differences in the higher-order representation of musical timbre. These findings add to the emerging picture of amusia as a disorder that has consequences for the perception and memory of musical timbre, as well as pitch.	\N	\N
22209062	To investigate the relationships between objective measures and the results of subjective assessment of voice quality and speech intelligibility in patients submitted to total laryngectomy and tracheoesophageal (TE) puncture. Retrospective. Twenty patients implanted with voice prosthesis were studied. After surgery, the entire sample performed speech rehabilitation. The assessment protocol included maximum phonation time (MPT), number of syllables per deep breath, acoustic analysis of the sustained vowel /a/ and of a bisyllabic word, perceptual evaluation (pleasantness and intelligibility%), and self-assessment. The correlation between pleasantness and intelligibility% was statistically significant. Both the latter were significantly correlated with the acoustic signal type, the number of formant peaks, and the F2-F1 difference. The intelligibility% and number of formant peaks were significantly correlated with the MPT and number of syllables per deep breath. Moreover, significant correlations were found between the number of formant peaks and both intelligibility% and pleasantness. The higher the number of syllables per deep breath and the longer the MPT, significantly higher was the number of formant peaks and the intelligibility%. The study failed to show significant correlation between patient's self-assessment of voice quality and both pleasantness and communication effectiveness. The multidimensional assessment seems to be a reliable tool to evaluate the TE functional outcome. Particularly, the results showed that both pleasantness and intelligibility of TE speech are correlated to the availability of expired air and the function of the vocal tract.	\N	\N
22213748	To determine the efficacy of cochlear implantation (CI) in prelingually deafened adolescent children and to evaluate predictive variables for successful outcomes. Retrospective medical record review. Children aged 10 to 17 years with prelingual hearing loss (mean length of deafness, 11.5 years) who received a unilateral CI (mean age at CI, 12.9 years). Unilateral CI. Standard speech perception testing (Consonant-Nucleus-Consonant [CNC] monosyllabic word test and Hearing in Noise [HINT] sentence test) was performed preoperatively, 1 year postoperatively (year 1), and at the last follow-up/end of the study (EOS). There was a highly significant improvement in speech perception scores for both HINT sentence and CNC word testing from the preoperative testing to year 1 (mean change score, 51.10% and 32.23%, respectively; P < .001) and from the preoperative testing to EOS (mean change score, 60.02% and 38.73%, respectively; P < .001), with a significantly greater increase during the first year (P < .001). In addition, there was a highly significant correlation between improvements in performance scores on the CNC word and HINT sentence speech perception tests and both age at CI and length of deafness at the year 1 testing (P ≤.009) but not from the year 1 testing to EOS testing. Adolescents with progressive deafness and those using oral communication before CI performed significantly better than age-matched peers. Adolescents with prelingual deafness undergoing unilateral CI show significant improvement in objective hearing outcome measures. Patients with shorter lengths of deafness and earlier age at CI tend to outperform their peers. In addition, patients with progressive deafness and those using oral communication have significantly better objective outcomes than their peers.	\N	\N
22218296	Visual attention has temporal limitations. In the attentional blink (AB) a stream of stimuli such as letters or digits are presented to a participant on a computer monitor at a rapid rate. Embedded in the stream are two targets that the participant must try to identify. Identification of the second target is severely impaired if it is presented within approximately 500ms of the first target. This is the 'blink' in visual attention. In this study we examined the role of the magnocellular visual pathway in the AB. This fast conducting pathway has high temporal resolution and contrast sensitivity. It is also insensitive to the direction of chromatic contrast, and this attribute was exploited in order to isolate its contributions to temporal attention. Colour defined, luminance noise masked AB streams were compared to AB streams of varying achromatic contrast. The four observers, (2F and 2M) aged between 21 and 35years, had normal visual acuity and colour vision. The colour stimuli produced a similar blink to the moderate contrast achromatic stimuli. This indicates that the magnocellular pathway does not have a privileged role in the attentional blink. We provide an explanation of previous apparently contradictory findings in terms of the role of different types of visual masking in the attentional blink.	\N	\N
22232388	The goal of this study was to compare clinical and research-based cochlear implant (CI) measures using telehealth versus traditional methods. This prospective study used an ABA design (A = laboratory, B = remote site). All measures were made twice per visit for the purpose of assessing within-session variability. Twenty-nine adult and pediatric CI recipients participated. Measures included electrode impedance, electrically evoked compound action potential thresholds, psychophysical thresholds using an adaptive procedure, map thresholds and upper comfort levels, and speech perception. Subjects completed a questionnaire at the end of the study. Results for all electrode-specific measures revealed no statistically significant differences between traditional and remote conditions. Speech perception was significantly poorer in the remote condition, which was likely due to the lack of a sound booth. In general, subjects indicated that they would take advantage of telehealth options at least some of the time, if such options were available. Results from this study demonstrate that telehealth is a viable option for research and clinical measures. Additional studies are needed to investigate ways to improve speech perception at remote locations that lack sound booths and to validate the use of telehealth for pediatric services (e.g., play audiometry), sound-field threshold testing, and troubleshooting equipment.	\N	\N
22232404	Older adults exhibit difficulty understanding speech that has been experimentally degraded. Age-related changes to the speech mechanism lead to natural degradations in signal quality. We tested the hypothesis that older adults with hearing loss would exhibit declines in speech recognition when listening to the speech of older adults, compared with the speech of younger adults, and would report greater amounts of listening effort in this task. Nineteen individuals with age-related hearing loss completed speech recognition and listening effort scaling tasks. Both were conducted in quiet, when listening to high- and low-predictability phrases produced by younger and older speakers, respectively. No significant difference in speech recognition existed when stimuli were derived from younger or older speakers. However, perceived effort was significantly higher when listening to speech from older adults, as compared with younger adults. For older individuals with hearing loss, natural degradations in signal quality may require greater listening effort. However, they do not interfere with speech recognition-at least in quiet. Follow-up investigation of the effect of speaker age on speech recognition and listening effort under more challenging noise conditions appears warranted.	\N	\N
22232413	The ability to detect a tone added to a random masker improves when a preview of the masker is provided. In 2 experiments, the authors explored the role that perceptual organization plays in this release from masking. Detection thresholds were measured in informational masking studies. The maskers were drawn at random prior to each trial. Masker or signal-plus-masker precursors preceded the detection interval, and the time between the precursor and the detection interval was systematically altered. In Experiment 1, the signal frequency was either fixed or random. In Experiment 2, the random masker was composed of harmonics of a common fundamental frequency (F (0)), and the randomly chosen signal frequency was either harmonically related to, or mistuned from, the masker's F (0). For a masker precursor, the release from informational masking withstood longer precursor-detection interval delays (a) when the signal frequency was fixed versus random and (b) when the signal was mistuned relative to a harmonic of the masker's F (0). These results suggest that listeners' ability to attend to the signal may contribute to the long-lived release from masking with a masker precursor.	\N	\N
22237163	The aim of this study was to investigate potential effects of age on the ability of normal-hearing (NH) adult listeners to utilize spectral and temporal cues when performing a voice gender identification task. Ten younger and 10 older NH adult listeners were measured on their ability to correctly identify the speaker gender of six different vowel tokens (H-/vowel/-D) when spoken by eight speakers (four male and four female). Spectral (number of channels) and temporal cues (low-pass cut-off frequency for temporal envelope extraction) were systematically manipulated using noiseband vocoding techniques; stimuli contained 1, 4, 8, 16, or 32 spectral channels, while the low-pass cut-off frequency of the temporal envelope filter was 20, 50, 100, 200, or 400 Hz. Furthermore, the fundamental frequencies (F0s) of the vowel tokens were manipulated to create two conditions: "Expanded" (large range of F0 values) and "Compressed" (small range of F0 values). In general, younger listeners performed better than the older listeners but only when stimuli were spectrally degraded. For both the Expanded and Compressed conditions, the overall performance of the younger listeners was better than that of the older listeners, suggesting age-related deficits in both spectral and temporal processing. Furthermore, a significant interaction between age group and temporal envelope cues revealed that older listeners received less benefit from increasing temporal envelope information compared with the benefit observed among younger listeners. In particular, the performance of the younger NH group (collapsed across number of channels), but not the older NH group, improved as the temporal envelope cut-off frequency was increased from 50 to 400 Hz. The results reported here support previous findings of senescent declines in perceiving spectrally reduced speech and temporal amplitude modulation processing. These results suggest that when F0 values are similar to one another, younger listeners can use temporal cues alone to glean voice-pitch information but older listeners exhibit a lessened ability to use such cues. Previous studies have demonstrated the importance of temporal envelope cues in periodicity perception (e.g., gender recognition) by cochlear implant listeners. The results of this study suggest that aging affects the use of such cues, and consequently gender recognition might be poorer among older cochlear implant recipients.	\N	\N
22245012	An event-related potential, the Phonological Mapping Negativity (PMN), has been reported to reflect recognition of phonological mismatches in speech stimuli. The purpose of the present study was to explore how the PMN response to the auditory nonsense syllable reflects phonological processing in isolation without the letter prime or lexical/semantic context. Sixty-four nonsense syllable stimuli were composed for each of three stimulus conditions: phonological match (PM), phonological mismatch with similar sound (PMMS), and phonological mismatch with different sound (PMMD), making a total of 192 stimuli. The PMN was measured from fourteen normal-hearing listeners. Electroencephalogram (EEG) activity was recorded while subjects were listening to the stimuli and responding behaviorally. Subjects were asked to determine what vowel-consonant-vowel (VCV) (e.g. /apa/) would be formed from the combination of the preceding vowel (V) (e.g. /a/) and consonant-vowel (CV) (e.g. /pa/), and press a 'correct' or 'incorrect' response button as soon as they decided whether the target VCV stimulus matched their expectation. In this way, along with the PMN, behavioral response accuracy and reaction times were obtained. The results were as follows: (1) PMN amplitude was not different by stimulus condition, (2) PMN amplitude was larger over frontal and central than posterior regions, but not different between the left versus right hemisphere, (3) PMN was detected in the absence of N400, and (4) behavioral responses were more accurate and faster in PMMD than PM and PMMS. Results indicate that the PMN can reflect phonological processing of auditory nonsense syllables in isolation. The scalp distribution of PMN is most dominant in the fronto-central regions without lateralization. Lastly, behavioral response accuracy and reaction times appear to be influenced by the extent of the task difficulty or processing demand rather than by the extent of phonological violation.	\N	\N
22247221	We employed a variant of the mask-onset delay paradigm in order to limit the availability of visual information in central and peripheral vision within individual fixations during scene viewing. Subjects viewed full-color scene photos with instructions to search for a target object (Experiment 1) or to study them for a later memory test (Experiment 2). After a fixed interval following the onset of each eye fixation (50-100 ms), the scene was scrambled either in the central visual field or over the entire display. The intact scene was presented when the subject made an eye movement. Our results reconcile different sets of findings from prior research regarding the masking of central and peripheral visual information at different intervals following fixation onset. In particular, we found that when the entire display was scrambled, both search and memory performance were impaired even at relatively long mask-onset intervals. In contrast, when central vision was scrambled, there were subtle impairments that depended on the viewing task. In the 50-ms mask-onset interval, subjects were selectively impaired at identifying, but not in locating, the search target (Experiment 1), while memory performance (Experiment 2) was unaffected in this condition, and hence, the reliance on central and peripheral visual information depends partly on the viewing task.	\N	\N
22251050	Given that semantic processes mediate early processes in the elicitation of emotions, we expect that already activated emotion-specific information can influence the elicitation of an emotion. In Experiment 1, participants were exposed to masked International Affective Picture System (IAPS) pictures that elicited either disgust or fear. Following the presentation of the primes, other IAPS pictures were presented as targets that elicited either disgust or fear. The participants' task was to classify the target picture as either disgust or fear evoking. In Experiment 2, we substituted the IAPS primes with facial expressions of either disgust or fear. In Experiment 3, we substituted the IAPS primes with the words disgust or fear. In all three experiments, we found that prime-target combinations of the same emotion were responded to faster than prime-target combinations of different emotions. Our findings suggest that the influence of primes on the elicitation of emotion is mediated by activated schemata or appraisal processes.	\N	\N
22253008	Auditory brainstem implants (ABIs) can provide highly beneficial hearing sensations to individuals deafened by bilateral vestibular schwannomas (neurofibromatosis type 2). Relatively little is known about the status of stimulated neurons after long-term ABI use. Direct examination of the cochlear nuclear complex (CN) of one 5-year ABI user indicated no deleterious effect. Recently, we examined the brainstem of a patient who used his ABI daily for 15 years with excellent performance. There was good preservation of CN cell size, morphology, and packing density, a very favorable sign considering that a number of infants are now receiving ABIs.	\N	\N
22264101	Our findings show that all cochlear implanted temporal bones had a varied degree of trauma and inflammatory reaction from cochlear implantation. No definitive relationship was observed from our limited number of specimens between residual spiral ganglion cells (SGCs) in implanted temporal bones and clinical speech performance. We hypothesize that there is a relationship between residual SGCs in cochlear implanted temporal bones and clinical speech performance. Our aim was to examine the histopathology of multi-channel cochlear implant temporal bones and to evaluate the relationship of residual SGC counts to clinical hearing performance. Temporal bones from four cochlear implant patients were examined histologically. Comparisons were made between implanted and nonimplanted temporal bones. Clinical performance data were obtained from patient charts. There were varying amounts of inflammation in the basal turn of the cochleae in all four implanted temporal bones. Trauma to the facial nerve at the facial recess was noticed in one case. Surviving dendrites varied from 5% to 30% among four cases, with no relationship to clinical performance. The speech recognition scores, measured with Central Institute of the Deaf (CID) sentence score, varied among patients from 4% to 89%, while the patient with the highest SGCs had the best clinical outcome.	\N	\N
22280721	A psychophysical pitch function, describing the relation of perceived magnitude of pitch to the frequency of a pure tone, was determined by absolute magnitude estimation. Pitch estimates were made by listeners with relative pitch and by absolute pitch possessors for 27 tones spanning a frequency range of 31.5-12,500 Hz in 1/3 octave steps. Results show that the pitch function, plotted in log-log coordinates, is steeper below 200 Hz than at higher frequencies. It is hypothesized that the pitch function's bend may reflect the diversity of neurophysiological mechanisms of pitch encoding in frequency ranges below and above 200 Hz. The variation of the function's slope implies that pitch distances between tones with the same frequency ratios are perceived as larger below 200 Hz than at higher frequencies. It is argued that this implication may apply only to a purely sensory concept of pitch distance and cannot be extended to the perception of musical intervals, a phenomenon governed by musical cognitive principles. The results also show that pitch functions obtained for listeners with relative and absolute pitch have a similar shape, which means that quantitative pitch relations determined for both groups of listeners do not differ appreciably along the frequency scale.	\N	\N
22289507	In this event-related potential (ERP) study a masking technique that prevents conscious perception of words and non-words through attentional distraction was used to reveal the temporal dynamics of word processing under non-conscious and conscious conditions. In the non-conscious condition, ERP responses differed between masked words and non-words from 112 to 160 ms after stimulus-onset over posterior brain areas. The early onset of the word-non-word differences was compatible with previous studies that reported non-conscious access to orthographic information within this time period. Moreover, source localisations provided evidence for automatic activation of prelexical phonological information, whereas no evidence for non-conscious semantic processing was found. When subjects were informed about the masking technique, lexical differences occurred at later time intervals, suggesting conscious access to additional word related information. These results indicate that early visual word processing does not depend entirely on attentional resources, but that non-conscious processing probably is restricted to rather lower-level linguistic information.	\N	\N
22290344	Synesthesia is a sensory disorder where the stimulation of one sensory modality can lead to a reaction in another which would not usually be expected to respond; for instance, someone might see a color on hearing a word such as a day of the week. Disordered perception of sensory information also appears to contribute to the pathophysiology of irritable bowel syndrome (IBS). The purpose of this exploratory study was to ascertain whether these two conditions might be linked in any way. Two hundred consecutive IBS outpatients were screened for synesthesia and compared with 200 matched healthy volunteers (controls). Positive responders were tested for two types of synesthesia (grapheme-color and music-color/shape) using a questionnaire which was repeated after 3 months to test for reproducibility. Of the 200 IBS outpatients screened, 26 (13%) patients and six (3%) controls claimed to be synesthetic (P < 0.001). Reproducibility was more variable in IBS patients than controls but despite this variability, 15 (7.5%) patients compared with 5 (2.5%) controls had greater than 75% consistency (P = 0.036), and 19 (9.5%) patients and 6 (3%) controls had greater than 50% consistency (P = 0.012). A reproducibility of less than 50% was observed in seven (3.5%) patients and no controls (0%) (P = 0.015), and these individuals were classified as having pseudo-synesthesia. IBS patients clearly differ from controls in terms of claiming to have synesthetic experiences. These results justify additional studies on the relationship between IBS and synesthesia to further understand the neural mechanisms underpinning these two conditions and to establish whether they may be linked.	\N	\N
22292985	The effects of the use of cochlear implant (CI) on speech intelligibility, speaking rate, and vowel formant characteristics and the relationships between speech intelligibility, speaking rate, and vowel formant characteristics for children are clinically important. The purposes of this study were to report on the comparisons for speaking rate and vowel space area, and their relationship with speech intelligibility, between 24 Mandarin-speaking children with CI and 24 age-sex-education level matched normal hearing (NH) controls. Participants were audio recorded as they read a designed Mandarin intelligibility test, repeated prolongation of each of the three point vowels /i/, /a/, and /u/ five times, and repeated each of three sentences carrying one point vowel five times. Compared to the NH group, the CI group exhibited: (1) mild-to-moderate speech intelligibility impairment; (2) significantly reduced speaking rate mainly due to significantly longer inter-word pauses and larger pause proportion; and (3) significantly less vowel reduction in the horizontal dimension in sustained vowel phonation. The limitations of speech intelligibility development in children after cochlear implantation were related to atypical patterns and to a smaller degree in vowel reduction and slower speaking rate resulting from less efficient articulatory movement transition.	\N	\N
22302814	Understanding speech in noisy environments is often taken for granted; however, this task is particularly challenging for people with cochlear hearing loss, even with hearing aids or cochlear implants. A significant limitation to improving auditory prostheses is our lack of understanding of the neural basis for robust speech perception in noise. Perceptual studies suggest the slowly varying component of the acoustic waveform (envelope, ENV) is sufficient for understanding speech in quiet, but the rapidly varying temporal fine structure (TFS) is important in noise. These perceptual findings have important implications for cochlear implants, which currently only provide ENV; however, neural correlates have been difficult to evaluate due to cochlear transformations between acoustic TFS and recovered neural ENV. Here, we demonstrate the relative contributions of neural ENV and TFS by quantitatively linking neural coding, predicted from a computational auditory nerve model, with perception of vocoded speech in noise measured from normal hearing human listeners. Regression models with ENV and TFS coding as independent variables predicted speech identification and phonetic feature reception at both positive and negative signal-to-noise ratios. We found that: (1) neural ENV coding was a primary contributor to speech perception, even in noise; and (2) neural TFS contributed in noise mainly in the presence of neural ENV, but rarely as the primary cue itself. These results suggest that neural TFS has less perceptual salience than previously thought due to cochlear signal processing transformations between TFS and ENV. Because these transformations differ between normal and impaired ears, these findings have important translational implications for auditory prostheses.	\N	\N
22304406	Children ask questions and learn from the responses they receive; however, little is known about how children learn from listening to others ask questions. Five experiments examined preschoolers' (N = 179) ability to solve simple problems using information gathered from listening to question-and-answer exchanges between 2 parties present in the same room. Overall, the ability to efficiently use information gathered from overheard exchanges improved between ages 3 and 5. Critically, however, across ages children solved the majority of problems correctly, suggesting preschoolers are capable of learning from others' questions. Moreover, children learned from others' questions without explicit instruction and when engaged in another activity. Implications for the development of problem-solving skills are discussed.	\N	\N
22317269	The purpose of this study was to assess normal hearing listeners' performance in detecting a stationary backup alarm signal and to quantify the linear distance at detection point. Detection distances for 12 participants with normal hearing were measured while they were fitted with 7 hearing protectors and while they were unoccluded (open ear). A standard (narrowband) backup alarm signal and a broadband (pulsed white noise) backup alarm signal from Brigade[1] were used. The method of limits, with distance as the physical measurement variable and threshold detection as the task, was employed to find at which distance the participant could first detect the backup alarms. A within-subject Analysis of Variance (ANOVA) revealed a significant main effect of the listening conditions on the detection distance in feet. Post hoc analyses indicated that the Bilsom L3HV conventional passive earmuff (at 1132.2 ft detection distance) was significantly poorer compared to all other HPDs and the open ear in detection distance achieved, and that there were no statistically-significant differences between the unoccluded ear (1652.3 ft), EB-15-Lo BlastPLGTM (1546.2 ft), EB-15-Hi BlastPLGTM (1543.4 ft), E-A-R/3M Combat ArmsTM earplug-nonlinear, level-dependent state (1507.8 ft), E-A-R/3M HiFiTM earplug (1497.7 ft), and Bilsom ImpactTM dichotic electronic earmuff (1567.2 ft). In addition, the E-A-R/3M Combat ArmsTM earplug-passive steady state resulted in significantly longer detection distances than only the open ear condition, at 1474.1 ft versus 1652.3 ft for the open ear. ANOVA also revealed a significant main effect of the backup alarm type on detection distance. The means were 1600.9 ft for the standard (narrowband) backup alarm signal, and a significantly closer 1379.4 ft was required for the Brigade broadband backup alarm signal. For on-ground workers, it is crucial to detect backup alarm signals as far away as possible rather than at close distances since this will provide them more time to react to approaching vehicles. The results of this study suggest that as the attenuation of the hearing protectors increases, precautions should be considered by safety professionals. This is because, as it was the case with the Bilsom passive earmuff and E-A-R/3M Combat ArmsTM earplug-passive steady state, high attenuation minimizes the detection distance and as a result on-foot workers will have less time to react to any approaching vehicle. The main effects of the type of backup alarm signal demonstrated a statistically-significant advantage of the standard backup alarm over the broadband backup alarm on detection distance in feet. The magnitude of the improvement produced by the standard backup alarm was 221.5 feet, a very large margin. For example, with a vehicle backing at 10 mph, the 221.5 ft decrease in detection distance with the Brigade alarm equates to the vehicle arriving 15 seconds sooner at the worker from the point at which its alarm was first heard.	\N	\N
22321294	This study was designed to separately test the effect of the cued/cueless nature of deviant stimuli and that of temporal distance between sound and deviance onsets on the mismatch negativity (MMN) as well as to look for discrepancies between behavioural discrimination performances and MMN amplitude when deviants are cueless. Ten healthy adults passively listened to stimuli that were contrasted by the presence or absence of a frequency sweep starting early or late within the sound. Discrimination performances were collected after the electrophysiological sessions. MMNs were much larger for cued than for cueless deviants. The temporal distance between sound and deviance onsets affected MMNs evoked by both cued and cueless deviants, even to the point of abolishing the MMN when cueless deviance occurred late in the stimulus. Behavioural data were at ceiling levels for all conditions, contrasting with the absence of MMN evoked by cueless deviants with late onset. Two mechanisms contribute to the MMN evoked by cued deviants: the memory comparison process and the adaptation/fresh-afferent one. Within the temporal window of integration, the delay at which each component disappears is different. Comparing waveforms evoked by cued versus cueless deviants provides a fairly simple way of isolating the MMN memory-based component.	\N	\N
22323627	The ability to detect and track relevant acoustic signals embedded in a background of other sounds is crucial for hearing in complex acoustic environments. This ability is exemplified by a perceptual phenomenon known as "rhythmic masking release" (RMR). To demonstrate RMR, a sequence of tones forming a target rhythm is intermingled with physically identical "Distracter" sounds that perceptually mask the rhythm. The rhythm can be "released from masking" by adding "Flanker" tones in adjacent frequency channels that are synchronous with the Distracters. RMR represents a special case of auditory stream segregation, whereby the target rhythm is perceptually segregated from the background of Distracters when they are accompanied by the synchronous Flankers. The neural basis of RMR is unknown. Previous studies suggest the involvement of primary auditory cortex (A1) in the perceptual organization of sound patterns. Here, we recorded neural responses to RMR sequences in A1 of awake monkeys in order to identify neural correlates and potential mechanisms of RMR. We also tested whether two current models of stream segregation, when applied to these responses, could account for the perceptual organization of RMR sequences. Results suggest a key role for suppression of Distracter-evoked responses by the simultaneous Flankers in the perceptual restoration of the target rhythm in RMR. Furthermore, predictions of stream segregation models paralleled the psychoacoustics of RMR in humans. These findings reinforce the view that preattentive or "primitive" aspects of auditory scene analysis may be explained by relatively basic neural mechanisms at the cortical level.	\N	\N
22337498	In this study, the authors (a) investigated whether a group of people with severe aphasia could learn a vocabulary of pantomime gestures through therapy and (b) compared their learning of gestures with their learning of words. The authors also examined whether gesture therapy cued word production and whether naming therapy cued gestures. Fourteen people with severe aphasia received 15 hr of gesture and naming treatments. Evaluations comprised repeated measures of gesture and word production, comparing treated and untreated items. Baseline measures were stable but improved significantly following therapy. Across the group, improvements in naming were greater than improvements in gesture. This trend was evident in most individuals' results, although 3 participants made better progress in gesture. Gains were item specific, and there was no evidence of cross-modality cueing. Items that received gesture therapy did not improve in naming, and items that received naming therapy did not improve in gesture. Results show that people with severe aphasia can respond to gesture and naming therapies. Given the unequal gains, naming may be a more productive therapy target than gesture for many (although not all) individuals with severe aphasia. The communicative benefits of therapy were not examined but are addressed in a follow-up article.	\N	\N
22352496	The standard method for the calibration of audiometric bone vibrators requires the use of an artificial mastoid, a device that converts vibratory energy to an electrical analog. The mechanical input impedance of the device is designed to represent the average mechanical impedance of the human head. For calibration purposes, it is not necessary that the coupling device represent the impedance of the head. It is only necessary that it provides a repeatable measurement of the output of the vibrator that can be related to the normal threshold of hearing at each test frequency. In addition to the mechanical output that serves as the stimulus for the hearing test, bone vibrators produce an acoustic signal that is proportional to the mechanical force delivered to the head. By determining the transfer function relating the acoustic sound pressure to the mechanical force, the acoustic signal can serve as a proxy for the vibratory stimulus. This article describes the design and validation of an acoustic coupler for the calibration of audiometric bone vibrators.	\N	\N
22352502	The reliability of distortion-product otoacoustic emission (DPOAE) measurements and their relation to loudness measurements was examined in 16 normal-hearing subjects and 58 subjects with hearing loss. The level of the distortion product (L(d)) was compared across two sessions and resulted in correlations that exceeded 0.90. The reliability of DPOAEs was less when parameters from nonlinear fits to the input/output (I/O) functions were compared across visits. Next, the relationship between DPOAE I/O parameters and the slope of the low-level portion of the categorical loudness scaling (CLS) function (soft slope) was assessed. Correlations of 0.65, 0.74, and 0.81 at 1, 2, and 4 kHz were observed between CLS soft slope and combined DPOAE parameters. Behavioral threshold had correlations of 0.82, 0.83, and 0.88 at 1, 2, and 4 kHz with CLS soft slope. Combining DPOAEs and behavioral threshold provided little additional information. Lastly, a multivariate approach utilizing the entire DPOAE I/O function was used to predict the CLS rating for each input level (dB SPL). Standard error of the estimate when using this method ranged from 2.4 to 3.0 categorical units (CU), suggesting that DPOAE I/O functions can predict CLS measures within the CU step size used in this study (5).	\N	\N
22352514	Cross-generational and cross-dialectal variation in vowels among speakers of American English was examined in terms of vowel identification by listeners and vowel classification using pattern recognition. Listeners from Western North Carolina and Southeastern Wisconsin identified 12 vowel categories produced by 120 speakers stratified by age (old adults, young adults, and children), gender, and dialect. The vowels /ɝ, o, ʊ, u/ were well identified by both groups of listeners. The majority of confusions were for the front /i, ɪ, e, ɛ, æ/, the low back /ɑ, ɔ/ and the monophthongal North Carolina /aɪ/. For selected vowels, generational differences in acoustic vowel characteristics were perceptually salient, suggesting listeners' responsiveness to sound change. Female exemplars and native-dialect variants produced higher identification rates. Linear discriminant analyses which examined dialect and generational classification accuracy showed that sampling the formant pattern at vowel midpoint only is insufficient to separate the vowels. Two sample points near onset and offset provided enough information for successful classification. The models trained on one dialect classified the vowels from the other dialect with much lower accuracy. The results strongly support the importance of dynamic information in accurate classification of cross-generational and cross-dialectal variations.	\N	\N
22352516	This study examined whether speech-on-speech masking is sensitive to variation in the degree of similarity between the target and the masker speech. Three experiments investigated whether speech-in-speech recognition varies across different background speech languages (English vs Dutch) for both English and Dutch targets, as well as across variation in the semantic content of the background speech (meaningful vs semantically anomalous sentences), and across variation in listener status vis-à-vis the target and masker languages (native, non-native, or unfamiliar). The results showed that the more similar the target speech is to the masker speech (e.g., same vs different language, same vs different levels of semantic content), the greater the interference on speech recognition accuracy. Moreover, the listener's knowledge of the target and the background language modulate the size of the release from masking. These factors had an especially strong effect on masking effectiveness in highly unfavorable listening conditions. Overall this research provided evidence that that the degree of target-masker similarity plays a significant role in speech-in-speech recognition. The results also give insight into how listeners assign their resources differently depending on whether they are listening to their first or second language.	\N	\N
22352522	Automatic speech recognition (ASR) refers to the task of extracting a transcription of the linguistic content of an acoustical speech signal automatically. Despite several decades of research in this important area of acoustic signal processing, the accuracy of ASR systems is still far behind human performance, especially in adverse acoustic scenarios. In this context, one of the most challenging situations is the one concerning simultaneous speech in cocktail-party environments. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance. In this paper, a specific training to improve the percentage of recognized words in real simultaneous speech cases is proposed. The combination of source separation and this specific training is explored and evaluated under different acoustical conditions, leading to improvements of up to a 35% in ASR performance.	\N	\N
22352609	There is substantial performance variability among listeners who transcribe degraded speech. Error patterns from 88 listeners who transcribed dysarthric speech were examined to identify differential use of syllabic strength cues for lexical segmentation. Transcripts from listeners were divided into four groups (ranging from Better- to Poorer- performing). Phrases classified as Higher- and Lower-intelligibility were analyzed separately for each performance group to assess the independent variable of severity. Results revealed that all four listener groups used syllabic strength cues for lexical segmentation of Higher-intelligibility speech, but only the Poorer listeners persisted with this strategy for the Lower-intelligibility phrases. This finding and additional analyses suggest testable hypotheses to address the role of cue-use and performance patterns.	\N	\N
22353676	Age-related declines in auditory and cognitive processing may contribute to the difficulties with listening in noise that are often reported by older adults. Such difficulties are reported even by those who have relatively good audiograms that could be considered "normal" for their age (ISO 7029-2000 [ISO, 2000]). The Speech, Spatial, and Qualities of Hearing Scale (SSQ; Gatehouse and Noble, 2004) is a questionnaire developed to measure a listener's self-reported ability to hear in a variety of everyday situations, such as those that are challenging for older adults, and it can provide insights into the possible contributions of auditory and cognitive factors to their listening difficulties. The SSQ has been shown to be a sensitive and reliable questionnaire to detect benefits associated with the use of different hearing technologies and potentially other forms of intervention. Establishing how age-matched listeners with audiograms "normal" for their age rate the items on the SSQ could enable an extension of its use in audiological assessment and in setting rehabilitative goals. The main purpose of this study was to investigate how younger and older adults who passed audiometric screening and who had thresholds considered to be "normal" for their age responded on the SSQ. It was also of interest to compare these results to those reported previously for older listeners with hearing loss in an attempt to tease out the relative effects of age and hearing loss. The SSQ was administered to 48 younger (mean age = 19 yr; SD = 1.0) and 48 older (mean age = 70 yr, SD = 4.1) adults with clinically normal audiometric thresholds below 4 kHz. The younger adults were recruited through an introductory psychology course, and the older adults were volunteers from the local community. Both age groups completed the SSQ. The differences between the groups were analyzed. Correlations were used to compare the pattern of results across items for the two age groups in the present study and to assess the relationship between SSQ scores and objective measures of hearing. Comparisons were also made to published results for older adults with hearing loss. The pattern of reported difficulty across items was similar for both age groups, but younger adults had significantly higher scores than older adults on 42 of the 46 items. On average, younger adults scored 8.8 (SD = 0.6) out of 10 and older adults scored 7.7 (SD = 1.2) out of 10. By comparison, scores of 5.5 (SD = 1.9) have been reported for older adults (mean age = 71 yr, SD = 8.1) with moderate hearing loss (Gatehouse and Noble, 2004). By establishing the best scores that could reasonably be expected from younger and older adults with "normal" hearing thresholds, these results provide clinicians with information that should assist them in setting realistic targets for interventions for adults of different ages.	\N	\N
22355005	To determine (a) the effect of fundamental frequency (f₀) on speech intelligibility, acceptability, and perceived gender in electrolaryngeal (EL) speakers, and (b) the effect of known gender on speech acceptability in EL speakers. A 2-part study was conducted. In Part 1, 34 healthy adults provided speech recordings using electrolarynges set at 75 Hz, 130 Hz, and 175 Hz, and 36 listeners transcribed the recordings. In Part 2, 22 speech samples were presented to 16 listeners. First, listeners identified the gender of each speaker and judged his or her speech acceptability using rating scales. Second, listeners judged the same samples for speech acceptability when gender information was provided. In Part 1, speakers were significantly more intelligible when using 75-Hz devices. In Part 2, the f₀ of the speech signal significantly impacted listeners' accuracy in perceiving the speaker's gender: In gender-incongruent conditions (males using 175-Hz devices, females using 75-Hz devices), listeners were unable to identify female speakers. Speech acceptability judgments were directly related to intelligibility. Finally, listeners differentially penalized female speakers who used 75-Hz devices when gender information was known. Low f₀ facilitated speech intelligibility. However, at low f₀, listeners were unable to identify females as female, and females were differentially penalized for speech acceptability. Results may have implications for rehabilitation.	\N	\N
22355541	The potentiality to find precursors of human language in nonhuman primates is questioned because of differences related to the genetic determinism of human and nonhuman primate acoustic structures. Limiting the debate to production and acoustic plasticity might have led to underestimating parallels between human and nonhuman primates. Adult-young differences concerning vocal usage have been reported in various primate species. A key feature of language is the ability to converse, respecting turn-taking rules. Turn-taking structures some nonhuman primates' adult vocal exchanges, but the development and the cognitive relevancy of this rule have never been investigated in monkeys. Our observations of Campbell's monkeys' spontaneous vocal utterances revealed that juveniles broke the turn-taking rule more often than did experienced adults. Only adults displayed different levels of interest when hearing playbacks of vocal exchanges respecting or not the turn-taking rule. This study strengthens parallels between human conversations and nonhuman primate vocal exchanges.	\N	\N
22361165	Sensory-motor interactions between auditory and articulatory representations in the dorsal auditory processing stream are suggested to contribute to speech perception, especially when bottom-up information alone is insufficient for purely auditory perceptual mechanisms to succeed. Here, we hypothesized that the dorsal stream responds more vigorously to auditory syllables when one is engaged in a phonetic identification/repetition task subsequent to perception compared to passive listening, and that this effect is further augmented when the syllables are embedded in noise. To this end, we recorded magnetoencephalography while twenty subjects listened to speech syllables, with and without noise masking, in four conditions: passive perception; overt repetition; covert repetition; and overt imitation. Compared to passive listening, left-hemispheric N100m equivalent current dipole responses were amplified and shifted posteriorly when perception was followed by covert repetition task. Cortically constrained minimum-norm estimates showed amplified left supramarginal and angylar gyri responses in the covert repetition condition at ~100ms from stimulus onset. Longer-latency responses at ~200ms were amplified in the covert repetition condition in the left angular gyrus and in all three active conditions in the left premotor cortex, with further enhancements when the syllables were embedded in noise. Phonetic categorization accuracy and magnitude of voice pitch change between overt repetition and imitation conditions correlated with left premotor cortex responses at ~100 and ~200ms, respectively. Together, these results suggest that the dorsal stream involvement in speech perception is dependent on perceptual task demands and that phonetic categorization performance is influenced by the left premotor cortex.	\N	\N
22364395	Young infants perceive an object's trajectory as continuous across occlusion provided the temporal or spatial gap in perception is small. In 3 experiments involving 72 participants the authors investigated the effects of different forms of auditory information on 4-month-olds' perception of trajectory continuity. Provision of dynamic auditory information about the object's trajectory enhanced perception of trajectory continuity. However, a smaller positive effect was also obtained when the sound was continuous but provided no information about the object's location. Finally, providing discontinuous auditory information or auditory information that was dislocated relative to vision had negative effects on trajectory perception. These results are discussed relative to the intersensory redundancy hypothesis and emphasize the need to take an intersensory approach to infant perception.	\N	\N
22364434	Vowels with extreme articulatory-acoustic properties act as natural referents. Infant perceptual asymmetries point to an underlying bias favoring these referent vowels. However, as language experience is gathered, distributional frequency of speech sounds could modify this initial bias. The perception of the /i/-/e/ contrast was explored in 144 Catalan- and Spanish-learning infants (2 languages with a different distribution of vowel frequency of occurrence) at 4, 6, and 12 months. The results confirmed an acoustic bias at 4 and 6 months in all infants. However, at 12 months, discrimination was not affected by the acoustic bias but by the frequency of occurrence of the vowel.	\N	\N
22366801	Many studies have shown that the visual cortex of blind humans is activated in non-visual tasks. However, the electrophysiological signals underlying this cross-modal plasticity are largely unknown. Here, we characterize the neuronal population activity in the visual and auditory cortex of congenitally blind humans and sighted controls in a complex cognitive task. We recorded magnetoencephalographic responses from participants performing semantic categorization of meaningful sounds that followed the presentation of a semantically related or unrelated haptic object. Source analysis of the spectrally resolved magnetoencephalography data revealed that: (i) neuronal responses to sounds were stronger and longer lasting in the auditory cortex of blind subjects; (ii) auditory stimulation elicited strong oscillatory responses in the visual cortex of blind subjects that closely resembled responses to visual stimulation in sighted humans; (iii) the signal in the gamma frequency range was modulated by semantic congruency between the sounds and the preceding haptic objects; and (iv) signal power in the gamma range was correlated on a trial-by-trial basis between auditory and visual cortex in blind subjects, and the strength of this correlation was modulated by semantic congruency. Our results suggest that specifically oscillatory activity in the gamma range reflects non-visual processing in the visual cortex of blind individuals. Moreover, our results provide evidence that the deprived visual cortex is functionally integrated into a larger network that serves non-visual functions.	\N	\N
22367585	In recent years, it has become evident that neural responses previously considered to be unisensory can be modulated by sensory input from other modalities. In this regard, visual neural activity elicited to viewing a face is strongly influenced by concurrent incoming auditory information, particularly speech. Here, we applied an additive-factors paradigm aimed at quantifying the impact that auditory speech has on visual event-related potentials (ERPs) elicited to visual speech. These multisensory interactions were measured across parametrically varied stimulus salience, quantified in terms of signal to noise, to provide novel insights into the neural mechanisms of audiovisual speech perception. First, we measured a monotonic increase of the amplitude of the visual P1-N1-P2 ERP complex during a spoken-word recognition task with increases in stimulus salience. ERP component amplitudes varied directly with stimulus salience for visual, audiovisual, and summed unisensory recordings. Second, we measured changes in multisensory gain across salience levels. During audiovisual speech, the P1 and P1-N1 components exhibited less multisensory gain relative to the summed unisensory components with reduced salience, while N1-P2 amplitude exhibited greater multisensory gain as salience was reduced, consistent with the principle of inverse effectiveness. The amplitude interactions were correlated with behavioral measures of multisensory gain across salience levels as measured by response times, suggesting that change in multisensory gain associated with unisensory salience modulations reflects an increased efficiency of visual speech processing.	\N	\N
22371164	The finding that serial recall performance for visually presented items is impaired by concurrently presented task-irrelevant speech or sounds is referred to as the irrelevant-speech/-sound effect (ISE). Substantial evidence has indicated that the impairment of serial rehearsal can result in an ISE, and this may be explained by several models. The present series of experiments has demonstrated an ISE in surprise nonserial recognition tasks in which participants were unaware of the need to maintain a large number of visual items for a later memory test, suggesting that neither the rehearsal nor maintenance of order information is necessary for observing the ISE. This effect was observed for both steady-state and changing-state irrelevant sounds, suggesting that the present results do not derive from a confusion of order information, but instead provide evidence that identity representations can also be impaired by irrelevant sound.	\N	\N
22371616	Auditory streaming and visual plaids have been used extensively to study perceptual organization in each modality. Both stimuli can produce bistable alternations between grouped (one object) and split (two objects) interpretations. They also share two peculiar features: (i) at the onset of stimulus presentation, organization starts with a systematic bias towards the grouped interpretation; (ii) this first percept has 'inertia'; it lasts longer than the subsequent ones. As a result, the probability of forming different objects builds up over time, a landmark of both behavioural and neurophysiological data on auditory streaming. Here we show that first percept bias and inertia are independent. In plaid perception, inertia is due to a depth ordering ambiguity in the transparent (split) interpretation that makes plaid perception tristable rather than bistable: experimental manipulations removing the depth ambiguity suppressed inertia. However, the first percept bias persisted. We attempted a similar manipulation for auditory streaming by introducing level differences between streams, to bias which stream would appear in the perceptual foreground. Here both inertia and first percept bias persisted. We thus argue that the critical common feature of the onset of perceptual organization is the grouping bias, which may be related to the transition from temporally/spatially local to temporally/spatially global computation.	\N	\N
22371621	Auditory stream segregation involves linking temporally separate acoustic events into one or more coherent sequences. For any non-trivial sequence of sounds, many alternative descriptions can be formed, only one or very few of which emerge in awareness at any time. Evidence from studies showing bi-/multistability in auditory streaming suggest that some, perhaps many of the alternative descriptions are represented in the brain in parallel and that they continuously vie for conscious perception. Here, based on a predictive coding view, we consider the nature of these sound representations and how they compete with each other. Predictive processing helps to maintain perceptual stability by signalling the continuation of previously established patterns as well as the emergence of new sound sources. It also provides a measure of how well each of the competing representations describes the current acoustic scene. This account of auditory stream segregation has been tested on perceptual data obtained in the auditory streaming paradigm.	\N	\N
22379692	The attentional blink (AB) is a well-established paradigm in which identification of a target T2 is reduced shortly after presentation of an earlier target T1. An important question concerns the importance of backward masking during the AB. While task switching has been found to be a strong modulator mediating the AB without any masking of T2, the present study investigated whether spatial switching could similarly produce an AB without masking. Using a spatial AB paradigm in which items appeared at different locations; we found (a) a significant AB without backward masking of T2 but no AB when no distractors followed T2, (b) no evidence for Lag 1 sparing. These findings show that when there is a spatial switch between the targets, presenting the distractor following T2 at the same location than T2 (backward masking) is not a necessary condition for the AB to occur, but T2 has to be followed by surrounding distractors (appearing at different locations than T2). This pattern of data confirms that spatial switching is a robust modulator of the AB, but to a less extent than task switching.	\N	\N
22384765	Hemodynamic changes can be noninvasively real-time monitored in stroke patients by means of transcranial Doppler sonography (TCD). The aim of this pilot study was to assess hemodynamic changes in both middle cerebral arteries (MCA) in aphasic stroke patients by means of TCD during verbal stimulation. Eight aphasic patients with stroke in the territory of the left MCA were tested by modified Boston Diagnostic Aphasia Examination (BDAE) within 3 days of stroke onset. Both MCA were monitored simultaneously by means of TCD with 2 MHz probes. Basic MCA mean blood flow velocity (MBFV) values were assessed and monitored during verbal stimulation. Verbal stimulation was performed with 30 photos of objects for daily usage, arranged by function. The same test was performed in 16 right-handed healthy controls. In stroke patients, the mean MBFV were 56 cm/s in the left MCA and 56 cm/s in the right MCA. A mean 30% increase was observed in the left MCA and 22% in the right MCA. In healthy controls, a mean 21.7% increase was observed in the left MCA and 18% in the right MCA. A trend toward higher percentage of MBFV increase was observed in the left MCA during verbal stimulations in aphasic patients as compared to control subjects.	\N	\N
22390292	Human multisensory systems are known to bind inputs from the different sensory modalities into a unified percept, a process that leads to measurable behavioral benefits. This integrative process can be observed through multisensory illusions, including the McGurk effect and the sound-induced flash illusion, both of which demonstrate the ability of one sensory modality to modulate perception in a second modality. Such multisensory integration is highly dependent upon the temporal relationship of the different sensory inputs, with perceptual binding occurring within a limited range of asynchronies known as the temporal binding window (TBW). Previous studies have shown that this window is highly variable across individuals, but it is unclear how these variations in the TBW relate to an individual's ability to integrate multisensory cues. Here we provide evidence linking individual differences in multisensory temporal processes to differences in the individual's audiovisual integration of illusory stimuli. Our data provide strong evidence that the temporal processing of multiple sensory signals and the merging of multiple signals into a single, unified perception, are highly related. Specifically, the width of right side of an individuals' TBW, where the auditory stimulus follows the visual, is significantly correlated with the strength of illusory percepts, as indexed via both an increase in the strength of binding synchronous sensory signals and in an improvement in correctly dissociating asynchronous signals. These findings are discussed in terms of their possible neurobiological basis, relevance to the development of sensory integration, and possible importance for clinical conditions in which there is growing evidence that multisensory integration is compromised.	\N	\N
22390745	The purpose of this study was to investigate the effect of the Global Voice Therapy Model (GVTM) on acoustic, aerodynamic, and perceptual voice measures of four adults seeking voice therapy for a voice disorder. A speech-language pathologist facilitated speaking voice therapy with the four participants using the GVTM. Participants completed acoustic, aerodynamic, and perceptual voice measures at pre- and post-therapy time points. Differences were seen in the voice measures from pre- to post-therapy. The GVTM was successful in facilitating an improvement in the acoustic, aerodynamic, and perceptual measures of the participants.	\N	\N
22395654	Tactile sensation, which is one of the earliest developing sensory systems, is very important in the perception of an individual's body and the surrounding physical environment, especially in newborns. However, currently, only little is known about the response of a newborn's brain to tactile sensation. The objective of the present study was to determine the response of a newborn's brain to tactile sensation and to compare the brain responses to various sensory stimuli. Ten healthy newborns, 2-9 days after birth, were enrolled. A multichannel near-infrared spectroscopy system was used to measure brain responses. The probe array covered broad cortical areas, including the parietal, temporal, and occipital areas. We measured cortical hemodynamic changes in response to three different types of stimuli: tactile, auditory, and visual. Activated areas were analyzed by t-tests, and the number of activated channels among the three different stimuli was compared by χ²-tests. The results showed that when the brain responded to each type of stimulation, the corresponding primary sensory area was activated, and tactile stimuli induced broader areas of brain activation than the other two types of stimuli (auditory or visual). Thus, broad brain areas, including the temporal and parietal areas, were activated by tactile stimuli in early newborn periods. These results suggest that there are differences in newborns' reactions to various types of sensory stimuli, which may reflect the importance of tactile sensation in the early newborn period.	\N	\N
22403933	Often it is difficult to find a natural explanation as to why a surprising coincidence occurs. In attempting to find one, people may be inclined to accept paranormal explanations. The objective of this study was to investigate whether people with a lower threshold for being surprised by coincidences have a greater propensity to become believers compared to those with a higher threshold. Participants were exposed to artificial coincidences, which were formally defined as less or more probable, and were asked to provide remarkability ratings. Paranormal belief was measured by the Australian Sheep-Goat Scale. An analysis of the remarkability ratings revealed a significant interaction effect between Sheep-Goat score and type of coincidence, suggesting that people with lower thresholds of surprise, when experiencing coincidences, harbor higher paranormal belief than those with a higher threshold. The theoretical aspects of these findings were discussed.	\N	\N
22405960	Transcranial magnetic stimulation (TMS) and neuroimaging studies suggest a functional link between the emotion-related brain areas and the motor system. It is not well understood, however, whether the motor cortex activity is modulated by specific emotions experienced during music listening. In 23 healthy volunteers, we recorded the motor evoked potentials (MEP) following TMS to investigate the corticospinal excitability while subjects listened to music pieces evoking different emotions (happiness, sadness, fear, and displeasure), an emotionally neutral piece, and a control stimulus (musical scale). Quality and intensity of emotions were previously rated in an additional group of 30 healthy subjects. Fear-related music significantly increased the MEP size compared to the neutral piece and the control stimulus. This effect was not seen with music inducing other emotional experiences and was not related to changes in autonomic variables (respiration rate, heart rate). Current data indicate that also in a musical context, the excitability of the corticomotoneuronal system is related to the emotion expressed by the listened piece.	\N	\N
22405961	Conceptual knowledge is classically supposed to be abstract and represented in an amodal unitary system, distinct from the sensory and motor brain systems. A more recent embodiment view of conceptual knowledge, however, proposes that concepts are grounded in distributed modality-specific brain areas which typically process sensory or action-related object information. Recent neuroimaging evidence suggested the significance of left auditory association cortex encompassing posterior superior and middle temporal gyrus in coding conceptual sound features of everyday objects. However, a causal role of this region in processing conceptual sound information has yet to be established. Here we had the unique chance to investigate a patient, JR, with a focal lesion in left posterior superior and middle temporal gyrus. To test the necessity of this region in conceptual and perceptual processing of sound information we administered four different experimental tasks to JR: Visual word recognition, category fluency, sound recognition and voice classification. Compared with a matched control group, patient JR was consistently impaired in conceptual processing of sound-related everyday objects (e.g., "bell"), while performance for non-sound-related everyday objects (e.g., "armchair"), animals, whether they typically produce sounds (e.g., "frog") or not (e.g., "tortoise"), and musical instruments (e.g., "guitar") was intact. An analogous deficit pattern in JR was also obtained for perceptual recognition of the corresponding sounds. Hence, damage to left auditory association cortex specifically impairs perceptual and conceptual processing of sounds from everyday objects. In support of modality-specific theories, these findings strongly evidence the necessity of auditory association cortex in coding sound-related conceptual information.	\N	\N
22410432	This article reports on an investigation of graphophonological processes in deaf readers of French over a 1-year period. Deaf readers are known to have a phonological deficit compared to hearing peers, and conclusions from studies on this question are often conflicting. Among the different types of phonological processing, we can identify graphophonological processes based on correspondences between the oral and the written language. In this investigation, we evaluated graphophonemic and graphosyllabic processes using, in each case, two different tasks varying in their degree of cognitive constraints (CC- vs. CC+). Nineteen 11 year-old deaf students were compared to younger normal readers of the same reading level (RA, n = 17) and to normal readers of the same age (CA, n = 20). Two variables were considered in the analyses: accuracy and response latency. Results show that deaf readers do process written items at the graphophonological level and that graphophonological processes are related to reading ability. Also, results indicate main effects of task (CC- vs. CC+), time (T1 vs. T2), and group. In general, deaf participants' performances are comparable to those of RA and differ from those of CA. Results are discussed within the framework of the study of phonology in deaf readers and its relation to reading acquisition.	\N	\N
22411494	To determine the effects of noise and speech style on word learning in typically developing school-age children. Thirty-one participants ages 9;0 (years;months) to 10;11 attempted to learn 2 sets of 8 novel words and their referents. They heard all of the words 13 times each within meaningful narrative discourse. Signal-to-noise ratio (noise vs. quiet) and speech style (plain vs. clear) were manipulated such that half of the children heard the new words in broadband white noise and half heard them in quiet; within those conditions, each child heard one set of words produced in a plain speech style and another set in a clear speech style. Children who were trained in quiet learned to produce the word forms more accurately than those who were trained in noise. Clear speech resulted in more accurate word form productions than plain speech, whether the children had learned in noise or quiet. Learning from clear speech in noise and plain speech in quiet produced comparable results. Noise limits expressive vocabulary growth in children, reducing the quality of word form representation in the lexicon. Clear speech input can aid expressive vocabulary growth in children, even in noisy environments.	\N	\N
22411713	This article describes the development and evaluation of The University of Western Ontario (UWO) Plurals Test, which is an English language measure of detection of the word-final fricative cue for plurality. Normative data are provided for 26 listeners with normal hearing and 24 listeners with hearing impairment (children and adults), as are evaluations of the acoustical properties of the stimuli, the test's test-retest reliability, and the test's sensitivity to changes in hearing aid signal processing (e.g., nonlinear frequency compression). Results indicate reliable, repeated outcome measurement at the level of the individual. When compared to a global measure of real-world listening preference, the UWO Plurals Test was found to be somewhat sensitive to the effects of changes in hearing aid signal processing. Findings suggest potential use of the UWO Plurals Test to evaluate aided and unaided ability of listeners between the ages of 6 and 81 years to detect the word-final fricatives /s/ and /z/ as they occur in English plural nouns.	\N	\N
22414595	Musicians' skills in auditory processing depend highly on instrument, performance practice, and on level of expertise. Yet, it is not known though whether the style/genre of music might shape auditory processing in the brains of musicians. Here, we aimed at tackling the role of musical style/genre on modulating neural and behavioral responses to changes in musical features. Using a novel, fast and musical sounding multi-feature paradigm, we measured the mismatch negativity (MMN), a pre-attentive brain response, to six types of musical feature change in musicians playing three distinct styles of music (classical, jazz, rock/pop) and in non-musicians. Jazz and classical musicians scored higher in the musical aptitude test than band musicians and non-musicians, especially with regards to tonal abilities. These results were extended by the MMN findings: jazz musicians had larger MMN-amplitude than all other experimental groups across the six different sound features, indicating a greater overall sensitivity to auditory outliers. In particular, we found enhanced processing of pith and sliding up to pitches in jazz musicians only. Furthermore, we observed a more frontal MMN to pitch and location compared to the other deviants in jazz musicians and left lateralization of the MMN to timbre in classical musicians. These findings indicate that the characteristics of the style/genre of music played by musicians influence their perceptual skills and the brain processing of sound features embedded in a musical context. Musicians' brain is hence shaped by the type of training, musical style/genre, and listening experiences.	\N	\N
22415447	Although it has been well documented that the spatial inhibitory effect induced by repetition of location (i.e., spatial inhibition of return, or IOR) occurs cross-modally, we do not yet know whether nonspatial (e.g., color-based) repetition-induced inhibition occurs in a cross-modal fashion as well. In the present study, a novel cross-modal paradigm with regard to color-based repetition was adopted. An intervening neutral cue, whose semantic identity was different from those of both the prime and the target, was introduced between the prime and the target in a repetition-priming task. The modalities of the prime, the neutral cue, and the target could be either visual or auditory, and the prime and the target could refer either to the same or to different semantic identities. By adopting this paradigm, we aimed to answer two questions: (1) What are the specific conditions under which cross-modal semantic-based repetition inhibition occurs? (2) Are the representations inhibited in the semantic-based repetition inhibition effect supramodal or modality-specific? Our results suggested that semantic-based repetition inhibition occurs only when the prime and the neutral cue are from the same sensory modality, and it occurs irrespective of whether the modality of the target is cued and irrespective of whether the modality of the target is auditory or visual. Taken together, our results suggest that the occurrence of cross-modal nonspatial repetition inhibition is conditional and that the nonspatial representations inhibited by the repetition inhibition are supramodal.	\N	\N
22419678	Infants must learn to make sense of real-world auditory environments containing simultaneous and overlapping sounds. In adults, event-related potential studies have demonstrated the existence of separate preattentive memory traces for concurrent note sequences and revealed perceptual dominance for encoding of the voice with higher fundamental frequency of 2 simultaneous tones or melodies. Here, we presented 2 simultaneous streams of notes (15 semitones apart) to 7-month-old infants. On 50% of trials, either the higher or the lower note was modified by one semitone, up or down, leaving 50% standard trials. Infants showed mismatch negativity (MMN) to changes in both voices, indicating separate memory traces for each voice. Furthermore, MMN was earlier and larger for the higher voice as in adults. When in the context of a second voice, representation of the lower voice was decreased and that of the higher voice increased compared with when each voice was presented alone. Additionally, correlations between MMN amplitude and amount of weekly music listening suggest that experience affects the development of auditory memory. In sum, the ability to process simultaneous pitches and the dominance of the highest voice emerge early during infancy and are likely important for the perceptual organization of sound in realistic environments.	\N	\N
22423819	Hearing-aid wearers have reported sound source locations as being perceptually internalized (i.e., inside their head). The contribution of hearing-aid design to internalization has, however, received little attention. This experiment compared the sensitivity of hearing-impaired (HI) and normal-hearing listeners to externalization cues when listening with their own ears and simulated behind-the-ear hearing-aids in increasingly complex listening situations and reduced pinna cues. Participants rated the degree of externalization using a multiple-stimulus listening test for mixes of internalized and externalized speech stimuli presented over headphones. The results showed that HI listeners had a contracted perception of externalization correlated with high-frequency hearing loss.	\N	\N
22427328	Imaging studies in blind subjects have consistently shown that sensory and cognitive tasks evoke activity in the occipital cortex, which is normally visual. The precise areas involved and degree of activation are dependent upon the cause and age of onset of blindness. Here, we investigated the cortical language network at rest and during an auditory covert naming task in five bilaterally anophthalmic subjects, who have never received visual input. When listening to auditory definitions and covertly retrieving words, these subjects activated lateral occipital cortex bilaterally in addition to the language areas activated in sighted controls. This activity was significantly greater than that present in a control condition of listening to reversed speech. The lateral occipital cortex was also recruited into a left-lateralized resting-state network that usually comprises anterior and posterior language areas. Levels of activation to the auditory naming and reversed speech conditions did not differ in the calcarine (striate) cortex. This primary 'visual' cortex was not recruited to the left-lateralized resting-state network and showed high interhemispheric correlation of activity at rest, as is typically seen in unimodal cortical areas. In contrast, the interhemispheric correlation of resting activity in extrastriate areas was reduced in anophthalmia to the level of cortical areas that are heteromodal, such as the inferior frontal gyrus. Previous imaging studies in the congenitally blind show that primary visual cortex is activated in higher-order tasks, such as language and memory to a greater extent than during more basic sensory processing, resulting in a reversal of the normal hierarchy of functional organization across 'visual' areas. Our data do not support such a pattern of organization in anophthalmia. Instead, the patterns of activity during task and the functional connectivity at rest are consistent with the known hierarchy of processing in these areas normally seen for vision. The differences in cortical organization between bilateral anophthalmia and other forms of congenital blindness are considered to be due to the total absence of stimulation in 'visual' cortex by light or retinal activity in the former condition, and suggests development of subcortical auditory input to the geniculo-striate pathway.	\N	\N
22431327	Previous neuroimaging studies have shown that the patterns of brain activity during the processing of personally relevant names (e.g., own name, friend's name, partner's name, etc.) and the names of famous people (e.g., celebrities) are different. However, it is not known how the activity in this network is influenced by the modality of the presented stimuli. In this fMRI study, we investigated the pattern of brain activations during the recognition of aurally and visually presented full names of the subject, a significant other, a famous person and unknown individuals. In both modalities, we found that the processing of self-name and the significant other's name was associated with increased activation in the medial prefrontal cortex (MPFC). Acoustic presentations of these names also activated bilateral inferior frontal gyri (IFG). This pattern of results supports the role of MPFC in the processing of personally relevant information, irrespective of their modality.	\N	\N
22432606	Contact quotient (CQ), measured by electroglottogram (EGG), is a ratio which illustrates the duration of vocal fold contact during one vocal fold period. In the present study CQ(EGG) was calculated from a sustained vowel phonation in three different phonation types (breathy, normal, pressed) at three amplitude threshold levels (25%, 35%, 50%). CQ(EGG) values were compared with experts' perceptual evaluation of the firmness of phonation. The contact time of the vocal folds differed significantly between the different phonation types at all threshold levels (P < 0.01). Perceptual evaluation correlated best with CQ(EGG) at threshold levels 25% and 35%. The results of the linear regression model suggested that by using threshold level 25% the effect of F0 and SPL on CQ(EGG) were not significant.	\N	\N
22434397	Female as opposed to male listeners were better able to use a delayed informative cue at the end of a long sentence to report an earlier word which was disrupted by noise. Informative (semantically related) or uninformative (semantically unrelated) word cues were presented 2, 6, or 10 words after a target word whose initial phoneme had been replaced with noise. A total of 84 young adults (45 males) listened to each sentence and then repeated it after its offset. The semantic benefit effect (SBE) was the difference in the accuracy of report of the disrupted target word during informative vs. uninformative sentences. Women had significantly higher SBEs than men even though there were no significant sex differences in terms of number of non-target words reported, the effect of distance between the disrupted target word and the informative cue, or kinds of errors generated. We suggest that the superior ability of women to use delayed semantic information to decode an earlier ambiguous speech signal may be linked to women's tendency to engage the hemispheres more bilaterally than men during word processing. Since the maintenance of semantic context under ambiguous conditions demands more right than left hemispheric resources, this may give women an advantage.	\N	\N
22454230	Vibrant Soundbridge (VSB) has been recommended for both adults and children with all types of hearing loss. The aim of this study was to evaluate the objective and subjective benefits with VSB and the difference in benefits for patients with different types of hearing loss. A retrospective case review was conducted on seven consecutive patients who had received VSB implantations at the National University Hospital of Singapore from March 2006 to November 2009. Patients were divided into the Sensorineural Hearing Loss (SNHL) Group and Conductive Hearing Loss (CHL)/Mixed Hearing Loss (MHL) Group. Surgical complications were evaluated, and objective and subjective results were compared between the two groups. No major complications were observed during the follow-up of up to 4 years. Greater objective and subjective benefits were observed in the CHL/MHL Group. Subjective benefits were consistent with objective improvements. Pre-operative counseling for realistic expectations is important, especially for patients with SNHL.	\N	\N
22459559	This study examined the role of modality in correct recognition and misinformation acceptance in a naturalistic event cognition task that reflected an everyday life sequence of events. Participants heard, observed or acted out a sequence of events and were tested on memory for these events after being presented with an accurate description of the events or a description containing misinformation. The results indicated that recognition of unaltered information was higher in the enactment condition than the auditory or visual conditions and that this effect persisted over time. Misinformation acceptance for the immediate recognition test was lowest in the auditory condition but this advantage disappeared over time. Modality congruence of the auditory condition with the modality in which misinformation was presented and different retrieval processes underlying recognition of altered and unaltered information may explain these findings.	\N	\N
22463939	This case study describes a 45-yr-old female with bilateral, profound sensorineural hearing loss due to Ménière's disease. She received her first cochlear implant in the right ear in 2008 and the second cochlear implant in the left ear in 2010. The case study examines the enhancement to speech recognition, particularly in noise, provided by bilateral cochlear implants. Speech recognition tests were administered prior to obtaining the second implant and at a number of test intervals following activation of the second device. Speech recognition in quiet and noise as well as localization abilities were assessed in several conditions to determine bilateral benefit and performance differences between ears. The results of the speech recognition testing indicated a substantial improvement in the patient's ability to understand speech in noise and her ability to localize sound when using bilateral cochlear implants compared to using a unilateral implant or an implant and a hearing aid. In addition, the patient reported considerable improvement in her ability to communicate in daily life when using bilateral implants versus a unilateral implant. This case suggests that cochlear implantation is a viable option for patients who have lost their hearing to Ménière's disease even when a number of medical treatments and surgical interventions have been performed to control vertigo. In the case presented, bilateral cochlear implantation was necessary for this patient to communicate successfully at home and at work.	\N	\N
22465324	Nonverbal emotional vocalizations are one of the most elementary ways of communicating in humans. We examined the impact of sex differences on neural responses to laughter and crying produced by the same and opposite sex. Thirty subjects (15 women) underwent functional magnetic resonance imaging during a sex identification task for laughter, crying, and neutral voices. The parahippocampal gyrus was involved in both men and women while hearing laughter of the same sex, suggesting greater positive emotional processing and greater attention toward emotional context in response to laughter of the same sex than of the opposite sex. The posterior cingulate was involved in both men and women while hearing crying of the opposite sex, suggesting that empathic processing may occur more in response to crying of the opposite sex than of the same sex. Furthermore, brain responses to crying of the opposite sex seem to reflect upon men's efforts to perform emotional regulation and women's empathic concerns.	\N	\N
22465475	Neglect is a neurological syndrome characterised by a lack of conscious perception of events localised in the contralesional side of space. Here, we consider the possible multisensory nature of this disorder, critically reviewing the literature devoted to multisensory manifestations and processing in neglect. Although its most striking manifestations have been observed in the visual domain, a number of studies demonstrate that neglect can affect virtually any sensory modality, in particular touch and audition. Furthermore, a few recent studies have reported a correlation in severity between visual and non-visual neglect-related deficits evaluated in the same patients, providing some preliminary support for a multisensory conception of neglect. Sensory stimulation and sensorimotor adaptation techniques, aimed at alleviating neglect, have also been shown to affect several sensory modalities, including some that were not directly affected by the intervention. Finally, in some cases neglect can bias multisensory interactions known to occur in healthy individuals, leading to abnormal behaviour or uncovering multisensory compensation mechanisms. This evidence, together with neurophysiological and neuroimaging data revealing the multisensory role played by the areas that are most commonly damaged in neglect patients, seems to speak in favour of neglect as a multisensory disorder. However, since most previous studies were not conducted with the specific purpose of systematically investigating the multisensory nature of neglect, we conclude that more research is needed to appropriately assess this question, and suggest some methodological guidelines that we hope will help clarify this issue. At present, the conception of neglect as a multisensory disorder remains a promising working hypothesis that may help define the pathophysiology of this syndrome.	\N	\N
22476724	That auditory perceptual training may alleviate tinnitus draws on two observations: (1) tinnitus probably arises from altered activity within the central auditory system following hearing loss and (2) sound-based training can change central auditory activity. Training that provides sound enrichment across hearing loss frequencies has therefore been hypothesised to alleviate tinnitus. We tested this prediction with two randomised trials of frequency discrimination training involving a total of 70 participants with chronic subjective tinnitus. Participants trained on either (1) a pure-tone standard at a frequency within their region of normal hearing, (2) a pure-tone standard within the region of hearing loss or (3) a high-pass harmonic complex tone spanning a region of hearing loss. Analysis of the primary outcome measure revealed an overall reduction in self-reported tinnitus handicap after training that was maintained at a 1-month follow-up assessment, but there were no significant differences between groups. Secondary analyses also report the effects of different domains of tinnitus handicap on the psychoacoustical characteristics of the tinnitus percept (sensation level, bandwidth and pitch) and on duration of training. Our overall findings and conclusions cast doubt on the superiority of a purely acoustic mechanism to underpin tinnitus remediation. Rather, the nonspecific patterns of improvement are more suggestive that auditory perceptual training affects impact on a contributory mechanism such as selective attention or emotional state.	\N	\N
22480025	In grammar books, the various functions of and as phrasal coordinator and clausal conjunction are treated as standard knowledge. In addition, studies on the uses of and in everyday talk-in-interaction have described its discourse-organizational functions on a more global level. In the phonetic literature, in turn, a range of phonetic forms of and have been listed. Yet, so far few studies have related the phonetic features of and to its function. This contribution surveys a range of phonetic forms of and in a corpus of private American English telephone conversations. It shows that the use of forms such as [ænd], [εn], or [en], among others, is not random but, in essence, correlates with the syntactic-pragmatic scope of and and the cognitive closeness of the items the and connects. This, in turn, allows the phonetic design of and to contribute to the organization of turn-taking. The findings presented are based on conversation-analytic and interactional-linguistic methodology, which includes quantitative analyses.	\N	\N
22480027	This paper investigates hearers' use of response tokens (back-channels), in maintaining and differentiating their actions. Initial observations suggest that hearers produce a sequence of phonetically similar responses to disengage from the current topic, and dissimilar responses to engage with the current topic. This is studied systematically by combining detailed interactional and phonetic analysis in a collection of naturally-occurring talk in Norwegian. The interactional analysis forms the basis for labeling actions as maintained ('doing the same') and differentiated ('NOT doing the same'), which is then used as a basis for phonetic analysis. The phonetic analysis shows that certain phonetic characteristics, including pitch, loudness, voice quality and articulatory characteristics, are associated with 'doing the same', as different from 'NOT doing the same'. Interactional analysis gives further evidence of how this differentiation is of systematic relevance in the negotiations of a next turn. This paper addresses phonetic variation and variability by focusing on the relationship between sequence and phonetics in the turn-by-turn development of meaning. This has important implications for linguistic/phonetic research, and for the study of back-channels.	\N	\N
22492193	The auditory system codes spatial locations in a way that deviates from the spatial representations found in other modalities. This difference is especially striking in the cortex, where neurons form topographical maps of visual and tactile space but where auditory space is represented through a population rate code. In this hemifield code, sound source location is represented in the activity of two widely tuned opponent populations, one tuned to the right and the other to the left side of auditory space. Scientists are only beginning to uncover how this coding strategy adapts to various spatial processing demands. This review presents the current understanding of auditory spatial processing in the cortex. To this end, the authors consider how various implementations of the hemifield code may exist within the auditory cortex and how these may be modulated by the stimulation and task context. As a result, a coherent set of neural strategies for auditory spatial processing emerges.	\N	\N
22500627	Behavior varies from trial to trial even when the stimulus is maintained as constant as possible. In many models, this variability is attributed to noise in the brain. Here, we propose that there is another major source of variability: suboptimal inference. Importantly, we argue that in most tasks of interest, and particularly complex ones, suboptimal inference is likely to be the dominant component of behavioral variability. This perspective explains a variety of intriguing observations, including why variability appears to be larger on the sensory than on the motor side, and why our sensors are sometimes surprisingly unreliable.	\N	\N
22501070	Judgments of whether a sinusoidal probe is higher or lower in frequency than the closest partial ("target") in a multi-partial complex are improved when the target is pulsed on and off. These experiments explored the contribution of reduction in perceptual confusion and recovery from adaptation to this effect. In experiment 1, all partials except the target were replaced by noise to reduce perceptual confusion. Performance was much better than when the background was composed of multiple partials. When the level of the target was reduced to avoid ceiling effects, no effect of pulsing the target occurred. In experiment 2, the target and background partials were irregularly and independently amplitude modulated. This gave a large effect of pulsing the target, suggesting that if recovery from adaptation contributes to the effect, amplitude fluctuations do not prevent this. In experiment 3, the background was composed of multiple steady partials, but the target was irregularly amplitude modulated. This gave better performance than when the target was unmodulated and a moderate effect of pulsing the target. It is argued that when the target and background are steady tones, pulsing the target may result both in reduction of perceptual confusion and recovery from adaptation.	\N	\N
22501078	Research on children's speech perception and production suggests that consonant voicing and place contrasts may be acquired early in life, at least in word-onset position. However, little is known about the development of the acoustic correlates of later-acquired, word-final coda contrasts. This is of particular interest in languages like English where many grammatical morphemes are realized as codas. This study therefore examined how various non-spectral acoustic cues vary as a function of stop coda voicing (voiced vs. voiceless) and place (alveolar vs. velar) in the spontaneous speech of 6 American-English-speaking mother-child dyads. The results indicate that children as young as 1;6 exhibited many adult-like acoustic cues to voicing and place contrasts, including longer vowels and more frequent use of voice bar with voiced codas, and a greater number of bursts and longer post-release noise for velar codas. However, 1;6-year-olds overall exhibited longer durations and more frequent occurrence of these cues compared to mothers, with decreasing values by 2;6. Thus, English-speaking 1;6-year-olds already exhibit adult-like use of some of the cues to coda voicing and place, though implementation is not yet fully adult-like. Physiological and contextual correlates of these findings are discussed.	\N	\N
22501083	This study tested the hypothesis that the reduction in spatial release from masking (SRM) resulting from sensorineural hearing loss in competing speech mixtures is influenced by the characteristics of the interfering speech. A frontal speech target was presented simultaneously with two intelligible or two time-reversed (unintelligible) speech maskers that were either colocated with the target or were symmetrically separated from the target in the horizontal plane. The difference in SRM between listeners with hearing impairment and listeners with normal hearing was substantially larger for the forward maskers (deficit of 5.8 dB) than for the reversed maskers (deficit of 1.6 dB). This was driven by the fact that all listeners, regardless of hearing abilities, performed similarly (and poorly) in the colocated condition with intelligible maskers. The same conditions were then tested in listeners with normal hearing using headphone stimuli that were degraded by noise vocoding. Reducing the number of available spectral channels systematically reduced the measured SRM, and again, more so for forward (reduction of 3.8 dB) than for reversed speech maskers (reduction of 1.8 dB). The results suggest that non-spatial factors can strongly influence both the magnitude of SRM and the apparent deficit in SRM for listeners with impaired hearing.	\N	\N
22511719	Monkeys can easily form lasting central representations of visual and tactile stimuli, yet they seem unable to do the same with sounds. Humans, by contrast, are highly proficient in auditory long-term memory (LTM). These mnemonic differences within and between species raise the question of whether the human ability is supported in some way by speech and language, e.g., through subvocal reproduction of speech sounds and by covert verbal labeling of environmental stimuli. If so, the explanation could be that storing rapidly fluctuating acoustic signals requires assistance from the motor system, which is uniquely organized to chain-link rapid sequences. To test this hypothesis, we compared the ability of normal participants to recognize lists of stimuli that can be easily reproduced, labeled, or both (pseudowords, nonverbal sounds, and words, respectively) versus their ability to recognize a list of stimuli that can be reproduced or labeled only with great difficulty (reversed words, i.e., words played backward). Recognition scores after 5-min delays filled with articulatory-suppression tasks were relatively high (75-80% correct) for all sound types except reversed words; the latter yielded scores that were not far above chance (58% correct), even though these stimuli were discriminated nearly perfectly when presented as reversed-word pairs at short intrapair intervals. The combined results provide preliminary support for the hypothesis that participation of the oromotor system may be essential for laying down the memory of speech sounds and, indeed, that speech and auditory memory may be so critically dependent on each other that they had to coevolve.	\N	\N
22516238	Over the years, a large body of work on the brain basis of language comprehension has accumulated, paving the way for the formulation of a comprehensive model. The model proposed here describes the functional neuroanatomy of the different processing steps from auditory perception to comprehension as located in different gray matter brain regions. It also specifies the information flow between these regions, taking into account white matter fiber tract connections. Bottom-up, input-driven processes proceeding from the auditory cortex to the anterior superior temporal cortex and from there to the prefrontal cortex, as well as top-down, controlled and predictive processes from the prefrontal cortex back to the temporal cortex are proposed to constitute the cortical language circuit.	\N	\N
22516315	The primary goal of this study was to evaluate a nonlinear dynamic approach to the acoustic analysis of dysphonia associated with vocal fold scar and sulcus vocalis. Case-control study. Acoustic voice samples from scar/sulcus patients and age-/sex-matched controls were analyzed using correlation dimension (D2) and phase plots, time-domain based perturbation indices (jitter, shimmer, signal-to-noise ratio [SNR]), and an auditory-perceptual rating scheme. Signal typing was performed to identify samples with bifurcations and aperiodicity. Type 2 and 3 acoustic signals were highly represented in the scar/sulcus patient group. When data were analyzed irrespective of signal type, all perceptual and acoustic indices successfully distinguished scar/sulcus patients from controls. Removal of type 2 and 3 signals eliminated the previously identified differences between experimental groups for all acoustic indices except D2. The strongest perceptual-acoustic correlation in our data set was observed for SNR and the weakest correlation was observed for D2. These findings suggest that D2 is inferior to time-domain based perturbation measures for the analysis of dysphonia associated with scar/sulcus; however, time-domain based algorithms are inherently susceptible to inflation under highly aperiodic (ie, type 2 and 3) signal conditions. Auditory-perceptual analysis, unhindered by signal aperiodicity, is therefore a robust strategy for distinguishing scar/sulcus patient voices from normal voices. Future acoustic analysis research in this area should consider alternative (e.g., frequency- and quefrency-domain based) measures alongside additional nonlinear approaches.	\N	\N
22522205	There is ample evidence that individuals with dyslexia have a phonological deficit. A growing body of research also suggests that individuals with dyslexia have problems with categorical perception, as evidenced by weaker discrimination of between-category differences and better discrimination of within-category differences compared to average readers. Whether the categorical perception problems of individuals with dyslexia are a result of their reading problems or a cause has yet to be determined. Whether the observed perception deficit relates to a more general auditory deficit or is specific to speech also has yet to be determined. To shed more light on these issues, the categorical perception abilities of children at risk for dyslexia and chronological age controls were investigated before and after the onset of formal reading instruction in a longitudinal study. Both identification and discrimination data were collected using identical paradigms for speech and non-speech stimuli. Results showed the children at risk for dyslexia to shift from an allophonic mode of perception in kindergarten to a phonemic mode of perception in first grade, while the control group showed a phonemic mode already in kindergarten. The children at risk for dyslexia thus showed an allophonic perception deficit in kindergarten, which was later suppressed by phonemic perception as a result of formal reading instruction in first grade; allophonic perception in kindergarten can thus be treated as a clinical marker for the possibility of later reading problems.	\N	\N
22524348	The present manuscript summarizes and discusses the implications of recent neuroimaging studies, which have investigated the relationship between musical expertise and structural, as well as functional, changes in an auditory-related association cortex, namely, the planum temporale (PT). Since the bilateral PT is known to serve as a spectrotemporal processor that supports perception of acoustic modulations in both speech and music, it comes as no surprise that musical expertise corresponds to functional sensitivity and neuroanatomical changes in cortical architecture. In this context, we focus on the following question: To what extent does musical expertise affect the functioning of the left and right plana temporalia? We discuss the relationship between behavioral, hemodynamic, and neuroanatomical data obtained from musicians in light of maturational and developmental issues. In particular, we introduce two studies of our group that show to what extent brains of musicians are more proficient in phonetic task performance.	\N	\N
22524375	Autism spectrum disorder (ASD) is a complex neurodevelopmental condition characterized by atypical social and communication skills, repetitive behaviors, and atypical visual and auditory perception. Studies in vision have reported enhanced detailed ("local") processing but diminished holistic ("global") processing of visual features in ASD. Individuals with ASD also show enhanced processing of simple visual stimuli but diminished processing of complex visual stimuli. Relative to the visual domain, auditory global-local distinctions, and the effects of stimulus complexity on auditory processing in ASD, are less clear. However, one remarkable finding is that many individuals with ASD have enhanced musical abilities, such as superior pitch processing. This review provides a critical evaluation of behavioral and brain imaging studies of auditory processing with respect to current theories in ASD. We have focused on auditory-musical processing in terms of global versus local processing and simple versus complex sound processing. This review contributes to a better understanding of auditory processing differences in ASD. A deeper comprehension of sensory perception in ASD is key to better defining ASD phenotypes and, in turn, may lead to better interventions.	\N	\N
22529921	Overall success of current tinnitus therapies is low, which may be due to the heterogeneity of tinnitus patients. Therefore, subclassification of tinnitus patients is expected to improve therapeutic allocation, which, in turn, is hoped to improve therapeutic success for the individual patient. The present study aims to define factors that differentially influence subjectively perceived tinnitus loudness and tinnitus-related distress. In a questionnaire-based cross-sectional survey, the data of 4705 individuals with tinnitus were analyzed. The self-report questionnaire contained items about subjective tinnitus loudness, type of onset, awareness and localization of the tinnitus, hearing impairment, chronic comorbidities, sleep quality, and psychometrically validated questionnaires addressing tinnitus-related distress, depressivity, anxiety, and somatic symptom severity. In a binary step-wise logistic regression model, we tested the predictive power of these variables on subjective tinnitus loudness and tinnitus-related distress. The present data contribute to the distinction between subjective tinnitus loudness and tinnitus-related distress. Whereas subjective loudness was associated with permanent awareness and binaural localization of the tinnitus, tinnitus-related distress was associated with depressivity, anxiety, and somatic symptom severity. Subjective tinnitus loudness and the potential presence of severe depressivity, anxiety, and somatic symptom severity should be assessed separately from tinnitus-related distress. If loud tinnitus is the major complaint together with mild or moderate tinnitus-related distress, therapies should focus on auditory perception. If levels of depressivity, anxiety or somatic symptom severity are severe, therapies and further diagnosis should focus on these symptoms at first.	\N	\N
22530620	The effortfulness hypothesis implies that difficulty in decoding the surface form, as in the case of age-related sensory limitations or background noise, consumes the attentional resources that are then unavailable for semantic integration in language comprehension. Because ageing is associated with sensory declines, degrading of the surface form by a noisy background can pose an extra challenge for older adults. In two experiments, this hypothesis was tested in a self-paced moving window paradigm in which younger and older readers' online allocation of attentional resources to surface decoding and semantic integration was measured as they read sentences embedded in varying levels of visual noise. When visual noise was moderate (Experiment 1), resource allocation among young adults was unaffected but older adults allocated more resources to decode the surface form at the cost of resources that would otherwise be available for semantic processing; when visual noise was relatively intense (Experiment 2), both younger and older participants allocated more attention to the surface form and less attention to semantic processing. The decrease in attentional allocation to semantic integration resulted in reduced recall of core ideas in both experiments, suggesting that a less organized semantic representation was constructed in noise. The greater vulnerability of older adults at relatively low levels of noise is consistent with the effortfulness hypothesis.	\N	\N
22533977	For pediatric cochlear implant (CI) users, CI processor technology, map characteristics, and fitting strategies are known to have a substantial impact on speech perception scores at young ages. It is unknown whether these benefits continue over time as these children reach adolescence. To document changes in CI technology, map characteristics, and speech perception scores in children between elementary grades and high school, and to describe relations between map characteristics and speech perception scores over time. A longitudinal design with participants 8-9-yr-old at session 1 and 15-18-yr-old at session 2. Participants were 82 adolescents with unilateral CIs, who are a subset of a larger longitudinal study. Mean age at implantation was 3.4 yr (range: 1.7-5.4), and mean duration of device use was 5.5 yr (range: 3.8-7.5) at session 1 and 13.3 yr (range: 10.9-15) at session 2. Speech perception tests at sessions 1 and 2 were the Lexical Neighborhood Test (LNT) presented at 70 dB SPL (LNT-70) and Bamford-Kowal-Bench sentences in quiet (BKB-Q) presented at 70 dB SPL. At session 2, the LNT was also administered at 50 dB SPL (LNT-50), and BKB sentences were administered in noise with a +10 dB SNR (BKB-N). CI processor technology type and CI map characteristics (coding strategy, number of electrodes, threshold levels, and comfort levels) were obtained at both sessions. Electrical dynamic range was computed, and descriptive statistics, correlations, and repeated-measures ANOVAs were employed. Participants achieved significantly higher LNT and BKB scores, at 70 dB SPL, at ages 15-18 than at ages 8-9 yr. Forty-two participants had 1-3 electrodes either activated or deactivated in their map between test sessions, and 40 had no change in number of active electrodes (mean change: -0.5; range: -3 to +2). After conversion from arbitrary clinical map units to charge-per-phase in nanocoulombs (nC), no significant difference was found for T levels across time. Average comfort levels (C levels) decreased by 19 nC. Seventy-three participants (89%) upgraded their CI processor technology type. At both sessions, significant correlations were found between electrical dynamic range (EDR) and all speech perception measures except LNT-50 (r range: .31 to .47; p < 0.01). Similarly, significant correlations were also found between C levels and all speech perception measures (r range: .29 to .49; p < 0.01). At session 2, a significant correlation was found between processor technology type and the LNT-50 scores (r = .38; p < 0.01). Significant improvement in speech scores was observed between elementary grades and high school for children who had used a CI since preschool. On average, T levels (nC) and electrode function remained stable for these long-term pediatric users. Analyses of maps did not allow for the determination of the exact cause of C level reductions, though power limitations in new processor systems and changes in perceived loudness over time are possible. Larger EDRs and higher C levels were associated with better speech scores. Newer speech processor technology was associated with better speech scores at a softer level.	\N	\N
22542616	Individuals with schizophrenia (SZ) have deviations in auditory perception perhaps attributable to altered neural oscillatory response properties in thalamo-cortical and/or local cortico-cortical circuits. Previous EEG studies of auditory steady-state responses (aSSRs; a measure of sustained neuronal entrainment to repetitive stimulation) in SZ have indicated attenuated gamma range (≈40 Hz) neural entrainment. Stimuli in most such studies have been relatively brief (500-1000 ms) trains of 1 ms clicks or amplitude modulated pure tones (1000 Hz) with short, fixed interstimulus intervals (200-1000 ms). The current study used extended (1500 ms), more aurally dense broadband stimuli (500-4000 Hz noise; previously demonstrated to elicit larger aSSRs) with longer, variable interstimulus intervals (2700-3300 ms). Dense array EEG (256 sensor) was collected while 17 SZ and 16 healthy subjects passively listed to stimuli modulated at 15 different frequencies spanning beta and gamma ranges (16-44 Hz in 2 Hz steps). Results indicate that SZ have augmented aSSRs that were most extreme in the gamma range. Results also constructively replicate previous findings of attenuated low frequency auditory evoked responses (2-8 Hz) in SZ. These findings (i) highlight differential characteristics of low versus high frequency and induced versus entrained oscillatory auditory responses in both SZ and healthy stimulus processing, (ii) provide support for an NMDA-receptor hypofunction-based pharmacological model of SZ, and (iii) report a novel pattern of aSSR abnormalities suggesting that gamma band neural entrainment deviations among SZ may be more complex than previously supposed, including possibly being substantially influenced by physical stimulus properties.	\N	\N
22546730	Pseudoneglect is a normal left sided spatial bias observed with attempted bisections of horizontal lines and a normal upward bias observed with attempted bisections of vertical lines. Horizontal pseudoneglect has been attributed to right hemispheric dominance for the allocation of attention. The goal of this study was to test the hypothesis that the upward bias in vertical line bisection may also relate to right hemispheric dominance for the allocation of attention and/or action-intention. Twenty right handed healthy adults were asked to bisect vertical lines presented in the midsagittal plane (center space) and in sagittal planes to the left and right of the midsagittal plane (left and right hemispace) when using a pen held in either the right or left hand. Vertical line bisections were biased upward in all three sagittal planes and higher in left than right hemispace. However, bisections made with the left hand were lower than those made with the right hand. Whereas these results suggest a left hemispace-right hemispheric visuospatial attentional upward bias and a relative left hemispheric-right hand upward action-intentional bias, further studies are needed to document this intentional versus attentional bias and to understand the brain mechanisms that produce these biases.	\N	\N
22553024	The temporal context of an acoustic signal can greatly influence its perception. The present study investigated the neural correlates underlying perceptual facilitation by regular temporal contexts in humans. Participants listened to temporally regular (periodic) or temporally irregular (nonperiodic) sequences of tones while performing an intensity discrimination task. Participants performed significantly better on intensity discrimination during periodic than nonperiodic tone sequences. There was greater activation in the putamen for periodic than nonperiodic sequences. Conversely, there was greater activation in bilateral primary and secondary auditory cortices (planum polare and planum temporale) for nonperiodic than periodic sequences. Across individuals, greater putamen activation correlated with lesser auditory cortical activation in both right and left hemispheres. These findings suggest that temporal regularity is detected in the putamen, and that such detection facilitates temporal-lobe cortical processing associated with superior auditory perception. Thus, this study reveals a corticostriatal system associated with contextual facilitation for auditory perception through temporal regularity processing.	\N	\N
22553042	Signal duration is important for identifying sound sources and determining signal meaning. Duration-tuned neurons (DTNs) respond preferentially to a range of stimulus durations and maximally to a best duration (BD). Duration-tuned neurons are found in the auditory midbrain of many vertebrates, although studied most extensively in bats. Studies of DTNs across vertebrates have identified cells with BDs and temporal response bandwidths that mirror the range of species-specific vocalizations. Neural tuning to stimulus duration appears to be universal among hearing vertebrates. Herein, we test the hypothesis that neural mechanisms underlying duration selectivity may be similar across vertebrates. We instantiated theoretical mechanisms of duration tuning in computational models to systematically explore the roles of excitatory and inhibitory receptor strengths, input latencies, and membrane time constant on duration tuning response profiles. We demonstrate that models of duration tuning with similar neural circuitry can be tuned with species-specific parameters to reproduce the responses of in vivo DTNs from the auditory midbrain. To relate and validate model output to in vivo responses, we collected electrophysiological data from the inferior colliculus of the awake big brown bat, Eptesicus fuscus, and present similar in vivo data from the published literature on DTNs in rats, mice, and frogs. Our results support the hypothesis that neural mechanisms of duration tuning may be shared across vertebrates despite species-specific differences in duration selectivity. Finally, we discuss how the underlying mechanisms of duration selectivity relate to other auditory feature detectors arising from the interaction of neural excitation and inhibition.	\N	\N
22555987	Although it is well known that cisplatin is associated with ototoxicity, there is still a lack of knowledge concerning the ototoxicity of cisplatin, especially in Japanese head and neck cancer patients. The objectives of this study were to determine the incidence rate of cisplatin ototoxicity and to determine the threshold dose causing ototoxicity in the Japanese population. Before-and-after study in a tertiary referral hospital. The distortion product otoacoustic emission (DPOAE) was measured 1 week after each administration of cisplatin in 44 Japanese head and neck cancer patients treated at Kyoto University Hospital. We determined the incidence and threshold dose of cisplatin ototoxicity according to DPOAE data. The incidence of ototoxicity detected by DPOAE was 77.3%. The average DPOAE value was significantly lower in patients who received more than 200 mg/m(2) cisplatin than the baseline DPOAE value. The threshold dose for cisplatin ototoxicity was lower in Japanese patients than in European patients. Our data suggest that Japanese patients are more susceptible to cisplatin-induced ototoxicity. This is presumably caused by a genetic difference.	\N	\N
22559374	Previous studies investigating sensitivity to step changes in tempo and prediction of tone onset time have generally utilized isochronous sequences. This study investigates subjects' ability to detect deviations from a gradual change in the tempo of a tone sequence (experiment 1) and their judgment of the perceptually optimal timing of this tone (experiment 2). In experiment 1, inter-onset-intervals within pairs of eight-tone sequences followed a geometric progression to create a gradual tempo change. In one sequence, the final tone was presented either earlier or later than specified by the progression. Subjects performed well at detecting deviations that exaggerated the tempo progression but poorly when it was counteracted. Experiment 2 used similar pairs except that the final tone was always presented earlier in one sequence than the other. Final interval length was adaptively adjusted to subjects' judgments; it was adjudged in best agreement with the progression when its length was roughly half way between the mathematically correct value and the length of the penultimate interval. The data support "multiple-look" and entrainment models of tempo sensitivity and suggest that temporal prediction is based less on the tempo contour of a whole sequence than on the duration of the preceding interval.	\N	\N
22559382	Recent evidence suggests that spectral change, as measured by cochlea-scaled entropy (CSE), predicts speech intelligibility better than the information carried by vowels or consonants in sentences. Motivated by this finding, the present study investigates whether intelligibility indices implemented to include segments marked with significant spectral change better predict speech intelligibility in noise than measures that include all phonetic segments paying no attention to vowels/consonants or spectral change. The prediction of two intelligibility measures [normalized covariance measure (NCM), coherence-based speech intelligibility index (CSII)] is investigated using three sentence-segmentation methods: relative root-mean-square (RMS) levels, CSE, and traditional phonetic segmentation of obstruents and sonorants. While the CSE method makes no distinction between spectral changes occurring within vowels/consonants, the RMS-level segmentation method places more emphasis on the vowel-consonant boundaries wherein the spectral change is often most prominent, and perhaps most robust, in the presence of noise. Higher correlation with intelligibility scores was obtained when including sentence segments containing a large number of consonant-vowel boundaries than when including segments with highest entropy or segments based on obstruent/sonorant classification. These data suggest that in the context of intelligibility measures the type of spectral change captured by the measure is important.	\N	\N
22561890	The basic deficits underlying the severe and persistent reading difficulties in dyslexia are still highly debated. One of the major topics of debate is whether these deficits are language specific, or affect both verbal and non-verbal stimuli. Recently, Ahissar and colleagues proposed the "anchoring-deficit hypothesis" (Ahissar, Lubin, Putter-Katz, & Banai, 2006), which suggests that dyslexics have a general difficulty in automatic extraction of stimulus regularities from auditory inputs. This hypothesis explained a broad range of dyslexics' verbal and non-verbal difficulties. However, it was not directly tested in the context of reading and verbal memory, which poses the main stumbling blocks to dyslexics. Here we assessed the abilities of adult dyslexics to efficiently benefit from ("anchor to") regularities embedded in repeated tones, orally presented syllables, and written words. We also compared dyslexics' performance to that of individuals with attention disorder (ADHD), but no reading disability. We found an anchoring effect in all groups: all gained from stimulus repetition. However, in line with the anchoring-deficit hypothesis, controls and ADHD participants showed a significantly larger anchoring effect in all tasks. This study is the first that directly shows that the same domain-general deficit, poor anchoring, characterizes dyslexics' performance in perceptual, working memory and reading tasks.	\N	\N
22562828	The evidence of a deficit in working memory in specific language impairment (SLI) is of sufficient magnitude to suggest a primary role in developmental language disorder. However, little research has investigated memory in late talkers who recover from their early delay. Drawing on a longitudinal, community sample, this study compared the memory profiles of 3 groups of 5-year-olds: children with SLI who had been identified as late talkers, resolved late talkers (RLTs), and children with typical language development (TLD). Participants were 25 children with SLI, 45 RLTs, and 32 children with TLD. Subtests from the Working Memory Test Battery for Children and the Children's Memory Scale plus recalling sentences and nonword repetition tasks were administered to test the components of Baddeley's working memory model. The SLI group showed significantly poorer performance than the RLT and TLD groups on measures of the phonological loop and episodic buffer. The RLT and TLD groups scored similarly on all memory measures. The results support previous findings that sentence recall and nonword repetition are markers of SLI. Although residual effects of late-talking status may emerge over time, RLTs do not necessarily show memory deficits at 5 years of age despite delayed early vocabulary development.	\N	\N
22564904	This investigation examined the effect of repeated exposure to novel and repeated spoken words in typical environments on the intelligibility of 2 synthesized voices and human recorded speech in preschools. Eighteen preschoolers listened to and repeated single words presented in human-recorded speech, DECtalk Paul, and AT&T Voice Michael during 5 experimental sessions. Stimuli consisted of repeated and novel words presented in each speech output condition during each session. Sessions took place in the presence of typically occurring noise in classroom or home settings. There was a significant main effect for voice as participants accurately identified significantly more words in the human-recorded speech and AT&T Voice than in the DECtalk speech output condition. When averaged across speech output conditions, children increased their accuracy as they participated in additional sessions. There was a statistically significant interaction between session and voice. DECtalk had a slightly larger effect of session than did AT&T Voice and human-recorded speech.	\N	\N
22568633	Elevated levels of hypoxia-inducible factor 1α (HIF-1α) in middle ear effusion may play an important role in the pathogenesis of bone conduction impairment associated with otitis media with effusion (OME). The mechanism may be related to the up-regulation of nitric oxide (NO) expression. This study was undertaken to investigate the role of HIF-1α in the pathogenesis of sensorineural hearing loss associated with OME. One hundred and eight OME patients were divided into two groups: OME without bone conduction impairment (group 1) and OME with bone conduction impairment (group 2). The levels of HIF-1α, NO, and quinolinic acid (QUIN) in the middle ear effusion and serum of these patients were investigated. The relationship between these factors and the bone conduction threshold (BCT) differences were analyzed. The levels of HIF-1α and NO concentrations in the middle ear effusion were found to be signiﬁcantly higher in group 2 than in group 1 (both p < 0.05). The OME patients' BCT differences at 4000 Hz were correlated with the levels of HIF-1α and the NO concentrations in the middle ear effusion. Furthermore, the HIF-1α levels were correlated with the levels of NO but not with the levels of QUIN in the effusion.	\N	\N
22568993	Both 80 Hz auditory steady state responses (ASSRs) and tone burst auditory brainstem responses (ABRs) have been shown to provide reasonable estimates of the behavioral thresholds. Although ASSRs provide statistically objective estimates that can be easily automated by computers, they present no information for the neurophysiological interpretation of the results. ABRs, on the other hand, do not provide easily automated information and usually need expert interpretation of the recorded waveforms. A recently developed continuous loop averaging deconvolution algorithm offers an alternative solution by acquiring slightly jittered 80 Hz quasi auditory steady state responses (QASSRs), thus enabling the acquisition of both recordings simultaneously. The purpose of this study is to investigate a specially developed 80 Hz QASSR paradigm for simultaneous acquisition for both responses for threshold detection purposes. Sixteen ears from eight adults with normal hearing were tested. Amplitude modulated QASSRs were obtained using slightly jittered temporal sequences of tone bursts presented at a mean rate of 78.125 Hz. Four carrier frequencies (500, 1000, 2000, and 4000 Hz) at several stimulus intensity levels were monaurally presented and QASSRs to 128 sweeps blocks were recorded. The ABRs were extracted using the CLAD algorithm. Wave V was visually identified and analyzed in the time domain as in everyday clinical practice. In addition, statistically objective ƒMP computation method was used to automatically detect ABR threshold as well. The QASSRs were analyzed in the frequency domain and magnitudes, phase delays, and thresholds were obtained. Phasor (polar plot) diagrams were constructed. QASSR and ABR hearing thresholds were obtained and compared with behavioral thresholds. Study reveals that the QASSR method provides accurate objective estimation of the audiometric thresholds from extracted ASSRs and latency/amplitude information from extracted ABRs. The largest mean threshold difference for QASSR was within 5 dB for all carrier frequencies including 500 Hz. For auditory threshold estimation in adults with normal hearing, the Hotelling's T-Square test in four dimensions in the frequency domain was more accurate than the ƒMP or visual ABR threshold detection in the time domain. Simultaneously recorded ASSR and ABR from QASSRs provide accurate and effective method for frequency-specific hearing threshold estimation with neurophysiological information in adults with normal hearing. Further research is required for hearing-impaired adults, newborns, and infants.	\N	\N
22570723	Multisensory learning and resulting neural brain plasticity have recently become a topic of renewed interest in human cognitive neuroscience. Music notation reading is an ideal stimulus to study multisensory learning, as it allows studying the integration of visual, auditory and sensorimotor information processing. The present study aimed at answering whether multisensory learning alters uni-sensory structures, interconnections of uni-sensory structures or specific multisensory areas. In a short-term piano training procedure musically naive subjects were trained to play tone sequences from visually presented patterns in a music notation-like system [Auditory-Visual-Somatosensory group (AVS)], while another group received audio-visual training only that involved viewing the patterns and attentively listening to the recordings of the AVS training sessions [Auditory-Visual group (AV)]. Training-related changes in cortical networks were assessed by pre- and post-training magnetoencephalographic (MEG) recordings of an auditory, a visual and an integrated audio-visual mismatch negativity (MMN). The two groups (AVS and AV) were differently affected by the training. The results suggest that multisensory training alters the function of multisensory structures, and not the uni-sensory ones along with their interconnections, and thus provide an answer to an important question presented by cognitive models of multisensory training.	\N	\N
22571383	Previous studies of source monitoring and auditory hallucinations (AH) have often conflated spatial source (internal-external) with source agency (self-other). Other studies have used suboptimal manipulations of auditory space (e.g., imagine saying vs. saying aloud). We avoided these problems by presenting experimenter-generated stimuli over headphones in the voice of another person so that the location of the voice sounded either internal or external to the participant's head. Participants (N=121) studied 96 words and indicated for each whether it was presented internally or externally (online spatial source monitoring). At test, studied words were presented visually, intermixed randomly with 96 unstudied words. Participants indicated whether each item was old or new (item memory) and whether it was presented internally or externally during study (spatial source memory). Independent measures of memory accuracy and response bias were derived for online source monitoring, item memory and source memory using signal detection theory. Performance on these measures was compared between two groups of 30 participants who scored low or high on a measure of AH proneness. ANOVAs revealed no differences between the high- and low-AH groups in online spatial source monitoring, item memory, or spatial source memory. We found no evidence that proneness to AH in a sample of healthy volunteers was related to any of the measures of spatial source monitoring performance. We recommend that the methods introduced be applied to future investigations of spatial source monitoring with patient groups and with individuals at-risk for psychosis.	\N	\N
22584229	Children's language skills develop rapidly with increasing age, and several studies indicate that they use language- and age-specific strategies to understand complex sentences. In the present experiment, functional magnetic resonance imaging (fMRI) and behavioral measures were used to investigate the acquisition of case-marking cues for sentence interpretation in the developing brain of German preschool children with a mean age of 6 years. Short sentences were presented auditorily, consisting of a transitive verb and two case-marked arguments with canonical subject-initial or non canonical object-initial word order. Overall group results revealed mainly left hemispheric activation in the perisylvian cortex with increased activation in the inferior parietal cortex (IPC), and the anterior cingulate cortex (ACC) for object-initial compared to subject-initial sentences. However, single-subject analysis suggested two distinct activation patterns within the group which allowed a classification into two subgroups. One subgroup showed the predicted activation increase in the left inferior frontal gyrus (IFG) for the more difficult object-initial compared to subject-initial sentences, while the other group showed the reverse effect. This activation in the left IFG can be taken to reflect the degree to which adult-like sentence processing strategies, necessary to integrate case-marking information, are applied. Additional behavioral data on language development tests show that these two subgroups differ in their grammatical knowledge. Together with these behavioral findings, the results indicate that the use of a particular processing strategy is not dependent on age as such, but rather on the child's individual grammatical knowledge and the ability to use specific language cues for successful sentence comprehension.	\N	\N
22595658	The aim of this experiment was to examine the preattentive processing of syllables in 9-11-year-old children with dyslexia and matched controls using the Mismatch Negativity (MMN), an auditory Event-Related brain potential (ERP) related to preattentive discrimination. Children were presented with a sequence of syllables that included standards (the syllable "Ba") and deviants in vowel frequency, vowel duration and Voice Onset Time (VOT) that were either close to or far from the standard (Small and Large deviants). No between-group differences were found for frequency deviants. However, whilst normal-reading children showed larger MMNs to Large than to Small deviants in vowel duration and VOT, no such deviance size effect was found in children with dyslexia. These results are taken to indicate that the preattentive processing of vowel duration and VOT is impaired in children with dyslexia, with no impairment in the processing of vowel frequency deviants. By revealing processing deficits of both duration and VOT deviants, these results suggest a strong link between acoustical and phonological processing.	\N	\N
22609772	Older adults often find it more difficult than younger adults to attend to a target talker when there are other people talking. One possible reason for this difficulty is that it may take them longer to perceptually segregate the target speech from competing speech. This study investigated age-related differences in the time it takes to segregate target speech from either a speech spectrum noise masker or a babble masker (many people talking simultaneously). Specifically, we employed five different delays (0.1 s-1.1 s) between masker onset and target speech onset. Four signal-to-masker ratios were employed at each delay to determine the 50% thresholds for word recognition accuracy when target words were masked by either speech spectrum noise or multi-talker babble. Thresholds for word recognition decreased exponentially as a function of the masker-word-onset delay, at the same rate for younger and older adults, when the masker was speech spectrum noise. When the masker was babble, thresholds for younger adults decreased exponentially with delay at the same rate as they did when the masker was speech spectrum noise. The word recognition thresholds for older adults, however, did not appear to change over the range of delays explored in this study. In addition, the average difference between word recognition thresholds for younger and older adults (younger adult thresholds < older adult thresholds) was significantly larger when the masker was babble than when it was noise. These results indicate that older adults are as fast as younger adults at separating speech from a steady-state noise masker, but are not as capable as younger adults of taking advantage of the delayed onset of the speech target when the masker is babble. The potential contributions of age-related sensory and cognitive declines to these stream segregation effects are discussed. Finally, we conclude that age-related differences in the timeline for stream segregation contribute to the difficulties older adults experience in listening to speech in a background of babble.	\N	\N
22633004	While perceiving speech, people see mouth shapes that are systematically associated with sounds. In particular, a vertically stretched mouth produces a /woo/ sound, whereas a horizontally stretched mouth produces a /wee/ sound. We demonstrate that hearing these speech sounds alters how we see aspect ratio, a basic visual feature that contributes to perception of 3D space, objects and faces. Hearing a /woo/ sound increases the apparent vertical elongation of a shape, whereas hearing a /wee/ sound increases the apparent horizontal elongation. We further demonstrate that these sounds influence aspect ratio coding. Viewing and adapting to a tall (or flat) shape makes a subsequently presented symmetric shape appear flat (or tall). These aspect ratio aftereffects are enhanced when associated speech sounds are presented during the adaptation period, suggesting that the sounds influence visual population coding of aspect ratio. Taken together, these results extend previous demonstrations that visual information constrains auditory perception by showing the converse - speech sounds influence visual perception of a basic geometric feature.	\N	\N
22641191	The present study uses a systems engineering approach to delineate the relationship between tinnitus and hyperacusis as a result of either hearing loss in the ear or an imbalanced state in the brain. Specifically examined is the input-output function, or loudness growth as a function of intensity in both normal and pathological conditions. Tinnitus reduces the output dynamic range by raising the floor, while hyperacusis reduces the input dynamic range by lowering the ceiling or sound tolerance level. Tinnitus does not necessarily steepen the loudness growth function but hyperacusis always does. An active loudness model that consists of an expansion stage following a compression stage can account for these key properties in tinnitus and hyperacusis loudness functions. The active loudness model suggests that tinnitus is a result of increased central noise, while hyperacusis is due to increased nonlinear gain. The active loudness model also generates specific predictions on loudness growth in tinnitus, hyperacusis, hearing loss or any combinations of the three conditions. These predictions need to be verified by experimental data and have explicit implications for treatment of tinnitus and hyperacusis.	\N	\N
22646514	Sensory consequences of our own actions are perceived differently from the sensory stimuli that are generated externally. The present event-related potential (ERP) study examined the neural responses to self-triggered stimulation relative to externally-triggered stimulation as a function of delays between the motor act and the stimulus onset. While sustaining a vowel phonation, subjects clicked a mouse and heard pitch-shift stimuli (PSS) in voice auditory feedback at delays of either 0 ms (predictable) or 500-1000 ms (unpredictable). The motor effect resulting from the mouse click was corrected in the data analyses. For the externally-triggered condition, PSS were delivered by a computer with a delay of 500-1000 ms after the vocal onset. As compared to unpredictable externally-triggered PSS, P2 responses to predictable self-triggered PSS were significantly suppressed, whereas an enhancement effect for P2 responses was observed when the timing of self-triggered PSS was unpredictable. These findings demonstrate the effect of the temporal predictability of stimulus delivery with respect to the motor act on the neural responses to self-triggered stimulation. Responses to self-triggered stimulation were suppressed or enhanced compared with the externally-triggered stimulation when the timing of stimulus delivery was predictable or unpredictable. Enhancement effect of unpredictable self-triggered stimulation in the present study supports the idea that sensory suppression of self-produced action may be primarily caused by an accurate prediction of stimulus timing, rather than a movement-related non-specific suppression.	\N	\N
22648606	According to many theories of decision making, of which signal detection theory is the most prominent, randomness is the main factor responsible for imperfect performance. These theories imply that correcting for attenuation due to randomness should result in perfect scores as long as the participants use nonextreme decision criteria. On the basis of a recent advance termed potential performance theory (Trafimow & Rice, Psychological Review 115:447-462, 2008), we performed auditory and visual detection experiments and corrected the scores for attenuation. Most participants in both experiments tended to perform at a less-than-perfect level, even after their scores were corrected. The findings demonstrate that at least one systematic factor influences detection that is not included in signal detection theory.	\N	\N
22653919	The authors investigated lengthening effects in child-directed speech (CDS) across the sentence, testing the additive effects on duration of Word Position, Register, Focus, and Sentence Mode (statement/question). Five theater students produced 6 sentences containing 5 monosyllabic words in a simulated dialogue, varying in Register, Focus, and Sentence Mode. The authors segmented a total of 1,800 sentences using forced-alignment tools, and they analyzed the duration of each word. The results show significant effects of Register, Word Position, and their interactions. The simple effect of Register was significant in all 5 word positions, indicating a global elongation effect in CDS. Interestingly, there was no proportional increase of the final word in CDS. In addition, the 3-way interactions Register × Word Position × Focus and Register × Word Position × Sentence Mode were significant, which converge to the conclusion that the utterance-final word in CDS is additively elongated when it is focused and in a statement. Elongation in CDS is a global effect, but the additive effects of duration demonstrated in the authors' data suggest that the effect of enhanced utterance-final lengthening in CDS in naturalistic samples may be a by-product of discourse characteristics of CDS.	\N	\N
22664896	To examine the association between dehiscence length in patients with superior semicircular canal dehiscence syndrome and their clinical findings, including objective audiometric and vestibular testing results. Retrospective study. Tertiary referral center. Patients included in this study were diagnosed with superior semicircular canal dehiscence syndrome and underwent surgical repair of the dehiscence through middle fossa craniotomy. The dehiscence length was measured intraoperatively in all cases. Correlation between dehiscence length with pure-tone average (PTA), average bone-conduction threshold, maximal air-bone gap, cervical vestibular evoked myogenic potential thresholds, and presenting signs and symptoms. The correlation between dehiscence length and maximal air-bone gap was statistically significant on both univariate and multivariate regression analyses. The correlations between dehiscence length and PTA, average bone-conduction threshold, cervical vestibular evoked myogenic potential threshold, and presenting signs and symptoms were not statistically significant. The dehiscence length correlated positively with the maximal air-bone gap in patients with superior semicircular canal dehiscence. The correlation was statistically significant. The dehiscence length did not correlate with the other variables examined in this study.	\N	\N
22666781	The purpose of the study was to acoustically compare the performance of children who do and do not stutter on diadochokinesis tasks in terms of syllable duration, syllable periods, and peak intensity. In this case-control study, acoustical analyses were performed on 26 children who stutter and 20 aged-matched normally fluent children (both groups stratified into preschoolers and school-aged children) during a diadochokinesis task: the repetition of articulatory segments through a task testing the ability to alternate movements. Speech fluency was assessed using the Fluency Profile and the Stuttering Severity Instrument. The children who stutter and those who do not did not significantly differ in terms of the acoustic patterns they produced in the diadochokinesis tasks. Significant differences were demonstrated between age groups independent of speech fluency. Overall, the preschoolers performed poorer. These results indicate that the observed differences are related to speech-motor age development and not to stuttering itself. Acoustic studies demonstrate that speech segment durations are most variable, both within and between subjects, during childhood and then gradually decrease to adult levels by the age of eleven to thirteen years. One possible explanation for the results of the present study is that children who stutter presented higher coefficients of variation to exploit the motor equivalence to achieve accurate sound production (i.e., the absence of speech disruptions).	\N	\N
22672110	Auditory spatial deficits occur frequently after hemispheric damage; a previous case report suggested that the explicit awareness of sound positions, as in sound localisation, can be impaired while the implicit use of auditory cues for the segregation of sound objects in noisy environments remains preserved. By assessing systematically patients with a first hemispheric lesion, we have shown that (1) explicit and/or implicit use can be disturbed; (2) impaired explicit vs. preserved implicit use dissociations occur rather frequently; and (3) different types of sound localisation deficits can be associated with preserved implicit use. Conceptually, the dissociation between the explicit and implicit use may reflect the dual-stream dichotomy of auditory processing. Our results speak in favour of systematic assessments of auditory spatial functions in clinical settings, especially when adaptation to auditory environment is at stake. Further, systematic studies are needed to link deficits of explicit vs. implicit use to disability in everyday activities, to design appropriate rehabilitation strategies, and to ascertain how far the explicit and implicit use of spatial cues can be retrained following brain damage.	\N	\N
22686693	The goal of an action can consist of generating a change in the environment (to produce an effect) or changing one's own situation in the environment (to move to a physical target). To investigate whether the mechanisms of effect-directed and target-directed action control are similar, participants performed continuous reversal movements. They either synchronized movement reversals with regularly presented tones (temporal targets) or produced tones at reversals isochronously (temporal effects). In both goal conditions an irrelevant goal characteristic was integrated into the goal representation (loudness, Experiment 1). When targets and effects were presented within the same reversal movement, similarities were enhanced (Experiment 2). When the task posed spatial demands in addition to temporal demands, target- and effect-directed movement kinematics changed equally with tempo (Experiment 3). Correlations between target-directed and effect-directed movements in temporal variability indicated similar timing mechanisms (Experiments 1 and 2). Only gradual differences between target- and effect-directed movements were observed. We conclude that the same mechanisms of action control, including the anticipation of upcoming events, underlie effect-directed and target-directed movements. Ideomotor theories of action control should incorporate action targets as goals similar to action effects.	\N	\N
22696248	We examined the effects of hedges and the discourse marker like on how people recalled specific details about precise quantities in spontaneous speech. We found that listeners treated hedged information differently from like-marked information, although both are thought to be indicators of uncertainty or vagueness. In addition, hedges had different effects depending on whether speakers were (1) retelling conversations to another person or (2) answering questions about material they had heard. When retelling to another person, listeners were more likely to report information that was either unmarked or marked with a like than hedged information (Experiment 1). Yet when answering questions by themselves, hedges enhanced memory for details, in comparison with likes (Experiment 2). Hedges appear to provide pragmatic cues about what information is reliable enough to repeat in a conversational context. But although hedged information may be left out, it is not forgotten.	\N	\N
22696304	This study determined the effects of phonology and semantics on the distribution of cortical activity to the second of a pair of words in first and second language (mixed pairs). The effects of relative proficiency in the two languages and linguistic setting (monolinguistic or mixed) are reported in a companion paper. Ten early bilinguals and 14 late bilinguals listened to mixed pairs of words in Arabic (L1) and Hebrew (L2) and indicated whether both words in the pair had the same or different meanings. The spatio-temporal distribution of current densities of event-related potentials were estimated for each language and according to semantic and phonologic relationship (same or different) compared with the first word in the pair. During early processing (<300 ms), brain activity in temporal and temporoparietal auditory areas was enhanced by phonologic incongruence between words in the pair and in Wernicke's area by both phonologic and semantic priming. In contrast, brain activities during late processing (>300 ms) were enhanced by semantic incongruence between the two words, particularly in temporal areas and in left hemisphere Broca's and Wernicke's areas. The latter differences were greater when words were in L2. Surprisingly, no significant effects of relative proficiency on processing the second word in the pair were found. These results indicate that the distribution of brain activity to the second of two words presented bilingually is affected differently during early and late processing by both semantic and phonologic priming by- and incongruence with the immediately preceding word.	\N	\N
22698777	Functional near-infrared spectroscopy (fNIRS), a non-invasive neuroimaging optical technique which measures the cortical concentration changes in oxygenated and deoxygenated hemoglobin (O(2)Hb and HHb, respectively), has been extensively utilized in language studies. Most of these studies investigated the ventrolateral/dorsolateral cortex responses, while few language studies on the frontopolar cortex are reported. The aim of this study was to investigate by fNIRS the frontopolar cortex response to a letter verbal fluency task (VFT) in single healthy subjects to better understand the symmetry/asymmetry of language processing. The O(2)Hb and HHb changes were measured on 33 University students by a 8-channel fNIRS system. A significant increase in O(2)Hb (p<0.001), accompanied by a smaller significant decrease in HHb (p<0.001), was observed in each measurement point. However, the laterality index of 21 out of the 33 subjects evidenced a hemispheric dominance (right 9, left 12). Although these results have confirmed a bilateral activation over the frontopolar cortex upon VFT, no clear pattern of lateralization was found. Considering the importance of establishing a response pattern related to cognitive functions in clinical populations, the fNIRS investigation of the frontopolar cortex (and other areas involved in language) in single subject and the use of the laterality index are recommended.	\N	\N
22699985	To report and review the clinical experiences of patients who required reimplantation from an ongoing trial of patients with partial deafness who were treated with electroacoustic stimulation (EAS) cochlear implantation. Retrospective case series review. Tertiary referral center. Two patients with partial deafness, 1 child and 1 adult, who required reimplantation because of device failure occurring 12 to 18 months after hearing preservation cochlear implantation with a Med-El Sonata Flex-EAS electrode array. Reimplantation (with full insertion) of a Med-El Sonata Flex-EAS array (child) and the new complete cochlear coverage Med-El Sonata Flex-28 electrode array (adult). Surgical techniques used include round window insertion with slow insertion speed and the use of preoperative systemic steroids and preoperative, perioperative, and postimplantation intratympanic steroids. Preservation of residual hearing. Both patients had complete preservation of residual hearing after reimplantation. The adult patient had stable improvement in hearing from 750 to 2,000 Hz of 5 to 10 dB. Both patients reported increased benefit after reimplantation. We report a case series of successful pediatric and adult EAS reimplantation, in the adult hearing improvement after reimplantation with a deep insertion electrode was observed. Reimplantation with preservation of residual hearing in patients with EAS is possible with current surgical hearing preserving techniques and atraumatic electrode arrays of variable length.	\N	\N
22709398	Infants begin to segment novel words from speech by 7.5 months, demonstrating an ability to track, encode and retrieve words in the context of larger units. Although it is presumed that word recognition at this stage is a prerequisite to constructing a vocabulary, the continuity between these stages of development has not yet been empirically demonstrated. The goal of the present study is to investigate whether infant word segmentation skills are indeed related to later lexical development. Two word segmentation tasks, varying in complexity, were administered in infancy and related to childhood outcome measures. Outcome measures consisted of age-normed productive vocabulary percentiles and a measure of cognitive development. Results demonstrated a strong degree of association between infant word segmentation abilities at 7 months and productive vocabulary size at 24 months. In addition, outcome groups, as defined by median vocabulary size and growth trajectories at 24 months, showed distinct word segmentation abilities as infants. These findings provide the first prospective evidence supporting the predictive validity of infant word segmentation tasks and suggest that they are indeed associated with mature word knowledge. A video abstract of this article can be viewed at http://www.youtube.com/watch?v=jxzLi5oLZQ8.	\N	\N
22717191	Patients with schizophrenia consistently demonstrate information processing abnormalities assessed with visual masking (VM) tasks, and these deficits have been linked to clinical and functional severity. It has been suggested that VM impairments may be a vulnerability marker in individuals at risk for developing psychosis. Forward and backward VM performance was assessed in 72 first-episode (FE) psychosis patients, 98 subjects at risk (AR) for psychosis and 98 healthy controls (HC) using two identification tasks (with either a high- or low-energy mask) and a location task. VM was examined for stability in a subgroup (FE, n=15; AR, n=35; HC, n=21) and assessed relative to clinical and functional measures. In the identification tasks, backward VM deficits were observed in both FE and AR relative to HC whereas forward VM deficits were only present in FE patients compared to HC. In the location task, AR subjects demonstrated superior performance in forward VM relative to HC. VM performance was stable over time, and VM deficits were associated with baseline functional measures and predicted future negative symptom severity in AR subjects. Visual information processing deficits, as indexed by backward VM, are present before and after the onset of frank psychosis, and probably represent a stable vulnerability marker that is associated with negative symptoms and functional decline. Additionally, the paradoxically better performance of AR subjects in select forward tasks suggests that early compensatory changes may characterize an emerging psychotic state.	\N	\N
22721630	Vocal expressions commonly elicit activity in superior temporal and inferior frontal cortices, indicating a distributed network to decode vocally expressed emotions. We examined the involvement of this fronto-temporal network for the decoding of angry voices during attention towards (explicit attention) or away from emotional cues in voices (implicit attention) based on a reanalysis of previous data (Frühholz, S., Ceravolo, L., Grandjean, D., 2012. Cerebral Cortex 22, 1107-1117). The general network revealed high interconnectivity of bilateral inferior frontal gyrus (IFG) to different bilateral voice-sensitive regions in mid and posterior superior temporal gyri. Right superior temporal gyrus (STG) regions showed connectivity to the left primary auditory cortex and secondary auditory cortex (AC) as well as to high-level auditory regions. This general network revealed differences in connectivity depending on the attentional focus. Explicit attention to angry voices revealed a specific right-left STG network connecting higher-level AC. During attention to a nonemotional vocal feature we also found a left-right STG network implicitly elicited by angry voices that also included low-level left AC. Furthermore, only during this implicit processing there was widespread interconnectivity between bilateral IFG and bilateral STG. This indicates that while implicit attention to angry voices recruits extended bilateral STG and IFG networks for the sensory and evaluative decoding of voices, explicit attention to angry voices solely involves a network of bilateral STG regions probably for the integrative recognition of emotional cues from voices.	\N	\N
22723356	The integration of facial gestures and vocal signals is an essential process in human communication and relies on an interconnected circuit of brain regions, including language regions in the inferior frontal gyrus (IFG). Studies have determined that ventral prefrontal cortical regions in macaques [e.g., the ventrolateral prefrontal cortex (VLPFC)] share similar cytoarchitectonic features as cortical areas in the human IFG, suggesting structural homology. Anterograde and retrograde tracing studies show that macaque VLPFC receives afferents from the superior and inferior temporal gyrus, which provide complex auditory and visual information, respectively. Moreover, physiological studies have shown that single neurons in VLPFC integrate species-specific face and vocal stimuli. Although bimodal responses may be found across a wide region of prefrontal cortex, vocalization responsive cells, which also respond to faces, are mainly found in anterior VLPFC. This suggests that VLPFC may be specialized to process and integrate social communication information, just as the IFG is specialized to process and integrate speech and gestures in the human brain.	\N	\N
22724279	The main goal of this study was to investigate the effects of acoustic characteristics, including timbre and fundamental frequency (F0), on the musical pitch discrimination of cochlear implant users. Eight postlingually deafened cochlear implant users were recruited, along with 8 control subjects with normal hearing. Pitch discrimination tests were carried out using test stimuli from 4 musical instruments plus synthetic complex stimuli. Three reference tones with different F0s were used. The mean difference limens were 1.8 to 10.7 semitones in the just-noticeable difference task and 2.1 to 13.6 semitones in the pitch-direction discrimination task for different timbre and F0 combinations. Three-way analysis of variance showed that the acoustic characteristics of the musical stimuli, such as timbre and F0, significantly influenced pitch discrimination performance. Acoustic characteristics determine the complexity of the electrical stimulation pattern, which directly affects performance in pitch discrimination. A place pattern with a clear and regular low-order harmonic structure is most important for good pitch discrimination. A clear F0-related temporal pattern is also useful when the F0 is low. Pitch perception performance will worsen when there is interference in the high-frequency channels.	\N	\N
22727355	The objective of this study was to evaluate the relationship between developmental delays and speech perception in pre-lingually deafened cochlear implant recipients. This study was a retrospective review of patient charts conducted at a tertiary referral center. Thirty-five pre-lingually deafened children underwent multichannel cochlear implantation and habilitation at the Kyoto University Hospital Department of Otolaryngology-Head and Neck Surgery. A pre-operative cognitive-adaptive developmental quotient was evaluated using the Kyoto scale of psychological development. Post-operative speech performance was evaluated with speech perception tests two years after cochlear implantation. We computed partial correlation coefficients (controlled for age at the time of implantation and the average pre-operative aided hearing level) between the cognitive-adaptive developmental quotient and speech performance. A developmental delay in the cognitive-adaptive area was weakly correlated with speech perception (partial correlation coefficients for consonant-vowel syllables and phrases were 0.38 and 0.36, respectively). A pre-operative developmental delay was only weakly associated with poor post-operative speech perception in pre-lingually deafened cochlear implant recipients.	\N	\N
22728130	The neural processing of auditory information engages pathways that begin initially at the cochlea and that eventually reach forebrain structures. At these higher levels, the computations necessary for extracting auditory source and identity information rely on the neuroanatomical connections between the thalamus and cortex. Here, the general organization of these connections in the medial geniculate body (thalamus) and the auditory cortex is reviewed. In addition, we consider two models organizing the thalamocortical pathways of the non-tonotopic and multimodal auditory nuclei. Overall, the transfer of information to the cortex via the thalamocortical pathways is complemented by the numerous intracortical and corticocortical pathways. Although interrelated, the convergent interactions among thalamocortical, corticocortical, and commissural pathways enable the computations necessary for the emergence of higher auditory perception.	\N	\N
22731996	The influence of top-down cognitive control on 2 putatively distinct forms of distraction was investigated. Attentional capture by a task-irrelevant auditory deviation (e.g., a female-spoken token following a sequence of male-spoken tokens)-as indexed by its disruption of a visually presented recall task-was abolished when focal-task engagement was promoted either by increasing the difficulty of encoding the visual to-be-remembered stimuli (by reducing their perceptual discriminability; Experiments 1 and 2) or by providing foreknowledge of an imminent deviation (Experiment 2). In contrast, distraction from continuously changing auditory stimuli ("changing-state effect") was not modulated by task-difficulty or foreknowledge (Experiment 3). We also confirmed that individual differences in working memory capacity--typically associated with maintaining task-engagement in the face of distraction--predict the magnitude of the deviation effect, but not the changing-state effect. This convergence of experimental and psychometric data strongly supports a duplex-mechanism account of auditory distraction: Auditory attentional capture (deviation effect) is open to top-down cognitive control, whereas auditory distraction caused by direct conflict between the sound and focal-task processing (changing-state effect) is relatively immune to such control.	\N	\N
22753470	A visual scene is perceived in terms of visual objects. Similar ideas have been proposed for the analogous case of auditory scene analysis, although their hypothesized neural underpinnings have not yet been established. Here, we address this question by recording from subjects selectively listening to one of two competing speakers, either of different or the same sex, using magnetoencephalography. Individual neural representations are seen for the speech of the two speakers, with each being selectively phase locked to the rhythm of the corresponding speech stream and from which can be exclusively reconstructed the temporal envelope of that speech stream. The neural representation of the attended speech dominates responses (with latency near 100 ms) in posterior auditory cortex. Furthermore, when the intensity of the attended and background speakers is separately varied over an 8-dB range, the neural representation of the attended speech adapts only to the intensity of that speaker but not to the intensity of the background speaker, suggesting an object-level intensity gain control. In summary, these results indicate that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.	\N	\N
22764349	Presenting synchronous auditory and visual stimuli in separate locations creates the illusion that the sound originates from the direction of the visual stimulus. Participants' auditory localization bias, called the ventriloquism effect, has revealed factors affecting the perceptual integration of audio-visual stimuli. However, many studies on audio-visual processes have focused on performance in simplified experimental situations, with a single stimulus in each sensory modality. These results cannot necessarily explain our perceptual behavior in natural scenes, where various signals exist within a single sensory modality. In the present study we report the contributions of a cognitive factor, that is, the audio-visual congruency of speech, although this factor has often been underestimated in previous ventriloquism research. Thus, we investigated the contribution of speech congruency on the ventriloquism effect using a spoken utterance and two videos of a talking face. The salience of facial movements was also manipulated. As a result, when bilateral visual stimuli are presented in synchrony with a single voice, cross-modal speech congruency was found to have a significant impact on the ventriloquism effect. This result also indicated that more salient visual utterances attracted participants' auditory localization. The congruent pairing of audio-visual utterances elicited greater localization bias than did incongruent pairing, whereas previous studies have reported little dependency on the reality of stimuli in ventriloquism. Moreover, audio-visual illusory congruency, owing to the McGurk effect, caused substantial visual interference to auditory localization. This suggests that a greater flexibility in responding to multi-sensory environments exists than has been previously considered.	\N	\N
22768163	Auditory sensory modulation difficulties are common in autism spectrum disorders (ASD) and may stem from a faulty arousal system that compromises the ability to regulate an optimal response. To study neurophysiological correlates of the sensory modulation difficulties, we recorded magnetic field responses to clicks in 14 ASD and 15 typically developing (TD) children. We further analyzed the P100m, which is the most prominent component of the auditory magnetic field response in children and may reflect preattentive arousal processes. The P100m was rightward lateralized in the TD, but not in the ASD children, who showed a tendency toward P100m reduction in the right hemisphere (RH). The atypical P100m lateralization in the ASD subjects was associated with greater severity of sensory abnormalities assessed by Short Sensory Profile, as well as with auditory hypersensitivity during the first two years of life. The absence of right-hemispheric predominance of the P100m and a tendency for its right-hemispheric reduction in the ASD children suggests disturbance of the RH ascending reticular brainstem pathways and/or their thalamic and cortical projections, which in turn may contribute to abnormal arousal and attention. The correlation of sensory abnormalities with atypical, more leftward, P100m lateralization suggests that reduced preattentive processing in the right hemisphere and/or its shift to the left hemisphere may contribute to abnormal sensory behavior in ASD.	\N	\N
22773778	Because acoustic landscapes are complex and rapidly changing, auditory systems have evolved mechanisms that permit rapid detection of novel sounds, sound source segregation, and perceptual restoration of sounds obscured by noise. Perceptual restoration is particularly important in noisy environments because it allows organisms to track sounds over time even when they are masked. The continuity illusion is a striking example of perceptual restoration with sounds perceived as intact even when parts of them have been replaced by gaps and rendered inaudible by being masked by an extraneous sound. The mechanisms of auditory filling-in are complex and are currently not well-understood. The present study used the high temporal resolution of EEG to examine brain activity related to continuity illusion perception. Masking noise loudness was adjusted individually for each subject so that physically identical sounds on some trials elicited a continuity illusion (failure to detect a gap in a sound) and on other trials resulted in correct gap detection. This design ensured that any measurable differences in brain activity would be due to perceptual differences rather than physical differences among stimuli. We found that baseline activity recorded immediately before presentation of the stimulus significantly predicted the occurrence of the continuity illusion in 10 out of 14 participants based on power differences in γ-band EEG (34-80 Hz). Across all participants, power in the β and γ (12- to 80-Hz range) was informative about the subsequent perceptual decision. These data suggest that a subject's baseline brain state influences the strength of continuity illusions.	\N	\N
22774804	The issue investigated in the present research is the nature of the information that is responsible for producing masked priming effects (e.g., semantic information or stimulus-response [S-R] associations) when responding to number stimuli. This issue was addressed by assessing both the magnitude of the category congruence (priming) effect and the nature of the priming distance effect across trials using single-digit primes and targets. Participants made either magnitude (i.e., whether the number presented was larger or smaller than 5) or identification (i.e., press the left button if the number was either a 1, 2, 3, or 4 or the right button if the number was either a 6, 7, 8, or 9) judgments. The results indicated that, regardless of task instruction, there was a clear priming distance effect and a significantly increasing category congruence effect. These results indicated that both semantic activation and S-R associations play important roles in producing masked priming effects.	\N	\N
22776903	Stuttering is generally considered to be a speech disorder that affects ∼1% of the global population. Various forms of speech feedback have been shown to reduce overt stuttered speaking, and in particular, second speech signal through speech feedback has drastically reduced utterances of stuttered speech in adults with persistent stuttering. This study reports data for increased overt fluency of speech in an adult stuttering population, whereby the vocalization of the speaker is captured by a microphone or an accelerometer, signal processed, and returned as mechanical tactile speech feedback to the speaker's skin. A repeated measures analysis of variance was used to show that both the microphone and the accelerometer speaking conditions were significantly more fluent than a control (no feedback) condition, with the microphone-driven tactile feedback reducing instances of stuttering by 71% and the accelerometer-driven tactile feedback reducing instances of stuttering by 80%. It is apparent that self-generated tactile feedback can be used to enhance fluency significantly in those who stutter.	\N	\N
22777734	A growing literature has suggested that processing of visual information presented near the hands is facilitated. In this study, we investigated whether the near-hands superiority effect also occurs with the hands moving. In two experiments, participants performed a cyclical bimanual movement task requiring concurrent visual identification of briefly presented letters. For both the static and dynamic hand conditions, the results showed improved letter recognition performance with the hands closer to the stimuli. The finding that the encoding advantage for near-hand stimuli also occurred with the hands moving suggests that the effect is regulated in real time, in accordance with the concept of a bimodal neural system that dynamically updates hand position in external space.	\N	\N
22788230	Altered auditory feedback can facilitate speech fluency in adults who stutter. However, other findings suggest that adults who stutter show anomalies in 'audiovocal integration', such as longer phonation reaction times to auditory stimuli and less effective pitch tracking. To study audiovocal integration in adults who stutter using the pitch-shift paradigm. Fourteen adult stuttering participants and 16 normally fluent adults produced the vowel /a/while monitoring their own voice through earphones. Unanticipated pitch-shifts were applied in the upward or downward direction for 500 ms. Short latency pitch-shift responses (or pitch-shift responses) were elicited in all participants. In stuttering participants, vocal response onset latency was significantly delayed and amplitude tended to be reduced. Atypical audiovocal responses could be associated with stuttering. It is not clear how audiovocal integration influences stuttering, but could signal inadequate activation of internal models.	\N	\N
22796516	In the present study we examined the effect of positional noise on spatial resolution in younger and older observers. We used a yes/no discrimination task in which observers indicated whether the size of two gaps in a Landolt-C-like contour was the same or not. The proportion of trials observers perceived one gap larger was measured when gaps-position was fixed (low positional noise) and random (high positional noise). Specifically, we compared, across conditions and groups, the values of threshold, lower and upper asymptote of the psychometric function. In the younger group, noise does not prevent detection of gap-size difference although sensitivity is lower, as revealed by higher threshold and lower upper asymptote, i.e., the proportion of responses "I see a larger gap" at the largest gap-size difference (asymptotic performance). In the older group detection is prevented, as revealed by threshold, lower and upper asymptote data. This may be because, at stimulus onset, high positional noise has associated coarse filter analysers averaging across the two gaps, which cannot be switched off.	\N	\N
22799761	Little is known about how sex influences functional brain maturation. The current study investigated sex differences in the maturation of event-related potential (ERP) amplitudes during an auditory oddball task (N = 170; age = 6-17 years). Performance improved with age. N200 amplitude declined with age: parietal sites showed earlier development than temporal and frontal locations. Girls showed greater bilateral frontal P300 amplitude development, approaching the higher values observed in boys during childhood. After controlling for age, right frontal P300 amplitude was associated with reaction time in girls. The findings demonstrate sex differences in ERP maturation in line with behavioral and neuroimaging studies.	\N	\N
22802637	Unlike nonhuman primates, songbirds learn to vocalize very much like human infants acquire spoken language. In humans, Broca's area in the frontal lobe and Wernicke's area in the temporal lobe are crucially involved in speech production and perception, respectively. Songbirds have analogous brain regions that show a similar neural dissociation between vocal production and auditory perception and memory. In both humans and songbirds, there is evidence for lateralization of neural responsiveness in these brain regions. Human infants already show left-sided dominance in their brain activation when exposed to speech. Moreover, a memory-specific left-sided dominance in Wernicke's area for speech perception has been demonstrated in 2.5-mo-old babies. It is possible that auditory-vocal learning is associated with hemispheric dominance and that this association arose in songbirds and humans through convergent evolution. Therefore, we investigated whether there is similar song memory-related lateralization in the songbird brain. We exposed male zebra finches to tutor or unfamiliar song. We found left-sided dominance of neuronal activation in a Broca-like brain region (HVC, a letter-based name) of juvenile and adult zebra finch males, independent of the song stimulus presented. In addition, juvenile males showed left-sided dominance for tutor song but not for unfamiliar song in a Wernicke-like brain region (the caudomedial nidopallium). Thus, left-sided dominance in the caudomedial nidopallium was specific for the song-learning phase and was memory-related. These findings demonstrate a remarkable neural parallel between birdsong and human spoken language, and they have important consequences for our understanding of the evolution of auditory-vocal learning and its neural mechanisms.	\N	\N
22805019	To establish the method of conducting electrical evoked middle latency response (EMLR) monitoring in cochlear implantation operation and further to assess the neural response of auditory pathway under electrical stimulation. Twenty cases of cochlear implantation subjects were investigated in this study. Fourteen cases were pre-lingual deaf and 6 were post-lingual deaf. The surface recording electrodes were placed on the patients under general anesthesia, with language processor connected to the triggering port of the auditory evoked potential device. After the electrode was implanted, the electrode No.3 was selected to conduct. The electrically evoked auditory nerve compound active potentials (ECAP) were firstly tested in all patients, thereafter the EABR mode was selected, and the stimulation parameters were changed to EMLR mode with monopole biphasic, alternation stimulation, pulse width from 50 to 100 µs, the stimulation intensity decreased or increased from 20 CL above the strength of the ECAP threshold to the reaction threshold with a step of 5CL. To evaluate the correlation between the ECAP thresholds and EMLR, another 6 cases of normal hearing healthy subjects were recruited to record their short-sound evoked auditory middle-latency response (AMLR), as the control of morphology and latency of MLR by electrical stimulation. The typical AMLR waveforms could be recorded by the composition of five waves in the 6 cases of normal hearing healthy subjects, with an average response threshold of (12.5±8.6) dBnHL, close to the behavioral audiometric threshold (10.8±7.3) dBHL. The EMLR waveforms could be recorded in 20 patients, which was similar to the AMLR waveforms. However, the wave latency and wave interval shortened. There were lower volatility and longer latency in pre-lingual deaf than post-lingual deaf. The EMLR threshold (140.55±9.92) CL was significantly lower than the ECAP threshold (160.75±13.34) CL (t=10.467, P<0.01), a positive correlation between the thresholds was detected (r=0.763, P<0.01). We successfully established the method of EMLR monitoring in cochlear implantation surgery. The EMLR threshold is lower than the ECAP threshold but it is close to the behavioral audiometric threshold; EMLR can provide neural response information closer to the auditory center, and can serve as an effective objective method to evaluate the effect of hearing rehabilitation.	\N	\N
22829158	The aim of this study was to describe the outcome and possible complications of subtotal petrosectomy (SP) for Vibrant Soundbridge (VSB) device surgery in a tertiary referral center. A secondary objective was the evaluation of hearing results in a subgroup of subjects who received the VSB device. Between 2009 and early 2011, 22 adult subjects with chronic otitis media (COM) underwent a SP, blind sac closure of the external auditory canal and abdominal fat obliteration to facilitate the application of an active middle ear implant (AMEI) in a staged procedure. Indications consisted of mixed hearing loss after previous tympanomastoplasty and failure of hearing rehabilitation with a hearing aid or bone conduction device in COM. Pre- and postoperative pure-tone audiograms were analyzed in respect to deterioration of inner ear function, unaided and aided (hearing aid, bone-anchored hearing aid and VSB) speech audiograms were compared to verify improvements in communications skills and functional gains. Incidence and type of complications were reviewed. No significant change was observed regarding mean bone conduction thresholds after the first stage procedure. Some minor wound healing problems were noted. Speech perception using the VSB (n = 16) showed a mean aided speech discrimination at 65-dB SPL of 75 % [standard deviation (SD) 28.7], at 80-dB SPL of 90 % (SD 25.1). Our results suggest that for selected patients with open mastoid cavities and chronic middle ear disease, SP with abdominal fat obliteration is an effective and safe technique to facilitate safe AMEI placement.	\N	\N
22832675	Auditory-perceptual evaluation of dysphonia may be influenced by the type of speech/voice task used to render judgements during the clinical evaluation, i.e., sustained vowels versus continuous speech. This study explored (a) differences in listener dysphonia severity ratings on the basis of speech/voice tasks, (b) the influence of speech/voice task on dysphonia severity ratings of stimuli that combined sustained vowels and continuous speech, and (c) the differences in inter-rater reliability of dysphonia severity ratings between both speech tasks. Five experienced listeners rated overall dysphonia severity in sustained vowels, continuous speech and concatenated speech samples elicited by 39 subjects with various voice disorders and degrees of hoarseness. Data confirmed that sustained vowels are rated significantly more dysphonic than continuous speech. Furthermore, dysphonia severity in concatenated speech samples is least determined by the sustained vowel. Finally, no significant difference was found in inter-rater reliability between dysphonia severity ratings of sustained vowels versus continuous speech. Based upon the results, both types of speech/voice tasks (i.e., sustained vowel and continuous speech) should be elicited and judged by clinicians in the auditory-perceptual rating of dysphonia severity.	\N	\N
22844984	A fundamental issue in the design and the interpretation of experimental studies of perception relates to the question of whether the participants in these experiments could perform the perceptual task assigned to them using another feature, or cue, than that intended by the experimenter. An approach frequently used by auditory- and visual-perception researchers to guard against this possibility involves applying random variations to the stimuli across presentations or trials so as to make the "unwanted" cue unreliable for the participants. However, the theoretical basis of this widespread practice is not well developed. In this article, we describe a 2-channel model based on general principles of psychophysical signal detection theory, which can be used to assess the respective contributions of the unwanted cue and of the primary cue to performance or thresholds measured in perceptual discrimination experiments involving stimulus randomization. Example applications of the model to the analysis of results obtained in representative studies from the auditory- and visual-perception literature are provided. In several cases, the results of the model-based analyses indicate that the effectiveness of the randomization procedure was less than originally assumed by the authors of these studies. These findings underscore the importance of quantifying the potential influence of unwanted cues on the results of psychophysical experiments, even when stimulus randomization is used.	\N	\N
22846767	Previous studies have demonstrated that human evaluation of subjective loudness and acoustic comfort depends on a series of factors in a particular situation rather than only on sound pressure levels. In the present study, a large-scale subjective survey has been undertaken on underground shopping streets in Harbin, China, to determine how individual sound sources influence subjective loudness and acoustic comfort evaluation. Based on the analysis of case study results, it has been shown that all individual sound sources can increase subjective loudness to a certain degree. However, their levels of influence on acoustic comfort are different. Background music and the public address system can increase acoustic comfort, with a mean difference of 0.18 to 0.32 and 0.21 to 0.27, respectively, where a five-point bipolar category scale is used. Music from shops and vendor shouts can decrease acoustic comfort, with a mean difference of -0.11 to -0.38 and -0.39 to -0.62, respectively. The feasibility of improving acoustic comfort by changing certain sound sources is thus demonstrated.	\N	\N
22866682	Two experiments tested the effects of preview sentences and headings on the quality of college students' outlines of informational texts. Experiment 1 found that performance was much better in the preview sentences condition than in a no-signals condition for both printed text and text-to-speech (TTS) audio rendering of the printed text. In contrast, performance in the headings condition was good for the printed text but poor for the auditory presentation because the TTS software failed to communicate nonverbal information carried by the visual headings. Experiment 2 compared outlining performance for five headings conditions during TTS presentation. Using a theoretical framework, "signaling available, relevant, accessible" (SARA) information, to provide an analysis of the information content of headings in the printed text, the manipulation of the headings systematically restored information that was omitted by the TTS application in Experiment 1. The result was that outlining performance improved to levels similar to the visual headings condition of Experiment 1. It is argued that SARA is a useful framework for guiding future development of TTS software for a wide variety of text signaling devices, not just headings.	\N	\N
22891070	Perceptual training with spectrally degraded environmental sounds results in improved environmental sound identification, with benefits shown to extend to untrained speech perception as well. The present study extended those findings to examine longer-term training effects as well as effects of mere repeated exposure to sounds over time. Participants received two pretests (1 week apart) prior to a week-long environmental sound training regimen, which was followed by two posttest sessions, separated by another week without training. Spectrally degraded stimuli, processed with a four-channel vocoder, consisted of a 160-item environmental sound test, word and sentence tests, and a battery of basic auditory abilities and cognitive tests. Results indicated significant improvements in all speech and environmental sound scores between the initial pretest and the last posttest with performance increments following both exposure and training. For environmental sounds (the stimulus class that was trained), the magnitude of positive change that accompanied training was much greater than that due to exposure alone, with improvement for untrained sounds roughly comparable to the speech benefit from exposure. Additional tests of auditory and cognitive abilities showed that speech and environmental sound performance were differentially correlated with tests of spectral and temporal-fine-structure processing, whereas working memory and executive function were correlated with speech, but not environmental sound perception. These findings indicate generalizability of environmental sound training and provide a basis for implementing environmental sound training programs for cochlear implant (CI) patients.	\N	\N
22892280	Three studies investigated developmental changes in facial expression processing, between 3 years-of-age and adulthood. For adults and older children, the addition of sunglasses to upright faces caused an equivalent decrement in performance to face inversion. However, younger children showed better classification of expressions of faces wearing sunglasses than children who saw the same faces un-occluded. When the mouth area was occluded with a mask, children under nine years showed no impairment in expression classification, relative to un-occluded faces. An early selective focus of attention on the eyes may be optimal for socialization, but mediate against accurate expression classification. The data support a model in which a threshold level of attentional control must be reached before children can develop adult-like configural processing skills and be flexible in their use of face- processing strategies.	\N	\N
22892586	We introduce a new version of the perceptual retouch model. This model was used for explaining properties of temporal interaction of successive objects in reaching conscious representation. The new model incorporates two interactive binding operations - binding features for objects and binding the bound feature-objects with a large scale oscillatory system that corresponds to perceptual consciousness. Here, the typical result of masking experiments - second object advantage in conscious perception - is achieved by applying the effects of a common synchronizing oscillator with a delay. This delayed modulation of each of the feature-binding first-order oscillators that represent emerging and decaying neural activities of each of the objects guarantees that the oscillating synchrony of the feature-neurons of the following object is higher than the synchrony of the feature-neurons of the first presented object. Thus we model the fact that the following object dominates the preceding object in conscious perception. We also show the capacity of the model to simulate illusory misbinding of features from different objects. The third qualitative effect, the relative release of the first object from backward masking is achieved by priming the non-specific oscillatory modulation ahead in time.	\N	\N
22894217	Green [J. Acoust. Soc. Am. 87, 2662-2674 (1990)] suggested an efficient, maximum-likelihood-based approach for adaptively estimating thresholds. Such procedures determine the signal strength on each trial by first identifying the most likely psychometric functions among the pre-proposed alternatives based on responses from previous trials, and then finding the signal strength at the "sweet point" on that most likely function. The sweet point is the point on the psychometric function that is associated with the minimum expected variance. Here, that procedure is extended to reduce poor estimates that result from lapses in attention. The sweet points for the threshold, slope, and lapse parameters of a transformed logistic psychometric function are derived. In addition, alternative stimulus placement algorithms are considered. The result is a relatively fast and robust estimation of a three-parameter psychometric function.	\N	\N
22894232	Speaker variability and noise are two common sources of acoustic variability. The goal of this study was to examine whether these two sources of acoustic variability affected native and non-native perception of Mandarin fricatives to different degrees. Multispeaker Mandarin fricative stimuli were presented to 40 native and 52 non-native listeners in two presentation formats (blocked by speaker and mixed across speakers). The stimuli were also mixed with speech-shaped noise to create five levels of signal-to- noise ratios. The results showed that noise affected non-native identification disproportionately. By contrast, the effect of speaker variability was comparable between the native and non-native listeners. Confusion patterns were interpreted with reference to the results of acoustic analysis, suggesting native and non-native listeners used distinct acoustic cues for fricative identification. It was concluded that not all sources of acoustic variability are treated equally by native and non-native listeners. Whereas noise compromised non-native fricative perception disproportionately, speaker variability did not pose a special challenge to the non-native listeners.	\N	\N
22895701	Critical periods in language acquisition have been discussed primarily with reference to studies of people who are deaf or bilingual. Here, we provide evidence on the opening of sensitivity to the linguistic environment by studying the response to a change of phoneme at a native and nonnative phonetic boundary in full-term and preterm human infants using event-related potentials. Full-term infants show a decline in their discrimination of nonnative phonetic contrasts between 9 and 12 months of age. Because the womb is a high-frequency filter, many phonemes are strongly degraded in utero. Preterm infants thus benefit from earlier and richer exposure to broadcast speech. We find that preterms do not take advantage of this enriched linguistic environment: the decrease in amplitude of the mismatch response to a nonnative change of phoneme at the end of the first year of life was dependent on maturational age and not on the duration of exposure to broadcast speech. The shaping of phonological representations by the environment is thus strongly constrained by brain maturation factors.	\N	\N
22897876	Neurophysiological studies of infant speech suggest that mismatch responses (MMRs) have predictive value for later language. Their value, however, is diminished because unexplained differences in the MMR patterns are seen across studies. The current study aimed to identify the functional nature of infant MMRs by recording event-related-potentials (ERPs) to an infrequent English vowel change in internal or final positions of a sequence of ten vowels in six-month-old monolingually and bilingually exposed infants. Increased negativity of the MMR (infrequent minus frequent) was found in final compared to internal positions and correlated with an index of increased attention to the final position. This pattern helps explain the overall greater negativity to the speech sounds in the bilingually exposed female infants. These findings substantially advance our understanding of neural indices of speech perception development and show promise for furthering our understanding of bilingual language development.	\N	\N
22907183	Although central nervous system abnormalities are incidentally detected in preoperative brain magnetic resonance imaging (MRI) studies in pediatric cochlear implant (CI) candidates, the clinical significance of the abnormalities remains unclear. We aimed to assess post-implantation auditory and speech performance in patients with brain lesions seen on MRI. Pediatric CI recipients (n = 177) who underwent preoperative MRI scans of the brain between January 2002 and June 2009 were included in this study. Patients with brain lesions on MRI were reviewed and categorized into the following groups: brain parenchymal lesions (focal vs. diffuse), ventriculomegaly, and extra-axial lesion. The main communication mode as well as progress in auditory perception and speech production were evaluated preoperatively and at 3, 6, 12, and 24 months postoperatively. Performance in patients with brain lesions was compared with the age- and sex-matched control group. Various brain lesions were found in 27 out of 177 patients. Children with brain lesions who received CIs showed gradual progress in auditory and speech outcomes for 2 years, though performance was reduced compared with the control group. In addition, there was a significant difference in the main communication mode between the two groups at 2 years following cochlear implantation. This difference was especially significant in patients with diffuse brain parenchymal lesions after further stratification of the brain lesion group. Preoperative brain MRI may have a role in improving the prediction of adverse outcomes in pediatric CI recipients. In particular, children with diffuse brain parenchymal lesions should be counseled regarding the poor prognosis preoperatively, and followed up with special attention.	\N	\N
22922236	Cognitive models propose that auditory verbal hallucinations arise through inner speech misidentification. However, such models cannot explain why the voices in hallucinations often have identities different from the hearer. This study investigated whether a general voice identity recognition difficulty might be present in schizophrenia and related to auditory verbal hallucinations. Twenty-five schizophrenia patients and 13 healthy controls were tested on recognition of famous voices. Signal detection theory was used to calculate perceptual sensitivity and response criterion measures. Schizophrenia patients obtained fewer hits and had lower perceptual sensitivity to detect famous voices than healthy controls did. There were no differences between groups in false alarm rate or response criterion. A symptom-based analysis demonstrated that especially those patients with auditory verbal hallucinations performed poorly in the task. The results indicate that patients with hallucinations are impaired at voice identity recognition because of decreased sensitivity, which may result in inner speech misidentification.	\N	\N
22922606	To evaluate, with a long-term follow-up, the speech perception and language development in children with cytomegalovirus (CMV)-related deafness after cochlear implantation. A retrospective study on CMV-related profound deafness and cochlear implantation was performed from 1995 to 2010. Six children with an average follow-up of 10 years were included in this research. Medical history, imaging, cognitive delay, speech perception and production data were reviewed. Two of the 6 patients developed a functional language with the use of phrases and word sequences based on morphological and syntactic rules; the others demonstrated the development of a preverbal or transitional language with the use of single words only. Patients with CMV-related deafness benefit from cochlear implantation; however, the expectations of the parents must be evaluated in a series of counseling efforts prior to the surgery.	\N	\N
22923209	Autism spectrum disorder is typically associated with social deficits and is often specifically linked to difficulty with processing faces and other socially relevant stimuli. Emerging research has suggested that children with autism might also have deficits in basic perceptual abilities including multisensory processing (e.g., simultaneously processing visual and auditory inputs). The current study examined the relationship between multisensory temporal processing (assessed via a simultaneity judgment task wherein participants were to report whether a visual stimulus and an auditory stimulus occurred at the same time or at different times) and self-reported symptoms of autism (assessed via the Autism Spectrum Quotient questionnaire). Data from over 100 healthy adults revealed a relationship between these two factors as multisensory timing perception correlated with symptoms of autism. Specifically, a stronger bias to perceive auditory stimuli occurring before visual stimuli as simultaneous was associated with greater levels of autistic symptoms. Additional data and analyses confirm that this relationship is specific to multisensory processing and symptoms of autism. These results provide insight into the nature of multisensory processing while also revealing a continuum over which perceptual abilities correlate with symptoms of autism and that this continuum is not just specific to clinical populations but is present within the general population.	\N	\N
22925516	We live in a world rich in sensory information, and consequently the brain is challenged with deciphering which cues from the various sensory modalities belong together. Determinations regarding the relatedness of sensory information appear to be based, at least in part, on the spatial and temporal relationships between the stimuli. Stimuli that are presented in close spatial and temporal correspondence are more likely to be associated with one another and thus 'bound' into a single perceptual entity. While there is a robust literature delineating behavioral changes in perception induced by multisensory stimuli, maturational changes in multisensory processing, particularly in the temporal realm, are poorly understood. The current study examines the developmental progression of multisensory temporal function by analyzing responses on an audiovisual simultaneity judgment task in 6- to 23-year-old participants. The overarching hypothesis for the study was that multisensory temporal function will mature with increasing age, with the developmental trajectory for this change being the primary point of inquiry. Results indeed reveal an age-dependent decrease in the size of the 'multisensory temporal binding window', the temporal interval within which multisensory stimuli are likely to be perceptually bound, with changes occurring over a surprisingly protracted time course that extends into adolescence.	\N	\N
22926436	The results of two experiments are presented which explore the effect of distractor items on face and voice recognition. Following from the suggestion that voice processing is relatively weak compared to face processing, it was anticipated that voice recognition would be more affected by the presentation of distractor items between study and test compared to face recognition. Using a sequential matching task with a fixed interval between study and test that either incorporated distractor items or did not, the results supported our prediction. Face recognition remained strong irrespective of the number of distractor items between study and test. In contrast, voice recognition was significantly impaired by the presence of distractor items regardless of their number (Experiment 1). This pattern remained whether distractor items were highly similar to the targets or not (Experiment 2). These results offer support for the proposal that voice processing is a relatively vulnerable method of identification.	\N	\N
22946856	Motivational interviewing (MI) is a directive, client-centered therapeutic method employed in the treatment of substance abuse, with strong evidence of effectiveness. To date, the sole mechanism of action in MI with any consistent empirical support is "change talk" (CT), which is generally defined as client within-session speech in support of a behavior change. "Sustain talk" (ST) incorporates speech in support of the status quo. MI maintains that during treatment, clients essentially talk themselves into change. Multiple studies have now supported this theory, linking within-session speech to substance use outcomes. Although a causal chain has been established linking therapist behavior, client CT, and substance use outcome, the neural substrate of CT has been largely uncharted. We addressed this gap by measuring neural responses to clients' own CT using magnetoencephalography (MEG), a noninvasive neuroimaging technique with excellent spatial and temporal resolution. Following a recorded MI session, MEG was used to measure brain activity while participants heard multiple repetitions of their CT and ST utterances from that session, intermingled and presented in a random order. Results suggest that CT processing occurs in a right-hemisphere network that includes the inferior frontal gyrus, insula, and superior temporal cortex. These results support a representation of CT at the neural level, consistent with the role of these structures in self-perception. This suggests that during treatment sessions, clinicians who are able to evoke this special kind of language are tapping into neural circuitry that may be essential to behavior change.	\N	\N
22951258	Comprehension of spoken narratives requires coordination of multiple language skills. As such, for normal children narrative skills develop well into the school years and, during this period, are particularly vulnerable in the face of brain injury or developmental disorder. For these reasons, we sought to determine the developmental trajectory of narrative processing using longitudinal fMRI scanning. 30 healthy children between the ages of 5 and 18 enrolled at ages 5, 6, or 7, were examined annually for up to 10 years. At each fMRI session, children were presented with a set of five, 30s-long, stories containing 9, 10, or 11 sentences designed to be understood by a 5 year old child. fMRI data analysis was conducted based on a hierarchical linear model (HLM) that was modified to investigate developmental changes while accounting for missing data and controlling for factors such as age, linguistic performance and IQ. Performance testing conducted after each scan indicated well above the chance (p<0.002) comprehension performance. There was a linear increase with increasing age in bilateral superior temporal cortical activation (BAs 21 and 22) linked to narrative processing. Conversely, age-related decreases in cortical activation were observed in bilateral occipital regions, cingulate and cuneus, possibly reflecting changes in the default mode networks. The dynamic changes observed in this longitudinal fMRI study support the increasing role of bilateral BAs 21 and 22 in narrative comprehension, involving non-domain-specific integration in order to achieve final story interpretation. The presence of a continued linear development of this area throughout childhood and teenage years with no apparent plateau, indicates that full maturation of narrative processing skills has not yet occurred and that it may be delayed to early adulthood.	\N	\N
22957659	The Adaptive Tests of Temporal Resolution (ATTR©) software provides within-channel (WC) and across-channel (AC) adaptive measures of temporal resolution that are feasible for clinical applications. The purpose of the present study was to obtain normative values for young adults on two of the ATTR tests: the narrow-band noise within-channel (NBN-WC) test and the narrow-band noise across-channel (NBN-AC) test, at different stimulus intensities. Gap detection thresholds were measured at five sensation levels. A Latin square design was used to control for practice effects. The NBN-WC group and the NBN-AC group each consisted of 25 young adults with normal hearing. Gap detection thresholds for both conditions decreased with increasing stimulus intensity, and stimulus intensities above 20 dB SL were not associated with large improvements in performance. Variability was larger in the NBN-AC condition. Values obtained for the NBC-WC condition were very similar to previously reported ATTR results despite equipment and design differences. Results provide normative values for NBN-WC and NBN-AC performance on the ATTR and suggest that the ATTR is a robust test for clinical use.	\N	\N
22963230	Properties of auditory working memory for sounds that lack strong semantic associations and are not readily verbalized or sung are poorly understood. We investigated auditory working memory capacity for lists containing 2-6 easily discriminable abstract sounds synthesized within a constrained timbral space, at delays of 1-6 s (Experiment 1), and the effect of greater perceptual variability among list items on capacity estimates at delays of 1-6 s (Experiment 2). Working memory capacity estimates of 1-2 items were found in all conditions and increased significantly as the perceptual variability among the list items increased. Nonetheless, the capacity estimates were smaller than the commonly observed average working memory capacity limit of 3-5 items. Decay profiles in both experiments were comparable with those previously reported in the verbal and auditory working memory literature. The results help define boundary conditions on capacity estimates for nonverbalizable timbres that lack strong long-term memory associations.	\N	\N
22978899	A common complaint of the hearing impaired is the inability to understand speech in noisy environments even with their hearing assistive devices. Only a few single-channel algorithms have significantly improved speech intelligibility in noise for hearing-impaired listeners. The current study introduces a cochlear noise reduction algorithm. It is based on a cochlear representation of acoustic signals and real-time derivation of a binary speech mask. The contribution of the algorithm for enhancing word recognition in noise was evaluated on a group of 42 normal-hearing subjects, 35 hearing-aid users, 8 cochlear implant recipients, and 14 participants with bimodal devices. Recognition scores of Hebrew monosyllabic words embedded in Gaussian noise at several signal-to-noise ratios (SNRs) were obtained with processed and unprocessed signals. The algorithm was not effective among the normal-hearing participants. However, it yielded a significant improvement in some of the hearing-impaired subjects under different listening conditions. Its most impressive benefit appeared among cochlear implant recipients. More than 20% improvement in recognition score of noisy words was obtained by 12, 16, and 26 hearing-impaired at SNR of 30, 24, and 18 dB, respectively. The algorithm has a potential to improve speech intelligibility in background noise, yet further research is required to improve its performances.	\N	\N
22978901	The auditory octave illusion arises when dichotically presented tones, one octave apart, alternate rapidly between the ears. Most subjects perceive an illusory sequence of monaural tones: A high tone in the right ear (RE) alternates with a low tone, incorrectly localized to the left ear (LE). Behavioral studies suggest that the perceived pitch follows the RE input, and the perceived location the higher-frequency sound. To explore the link between the perceived pitches and brain-level interactions of dichotic tones, magnetoencephalographic responses were recorded to 4 binaural combinations of 2-min long continuous 400- and 800-Hz tones and to 4 monaural tones. Responses to LE and RE inputs were distinguished by frequency-tagging the ear-specific stimuli at different modulation frequencies. During dichotic presentation, ipsilateral LE tones elicited weaker and ipsilateral RE tones stronger responses than when both ears received the same tone. During the most paradoxical stimulus-high tone to LE and low tone to RE perceived as a low tone in LE during the illusion-also the contralateral responses to LE tones were diminished. The results demonstrate modified binaural interaction of dichotic tones one octave apart, suggesting that this interaction contributes to pitch perception during the octave illusion.	\N	\N
22981882	Dyslexia is heritable and associated with auditory processing deficits. We investigate whether temporal auditory processing is compromised in young children at-risk for dyslexia and whether it is associated with later language and reading skills. We recorded EEG from 17 months-old children with or without familial risk for dyslexia to investigate whether their auditory system was able to detect a temporal change in a tone pattern. The children were followed longitudinally and performed an intelligence- and language development test at ages 4 and 4.5 years. Literacy related skills were measured at the beginning of second grade, and word- and pseudo-word reading fluency were measured at the end of second grade. The EEG responses showed that control children could detect the temporal change as indicated by a mismatch response (MMR). The MMR was not observed in at-risk children. Furthermore, the fronto-central MMR amplitude correlated with preliterate language comprehension and with later word reading fluency, but not with phonological awareness. We conclude that temporal auditory processing differentiates young children at risk for dyslexia from controls and is a precursor of preliterate language comprehension and reading fluency.	\N	\N
22982103	Production of actions is highly dependent on concurrent sensory information. In speech production, for example, movement of the articulators is guided by both auditory and somatosensory input. It has been demonstrated in non-human primates that self-produced vocalizations and those of others are differentially processed in the temporal cortex. The aim of the current study was to investigate how auditory and motor responses differ for self-produced and externally produced speech. Using functional neuroimaging, subjects were asked to produce sentences aloud, to silently mouth while listening to a different speaker producing the same sentence, to passively listen to sentences being read aloud, or to read sentences silently. We show that that separate regions of the superior temporal cortex display distinct response profiles to speaking aloud, mouthing while listening, and passive listening. Responses in anterior superior temporal cortices in both hemispheres are greater for passive listening compared with both mouthing while listening, and speaking aloud. This is the first demonstration that articulation, whether or not it has auditory consequences, modulates responses of the dorsolateral temporal cortex. In contrast posterior regions of the superior temporal cortex are recruited during both articulation conditions. In dorsal regions of the posterior superior temporal gyrus, responses to mouthing and reading aloud were equivalent, and in more ventral posterior superior temporal sulcus, responses were greater for reading aloud compared with mouthing while listening. These data demonstrate an anterior-posterior division of superior temporal regions where anterior fields are suppressed during motor output, potentially for the purpose of enhanced detection of the speech of others. We suggest posterior fields are engaged in auditory processing for the guidance of articulation by auditory information.	\N	\N
22984436	Amplitude modulation can serve as a cue for segregating streams of sounds from different sources. Here we evaluate stream segregation in humans using ABA- sequences of sinusoidally amplitude modulated (SAM) tones. A and B represent SAM tones with the same carrier frequency (1000, 4000 Hz) and modulation depth (30, 100%). The modulation frequency of the A signals (f(modA)) was 30, 100 or 300 Hz, respectively. The modulation frequency of the B signals was up to four octaves higher (Δf(mod)). Three different ABA- tone patterns varying in tone duration and stimulus onset asynchrony were presented to evaluate the effect of forward suppression. Subjects indicated their 1- or 2-stream percept on a touch screen at the end of each ABA- sequence (presentation time 5 or 15 s). Tone pattern, f(modA), Δf(mod), carrier frequency, modulation depth and presentation time significantly affected the percentage of a 2-stream percept. The human psychophysical results are compared to responses of avian forebrain neurons evoked by different ABA- SAM tone conditions [1] that were broadly overlapping those of the present study. The neurons also showed significant effects of tone pattern and Δf(mod) that were comparable to effects observed in the present psychophysical study. Depending on the carrier frequency, modulation frequency, modulation depth and the width of the auditory filters, SAM tones may provide mainly temporal cues (sidebands fall within the range of the filter), spectral cues (sidebands fall outside the range of the filter) or possibly both. A computational model based on excitation pattern differences was used to predict the 50% threshold of 2-stream responses. In conditions for which the model predicts a considerably larger 50% threshold of 2-stream responses (i.e., larger Δf(mod) at threshold) than was observed, it is unlikely that spectral cues can provide an explanation of stream segregation by SAM.	\N	\N
22992710	Auditory perception of vowels in background noise is enhanced when combined with visually perceived speech features. The objective of this study was to investigate whether the influence of visual cues on vowel perception extends to incongruent vowels, in a manner similar to the McGurk effect observed with consonants. Identification of Dutch front vowels /i, y, e, Y/ that share all features other than height and lip-rounding was measured for congruent and incongruent audiovisual conditions. The audio channel was systematically degraded by adding noise, increasing the reliance on visual cues. The height feature was more robustly carried over through the auditory channel and the lip-rounding feature through the visual channel. Hence, congruent audiovisual presentation enhanced identification, while incongruent presentation led to perceptual fusions and thus decreased identification. Visual cues influence the identification of congruent as well as incongruent audiovisual vowels. Incongruent visual information results in perceptual fusions, demonstrating that the McGurk effect can be instigated by long phonemes such as vowels. This result extends to the incongruent presentation of the visually less reliably perceived height. The findings stress the importance of audiovisual congruency in communication devices, such as cochlear implants and videoconferencing tools, where the auditory signal could be degraded.	\N	\N
22993261	Temporal selection poses unique challenges to the perceptual system. Selection is needed to protect goal-relevant stimuli from interference from new sensory input. In addition, contextual information that occurs at the same time as goal-relevant stimuli may be critical for learning. Using fMRI, we characterized how visual cortical regions respond to the temporal selection of auditory and visual stimuli. Critically, we focused on brain regions that are not involved in processing the target itself. Participants pressed a button when they heard a prespecified target tone and did not respond to other tones. Although more attention was directed to auditory input when the target tone was selected, activity in primary visual cortex increased more after target tones than after distractor tones. In contrast to spatial attention, this effect was larger in V1 than in V2 and V3. It was present in regions not typically involved in representing the target stimulus. Additional experiments demonstrated that these effects were not due to multimodal processing, rare targets, or motor responses to the targets. Thus temporal selection of behaviorally relevant stimuli enhances, rather than reduces, activity in perceptual regions involved in processing other information.	\N	\N
22995182	Auditory feedback plays an important role in monitoring vocal output and determining when adjustments are necessary. In this study a group of untrained singers participated in a frequency altered feedback experiment to examine if accuracy at matching a note could predict the degree of compensation to auditory feedback that was shifted in frequency. Participants were presented with a target note and instructed to match the note in pitch and duration. Following the onset of the participants' vocalizations their vocal pitch was shifted down one semi-tone at a random time during their utterance. This altered auditory feedback was instantaneously presented back to them through headphones. Results indicated that note matching accuracy did not correlate with compensation magnitude, however, a significant correlation was found between baseline variability and compensation magnitude. These results suggest that individuals with a more stable baseline fundamental frequency rely more on feedforward control mechanisms than individuals with more variable vocal production. This increased weighting of feedforward control means they are less sensitive to mismatches between their intended vocal production and auditory feedback.	\N	\N
23000118	Users of a cochlear implant together with a hearing aid in the non-implanted ear currently use devices that were developed separately and are often fitted separately. This results in very different growth of loudness with level in the two ears, potentially leading to decreased wearing comfort and suboptimal perception of interaural level differences. A loudness equalisation strategy, named 'SCORE bimodal', is proposed. It equalises loudness growth for the two modalities using existing models of loudness for acoustic and electric stimulation, and is suitable for implementation in wearable devices. Loudness balancing experiments were performed with six bimodal listeners to validate the strategy. In a first set of experiments, the function of each loudness model used was validated by balancing the loudness of four harmonic complexes of different bandwidths, ranging from 200 Hz to 1000 Hz, separately for each ear. Both the electric and acoustic loudness models predicted the data well. In a second set of experiments, binaural balancing was done for the same stimuli. It was found that SCORE significantly improved binaural balance.	\N	\N
23000801	Cochlear implants (CIs) help many deaf children achieve near-normal speech and language (S/L) milestones. Nevertheless, high levels of unexplained variability in S/L outcomes are limiting factors in improving the effectiveness of CIs in deaf children. The objective of this study was to longitudinally assess the role of verbal short-term memory (STM) and working memory (WM) capacity as a progress-limiting source of variability in S/L outcomes after CI in children. Longitudinal study of 66 children with CIs for prelingual severe-to-profound hearing loss. Outcome measures included performance on digit span forward (DSF), digit span backward (DSB), and four conventional S/L measures that examined spoken-word recognition (Phonetically Balanced Kindergarten word test), receptive vocabulary (Peabody Picture Vocabulary Test ), sentence-recognition skills (Hearing in Noise Test), and receptive and expressive language functioning (Clinical Evaluation of Language Fundamentals Fourth Edition Core Language Score; CELF). Growth curves for DSF and DSB in the CI sample over time were comparable in slope, but consistently lagged in magnitude relative to norms for normal-hearing peers of the same age. For DSF and DSB, 50.5% and 44.0%, respectively, of the CI sample scored more than 1 SD below the normative mean for raw scores across all ages. The first (baseline) DSF score significantly predicted all endpoint scores for the four S/L measures, and DSF slope (growth) over time predicted CELF scores. DSF baseline and slope accounted for an additional 13 to 31% of variance in S/L scores after controlling for conventional predictor variables such as: chronological age at time of testing, age at time of implantation, communication mode (auditory-oral communication versus total communication), and maternal education. Only DSB baseline scores predicted endpoint language scores on Peabody Picture Vocabulary Test and CELF. DSB slopes were not significantly related to any endpoint S/L measures. DSB baseline scores and slopes taken together accounted for an additional 4 to 19% of variance in S/L endpoint measures after controlling for the conventional predictor variables. Verbal STM/WM scores, process measures of information capacity, develop at an average rate in the years after cochlear implantation, but were found to consistently lag in absolute magnitude behind those reported for normal-hearing peers. Baseline verbal STM/WM predicted long-term endpoint S/L outcomes, but verbal STM slopes predicted only endpoint language outcomes. Verbal STM/WM processing skills reflect important underlying core elementary neurocognitive functions and represent potential intervention targets for improving endpoint S/L outcomes in pediatric CI users.	\N	\N
23014760	In this paper, we present a Bayesian framework for the active multimodal perception of 3-D structure and motion. The design of this framework finds its inspiration in the role of the dorsal perceptual pathway of the human brain. Its composing models build upon a common egocentric spatial configuration that is naturally fitting for the integration of readings from multiple sensors using a Bayesian approach. In the process, we will contribute with efficient and robust probabilistic solutions for cyclopean geometry-based stereovision and auditory perception based only on binaural cues, modeled using a consistent formalization that allows their hierarchical use as building blocks for the multimodal sensor fusion framework. We will explicitly or implicitly address the most important challenges of sensor fusion using this framework, for vision, audition, and vestibular sensing. Moreover, interaction and navigation require maximal awareness of spatial surroundings, which, in turn, is obtained through active attentional and behavioral exploration of the environment. The computational models described in this paper will support the construction of a simultaneously flexible and powerful robotic implementation of multimodal active perception to be used in real-world applications, such as human-machine interaction or mobile robot navigation.	\N	\N
23015425	Depth-electrode recordings from the auditory cortex of humans undergoing presurgical evaluation for epilepsy allow the recording of ensemble responses to pitch in the form of local field potentials. These recordings allow another test of the hypothesis that there is a specialized neural ensemble for pitch within auditory cortex. Moreover, the technique allows recordings from multiple sites with millisecond temporal resolution to allow modeling of the effective connectivity between these sites. Here we argue that this takes the form of a hierarchical network of pitch-sensitive regions. Activity can be understood as reflecting predictive coding, in which perceptual predictions and error messages are continuously exchanged between a higher pitch center and lower-level auditory cortex.	\N	\N
23025156	Change blindness is the failure of observers to notice otherwise obvious changes to a visual scene when those changes are masked in some way (eg by blotches or a blanking ofthe screen). Typically, change blindness is taken as evidence that our representation of the visual world is capacity limited. The locus of this capacity limit is thought to be visual short-term memory (vSTM). The capacity of vSTM is usually estimated with a high-threshold model which assumes that each element in the stimulus array is either fully encoded or not encoded at all, and, furthermore, that false alarms can arise only by guessing, not by noise. Low-threshold models, by contrast, suggest that false alarms can arise by noise at the level of detection/discrimination and/or decision. In this study, we use a well-controlled stimulus display in which a single element changes over a blanking of the screen and contrast predictions from a popular high-threshold model of vSTM with the predictions of a low-threshold model (specifically, the sample-size model) of visual search and vSTM. The data were better predicted by the low-threshold model.	\N	\N
23025164	Cross-sensory correspondences automatically intrude on performance in elaborate laboratory tasks (see Spence 2011 Attention, Perception, & Psychophysics 73 971-995, for a review). Outside such tasks, might they be responsible for some popular misconceptions about natural phenomena? Four simple demonstrations reveal how the correspondences between surface-lightness and weight, and between surface-lightness and auditory pitch, generate misconceptions about the weight and movement of objects and the vocalisations of animals. Specifically, people expect darker objects to be heavier than lighter-coloured objects, to free-fall more quickly, to roll across a table more slowly, and to make lower-pitched vocalisations when they come to life.	\N	\N
23028516	We physically interact with external stimuli when they occur within a limited space immediately surrounding the body, i.e., Peripersonal Space (PPS). In the primate brain, specific fronto-parietal areas are responsible for the multisensory representation of PPS, by integrating tactile, visual and auditory information occurring on and near the body. Dynamic stimuli are particularly relevant for PPS representation, as they might refer to potential harms approaching the body. However, behavioural tasks for studying PPS representation with moving stimuli are lacking. Here we propose a new dynamic audio-tactile interaction task in order to assess the extension of PPS in a more functionally and ecologically valid condition. Participants vocally responded to a tactile stimulus administered at the hand at different delays from the onset of task-irrelevant dynamic sounds which gave the impression of a sound source either approaching or receding from the subject's hand. Results showed that a moving auditory stimulus speeded up the processing of a tactile stimulus at the hand as long as it was perceived at a limited distance from the hand, that is within the boundaries of PPS representation. The audio-tactile interaction effect was stronger when sounds were approaching compared to when sounds were receding. This study provides a new method to dynamically assess pps representation: The function describing the relationship between tactile processing and the position of sounds in space can be used to estimate the location of PPS boundaries, along a spatial continuum between far and near space, in a valuable and ecologically significant way.	\N	\N
23029113	An auditory neuron can preserve the temporal fine structure of a low-frequency tone by phase-locking its response to the stimulus. Apart from sound localization, however, much about the role of this temporal information for signal processing in the brain remains unknown. Through psychoacoustic studies we provide direct evidence that humans employ temporal fine structure to discriminate between frequencies. To this end we construct tones that are based on a single frequency but in which, through the concatenation of wavelets, the phase changes randomly every few cycles. We then test the frequency discrimination of these phase-changing tones, of control tones without phase changes, and of short tones that consist of a single wavelet. For carrier frequencies below a few kilohertz we find that phase changes systematically worsen frequency discrimination. No such effect appears for higher carrier frequencies at which temporal information is not available in the central auditory system.	\N	\N
23029492	Findings on song perception and song production have increasingly suggested that common but partially distinct neural networks exist for processing lyrics and melody. However, the neural substrates of song recognition remain to be investigated. The purpose of this study was to examine the neural substrates involved in the accessing "song lexicon" as corresponding to a representational system that might provide links between the musical and phonological lexicons using positron emission tomography (PET). We exposed participants to auditory stimuli consisting of familiar and unfamiliar songs presented in three ways: sung lyrics (song), sung lyrics on a single pitch (lyrics), and the sung syllable 'la' on original pitches (melody). The auditory stimuli were designed to have equivalent familiarity to participants, and they were recorded at exactly the same tempo. Eleven right-handed nonmusicians participated in four conditions: three familiarity decision tasks using song, lyrics, and melody and a sound type decision task (control) that was designed to engage perceptual and prelexical processing but not lexical processing. The contrasts (familiarity decision tasks versus control) showed no common areas of activation between lyrics and melody. This result indicates that essentially separate neural networks exist in semantic memory for the verbal and melodic processing of familiar songs. Verbal lexical processing recruited the left fusiform gyrus and the left inferior occipital gyrus, whereas melodic lexical processing engaged the right middle temporal sulcus and the bilateral temporo-occipital cortices. Moreover, we found that song specifically activated the left posterior inferior temporal cortex, which may serve as an interface between verbal and musical representations in order to facilitate song recognition.	\N	\N
23033450	This study aimed to compare sound production errors arising due to phonological processing impairment with errors arising due to motor speech impairment. Two speakers with similar clinical profiles who produced similar consonant cluster simplification errors were examined using a repetition task. We compared both overall accuracy and acoustic details of hundreds of productions with target consonant clusters to tokens with singletons. Changes in accuracy over the course of the study were also compared. In target words with consonant cluster simplification, the individual whose errors reflected phonological impairment produced articulatory timing consistent with singleton onsets. These productions improved when resyllabification was possible, but error rates were not affected by exposure. In contrast, the individual with motoric-based errors produced simplifications that contained the articulatory timing associated with clusters. Accuracy was not affected by the ability to resyllabify, but it did significantly improve following repeated production. Our findings reveal clear differences between errors arising in phonological processing and in motor planning that reflect the underlying systems. The changes over the course of the study suggest that error types with different sources are responsive to different intervention strategies.	\N	\N
23036182	The present event-related potential (ERP) study examined the developmental mechanisms of auditory-vocal integration in normally developing children. Neurophysiological responses to altered auditory feedback were recorded to determine whether they are affected by age and sex. Forty-two children were pairwise matched for sex and were divided into a group of younger (10-12years) and a group of older (13-15years) children. Twenty healthy young adults (20-25years) also participated in the experiment. ERPs were recorded from the participants who heard their voice pitch feedback unexpectedly shifted -50, -100, or -200 cents during sustained vocalization. P1 amplitudes became smaller as subjects increased in age from childhood to adulthood, and males produced larger N1 amplitudes than females. An age-related decrease in the P1-N1 latencies was also found: latencies were shorter in young adults than in school children. A complex age-by-sex interaction was found for the P2 component, where an age-related increase in P2 amplitudes existed only in girls, and boys produced longer P2 latencies than girls but only in the older children. These findings demonstrate that neurophysiological responses to pitch errors in voice auditory feedback depend on age and sex in normally developing children. The present study provides evidence that there is a sex-specific development of the neural mechanisms involved in auditory-vocal integration.	\N	\N
23047260	To determine the effect of oral steroid treatment on hearing in unilateral Ménière's disease and endolymphatic hydrops patients. Retrospective chart review. Tertiary referral center. All patients presenting during the 2010 calendar year with confirmed unilateral Ménière's disease or endolymphatic hydrops. Those with a first visit and second visit audiogram (n = 58) were included in the analysis of oral steroid treatment effect. Steroid treatment for hearing loss. Change in hearing, as defined by change in affected ear threshold values or speech discrimination score from pretreatment visit to posttreatment visit. Hearing (threshold, speech discrimination score) in patients' affected ear did not significantly change from first visit to second visit after treatment with steroids relative to patients who did not receive steroid treatment. The results of this and other studies would indicate that a Ménière's disease or endolymphatic hydrops patient is unlikely to experience an improvement in hearing from a short course of oral steroid. Clinically observed temporary improvement did not sustain over several months. Further work to elucidate the mechanisms underlying hearing loss in hydrops, perhaps focusing on the dendrite damage noted in animal models of hydrops, is warranted.	\N	\N
23056592	Time-compressed speech, a form of rapidly presented speech, is harder to comprehend than natural speech, especially for non-native speakers. Although it is possible to adapt to time-compressed speech after a brief exposure, it is not known whether additional perceptual learning occurs with further practice. Here, we ask whether multiday training on time-compressed speech yields more learning than that observed during the initial adaptation phase and whether the pattern of generalization following successful learning is different than that observed with initial adaptation only. Two groups of non-native Hebrew speakers were tested on five different conditions of time-compressed speech identification in two assessments conducted 10-14 days apart. Between those assessments, one group of listeners received five practice sessions on one of the time-compressed conditions. Between the two assessments, trained listeners improved significantly more than untrained listeners on the trained condition. Furthermore, the trained group generalized its learning to two untrained conditions in which different talkers presented the trained speech materials. In addition, when the performance of the non-native speakers was compared to that of a group of naïve native Hebrew speakers, performance of the trained group was equivalent to that of the native speakers on all conditions on which learning occurred, whereas performance of the untrained non-native listeners was substantially poorer. Multiday training on time-compressed speech results in significantly more perceptual learning than brief adaptation. Compared to previous studies of adaptation, the training induced learning is more stimulus specific. Taken together, the perceptual learning of time-compressed speech appears to progress from an initial, rapid adaptation phase to a subsequent prolonged and more stimulus specific phase. These findings are consistent with the predictions of the Reverse Hierarchy Theory of perceptual learning and suggest constraints on the use of perceptual-learning regimens during second language acquisition.	\N	\N
23059750	The ability to identify stop consonants from brief onset spectra was compared between a group of Chinese children with phonological dyslexia (the PD group, with a mean age of 10 years 4 months) and a group of chronological age-matched control children. The linguistic context, which included vowels and speakers, and durations of stop onset spectra were varied. Children with PD showed lower identification accuracy and exhibited a smaller vowel context effect for some stop-vowel combinations compared with the chronological age-matched control group. Further analyses revealed that the PD group had more variable response patterns, and their responses were less consistent with the acoustic characteristics of stop onset spectra. The results suggest that Chinese children with PD do not show greater sensitivity to allophonic acoustic variability compared with control children and exhibit a generally less robust response pattern to phonetic categories.	\N	\N
23085111	Recent electrophysiological studies have reported short latency modulations in cortical regions for multisensory stimuli, thereby suggesting a subcortical, possibly thalamic origin of these modulations. Concurrently, there is an ongoing debate, whether multisensory interplay reflects automatic, bottom-up driven processes or relies on top-down influences. Here, we dissociated the effects of task set and stimulus configurations on BOLD-signals in the human thalamus with event-related functional magnetic resonance imaging (fMRI). We orthogonally manipulated temporal and spatial congruency of audio-visual stimulus configurations, while subjects judged either their temporal or spatial congruency. Voxel-based fMRI results revealed increased fMRI-signals for the temporal versus spatial task in posterior and central thalamus, respectively. A more sensitive region of interest (ROI)-analysis confirmed that the posterior thalamic nuclei showed a preference for the temporal task and central thalamic nuclei for the spatial task. Moreover, the ROI-analysis also revealed enhanced fMRI-signals for spatially incongruent stimuli in the central thalamus. Together, our results demonstrate that both audio-visual stimulus configurations and task-related processing of spatial or temporal stimulus features selectively modulate thalamic processing and thus are in a position to influence cortical processing at an early stage.	\N	\N
23088507	Sensitivity to frequency ratios is essential for the perceptual processing of complex sounds and the appreciation of music. This study assessed the effect of ratio simplicity on ratio discrimination for pure tones presented either simultaneously or sequentially. Each stimulus consisted of four 100-ms pure tones, equally spaced in terms of frequency ratio and presented at a low intensity to limit interactions in the auditory periphery. Listeners had to discriminate between a reference frequency ratio of 0.97 octave (about 1.96:1) and target frequency ratios, which were larger than the reference. In the simultaneous condition, the obtained psychometric functions were nonmonotonic: as the target frequency ratio increased from 0.98 octave to 1.04 octaves, discrimination performance initially increased, then decreased, and then increased again; performance was better when the target was exactly one octave (2:1) than when the target was slightly larger. In the sequential condition, by contrast, the psychometric functions were monotonic and there was no effect of frequency ratio simplicity. A control experiment verified that the non-monotonicity observed in the simultaneous condition did not originate from peripheral interactions between the tones. Our results indicate that simultaneous octaves are recognized as "special" frequency intervals by a mechanism that is insensitive to the sign (positive or negative) of deviations from the octave, whereas this is apparently not the case for sequential octaves.	\N	\N
23094319	The paper reports on a perception experiment in German that investigated the neuro-cognitive processing of information structural concepts and their prosodic marking using event-related brain potentials (ERPs). Experimental conditions controlled the information status (given vs. new) of referring and non-referring target expressions (nouns vs. adjectives) and were elicited via context sentences, which did not - unlike most previous ERP studies in the field--trigger an explicit focus expectation. Target utterances displayed prosodic realizations of the critical words which differed in accent position and accent type. Electrophysiological results showed an effect of information status, maximally distributed over posterior sites, displaying a biphasic N400--Late Positivity pattern for new information. We claim that this pattern reflects increased processing demands associated with new information, with the N400 indicating enhanced costs from linking information with the previous discourse and the Late Positivity indicating the listener's effort to update his/her discourse model. The prosodic manipulation registered more pronounced effects over anterior regions and revealed an enhanced negativity followed by a Late Positivity for deaccentuation, probably also reflecting costs from discourse linking and updating respectively. The data further lend indirect support for the idea that givenness applies not only to referents but also to non-referential expressions ('lexical givenness').	\N	\N
23095266	A retrospective review was performed of patients treated for middle ear cholesteatoma with bone defects of the skull base via a combined transmastoid/middle fossa approach at the University of Tsukuba Hospital from 2006 through 2011 to determine the safety and effectiveness of a combined transmastoid/middle fossa approach for the treatment of cholesteatoma involving the middle cranial fossa. The bone defects of the skull base were reconstructed with a galeal flap pedicled with a parietal branch of the superficial temporal artery and an autologous bone flap. The clinical and radiological data were analyzed. This series included 8 patients (6 men and 2 women) with a mean age of 46.3 years (range 10-67 years). One of the patients preoperatively exhibited meningoencephalocele of the middle fossa skull base, and in the remaining 7 patients, petrous bone involvement such as involvement of the supralabyrinthine cells was observed. The cholesteatoma lesion was totally removed and inner ear function preserved in all the patients. Cerebrospinal fluid leakage was observed in 1 patient during and after the surgery. Neither meningitis nor recurrence was observed in any patient during the follow-up periods (mean 29.4 months, range 6-64 months). The combined transmastoid/middle fossa approach allowed complete removal of cholesteatoma with middle cranial fossa involvement while preserving hearing and preventing postoperative cerebrospinal fluid leakage and meningitis.	\N	\N
23102977	For children, learning often occurs in the presence of background noise. As such, there is growing desire to improve a child's access to a target signal in noise. Given adult musicians' perceptual and neural speech-in-noise enhancements, we asked whether similar effects are present in musically-trained children. We assessed the perception and subcortical processing of speech in noise and related cognitive abilities in musician and nonmusician children that were matched for a variety of overarching factors. Outcomes reveal that musicians' advantages for processing speech in noise are present during pivotal developmental years. Supported by correlations between auditory working memory and attention and auditory brainstem response properties, we propose that musicians' perceptual and neural enhancements are driven in a top-down manner by strengthened cognitive abilities with training. Our results may be considered by professionals involved in the remediation of language-based learning deficits, which are often characterized by poor speech perception in noise.	\N	\N
23103362	Adolescence is a time of great change in the brain in terms of structure and function. It is possible to track the development of neural function across adolescence using auditory event-related potentials (ERPs). We measured passive auditory ERPs to pure tones and consonant-vowel (CV) syllables in 90 children and adolescents aged 10-18 years, as well as 10 adults. With one exception, the pattern of results were the same for tones and speech: Across adolescence, the P1 ERP peak decreased in size and latency, the N1 increased in size and decreased in latency, the P2 remained constant in size, and the N2 decreased in size but remained stable across adolescence. The exception was P2 latency, which increased for speech but remained stable for tones. Interesting step-like changes were observed for N1 latency for both tones and speech stimuli in 15- to 16-year-olds. These may stem from rapid hormonal changes that affect neurotransmitter activity of the ERP-generating neurons.	\N	\N
23103517	The present study builds on our previous study within the framework of Wyer and Collin's comprehension-elaboration theory of humor processing. In this study, an attempt is made to segregate the neural substrates of incongruity detection and incongruity resolution during the comprehension of verbal jokes. Although a number of fMRI studies have investigated the incongruity-resolution process, the differential neurological substrates of comprehension are still not fully understood. The present study utilized an event-related fMRI design incorporating three conditions (unfunny, nonsensical and funny) to examine distinct brain regions associated with the detection and resolution of incongruities. Stimuli in the unfunny condition contained no incongruities; stimuli in the nonsensical condition contained irresolvable incongruities; and stimuli in the funny condition contained resolvable incongruities. The results showed that the detection of incongruities was associated with greater activation in the right middle temporal gyrus and right medial frontal gyrus, and the resolution of incongruities with greater activation in the left superior frontal gyrus and left inferior parietal lobule. Further analysis based on participants' rating scores provided converging results. Our findings suggest a three-stage neural circuit model of verbal humor processing: incongruity detection and incongruity resolution during humor comprehension and inducement of the feeling of amusement during humor elaboration.	\N	\N
23106730	The acquisition of the function of case-marking is a key step in the development of sentence processing for German-speaking children since case-marking reveals the relations between sentential arguments. In this study, we investigated the development of the processing of case-marking and argument structures in children at 3, 4;6 and 6 years of age, as well as its processing in adults. Using EEG, we measured event-related potentials (ERPs) in response to object-initial compared to subject-initial German sentences including transitive verbs and case-marked noun phrases referring to animate arguments. We also tested children's behavioral competence in a sentence-picture matching task. Word order and case-marking were manipulated in German main clauses. Adults' behavioral performance was close to perfect and their ERPs revealed a negativity for the processing of the topicalized accusative marked noun phrase (NP1) and no effect for the second NP (NP2) in the object-initial structure. Children's behavioral data showed a significant above-chance outcome in the subject-initial condition for all age groups, but not for the object-initial condition. In contrast to adults, the ERPs of 3-year-olds showed a positivity at NP1, indicating difficulties in processing the non-canonical object-initial structures. Children at the age of 4;6 did not differ in the processing patterns of object-initial vs. subject-initial sentences at NP1 but showed a slight positivity at NP2. This positivity at NP2, which implies syntactic integration difficulties, is more pronounced in 6-year-olds but is absent in adults. At NP1, however, 6-year-olds show the same negativity as adults. In sum, the behavioral and electrophysiological findings demonstrate that children in each age group use different strategies, which are indicative of their developmental stage. While 3-year-olds merely detect differences in the two sentence structures without being able to use this information for sentence comprehension, 4;6-year-olds proceed to use mainly a word-order strategy, processing NP1 in both conditions in the same manner, which leads to processing difficulties upon detecting case-marking cues at NP2. At the age of 6, children are able to use case-marking cues for comprehension but still show enhanced effort for correct thematic-role assignment.	\N	\N
23106737	Integrating the multisensory features of talking faces is critical to learning and extracting coherent meaning from social signals. While we know much about the development of these capacities at the behavioral level, we know very little about the underlying neural processes. One prominent behavioral milestone of these capacities is the perceptual narrowing of face-voice matching, whereby young infants match faces and voices across species, but older infants do not. In the present study, we provide neurophysiological evidence for developmental decline in cross-species face-voice matching. We measured event-related brain potentials (ERPs) while 4- and 8-month-old infants watched and listened to congruent and incongruent audio-visual presentations of monkey vocalizations and humans mimicking monkey vocalizations. The ERP results indicated that younger infants distinguished between the congruent and the incongruent faces and voices regardless of species, whereas in older infants, the sensitivity to multisensory congruency was limited to the human face and voice. Furthermore, with development, visual and frontal brain processes and their functional connectivity became more sensitive to the congruence of human faces and voices relative to monkey faces and voices. Our data show the neural correlates of perceptual narrowing in face-voice matching and support the notion that postnatal experience with species identity is associated with neural changes in multisensory processing (Lewkowicz & Ghazanfar, 2009).	\N	\N
23110123	The common marmoset (Callithrix jacchus) is a small New World primate that has increasingly been used as a non-human model in the fields of sensory, motor, and cognitive neuroscience. However, little knowledge exists regarding behavioral methods in this species. Developing an understanding of the neural basis of perception and cognition in an animal model requires measurement of both brain activity and behavior. Here we describe an operant conditioning behavioral training method developed to allow controlled psychoacoustic measurements in marmosets. We demonstrate that marmosets can be trained to consistently perform a Go/No-Go auditory task in which a subject licks at a feeding tube when it detects a sound. Correct responses result in delivery of a food reward. Crucially, this operant conditioning task generates little body movement and is well suited for pairing behavior with single-unit electrophysiology. Successful implementation of an operant conditioning behavior opens the door to a wide range of new studies in the field of auditory neuroscience using the marmoset as a model system.	\N	\N
23110674	Evaluation of pure-tone audiometry (PTA) in hearing screening of a population with mild to profound intellectual disability (ID). PTA was performed at six frequencies at the screening level 20 dB HL. Referral criteria were threshold levels ≥ 25 dB HL at two or more frequencies for one ear or both. 1478 participants aged 7-91 years were included. 1470 (99.5%) people cooperated in screening of which 1325 (90%) could be tested on both ears at all six frequencies. A majority, 987 (66.8%), performed ordinary PTA, 234 (15.8%) conditioned play audiometry, and 249 (16.9%) behavioural observation audiometry. Six hundred and sixty-nine (45%) passed and 809 (55%) failed according to referral criteria. Of those failing, 441 (54.5%) accepted referral to clinical evaluation. PTA with slight modifications is applicable for screening of a population with mild to profound intellectual disability. The most challenging and time-consuming activity is to introduce the test procedure in a way that reduces anxiety and establishes trust.	\N	\N
23116815	The supratemporal plane contains several functionally heterogeneous subregions that respond strongly to speech. Much of the prior work on the issue of speech processing in the supratemporal plane has focused on neural responses to single speech vs. non-speech sounds rather than focusing on higher-level computations that are required to process more complex auditory sequences. Here we examined how information is integrated over time for speech and non-speech sounds by quantifying the BOLD fMRI response to stochastic (non-deterministic) sequences of speech and non-speech naturalistic sounds that varied in their statistical structure (from random to highly structured sequences) during passive listening. Behaviorally, the participants were accurate in segmenting speech and non-speech sequences, though they were more accurate for speech. Several supratemporal regions showed increased activation magnitude for speech sequences (preference), but, importantly, this did not predict sensitivity to statistical structure: (i) several areas showing a speech preference were sensitive to statistical structure in both speech and non-speech sequences, and (ii) several regions that responded to both speech and non-speech sounds showed distinct responses to statistical structure in speech and non-speech sequences. While the behavioral findings highlight the tight relation between statistical structure and segmentation processes, the neuroimaging results suggest that the supratemporal plane mediates complex statistical processing for both speech and non-speech sequences and emphasize the importance of studying the neurocomputations associated with auditory sequence processing. These findings identify new partitions of functionally distinct areas in the supratemporal plane that cannot be evoked by single stimuli. The findings demonstrate the importance of going beyond input preference to examine the neural computations implemented in the superior temporal plane.	\N	\N
23117057	Introducing coherent masker envelope modulation to frequency regions neighboring the signal frequency can reduce detection thresholds for a pure-tone signal. Verhey and Ernst (2009) reported that irregular masker modulation conferred greater benefit than regular modulation when the masker was broadband, but that there was no difference when the masker was narrowband. The present study evaluated two possible explanations for this result: one based on modulation adaptation and the other based on the introduction of relatively long-duration modulation minima in the irregular masker modulation condition. The first experiment replicated the results of Verhey and Ernst (2009), but also included conditions in which a 12.5-ms signal was presented in a 12.5-ms modulation minimum, which was exempted from envelope jitter. The second experiment used a continuous masker and suspended jitter during epochs associated with either a 12.5- or 87.5-ms signal. No benefit of masker envelope irregularity before or after the signal was observed in either experiment. These findings are inconsistent with an explanation based on modulation adaptation, implicating instead the introduction of relatively long-duration modulation minima in the large masking release obtained for a long-duration signal in an irregularly modulated masker.	\N	\N
23123219	The current study was undertaken to investigate changes in forward masking patterns using on-frequency and off-frequency maskers of 7 and 10 kHz probes in the Sprague-Dawley rat. Off-frequency forward masking growth functions have been shown in humans to be non-linear, while on-frequency functions behave linearly. The non-linear nature of the off-frequency functions is attributable to active processing from the outer hair cells, and was therefore expected to be sensitive to noise-induced cochlear damage. For the study, nine Sprague-Dawley rats' auditory brainstem responses (ABRs) were recorded with and without forward maskers. Forward masker-induced changes in latency and amplitude of the initial positive peak of the rats' auditory brainstem responses were assessed with both off-frequency and on-frequency maskers. The rats were then exposed to a noise designed to induce 20-40 dB of permanent threshold shift. Twenty-one days after the noise exposure, the forward masking growth functions were measured to assess noise-induced changes in the off-frequency and on-frequency forward masking patterns. Pre-exposure results showed compressive non-linear masking effects of the off-frequency conditions on both latency and amplitude of the auditory brainstem response. The noise rendered the off-frequency forward masking patterns more linear, consistent with human behavioral findings. On- and off-frequency forward masking growth functions were calculated, and they displayed patterns consistent with human behavioral functions, both prior to noise and after the noise exposure.	\N	\N
23132604	Common-coding theory posits that (1) perceiving an action activates the same representations of motor plans that are activated by actually performing that action, and (2) because of individual differences in the ways that actions are performed, observing recordings of one's own previous behavior activates motor plans to an even greater degree than does observing someone else's behavior. We hypothesized that if observing oneself activates motor plans to a greater degree than does observing others, and if these activated plans contribute to perception, then people should be able to lipread silent video clips of their own previous utterances more accurately than they can lipread video clips of other talkers. As predicted, two groups of participants were able to lipread video clips of themselves, recorded more than two weeks earlier, significantly more accurately than video clips of others. These results suggest that visual input activates speech motor activity that links to word representations in the mental lexicon.	\N	\N
23135616	The objective was to develop and evaluate a new sentence test, the Sentence Test with Adaptive Randomized Roving levels, intended to emulate everyday listening experience, using both normal-hearing (NH) and cochlear implant (CI) groups, examining practicality, learning, test-retest variability, and interlist variability. In experiment 1, each of 25 NH adults was tested using five lists, each comprising 30 sentences. One male and one female speaker each spoke 15 sentences. Ten sentences were presented at each of three presentation levels: 50, 65, and 80 dB SPL. The relative level of a speech-shaped noise was varied adaptively to estimate the speech reception threshold (SRT). Counterbalance for list order was achieved by staggering the allocation of lists to participants. To allow assessment of learning effects, no practice was given. The variability of mean SRTs across lists was small, but correction factors were derived for each list so that, after correction, all lists gave the same mean SRT. Test-retest variability was estimated by examining the corrected SRTs for each subject's five lists. In experiment 2, 25 CI users each received one test list after a small amount of practice. Experiment 3 examined the effect of speech rate using time-compressed speech, for age-matched NH and CI users. The mean SRT for the NH participants was approximately -6 dB and was similar for the male and female speakers. There was a small but significant improvement in SRTs between the first and later lists administered, but no further improvement for subsequent lists. On the basis of the variability of the corrected SRTs within each participant, a 2.2 dB difference in SRT is meaningful for comparisons using one test list per condition, for a single participant. The percentage of key words correct varied with presentation level over a 13% range, being best at 65 dB SPL. Only 40% of the CI group achieved an SRT lower than 20 dB for both speakers. There was large individual variability in the SRTs, and SRTs were higher for the female than for the male speaker. For the CI participants, the percentage of key words correct varied markedly with level, from 19% at the lowest level to 57% at the medium level. Time compression had a small effect for NH participants but a very large effect for CI participants. The Sentence Test with Adaptive Randomized Roving levels seems practical to administer and is reasonably sensitive. For NH participants, a 2.2 dB difference in SRT is meaningful for a single list per condition and a single participant. Although learning effects were small for NH participants, it seems prudent to provide some practice sentences when testing hearing-impaired or CI participants. The very large effect of time compression for the CI group has implications for live voice testing of children, because speech rate is only poorly controlled in such testing.	\N	\N
23144191	This is the first study on adults' physiological reactivity to infant cry sounds and the association with intended harsh parenting using salivary α-amylase (sAA) as a novel and noninvasive marker of autonomic nervous system activity. The sample consisted of 184 adult twin pairs. In an experimental design, cry sounds were presented and adults' perception and their intended caregiving responses were measured. Saliva samples were collected after each cry sound. For the majority of the sample, a decrease in sAA across the cry paradigm was observed. However, adults who indicated that they would respond in a harsh way to the crying infant were significantly less likely to show a decrease in sAA. Consistent with previous studies on physiological hyperreactivity in abusive parents, these findings suggest that failure to habituate to repeated infant crying may be one of the mediating mechanisms through which excessive, inconsolable, and high-pitched infant crying triggers less optimal caregiving.	\N	\N
23145600	Subjective evaluation of acoustics was studied by recording nine concert halls with a simulated symphony orchestra on a seat 12 m from the orchestra. The recorded music was spatially reproduced for subjective listening tests and individual vocabulary profiling. In addition, the preferences of the assessors and objective parameters were gathered. The results show that concert halls were discriminated using perceptual characteristics, such as Envelopment/Loudness, Reverberance, Bassiness, Proximity, Definition, and Clarity. With these perceptual dimensions the preference ratings can be explained. Seventeen assessors were divided into two groups based on their preferences. The first group preferred concert halls with relatively intimate sound, in which it is quite easy to hear individual instruments and melody lines. In contrast, the second group preferred a louder and more reverberant sound with good envelopment and strong bass. Even though all halls were recorded exactly at the same distance, the preference is best explained with subjective Proximity and with Bassiness, Envelopment, and Loudness to some extent. Neither the preferences nor the subjective ratings could be fully explained by objective parameters (ISO3382-1:2009), although some correlations were found.	\N	\N
23151776	To examine and compare the family environment of preschool- and school-age children with cochlear implants and assess its influence on children's executive function and spoken language skills. Retrospective between-subjects design. Outpatient research laboratory. Prelingually deaf children with cochlear implants and no additional disabilities and their families. Cochlear implantation and speech-language therapy. Parents completed the Family Environment Scale and the Behavior Rating Inventory of Executive Function (or the preschool version). Children were tested using the Peabody Picture Vocabulary Test-4 and either the Preschool Language Scales-4 or the Clinical Evaluation of Language Fundamentals-4. The family environments of children with cochlear implants differed from normative data obtained from hearing children, but average scores were within 1 standard deviation of norms on all subscales. Families of school-age children reported higher levels of control than those of preschool-age children. Preschool-age children had fewer problems with emotional control when families reported higher levels of support and lower levels of conflict. School-age children had fewer problems with inhibition but more problems with shifting of attention when families reported lower levels of conflict. School-age children's receptive vocabularies were enhanced by families with lower levels of control and higher levels of organization. Family environment and its relation to language skills and executive function development differed across the age groups in this sample of children with cochlear implants. Because family dynamics is one developmental/environmental factor that can be altered with therapy and education, the present results have important clinical implications for family-based interventions for deaf children with cochlear implants.	\N	\N
23151778	To present the preliminary results of new malleus replacement prosthesis combined with a total ossicular prosthesis in middle ear reconstruction in patients missing the malleus and stapes. Prospective experimental and nonrandomized clinical study. Tertiary referral center. An original titanium malleus replacement prosthesis (MRP) was designed to be inserted into the external auditory canal and to replace a missing malleus for various middle ear pathologies. The MRP was tested experimentally and clinically. The vibratory properties of the new prosthesis were measured using laser Doppler vibrometry. Ninety patients with missing malleus and stapes, undergoing 92 ossicular reconstructions were enrolled in this study from September 1994 to March 2012. Comparative analyses were made between a group of 34 cases of ossicular reconstructions with total prosthesis (TORP) positioned from the tympanic membrane to the stapes footplate (TM-to-footplate assembly) and a group of 58 cases of ossicular reconstructions with TORP positioned from a newly designed malleus replacement prosthesis (MRP) to the stapes footplate (MRP-to-footplate assembly). Preoperative and postoperative audiometric evaluation using conventional audiometry, that is, air-bone gap (ABG), bone-conduction thresholds (BC), and air-conduction thresholds (AC) were assessed. Experimentally, the vibratory properties of the MRP are promising and remain very good even when the MRP is cemented into the bony canal wall mimicking its complete osseous-integration, if this were to occur. This finding supports the short-term clinical results as in the TM-to-footplate group; the 3-month postoperative mean ABG was 23.3 dB compared with 12.5 dB in the MRP-to-footplate group (difference, 10.8; 95% confidence interval, 4.0-17.6); 37.0% of patients from the TM-to-footplate group had a postoperative ABG of 10 dB or less, and 48.1% of patients had a postoperative ABG of 20 dB or less, as compared with 58.1% and 79.1%, respectively, in the MRP-to-footplate group. The average gain in AC was 11.0 dB in the TM-to-footplate group as compared with 21.3 dB in the MRP-to-footplate group (difference, -10.3; 95% confidence interval, -18.2 to -2.4). The results of this study indicate that superior postoperative hearing thresholds could be achieved using a MRP-to-footplate assembly, compared with a TM-to-footplate assembly in patients with an absent malleus undergoing ossiculoplasty. The postoperative AC thresholds, after 3 months and 1 year, are significantly lower in patients treated with the MRP-to-footplate assembly.	\N	\N
23156899	The study presented in this paper aimed to investigate the pattern of semantic priming effects, under masked and unmasked conditions, in the lexical decision task, manipulating type of semantic relation and associative strength. Three different kinds of word relations were examined in two experiments: only-semantically related words [e.g., codo (elbow)-rodilla (knee)] and semantic/associative related words with strong [e.g., mesa (table)-silla (chair) and weak association strength [e.g., sapo (toad)-rana (frog)]. In Experiment 1 a masked priming procedure was used with a prime duration of 56 ms, and in Experiment 2, the prime was presented unmasked for 150 ms. The results showed that there were masked priming effects with strong associates, but no evidence of these effects was found with weak associates or only-semantic related word pairs. When the prime was presented unmasked, the three types of relations produced significant priming effects and they were not influenced by association strength.	\N	\N
23165224	This study describes a vocoder-based frequency-lowering system that enhances spectral cues for nonsonorant consonants differing in place of articulation. The goal of this study was to evaluate the efficacy of this system for speech recognition by hearing-impaired listeners. Experiment 1 evaluated fricative consonant recognition in quiet. Eight fricatives in /VCV/ context were used. Experiment 2 evaluated consonant recognition in quiet with 22 consonants. Six listeners with steeply sloping high-frequency sensorineural hearing loss participated in experiment 1. The same six listeners and three additional listeners with flat/mid-frequency sensorineural hearing loss participated in experiment 2. Two processing conditions-frequency lowering and conventional amplification-were tested in each experiment. Insertion gains based on the NAL-RP formula were provided up to 8000 Hz for each processing condition. In addition, speech stimuli were low-pass (LP) filtered at 1000, 1500, and 2000 Hz to evaluate the effect of lack of high-frequency speech information on consonant perception with and without frequency lowering. For these LP speech conditions, amplification was provided up to the cutoff frequencies. Overall percent correct and percent information transmission were calculated for each processing and speech condition. The frequency-lowering system provided significant benefit for the perception of fricative consonants and perception of the place-of-articulation feature for hearing-impaired listeners without affecting their perception of sonorant consonants and other consonant features (i.e., voicing and nasality). The improvement of fricative consonant perception was observed for both wideband and LP speech conditions for the steeply sloping hearing-loss listeners. The results indicate that individuals with unaidable hearing loss above 1000 to 2000 Hz would receive significant benefit with the system compared with conventional amplification for the perception of fricative consonants, and more importantly, significant benefit for the perception of place of articulation.	\N	\N
23165382	To objectively determine changes in sensorineural hearing in children with mucopolysaccharidosis (MPS) by comparing audiological data before and after hematopoietic stem cell transplantation (HSCT). Retrospective medical chart analysis. Tertiary referral hospital. Thirty pediatric patients with the diagnosis of MPS who underwent HSCT and had audiological data before and after HSCT. Data were extracted from medical charts for patients seen at our institution from January 1, 1999, to December 1, 2009. Hearing was assessed using behavioral audiometry testing and auditory brainstem responses (ABR) before and after HSCT. Patient demographics, diagnosis, and age at HSCT were also evaluated. Thirty patients with MPS were included. Four (13%) had MPS type 3a, 2 (7%) had MPS type 2, and 24 (80%) had MPS type 1. The average age at HSCT was 19 months (range, 5-44 months). Hearing improvement was evaluated by audiogram (20 patients), ABR (8 patients), and qualitative measures (30 patients). On average, patients did not show improvement on audiogram (P = .28; paired t test). The ABR click threshold improved 19 dB on average (P < .001). Qualitatively, 3 patients had normal hearing before and after HSCT. Of the remaining 27 patients, 20 (67%) showed improvement in sensorineural hearing (P < .001). Five (17%) had hearing loss and did not improve. Two (7%) had worsening hearing. Hematopoietic stem cell transplantation at the age of 25 months or younger was significantly correlated with hearing improvement (P = .03). Hematopoietic stem cell transplantation may provide improvement in MPS-associated sensorineural hearing loss. Hearing improvement is more likely to occur in patients who undergo transplantation at 25 months or younger.	\N	\N
23166292	Functional magnetic resonance imaging (fMRI) was employed to identify neural regions engaged during the encoding of contextual features belonging to different modalities. Subjects studied objects that were presented to the left or right of fixation. Each object was paired with its name, spoken in either a male or a female voice. The test requirement was to discriminate studied from unstudied pictures and, for each picture judged old, to retrieve its study location and the gender of the voice that spoke its name. Study trials associated with accurate rather than inaccurate location memory demonstrated enhanced activity in the fusiform and parahippocampal cortex and the hippocampus and reduced activity (a negative subsequent memory effect) in the medial occipital cortex. Successful encoding of voice information was associated with enhanced study activity in the right middle superior temporal sulcus and activity reduction in the right superior frontal cortex. These findings support the proposal that encoding of a contextual feature is associated with enhanced activity in regions engaged during its online processing. In addition, they indicate that negative subsequent memory effects can also demonstrate feature-selectivity. Relative to other classes of study trials, trials for which both contextual features were later retrieved demonstrated enhanced activity in the lateral occipital complex and reduced activity in the temporo-parietal junction. These findings suggest that multifeatural encoding was facilitated when the study item was processed efficiently and study processing was not interrupted by redirection of attention toward extraneous events.	\N	\N
23167712	Despite its fundamental relevance for representing the emotional world surrounding us, human affective neuroscience research has widely neglected the auditory system, at least in comparison to the visual domain. Here, we have investigated the spatiotemporal dynamics of human affective auditory processing using time-sensitive whole-head magnetoencephalography. A novel and highly challenging affective associative learning procedure, 'MultiCS conditioning', involving multiple conditioned stimuli (CS) per affective category, was adopted to test whether previous findings from intramodal conditioning of multiple click-tones with an equal number of auditory emotional scenes (Bröckelmann et al., 2011 J. Neurosci., 31, 7801) would generalise to crossmodal conditioning of multiple click-tones with an electric shock as single aversive somatosensory unconditioned stimulus (UCS). Event-related magnetic fields were recorded in response to 40 click-tones before and after four contingent pairings of 20 CS with a shock and the other half remaining unpaired. In line with previous findings from intramodal MultiCS conditioning we found an affect-specific modulation of the auditory N1m component 100-150 ms post-stimulus within a distributed frontal-temporal-parietal neural network. Increased activation for shock-associated tones was lateralised to right-hemispheric regions, whereas unpaired safety-signalling tones were preferentially processed in the left hemisphere. Participants did not show explicit awareness of the contingent CS-UCS relationship, yet behavioural conditioning effects were indicated on an indirect measure of stimulus valence. Our findings imply converging evidence for a rapid and highly differentiating affect-specific modulation of the auditory N1m after intramodal as well crossmodal MultiCS conditioning and a correspondence of the modulating impact of emotional attention on early affective processing in vision and audition.	\N	\N
23168357	We investigated gender differences in the identification of personally familiar voices in a gender-balanced sample of 40 listeners. From various types of utterances, listeners had to identify by name 20 speakers (10 female) among a set of 70 possible classmates who were all 12th grade pupils from the same local secondary school. Mean identification rates were 67% from sentences, and around 35% for an isolated /Hello/ or a VCV syllable. Even from non-verbal harrumphs, speakers were identified with an accuracy of 18%, i.e. highly above chance levels. Substantial individual differences were observed between listeners. Importantly, superior overall performance of female listeners was qualified by an interaction between voice gender and listener gender. Male listeners exhibited an own-gender bias (i.e. better identification for male than female voices), whereas female listeners identified voices of both genders at similar levels. Individual own-gender identification biases were correlated with differences in reported contact to a speaker's voice and voice distinctiveness. Overall, the present study establishes a number of factors that account for substantial individual differences in personal voice identification.	\N	\N
23169196	Patients with single-sided deafness (SSD), where one ear has an unaidable hearing loss and the other ear has normal or aidable hearing, often complain of difficulties understanding speech and localizing sound sources, and report a higher self-perceived hearing disability. Patients with SSD may benefit from using contralateral routing of signal (CROS) or bilateral contralateral routing of the signal (BiCROS) amplification. Dissatisfaction of previously available (Bi)CROS devices has been reported, such as, interfering transmissions, low-fidelity sound quality, poor "user-friendly" set-up, and a bulky and cosmetically cumbersome appearance. Recent advances in hearing aid technology have improved (Bi)CROS hearing aids; however, these devices have not been experimentally evaluated. We hypothesized that newer technology with reports of improved digital signal processing, wireless transmission, and physical design would be as good, or better than, our participants' previous-generation BiCROS systems. A within-subjects, pretest-posttest design was executed. Thirty-nine veterans (one female, 38 males; mean age = 74 yr, range = 49-85 yr) from the Audiology Section of the Bay Pines Veterans Affair Healthcare System participated. All participants were previously experienced BiCROS hearing aid users with varying degrees of sensorinerual hearing impairment in their better ear. Participants were provided at least 4 wk of consistent use with the new BiCROS. DATA COLLECTION AND ANALYSES: Participants completed three research visits. At Visit 1, with their previous BiCROS, and at Visit 3, with their new BiCROS, the following objective and subjective measures were obtained: (1) soundfield speech-in-noise testing using the Words-In-Noise (WIN) test; (2) speech, spatial, and qualities of the hearing scale (SSQ) questionnaire; (3) selected questions from the MarkeTrak questionnaire; and, (4) three open-ended questions. Data were analyzed using parametric and nonparametric statistics. Overall, the objective (WIN) and subjective (SSQ, MarkeTrak, and open-ended questions) measures indicated that the new BiCROS provided better outcomes than the previous BiCROS system. In addition, an overlap of favorable results was seen across measures. Of the 39 participants, 95% reported improvements with the new BiCROS and chose to utilize the device regularly. The favorable objective and subjective outcomes indicate that the new BiCROS system is as good, or better than, what was previously utilized by our sample of veterans.	\N	\N
23174416	Speech contains a variety of acoustic cues to auditory and phonetic contrasts that are exploited by the listener in decoding the acoustic signal. In three experiments, we tried to elucidate whether listeners rely on formant peak frequencies or whole spectrum attributes in vowel discrimination. We created two vowel continua in which the acoustic distance in formant frequencies was constant but the continua differed in spectral moments (i.e., the whole spectrum modeled as a probability density function). In Experiment 1, we measured reaction times and response accuracy while listeners performed a go/no-go discrimination task. The results indicated that the performance of the listeners was based on the spectral moments (especially the first and second moments), and not on formant peaks. Behavioral results in Experiment 2 showed that, when the stimuli were presented in noise eliminating differences in spectral moments between the two continua, listeners employed formant peak frequencies. In Experiment 3, using the same listeners and stimuli as in Experiment 1, we measured an automatic brain potential, the mismatch negativity (MMN), when listeners did not attend to the auditory stimuli. Results showed that the MMN reflects sensitivity only to the formant structure of the vowels. We suggest that the auditory cortex automatically and pre-attentively encodes formant peak frequencies, whereas attention can be deployed for processing additional spectral information, such as spectral moments, to enhance vowel discrimination.	\N	\N
23178211	Blind people may compensate for their visual loss by the increased use of auditory spatial information, thus showing normal or even supra-normal ability to localize sources of sound. However, the problem of how blind persons develop and maintain an internal concept of the topography of the auditory space in the absence of calibration by visual information is still unsolved. The present study demonstrated a substantial superiority of blind subjects in perception of auditory motion: The minimum audible movement angle of blind subjects (mean 3°) was about half the value found in matched sighted controls, whereas no such advantage was demonstrable for localization of stationary sound. There were no significant differences between early or congenitally blind subjects and late blind subjects, suggesting that long-term visual deprivation per se, independently of the point in time of its onset, was relevant for the superiority in auditory motion perception. The results were compatible with the hypothesis that in the absence of visual input the calibration of the auditory space is performed by audiomotor feedback, that is, by the evaluation of systematic changes of auditory spatial cues resulting from head and body movements. It is reasonable to assume that with blindness the neuronal circuits specifically concerned with the analysis of auditory motion are more intensely trained than in sighted people. It seems possible that the higher demand of motion analysis associated with blindness is related to processes of reorganization in the brain, as have been previously reported to occur also in areas known to be involved in auditory and/or visual motion analysis in sighted persons.	\N	\N
23231122	Measures of spectral ripple resolution have become widely used psychophysical tools for assessing spectral resolution in cochlear-implant (CI) listeners. The objective of this study was to compare spectral ripple discrimination and detection in the same group of CI listeners. Ripple detection thresholds were measured over a range of ripple frequencies and were compared to spectral ripple discrimination thresholds previously obtained from the same CI listeners. The data showed that performance on the two measures was correlated, but that individual subjects' thresholds (at a constant spectral modulation depth) for the two tasks were not equivalent. In addition, spectral ripple detection was often found to be possible at higher rates than expected based on the available spectral cues, making it likely that temporal-envelope cues played a role at higher ripple rates. Finally, spectral ripple detection thresholds were compared to previously obtained speech-perception measures. Results confirmed earlier reports of a robust relationship between detection of widely spaced ripples and measures of speech recognition. In contrast, intensity difference limens for broadband noise did not correlate with spectral ripple detection measures, suggesting a dissociation between the ability to detect small changes in intensity across frequency and across time.	\N	\N
23231815	Normal temporal processing is important for the perception of speech in quiet and in difficult listening situations. Temporal resolution is commonly measured using a behavioral gap detection task, where the patient or subject must participate in the evaluation process. This is difficult to achieve with subjects who cannot reliably complete a behavioral test. However, recent research has investigated the use of evoked potential measures to evaluate gap detection. The purpose of the current study was to record N1-P2 responses to gaps in broadband noise in normal hearing young adults. Comparisons were made of the N1 and P2 latencies, amplitudes, and morphology to different length gaps in noise in an effort to quantify the changing responses of the brain to these stimuli. It was the goal of this study to show that electrophysiological recordings can be used to evaluate temporal resolution and measure the influence of short and long gaps on the N1-P2 waveform. This study used a repeated-measures design. All subjects completed a behavioral gap detection procedure to establish their behavioral gap detection threshold (BGDT). N1-P2 waveforms were recorded to the gap in a broadband noise. Gap durations were 20 msec, 2 msec above their BGDT, and 2 msec. These durations were chosen to represent a suprathreshold gap, a near-threshold gap, and a subthreshold gap. Fifteen normal-hearing young adult females were evaluated. Subjects were recruited from the local university community. Latencies and amplitudes for N1 and P2 were compared across gap durations for all subjects using a repeated-measures analysis of variance. A qualitative description of responses was also included. Most subjects did not display an N1-P2 response to a 2 msec gap, but all subjects had present clear evoked potential responses to 20 msec and 2+ msec gaps. Decreasing gap duration toward threshold resulted in decreasing waveform amplitude. However, N1 and P2 latencies remained stable as gap duration changed. N1-P2 waveforms can be elicited by gaps in noise in young normal-hearing adults. The responses are present as low as 2 msec above behavioral gap detection thresholds (BGDT). Gaps that are below BGDT do not generally evoke an electrophysiological response. These findings indicate that when a waveform is present, the gap duration is likely above their BGDT. Waveform amplitude is also a good index of gap detection, since amplitude decreases with decreasing gap duration. Future studies in this area will focus on various age groups and individuals with auditory disorders.	\N	\N
23231816	Most cochlear implant (CI) users describe music as a noise-like and unpleasant sound. Using behavioral tests, most prior studies have shown that perception of pitch-based melody and timbre is poor in CI users. This article will focus on cortical encoding of timbre changes in CI users, which may allow us to find solutions to further improve CI benefits. Furthermore, the value of using objective measures to reveal neural encoding of timbre changes may be reflected in this study. A case-control study of the mismatch negativity (MMN) using electrophysiological technique was conducted. To derive MMNs, three randomly arranged oddball paradigms consisting of standard/deviant instrumental pairs: saxophone/piano, cello/trombone, and flute/French horn, respectively, were presented. Ten CI users and ten normal-hearing (NH) listeners participated in this study. After filtering, epoching, and baseline correction, independent component analysis (ICA) was performed to remove artifacts. The averaged waveforms in response to the standard stimuli (STANDARD waveform) and the deviant stimuli (DEVIANT waveform) in each condition were separately derived. The responses from nine electrodes in the fronto-central area were averaged to form one waveform. The STANDARD waveform was subtracted from the DEVIANT waveform to derive the difference waveform, for which the MMN was judged to be present or absent. The measures used to evaluate the MMN included the MMN peak latency and amplitude as well as MMN duration. The MMN, which reflects the ability to automatically detect acoustic changes, was present in all NH listeners but only approximately half of CI users. In CI users with present MMNs, the MMN peak amplitude and duration were significantly smaller and shorter compared to those in NH listeners. Our electrophysiological results were consistent with prior behavioral results that CI users' performance in timbre perception was significantly poorer than that in NH listeners. Our results may suggest that timbre information is poorly registered in the auditory cortex of CI users and the capability of automatic detection of timbre changes is degraded in CI users. Although there are some limitations of the MMN in CI users, along with other objective auditory evoked potential tools, the MMN may be a useful objective tool to indicate the extent of sound registration in auditory cortex in the future efforts of improving CI design and speech strategy.	\N	\N
23237416	Children with phonological impairment (PI) often have difficulties perceiving insufficiencies in their own speech. The use of recordings has been suggested as a way of directing the child's attention toward his/her own speech, despite a lack of evidence that children actually recognize their recorded voice as their own. We present two studies of children's self-voice identification, one exploring developmental aspects, and one exploring potential effects of having a PI. The results indicate that children from 4 to 8 years recognize their recorded voice well (around 80% accuracy), regardless of whether they have a PI or not. A subtle change in this ability from 4 to 8 years is observed that could be linked to a development in short-term memory. Clinically, one can indeed expect an advantage of using recordings in therapy; this could constitute an intermediate step toward the more challenging task of online self-monitoring.	\N	\N
23238175	To investigate the relationship between the threshold and the interaural amplitude difference ratio (IADR) in cervical vestibular evoked myogenic potential (cVEMP) testing and pursuit the clinical significance of the parameters. cVEMP responses were recorded while the SCM contraction was controlled using a pressure cuff. The intensities of the sound stimulation decreased from 95 dB n HL by 5 dB, until no responses were evoked. Thresholds, interaural threshold difference (ITD), amplitudes, and interaural amplitude difference ratio at the stimulation of 95 dB n HL were calculated and the relationship between them was examined. All subjects showed cVEMP responses bilaterally. Thresholds measured were overall 76 dB n HL and most (92%) ears showed the ITD of 0 or 5 dB. The amplitudes of cVEMP responses showed a positive correlation with the sound intensities, and more specifically with the sound intensity above each threshold value. There was no significant difference in IADR values by the ITD. Based on our study, the ITD is less than 10 dB in most normal subjects and estimation of threshold should be added to cVEMP testing for probing vestibular asymmetry. Getting a threshold might be helpful in determining whether the abnormal interaural amplitude difference ratio is related to the abnormal ITD.	\N	\N
23241212	Coloured-hearing (CH) synesthesia is a perceptual phenomenon in which an acoustic stimulus (the inducer) initiates a concurrent colour perception (the concurrent). Individuals with CH synesthesia "see" colours when hearing tones, words, or music; this specific phenomenon suggesting a close relationship between auditory and visual representations. To date, it is still unknown whether the perception of colours is associated with a modulation of brain functions in the inducing brain area, namely in the auditory-related cortex and associated brain areas. In addition, there is an on-going debate as to whether attention to the inducer is necessarily required for eliciting a visual concurrent, or whether the latter can emerge in a pre-attentive fashion. By using the EEG technique in the context of a pre-attentive mismatch negativity (MMN) paradigm, we show that the binding of tones and colours in CH synesthetes is associated with increased MMN amplitudes in response to deviant tones supposed to induce novel concurrent colour perceptions. Most notably, the increased MMN amplitudes we revealed in the CH synesthetes were associated with stronger intracerebral current densities originating from the auditory cortex, parietal cortex, and ventral visual areas. The automatic binding of tones and colours in CH synesthetes is accompanied by an early pre-attentive process recruiting the auditory cortex, inferior and superior parietal lobules, as well as ventral occipital areas.	\N	\N
23249352	Psychophysical experiments show that auditory change detection can be disturbed in situations in which listeners have to monitor complex auditory input. We made use of this change deafness effect to segregate the neural correlates of physical change in auditory input from brain responses related to conscious change perception in an fMRI experiment. Participants listened to two successively presented complex auditory scenes, which consisted of six auditory streams, and had to decide whether scenes were identical or whether the frequency of one stream was changed between presentations. Our results show that physical changes in auditory input, independent of successful change detection, are represented at the level of auditory cortex. Activations related to conscious change perception, independent of physical change, were found in the insula and the ACC. Moreover, our data provide evidence for significant effective connectivity between auditory cortex and the insula in the case of correctly detected auditory changes, but not for missed changes. This underlines the importance of the insula/anterior cingulate network for conscious change detection.	\N	\N
23258317	Steady state responses (SSRs), between 75 and 110 Hz, evoked by auditory amplitude modulated single or multiple tone stimuli, may be used to estimate objective hearing threshold. The aim of this study was to compare SSRs and click-evoked auditory brainstem responses (ABRs) in both ears of 20 adults (10 males and 10 females, aged between 24 and 36 years) with normal hearing threshold. Mean ABR threshold was found at 21.25 (± 5.9) dB nHL. Mean SSR threshold was found at 15.6 (± 9.6) dB nHL after a single frequency stimulus (1 kHz); at 10.5 (± 18.2) dB nHL and at 7.1 (± 12.4) dB nHL after bifrequency stimulation (0.5 and 2 kHz). SSR thresholds after multifrequency stimulation (0.5, 1, 2 and 4 kHz) were found, respectively, at 12.1 (± 12.9) dB nHL, 12.2 (± 12.8) dB nHL, 12.3 (± 8.3) dB nHL and 18.9 (± 17.2) dB nHL. Mean duration of the recording session was 6 min in the case of ABRs, while it was 25 min in the single frequency condition and 29 min in the multifrequency condition in the case of SSRs. SSRs can be used for frequency-specific objective audiometry. The multifrequency stimulation greatly reduces the whole testing time.	\N	\N
23258616	Modern digital hearing aids have provided improved fidelity over those of earlier decades for speech. The same however cannot be said for music. Most modern hearing aids have a limitation of their "front end," which comprises the analog-to-digital (A/D) converter. For a number of reasons, the spectral nature of music as an input to a hearing aid is beyond the optimal operating conditions of the "front end" components. Amplified music tends to be of rather poor fidelity. Once the music signal is distorted, no amount of software manipulation that occurs later in the circuitry can improve things. The solution is not a software issue. Some characteristics of music that make it difficult to be transduced without significant distortion include an increased sound level relative to that of speech, and the crest factor- the difference in dB between the instantaneous peak of a signal and its RMS value. Clinical strategies and technical innovations have helped to improve the fidelity of amplified music and these include a reduction of the level of the input that is presented to the A/D converter.	\N	\N
23263015	Anesthesiology requires performing visually oriented procedures while monitoring auditory information about a patient's vital signs. A concern in operating room environments is the amount of competing information and the effects that divided attention has on patient monitoring, such as detecting auditory changes in arterial oxygen saturation via pulse oximetry. The authors measured the impact of visual attentional load and auditory background noise on the ability of anesthesia residents to monitor the pulse oximeter auditory display in a laboratory setting. Accuracies and response times were recorded reflecting anesthesiologists' abilities to detect changes in oxygen saturation across three levels of visual attention in quiet and with noise. Results show that visual attentional load substantially affects the ability to detect changes in oxygen saturation concentrations conveyed by auditory cues signaling 99 and 98% saturation. These effects are compounded by auditory noise, up to a 17% decline in performance. These deficits are seen in the ability to accurately detect a change in oxygen saturation and in speed of response. Most anesthesia accidents are initiated by small errors that cascade into serious events. Lack of monitor vigilance and inattention are two of the more commonly cited factors. Reducing such errors is thus a priority for improving patient safety. Specifically, efforts to reduce distractors and decrease background noise should be considered during induction and emergence, periods of especially high risk, when anesthesiologists has to attend to many tasks and are thus susceptible to error.	\N	\N
23268357	Fine structure in the frequency response of distortion product otoacoustic emissions (DPOAEs) can severely limit the usefulness of DPOAEs in estimating auditory thresholds. Here, fine structure is removed by extracting the primary-source DPOAE component using the onset-decomposition technique (Vetešník et al., 2009) and auditory threshold estimates are compared to those obtained from DPOAEs in response to conventional, continuous two-tone stimulation. Auditory thresholds are predicted using the estimated distortion product thresholds (EDPTs), obtained from linear regression of input-output (I/O) functions of DPOAE pressure amplitude versus second-tone stimulus level (Boege and Janssen, 2002). The accuracy of the auditory-threshold predictions is derived by comparison with measured auditory thresholds. The parameters of the two primary stimulus tones of frequency f(1) and f(2) and levels of L(1) and L(2) are chosen as: f(2)/f(1) = 1.2 with 1.5 ≤ f(2) ≤ 2.5 kHz, and L(1) = 0.4L(2) + 39 dB SPL, with 25 ≤ L(2) ≤ 65 dB SPL. Data are from 12 normal-hearing subjects with profound DPOAE fine structure. 255 DPOAE I/O functions were measured for each of the two DPOAE paradigms. An EDPT value was accepted as reliable if: 1) the squared correlation coefficient, r(2) ≥ 0.8, 2) the regression slope, s(I/O) ≥ 0.2 μPa/dB, and 3) the standard deviation of the EDPT, σ(EDPT) ≤ 10 dB. The proportion of rejected I/O functions was 8% for onset-decomposition DPOAEs, and 25% for continuous-tone DPOAEs. Removal of data points from the saturation region of the DPOAE I/O function by an automated algorithm reduced the rejection rate, to zero for onset-decomposition DPOAEs, but to only 13% for continuous-tone DPOAEs. In the absence of saturated DPOAE responses, auditory thresholds were predicted with standard deviation of only 4 dB for onset-decomposition DPOAEs, but 12 dB for continuous-tone DPOAEs. In summary, by extracting the primary-source component of the DPOAE by the method of onset-decomposition it is possible to predict human auditory threshold with hitherto unattainable accuracy.	\N	\N
23268783	According to predictive coding models of sensory processing, stimulus expectations have a profound effect on sensory cortical responses. This was supported by experimental results, showing that fMRI repetition suppression (fMRI RS) for face stimuli is strongly modulated by the probability of stimulus repetitions throughout the visual cortical processing hierarchy. To test whether processing of voices is also affected by stimulus expectations, here we investigated the effect of repetition probability on fMRI RS in voice-selective cortical areas. Changing ('alt') and identical ('rep') voice stimulus pairs were presented to the listeners in blocks, with a varying probability of alt and rep trials across blocks. We found auditory fMRI RS in the nonprimary voice-selective cortical regions, including the bilateral posterior STS, the right anterior STG and the right IFC, as well as in the IPL. Importantly, fMRI RS effects in all of these areas were strongly modulated by the probability of stimulus repetition: auditory fMRI RS was reduced or not present in blocks with low repetition probability. Our results revealed that auditory fMRI RS in higher-level voice-selective cortical regions is modulated by repetition probabilities and thus suggest that in audition, similarly to the visual modality, processing of sensory information is shaped by stimulus expectation processes.	\N	\N
23274182	Speech recognition is improved when complementary visual information is available, especially under noisy acoustic conditions. Functional neuroimaging studies have suggested that the superior temporal sulcus (STS) plays an important role for this improvement. The spectrotemporal dynamics underlying audiovisual speech processing in the STS, and how these dynamics are affected by auditory noise, are not well understood. Using electroencephalography, we investigated how auditory noise affects audiovisual speech processing in event-related potentials (ERPs) and oscillatory activity. Spoken syllables were presented in audiovisual (AV) and auditory only (A) trials at three different auditory noise levels (no, low, and high). Responses to A stimuli were subtracted from responses to AV stimuli, separately for each noise level, and these responses were subjected to the statistical analysis. Central ERPs differed between the no noise and the two noise conditions from 130 to 150 ms and 170 to 210 ms after auditory stimulus onset. Source localization using the local autoregressive average procedure revealed an involvement of the lateral temporal lobe, encompassing the superior and middle temporal gyrus. Neuronal activity in the beta-band (16 to 32 Hz) was suppressed at central channels around 100 to 400 ms after auditory stimulus onset in the averaged AV minus A signal over the three noise levels. This suppression was smaller in the high noise compared to the no noise and low noise condition, possibly reflecting disturbed recognition or altered processing of multisensory speech stimuli. Source analysis of the beta-band effect using linear beamforming demonstrated an involvement of the STS. Our study shows that auditory noise alters audiovisual speech processing in ERPs localized to lateral temporal lobe and provides evidence that beta-band activity in the STS plays a role for audiovisual speech processing under regular and noisy acoustic conditions.	\N	\N
23275424	Fitting a cochlear implant (CI) for optimal speech perception does not necessarily optimize listening effort. This study aimed to show that listening effort may change between CI processing conditions for which speech intelligibility remains constant. Nineteen normal-hearing participants listened to CI simulations with varying numbers of spectral channels. A dual-task paradigm combining an intelligibility task with either a linguistic or nonlinguistic visual response-time (RT) task measured intelligibility and listening effort. The simultaneously performed tasks compete for limited cognitive resources; changes in effort associated with the intelligibility task are reflected in changes in RT on the visual task. A separate self-report scale provided a subjective measure of listening effort. All measures showed significant improvements with increasing spectral resolution up to 6 channels. However, only the RT measure of listening effort continued improving up to 8 channels. The effects were stronger for RTs recorded during listening than for RTs recorded between listening. The results suggest that listening effort decreases with increased spectral resolution. Moreover, these improvements are best reflected in objective measures of listening effort, such as RTs on a secondary task, rather than intelligibility scores or subjective effort measures.	\N	\N
23290461	The white matter bundles that underlie comprehension and production of language have been investigated for a number of years. Several studies have examined which fiber bundles (or tracts) are involved in auditory language processing, and which kind of language information is transmitted by which fiber tract. However, there is much debate about exactly which fiber tracts are involved, their precise course in the brain, how they should be named, and which functions they fulfill. Therefore, the present article reviews the available language-related literature, and educes a neurocognitive model of the pathways for auditory language processing. Besides providing an overview of the current methods used for relating fiber anatomy to function, this article details the precise anatomy of the fiber tracts and their roles in phonological, semantic and syntactic processing, articulation, and repetition.	\N	\N
23297922	Previous studies have suggested that cochlear implant users may have particular difficulties exploiting opportunities to glimpse clear segments of a target speech signal in the presence of a fluctuating masker. Although it has been proposed that this difficulty is associated with a deficit in linking the glimpsed segments across time, the details of this mechanism are yet to be explained. The present study introduces a method called Zebra-speech developed to investigate the relative contribution of simultaneous and sequential segregation mechanisms in concurrent speech perception, using a noise-band vocoder to simulate cochlear implants. One experiment showed that the saliency of the difference between the target and the masker is a key factor for Zebra-speech perception, as it is for sequential segregation. Furthermore, forward masking played little or no role, confirming that intelligibility was not limited by energetic masking but by across-time linkage abilities. In another experiment, a binaural cue was used to distinguish the target and the masker. It showed that the relative contribution of simultaneous and sequential segregation depended on the spectral resolution, with listeners relying more on sequential segregation when the spectral resolution was reduced. The potential of Zebra-speech as a segregation enhancement strategy for cochlear implants is discussed.	\N	\N
23298012	The present study explored the acoustic characteristics of prosodic cues that indicate a speaker's reluctance when giving permission or agreement using a single word ("okay"). Eight speakers (four male, four female) produced the recorded materials that were subsequently validated through a listening experiment using 12 normal-hearing listeners. Acoustic analyses revealed that significantly longer word duration was the cue used most consistently across speakers to communicate reluctance. Voice quality, fundamental voice frequency, and intensity cues also differed significantly between the two prosodic conditions, but the manner in which these cues were applied varied greatly across speakers.	\N	\N
23301004	For humans and animals, the ability to discriminate speech and conspecific vocalizations is an important physiological assignment of the auditory system. To reveal the underlying neural mechanism, many electrophysiological studies have investigated the neural responses of the auditory cortex to conspecific vocalizations in monkeys. The data suggest that vocalizations may be hierarchically processed along an anterior/ventral stream from the primary auditory cortex (A1) to the ventral prefrontal cortex. To date, the organization of vocalization processing has not been well investigated in the auditory cortex of other mammals. In this study, we examined the spike activities of single neurons in two early auditory cortical regions with different anteroposterior locations: anterior auditory field (AAF) and posterior auditory field (PAF) in awake cats, as the animals were passively listening to forward and backward conspecific calls (meows) and human vowels. We found that the neural response patterns in PAF were more complex and had longer latency than those in AAF. The selectivity for different vocalizations based on the mean firing rate was low in both AAF and PAF, and not significantly different between them; however, more vocalization information was transmitted when the temporal response profiles were considered, and the maximum transmitted information by PAF neurons was higher than that by AAF neurons. Discrimination accuracy based on the activities of an ensemble of PAF neurons was also better than that of AAF neurons. Our results suggest that AAF and PAF are similar with regard to which vocalizations they represent but differ in the way they represent these vocalizations, and there may be a complex processing stream between them.	\N	\N
23306571	The phenomenon of Late-Onset Unilateral Auditory Deprivation was first reported in 1984. However, a high number of unilateral hearing aid fittings are still carried out in cases of bilateral hearing loss, justified by non-auditory factors such as cost, vanity, misinformation and public health policies. To carry out behavioral and electrophysiological assessment of the auditory performance of adults using unilateral amplification compared with individuals exposed to bilateral symmetric auditory stimulation. Thirty five adults, all with symmetric bilateral sensorineural hearing loss, regular users of unilateral hearing aid, bilateral hearing aids and not users of hearing aids, were assessed on behavioral and electrophysiological tests. Variance analysis revealed that in the unilaterally fitted group, P300 latency was significantly greater in ears with auditory deprivation compared with those fitted with the hearing aid (p < 0.05). This same group also had poorer performance on the Sentence Recognition Test in Noise held in free field. These results corroborate findings in the literature showing that unilateral auditory deprivation can lead to physiological and perceptual changes.	\N	\N
23307427	Using a tactile variant of the negative-priming paradigm, we analyzed the influence of Gestalt grouping on the ability of participants to ignore distracting tactile information. The distance between participants' hands, to which the target and distractor stimuli were simultaneously delivered, was varied (near/touching hands vs. hands far apart). In addition, the influence of touching hands was controlled, as participants wore gloves and their hands were blocked from vision by a cover. The magnitude of the tactile negative-priming effect was modulated by the interaction between hand separation and whether or not gloves were worn. When the hands were touching, negative priming emerged only while wearing gloves that prevented direct skin-to-skin contact. In contrast, when the separation between the participants' hands was larger, negative priming emerged only when gloves were not worn. This pattern of results is interpreted in terms of the competing influences of two interacting Gestalt principles--namely, connectedness and proximity--on the processing of tactile distractors.	\N	\N
23316925	The event-related potential (ERP) correlates of sound detection are attenuated when eliciting sounds coincide with our own actions. The role of attention in this effect was investigated in two experiments by presenting tones separated by random intervals. In the homogeneous condition of Experiments 1 and 2, the same tone was repeated, whereas in the mixed condition of Experiment 1, tones with five different frequencies were presented. Participants performed a time-interval production task by marking intervals with keypresses in Experiment 1, and tried to produce keypress-tone coincidences in Experiment 2. Although the auditory ERPs were attenuated for coincidences, no modulation by the multiplicity of tone frequencies in Experiment 1, or by the task-relevancy of tones and coincidences in Experiment 2, was found. This suggests that coincidence-related ERP attenuation cannot be fully explained by voluntary attentional mechanisms.	\N	\N
23321588	To assess the auditory performance of Digisonic(®) cochlear implant users with electric stimulation (ES) and electro-acoustic stimulation (EAS) with special attention to the processing of low-frequency temporal fine structure. Six patients implanted with a Digisonic(®) SP implant and showing low-frequency residual hearing were fitted with the Zebra(®) speech processor providing both electric and acoustic stimulation. Assessment consisted of monosyllabic speech identification tests in quiet and in noise at different presentation levels, and a pitch discrimination task using harmonic and disharmonic intonating complex sounds ( Vaerenberg et al., 2011 ). These tests investigate place and time coding through pitch discrimination. All tasks were performed with ES only and with EAS. Speech results in noise showed significant improvement with EAS when compared to ES. Whereas EAS did not yield better results in the harmonic intonation test, the improvements in the disharmonic intonation test were remarkable, suggesting better coding of pitch cues requiring phase locking. These results suggest that patients with residual hearing in the low-frequency range still have good phase-locking capacities, allowing them to process fine temporal information. ES relies mainly on place coding but provides poor low-frequency temporal coding, whereas EAS also provides temporal coding in the low-frequency range. Patients with residual phase-locking capacities can make use of these cues.	\N	\N
23327452	In order to provide effective intervention for children with specific language impairment (SLI), it is crucial that there is an understanding of the underlying deficit in SLI. This study utilized a battery of phonological processing tasks to compare the phonological processing skills of children with SLI to typically-developing peers matched for age or language. The children with SLI had significantly poorer performance than age-matched peers on measures of phonological representations, phonological awareness, rapid automatized naming, phonological short-term memory, and one measure of working memory. Of particular significance, the SLI group also demonstrated significantly weaker performance than language-matched peers on one measure of phonological representations, and one measure of working memory. The findings provide some support for a phonological processing account of SLI and highlight the utility of using tasks that draw on a comprehensive model of speech processing to profile and consider children's phonological processing skills in detail.	\N	\N
23331545	The auditory N1 event-related potential has previously been observed to be attenuated for tones that are triggered by human actions. This attenuation is thought to be generated by motor prediction mechanisms and is considered to be important for agency attribution. The present study was designed to rigorously test the notion of action prediction-based sensory attenuation. Participants performed one of four voluntary actions on each trial, with each button associated with either predictable or unpredictable action effects. In addition, actions with each hand could result in action effects that were either congruent or incongruent with hand-specific prediction. We observed no significant differences in N1 amplitude between predictable and unpredictable tones. When contrasting action effects that were congruent or incongruent with hand-specific prediction, we observed significant attenuation for prediction-congruent compared to prediction-incongruent action-effects. These novel findings suggest that accurate action-effect prediction drives sensory attenuation of auditory stimuli. These findings have important implications for understanding the mechanisms of action-effect prediction and sensory attenuation, and may have clinical implications for studies investigating action awareness and agency in schizophrenia.	\N	\N
23334356	Several studies have shown that the ability to identify the timbre of musical instruments is reduced in cochlear implant (CI) users compared with normal-hearing (NH) listeners. However, most of these studies have focused on tasks that require specific musical knowledge. In contrast, the present study investigates the perception of timbre by CI subjects using a multidimensional scaling (MDS) paradigm. The main objective was to investigate whether CI subjects use the same cues as NH listeners do to differentiate the timbre of musical instruments. Three groups of 10 NH subjects and one group of 10 CI subjects were asked to make dissimilarity judgments between pairs of instrumental sounds. The stimuli were 16 synthetic instrument tones spanning a wide range of instrument families. All sounds had the same fundamental frequency (261 Hz) and were balanced in loudness and in perceived duration before the experiment. One group of NH subjects listened to unprocessed stimuli. The other two groups of NH subjects listened to the same stimuli passed through a four-channel or an eight-channel noise vocoder, designed to simulate the signal processing performed by a real CI. Subjects were presented with all possible combinations of pairs of instruments and had to estimate, for each pair, the amount of dissimilarity between the two sounds. These estimates were used to construct dissimilarity matrices, which were further analyzed using an MDS model. The model output gave, for each subject group, an optimal graphical representation of the perceptual distances between stimuli (the so-called "timbre space"). For all groups, the first two dimensions of the timbre space were strikingly similar and correlated strongly with the logarithm of the attack time and with the center of gravity of the spectral envelope, respectively. The acoustic correlate of the third dimension differed across groups but only accounted for a small proportion of the variance explained by the MDS solution. Surprisingly, CI subjects and NH subjects listening to noise-vocoded simulations gave relatively more weight to the spectral envelope dimension and less weight to the attack-time dimension when making their judgments than NH subjects listening to unprocessed stimuli. One possible reason for the relatively higher salience of spectral envelope cues in real and simulated CIs may be that the degradation of local fine spectral details produced a more stable spectral envelope across the stimulus duration. The internal representation of musical timbre for isolated musical instrument sounds was found to be similar in NH and in CI listeners. This suggests that training procedures designed to improve timbre recognition in CIs will indeed train CI subjects to use the same cues as NH listeners. Furthermore, NH subjects listening to noise-vocoded sounds appear to be a good model of CI timbre perception as they show the same first two perceptual dimensions as CI subjects do and also exhibit a similar change in perceptual weights applied to these two dimensions. This last finding validates the use of simulations to evaluate and compare training procedures to improve timbre perception in CIs.	\N	\N
23336003	A large-scale subjective survey was conducted in six shopping malls in Harbin City, China, to determine the influence of social and behavioural characteristics of users on their evaluation of subjective loudness and acoustic comfort. The analysis of social characteristics shows that evaluation of subjective loudness is influenced by income and occupation, with correlation coefficients or contingency coefficients of 0.10 to 0.40 (p<0.05 or p<0.01). Meanwhile, evaluation of acoustic comfort evaluation is influenced by income, education level, and occupation, with correlation coefficients or contingency coefficients of 0.10 to 0.60 (p<0.05 or p<0.01). The effect of gender and age on evaluation of subjective loudness and acoustic comfort is statistically insignificant. The effects of occupation are mainly caused by the differences in income and education level, in which the effects of income are greater than that of education level. In terms of behavioural characteristics, evaluation of subjective loudness is influenced by the reason for visit, frequency of visit, and length of stay, with correlation coefficients or contingency coefficients of 0.10 to 0.40 (p<0.05 or p<0.01). Evaluation of acoustic comfort is influenced by the reason for visit to the site, the frequency of visit, length of stay, and also season of visit, with correlation coefficients of 0.10 to 0.30 (p<0.05 or p<0.01). In particular, users who are waiting for someone show lower evaluation of acoustic comfort, whereas users who go to shopping malls more than once a month show higher evaluation of acoustic comfort. On the contrary, the influence of the period of visit and the accompanying persons are found insignificant.	\N	\N
23339556	This study explored the developmental trends and phonetic category formation in bilingual children and adults. Participants included 30 fluent Spanish-English bilingual children, aged 8-11, and bilingual adults, aged 18-40. All completed gating tasks that incorporated code-mixed Spanish-English stimuli. There were significant differences in performance according to phonotactic construction of the stimuli, with fastest word recognition on words with voiceless initial consonants. Analysis of developmental trends revealed significant differences in children's performance by grade level and fastest recognition on English voiceless initial consonants than Spanish voiceless initial consonants. Differences in voice onset time between English and Spanish may have contributed to quicker recognition of English voiceless consonants than Spanish voiceless consonants. It is also possible that increased exposure to both spoken and written English may account for faster recognition of English voiceless words than Spanish voiceless words. In conclusion, multiple factors may influence perception of a second language.	\N	\N
23340379	Since deafness is the most common sensorineural disorder in humans, better understanding of the underlying causes is necessary to improve counseling and rehabilitation. A Dutch family with autosomal dominantly inherited sensorineural hearing loss was clinically and genetically assessed. The MYO6 gene was selected to be sequenced because of similarities with other, previously described DFNA22 phenotypes and a pathogenic c.3610C > T (p.R1204W) mutation was found to co-segregate with the disease. This missense mutation results in a flat configured audiogram with a mild hearing loss, which becomes severe to profound and gently to steeply downsloping later in life. The age-related typical audiograms (ARTA) constructed for this family resemble presbyacusis. Speech audiometry and results of loudness scaling support the hypothesis that the phenotype of this specific MYO6 mutation mimics presbyacusis.	\N	\N
23341954	Two experiments investigated deaf individuals' ability to discriminate between same-sex talkers based on vibrotactile stimulation alone. Nineteen participants made same/different judgments on pairs of utterances presented to the lower back through voice coils embedded in a conforming chair. Discrimination of stimuli matched for F0, duration, and perceived magnitude was successful for pairs of spoken sentences in Experiment 1 (median percent correct = 83%) and pairs of vowel utterances in Experiment 2 (median percent correct = 75%). Greater difference in spectral tilt between "different" pairs strongly predicted their discriminability in both experiments. The current findings support the hypothesis that discrimination of complex vibrotactile stimuli involves the cortical integration of spectral information filtered through frequency-tuned skin receptors.	\N	\N
23351131	The suppression of the auditory N1 event-related potential (ERP) to self-initiated sounds became a popular tool to tap into sensory-specific forward modeling. It is assumed that processing in the auditory cortex is attenuated due to a match between sensory stimulation and a specific sensory prediction afforded by a forward model of the motor command. The present study shows that N1 suppression was dramatically increased with long (≈ 3 s) stimulus onset asynchronies (SOA), whereas P2 suppression was equal in all SOA conditions (0.8, 1.6, 3.2 s). Thus, the P2 was found to be more sensitive to self-initiation effects than the N1 with short SOAs. Moreover, only the unspecific but not the sensory-specific N1 components were suppressed for self-initiated sounds suggesting that N1-suppression effects mainly reflect an attenuated orienting response. We argue that the N1-suppression effect is a rather indirect measure of sensory-specific forward models.	\N	\N
23351849	This work investigates the nature of the comprehension impairment in Wernicke's aphasia (WA), by examining the relationship between deficits in auditory processing of fundamental, non-verbal acoustic stimuli and auditory comprehension. WA, a condition resulting in severely disrupted auditory comprehension, primarily occurs following a cerebrovascular accident (CVA) to the left temporo-parietal cortex. Whilst damage to posterior superior temporal areas is associated with auditory linguistic comprehension impairments, functional-imaging indicates that these areas may not be specific to speech processing but part of a network for generic auditory analysis. We examined analysis of basic acoustic stimuli in WA participants (n = 10) using auditory stimuli reflective of theories of cortical auditory processing and of speech cues. Auditory spectral, temporal and spectro-temporal analysis was assessed using pure-tone frequency discrimination, frequency modulation (FM) detection and the detection of dynamic modulation (DM) in "moving ripple" stimuli. All tasks used criterion-free, adaptive measures of threshold to ensure reliable results at the individual level. Participants with WA showed normal frequency discrimination but significant impairments in FM and DM detection, relative to age- and hearing-matched controls at the group level (n = 10). At the individual level, there was considerable variation in performance, and thresholds for both FM and DM detection correlated significantly with auditory comprehension abilities in the WA participants. These results demonstrate the co-occurrence of a deficit in fundamental auditory processing of temporal and spectro-temporal non-verbal stimuli in WA, which may have a causal contribution to the auditory language comprehension impairment. Results are discussed in the context of traditional neuropsychology and current models of cortical auditory processing.	\N	\N
23354172	Visual speech inputs can enhance auditory speech information, particularly in noisy or degraded conditions. The natural statistics of audiovisual speech highlight the temporal correspondence between visual and auditory prosody, with lip, jaw, cheek and head movements conveying information about the speech envelope. Low-frequency spatial and temporal modulations in the 2-7 Hz range are of particular importance. Dyslexic individuals have specific problems in perceiving speech envelope cues. In the current study, we used an audiovisual noise-vocoded speech task to investigate the contribution of low-frequency visual information to intelligibility of 4-channel and 16-channel noise vocoded speech in participants with and without dyslexia. For the 4-channel speech, noise vocoding preserves amplitude information that is entirely congruent with dynamic visual information. All participants were significantly more accurate with 4-channel speech when visual information was present, even when this information was purely spatio-temporal (pixelated stimuli changing in luminance). Possible underlying mechanisms are discussed.	\N	\N
23357092	Here we present two experiments investigating the implicit orienting of attention over time by entrainment to an auditory rhythmic stimulus. In the first experiment, participants carried out a detection and discrimination tasks with auditory and visual targets while listening to an isochronous, auditory sequence, which acted as the entraining stimulus. For the second experiment, we used musical extracts as entraining stimulus, and tested the resulting strength of entrainment with a visual discrimination task. Both experiments used reaction times as a dependent variable. By manipulating the appearance of targets across four selected metrical positions of the auditory entraining stimulus we were able to observe how entraining to a rhythm modulates behavioural responses. That our results were independent of modality gives a new insight into cross-modal interactions between auditory and visual modalities in the context of dynamic attending to auditory temporal structure.	\N	\N
23362674	It has been shown that humans are able to recognise their own movement. While visual cues have been amply studied, the contribution of auditory cues is not clear. Our aim was to investigate the role of temporal auditory cues in the identification of one's own or others' performance in a complex movement--a golf swing. We investigated whether golfers are able to discriminate between the sounds associated with their own swings and other golfers' swings, by using the relative timing and the overall duration of the movement. The sounds produced by the participants performing 65 m shots have been recorded and used to create the stimuli. The experimental conditions were: participants' swing sounds and the sounds of other golfers having equal both relative timing and overall duration, equal relative timing but different overall duration, different relative timing but equal overall duration, and both different relative timing and overall duration. The task of the participants was to say whether each sound corresponded or did not correspond to their own swing. Results show that golfers are able to recognise their own movements, but they also recognise as their own the sound produced by other athletes having equal both relative timing and overall duration.	\N	\N
23363116	Detection thresholds for 100 ms of either 5- or 20-Hz frequency modulation (FM) were measured at various temporal positions within a 600-ms, 4-kHz pure-tone carrier. The results indicated that the temporal position of the signal relative to the fringe influences detection thresholds, including an effect that is reminiscent of auditory backward recognition masking. A task involving frequency increments, rather than sinusoidal FM, yielded similar results. Additional manipulation of total carrier duration indicated that FM detection thresholds improve as the duration of the forward fringe increases, while a backward fringe only degrades performance in the absence of any forward fringe. The results suggest that listeners are insensitive to subtle frequency changes that occur at the onset of a longer stimulus and that the interaction between the opposing effects of the forward and backward fringes is not additive.	\N	\N
23363122	The vocal tract length of a speaker is the primary determinant of the range of formant frequencies (FFs) produced by that speaker. Listeners have demonstrated sensitivity to the average FFs produced by voices, for example, in estimating the relative heights of two speakers based on their speech. However, it is not known whether they can learn to identify voices based on the acoustic characteristic associated with the average FFs produced by a voice (this characteristic will be referred to as FF-scaling). To investigate this, a series of vowels corresponding to voices that differed in their average f0 and/or FF-scaling were synthesized. Listeners (n = 71) were trained to identify these voices using a training procedure where, for each trial, they heard the vowels representing a voice and then had to identify the stimulus voice from among a series of candidate voices that differed in terms of their FF-scaling and/or their f0. Results indicate that listeners can identify voices on the basis of FF-scaling quite accurately and consistently after only a short training session and that, although f0 weakly influences these estimates, they are most strongly determined by the stimulus FFs.	\N	\N
23363188	Good localization accuracy depends on an auditory spatial map that provides consistent binaural information across frequency and level. This study investigated whether mapping bilateral cochlear implants (CIs) independently contributes to distorted perceptual spatial maps. In a meta-analysis, interaural level differences necessary to perceptually center sound images were calculated for 127 pitch-matched pairs of electrodes; many needed large current adjustments to be perceptually centered. In a separate experiment, lateralization was also found to be inconsistent across levels. These findings suggest that auditory spatial maps are distorted in the mapping process, which likely reduces localization accuracy and target-noise separation in bilateral CIs.	\N	\N
23363191	Listeners presented with noise were asked to press a key whenever they heard the vowels [a] or [i:]. The noise had a random spectrum, with levels in 60 frequency bins changing every 0.5 s. Reverse correlation was used to average the spectrum of the noise prior to each key press, thus estimating the features of the vowels for which the participants were listening. The formant frequencies of these reverse-correlated vowels were similar to those of their respective whispered vowels. The success of this response-triggered technique suggests that it may prove useful for estimating other internal representations, including perceptual phenomena like tinnitus.	\N	\N
23363193	Speech understanding difficulties for older adults (OAs) are well documented. Very little is known about whether age-related changes affect their speech production as well. Intelligibility of conversational and clear speech sentences produced by five OA talkers was examined. The results of the sentence-in-noise listening tests revealed that OAs enhanced their intelligibility for young adult (YA) listeners through clear speech modifications. Importantly, though, OAs were less effective at improving their speech to benefit listeners compared to YA talkers [reported in Smiljanic and Bradlow, J. Acoust. Soc. Am. 118(3), 1677-1688 (2005)]. The results suggest that auditory and cognitive changes across lifespan can affect OA's speech patterns and intelligibility.	\N	\N
23363194	Stilp and Kluender [(2010). Proc. Natl. Acad. Sci. U.S.A. 107(27), 12387-12392] reported measures of sensory change over time (cochlea-scaled spectral entropy, CSE) reliably predicted sentence intelligibility for normal-hearing listeners. Here, implications for listeners with atypical hearing were explored using noise-vocoded speech. CSE was parameterized as Euclidean distances between biologically scaled spectra [measured before sentences were noise vocoded (CSE)] or between channel amplitude profiles in simulated cochlear-implant processing [measured after vocoding (CSE(CI))]. Sentence intelligibility worsened with greater amounts of information replaced by noise; patterns of performance did not differ between CSE and CSE(CI). Results demonstrate the importance of information-bearing change for speech perception in simulated electric hearing.	\N	\N
23366748	Emotional arousal, or affective patterns, can be probed using observable bioelectric signals, in particular using the fluctuations of electroencephalographic potentials from the human scalp. Hearing impairment related to increased threshold of audio tone detection may cause the loss of intelligibility of speech resulting in an innate automatic emotional response. An adaptive support vector machine can be trained to identify a subject's unique affective response based upon an audiogram hearing test. This paper presents the efficacy of our model, initial SVM classification data, and discusses potential application.	\N	\N
23398728	Superior semi-circular canal dehiscence (SSCD) is a known cause of hearing loss. This study quantifies hearing loss in SSCD ears in a frequency-specific fashion. A meta-analysis of English language literature pertaining to SSCD was performed, with extraction and evaluation of available human audiometric data. Our own institution's case series of SSCD patients was also similarly analysed. Hearing loss in SSCD ears was compared to same patient control ears and to age-matched normative audiometric data. Ears with SSCD had statistically significant worse hearing as compared to both normative data and to own normal ear controls at 2000 Hz and below. The effect appears to diminish with increasing frequency. The presence of statistically significant conductive hearing loss in the low frequencies was confirmed for SSCD ears. SSCD may also predispose ears to high frequency sensorineural hearing loss.	\N	\N
23404529	The purpose of the study was to investigate the potential clinical advantages of incorporating a contralateral routing of signals (CROS) microphone in unilateral cochlear implant (CI) users. A prospective study was undertaken on a group of 21 postlingually deafened adults who were all implanted with the same multichannel CI system. Performance with a unilateral CI was compared with performance using both a unilateral implant and a CROS microphone system worn on the opposite site (CI-CROS). Speech understanding using the AzBio sentence was evaluated in quiet, with speech presented at 0° and 270° azimuth in the horizontal plane. Speech understanding in noise was performed with speech at 0°, and noise at 0°, 90°, and 270°. A significant gain in speech understanding using CI-CROS compared to the unilateral CI alone was found in quiet when speech was presented at 270° (average improvement of 8.8%, P < .01). Participants also demonstrated statistically significant improvement using CI-CROS compared with the unilateral CI alone when noise was presented at 90° and speech at 0° (average improvement of 6.7%, P < .01). Adding a contralateral microphone to a unilateral CI resulted in a significant improvement in speech understanding in different conditions. This method could provide a greater cost/benefit ratio than bilateral CIs and be a potential improvement for those who are not candidates for bilateral CIs.	\N	\N
23418635	Cochlear implantation (CI) has proven in long term prospective trials to reduce significantly incapacitating tinnitus in single sided deafness (SSD). Discussion arises whether electrical stimulation near the round window (RW) is also able to reduce tinnitus. to assess whether electrical stimulation of the basal first 4 intracochlear electrodes of a CI could sufficiently reduce tinnitus and to compare these results with stimulation with all CI electrodes. 7 patients who met the criteria of severe tinnitus due to SSD were implanted with a Med-El Sonata Ti100 with a FlexSoftTM or Flex24TM electrode. After 4 weeks only the basal electrode pair (E12) nearest to the RW was activated. Each week the following pair was activated until the 4th pair.Thereafter all electrodes were activated. Tinnitus was assessed before CI surgery and before each electrode pair was activated. When all electrodes were fitted, evaluation was done after 1, 3 and 6 months.Tinnitus was assessed with Visual Analogue Scale (VAS) for loudness, psychoacoustic tinnitus loudness comparison at 1 kHz and Tinnitus Questionnaire (TQ) for the effect on quality of life. To evaluate the natural evolution, a tightly matched control group with severe tinnitus due to SSD was followed prospectively. All the tinnitus outcome measures remained unchanged with 1, 2, 3 or 4 activated electrode pairs. With complete CI activation, the tinnitus decreased significantly comparable with earlier reports.Pre-implantation the tinnitus loudness was 8.2/10 on the VAS and was reduced to 4.1/10 6 months postimplantation.Psychometrically the loudness level went from 21.7 dB SL (SD: 16.02) to 7.5 dB SL (SD: 5.24)and the TQ from 60/84 to 39/84. The non-implanted group had no decrease of the tinnitus, the average VAS remained stable at 8.9/10 throughout the follow-up period of 6 months. with the current stimulation parameters electrical stimulation in the first 8e10 mm of the basal part of the scala tympani is insufficient to reduce tinnitus. However, stimulation over the complete CI length yields immediate tinnitus reduction confirming earlier results.	\N	\N
23421638	BACKGROUND/STUDY CONTEXT: The perception of naturalistic events depends on the ability to integrate perceptual information from multiple sensory systems. Currently, little is known about how multisensory integration is affected by normal aging. The authors conducted two experiments to investigate audiovisual temporal processing in younger (18-29 years) and older (70+ years) adults. In both experiments, participants were presented with a brief visual stimulus and a brief auditory stimulus separated by various temporal offsets, and participants judged which stimulus was presented first. In Experiment 1, the auditory and visual stimuli were presented from the same perceived location, whereas in Experiment 2 they were presented from different locations. The authors found no effect of stimulus location, and no evidence of age-related declines in performance in either experiment. Older adults appear to retain the ability to discriminate the temporal order of audiovisual stimuli and can perform similarly to younger adults.	\N	\N
23422927	Prosody includes suprasegmental components of speech, such as intonation and rate, which add meaning beyond the words being spoken. Sensitivity to pragmatic prosody could improve communication within conversations. These studies investigated adults' and preschoolers' sensitivity to pragmatic prosody. Experiment 1 demonstrated that adults and children comprehend pragmatic prosody; they selected fast actions when descriptions were spoken fast versus when descriptions were spoken slowly. Experiment 2 demonstrated that adults and children spontaneously produce pragmatic prosody-their descriptions of fast actions were faster than their descriptions of slow actions-even when it was not necessary for the task. These studies conclude that children, like adults, are capable of using and producing pragmatic prosody; however, children are less sensitive than adults to subtle prosodic distinctions.	\N	\N
23426091	To maintain optimal understanding, persons with sensorineural hearing loss (SNHL) often report a need for increased attention, concentration, and "listening effort" compared with persons without hearing loss. It is generally assumed that this increased effort is related to subjective reports of mental fatigue in persons with hearing loss. Although the benefits of hearing aids for improving intelligibility are well documented, their impact on listening effort and mental fatigue are less clear. This study used subjective and objective measures to examine the effects of hearing aid use and advanced hearing aid features on listening effort and mental fatigue in adults with SNHL. Sixteen adults (aged 47-69 years) with mild to severe sloping SNHL participated. A dual-task paradigm assessed word recognition, word recall, and visual reaction times (RTs) to objectively quantify listening effort and fatigue. Mental fatigue was operationally defined as a decrement in performance over the duration of the experiment (approximately 1 hr). Participants were fitted with study hearing aids and tested unaided and in two aided conditions (omnidirectional and with directional processing and digital noise reduction active). Subjective ratings of listening effort experienced during the day and ratings of fatigue and attentiveness immediately before and after the dual-task were also obtained. Word recall was better and dual-task RTs were significantly faster in the aided compared with unaided conditions, suggesting a decrease in listening effort when listening aided. Word recognition and recall in unaided and aided conditions remained relatively stable over the duration of the dual-task, suggesting these processes were resistant to mental fatigue. In contrast, dual-task RTs systematically increased over the duration of the speech task when listening unaided, consistent with development of mental fatigue. However, dual-task RTs remained stable over time in both aided conditions suggesting that hearing aid use reduced susceptibility to mental fatigue. Subjective ratings of fatigue and attentiveness also increased significantly after completion of the dual-task; however, no differences between unaided and aided subjective ratings were observed. Correlation analyses between subjective and objective measures of listening effort and mental fatigue showed no strong or consistent relationship. Likewise, subject variables such as age and degree of hearing loss showed no strong or consistent relationship to either subjective or objective measures of listening effort or mental fatigue. Results from subjective and select objective measures suggest sustained speech-processing demands can lead to mental fatigue in persons with hearing loss. It is important to note that the use of clinically fit hearing aids may reduce listening effort and susceptibility to mental fatigue associated with sustained speech-processing demands. The present study design did not reveal additional benefits, in terms of reduced listening effort or fatigue, from use of directional processing and digital noise-reduction algorithms. However, experimental design limitations suggest further work in this area is needed. Finally, subjective and objective measures of listening effort and mental fatigue due to sustained speech-processing demands, were not strongly associated, suggesting that these measures may assess different aspects of listening effort and mental fatigue.	\N	\N
23438484	Two experiments tested the effect of exposure to masked phobic stimuli at a very brief stimulus onset asynchrony on reducing the subjective experience of fear caused by in vivo exposure to a feared object. In the main experiment, 35 spider-fearful and 35 non-fearful participants were identified with a questionnaire and a behavioural avoidance test (BAT) with a live tarantula. One week later, they were individually administered one of two continuous series of masked images: spiders or flowers. They engaged in the BAT again immediately thereafter. They provided ratings of subjective fear at the end of each BAT (pre- and post-manipulation). Very brief exposure to images of spiders reduced the fearful group's and not the non-fearful group's experience of fear at the end of the BAT. This effect was replicated with another sample of 26 spider-fearful participants from the same population. Theoretical implications are discussed.	\N	\N
23442566	Performance in tone perception and production are correlated in prelingually deafened pediatric cochlear implant (CI) users across individuals. Demographic variables, such as age at implantation, contribute to the performance variability. Poor representation of pitch information in CI devices hinders pitch perception and affects perception of lexical tones in cochlear implant users who speak tonal languages. One hundred ten Mandarin-speaking, prelingually deafened CI subjects and 125 typically developing, normal-hearing subjects were recruited from Beijing, China. Lexical tone perception was measured using a computerized tone contrast test. Tone production was judged by native Mandarin-speaking adult listeners as well as analyzed acoustically and with an artificial neural network. A general linear model analysis was performed to determine factors that accounted for performance variability. CI subjects scored ≈ 67% correct on the lexical tone perception task. The degree of differentiation of tones produced by the CI group was significantly lower than the control group as revealed by acoustic analysis. Tone production performance assessed by the neural network was highly correlated with that evaluated by human listeners. There was a moderate correlation between the overall tone perception and production performance across CI subjects. Duration of implant use and age at implantation jointly explained ≈ 29% of the variance in the tone perception performance. Age at implantation was the only significant predictor for tone production performance in the CI subjects. Tone production performance in pediatric CI users is dependent on accurate perception. Early implantation predicts a better outcome in lexical tone perception and production.	\N	\N
23446715	It is generally agreed that the auditory perception skills of children with developmental language disorders are more limited than those of typically developing children. It is not easy to determine exactly how the capacity to discriminate and the capacity to pronounce phonemes influence each other in children with language disorders. For most authors, the inability to discriminate certain phonemes accurately causes a developmental delay in pronunciation, whereas others claim the influence is mutual. The aim of this study is to determine in which consonants perceptive difficulty is more likely to occur and in which cases there is a greater probability of difficulty when it comes to articulating them. The sample used in the study consisted of 86 children with a mean age of 4 years and 7 months. The phonological processes involved in simplifying speech were identified. Their errors were used as the basis on which to construct and apply a specific speech perception test. The relationship between the articulatory and perceptive skills of children with substitutive processes were analysed by means of two comparisons: first, in all the processes detected taken as a whole and, second, in the three most frequent substitutive processes. These analyses were carried out to determine whether the nature of the consonant implied a greater probability of perceptive difficulty. The findings provide information about a relation between the articulatory and perceptive skills, and about whether the nature of the consonant determines a higher probability of perceptive or articulatory difficulties. These results can be of value in the assessment, design and effectiveness of speech therapy programmes.	\N	\N
23448103	To evaluate the auditory brainstem response (ABR) amplitudes evoked by tone pip and narrowband chirp (NB CE-Chirp) stimuli when testing post-screening newborns and to determine the difference in estimated hearing level correction values. Tests were performed with tone pips and NB CE-Chirps at 4 kHz or 1 kHz. The response amplitude, response quality (Fmp), and residual noise were compared for both stimuli. Thirty babies (42 ears) who passed our ABR discharge criterion at 4 kHz following referral from their newborn hearing screen. Overall, NB CE-Chirp responses were 64% larger than the tone pip responses, closer to those evoked by clicks. Fmp was significantly higher for NB CE-Chirps. It is anticipated that there could be significant reductions in test time for the same signal to noise ratio by using NB CE-Chirps when testing newborns. This effect may vary in practice and is likely to be most beneficial for babies with low amplitude ABR responses. We propose that the ABR nHL threshold to eHL correction for NB CE-Chirps should be approximately 5 dB less than the corrections for tone pips at 4 and 1 kHz.	\N	\N
23453221	This study examined the ability of prelingually deaf children with bilateral implants to identify emotion (i.e. happiness or sadness) in speech and music. Participants in Experiment 1 were 14 prelingually deaf children from 5-7 years of age who had bilateral implants and 18 normally hearing children from 4-6 years of age. They judged whether linguistically neutral utterances produced by a man and woman sounded happy or sad. Participants in Experiment 2 were 14 bilateral implant users from 4-6 years of age and the same normally hearing children as in Experiment 1. They judged whether synthesized piano excerpts sounded happy or sad. Child implant users' accuracy of identifying happiness and sadness in speech was well above chance levels but significantly below the accuracy achieved by children with normal hearing. Similarly, their accuracy of identifying happiness and sadness in music was well above chance levels but significantly below that of children with normal hearing, who performed at ceiling. For the 12 implant users who participated in both experiments, performance on the speech task correlated significantly with performance on the music task and implant experience was correlated with performance on both tasks. Child implant users' accurate identification of emotion in speech exceeded performance in previous studies, which may be attributable to fewer response alternatives and the use of child-directed speech. Moreover, child implant users' successful identification of emotion in music indicates that the relevant cues are accessible at a relatively young age.	\N	\N
23458475	The purpose of this study was to measure real-ear aided and saturated responses of SpeechEasy™ devices and compare responses while devices delivered altered auditory feedback (AAF) and non-altered feedback (NAF). A repeated measures quasi-experimental design was employed. Ten people fitted with completely-in-the-canal or open fit behind-the-ear devices participated. Probe microphone measures were obtained with speech, and 17 chirp stimuli presented at 75 dB and 85 dB SPL, respectively. Measurements were compared with devices delivering AAF (i.e. delayed and frequency shifted) versus NAF. Maximum outputs were approximately 100-105 dB SPL in the 2000-4000 Hz range. Statistically significant differences in device SPL output as a function of device setting (AAF vs. NAF) were found for seven chirp stimuli (p <.05) when levels were sampled at points that were not temporally aligned with the output chirps but not for speech stimulus (p = .17). Device output varied across individuals and with open fit devices dominated by ear canal resonance effects. Real-ear aided responses were equivalent with speech input when devices delivered AAF and NAF. Real-ear saturated responses were not, however, comparable between AAF and NAF settings and may be underestimated if AAF delay is not accounted for.	\N	\N
23461765	Advantages associated with the left ear (right brain hemisphere) have been reported in some studies. Of these, some have specifically suggested that the left ear has a more heightened ability to detect emotional tones. Meanwhile others have pointed to factors such as age and gender as potentially leading to manifestations of human laterality. This study investigates which brain hemisphere is more involved in emotional processing of auditory information in Arab participants. We aimed to replicate the previous studies because no single study has been done in the Arabic region previously. Additionally, people in this region prefer to use the right side of their body, e.g., hand, ear, foot, etc., for most daily tasks. To acquire data a dichotic listening task (DLT) was administered to 28 male and 23 female (Edinburgh, UK) university students aged 19 to 38; 13 were left-handed and 38 were right-handed. The results showed a significant left ear advantage in the auditory processing of emotional information. There was a significant negative correlation between ear preference and handedness. Left ear advantage related only to handedness. Thus right-handed participants were more likely than left-handers to have a left ear advantage. The relationship between ear preference and gender was non-significant. The conclusion that might be drawn from this study is that the left ear (right hemisphere) is more involved in emotional processing than the right ear (left hemisphere), especially for right-handed people.	\N	\N
23462430	We set out to determine whether extra-striate ventral stream function was compromised in amblyopia and to compare any observed deficit with previous data on comparable dorsal stream function. We devised a multi-element orientation task where orientation coherence sensitivity could be measured in a comparable way to motion coherence. The use of spatial frequency narrowband elements allowed for accurate correction of any upstream contrast sensitivity influence and ensured that the orientation bandwidth of our elements did not covary with the measured coherence. Using a standard equivalent noise analysis, we varied both the local orientation bandwidth of individual elements as well as the global orientation bandwidth of the element array to obtain estimates of both local and global internal noise and efficiency. The results show that for this ventral stream task there is only a subtle amblyopic deficit in processing global orientation relative to control observers. This deficit is present for both amblyopic and fixing eyes, and appears to reflect poorer efficiency in processing local orientation, suggesting a subtle deficit at the input stage to extra-striate cortex where orientation coherence is processed.	\N	\N
23463992	A decision weight analysis is used to investigate transition bandwidths [Berg (2007). J. Acoust. Soc. Am. 121, 3639-2645]. The psychophysical task is similar to a standard profile analysis experiment except that the spacing of the tones comprising the stimuli is linear and very narrow (e.g., 20 Hz). An increment in the level of the central tone constitutes the signal. Pitch cues and single channel energy cues are degraded with randomization procedures. Thresholds increase as the number of tones comprising the stimulus (n) increases up to a transition bandwidth and then decrease or stay constant with further increases in n. It is proposed that the transition bandwidth reflects a discrete change in the underlying process, with a temporal process (e.g., envelope processor) dominating for stimulus bandwidths less than the transition bandwidth and a process of spectral profile analysis at wider bandwidths. Estimates of decision weights support the proposal.	\N	\N
23464020	Just noticeable differences in interaural correlation (ρ-jnds) from diffuse sound field reference correlations are obtained. In a three-interval, three-alternative forced-choice procedure, ρ-jnds are measured for positive and negative deviations from nine narrowband reference conditions. Stimuli are 1 equivalent rectangular bandwidth wide noise bursts with center frequencies between 165 and 1500 Hz. The frequency dependent reference correlation (ρref) is determined by the simulated interaural correlation under ideal diffuse sound field conditions. Results show that the interaural correlation at threshold for deviation toward the positive correlation range follows the reference curve in a nonlinear fashion. For deviation toward the negative correlation range the interaural correlation at threshold is further afar the reference curve and does not markedly resemble its trend. The results indicate that the previously found asymmetry for correlation discrimination from uncorrelated broadband stimuli to the positive and negative correlation range becomes less pronounced for narrowband stimuli. For positive deviation, the highest jnds are found for the region where the reference curve occupies the global minimum in ρref; despite that, the interaural correlation at threshold for positive deviation exhibits its lowest value at that point.	\N	\N
23464027	Acousticians generally assess the acoustic qualities of a concert hall or any other room using impulse response-based measures such as the reverberation time, clarity index, and others. These parameters are used to predict perceptual attributes related to the acoustic qualities of the room. Various studies show that these physical measures are not able to predict the related perceptual attributes sufficiently well under all circumstances. In particular, it has been shown that physical measures are dependent on the state of occupation, are prone to exaggerated spatial fluctuation, and suffer from lacking discrimination regarding the kind of acoustic stimulus being presented. Accordingly, this paper proposes a method for the derivation of signal-based measures aiming at predicting aspects of room acoustic perception from content specific signal representations produced by a binaural, nonlinear model of the human auditory system. Listening tests were performed to test the proposed auditory parameters for both speech and music. The results look promising; the parameters correlate with their corresponding perceptual attributes in most cases.	\N	\N
23464028	Several lines of evidence indicate that auditory temporal resolution improves over childhood, whereas other data implicate the development of processing efficiency. The present study used the masking period pattern paradigm to examine the maturation of temporal processing in normal-hearing children (4.8 to 10.7 yrs) compared to adults. Thresholds for a brief tone were measured at 6 temporal positions relative to the period of a 5-Hz quasi-square-wave masker envelope, with a 20-dB modulation depth, as well as in 2 steady maskers. The signal was a pure tone at either 1000 or 6500 Hz, and the masker was a band of noise, either spectrally wide or narrow (21.3 and 1.4 equivalent rectangular bandwidths, respectively). Masker modulation improved thresholds more for wide than narrow bandwidths, and adults tended to receive more benefit from modulation than young children. Fits to data for the wide maskers indicated a change in window symmetry with development, reflecting relatively greater backward masking for the youngest listeners. Data for children >6.5 yrs of age appeared more adult-like for the 6500- than the 1000-Hz signal. Differences in temporal window asymmetry with listener age cannot be entirely explained as a consequence of a higher criterion for detection in children, a form of inefficiency.	\N	\N
23464037	In spoken word identification and memory tasks, stimulus variability from numerous sources impairs performance. In the current study, the influence of foreign-accent variability on spoken word identification was evaluated in two experiments. Experiment 1 used a between-subjects design to test word identification in noise in single-talker and two multiple-talker conditions: multiple talkers with the same accent and multiple talkers with different accents. Identification performance was highest in the single-talker condition, but there was no difference between the single-accent and multiple-accent conditions. Experiment 2 further explored word recognition for multiple talkers in single-accent versus multiple-accent conditions using a mixed design. A detriment to word recognition was observed in the multiple-accent condition compared to the single-accent condition, but the effect differed across the language backgrounds tested. These results demonstrate that the processing of foreign-accent variation may influence word recognition in ways similar to other sources of variability (e.g., speaking rate or style) in that the inclusion of multiple foreign accents can result in a small but significant performance decrement beyond the multiple-talker effect.	\N	\N
23466938	Change deafness describes the failure to perceive even intense changes within complex auditory input, if the listener does not attend to the changing sound. Remarkably, previous psychophysical data provide evidence that this effect occurs independently of successful stimulus encoding, indicating that undetected changes are processed to some extent in auditory cortex. Here we investigated cortical representations of detected and undetected auditory changes using electroencephalographic (EEG) recordings and a change deafness paradigm. We applied a one-shot change detection task, in which participants listened successively to three complex auditory scenes, each of them consisting of six simultaneously presented auditory streams. Listeners had to decide whether all scenes were identical or whether the pitch of one stream was changed between the last two presentations. Our data show significantly increased middle-latency Nb responses for both detected and undetected changes as compared to no-change trials. In contrast, only successfully detected changes were associated with a later mismatch response in auditory cortex, followed by increased N2, P3a and P3b responses, originating from hierarchically higher non-sensory brain regions. These results strengthen the view that undetected changes are successfully encoded at sensory level in auditory cortex, but fail to trigger later change-related cortical responses that lead to conscious perception of change.	\N	\N
23467170	Cochlear implant (CI) users typically have excellent speech recognition in quiet but struggle with understanding speech in noise. It is thought that broad current spread from stimulating electrodes causes adjacent electrodes to activate overlapping populations of neurons which results in interactions across adjacent channels. Current focusing has been studied as a way to reduce spread of excitation, and therefore, reduce channel interactions. In particular, partial tripolar stimulation has been shown to reduce spread of excitation relative to monopolar stimulation. However, the crucial question is whether this benefit translates to improvements in speech perception. In this study, we compared speech perception in noise with experimental monopolar and partial tripolar speech processing strategies. The two strategies were matched in terms of number of active electrodes, microphone, filterbanks, stimulation rate and loudness (although both strategies used a lower stimulation rate than typical clinical strategies). The results of this study showed a significant improvement in speech perception in noise with partial tripolar stimulation. All subjects benefited from the current focused speech processing strategy. There was a mean improvement in speech recognition threshold of 2.7 dB in a digits in noise task and a mean improvement of 3 dB in a sentences in noise task with partial tripolar stimulation relative to monopolar stimulation. Although the experimental monopolar strategy was worse than the clinical, presumably due to different microphones, frequency allocations and stimulation rates, the experimental partial-tripolar strategy, which had the same changes, showed no acute deficit relative to the clinical.	\N	\N
23489145	Under adverse listening conditions, speech comprehension profits from the expectancies that listeners derive from the semantic context. However, the neurocognitive mechanisms of this semantic benefit are unclear: How are expectancies formed from context and adjusted as a sentence unfolds over time under various degrees of acoustic degradation? In an EEG study, we modified auditory signal degradation by applying noise-vocoding (severely degraded: four-band, moderately degraded: eight-band, and clear speech). Orthogonal to that, we manipulated the extent of expectancy: strong or weak semantic context (±con) and context-based typicality of the sentence-last word (high or low: ±typ). This allowed calculation of two distinct effects of expectancy on the N400 component of the evoked potential. The sentence-final N400 effect was taken as an index of the neural effort of automatic word-into-context integration; it varied in peak amplitude and latency with signal degradation and was not reliably observed in response to severely degraded speech. Under clear speech conditions in a strong context, typical and untypical sentence completions seemed to fulfill the neural prediction, as indicated by N400 reductions. In response to moderately degraded signal quality, however, the formed expectancies appeared more specific: Only typical (+con +typ), but not the less typical (+con -typ) context-word combinations led to a decrease in the N400 amplitude. The results show that adverse listening "narrows," rather than broadens, the expectancies about the perceived speech signal: limiting the perceptual evidence forces the neural system to rely on signal-driven expectancies, rather than more abstract expectancies, while a sentence unfolds over time.	\N	\N
23495123	We assessed the automaticity of spatial-numerical and spatial-musical associations by testing their intentionality and load sensitivity in a dual-task paradigm. In separate sessions, 16 healthy adults performed magnitude and pitch comparisons on sung numbers with variable pitch. Stimuli and response alternatives were identical, but the relevant stimulus attribute (pitch or number) differed between tasks. Concomitant tasks required retention of either color or location information. Results show that spatial associations of both magnitude and pitch are load sensitive and that the spatial association for pitch is more powerful than that for magnitude. These findings argue against the automaticity of spatial mappings in either stimulus dimension.	\N	\N
23503620	Abnormal auditory adaptation is a standard clinical tool for diagnosing auditory nerve disorders due to acoustic neuromas. In the present study we investigated auditory adaptation in auditory neuropathy owing to disordered function of inner hair cell ribbon synapses (temperature-sensitive auditory neuropathy) or auditory nerve fibres. Subjects were tested when afebrile for (i) psychophysical loudness adaptation to comfortably-loud sustained tones; and (ii) physiological adaptation of auditory brainstem responses to clicks as a function of their position in brief 20-click stimulus trains (#1, 2, 3 … 20). Results were compared with normal hearing listeners and other forms of hearing impairment. Subjects with ribbon synapse disorder had abnormally increased magnitude of loudness adaptation to both low (250 Hz) and high (8000 Hz) frequency tones. Subjects with auditory nerve disorders had normal loudness adaptation to low frequency tones; all but one had abnormal adaptation to high frequency tones. Adaptation was both more rapid and of greater magnitude in ribbon synapse than in auditory nerve disorders. Auditory brainstem response measures of adaptation in ribbon synapse disorder showed Wave V to the first click in the train to be abnormal both in latency and amplitude, and these abnormalities increased in magnitude or Wave V was absent to subsequent clicks. In contrast, auditory brainstem responses in four of the five subjects with neural disorders were absent to every click in the train. The fifth subject had normal latency and abnormally reduced amplitude of Wave V to the first click and abnormal or absent responses to subsequent clicks. Thus, dysfunction of both synaptic transmission and auditory neural function can be associated with abnormal loudness adaptation and the magnitude of the adaptation is significantly greater with ribbon synapse than neural disorders.	\N	\N
23506662	Various dimensions of auditory processing, especially the perception of speech in the presence of background competition, have been shown to deteriorate with age. A persistent problem in the assessment of these age-related changes has been the high prevalence of age-related high-frequency hearing loss in elderly persons. Some investigators have suggested that a more fruitful approach to the study of age-related decline might be to study middle-aged, rather than elderly, persons, where confounding high-frequency hearing loss is less prevalent. To determine whether an increase in the left-ear disadvantage (LED) in dichotic listening could be demonstrated in a group of middle-aged persons. The N400 component of the auditory event-related potential (AERP) was utilized to evaluate interaural asymmetry in a quasi-dichotic competing speech task. Electrophysiological responses were obtained on a word-pair semantic categorization task presented through a front loudspeaker while the listener ignored competing speech presented through either left (competition left [CL]) or right (competition right [CR]) loudspeakers. Study Samples: Twenty young (18-24 yr) and 20 middle-aged (44-57 yr) females with normal hearing sensitivity. Individual, as well as grand-averaged, AERP waveforms and scalp topographies were analyzed for the word pairs. Peak amplitude and latency measures of the N400 component were subjected to a mixed design analysis of variance (ANOVA). No significant interaural asymmetry was found in the AERP waveform for the reference word condition in either age group. In response to the second word of the pair, however, middle-aged females showed significantly greater N400 negativity in the CR condition than in the CL condition. No significant laterality effect was found in the young females. The study of young versus middle-aged participants may be an effective way of avoiding the confound of high-frequency hearing loss in elderly persons when studying age effects on auditory processing.	\N	\N
23507387	It was the aim of this study to delineate the areas along the right superior temporal sulcus (STS) for processing of faces, voices, and face-voice integration using established functional magnetic resonance imaging (fMRI) localizers and to assess their structural connectivity profile with diffusion tensor imaging (DTI). We combined this approach with an fMRI adaptation design during which the participants judged emotions in facial expressions and prosody and demonstrated response habituation in the orbitofrontal cortex (OFC) which occurred irrespective of the sensory modality. These functional data were in line with DTI findings showing separable fiber projections of the three different STS modules converging in the OFC which run through the external capsule for the voice area, through the dorsal superior longitudinal fasciculus (SLF) for the face area and through the ventral SLF for the audiovisual integration area. The OFC was structurally connected with the supplementary motor area (SMA) and activation in these two areas was correlated with faster stimulus evaluation during repetition priming. Based on these structural and functional properties, we propose that the OFC is part of the extended system for perception of emotional information in faces and voices and constitutes a neural interface linking sensory areas with brain regions implicated in generation of behavioral responses.	\N	\N
23518401	This study investigated the development of children's skills in identifying ecologically relevant sound objects within naturalistic listening environments, using a non-linguistic analog of the classic 'cocktail-party' situation. Children aged 7-12.5 years completed a closed-set identification task in which brief, commonly encountered environmental sounds were presented at varying signal-to-noise ratios. To simulate the complexity of real-world acoustic environments, target sounds were embedded in either a single, stereophonically presented scene, or in one of two different scenes, with each scene presented to a single ear. Each target sound was either congruent or incongruent with the auditory context. Identification accuracy improved with increasing age, particularly in trials with low signal-to-noise ratios. Performance was most accurate when target sounds were incongruent with the background scene, and when sounds were presented in a single background scene. The presence of two backgrounds disproportionately disrupted children's performance relative to that of previously tested adults, and reduced children's sensitivity to contextual cues. Successful identification of familiar sounds in complex auditory contexts is the outcome of a protracted learning process, with children reaching adult levels of performance after a decade or more of experience.	\N	\N
23523270	The ability to perceive and produce speech undergoes important changes in late adulthood. The goal of the present study was to characterize functional and structural age-related differences in the cortical network that support speech perception and production, using magnetic resonance imaging, as well as the relationship between functional and structural age-related changes occurring in this network. We asked young and older adults to observe videos of a speaker producing single words (perception), and to observe and repeat the words produced (production). Results show a widespread bilateral network of brain activation for Perception and Production that was not correlated with age. In addition, several regions did show age-related change (auditory cortex, planum temporale, superior temporal sulcus, premotor cortices, SMA-proper). Examination of the relationship between brain signal and regional and global gray matter volume and cortical thickness revealed a complex set of relationships between structure and function, with some regions showing a relationship between structure and function and some not. The present results provide novel findings about the neurobiology of aging and verbal communication.	\N	\N
23534128	The aim of this study was to establish the expression rate of autoimmunity in patients with sudden sensorineural hearing loss and to determine whether a positive marker is associated with a higher rate of hearing recovery after steroid treatment. A prospective study was performed on 137 patients who experienced sudden sensorineural hearing loss and underwent immunoserologic investigations. Autoantibodies evaluated on the day of admission included anti-double-stranded DNA, rheumatoid factor, antiphospholipid immunoglobulins G and M, antinuclear antibody, and complements C3 and C4. Of 137 patients, 75 were male and 62 were female (mean age, 45.1 years). Hearing loss was found on the left side in 61 patients and on the right side in 76 patients. Elevation of at least 1 autoantibody or abnormal complement levels were found in 80 patients (58%), and abnormalities of 2 or more antibodies were found in 28 (20%). There were no statistically significant correlations between autoantibody abnormalities and age, initial hearing level, or positive treatment response. There is no clear evidence of a correlation between autoimmunity and hearing improvement in patients with autoantibody abnormalities. A high (but not significant) expression rate of autoantibody abnormality and complement level was seen in patients with sudden sensorineural hearing loss.	\N	\N
23539259	Cochlear implant systems that combine electric and acoustic stimulation in the same ear are now commercially available and the number of patients using these devices is steadily increasing. In particular, electric-acoustic stimulation is an option for patients with severe, high frequency sensorineural hearing impairment. There have been a range of approaches to combining electric stimulation and acoustic hearing in the same ear. To develop a better understanding of fitting practices for devices that combine electric and acoustic stimulation, we conducted a systematic review addressing three clinical questions: what is the range of acoustic hearing in the implanted ear that can be effectively preserved for an electric-acoustic fitting?; what benefits are provided by combining acoustic stimulation with electric stimulation?; and what clinical fitting practices have been developed for devices that combine electric and acoustic stimulation? A search of the literature was conducted and 27 articles that met the strict evaluation criteria adopted for the review were identified for detailed analysis. The range of auditory thresholds in the implanted ear that can be successfully used for an electric-acoustic application is quite broad. The effectiveness of combined electric and acoustic stimulation as compared with electric stimulation alone was consistently demonstrated, highlighting the potential value of preservation and utilization of low frequency hearing in the implanted ear. However, clinical procedures for best fitting of electric-acoustic devices were varied. This clearly identified a need for further investigation of fitting procedures aimed at maximizing outcomes for recipients of electric-acoustic devices.	\N	\N
23540912	The present study investigated phonological encoding skills in children who stutter (CWS) and those who do not (CNS). Participants were 9 CWS (M=11.8 years, SD=1.5) and 9 age and sex matched CNS (M=11.8 years, SD=1.5). Participants monitored target phonemes located at syllable onsets and offsets of bisyllabic words. Performance in the phoneme monitoring task was compared to an auditory tone monitoring task. Repeated measures analysis of the response time data revealed significant Group×Task×Position interaction with the CWS becoming progressively slower than the CNS in monitoring subsequent phonemes located within the bisyllabic words; differences were not observed in the auditory tone monitoring task. Repeated measures analysis of the error data indicated that the groups were comparable in the percent errors in phoneme vs. tone monitoring. The CWS group was also significantly slower in a picture naming task compared to the CNS. Present findings suggest that CWS experience temporal asynchronies in one or more processes leading up to phoneme monitoring. The findings are interpreted within the scope of contemporary theories of stuttering. At the end of this activity the reader will be able to: (a) discuss the literature on phonological encoding skills in children who stutter, (b) identify theories of phonological encoding in stuttering, (c) define the process of phonological encoding and its implications for fluent speech, (d) suggest future areas of research in the investigation of phonological encoding abilities in children who stutter.	\N	\N
23544047	Because classical music has greatly affected our life and culture in its long history, it has attracted extensive attention from researchers to understand laws behind it. Based on statistical physics, here we use a different method to investigate classical music, namely, by analyzing cumulative distribution functions (CDFs) and autocorrelation functions of pitch fluctuations in compositions. We analyze 1,876 compositions of five representative classical music composers across 164 years from Bach, to Mozart, to Beethoven, to Mendelsohn, and to Chopin. We report that the biggest pitch fluctuations of a composer gradually increase as time evolves from Bach time to Mendelsohn/Chopin time. In particular, for the compositions of a composer, the positive and negative tails of a CDF of pitch fluctuations are distributed not only in power laws (with the scale-free property), but also in symmetry (namely, the probability of a treble following a bass and that of a bass following a treble are basically the same for each composer). The power-law exponent decreases as time elapses. Further, we also calculate the autocorrelation function of the pitch fluctuation. The autocorrelation function shows a power-law distribution for each composer. Especially, the power-law exponents vary with the composers, indicating their different levels of long-range correlation of notes. This work not only suggests a way to understand and develop music from a viewpoint of statistical physics, but also enriches the realm of traditional statistical physics by analyzing music.	\N	\N
23544676	Sound sequences, such as music, are usually organized perceptually into concurrent "streams." The mechanisms underlying this "auditory streaming" phenomenon are not completely known. The present study sought to test the hypothesis that synchrony limits listeners' ability to separate sound streams. To test this hypothesis, both perceptual-organization judgments and performance measures were used. In Experiment 1, listeners indicated whether they perceived sequences of alternating or synchronous tones as a single stream or as two streams. In Experiments 2 and 3, listeners detected rare changes in the intensity of "target" tones at one frequency in the presence of synchronous or asynchronous random-intensity "distractor" tones at another frequency. The results of these experiments showed that, for large frequency separations between the tones, the probability of perceiving two streams was lower on average for synchronous than for alternating tones, and that sensitivity to intensity changes in the target sequence was greater for asynchronous than for synchronous distractors. Overall, these results are consistent with the hypothesis that synchrony limits listeners' ability to form separate streams and/or to attend selectively to certain sounds in the presence of other sounds, even when the target and distractor sounds are well separated from each other in frequency.	\N	\N
23547103	Intertrial repetition priming plays a striking role in visual search. For instance, when searching for a target with a unique color, performance is substantially better when the specific color of the target repeats on successive trials (Maljkovic & Nakayama, 1994). Recent research has relied on objective measures of performance to show that priming improves the perceptual quality of the repeated target. Here, we examined the relation between priming and conscious perception of the target by adding a subjective measure of perception. We used backward masking to create liminal perception, that is, different levels of subjectively conscious perception of the target using exactly the same stimulus conditions. The displays in either probe trials (in which priming benefits are measured, experiment 1) or in prime trials (in which memory traces are laid down, experiment 2) were masked. The results showed that intertrial priming improves full access to awareness of the repeated target but only for targets that already achieved partial access to awareness. In addition, they show that full awareness of the target is necessary in both the prime and probe trials for intertrial priming effects to emerge. Implications for the role of implicit short-term memory in visual search are discussed.	\N	\N
23547105	When two objects are flashed at one location in close temporal proximity in the visual periphery, an intriguing illusion occurs whereby a single flash presented concurrently at another location appears to flash twice (the visual double-flash illusion: Chatterjee et al., 2011, Wilson & Singer, 1981). Here, for the first time, we investigate the time course of the effect, and directly compare it to the time course of the auditory (sound-induced flash illusion) effect, for both fission (single test flash, double inducer) and fusion (double test flash, single inducer) conditions, across stimulus onset asynchronies (SOAs) of 30 to 250 ms. In addition, using a novel audiovisual stimulus, we directly compare the cue strength of the two modalities, and whether they are additive in effect. The results show that the time course of fission and fusion is different for visual inducers, but not for auditory inducers. In audiovisual conditions, in situations of uncertainty, observers tended to follow the more reliable (auditory) cue. There was little evidence for a superadditive effect of auditory and visual cues; rather, observers tended to follow one or the other modality. The results suggest that the visually induced flash illusion and the auditory-induced effect may both stem from perceptual uncertainty, with the difference in time courses attributable to the lower temporal resolution of vision compared to audition.	\N	\N
23555217	The processing characteristics of neurons in the central auditory system are directly shaped by and reflect the statistics of natural acoustic environments, but the principles that govern the relationship between natural sound ensembles and observed responses in neurophysiological studies remain unclear. In particular, accumulating evidence suggests the presence of a code based on sustained neural firing rates, where central auditory neurons exhibit strong, persistent responses to their preferred stimuli. Such a strategy can indicate the presence of ongoing sounds, is involved in parsing complex auditory scenes, and may play a role in matching neural dynamics to varying time scales in acoustic signals. In this paper, we describe a computational framework for exploring the influence of a code based on sustained firing rates on the shape of the spectro-temporal receptive field (STRF), a linear kernel that maps a spectro-temporal acoustic stimulus to the instantaneous firing rate of a central auditory neuron. We demonstrate the emergence of richly structured STRFs that capture the structure of natural sounds over a wide range of timescales, and show how the emergent ensembles resemble those commonly reported in physiological studies. Furthermore, we compare ensembles that optimize a sustained firing code with one that optimizes a sparse code, another widely considered coding strategy, and suggest how the resulting population responses are not mutually exclusive. Finally, we demonstrate how the emergent ensembles contour the high-energy spectro-temporal modulations of natural sounds, forming a discriminative representation that captures the full range of modulation statistics that characterize natural sound ensembles. These findings have direct implications for our understanding of how sensory systems encode the informative components of natural stimuli and potentially facilitate multi-sensory integration.	\N	\N
23556554	Auditory and visual digit span tests were administered to a group of absolute pitch (AP) possessors, and a group of AP nonpossessors matched for age, and for age of onset and duration of musical training. All subjects were speakers of English. The AP possessors substantially and significantly outperformed the nonpossessors on the auditory test, while the two groups did not differ significantly on the visual test. It is conjectured that a large auditory memory span, including memory for speech sounds, facilitates the development of associations between pitches and their verbal labels early in life, so promoting the acquisition of AP.	\N	\N
23556594	Thresholds for sinusoids interaurally in phase (S0) and antiphase (Sπ) were measured in the presence of a diotic notched-noise masker (N0) as a function of notch width. The signal frequency was 250, 500, 1000, or 2000 Hz. For all signal frequencies, the difference between N0S0 and N0Sπ thresholds (binaural masking-level difference, BMLD) decreased continuously as the notch width increased. Model simulations showed that this result cannot be accounted for by a model that only processes the output of the auditory filter centered at the signal frequency, even if the nonlinear behavior of the monaural frequency selectivity or interaural differences in the filter shape are considered. The data were predicted well if a detrimental across-channel process was included, either by an addition of portions of the output of adjacent filters to the output of the on-frequency filter or by a notch-width dependent adverse shift in interaural phase in the binaural stage. The strength of this detrimental across-channel process tends to decrease with increasing signal frequencies.	\N	\N
23573184	Stress is prevalent in human life and threatens both physical and mental health; stress coping is thus of adaptive value for individual's survival and well-being. Although there has been extensive research on how the neural and physiological systems respond to stressful stimulation, relatively little is known about how the brain dynamically copes with stress evoked by this stimulation. Here we investigated how stress is relieved by a popular coping behavior, namely, gum chewing. In an fMRI study, we used loud noise as an acute stressor and asked participants to rate their feeling of stress in gum-chewing and no-chewing conditions. The participants generally felt more stressful when hearing noise, but less so when they were simultaneously chewing gum. The bilateral superior temporal sulcus (STS) and the left anterior insula (AI) were activated by noise, and their activations showed a positive correlation with the self-reported feeling of stress. Critically, gum chewing significantly reduced the noise-induced activation in these areas. Psychophysiological interaction (PPI) analysis showed that the functional connectivity between the left AI and the dorsal anterior cingulate cortex (dACC) was increased by noise to a lesser extent when the participants were chewing gum than when not chewing gum. Dynamic causality modeling (DCM) demonstrated that gum chewing inhibited the connectivity from the STS to the left AI. These findings demonstrate that gum chewing relieves stress by attenuating the sensory processing of external stressor and by inhibiting the propagation of stress-related information in the brain stress network.	\N	\N
23575462	It would be clinically valuable if an electrophysiological validation of hearing aid effectiveness in conveying speech information could be performed when a device is first provided to the individual after electroacoustic verification. This study evaluated envelope following responses (EFRs) elicited by English vowels in a steady state context and in natural sentences. It was the purpose of this study to determine whether EFRs could be detected rapidly enough to be clinically useful. EFRs were elicited using 5 vowels spanning the English vowel space, /i/, /ε/, /æ/, /(Equation is included in full-text article.)/, and /u/. These were presented either as concatenated steady state vowels (total duration 10.04 seconds) or in three 5-word sentences (total duration 11.77 seconds), where each vowel appeared once per sentence. Single-channel electroencephalogram was recorded from vertex (Cz) to the nape of the neck for 190 and 160 repetitions of the steady state vowels and sentences, respectively. The stimuli were presented at 70 dBA SPL. The fundamental frequency (f0) track from the stimuli was used with a Fourier analyzer to estimate the EFRs to each vowel. Noise amplitudes were also calculated at neighboring frequencies. Fifteen normal-hearing subjects who were 20 to 34 years of age participated in the experiment. In the analysis of steady state vowels, the mean response amplitude of /i/ was statistically the largest at 173 nV. The other 4 steady state vowels did not differ in mean response amplitude, which varied between 73 and 106 nV. In the analysis of vowels from the 3 sentences, the largest response amplitudes tended to be for /u/. Mean amplitudes for /u/ were 164, 111, and 140 nV for the words "booed," "food," and "Sue," respectively. The vowel /u/ produced statistically larger responses than /i/, /ε/, and /(Equation is included in full-text article.)/ when grouped across words, whereas other vowels did not differ. Mean response amplitudes for the other vowel categories in the sentences varied between 82 and 105 nV. All subjects showed significant EFRs in response to the words "Bee's" and "booed," but only 9 subjects showed significant EFRs for "pet," "bed," and "Bob." The authors were readily able to detect significant EFRs elicited by vowels in a steady state context and from 3 natural sentences. These results are promising as an early step in developing a clinical tool for validating that vowel stimuli are at least partially encoded at the level of the auditory brainstem. Future research will require evaluation of the technique with aided listeners, where the natural sentences are expected to be treated as typical speech by hearing aid signal-processing algorithms.	\N	\N
23576809	Major depression goes along with affective and social-cognitive deficits. Most research on affective deficits in depression has, however, only focused on unimodal emotion processing, whereas in daily life, emotional perception is often highly dependent on the evaluation of multimodal inputs. We thus investigated emotional audiovisual integration in patients with depression and healthy subjects. Subjects rated the expression of happy, neutral and fearful faces while concurrently being exposed to emotional or neutral sounds. Results demonstrated group differences in left inferior frontal gyrus and inferior parietal cortex when comparing incongruent to congruent happy facial conditions, mainly due to a failure of patients to deactivate these regions in response to congruent stimulus pairs. Moreover, healthy subjects decreased activation in right posterior superior temporal gyrus/sulcus and midcingulate cortex when an emotional stimulus was paired with a neutral rather than another emotional one. In contrast, patients did not show such deactivation when neutral stimuli were integrated. These results demonstrate aberrant neural response in audiovisual processing in depression, indicated by failure to deactivate regions involved in inhibition and salience processing when congruent and neutral audiovisual stimuli pairs are integrated, providing a possible mechanism of constant arousal and readiness to act in this patient group.	\N	\N
23591684	Migraine attacks consist of head pain and hypersensitivities to somatosensory, visual, auditory, and olfactory stimuli. Investigating how the migraine brain simultaneously processes and responds to multiple incoming stimuli may yield insights into migraine pathophysiology and migraine symptoms. The presence and intensity of hypersensitivity to one stimulus type are positively associated with the presence and intensity of hypersensitivities to other stimuli and to headache intensity. Furthermore, exposure to visual, auditory, and olfactory stimuli can trigger migraine attacks. These relationships suggest a role for multisensory integration in migraine. Multisensory integration of somatosensory, visual, auditory, and olfactory stimuli by the migraine brain may be an important concept for understanding migraine.	\N	\N
23593198	It has been proposed that the loudness dependence of auditory evoked potentials (LDAEP) would be a reliable indicator of central serotonin system activity in humans. Serotonin levels and turnover are also increased by brain-derived neurotrophic factor (BDNF). The aim of the present study was to determine whether there is an association between genetic polymorphisms of BDNF and the LDAEP in healthy Korean young adults. The cohort comprised 211 mentally and physically healthy subjects, all of whom were nonsmokers (111 males, 100 females; age: 20∼32 years). To avoid hormonal effects, the LDAEP was measured during days 2-5 after the beginning of menstruation for female subjects. In addition, BDNF polymorphisms (rs6265, rs2030324, and rs1491850) were genotyped. The strength of the LDAEP differed significantly among the BDNF genotype groups. Furthermore, the distribution of genotypic frequencies differed significantly between subjects with high and low LDAEPs. In particular, subjects with the Val/Met (A/G) genotype for rs6265, the T/T genotype for rs2030324, or the C/C genotype for rs1491850 had a higher LDAEP, indicating lower central serotonergic activity. A low LDAEP was more prevalent than a high LDAEP among those with the C-T haplotype (C genotype for rs2030424 and T genotype for rs1491850). Our results concur with previous findings on BDNF polymorphisms and serotonergic drug responses in psychiatric disorder patients. The present results suggest the possibility that BDNF polymorphisms and LDAEP patterns can predict altered serotonergic activity.	\N	\N
23603423	Language is more than a source of information for accessing higher-order conceptual knowledge. Indeed, language may determine how people perceive and interpret visual stimuli. Visual processing in linguistic contexts, for instance, mirrors language processing and happens incrementally, rather than through variously-oriented fixations over a particular scene. The consequences of this atypical visual processing are yet to be determined. Here, we investigated the integration of visual and linguistic input during a reasoning task. Participants listened to sentences containing conjunctions or disjunctions (Nancy examined an ant and/or a cloud) and looked at visual scenes containing two pictures that either matched or mismatched the nouns. Degree of match between nouns and pictures (referential anchoring) and between their expected and actual spatial positions (spatial anchoring) affected fixations as well as judgments. We conclude that language induces incremental processing of visual scenes, which in turn becomes susceptible to reasoning errors during the language-meaning verification process.	\N	\N
23613083	The lack of fine structure information in conventional cochlear implant (CI) encoding strategies presumably contributes to the generally poor music perception with CIs. To improve CI users' music perception, a harmonic-single-sideband-encoder (HSSE) strategy was developed , which explicitly tracks the harmonics of a single musical source and transforms them into modulators conveying both amplitude and temporal fine structure cues to electrodes. To investigate its effectiveness, vocoder simulations of HSSE and the conventional continuous-interleaved-sampling (CIS) strategy were implemented. Using these vocoders, five normal-hearing subjects' melody and timbre recognition performance were evaluated: a significant benefit of HSSE to both melody (p < 0.002) and timbre (p < 0.026) recognition was found. Additionally, HSSE was acutely tested in eight CI subjects. On timbre recognition, a significant advantage of HSSE over the subjects' clinical strategy was demonstrated: the largest improvement was 35% and the mean 17% (p < 0.013). On melody recognition, two subjects showed 20% improvement with HSSE; however, the mean improvement of 7% across subjects was not significant (p > 0.090). To quantify the temporal cues delivered to the auditory nerve, the neural spike patterns evoked by HSSE and CIS for one melody stimulus were simulated using an auditory nerve model. Quantitative analysis demonstrated that HSSE can convey temporal pitch cues better than CIS. The results suggest that HSSE is a promising strategy to enhance music perception with CIs.	\N	\N
23615803	Although the ferret has become an important model species for studying both fundamental and clinical aspects of spatial hearing, previous behavioral work has focused on studies of sound localization and spatial release from masking in the free field. This makes it difficult to tease apart the role played by different spatial cues. In humans and other species, interaural time differences (ITDs) and interaural level differences (ILDs) play a critical role in sound localization in the azimuthal plane and also facilitate sound source separation in noisy environments. In this study, we used a range of broadband noise stimuli presented via customized earphones to measure ITD and ILD sensitivity in the ferret. Our behavioral data show that ferrets are extremely sensitive to changes in either binaural cue, with levels of performance approximating that found in humans. The measured thresholds were relatively stable despite extensive and prolonged (>16 weeks) testing on ITD and ILD tasks with broadband stimuli. For both cues, sensitivity was reduced at shorter durations. In addition, subtle effects of changing the stimulus envelope were observed on ITD, but not ILD, thresholds. Sensitivity to these cues also differed in other ways. Whereas ILD sensitivity was unaffected by changes in average binaural level or interaural correlation, the same manipulations produced much larger effects on ITD sensitivity, with thresholds declining when either of these parameters was reduced. The binaural sensitivity measured in this study can largely account for the ability of ferrets to localize broadband stimuli in the azimuthal plane. Our results are also broadly consistent with data from humans and confirm the ferret as an excellent experimental model for studying spatial hearing.	\N	\N
23616552	The combined use of multisensory signals is often beneficial. Based on neuronal recordings in the superior colliculus of cats, three basic rules were formulated to describe the effectiveness of multisensory signals: the enhancement of neuronal responses to multisensory compared with unisensory signals is largest when signals occur at the same location ("spatial rule"), when signals are presented at the same time ("temporal rule"), and when signals are rather weak ("principle of inverse effectiveness"). These rules are also considered with respect to multisensory benefits as observed with behavioral measures, but do they capture these benefits best? To uncover the principles that rule benefits in multisensory behavior, we here investigated the classical redundant signal effect (RSE; i.e., the speedup of response times in multisensory compared with unisensory conditions) in humans. Based on theoretical considerations using probability summation, we derived two alternative principles to explain the effect. First, the "principle of congruent effectiveness" states that the benefit in multisensory behavior (here the speedup of response times) is largest when behavioral performance in corresponding unisensory conditions is similar. Second, the "variability rule" states that the benefit is largest when performance in corresponding unisensory conditions is unreliable. We then tested these predictions in two experiments, in which we manipulated the relative onset and the physical strength of distinct audiovisual signals. Our results, which are based on a systematic analysis of response time distributions, show that the RSE follows these principles very well, thereby providing compelling evidence in favor of probability summation as the underlying combination rule.	\N	\N
23627836	Theories of auditory attention suggest that humans decompose complex auditory input into individual auditory objects, which then compete for attention to dominate auditory perception. Since emotional significance of external stimuli has been argued to provide cues for sensory prioritization and allocation of attention, emotionally salient auditory objects can receive attention to dominate auditory perception. On the basis of the function of audition as an alarm system that informs the organism about its immediate surroundings, and on empirical evidence that emotion can modulate auditory perception, we argue that auditory stimuli with greater emotional saliency would dominate perception in multisource environments. To test our hypothesis, we employed a change detection task in which participants were asked to indicate whether multisource auditory scenes were identical or different. Participants were better at detecting changes at the presence of an emotionally negative environment compared to neutral environment. Further, we found that participants were better at detecting changes of emotionally negative targets compared to neutral targets. Our results demonstrate that detecting changes in auditory scenes is influenced by emotion. The findings are discussed in the light of the theories of auditory attention, emotional modulation of attention, and the adaptive function of emotion for perception.	\N	\N
23632885	It has been suggested that an auditory phantom percept is the result of multiple, parallel but overlapping networks. One of those networks encodes tinnitus loudness and is electrophysiologically separable from a nonspecific distress network. The present study investigates how these networks anatomically overlap, what networks are involved, and how and when these networks interact. Electroencephalography data of 317 tinnitus patients and 256 healthy subjects were analyzed, using independent component analysis. Results demonstrate that tinnitus is characterized by at least 2 major brain networks, each consisting of multiple independent components. One network reflects tinnitus distress, while another network reflects the loudness of the tinnitus. The component coherence analysis shows that the independent components that make up the distress and loudness networks communicate within their respective network at several discrete frequencies in parallel. The distress and loudness networks do not intercommunicate for patients without distress, but do when patients are distressed by their tinnitus. The obtained data demonstrate that the components that build up these 2 separable networks communicate at discrete frequencies within the network, and only between the distress and loudness networks in those patients in whom the symptoms are also clinically linked.	\N	\N
23632973	The purpose of this study was to determine how the bandwidth of the hearing aid (HA) fitting affects bimodal speech recognition of listeners with a cochlear implant (CI) in one ear and severe-to-profound hearing loss in the unimplanted ear (but with residual hearing sufficient for wideband amplification using National Acoustic Laboratories Revised, Profound [NAL-RP] prescriptive guidelines; unaided thresholds no poorer than 95 dB HL through 2000 Hz). Recognition of sentence material in quiet and in noise was measured with the CI alone and with CI plus HA as the amplification provided by the HA in the high and mid-frequency regions was systematically reduced from the wideband condition (NAL-RP prescription). Modified bandwidths included upper frequency cutoffs of 2000, 1000, or 500 Hz. On average, significant bimodal benefit was obtained when the HA provided amplification at all frequencies with aidable residual hearing. Limiting the HA bandwidth to only low-frequency amplification (below 1000 Hz) did not yield significant improvements in performance over listening with the CI alone. These data suggest the importance of providing amplification across as wide a frequency region as permitted by audiometric thresholds in the HA used by bimodal users.	\N	\N
23639338	To investigate auditory perception, speech production, and language ability of prelingually deaf toddlers with cerebral palsy (CP) who were implanted within a sensitive period and who received proper speech therapy. Comparison of their outcomes with age- and sex-matched CI recipients without additional disabilities was also performed. We retrospectively reviewed a cohort of pediatric CI in Samsung Medical Center. Eight CP subjects who received CI before 3 years of age and age-sex matched control recipients who had no additional disabilities except idiopathic sensorineural hearing loss (SNHL) were included for the analysis. Preoperative evaluation included the Categories of Auditory Performance (CAP) score, Korean Version of the Ling's Stage (K-Ling), Sequenced Language Scale for Infants (SELSI), Bailey Scales of Infant Development II assessment, Social Maturity Scale test, and grading of CP severity using severity level and Gross Motor Function Classification System for CP (GMFCS). To measure the outcome, the CAP scores, K-Ling, and SELSI were performed at 3, 6, 12, and 24 months after implantation. Four CP children with outstanding performances showed comparable achievement with matched control recipients. These patients had less severe motor disabilities (mild-moderate severity, GMFCS level 1-3), better social quotient, and better cognitive abilities. Although the others showed poor language abilities and hardly produced meaningful speech, their CAP scores reached 1 or 2 in 24 months after implantation. Deaf children with CP could have various ranges of benefits up to the levels of normal peers whose only disability was hearing loss, when CI was performed within a critical period. Especially, children with mild or moderate CP had a favorable outcome after CI, equivalent to that of normal peers.	\N	\N
23653412	It has previously been shown that the perceived roughness of a surface touched by one digit is influenced by the roughness of a different surface touched simultaneously by another digit on the same hand. The present study was designed to examine whether this is the case when surfaces of varying roughness are touched using digits on separate hands. Participants touched pairs of sandpaper surfaces, in sequence, using the same digit, and identified which of the two was rougher. Roughness discrimination was measured in the presence of distractor surfaces touched simultaneously with the target surface, but using a different digit either on the same or on the other hand. The overall perception of roughness of the attended surfaces was better on the left than on the right hand. Perceived roughness also varied systematically with the roughness of the distractor surfaces. Attended surfaces were more likely to be perceived as smoother when they were paired with smooth rather than rough distractors. Likewise, attended surfaces tended to be perceived as rougher with rough distractors. This pattern of results occurred whether the attended and distractor digits were on the same hand or different hands. These data confirm that it is difficult to restrict tactile attention for roughness to a single digit and show that this difficulty extends to restricting attention to a single hand. Furthermore, the effect of a stimulus at an unattended body location was not simply to impair perception in general, but to bias it in the roughness direction of the distractor surface.	\N	\N
23654389	Auditory filter bandwidths are measured for a temporal process using an amplitude-modulation detection task. The signal is a 200 Hz wide, sinusoidally amplitude-modulated band of noise centered within an unmodulated notched-noise masker. A modulation rate of 10 Hz is used to avoid possible information loss at more central processing levels for high modulation rates. Threshold functions are obtained for 10-14 notch widths for each of four different center frequencies (0.6, 1, 2, and 4 kHz) to determine the maximum notch width at which the masker has an effect. The ratio of center frequency to maximum notch width is ~2 at all center frequencies. It is proposed that the bandwidths observed in temporal tasks, which are consistently greater than expected from the viewpoint of critical band theory, be characterized as "temporal critical bands." This proposal does not oppose, but provides a complement to the traditional critical band obtained in tasks involving spectral discrimination.	\N	\N
23654392	Measurement of sensitivity to differences in the rate of change of auditory signal parameters is complicated by confounds among duration, extent, and velocity of the changing signal. Dooley and Moore [(1988) J. Acoust. Soc. Am. 84(4), 1332-1337] proposed a method for measuring sensitivity to rate of change using a duration discrimination task. They reported improved duration discrimination when an additional intensity or frequency change cue was present. The current experiments were an attempt to use this method to measure sensitivity to the rate of change in intensity and spatial position. Experiment 1 investigated whether duration discrimination was enhanced when additional cues of rate of intensity change, rate of spatial position change, or both were provided. Experiment 2 determined whether participant listening experience or the testing environment influenced duration discrimination task performance. Experiment 3 assessed whether duration discrimination could be used to measure sensitivity to rates of changes in intensity and spatial position for stimuli with lower rates of change, as well as emphasizing the constancy of the velocity cue. Results of these experiments showed that duration discrimination was impaired rather than enhanced by the additional velocity cues. The findings are discussed in terms of the demands of listening to concurrent changes along multiple auditory dimensions.	\N	\N
23654413	Inharmonicity of piano tones is an essential property of their timbre that strongly influences the tuning, leading to the so-called octave stretching. It is proposed in this paper to jointly model the inharmonicity and tuning of pianos on the whole compass. While using a small number of parameters, these models are able to reflect both the specificities of instrument design and tuner's practice. An estimation algorithm is derived that can run either on a set of isolated note recordings, but also on chord recordings, assuming that the played notes are known. It is applied to extract parameters highlighting some tuner's choices on different piano types and to propose tuning curves for out-of-tune pianos or piano synthesizers.	\N	\N
23656101	In this study, two methods are proposed to modify the normalized covariance metric (NCM) measure to reduce the effects of gain-induced nonlinear distortions introduced by most noise-suppression algorithms. Considering that the gain-induced distortions behave differently dependent on the signal-to-noise ratio between the noise-reduced speech and the noise, the first approach introduces a penalty factor involving this ratio in the modified NCM measure. The second approach deemphasizes segments marked with amplification distortions that contribute less to intelligibility via adaptive thresholding. Significantly higher correlations with intelligibility scores were obtained from the modified NCM measures compared with the original NCM measures.	\N	\N
23656102	A reference-free speech quality measure is proposed and assessed for hearing aid applications. The proposed speech quality metric is validated with subjective ratings obtained from hearing impaired listeners under a number of noisy and reverberant conditions. In addition, a comparison is drawn between the proposed measure and a state-of-the-art electroacoustic measure that relies on a clean reference signal. The results showed that the reference-free measure had a lower correlation with the subjective ratings of hearing aid speech quality in comparison to the correlations achieved by the measure utilizing a reference signal. Nevertheless, advantages of the reference-free approach are discussed.	\N	\N
23658664	In reverberant rooms with multiple-people talking, spatial separation between speech sources improves recognition of attended speech, even though both the head-shadowing and interaural-interaction unmasking cues are limited by numerous reflections. It is the perceptual integration between the direct wave and its reflections that bridges the direct-reflection temporal gaps and results in the spatial unmasking under reverberant conditions. This study further investigated (1) the temporal dynamic of the direct-reflection-integration-based spatial unmasking as a function of the reflection delay, and (2) whether this temporal dynamic is correlated with the listeners' auditory ability to temporally retain raw acoustic signals (i.e., the fast decaying primitive auditory memory, PAM). The results showed that recognition of the target speech against the speech-masker background is a descending exponential function of the delay of the simulated target reflection. In addition, the temporal extent of PAM is frequency dependent and markedly longer than that for perceptual fusion. More importantly, the temporal dynamic of the speech-recognition function is significantly correlated with the temporal extent of the PAM of low-frequency raw signals. Thus, we propose that a chain process, which links the earlier-stage PAM with the later-stage correlation computation, perceptual integration, and attention facilitation, plays a role in spatially unmasking target speech under reverberant conditions.	\N	\N
23665378	The integration of auditory feedback with vocal motor output is important for the control of voice fundamental frequency (F0). We used a pitch-shift paradigm where subjects respond to an alteration, or shift, of voice pitch auditory feedback with a reflexive change in F0. We presented varying magnitudes of pitch shifted auditory feedback to subjects during vocalization and passive listening and measured event related potentials (ERPs) to the feedback shifts. Shifts were delivered at +100 and +400 cents (200 ms duration). The ERP data were modeled with dynamic causal modeling (DCM) techniques where the effective connectivity between the superior temporal gyrus (STG), inferior frontal gyrus and premotor areas were tested. We compared three main factors: the effect of intrinsic STG connectivity, STG modulation across hemispheres and the specific effect of hemisphere. A Bayesian model selection procedure was used to make inference about model families. Results suggest that both intrinsic STG and left to right STG connections are important in the identification of self-voice error and sensory motor integration. We identified differences in left-to-right STG connections between 100 cent and 400 cent shift conditions suggesting that self- and non-self-voice error are processed differently in the left and right hemisphere. These results also highlight the potential of DCM modeling of ERP responses to characterize specific network properties of forward models of voice control.	\N	\N
23667666	Audition--what listeners hear--is generally studied in terms of the physical properties of sound stimuli and physiological properties of the auditory system. Based on recent work in vision, we here consider an alternative perspective that sensory percepts are based on past experience. In this framework, basic auditory qualities (e.g., loudness and pitch) are based on the frequency of occurrence of stimulus patterns in natural acoustic stimuli. To explore this concept of audition, we examined five well-documented psychophysical functions. The frequency of occurrence of acoustic patterns in a database of natural sound stimuli (speech) predicts some qualitative aspects of these functions, but with substantial quantitative discrepancies. This approach may offer a rationale for auditory phenomena that are difficult to explain in terms of the physical attributes of the stimuli as such.	\N	\N
23678126	For effective interactions with our dynamic environment, it is critical for the brain to integrate motion information from the visual and auditory senses. Combining fMRI and psychophysics, this study investigated how the human brain integrates auditory and visual motion into benefits in motion discrimination. Subjects discriminated the motion direction of audiovisual stimuli that contained directional motion signal in the auditory, visual, audiovisual, or no modality at two levels of signal reliability. Therefore, this 2 × 2 × 2 factorial design manipulated: (1) auditory motion information (signal vs noise), (2) visual motion information (signal vs noise), and (3) reliability of motion signal (intact vs degraded). Behaviorally, subjects benefited significantly from audiovisual integration primarily for degraded auditory and visual motion signals while obtaining near ceiling performance for "unisensory" signals when these were reliable and intact. At the neural level, we show audiovisual motion integration bilaterally in the visual motion areas hMT+/V5+ and implicate the posterior superior temporal gyrus/planum temporale in auditory motion processing. Moreover, we show that the putamen integrates audiovisual signals into more accurate motion discrimination responses. Our results suggest audiovisual integration processes at both the sensory and response selection levels. In all of these regions, the operational profile of audiovisual integration followed the principle of inverse effectiveness, in which audiovisual response suppression for intact stimuli turns into response enhancements for degraded stimuli. This response profile parallels behavioral indices of audiovisual integration, in which subjects benefit significantly from audiovisual integration only for the degraded conditions.	\N	\N
23684863	In ordinary conversations, literal meanings of an utterance are often quite different from implicated meanings and the inference about implicated meanings is essentially required for successful comprehension of the speaker's utterances. Inference of finding implicated meanings is based on the listener's assumption that the conversational partner says only relevant matters according to the maxim of relevance in Grice's theory of conversational implicature. To investigate the neural correlates of comprehending implicated meanings under the maxim of relevance, a total of 23 participants underwent an fMRI task with a series of conversational pairs, each consisting of a question and an answer. The experimental paradigm was composed of three conditions: explicit answers, moderately implicit answers, and highly implicit answers. Participants were asked to decide whether the answer to the Yes/No question meant 'Yes' or 'No'. Longer reaction time was required for the highly implicit answers than for the moderately implicit answers without affecting the accuracy. The fMRI results show that the left anterior temporal lobe, left angular gyrus, and left posterior middle temporal gyrus had stronger activation in both moderately and highly implicit conditions than in the explicit condition. Comprehension of highly implicit answers had increased activations in additional regions including the left inferior frontal gyrus, left medial prefrontal cortex, left posterior cingulate cortex and right anterior temporal lobe. The activation results indicate involvement of these regions in the inference process to build coherence between literally irrelevant but pragmatically associated utterances under the maxim of relevance. Especially, the left anterior temporal lobe showed high sensitivity to the level of implicitness and showed increased activation for highly versus moderately implicit conditions, which imply its central role in inference such as semantic integration. The right hemisphere activation, uniquely found in the anterior temporal lobe for highly implicit utterances, suggests its competence for integrating distant concepts in implied utterances under the relevance principle.	\N	\N
23691185	Listening to and understanding people in a "cocktail-party situation" is a remarkable feature of the human auditory system. Here we investigated the neural correlates of the ability to localize a particular sound among others in an acoustically cluttered environment with healthy subjects. In a sound localization task, five different natural sounds were presented from five virtual spatial locations during functional magnetic resonance imaging (fMRI). Activity related to auditory stream segregation was revealed in posterior superior temporal gyrus bilaterally, anterior insula, supplementary motor area, and frontoparietal network. Moreover, the results indicated critical roles of left planum temporale in extracting the sound of interest among acoustical distracters and the precuneus in orienting spatial attention to the target sound. We hypothesized that the left-sided lateralization of the planum temporale activation is related to the higher specialization of the left hemisphere for analysis of spectrotemporal sound features. Furthermore, the precuneus - a brain area known to be involved in the computation of spatial coordinates across diverse frames of reference for reaching to objects - seems to be also a crucial area for accurately determining locations of auditory targets in an acoustically complex scene of multiple sound sources. The precuneus thus may not only be involved in visuo-motor processes, but may also subserve related functions in the auditory modality.	\N	\N
23694737	Multiple auditory steady-state responses (ASSRs) to air- and bone-conduction stimuli were recorded in young children with otitis media with effusion (OME). After treatment for OME, differences between pre-treatment bone-conduction ASSR levels and post-treatment conditioned orientation reflex (COR) or air-conduction ASSR levels were examined, and compared with ASSR-estimated air-bone gap (ABG) before treatment. Navigator Pro with Master was used to assess the threshold of air- and bone-conduction ASSR in both ears at 500Hz, 1000Hz, 2000Hz and 4000Hz. For bone-conduction ASSR, RadioEar B-71 bone-vibrator placed on the mastoid was used with white-noise masking on the contralateral ear. After ventilation tube placement, the thresholds of COR got closer to those of pre-treatment bone-conduction ASSR in young children with OME. Moreover, post-treatment air-conduction ASSR thresholds also got closer to those of pre-treatment bone-conduction ASSR. The differences between pre-treatment bone-conduction ASSR thresholds and post-treatment COR or air-conduction ASSR thresholds became much smaller than ASSR-estimated ABG before treatment. These findings suggest that bone-conduction ASSR can assess the normal or near normal cochlear sensitivity in young children with conductive hearing loss. It is also suggested that ASSR-estimated ABG can be used clinically to predict their accurate ABG.	\N	\N
23694738	The aim of our study is to investigate the relationship between the complaint of speech understanding in noisy environments and the findings of contralateral suppression of transient evoked otoacoustic emissions and speech recognition in noise test methods in individuals with normal hearing. Sixty-nine subjects between 18 and 53 years of age with normal hearing participated in the present study. The subjects were assigned to one of two groups, reported difficulty understanding speech in noise or no reported difficulty understanding speech in noise. After hearing and immitancemetric evaluation, contralateral suppression of transient evoked otoacoustic emissions and speech recognition in noise tests were administered to both groups. Suppression was calculated in half-octave frequency bands centered at 1.0, 1.5, 2.0, 3.0 and 4.0kHz. We found out that the speech recognition in noise scores and contralateral suppression values were lower in subjects with the complaint of speech understanding in noise than those who do not have such complaints. We concluded that the complaint of speech understanding in noise may be related to the medial efferent system dysfunction, so central auditory nervous system.	\N	\N
23700960	We investigated the effects of focusing attention towards auditory or somatosensory stimuli on perceptual sensitivity and response bias using a signal detection task. Participants (N = 44) performed an unspeeded detection task in which weak (individually calibrated) somatosensory or auditory stimuli were delivered. The focus of attention was manipulated by the presentation of a visual cue at the start of each trial. The visual cue consisted of the word "warmth" or the word "tone". This word cue was predictive of the corresponding target on two-thirds of the trials. As hypothesised, the results showed that cueing attention to a specific sensory modality resulted in a higher perceptual sensitivity for validly cued targets than for invalidly cued targets, as well as in a more liberal response criterion for reporting stimuli in the valid modality than in the invalid modality. The value of this experimental paradigm for investigating excessive attentional focus or hypervigilance in various non-clinical and clinical populations is discussed.	\N	\N
23705807	Cochlear implantation (CI) is a standard treatment for severe-profound sensorineural hearing loss (SNHL). However, consensus has yet to be reached on its effectiveness for hearing loss caused by auditory neuropathy spectrum disorder (ANSD). This review aims to summarize and synthesize current evidence of the effectiveness of CI in improving speech recognition in children with ANSD. Systematic review. A total of 27 studies from an initial selection of 237. All selected studies were observational in design, including case studies, cohort studies, and comparisons between children with ANSD and SNHL. Most children with ANSD achieved open-set speech recognition with their CI. Speech recognition ability was found to be equivalent in CI users (who previously performed poorly with hearing aids) and hearing-aid users. Outcomes following CI generally appeared similar in children with ANSD and SNHL. Assessment of study quality, however, suggested substantial methodological concerns, particularly in relation to issues of bias and confounding, limiting the robustness of any conclusions around effectiveness. Currently available evidence is compatible with favourable outcomes from CI in children with ANSD. However, this evidence is weak. Stronger evidence is needed to support cost-effective clinical policy and practice in this area.	\N	\N
23708733	Dyslexic and non-dyslexic readers engaged in a short training aimed at learning eight basic letter-speech sound correspondences within an artificial orthography. We examined whether a letter-speech sound binding deficit is behaviorally detectable within the initial steps of learning a novel script. Both letter knowledge and word reading ability within the artificial script were assessed. An additional goal was to investigate the influence of instructional approach on the initial learning of letter-speech sound correspondences. We assigned children from both groups to one of three different training conditions: (a) explicit instruction, (b) implicit associative learning within a computer game environment, or (c) a combination of (a) and (b) in which explicit instruction is followed by implicit learning. Our results indicated that dyslexics were outperformed by the controls on a time-pressured binding task and a word reading task within the artificial orthography, providing empirical support for the view that a letter-speech sound binding deficit is a key factor in dyslexia. A combination of explicit instruction and implicit techniques proved to be a more powerful tool in the initial teaching of letter-sound correspondences than implicit training alone.	\N	\N
23714710	To investigate safety and efficacy of a new transcutaneous bone conduction hearing implant, over a 3-month follow-up period. Prospective, single-subject repeated-measures design in which each subject serves as his/her own control. Departments of Otolaryngology at 4 hospitals in Germany and Austria. Subjects were 12 German-speaking adults who suffered from conductive or mixed hearing loss. The upper bone conduction threshold limit was set to 45 dB HL at frequencies between 500 Hz and 4 kHz. Implantation of a transcutaneous bone conduction hearing implant. Subjects' speech perception (word recognition scores and SRT 50%) and audiometric thresholds (air conduction, bone conduction and sound field at frequencies 500 Hz to 8 kHz) were assessed preoperatively, 1 month postoperatively and 3 months postoperatively. The subjects were monitored for adverse events and given a questionnaire to assess their satisfaction levels. Speech perception as measured by word recognition scores and SRT 50% improved on average about 78.8% and 25 dB HL, respectively, 3 months after implantation. Aided thresholds also improved postoperatively at all tested frequencies and continued to improve from 1 to 3 months postoperatively. Air conduction and bone conduction thresholds showed no significant changes, confirming that subjects' residual unaided hearing was not deteriorated by the treatment. Only minor adverse events were reported and resolved by the end of the study. The new transcutaneous bone conduction implant was demonstrated to be safe and effective in adults up to 3 months of device use.	\N	\N
23715097	In this study, we used magnetoencephalography and a mismatch paradigm to investigate speech processing in stroke patients with auditory comprehension deficits and age-matched control subjects. We probed connectivity within and between the two temporal lobes in response to phonemic (different word) and acoustic (same word) oddballs using dynamic causal modelling. We found stronger modulation of self-connections as a function of phonemic differences for control subjects versus aphasics in left primary auditory cortex and bilateral superior temporal gyrus. The patients showed stronger modulation of connections from right primary auditory cortex to right superior temporal gyrus (feed-forward) and from left primary auditory cortex to right primary auditory cortex (interhemispheric). This differential connectivity can be explained on the basis of a predictive coding theory which suggests increased prediction error and decreased sensitivity to phonemic boundaries in the aphasics' speech network in both hemispheres. Within the aphasics, we also found behavioural correlates with connection strengths: a negative correlation between phonemic perception and an inter-hemispheric connection (left superior temporal gyrus to right superior temporal gyrus), and positive correlation between semantic performance and a feedback connection (right superior temporal gyrus to right primary auditory cortex). Our results suggest that aphasics with impaired speech comprehension have less veridical speech representations in both temporal lobes, and rely more on the right hemisphere auditory regions, particularly right superior temporal gyrus, for processing speech. Despite this presumed compensatory shift in network connectivity, the patients remain significantly impaired.	\N	\N
23716019	Our environment is richly structured, with objects producing correlated information within and across sensory modalities. A prominent challenge faced by our perceptual system is to learn such regularities. Here, we examined statistical learning and addressed learners' ability to track transitional probabilities between elements in the auditory and visual modalities. Specifically, we investigated whether cross-modal information affects statistical learning within a single modality. Participants were familiarized with a statistically structured modality (e.g., either audition or vision) accompanied by different types of cues in a second modality (e.g., vision or audition). The results revealed that statistical learning within either modality is affected by cross-modal information, with learning being enhanced or reduced according to the type of cue provided in the second modality.	\N	\N
23716122	Cross-orientation masking (XOM) occurs when the detection of a test grating is masked by a superimposed grating at an orthogonal orientation, and is thought to reveal the suppressive effects mediating contrast normalization. Medina and Mullen (2009) reported that XOM was greater for chromatic than achromatic stimuli at equivalent spatial and temporal frequencies. Here we address whether the greater suppression found in binocular color vision originates from a monocular or interocular site, or both. We measure monocular and dichoptic masking functions for red-green color contrast and achromatic contrast at three different spatial frequencies (0.375, 0.75, and 1.5 cpd, 2 Hz). We fit these functions with a modified two-stage masking model (Meese & Baker, 2009) to extract the monocular and interocular weights of suppression. We find that the weight of monocular suppression is significantly higher for color than achromatic contrast, whereas dichoptic suppression is similar for both. These effects are invariant across spatial frequency. We then apply the model to the binocular masking data using the measured values of the monocular and interocular sources of suppression and show that these are sufficient to account for color binocular masking. We conclude that the greater strength of chromatic XOM has a monocular origin that transfers through to the binocular site.	\N	\N
23716218	High-frequency pure tones (>6 kHz), which alone do not produce salient melodic pitch information, provide melodic pitch information when they form part of a harmonic complex tone with a lower fundamental frequency (F0). We explored this phenomenon in normal-hearing listeners by measuring F0 difference limens (F0DLs) for harmonic complex tones and pure-tone frequency difference limens (FDLs) for each of the tones within the harmonic complexes. Two spectral regions were tested. The low- and high-frequency band-pass regions comprised harmonics 6-11 of a 280- or 1,400-Hz F0, respectively; thus, for the high-frequency region, audible frequencies present were all above 7 kHz. Frequency discrimination of inharmonic log-spaced tone complexes was also tested in control conditions. All tones were presented in a background of noise to limit the detection of distortion products. As found in previous studies, F0DLs in the low region were typically no better than the FDL for each of the constituent pure tones. In contrast, F0DLs for the high-region complex were considerably better than the FDLs found for most of the constituent (high-frequency) pure tones. The data were compared with models of optimal spectral integration of information, to assess the relative influence of peripheral and more central noise in limiting performance. The results demonstrate a dissociation in the way pitch information is integrated at low and high frequencies and provide new challenges and constraints in the search for the underlying neural mechanisms of pitch.	\N	\N
23716223	We report a series of psychophysics experiments that investigated listeners' sensitivity to changes in complex acoustic scenes. Specifically, we sought to test the hypothesis that change detection is supported by sensitivity to change-related transients (an abrupt change in stimulus power within a certain frequency band, associated with the appearance or disappearance of a scene element). This hypothesis, in the context of natural scenes, is commonly dismissed on account that the elements of the scene may themselves be characterized by on-going energy fluctuations that would mask any genuine change-related transients. We created artificial 'scenes' populated by multiple pure-tone components. Tones were modulated (by a square wave at a distinct rate) so as to mimic the fluctuation properties of complex sounds. "Change" was defined as the appearance or disappearance of one such element. Importantly, such scenes lack semantic attributes, which may have been a limiting factor in interpreting previous auditory change-detection studies, thus allowing us to probe the low-level, pre-semantic, processes involved in auditory change perception. In Experiment 1 we measured listeners' ability to detect item appearance and disappearance in conditions where change-related transients are masked by a silent gap. In Experiment 2, we investigated the effect of an acoustic distractor - a brief signal that occurs at the time of change, but does not mask any scene components. The data show that gaps adversely affected the processing of item appearance but not disappearance. However, distractors reduced both -appearance and disappearance detection. Together our results suggest a role for sensitivity to transients in the process of auditory change detection, similar to what has been demonstrated for visual change detection.	\N	\N
23716230	Advances in the design of cochlear implants (CIs), as well as improved CI surgical techniques, have led to an increase in the number of patients who retain some residual low-frequency acoustic hearing in the implanted ear. Many of these patients also possess some hearing in the unimplanted ear. Although their low-frequency audiometric configurations will likely be asymmetrical across ears, they may nevertheless be able to process interaural time differences (ITDs) which might aid them in localizing sound sources and achieving a spatial release from masking. We recently published research (Brown and Yost 2011) showing how sensitivity to ITD differences was affected when the stimulus bandwidths were varied between the ears, to simulate asymmetrical hearing loss in the low-frequency region. We showed that ITD discrimination thresholds decreased as the bandwidth of the noise presented to one ear increased beyond that presented to the other ear. In the current experiment, we expand upon those conditions to ­further explore ITD processing in the presence of interaural spectral differences. ITD sensitivity was measured when a fixed band of noise was presented to one ear and the center frequency of a spectral band of the same width was moved upward in frequency in the other ear. The data suggest that listeners have difficulty attending to ITD differences in one spectral region when there are other spectral regions that contain conflicting or inconsistent spatial information, which is likely to be the case for many CI patients who possess bilateral residual hearing.	\N	\N
23716240	Jørgensen and Dau (J Acoust Soc Am 130:1475-1487, 2011) proposed the speech-based envelope power spectrum model (sEPSM) in an attempt to overcome the limitations of the classical speech transmission index (STI) and speech intelligibility index (SII) in conditions with nonlinearly processed speech. Instead of considering the reduction of the temporal modulation energy as the intelligibility metric, as assumed in the STI, the sEPSM applies the signal-to-noise ratio in the envelope domain (SNRenv). This metric was shown to be the key for predicting the intelligibility of reverberant speech as well as noisy speech processed by spectral subtraction. The key role of the SNRenv metric is further supported here by the ability of a short-term version of the sEPSM to predict speech masking release for different speech materials and modulated interferers. However, the sEPSM cannot account for speech subjected to phase jitter, a condition in which the spectral structure of the intelligibility of speech signal is strongly affected, while the broadband temporal envelope is kept largely intact. In contrast, the effects of this distortion can be predicted -successfully by the spectro-temporal modulation index (STMI) (Elhilali et al., Speech Commun 41:331-348, 2003), which assumes an explicit analysis of the spectral "ripple" structure of the speech signal. However, since the STMI applies the same decision metric as the STI, it fails to account for spectral subtraction. The results from this study suggest that the SNRenv might reflect a powerful decision metric, while some explicit across-frequency analysis seems crucial in some conditions. How such across-frequency analysis is "realized" in the auditory system remains unresolved.	\N	\N
23716257	We recently showed that listeners with normal hearing thresholds vary in their ability to direct spatial attention and that ability is related to the fidelity of temporal coding in the brainstem. Here, we recruited additional middle-aged listeners and extended our analysis of the brainstem response, measured using the frequency-following response (FFR). We found that even though age does not predict overall selective attention ability, middle-aged listeners are more susceptible to the detrimental effects of reverberant energy than young adults. We separated the overall FFR into orthogonal envelope and carrier components and used an existing model to predict which auditory channels drive each component. We find that responses in mid- to high-frequency auditory channels dominate envelope FFR, while lower-frequency channels dominate the carrier FFR. Importantly, we find that which component of the FFR predicts selective attention performance changes with age. We suggest that early aging degrades peripheral temporal coding in mid-to-high frequencies, interfering with the coding of envelope interaural time differences. We argue that, compared to young adults, middle-aged listeners, who do not have strong temporal envelope coding, have more trouble following a conversation in a reverberant room because they are forced to rely on fragile carrier ITDs that are susceptible to the degrading effects of reverberation.	\N	\N
23716261	Humans and other animals can attend to one of multiple sounds, and -follow it selectively over time. The neural underpinnings of this perceptual feat remain mysterious. Some studies have concluded that sounds are heard as separate streams when they activate well-separated populations of central auditory neurons, and that this process is largely pre-attentive. Here, we propose instead that stream formation depends primarily on temporal coherence between responses that encode various features of a sound source. Furthermore, we postulate that only when attention is directed toward a particular feature (e.g., pitch or location) do all other temporally coherent features of that source (e.g., timbre and location) become bound together as a stream that is segregated from the incoherent features of other sources. Experimental -neurophysiological evidence in support of this hypothesis will be presented. The focus, however, will be on a computational realization of this idea and a discussion of the insights learned from simulations to disentangle complex sound sources such as speech and music. The model consists of a representational stage of early and cortical auditory processing that creates a multidimensional depiction of various sound attributes such as pitch, location, and spectral resolution. The following stage computes a coherence matrix that summarizes the pair-wise correlations between all channels making up the cortical representation. Finally, the perceived segregated streams are extracted by decomposing the coherence matrix into its uncorrelated components. Questions raised by the model are discussed, especially on the role of attention in streaming and the search for further neural correlates of streaming percepts.	\N	\N
23720086	The CHRNA4 gene is known to be associated with individual differences in attention. However, its associations with other cognitive functions remain to be elucidated. In the present study, we investigated the effects of genetic variations in CHRNA4 on rapid scene categorization by 100 healthy human participants. In Experiment 1, we also conducted the Attention Network Test (ANT) in order to examine whether the genetic effects could be accounted for by attention. CHRNA4 was genotyped as carrying the TT, CT, or CC allele. The scene categorization task required participants to judge whether the category of a scene image (natural or man-made) was consistent with a cue word displayed at the response phase. The target-mask stimulus onset asynchrony (SOA) ranged from 13 to 93 ms. In comparison with CC-allele carriers, CT- and TT-allele carriers responded more accurately at the long SOA (93 ms) only during natural-scene categorization. In contrast, we observed no consistent association between CHRNA4 and the ANT, and no intertask correlation between scene categorization and the ANT. To validate our natural-scene categorization results, Experiment 2, carried out with an independent sample of 100 participants and a different stimulus set, successfully replicated the association between CHRNA4 genotypes and natural-scene categorization accuracy at long SOAs (67 and 93 ms). Our findings demonstrate, for the first time, that genetic variations in CHRNA4 can moderately contribute to individual differences in natural-scene categorization performance.	\N	\N
23727710	Successful interactions between people are dependent on rapid recognition of social cues. We investigated whether head direction--a powerful social signal--is processed in the absence of conscious awareness. We used continuous flash interocular suppression to render stimuli invisible and compared the reaction time for face detection when faces were turned towards the viewer and turned slightly away. We found that faces turned towards the viewer break through suppression faster than faces that are turned away, regardless of eye direction. Our results suggest that detection of a face with attention directed at the viewer occurs even in the absence of awareness of that face. While previous work has demonstrated that stimuli that signal threat are processed without awareness, our data suggest that the social relevance of a face, defined more broadly, is evaluated in the absence of awareness.	\N	\N
23734220	To understand why human sensitivity for complex objects is so low, we study how word identification combines eye and ear or parts of a word (features, letters, syllables). Our observers identify printed and spoken words presented concurrently or separately. When researchers measure threshold (energy of the faintest visible or audible signal) they may report either sensitivity (one over the human threshold) or efficiency (ratio of the best possible threshold to the human threshold). When the best possible algorithm identifies an object (like a word) in noise, its threshold is independent of how many parts the object has. But, with human observers, efficiency depends on the task. In some tasks, human observers combine parts efficiently, needing hardly more energy to identify an object with more parts. In other tasks, they combine inefficiently, needing energy nearly proportional to the number of parts, over a 60∶1 range. Whether presented to eye or ear, efficiency for detecting a short sinusoid (tone or grating) with few features is a substantial 20%, while efficiency for identifying a word with many features is merely 1%. Why? We show that the low human sensitivity for words is a cost of combining their many parts. We report a dichotomy between inefficient combining of adjacent features and efficient combining across senses. Joining our results with a survey of the cue-combination literature reveals that cues combine efficiently only if they are perceived as aspects of the same object. Observers give different names to adjacent letters in a word, and combine them inefficiently. Observers give the same name to a word's image and sound, and combine them efficiently. The brain's machinery optimally combines only cues that are perceived as originating from the same object. Presumably such cues each find their own way through the brain to arrive at the same object representation.	\N	\N
23742322	This study tested the hypothesis that the reduced spatial release from speech-on-speech masking typically observed in listeners with sensorineural hearing loss results from increased energetic masking. Target sentences were presented simultaneously with a speech masker, and the spectral overlap between the pair (and hence the energetic masking) was systematically varied. The results are consistent with increased energetic masking in listeners with hearing loss that limits performance when listening in speech mixtures. However, listeners with hearing loss did not exhibit reduced spatial release from masking when stimuli were filtered into narrow bands.	\N	\N
23742375	This study investigates the effectiveness of three high variability training paradigms in training 42 speakers of American English to correctly perceive and produce Spanish intervocalic /d, r, r/. Since Spanish spirantization and English flapping both affect /d/ intervocalically, the acquisition of the /d/-/r/ contrast proves difficult for English learners of Spanish. The acquisition of the trill /r/ is also problematic because it is a new phoneme for English learners and is articulatorily difficult to produce. Past research reported that high-variability perceptual training improves both perception and production [Bradlow et al., J. Acoust. Soc. Am. 101, 2299-2310 (1997); Wang et al., J. Acoust. Soc. Am. 113, 1033-1043 (2003)] and that production training improves both as well [Hirata, Comp. Assisted Lang. Learning 17, 357-376 (2004)]. However, trainees were able to listen to stimuli during production training, making it unclear whether production training alone transfers to perception. This study systematically controls both training modalities so they can be directly compared and introduces a third training methodology that includes both perception and production. All three training paradigms proved effective. While perception and production trainees primarily made gains in perception, combination trainees made gains in production. The effectiveness of each training modality depended on the nature of the contrast being trained and the modality of the test.	\N	\N
23751862	We used magnetoencephalography (MEG) to localize brain activity related to the retention of tones differing in pitch. Participants retained one or two simultaneously presented tones. After a two second interval a test tone was presented and the task was to determine if that tone was in memory. We focused on brain activity during the retention interval that increased as the number of sounds retained in auditory short-term memory (ASTM) increased. Source analyses revealed that the superior temporal gyrus in both hemispheres is involved in ASTM. In the right hemisphere, the inferior temporal gyrus, the inferior frontal gyrus, and parietal structures also play a role. Our method provides good spatial and temporal resolution for investigating neuronal correlates of ASTM and, as it is the first MEG study using a memory load manipulation without using sequences of tones, it allowed us to isolate brain regions that most likely reflect the simple retention of tones.	\N	\N
23751864	Neurobiological correlates of adaptation to spectrally degraded speech were investigated with fMRI before and after exposure to a portable real-time speech processor that implements an acoustic simulation model of a cochlear implant (CI). The speech processor, in conjunction with isolating insert earphones and a microphone to capture environment sounds, was worn by participants over a two week chronic exposure period. fMRI and behavioral speech comprehension testing were conducted before and after this two week period. After using the simulator each day for 2h, participants significantly improved in word and sentence recognition scores. fMRI shows that these improvements came accompanied by changes in patterns of neuronal activation. In particular, we found additional recruitment of visual, motor, and working memory areas after the perceptual training period. These findings suggest that the human brain is able to adapt in a short period of time to a degraded auditory signal under a natural learning environment, and gives insight on how a CI might interact with the central nervous system. This paradigm can be furthered to investigate neural correlates of new rehabilitation, training, and signal processing strategies non-invasively in normal hearing listeners to improve CI patient outcomes.	\N	\N
23757047	Temporal orienting--that is, selective attention to instants in time--has been shown to modulate performance in terms of faster responses in a variety of paradigms. Electrophysiological recordings have shown that temporal orienting modulates neural processing at early, probably perceptual, and late, probably decision- or response-related, stages. Recently, it was shown that the effect of temporal orienting on early auditory brain potentials is independent of the effect of the physical sound feature intensity. This indicates that temporal orienting might not affect stimulus processing by increasing the sensory gain of attended stimuli. In the present study, we investigated whether the independence of temporal-orienting and sound-intensity effects could be replicated behaviorally. Sequences were presented that were either rhythmic, most likely creating temporal expectations, or arrhythmic, presumably not creating such expectations. As hypothesized, the main effects of temporal expectation and sound intensity on reaction times were independent (Experiment 1). The exact pattern of results was replicated with a slightly altered paradigm (Experiment 2) and with a different kind of task (Experiment 3). In sum, these results corroborate the notion that the effect of temporal orienting might not rely on the same processes as the effect of sound intensity does.	\N	\N
23760984	The neural mechanisms of pitch coding have been debated for more than a century. The two main mechanisms are coding based on the profiles of neural firing rates across auditory nerve fibers with different characteristic frequencies (place-rate coding), and coding based on the phase-locked temporal pattern of neural firing (temporal coding). Phase locking precision can be partly assessed by recording the frequency-following response (FFR), a scalp-recorded electrophysiological response that reflects synchronous activity in subcortical neurons. Although features of the FFR have been widely used as indices of pitch coding acuity, only a handful of studies have directly investigated the relation between the FFR and behavioral pitch judgments. Furthermore, the contribution of degraded neural synchrony (as indexed by the FFR) to the pitch perception impairments of older listeners and those with hearing loss is not well known. Here, the relation between the FFR and pure-tone frequency discrimination was investigated in listeners with a wide range of ages and absolute thresholds, to assess the respective contributions of subcortical neural synchrony and other age-related and hearing loss-related mechanisms to frequency discrimination performance. FFR measures of neural synchrony and absolute thresholds independently contributed to frequency discrimination performance. Age alone, i.e., once the effect of subcortical neural synchrony measures or absolute thresholds had been partialed out, did not contribute to frequency discrimination. Overall, the results suggest that frequency discrimination of pure tones may depend both on phase locking precision and on separate mechanisms affected in hearing loss.	\N	\N
23761928	In vision, humans use summary statistics (e.g., the average facial expression of a crowd) to efficiently perceive the gist of groups of features. Here, we present direct evidence that ensemble coding is also important for auditory processing. We found that listeners could accurately estimate the mean frequency of a set of logarithmically spaced pure tones presented in a temporal sequence (Experiment 1). Their performance was severely reduced when only a subset of tones from a given sequence was presented (Experiment 2), which demonstrates that ensemble coding is based on a substantial number of the tones in a sequence. This precise ensemble coding occurred despite very limited representation of individual tones from the sequence: Listeners were poor at identifying specific individual member tones (Experiment 3) and at determining their positions in the sequence (Experiment 4). Together, these results indicate that summary statistical coding is not limited to visual processing and is an important auditory mechanism for extracting ensemble frequency information from sequences of sounds.	\N	\N
23769004	A 3-year longitudinal study was conducted to investigate changes in vocal quality as a result of singing training at a tertiary level conservatorium in Australia. Singers performed a messa di voce (MDV) at intervals of 6 months over the 3-year period of training. The study investigated the evolving acoustic features of the singers' voices exhibited during the MDV, including sound pressure level (SPL), short-term energy ratio (STER), duration, and vibrato parameters of the fundamental frequency (F0), SPL, and STER. The maximum SPL exhibited a marginal systematic increase over the training period, but the maximum STER did not systematically change. F0 vibrato extent increased significantly, whereas the extent of SPL and STER vibrato did not change significantly.	\N	\N
23772828	Noise-induced hearing loss (NIHL) is the most reported occupational health disease in the Netherlands. The internet-based speech-in-noise test Earcheck (Albrecht et al, 2005; Leensen et al, 2011b) is designed to detect beginning NIHL and can be a valuable tool in occupational hearing health surveillance. The aim of this study is to investigate the validity of Earcheck compared to regular screening audiometry. Subjects performed online Earcheck tests at home. The results are compared to a pure-tone screening audiogram obtained during regular occupational health examination. A subgroup performed the measurements twice to assess test-retest reliability. Two hundred and forty-nine male construction employees who recently had a periodic occupational health examination participated. An average learning effect of -1.6 dB was found, that reduced with increasing test number. The test-retest variability was 1.6 dB. Sensitivity to detect beginning NIHL was 68%, with a specificity of 71%. Although sensitivity and specificity values are only moderate, the broad internet application still promises a valuable addition to current practice. The relatively high learning effect indicates that more reliable results can be obtained after a longer test session. When this is put into practice some improvement in sensitivity and specificity may be expected as well.	\N	\N
23774181	A linguistic construction is typically viewed as encoding the pairing of syntactic form and semantic information that is independent of the meaning of constituent words. Here with the event-related potentials (ERPs) we demonstrate that such a construction can also encode pragmatic constraints (event likelihood) that immediately influence online sentence comprehension and the associated neural activity. The lian…dou…construction in Chinese (similar to even in English) normally describes an event of low expectedness (a semantic constraint); it also introduces a pragmatic scale implying that any event with a higher likelihood than the event described must occur (pragmatic inference). By embedding a highly likely event (a rich man buying a house) or an underspecified event (a man buying a house) in the construction, we created an incongruent condition and an underspecified condition and compared both with a control condition in which an event of low expectedness (a poor man buying a house) was described. ERPs on the main verb phrases showed an N400 with a maximum in the right hemisphere followed by a late negativity with an anterior maximum for both the incongruent and underspecified conditions, with a larger N400 effect for the former than for the latter. ERPs on the sentence-final phrases showed a sustained negativity for the incongruent, but not for the underspecified condition. The N400 effect may reflect the increased difficulty in unifying the current event into the lian…dou… construction. The late negativity may reflect a second-pass revision according to the likelihood scale to satisfy the pragmatic constraints of the construction.	\N	\N
23786393	The purpose of this study was to test the ability to discriminate low-frequency pure-tone stimuli for ears with and without contralateral dead regions, in subjects with bilateral high-frequency hearing loss; we examined associations between hearing loss characteristics and frequency discrimination of low-frequency stimuli in subjects with high-frequency hearing loss. Cochlear dead regions were diagnosed using the TEN-HL test. A frequency discrimination test utilizing an adaptive three-alternative forced choice method provided difference limens for reference frequencies 0.25 kHz and 0.5 kHz. Among 105 subjects with bilateral high-frequency hearing loss, unilateral dead regions were found in 15 subjects. These, and an additional 15 matched control subjects without dead regions, were included in the study. Ears with dead regions performed best at the frequency discrimination test. Ears with a contralateral dead region performed significantly better than ears without a contralateral dead region at 0.5 kHz, the reference frequency closest to the mean audiogram cut-off, while the opposite result was obtained at 0.25 kHz. Results may be seen as sign of a contralateral effect of unilateral dead regions on the discrimination of stimuli with frequencies well below the audiogram cut-off in adult subjects with bilateral high-frequency hearing loss.	\N	\N
23786439	Most theories of human language production assume that generating a sentence involves several stages, including an initial stage where the prelinguistic message is determined and a subsequent stage of grammatical encoding. However, it is contentious whether grammatical encoding involves separate stages of grammatical-function assignment and linearization. To address this question, we examined the mapping between the message level and grammatical encoding in two structural priming experiments in which German speakers choose between three different structures expressing ditransitive events. Although speakers showed a tendency to repeat the order of constituents (noun phrase-prepositional phrase, NP-PP, vs. NP-NP), they were additionally primed to repeat the order of thematic roles when constituent structure was constant (NPRECIPIENT-NPTHEME vs. NPTHEME-NPRECIPIENT). Experiment 2 found that the latter effect could not be due to persistence of the order of phrases referring to animate and inanimate entities. These results suggest a direct mapping of thematic roles to word order, consistent with a model in which the message is mapped onto syntactic structure in a single stage.	\N	\N
23787044	Coordinating movements to music is often considered a uniquely human skill. A new study dispels this notion by showing that male Australian lyrebirds also perform 'dance' moves which are predictably matched with specific songs in their display routines.	\N	\N
23789391	Auditory evoked response and mismatch negativity potential have been studied using the reversed odd-ball paradigm of standard and deviant stimulus presentation. In the experiments, three types of spatial sound stimuli (stationary and moving either gradually or abruptly from the head midline) were presented in three configurations. Each configuration employed one stimulus type as standard and the other two types as deviants. It was demonstrated that the configuration reversals influenced significantly the evoked response and mismatch negativity. The results obtained are discussed as the possible evidence of the categorical perception of auditory motion revealed at the earlier stages of sound processing in the hearing system.	\N	\N
23789637	Pickering & Garrod (P&G) put forward the interesting idea that language production relies on forward modeling operating at multiple processing levels. The evidence currently available to substantiate this idea mostly concerns sensorimotor processes and not more abstract linguistic levels (e.g., syntax, semantics, phonology). The predictions that follow from the claim seem too general, in their current form, to guide specific empirical tests.	\N	\N
23789872	We welcome the proposal to use forward models to understand predictive processes in language processing. However, Pickering & Garrod (P&G) miss the opportunity to provide a strong framework for future work. Forward models need to be pursued in the context of learning. This naturally leads to questions about what prediction error these models aim to minimize.	\N	\N
23790043	Although the target article emphasizes the important role of prediction in language use, prediction may well also play a key role in the initial formation of linguistic representations, that is, in language development. We outline the role of prediction in three relevant language-learning domains: transitional probabilities, statistical preemption, and construction learning.	\N	\N
23792078	Abundant evidence from both field and lab studies has established that conspecific vocalizations (CVs) are of critical ecological significance for a wide variety of species, including humans, non-human primates, rodents, and other mammals and birds. Correspondingly, a number of experiments have demonstrated behavioral processing advantages for CVs, such as in discrimination and memory tasks. Further, a wide range of experiments have described brain regions in many species that appear to be specialized for processing CVs. For example, several neural regions have been described in both mammals and birds wherein greater neural responses are elicited by CVs than by comparison stimuli such as heterospecific vocalizations, nonvocal complex sounds, and artificial stimuli. These observations raise the question of whether these regions reflect domain-specific neural mechanisms dedicated to processing CVs, or alternatively, if these regions reflect domain-general neural mechanisms for representing complex sounds of learned significance. Inasmuch as CVs can be viewed as complex combinations of basic spectrotemporal features, the plausibility of the latter position is supported by a large body of literature describing modulated cortical and subcortical representation of a variety of acoustic features that have been experimentally associated with stimuli of natural behavioral significance (such as food rewards). Herein, we review a relatively small body of existing literature describing the roles of experience, learning, and memory in the emergence of species-typical neural representations of CVs and auditory system plasticity. In both songbirds and mammals, manipulations of auditory experience as well as specific learning paradigms are shown to modulate neural responses evoked by CVs, either in terms of overall firing rate or temporal firing patterns. In some cases, CV-sensitive neural regions gradually acquire representation of non-CV stimuli with which subjects have training and experience. These results parallel literature in humans describing modulation of responses in face-sensitive neural regions through learning and experience. Thus, although many questions remain, the available evidence is consistent with the notion that CVs may acquire distinct neural representation through domain-general mechanisms for representing complex auditory objects that are of learned importance to the animal. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives".	\N	\N
23792769	This event-related potential (ERP) study examines the time course of context-dependent talker normalization in spoken word identification. We found three ERP components, the N1 (100-220 ms), the N400 (250-500 ms) and the Late Positive Component (500-800 ms), which are conjectured to involve (a) auditory processing, (b) talker normalization and lexical retrieval, and (c) decisional process/lexical selection respectively. Talker normalization likely occurs in the time window of the N400 and overlaps with the lexical retrieval process. Compared with the nonspeech context, the speech contexts, no matter whether they have semantic content or not, enable listeners to tune to a talker's pitch range. In this way, speech contexts induce more efficient talker normalization during the activation of potential lexical candidates and lead to more accurate selection of the intended word in spoken word identification.	\N	\N
23809517	Cochlear implantation has significant effects on language abilities and reading skills. The current study compared the reading performance of children with cochlear implants with that of typically developing children in second and third grades. This descriptive-analytic study was performed including 24 children with cochlear implants and 24 typically developing peers. The grade range of the participants was second and third grades. All of students were selected from Tehran city elementary schools. The reading performance of children was assessed by the "Nama" reading test. The results showed that the means of reading scores of typically developed children were significantly greater than the children with cochlear implants (P < 0.01) and there was a significant relationship between reading skills and age of surgery (P < 0.05). Also, there was a significant relationship between reading skills and the period of cochlear implantation (P < 0.05). Children with cochlear implants showed a weak performance in reading skills in comparison to typically developing children due to lower accessibility to phonological information. However, this limitation can be compensated for partly by early surgery. Parents should refer their deaf children for cochlear implantation before the age of language learning.	\N	\N
23815480	The origin of the Western preference for consonance remains unresolved, with some suggesting that the preference is innate. In Experiments 1 and 2 of the present study, 6-month-old infants heard six different consonant/dissonant pairs of stimuli, including those tested in previous research. In contrast to the findings of others, infants in the present study failed to listen longer to consonant stimuli. After 3 minutes of exposure to consonant or dissonant stimuli in Experiment 3, 6-month-old infants listened longer to the familiar stimulus, whether consonant or dissonant. Our findings are inconsistent with innate preferences for consonant stimuli. Instead, the effect of short-term exposure is consistent with the view that familiarity underlies the origin of the Western preference for consonant intervals.	\N	\N
23819616	In an earlier study ( Beach et al, 2012 ), detailed noise exposure measurements were obtained through individual dosimetry. In this further analysis of the data we ask the question "Can the effort required to converse in noise be used to estimate the experienced A-weighted noise level?" The noise levels experienced during specific activities were obtained from the analysis of dosimetry results from personal noise exposure meters worn by study participants. The measured noise levels from particular events were compared to a subjectively judged 'loudness rating' reported by the person wearing the dosimeter during the measured event. Volunteers (females = 20, males = 22) between 18 and 35 years (average age = 26.8) willing to wear dosimeters and keep a simple activity log. The relation between the objectively measured and the subjectively judged levels was consistent for the group over a large number of events. The subjective loudness rating index was shown to be a convenient tool that can be utilized for the retrospective estimation of noise levels from individual activities.	\N	\N
23824440	In this study, the authors examined the effects of aging and residual hearing on the identification of acoustically similar and dissimilar vowels in adults with postlingual deafness who use hearing aids (HAs) and/or cochlear implants (CIs). The authors used two groups of acoustically similar and dissimilar vowels to assess vowel identification. Also, the Consonant-Nucleus-Consonant Word Recognition Test (Peterson & Lehiste, 1962) and sentences from the Hearing in Noise Test (Nilsson, Soli, & Sullivan, 1994) were administered. Forty CI recipients with postlingual deafness (ages 31-81 years) participated in the study. Acoustically similar vowels were more difficult to identify than acoustically dissimilar vowels. With increasing age, performance deteriorated when identifying acoustically similar vowels. Vowel identification was also affected by the use of a contralateral HA and the degree of residual hearing prior to implantation. Moderate correlations were found between speech perception and vowel identification performance. Identification performance was affected by the acoustic similarity of the vowels. Older adults experienced more difficulty identifying acoustically similar confusable vowels than did younger adults. The findings might lend support to the ease of language understanding model (Ronnberg, Rudner, Foo, & Lunner, 2008), which proposes that the quality and perceptual robustness of acoustic input affects speech perception.	\N	\N
23833281	The relations of phonological, orthographic, and morphological awareness and vocabulary to word reading and spelling were examined for 304 first-grade children who were receiving differentiated instruction in a Response to Intervention (RtI) model of instruction. First-grade children were assessed on their phonological, orthographic, and morphological awareness; expressive vocabulary; word reading; and spelling. Year-end word reading and spelling were outcome variables, and phonological, orthographic, and morphological awareness; expressive vocabulary; and RtI status (Tiers 1, 2, & 3) were predictor variables. The 3 linguistic awareness skills were unique predictors of word reading, and phonological and orthographic awareness were unique predictors of spelling. The contributions that these linguistic awareness skills and vocabulary made to word reading and spelling did not differ by children's RtI tier status. These results, in conjunction with previous studies, suggest that even beginning readers and spellers draw on multiple linguistic awareness skills for their word reading and spelling regardless of their level of literacy abilities. Educational implications are discussed.	\N	\N
23833989	To investigate hidden hearing loss in tinnitus patients with normal audiograms by means of auditory brainstem response (ABR) and explore the origin of tinnitus. Pure tone thresholds, ABR thresholds, amplitude of wave I and wave V of ABR were analyzed in 40 tinnitus patients and 15 controls. There was no significantly difference in pure tone thresholds and ABR thresholds between those tinnitus patients and controls while a reduced amplitude of wave I and normal amplitude of wave V of ABR in the tinnitus patients became evident. Tinnitus patients with normal audiograms have hidden hearing loss at the level of primary auditory nerve and the generation of tinnitus is likely attributed to a homeostatic response of neurons in brainstem.	\N	\N
23846434	CHENFIT-AMP is a novel nonlinear strategy that combines the fitting (gain prescription) and amplification (gain implementation) procedures for cochlear hearing loss. The fitting part of CHENFIT-AMP prescribes gain for outer hair cell (OHC) and inner hair cell (IHC) loss, respectively. The gain for OHC loss varies with the cochlear gain decided by the value of OHC loss and the input level. The gain for IHC loss varies with the value of IHC loss only and will be limited to a constant if there is a "dead region." The amplification part of CHENFIT-AMP is responsible for estimating the input level and cochlear gain based on Chen's loudness model. CHENFIT-AMP is evaluated with four typical audiograms and nine individual audiograms. A widely used nonlinear fitting procedure, NAL-NL2, is evaluated to compare prescription results with CHENFIT-AMP; a standard nonlinear amplification algorithm, multichannel compression (MCC), with the parameters provided by NAL-NL2, is also evaluated to compare amplification results with CHENFIT-AMP. For long-term average speech spectrum (LTASS) inputs, CHENFIT-AMP generally prescribes similar gain as NAL-NL2 for the typical audiograms; however, gain prescribed by CHENFIT-AMP is more individualized than NAL-NL2 for the individual audiograms, especially when the audiograms have big deviations in the slope. For LTASS-shaped noise input, the gain implemented by MCC with parameters provided by NAL-NL2 cannot completely realize the gain prescribed by NAL-NL2. For speech sentence inputs, average ratings by subjects indicated that amplification by CHENFIT-AMP was preferred and led to a louder perception than that by MCC with parameters from NAL-NL2.	\N	\N
23846719	Inner speech is one of the most common, but least investigated, mental activities humans perform. It is an internal copy of one's external voice and so is similar to a well-established component of motor control: corollary discharge. Corollary discharge is a prediction of the sound of one's voice generated by the motor system. This prediction is normally used to filter self-caused sounds from perception, which segregates them from externally caused sounds and prevents the sensory confusion that would otherwise result. The similarity between inner speech and corollary discharge motivates the theory, tested here, that corollary discharge provides the sensory content of inner speech. The results reported here show that inner speech attenuates the impact of external sounds. This attenuation was measured using a context effect (an influence of contextual speech sounds on the perception of subsequent speech sounds), which weakens in the presence of speech imagery that matches the context sound. Results from a control experiment demonstrated this weakening in external speech as well. Such sensory attenuation is a hallmark of corollary discharge.	\N	\N
23847464	Evaluating series of complex sounds like those in speech and music requires sequential comparisons to extract task-relevant relations between subsequent sounds. With the present functional magnetic resonance imaging (fMRI) study, we investigated whether sequential comparison of a specific acoustic feature within pairs of tones leads to a change in lateralized processing in the auditory cortex (AC) of humans. For this we used the active categorization of the direction (up vs. down) of slow frequency modulated (FM) tones. Several studies suggest that this task is mainly processed in the right AC. These studies, however, tested only the categorization of the FM direction of each individual tone. In the present study we ask the question whether the right lateralized processing changes when, in addition, the FM direction is compared within pairs of successive tones. For this we use an experimental approach involving contralateral noise presentation in order to explore the contributions made by the left and right AC in the completion of the auditory task. This method has already been applied to confirm the right-lateralized processing of the FM direction of individual tones. In the present study, the subjects were required to perform, in addition, a sequential comparison of the FM direction in pairs of tones. The results suggest a division of labor between the two hemispheres such that the FM direction of each individual tone is mainly processed in the right AC whereas the sequential comparison of this feature between tones in a pair is probably performed in the left AC.	\N	\N
23850664	Over the last four decades, a range of different neuroimaging tools have been used to study human auditory attention, spanning from classic event-related potential studies using electroencephalography to modern multimodal imaging approaches (e.g., combining anatomical information based on magnetic resonance imaging with magneto- and electroencephalography). This review begins by exploring the different strengths and limitations inherent to different neuroimaging methods, and then outlines some common behavioral paradigms that have been adopted to study auditory attention. We argue that in order to design a neuroimaging experiment that produces interpretable, unambiguous results, the experimenter must not only have a deep appreciation of the imaging technique employed, but also a sophisticated understanding of perception and behavior. Only with the proper caveats in mind can one begin to infer how the cortex supports a human in solving the "cocktail party" problem. This article is part of a Special Issue entitled Human Auditory Neuroimaging.	\N	\N
23855264	Accurate tuning is an important aspect of singing in harmony in the context of a choir or vocal ensemble. Tuning and 'pitch drift' are concerning factors in performance for even the most accomplished professional choirs when singing a cappella (unaccompanied). In less experienced choirs tuning often lacks precision, typically because individual singers have not developed appropriate listening skills. In order to investigate accuracy of tuning in ensemble singing situations, a chorally appropriate reference is required against which frequency measurements can be made. Since most basic choral singing involves chords in four parts, a four-part reference template is used in which the fundamental frequencies of the notes in each chord can be accurately set. This template can now be used in experiments where three of the reference parts are tuned in any musical temperament (tuning system), in this case equal and just temperaments, and played over headphones to a singer to allow her/his tuning strategy to be investigated. This paper describes a practical implementation of a four-part choral synthesis system in Pure Data (Pd) and its use in an investigation of tuning of notes by individual singers using an exercise originally written to explore pitch drift in a cappella choral singing.	\N	\N
23859060	To examine the relationship between portable digital audio player listening behaviours and (1) measured sound pressure levels, (2) audiometric measures, (3) self-reported hearing loss symptoms. A questionnaire to evaluate listening behaviours, including self-reported hearing loss symptoms and listening duration/volume settings. Multivariate regression analysis was used to determine the relationship between these variables, audiometric evaluation, calculated exposure levels, Lex(8hr), and measured sound pressure levels, Leq(32sec). This study included 103 males and 134 female subjects aged 10 to 17 years. Calculated Lex(8hr) and measured Leq(32sec) levels increased with age and self-reported usage time. Audiometric thresholds averaged over 4 and 8 kHz were higher when usage exceeded five years as compared to less than one year. Higher measured sound pressure levels were associated with worse audiometric thresholds at (0.5, 1, 2 kHz, averaged) and 4 kHz. Self-reported hearing loss symptoms were reported by 33% to 50% of subjects. In this cohort sample, our results support a statistical association between hearing acuity and (1) Self-reported weekly usage in hours; (2) Tightness of fit; (3) Years of usage; and (4) Measured sound pressure levels. Generalizing these results beyond the current sample would require additional research.	\N	\N
23862816	Tone-in-noise detection has been studied for decades; however, it is not completely understood what cue or cues are used by listeners for this task. Model predictions based on energy in the critical band are generally more successful than those based on temporal cues, except when the energy cue is not available. Nevertheless, neither energy nor temporal cues can explain the predictable variance for all listeners. In this study, it was hypothesized that better predictions of listeners' detection performance could be obtained using a nonlinear combination of energy and temporal cues, even when the energy cue was not available. The combination of different cues was achieved using the logarithmic likelihood-ratio test (LRT), an optimal detector in signal detection theory. A nonlinear LRT-based combination of cues was proposed, given that the cues have Gaussian distributions and the covariance matrices of cue values from noise-alone and tone-plus-noise conditions are different. Predictions of listeners' detection performance for three different sets of reproducible noises were computed with the proposed model. Results showed that predictions for hit rates approached the predictable variance for all three datasets, even when an energy cue was not available.	\N	\N
23862818	A number of precedence-effect models have been developed to simulate the robust localization performance of humans in reverberant conditions. Although they are able to reduce reverberant information for many conditions, they tend to fail for ongoing stimuli with truncated on/offsets, a condition human listeners master when localizing a sound source in the presence of a reflection, according to a study by Dizon and Colburn [J. Acoust. Soc. Am. 119, 2947-2964 (2006)]. This paper presents a solution for this condition by using an autocorrelation mechanism to estimate the delay and amplitude ratio between the leading and lagging signals. An inverse filter is then used to eliminate the lag signal, before it is localized with a standard localization algorithm. The current algorithm can operate on top of a basic model of the auditory periphery (gammatone filter bank, half-wave rectification) to simulate psychoacoustic data by Braasch et al. [Acoust. Sci. Tech. 24, 293-303 (2003)] and Dizon and Colburn. The model performs robustly with these on/offset truncated and interaural level difference based stimuli and is able to demonstrate the Haas effect.	\N	\N
23862821	In noise repetition-detection tasks, listeners have to distinguish trials of continuously running noise from trials in which noise tokens are repeated in a cyclic manner. Recently, it has been shown that using the exact same noise token across several trials ("reference noise") facilitates the detection of repetitions for this token [Agus et al. (2010). Neuron 66, 610-618]. This was attributed to perceptual learning. Here, the nature of the learning was investigated. In experiment 1, reference noise tokens were embedded in trials with or without cyclic presentation. Naïve listeners reported repetitions in both cases, thus responding to the reference noise even in the absence of an actual repetition. Experiment 2, with the same listeners, showed a similar pattern of results even after the design of the experiment was made explicit, ruling out a misunderstanding of the task. Finally, in experiment 3, listeners reported repetitions in trials containing the reference noise, even before ever hearing it presented cyclically. The results show that listeners were able to learn and recognize noise tokens in the absence of an immediate repetition. Moreover, the learning mandatorily interfered with listeners' ability to detect repetitions. It is concluded that salient perceptual changes accompany the learning of noise.	\N	\N
23862833	Effective communication between staff members is key to patient safety in hospitals. A variety of patient care activities including admittance, evaluation, and treatment rely on oral communication. Surprisingly, published information on speech intelligibility in hospitals is extremely limited. In this study, speech intelligibility measurements and occupant evaluations were conducted in 20 units of five different U.S. hospitals. A variety of unit types and locations were studied. Results show that overall, no unit had "good" intelligibility based on the speech intelligibility index (SII > 0.75) and several locations found to have "poor" intelligibility (SII < 0.45). Further, occupied spaces were found to have 10%-15% lower SII than unoccupied spaces on average. Additionally, staff perception of communication problems at nurse stations was significantly correlated with SII ratings. In a targeted second phase, a unit treated with sound absorption had higher SII ratings for a larger percentage of time as compared to an identical untreated unit. Taken as a whole, the study provides an extensive baseline evaluation of speech intelligibility across a variety of hospitals and unit types, offers some evidence of the positive impact of absorption on intelligibility, and identifies areas for future research.	\N	\N
23862901	Categorical perception experiments were performed on an English /b-p/ voice onset time (VOT) continuum with native (American English) and non-native (Korean) listeners to examine whether and how phonetic categorization is modulated by prosodic boundary and language experience. Results demonstrated perceptual shifting according to prosodic boundary strength: A longer VOT was required to identify a sound as /p/ after an intonational phrase than a word boundary, regardless of the listeners' language experience. This suggests that segmental perception is modulated by the listeners' computation of an abstract prosodic structure reflected in phonetic cues of phrase-final lengthening and domain-initial strengthening, which are common across languages.	\N	\N
23862903	A previous report [Margolis and Stiepan (2012). "Acoustic method for calibration of audiometric bone vibrators," J. Acoust. Soc. Am. 131, 1221-1225] described a reliable, inexpensive, acoustic method for calibration of audiometric bone vibrators. As a follow up to that report harmonic distortion measurements were made with the standard electromechanical method and the acoustic method using five Radioear B71 vibrators and one Radioear B81 prototype vibrator. Lower distortion was seen for measurements made with the acoustic method compared to the electromechanical method and for the Radioear B81 vibrator compared to the Radioear B71 vibrator.	\N	\N
23865332	Masking therapy can make patients accustom to tinnitus. This therapy is safe and easy to implement, so that it has become a widely used treatment of curing tinnitus. According to surveys of tinnitus sounds, cicada sound is one of the most usual tinnituses. Meanwhile, we have not hitherto found published papers concerning how to synthesize cicada sound and to use it to ameliorate tinnitus. Inspired by the human acoustics theory, we proposed a method to synthesize medical masking sound and to realize the diversity by illustrating the process of synthesizing various cicada sounds. In addition, energy attenuation problem in spectrum shifting process has been successfully solved. Simulation results indicated that the proposed method achieved decent results and would have practical value for the future applications.	\N	\N
23867553	We study the developmental trajectory of morphology and function of the superior temporal cortex (STC) in children (8-9 years), adolescents (14-15 years) and young adults. We analyze cortical surface landmarks and functional MRI (fMRI) responses to voices, other natural categories and tones and examine how hemispheric asymmetry and inter-subject variability change across age. Our results show stable morphological asymmetries across age groups, including a larger left planum temporale and a deeper right superior temporal sulcus. fMRI analyses show that a rightward lateralization for voice-selective responses is present in all groups but decreases with age. Furthermore, STC responses to voices change from being less selective and more spatially diffuse in children to highly selective and focal in adults. Interestingly, the analysis of morphological landmarks reveals that inter-subject variability increases during development in the right--but not in the left--STC. Similarly, inter-subject variability of cortically-realigned functional responses to voices, other categories and tones increases with age in the right STC. Our findings reveal asymmetric developmental changes in brain regions crucial for auditory and voice perception. The age-related increase of inter-subject variability in right STC suggests that anatomy and function of this region are shaped by unique individual developmental experiences.	\N	\N
23882002	To determine the relative importance of acoustic parameters (fundamental frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice gender perception, the authors used a novel parameter-morphing approach that, unlike spectral envelope shifting, allows the application of nonuniform scale factors to transform formants and more direct comparison of parameter impact. In each of 2 experiments, 16 listeners with normal hearing (8 female, 8 male) classified voice gender for morphs between female and male speakers, using syllable tokens from 2 male-female speaker pairs. Morphs varied single acoustic parameters (Experiment 1) or selected combinations (Experiment 2), keeping residual parameters androgynous, as determined in a baseline experiment. The strongest cue related to gender perception was F0, followed by FF and SL. Aperiodicity did not systematically influence gender perception. Morphing F0 and FF in conjunction produced convincing changes in perceived gender-changes that were equivalent to those for Full morphs interpolating all parameters. Despite the importance of F0, morphing FF and SL in combination produced effective changes in voice gender perception. The most important single parameters for gender perception are, in order, F0, FF, and SL. At the same time, F0 and vocal tract resonances have a comparable impact on voice gender perception.	\N	\N
23883861	The observation that near-threshold low-contrast visual distractors can equally influence perceptual state and goal-directed motor responses was recently taken as an argument against a sharp separation between a conscious vision for perception and an unconscious vision for action. However, data supporting the dual visual system theory have principally involved high-contrast stimuli. In the present study, we assessed the effect of varying the contrast of a near-threshold visual distractor while keeping its visibility constant with backward noise masks. Eight participants performed fast manual reaching movements toward a highly visible target while subsequently reporting the presence/absence of a near-threshold distractor appearing at the opposite location with respect to the body midline. For all distractor contrasts, hand trajectory deviations toward the distractor were observed when the distractor was present and detected. When the distractor remained undetected deviations also occurred, but for higher contrasts. The subliminal motor effect traditionally observed in visual masking studies may therefore primarily depend on the luminance contrast of the interfering stimuli. These results suggest that dissociations between perceptual and motor responses can be explained by a single-signal model involving differential thresholds for perception and action that are specifically modulated as a function of both the requirements of the task and the contrast level of the stimuli. Such modulation is compatible with neurophysiological accounts of visual masking in which feedforward activation to--and feedback activation from--higher visual areas are correlated with the actual presence of the stimulation and its conscious perception, respectively.	\N	\N
23902521	This study investigated the effect of electrode configuration, stimulus rate, and EEG rejection level on the efficiency of ABR testing in babies. ABR to click stimuli at 40 dB nHL were simultaneously recorded from two electrode configurations, ipsilateral mastoid to high forehead (Mi-Fh) and nape to high forehead (N-Fh), with two EEG rejection levels (± 5 μV and ± 10 μV). Stimulus rates were between 39.1 and 69.1 per second. Efficiency was measured by confidence in the ABR for a given test time. Thirty babies who had passed a targeted newborn hearing screen with ABR thresholds ≤ 40 dB nHL. The N-Fh configuration, as expected, gave on average a larger response amplitude compared to the Mi-Fh configuration but was only marginally significantly better in terms of test efficiency. There was no significant effect of stimulus rate on test efficiency between 39.1/s and 59.1/s. The lower ± 5 μV EEG rejection level was more test efficient. This study provides some evidence that, for ABR threshold testing in babies, alternatives of ipsilateral mastoid or nape electrode and a range of stimulus rates have little or no effect on test efficiency. The results support the use of low EEG rejection limits.	\N	\N
23915050	In everyday listening situations, we need to constantly switch between alternative sound sources and engage attention according to cues that match our goals and expectations. The exact neuronal bases of these processes are poorly understood. We investigated oscillatory brain networks controlling auditory attention using cortically constrained fMRI-weighted magnetoencephalography/EEG source estimates. During consecutive trials, participants were instructed to shift attention based on a cue, presented in the ear where a target was likely to follow. To promote audiospatial attention effects, the targets were embedded in streams of dichotically presented standard tones. Occasionally, an unexpected novel sound occurred opposite to the cued ear to trigger involuntary orienting. According to our cortical power correlation analyses, increased frontoparietal/temporal 30-100 Hz gamma activity at 200-1400 msec after cued orienting predicted fast and accurate discrimination of subsequent targets. This sustained correlation effect, possibly reflecting voluntary engagement of attention after the initial cue-driven orienting, spread from the TPJ, anterior insula, and inferior frontal cortices to the right FEFs. Engagement of attention to one ear resulted in a significantly stronger increase of 7.5-15 Hz alpha in the ipsilateral than contralateral parieto-occipital cortices 200-600 msec after the cue onset, possibly reflecting cross-modal modulation of the dorsal visual pathway during audiospatial attention. Comparisons of cortical power patterns also revealed significant increases of sustained right medial frontal cortex theta power, right dorsolateral pFC and anterior insula/inferior frontal cortex beta power, and medial parietal cortex and posterior cingulate cortex gamma activity after cued versus novelty-triggered orienting (600-1400 msec). Our results reveal sustained oscillatory patterns associated with voluntary engagement of auditory spatial attention, with the frontoparietal and temporal gamma increases being best predictors of subsequent behavioral performance.	\N	\N
23920156	The current study investigated the role of resumption in the interpretation of object relative clauses (RCs) in Persian-speaking children. Sixty-four (N=64) children aged 3;2-6;0 (M=4;8) completed a referent selection task that tested their comprehension of subject RCs, gapped object RCs, and object RCs containing either a resumptive pronoun or an object clitic. The results showed that the presence of a resumptive element (pronoun or clitic) had a facilitative effect on children's processing of object RCs. In both cases object RCs with resumptive elements were interpreted more accurately than gapped subject and object RCs, suggesting that resumptive elements ease processing burden in syntactically complex contexts because they provide local cues to thematic role assignment.	\N	\N
23926291	Previously, Gygi and Shafiro (2011) found that when environmental sounds are semantically incongruent with the background scene (e.g., horse galloping in a restaurant), they can be identified more accurately by young normal-hearing listeners (YNH) than sounds congruent with the scene (e.g., horse galloping at a racetrack). This study investigated how age and high-frequency audibility affect this Incongruency Advantage (IA) effect. In Experiments 1a and 1b, elderly listeners ( N = 18 for 1a; N = 10 for 1b) with age-appropriate hearing (EAH) were tested on target sounds and auditory scenes in 5 sound-to-scene ratios (So/Sc) between -3 and -18 dB. Experiment 2 tested 11 YNH on the same sound-scene pairings lowpass-filtered at 4 kHz (YNH-4k). The EAH and YNH-4k groups exhibited an almost identical pattern of significant IA effects, but both were at approximately 3.9 dB higher So/Sc than the previously tested YNH listeners. However, the psychometric functions revealed a shallower slope for EAH listeners compared with YNH listeners for the congruent stimuli only, suggesting a greater difficulty for the EAH listeners in attending to sounds expected to occur in a scene. These findings indicate that semantic relationships between environmental sounds in soundscapes are mediated by both audibility and cognitive factors and suggest a method for dissociating these factors.	\N	\N
23927115	Laback et al. [(2011). J. Acoust. Soc. Am. 129, 888-897] investigated the additivity of nonsimultaneous masking using short Gaussian-shaped tones as maskers and target. The present study involved Gaussian stimuli to measure the additivity of simultaneous masking for combinations of up to four spectrally separated maskers. According to most basilar membrane measurements, the maskers should be processed linearly at the characteristic frequency (CF) of the target. Assuming also compression of the target, all masker combinations should produce excess masking (exceeding linear additivity). The results for a pair of maskers flanking the target indeed showed excess masking. The amount of excess masking could be predicted by a model assuming summation of masker-evoked excitations in intensity units at the target CF and compression of the target, using compressive input/output functions derived from the nonsimultaneous masking study. However, the combinations of lower-frequency maskers showed much less excess masking than predicted by the model. This cannot easily be attributed to factors like off-frequency listening, combination tone perception, or between-masker suppression. It was better predicted, however, by assuming weighted intensity summation of masker excitations. The optimum weights for the lower-frequency maskers were smaller than one, consistent with partial masker compression as indicated by recent psychoacoustic data.	\N	\N
23927133	Much recent interest surrounds listeners' abilities to adapt to various transformations that distort speech. An extreme example is spectral rotation, in which the spectrum of low-pass filtered speech is inverted around a center frequency (2 kHz here). Spectral shape and its dynamics are completely altered, rendering speech virtually unintelligible initially. However, intonation, rhythm, and contrasts in periodicity and aperiodicity are largely unaffected. Four normal hearing adults underwent 6 h of training with spectrally-rotated speech using Continuous Discourse Tracking. They and an untrained control group completed pre- and post-training speech perception tests, for which talkers differed from the training talker. Significantly improved recognition of spectrally-rotated sentences was observed for trained, but not untrained, participants. However, there were no significant improvements in the identification of medial vowels in /bVd/ syllables or intervocalic consonants. Additional tests were performed with speech materials manipulated so as to isolate the contribution of various speech features. These showed that preserving intonational contrasts did not contribute to the comprehension of spectrally-rotated speech after training, and suggested that improvements involved adaptation to altered spectral shape and dynamics, rather than just learning to focus on speech features relatively unaffected by the transformation.	\N	\N
23933145	Prepulse inhibition (PPI) of startle is a measure of inhibitory function in which a weak leading stimulus suppresses the startle response to an intense stimulus. Usually, startle blink reflexes to an intense sound are used for measuring PPI. A recent magnetoencephalographic study showed that a similar phenomenon is observed for auditory change-related cortical response (Change-N1m) to an abrupt change in sound features. It has been well established that nicotine enhances PPI of startle. Therefore, in the present magnetoencephalographic study, the effects of acute nicotine on PPI of the Change-N1m were studied in 12 healthy subjects (two females and 10 males) under a repeated measures and placebo-controlled design. Nicotine (4 mg) was given as nicotine gum. The test Change-N1m response was elicited with an abrupt increase in sound pressure by 6 dB in a continuous background sound of 65 dB. PPI was produced by an insertion of a prepulse with a 3-dB-louder or 6-dB-weaker sound pressure than the background 75 ms before the test stimulus. Results show that nicotine tended to enhance the test Change-N1m response and significantly enhanced PPI for both prepulses. Therefore, nicotine's enhancing effect on PPI of the Change-N1m was similar to that on PPI of the startle. The present results suggest that the two measures share at least some mechanisms.	\N	\N
23973563	The physical intensity of a sound, usually expressed in dB on a logarithmic ratio scale, can easily be measured using technical equipment. Loudness is the perceptual correlate of sound intensity, and is usually determined by means of some sort of psychophysical scaling procedure. The interrelation of sound intensity and perceived loudness is still a matter of debate, and the physiological correlate of loudness perception in the human auditory pathway is not completely understood. Various studies indicate that the activation in human auditory cortex is more a representation of loudness sensation rather than of physical sound pressure level. This raises the questions (1), at what stage or stages in the ascending auditory pathway is the transformation of the physical stimulus into its perceptual correlate completed, and (2), to what extent other factors affecting individual loudness judgements might modulate the brain activation as registered by auditory neuroimaging. An overview is given about recent studies on the effects of sound intensity, duration, bandwidth and individual hearing status on the activation in the human auditory system, as measured by various approaches in auditory neuroimaging. This article is part of a Special Issue entitled Human Auditory Neuroimaging.	\N	\N
23974947	Tonal music is characterized by a continuous flow of tension and resolution. This flow of tension and resolution is closely related to processes of expectancy and prediction and is a key mediator of music-evoked emotions. However, the neural correlates of subjectively experienced tension and resolution have not yet been investigated. We acquired continuous ratings of musical tension for four piano pieces. In a subsequent functional magnetic resonance imaging experiment, we identified blood oxygen level-dependent signal increases related to musical tension in the left lateral orbitofrontal cortex (pars orbitalis of the inferior frontal gyrus). In addition, a region of interest analysis in bilateral amygdala showed activation in the right superficial amygdala during periods of increasing tension (compared with decreasing tension). This is the first neuroimaging study investigating the time-varying changes of the emotional experience of musical tension, revealing brain activity in key areas of affective processing.	\N	\N
23990061	We aim to evaluate the incidence and clinical manifestations of sensorineural hearing loss (SNHL) in adult patients with acute otitis media (AOM). Seventy-five patients (age > 18 years; 83 ears) diagnosed with AOM between January 2008 and March 2011 at our clinic were enroled and retrospectively reviewed. We detected audiometrically confirmed SNHL during the course of AOM in eight patients. The clinical course, treatment, and audiometric final outcome of each case were reviewed. SNHL was associated with AOM in 8 out of 83 ears (9.3%). The mean age of patients was 57.5 years, and the mean follow-up period was 21.1 months (range 0.6-46.3 months). The most common symptom was tinnitus. Mean bone conduction hearing threshold was 39.5 dB in pure tone audiometry. All patients showed high-frequency HL, and three showed pan-frequency HL. All patients were treated with oral antibiotics at the initial visit. Seven ears were treated with a combination of oral steroids. Myringotomy was also performed. Seven of eight patients showed improvement; however, 8 kHz thresholds were not improved. This suggested that the inflammation spread through the round window. The mean duration of recovery was 18.6 days. SNHL associated with AOM in adult patients occurs during the early phases of the disease course. High-frequency hearing was commonly affected and was well treated with oral antibiotics, myringotomy, and steroid therapy. Audiometry can be helpful for treating adult patients with AOM. Active treatment, including myringotomy, should be performed during the early phase, if SNHL is suspected.	\N	\N
23992133	The development of presbycusis, or age-related hearing loss, is determined by a combination of genetic and environmental factors. The auditory periphery exhibits a progressive bilateral, symmetrical reduction of auditory sensitivity to sound from high to low frequencies. The central auditory nervous system shows symptoms of decline in age-related cognitive abilities, including difficulties in speech discrimination and reduced central auditory processing, ultimately resulting in auditory perceptual abnormalities. The pathophysiological mechanisms of presbycusis include excitotoxicity, oxidative stress, inflammation, aging and oxidative stress-induced DNA damage that results in apoptosis in the auditory pathway. However, the originating signals that trigger these mechanisms remain unclear. For instance, it is still unknown whether insulin is involved in auditory aging. Auditory aging has preclinical lesions, which manifest as asymptomatic loss of periphery auditory nerves and changes in the plasticity of the central auditory nervous system. Currently, the diagnosis of preclinical, reversible lesions depends on the detection of auditory impairment by functional imaging, and the identification of physiological and molecular biological markers. However, despite recent improvements in the application of these markers, they remain under-utilized in clinical practice. The application of antisenescent approaches to the prevention of auditory aging has produced inconsistent results. Future research will focus on the identification of markers for the diagnosis of preclinical auditory aging and the development of effective interventions.	\N	\N
23992488	The state of hearing in 75-year old persons was measured in a population based epidemiological study with the aim of studying if hearing had changed during a time span of 29 years. An epidemiological study of generational effects in three age cohorts. Three age cohorts were included: cohort 1 (n: 267) born in 1976-77, cohort 4 (n: 197) in 1990-91, and cohort 6 (n: 570) in 2005. The same test procedures using pure-tone audiometry and a short questionnaire were applied to the three cohorts of 75-year old residents in the same city. The hearing was essentially unchanged during the span of the investigation-almost three decades. Low-frequency hearing was up to about 10 dB poorer in the most recently studied cohort compared to the previously studied cohorts. The reason for this difference is considered to depend on methodological factors. Self-assessed hearing and tinnitus was mainly unchanged, or had minor changes both to the better and to the worse. The hearing, both measured with pure-tone audiometry and with a short questionnaire, of 75-year old persons has not changed at all, or only marginally, over three decades.	\N	\N
23994183	Speech comprehension relies on auditory as well as visual information, and is enhanced in healthy subjects, when audiovisual (AV) information is present. Patients with schizophrenia have been reported to have problems regarding this AV integration process, but little is known about which underlying neural processes are altered. Functional magnetic resonance imaging was performed in 15 schizophrenia patients (SP) and 15 healthy controls (HC) to study functional connectivity of Broca's area by means of a beta series correlation method during perception of audiovisually presented bisyllabic German nouns, in which audio and video either matched or did not match. Broca's area of SP showed stronger connectivity with supplementary motor cortex for incongruent trials whereas HC connectivity was stronger for congruent trials. The right posterior superior temporal sulcus (RpSTS) area showed differences in connectivity for congruent and incongruent trials in HC in contrast to SP where the connectivity was similar for both conditions. These smaller differences in connectivity in SP suggest a less adaptive processing of audiovisually congruent and incongruent speech. The findings imply that AV integration problems in schizophrenia are associated with maladaptive connectivity of Broca's and RpSTS area in particular when confronted with incongruent stimuli. Results are discussed in light of recent AV speech perception models.	\N	\N
23998484	Electrode impedance increases following implantation and undergoes transitory reduction with onset of electrical stimulation. The studies in this paper measured the changes in access resistance and polarization impedance in vivo before and following electrical stimulation, and recorded the time course of these changes. Impedance measures recorded in (a) four cats following 6 months of cochlear implant use, and (b) three cochlear implant recipients with 1.5-5 years cochlear implant experience. Both the experimental and clinical data exhibited a reduction in electrode impedance, 20 and 5% respectively, within 15-30 minutes of stimulation onset. The majority of these changes occurred through reduction in polarization impedance. Cessation of stimulation was followed by an equivalent rise in impedance measures within 6-12 hours. Stimulus-induced reductions in impedance exhibit a rapid onset and are evident in both chronic in vivo models tested, even several years after implantation. Given the impedance changes were dominated by the polarization component, these findings suggest that the electrical stimulation altered the electrode surface rather than the bulk tissue and fluid in the cochlea.	\N	\N
24001008	Classical theories of semantic memory assume that concepts are represented in a unitary amodal memory system. In challenging this classical view, pure or hybrid modality-specific theories propose that conceptual representations are grounded in the sensory-motor brain areas, which typically process sensory and action-related information. Although neuroimaging studies provided evidence for a functional-anatomical link between conceptual processing of sensory or action-related features and the sensory-motor brain systems, it has been argued that aspects of such sensory-motor activation may not directly reflect conceptual processing but rather strategic imagery or postconceptual elaboration. In the present ERP study, we investigated masked effects of acoustic and action-related conceptual features to probe unconscious automatic conceptual processing in isolation. Subliminal feature-specific ERP effects at frontocentral electrodes were observed, which differed with regard to polarity, topography, and underlying brain electrical sources in congruency with earlier findings under conscious viewing conditions. These findings suggest that conceptual acoustic and action representations can also be unconsciously accessed, thereby excluding any postconceptual strategic processes. This study therefore further substantiates a grounding of conceptual and semantic processing in action and perception.	\N	\N
24002965	Dual-system models of visual category learning posit the existence of an explicit, hypothesis-testing reflective system, as well as an implicit, procedural-based reflexive system. The reflective and reflexive learning systems are competitive and neurally dissociable. Relatively little is known about the role of these domain-general learning systems in speech category learning. Given the multidimensional, redundant, and variable nature of acoustic cues in speech categories, our working hypothesis is that speech categories are learned reflexively. To this end, we examined the relative contribution of these learning systems to speech learning in adults. Native English speakers learned to categorize Mandarin tone categories over 480 trials. The training protocol involved trial-by-trial feedback and multiple talkers. Experiments 1 and 2 examined the effect of manipulating the timing (immediate vs. delayed) and information content (full vs. minimal) of feedback. Dual-system models of visual category learning predict that delayed feedback and providing rich, informational feedback enhance reflective learning, while immediate and minimally informative feedback enhance reflexive learning. Across the two experiments, our results show that feedback manipulations that targeted reflexive learning enhanced category learning success. In Experiment 3, we examined the role of trial-to-trial talker information (mixed vs. blocked presentation) on speech category learning success. We hypothesized that the mixed condition would enhance reflexive learning by not allowing an association between talker-related acoustic cues and speech categories. Our results show that the mixed talker condition led to relatively greater accuracies. Our experiments demonstrate that speech categories are optimally learned by training methods that target the reflexive learning system.	\N	\N
24003904	In a system where tens of thousands of words are made up of a limited number of phonemes, many words are bound to sound alike. This similarity of the words in the lexicon as characterized by phonological neighbourhood density (PhND) has been shown to affect speed and accuracy of word comprehension and production. Whereas there is a consensus about the interfering nature of neighbourhood effects in comprehension, the language production literature offers a more contradictory picture with mainly facilitatory but also interfering effects reported on word production. Here we report both of these two types of effects in the same study. Multiple regression mixed models analyses were conducted on PhND effects on errors produced in a naming task by a group of 21 participants with aphasia. These participants produced more formal errors (interfering effect) for words in dense phonological neighbourhoods, but produced fewer nonwords and semantic errors (a facilitatory effect) with increasing density. In order to investigate the nature of these opposite effects of PhND, we further analysed a subset of formal errors and nonword errors by distinguishing errors differing on a single phoneme from the target (corresponding to the definition of phonological neighbours) from those differing on two or more phonemes. This analysis confirmed that only formal errors that were phonological neighbours of the target increased in dense neighbourhoods, while all other errors decreased. Based on additional observations favouring a lexical origin of these formal errors (they exceeded the probability of producing a real-word error by chance, were of a higher frequency, and preserved the grammatical category of the targets), we suggest that the interfering effect of PhND is due to competition between lexical neighbours and target words in dense neighbourhoods.	\N	\N
24003982	This study employed Boothroyd and Nittrouer's k (1988) to directly quantify effectiveness in native versus non-native listeners' use of semantic cues. Listeners were presented speech-perception-in-noise sentences processed at three levels of concurrent multi-talker babble and reverberation. For each condition, 50 sentences with multiple semantic cues and 50 with minimum semantic cues were randomly presented. Listeners verbally reported and wrote down the target words. The metric, k, was derived from percent-correct scores for sentences with and without semantics. Ten native and 33 non-native listeners participated. The presence of semantics increased recognition benefit by over 250% for natives, but access to semantics remained limited for non-native listeners (90-135%). The k was comparable across conditions for native listeners, but level-dependent for non-natives. The k for non-natives was significantly different from 1 in all conditions, suggesting semantic cues, though reduced in importance in difficult conditions, were helpful for non-natives. Non-natives as a group were not as effective in using semantics to facilitate English sentence recognition as natives. Poor listening conditions were particularly adverse to the use of semantics in non-natives, who may rely on clear acoustic-phonetic cues before benefitting from semantic cues when recognizing connected speech.	\N	\N
24005532	It has been hypothesized that selective serotonin reuptake inhibitor (SSRI)-induced sexual dysfunction can occur more frequently in patients with higher central serotonergic activity, and that this higher serotonergic activity can induce inhibition of sexual desire, ejaculation, and orgasm. Thus, the aim of this study was to determine the relationship between SSRI-induced sexual dysfunction and increased serotonin. Event-related potentials for the loudness dependence of auditory evoked potentials (LDAEP) were measured in 46 patients at a single time point. The subjects' scores on the Hamilton Depression Rating Scale and Antidepressant Side-Effect Checklist were also determined by the investigators at the same time point. All patients had received SSRI monotherapy. Overall, 37 % (17/46) of the patients experienced some form of SSRI-induced sexual dysfunction: lack of sexual desire, impotence, orgasm, and menstrual abnormality or mastalgia were experienced by 21.7, 8.3, 15.2, and 20.6 % of the patients, respectively. The subjects were thus divided into two groups-those with and without sexual dysfunction-and their data were compared. There was a tendency for the LDAEP to be lower in the group with sexual dysfunction (1.04 ± 0.77 μV) than the group without sexual dysfunction (1.45 ± 0.86 μV), although the difference was not statistically significant (p = 0.086). Furthermore, the distribution of the frequency of SSRI-induced sexual dysfunction differed marginally significantly between patients with low and high LDAEP, dichotomized according to the median LDAEP on the Cz electrode (χ (2) = 3.664, p = 0.056). There was a relatively high frequency of SSRI-induced sexual dysfunction in patients with low LDAEP.	\N	\N
24009759	Recent studies employing speech stimuli to investigate 'cocktail-party' listening have focused on entrainment of cortical activity to modulations at syllabic (5 Hz) and phonemic (20 Hz) rates. The data suggest that cortical modulation filters (CMFs) are dependent on the sound-frequency channel in which modulations are conveyed, potentially underpinning a strategy for separating speech from background noise. Here, we characterize modulation filters in human listeners using a novel behavioral method. Within an 'inverted' adaptive forced-choice increment detection task, listening level was varied whilst contrast was held constant for ramped increments with effective modulation rates between 0.5 and 33 Hz. Our data suggest that modulation filters are tonotopically organized (i.e., vary along the primary, frequency-organized, dimension). This suggests that the human auditory system is optimized to track rapid (phonemic) modulations at high sound-frequencies and slow (prosodic/syllabic) modulations at low frequencies.	\N	\N
24022792	Processing multiple complex features to create cohesive representations of objects is an essential aspect of both the visual and auditory systems. It is currently unclear whether these processes are entirely modality specific or whether there are amodal processes that contribute to complex object processing in both vision and audition. We investigated this using a dual-stream target detection task in which two concurrent streams of novel visual or auditory stimuli were presented. We manipulated the degree to which each stream taxed processing conjunctions of complex features. In two experiments, we found that concurrent visual tasks that both taxed conjunctive processing strongly interfered with each other but that concurrent auditory and visual tasks that both taxed conjunctive processing did not. These results suggest that resources for processing conjunctions of complex features within vision and audition are modality specific.	\N	\N
24023379	Effects of clicks and tonebursts on early and late auditory middle latency response (AMLR) components were evaluated in young and older cigarette smokers and nonsmokers. Participants ( n = 49) were categorized by smoking and age into 4 groups: (a) older smokers, (b) older nonsmokers, (c) young smokers, and (d) young nonsmokers. Monaural, 2-channel AMLRs were acquired from Fz and Cz electrodes with 3 stimuli (clicks, 500 Hz, and 3000 Hz). Group differences included significantly higher V-Na amplitude in young adults and shorter Pb latency in older nonsmokers. Young smokers had a significantly higher Nb-Pb amplitude and shorter Nb latency than other groups. Toneburst stimuli yielded significantly longer V, Na, and Pa latencies compared to clicks. Pb latency was shorter at Fz than at Cz. Relative amplitudes were significantly higher at Fz than at Cz overall; Pa-Nb and Nb-Pb were significantly lower for 3000 Hz than for 500 Hz and clicks. Responses from young smokers revealed a higher amplitude and shorter latency for later AMLR waves, reflecting an arousal effect of smoking in cortical and subcortical generators. AMLR differences in older adults may be due to age-related neurochemical changes in the central nervous system. Stimulus and electrode differences plus smoking and aging effects can guide neurodiagnostic AMLR protocols, especially in young adult smokers.	\N	\N
24026024	To standardize the information for families of children having functional surgery for middle ear malformations, we describe the audiometric results of the subgroup of patients with the most favorable anatomic conditions: viable auditory canal, intact tympanic membrane, mobile stapes, and corresponding to a Jahrsdoerfer score of 8 or higher. Case series, tertiary referral center. Charts of patients undergoing functional surgery for congenital middle ear malformations were reviewed for demographic data, preoperative Jahrsdoerfer score, ossicular chain status, type of ossiculoplasty, and audiometric data before and 6 months postsurgery. Eighteen consecutive interventions were performed on 13 patients (average age of 9 years, 8 girls and 5 boys) between 2004 and 2011. The ossiculoplasties performed were as follows: incus repositioning (4), double-layer tragal cartilage (5), intact native chain reconstruction (3), and partial ossicular prosthesis (6). Mean air bone gap (ABG) was 40.8 ± 12.4 dB preoperatively and 20.9 ± 12.9 dB postoperatively (p < 0.0001). Preoperative and postoperatively mean air conduction PTA thresholds were 49.9 ± 9.5 and 30.0 ± 14.1 dB, respectively (p < 0.0001). All ears operated on except one had air conduction improvement. There were no complications. Functional surgery for congenital middle ear malformations gives variable hearing outcomes. In this study, with the most favorable anatomic conditions, 12 ears (67%) of 18 had air conduction improvement below 30 dB.	\N	\N
24028890	Most hearing aid prescriptions focus on the optimization of a metric derived from the long-term average spectrum of speech, and do not consider how the prescribed values might distort the temporal envelope shape. A growing body of evidence suggests that such distortions can lead to systematic errors in speech perception, and therefore hearing aid prescriptions might benefit by including preservation of the temporal envelope shape in their rationale. To begin to explore this possibility, we designed a genetic algorithm (GA) to find the multiband compression settings that preserve the shape of the original temporal envelope while placing that envelope in the listener's audiometric dynamic range. The resulting prescription had a low compression threshold, short attack and release times, and a combination of compression ratio and gain that placed the output signal within the listener's audiometric dynamic range. Initial behavioral tests of individuals with impaired hearing revealed no difference in speech-in-noise perception between the GA and the NAL-NL2 prescription. However, gap detection performance was superior with the GA in comparison to NAL-NL2. Overall, this work is a proof of concept that consideration of temporal envelope distortions can be incorporated into hearing aid prescriptions.	\N	\N
24041778	Congenital amusia is a neuro-developmental disorder of music perception and production. Recent findings have demonstrated that this deficit is linked to an impaired short-term memory for tone sequences. As it has been shown before that non-musicians' implicit knowledge of musical regularities can improve short-term memory for tone information, the present study investigated if this type of implicit knowledge could also influence amusics' short-term memory performance. Congenital amusics and their matched controls, who were non-musicians, had to indicate whether sequences of five tones, presented in pairs, were the same or different; half of the pairs respected musical regularities (tonal sequences) and the other half did not (atonal sequences). As previously reported for non-musician participants, the control participants showed better performance (as measured with d') for tonal sequences than for atonal ones. While this improvement was not observed in amusics, both control and amusic participants showed faster response times for tonal sequences than for atonal sequences. These findings suggest that some implicit processing of tonal structures is potentially preserved in congenital amusia. This observation is encouraging as it strengthens the perspective to exploit implicit knowledge to help reducing pitch perception and memory deficits in amusia.	\N	\N
24043402	Our previous studies using fMRI have demonstrated that activations in human auditory cortex (AC) are strongly dependent on the characteristics of the task. The present study tested whether source estimation of scalp-recorded event-related potentials (ERPs) can be used to investigate task-dependent AC activations. Subjects were presented with frequency-varying two-part tones during pitch discrimination, pitch n-back memory, and visual tasks identical to our previous fMRI study (Rinne et al., J Neurosci 29:13338-13343, 2009). ERPs and their minimum-norm source estimates in AC were strongly modulated by task at 200-700 ms from tone onset. As in the fMRI study, the pitch discrimination and pitch memory tasks were associated with distinct AC activation patterns. In the pitch discrimination task, increased activity in the anterior AC was detected relatively late at 300-700 ms from tone onset. Therefore, this activity was probably not associated with enhanced pitch processing but rather with the actual discrimination process (comparison between the two parts of tone). Increased activity in more posterior areas associated with the pitch memory task, in turn, occurred at 200-700 ms suggesting that this activity was related to operations on pitch categories after pitch analysis was completed. Finally, decreased activity associated with the pitch memory task occurred at 150-300 ms consistent with the notion that, in the demanding pitch memory task, spectrotemporal analysis is actively halted as soon as category information has been obtained. These results demonstrate that ERP source analysis can be used to complement fMRI to investigate task-dependent activations of human AC.	\N	\N
24043565	What conditions, if any, can fully prevent attentional capture (i.e., involuntary allocation of spatial attention to an irrelevant object) has been a matter of debate. In a previous study, Folk, Ester, and Troemel (Psychonomic Bulletin & Review 16:127-132, 2009) suggested that attentional capture can be blocked entirely when attention is already engaged in a different object. This conclusion relied on the finding that in a search for a known-color target in a rapid serial visual presentation stream, a peripheral distractor with the target color did not further impair target identification performance when a distractor also with the target color that appeared in the stream had already captured attention. In the present study, we argue that this conclusion is unwarranted, because the effects of the central and peripheral distractors could not be disentangled. In order to isolate the effect of the peripheral distractor, we introduced a distractor-target letter compatibility manipulation. Our results showed that the peripheral distractor summoned attention, irrespective of whether attention had just been engaged. We conclude that neither spatially focused attention nor attentional engagement is sufficient to prevent attentional capture.	\N	\N
24055624	This study investigated the perceptual relationship between acoustic and electric stimuli presented to CI users with functional contralateral hearing. Fourteen subjects with unilateral profound deafness implanted with a MED-EL CI scaled the perceptual differences between pure tones presented to the acoustic hearing ear and electric biphasic pulse trains presented to the implanted ear. The differences were analyzed with a multidimensional scaling (MDS) analysis. Additionally, speech performance in noise was tested using sentence material presented in different spatial configurations while patients listened with both their acoustic hearing and implanted ears. Results of alternating least squares scaling (ALSCAL) analysis consistently demonstrate that a change in place of stimulation is in the same perceptual dimension as a change in acoustic frequency. However, the relative perceptual differences between the acoustic and the electric stimuli varied greatly across subjects. A degree of perceptual separation between acoustic and electric stimulation (quantified by relative dimensional weightings from an INDSCAL analysis) was hypothesized that would indicate a change in perceptual quality, but also be predictive of performance with combined acoustic and electric hearing. Perceptual separation between acoustic and electric stimuli was observed for some subjects. However, no relationship between the degree of perceptual separation and performance was found.	\N	\N
24059596	To establish the reliability and validity of an automated hearing screening test system for children. Cross-sectional within a comparative study of subjects. Subjects were 325 first-grade and second-grade children (6-10 years old) from primary schools in Shenzhen, China. Using the conventional pure-tone screening test with the pass/refer criterion set as 25 dB HL, as the 'gold standard", the sensitivity and specificity of the automated hearing screening test was 0.63 and 0.82, respectively. No specific pattern in the failure rates was observed to relate to the students' grade. There was no statistically significant age effect or gender effect. The results suggest that with further improvement in terms of its sensitivity and specificity, it may be feasible to use the automated hearing screening test system to conduct routine school hearing screenings.	\N	\N
24067501	To confirm an increased susceptibility to informational masking among individuals with single-sided deafness (SSD). To demonstrate a reduction in informational masking when SSD is treated with an integrated bone conduction hearing aid (IBC). To identify the acoustic cues that contribute to IBC-aided masking release. To determine the effects of device experience on the IBC advantage. Informational masking was evaluated with the coordinate-response measure. Participants performed the task by reporting color and number coordinates that changed randomly within target sentences. The target sentences were presented in free field accompanied by zero to three distracting sentences. Target and distracting sentences were spoken by different talkers and originated from different source locations, creating two sources of information for auditory streaming. Susceptibility to informational masking was inferred from the error rates of unaided SSD patients relative to normal controls. These baseline measures were derived by testing inexperienced IBC users without the device on the day of their initial fitting. The benefits of IBC-aided listening were assessed by measuring the aided performance of users who had at least 3 months' device experience. The acoustic basis of the listening advantage was isolated by correlating response errors with the voice pitch and location of distracting sentences. The effects of learning on cue effectiveness were evaluated by comparing the error rates of experienced and inexperienced users. Unaided SSD participants (inexperienced users) performed as well as normal controls when tested without distracting sentences but produced significantly higher error rates when tested with distracting sentences. Most errors involved responding with coordinates that were contained in distracting sentences. This increased susceptibility to informational masking was significantly reduced when experienced IBC users were tested with the device. The listening advantage was most strongly correlated with the availability of voice pitch cues, although performance was also influenced by the location of distracting sentences. Directional asymmetries appear to be dictated by location-dependent cues that are derived from the distinctive transmission characteristics of IBC stimulation. Experienced users made better use of these cues than inexperienced users. These results suggest that informational masking is a significant source of communication impairment among individuals with SSD. Despite the lateralization of auditory function, unaided SSD subjects experience informational masking when distractors occur in either the deaf or normal spatial hemifield. Restoration of aural sensitivity in the deaf hemifield with an IBC enhances speech intelligibility under complex listening conditions, presumably by providing additional sound-segregation cues that are derived from voice pitch and spatial location. The optimal use of these cues is not immediate, but a significant listening advantage is observed after 3 months of unstructured use.	\N	\N
24071587	Stereo vision has a well-known anisotropy: At low frequencies, horizontally oriented sinusoidal depth corrugations are easier to detect than vertically oriented corrugations (both defined by horizontal disparities). Previously, Serrano-Pedraza and Read (2010) suggested that this stereo anisotropy may arise because the stereo system uses multiple spatial-frequency disparity channels for detecting horizontally oriented modulations but only one for vertically oriented modulations. Here, we tested this hypothesis using the critical-band masking paradigm. In the first experiment, we measured disparity thresholds for horizontal and vertical sinusoids near the peak of the disparity sensitivity function (0.4 cycles/°), in the presence of either broadband or notched noise. We fitted the power-masking model to our results assuming a channel centered on 0.4 cycles/°. The estimated channel bandwidths were 2.95 octaves for horizontal and 2.62 octaves for vertical corrugations. In our second experiment we measured disparity thresholds for horizontal and vertical sinusoids of 0.1 cycles/° in the presence of band-pass noise centered on 0.4 cycles/° with a bandwidth of 0.5 octaves. This mask had only a small effect on the disparity thresholds, for either horizontal or vertical corrugations. We simulated the detection thresholds using the power-masking model with the parameters obtained in the first experiment and assuming either single-channel and multiple-channel detection. The multiple-channel model predicted the thresholds much better for both horizontal and vertical corrugations. We conclude that the human stereo system must contain multiple independent disparity channels for detecting horizontally oriented and vertically oriented depth modulations.	\N	\N
24073696	To investigate the effects of emotional music on visual processes, we analyzed visual evoked magnetic fields (VEF) on listening to emotional music in 14 healthy subjects. Positive and negative pieces of music were delivered during VEF recording following stimulation by emotionally neutral pictures of faces and landscapes. VEF components at 100 (M100) and 150 (M170)ms after stimulus onset were analyzed, and the estimated current strength for M170 following face stimulation was enhanced with negative compared to positive music in the right hemisphere. The equivalent current dipole for M100 and M170 was estimated in the primary visual cortex (V1) and inferior temporal area (IT), respectively. The present results indicate that background music showed a top-down control of the visual processes in IT, which is a core site responsible for the interpretation of facial expression. The emotional contents of music could alter visual processes, especially those involving the face.	\N	\N
24076424	For much of the past 30 years, investigations of auditory perception and language have been enhanced or even driven by the use of functional neuroimaging techniques that specialize in localization of central responses. Beginning with investigations using positron emission tomography (PET) and gradually shifting primarily to usage of functional magnetic resonance imaging (fMRI), auditory neuroimaging has greatly advanced our understanding of the organization and response properties of brain regions critical to the perception of and communication with the acoustic world in which we live. As the complexity of the questions being addressed has increased, the techniques, experiments and analyses applied have also become more nuanced and specialized. A brief review of the history of these investigations sets the stage for an overview and analysis of how these neuroimaging modalities are becoming ever more effective tools for understanding the auditory brain. We conclude with a brief discussion of open methodological issues as well as potential clinical applications for auditory neuroimaging. This article is part of a Special Issue entitled Human Auditory Neuroimaging.	\N	\N
24076425	Harmonic complexes that generate highly modulated temporal envelopes on the basilar membrane (BM) mask a tone less effectively than complexes that generate relatively flat temporal envelopes, because the non-linear active gain of the BM selectively amplifies a low-level tone in the dips of a modulated masker envelope. The present study examines a similar effect in speech recognition. Speech reception thresholds (SRTs) were measured for a voice masked by harmonic complexes with partials in sine phase (SP) or in random phase (RP). The masker's fundamental frequency (F0) was 50, 100 or 200 Hz. SRTs were considerably lower for SP than for RP maskers at 50-Hz F0, but the two converged at 100-Hz F0, while at 200-Hz F0, SRTs were a little higher for SP than RP maskers. The results were similar whether the target voice was male or female and whether the masker's spectral profile was flat or speech-shaped. Although listening in the masker dips has been shown to play a large role for artificial stimuli such as Schroeder-phase complexes at high levels, it contributes weakly to speech recognition in the presence of harmonic maskers with different crest factors at more moderate sound levels (65 dB SPL).	\N	\N
24086676	The auditory illusory perception "scale illusion" occurs when a tone of ascending scale is presented in one ear, a tone of descending scale is presented simultaneously in the other ear, and vice versa. Most listeners hear illusory percepts of smooth pitch contours of the higher half of the scale in the right ear and the lower half in the left ear. Little is known about neural processes underlying the scale illusion. In this magnetoencephalographic study, we recorded steady-state responses to amplitude-modulated short tones having illusion-inducing pitch sequences, where the sound level of the modulated tones was manipulated to decrease monotonically with increase in pitch. The steady-state responses were decomposed into right- and left-sound components by means of separate modulation frequencies. It was found that the time course of the magnitude of response components of illusion-perceiving listeners was significantly correlated with smooth pitch contour of illusory percepts and that the time course of response components of stimulus-perceiving listeners was significantly correlated with discontinuous pitch contour of stimulus percepts in addition to the contour of illusory percepts. The results suggest that the percept of illusory pitch sequence was represented in the neural activity in or near the primary auditory cortex, i.e., the site of generation of auditory steady-state response, and that perception of scale illusion is maintained by automatic low-level processing.	\N	\N
24089491	The strategies by which the central nervous system decodes the properties of sensory stimuli, such as sound source location, from the responses of a population of neurons are a matter of debate. We show, using the average firing rates of neurons in the inferior colliculus (IC) of awake rabbits, that prevailing decoding models of sound localization (summed population activity and the population vector) fail to localize sources accurately due to heterogeneity in azimuth tuning across the population. In contrast, a maximum-likelihood decoder operating on the pattern of activity across the population of neurons in one IC accurately localized sound sources in the contralateral hemifield, consistent with lesion studies, and did so with a precision consistent with rabbit psychophysical performance. The pattern decoder also predicts behavior in response to incongruent localization cues consistent with the long-standing "duplex" theory of sound localization. We further show that the pattern decoder accurately distinguishes two concurrent, spatially separated sources from a single source, consistent with human behavior. Decoder detection of small amounts of source separation directly in front is due to neural sensitivity to the interaural decorrelation of sound, at both low and high frequencies. The distinct patterns of IC activity between single and separated sound sources thereby provide a neural correlate for the ability to segregate and localize sources in everyday, multisource environments.	\N	\N
24095845	Pitch is derived by the auditory system through complex spectrotemporal processing. Pitch extraction is thought to depend on both spectral cues arising from lower harmonics that are resolved by cochlear filters in the inner ear, and on temporal cues arising from the pattern of action potentials contained in the cochlear output. Adults are capable of extracting pitch in the absence of robust spectral cues, taking advantage of the temporal cues that remain. However, recent behavioral evidence suggests that infants have difficulty discriminating between stimuli with different pitches when resolvable spectral cues are absent. In the current experiments, we used the mismatch negativity (MMN) component of the event related potential derived from electroencephalographic (EEG) recordings to examine a cortical representation of pitch discrimination for iterated rippled noise (IRN) stimuli in 4- and 8-month-old infants. IRN stimuli are pitch-evoking sounds generated by repeatedly adding a segment of white noise to itself at a constant delay. We created IRN stimuli (delays of 5 and 6ms creating pitch percepts of 200 and 167Hz) and high-pass filtered them to remove all resolvable spectral pitch cues. In experiment 1, we did not find EEG evidence that infants could detect the change in the pitch of these IRN stimuli. However, in Experiment 2, after a brief period of pitch-priming during which we added a sine wave component to the IRN stimulus at its perceived pitch, infants did show significant MMN in response to pitch changes in the IRN stimuli with sine waves removed. This suggests that (1) infants can use temporal cues to process pitch, although such processing is not mature and (2) that a short amount of pitch-priming experience can alter pitch representations in auditory cortex during infancy.	\N	\N
24105268	This study examined the ability of click auditory brainstem response (ABR) undertaken below the age of 6 months (from expected date of delivery) to differentiate between conductive and sensorineural hearing loss (SNHL), using the latency of wave V measured 20 dB above threshold. Subjects were recruited if they had an ABR threshold of ≥ 40 dB nHL and ≤ 70 dB nHL in one or both ears measured below the age of 6 months and they had also attended follow-up appointments for behavioral assessment of their hearing in which the type of hearing loss had been confirmed. Forty-five children (84 ears) with SNHL, 82 children (141 ears) with temporary conductive hearing loss (TCHL), and 5 children (10 ears) with permanent conductive hearing loss (PCHL) were recruited. The differences between mean wave V latencies measured 20 dB above ABR threshold were examined using the independent t-test for the groups of cases with SNHL, TCHL, and PCHL. Signal-detection theory was used to examine the relationship between sensitivity and specificity when the latency of wave V 20 dB above threshold was used to identify the presence of SNHL. Receiver operating characteristics were generated and the coordinates of the curve examined for the best compromise between sensitivity and false-alarm rate. The specificity, positive predictive value, and probability of missing a true case were determined for the most promising criteria. There were significant differences between the two groups with SNHL and TCHL. The mean latency of wave V 20 dB above threshold was 1 msec shorter in those with SNHL compared with those with TCHL. There were significant differences between children with PCHL and SNHL but no difference between those with PCHL and TCHL. When a criterion of < 7.6 msec was chosen to predict the presence of SNHL the test sensitivity was 0.98, test specificity 0.71, and positive predictive value was 0.66. Nine out of 10 of those with a latency 20 dB above threshold of < 7.0 msec had an SNHL. The latency of wave V 20 dB above threshold measured using click ABR is a useful indicator of the type of hearing loss in babies referred from newborn hearing screening.	\N	\N
24108804	Synchronizing movements with rhythmic inputs requires tight coupling of sensory and motor neural processes. Here, using a novel approach based on the recording of steady-state-evoked potentials (SS-EPs), we examine how distant brain areas supporting these processes coordinate their dynamics. The electroencephalogram was recorded while subjects listened to a 2.4-Hz auditory beat and tapped their hand on every second beat. When subjects tapped to the beat, the EEG was characterized by a 2.4-Hz SS-EP compatible with beat-related entrainment and a 1.2-Hz SS-EP compatible with movement-related entrainment, based on the results of source analysis. Most importantly, when compared with passive listening of the beat, we found evidence suggesting an interaction between sensory- and motor-related activities when subjects tapped to the beat, in the form of (1) additional SS-EP appearing at 3.6 Hz, compatible with a nonlinear product of sensorimotor integration; (2) phase coupling of beat- and movement-related activities; and (3) selective enhancement of beat-related activities over the hemisphere contralateral to the tapping, suggesting a top-down effect of movement-related activities on auditory beat processing. Taken together, our results are compatible with the view that rhythmic sensorimotor synchronization is supported by a dynamic coupling of sensory and motor related activities.	\N	\N
24110502	Bone-conducted ultrasound (BCU) is perceived even by the profoundly sensorineural deaf. A novel hearing aid using the perception of amplitude-modulated BCU (BCU hearing aid: BCUHA) has been developed. However, there is room for improvement particularly in terms of articulation and sound quality. BCU speech is accompanied by a strong high-pitched tone and contain some distortion. In this study, transposed modulation, that can be expected to reduce the high-pitched tone was newly employed as a modulation method in the BCUHA, and its resulting articulation, intelligibility and sound quality were evaluated. The results showed that transposed modulation showed nearly equal articulation and intelligibility scores to and better sound quality than the existing method, DSB-TC modulation. These results provide useful information for further development of the BCUHA.	\N	\N
24111102	Auditory Brainstem Responses (ABRs) are commonly used in clinical practice to determine hearing impairments and hearing thresholds. Although many research groups work on automatic recognition of ABRs - in order to decrease the acquisition times - measures to determine the quality of ABR measurements objectively are still missing. In fact, recently released new standards for electroencephalographic measurements in auditory examinations require an objective measurement quality assessment for neurodiagnostic devices. Thus there is a pressing need for the development and evaluation of such a quality control. In this study, we propose (a) a novel technique for the assessment of the ABR measurement quality and (b) evaluate and compare this technique to two other approaches which have been suggested in literature as required by the new standards.	\N	\N
24116424	Speech reception thresholds were obtained in normally hearing listeners for sentence targets masked by harmonic complexes constructed with different phase relationships. Maskers had either a constant fundamental frequency (F0), or had F0 changing over time, following a pitch contour extracted from natural speech. The median F0 of the target speech was very similar to that of the maskers. In experiment 1 differences in the masking produced by Schroeder positive and Schroeder negative phase complexes were small (around 1.5 dB) for moderate levels [60 dB sound pressure level (SPL)], but increased to around 6 dB for maskers at 80 dB SPL. Phase effects were typically around 1.5 dB larger for maskers that had naturally varying F0 contours than for maskers with constant F0. Experiment 2 showed that shaping the long-term spectrum of the maskers to match the target speech had no effect. Experiment 3 included additional phase relationships at moderate levels and found no effect of phase. Therefore, the phase relationship within harmonic complexes appears to have only minor effects on masking effectiveness, at least at moderate levels, and when targets and maskers are in the same F0 range.	\N	\N
24121087	Functional hemispheric differences for speech and language processing have been traditionally studied by using verbal dichotic-listening paradigms. The commonly observed right-ear preference for the report of dichotically presented syllables is taken to reflect the left hemispheric dominance for speech processing. However, the results of recent functional imaging studies also show that both hemispheres - not only the left - are engaged by dichotic listening, suggesting a more complex relationship between behavioral laterality and functional hemispheric activation asymmetries. In order to more closely examine the hemispheric differences underlying dichotic-listening performance, we report an analysis of functional magnetic resonance imaging (fMRI) data of 104 right-handed subjects, for the first time combining an interhemispheric difference and conjunction analysis. This approach allowed for a distinction of homotopic brain regions which showed symmetrical (i.e., brain region significantly activated in both hemispheres and no activation difference between the hemispheres), relative asymmetrical (i.e., activated in both hemispheres but significantly stronger in one than the other hemisphere), and absolute asymmetrical activation patterns (i.e., activated only in one hemisphere and this activation is significantly stronger than in the other hemisphere). Symmetrical activation was found in large clusters encompassing temporal, parietal, inferior frontal, and medial superior frontal regions. Relative and absolute left-ward asymmetries were found in the posterior superior temporal gyrus, located adjacent to symmetrically activated areas, and creating a lateral-medial gradient from symmetrical towards absolute asymmetrical activation within the peri-Sylvian region. Absolute leftward asymmetry was also found in the post-central and medial superior frontal gyri, while rightward asymmetries were found in middle temporal and middle frontal gyri. We conclude that dichotic listening engages a bihemispheric cortical network, showing a symmetrical and mostly leftward asymmetrical pattern. The here obtained functional (a)symmetry map might serve as a basis for future studies which - by studying the relevance of the here identified regions - clarify the relationship between behavioral laterality measures and hemispheric asymmetry.	\N	\N
24121711	The purpose of this study was to investigate the influence of contingent auditory feedback on the development of infant reaching. Eleven full-term infants were observed biweekly from the age of 10 weeks to 16 weeks, and their arm kinematics were recorded. Auditory feedback that was contingent on arm kinematics was provided in the form of: (a) the mother's voice; and (b) musical tones. Results showed that providing auditory feedback (mother's voice or musical tones): (i) increased the amplitude of exploratory arm movements before the onset of reaching; and (ii) increased the number of reaches at the onset of reaching. These results show that infants are able to use contingent auditory feedback to explore the relevant possibilities for action that are subsequently shaped into goal-directed movements.	\N	\N
24122619	To determine the effectiveness of simultaneous versus sequential bilateral cochlear implantation on postoperative outcomes in children with bilateral deafness and to evaluate the impact of the inter-implant interval and age at second implantation on postoperative outcomes in children who already received their first cochlear implant. PubMed, Embase, and Web of Science. All studies comparing the effects of simultaneous with sequential bilateral cochlear implantation on postoperative outcomes and those evaluating the impact of the inter-implant interval and age at second implantation were retrieved. Four studies compared the effects of simultaneous with sequential bilateral cochlear implantation. All studies lacked randomization. Of these, three reported better speech perception and expressive language development at one year of bilateral experience for simultaneous cochlear implantation. Of the nineteen publications on the impact of the inter-implant interval on postoperative outcomes, the risk of bias was low-moderate for seven studies which were derived from five different study populations. In two of these populations no impact of the inter-implant interval was found, while in three a longer inter-implant interval was associated with poorer speech and language development. Observational studies suggest that simultaneous implantation in children may be associated with improved speech and language development, and that a prolonged inter-implant interval between both implantations may have a negative impact on these postoperative outcomes. Randomized trials are, however, needed to demonstrate whether simultaneous implantation indeed is superior to sequential bilateral implantation in children with bilateral deafness. NA.	\N	\N
24125574	It is not unusual to find it stated as a fact that the left hemisphere is specialized for the processing of rapid, or temporal aspects of sound, and that the dominance of the left hemisphere in the perception of speech can be a consequence of this specialization. In this review we explore the history of this claim and assess the weight of this assumption. We will demonstrate that instead of a supposed sensitivity of the left temporal lobe for the acoustic properties of speech, it is the right temporal lobe which shows a marked preference for certain properties of sounds, for example longer durations, or variations in pitch. We finish by outlining some alternative factors that contribute to the left lateralization of speech perception.	\N	\N
24125858	Auditory verbal hallucinations (AVHs) are the experience of hearing voices in the absence of any speaker, often associated with a schizophrenia diagnosis. Prominent cognitive models of AVHs suggest they may be the result of inner speech being misattributed to an external or non-self source, due to atypical self- or reality monitoring. These arguments are supported by studies showing that people experiencing AVHs often show an externalising bias during monitoring tasks, and neuroimaging evidence which implicates superior temporal brain regions, both during AVHs and during tasks that measure verbal self-monitoring performance. Recently, efficacy of noninvasive neurostimulation techniques as a treatment option for AVHs has been tested. Meta-analyses show a moderate effect size in reduction of AVH frequency, but there has been little attempt to explain the therapeutic effect of neurostimulation in relation to existing cognitive models. This article reviews inner speech models of AVHs, and argues that a possible explanation for reduction in frequency following treatment may be modulation of activity in the brain regions involving the monitoring of inner speech.	\N	\N
24132709	Three cross-modal priming experiments examined the influence of preexposure to pictures and printed words on the speed of spoken word recognition. Targets for auditory lexical decision were spoken Dutch words and nonwords, presented in isolation (Experiments 1 and 2) or after a short phrase (Experiment 3). Auditory stimuli were preceded by primes, which were pictures (Experiments 1 and 3) or those pictures' printed names (Experiment 2). Prime-target pairs were phonologically onset related (e.g., pijl-pijn, arrow-pain), were from the same semantic category (e.g., pijl-zwaard, arrow-sword), or were unrelated on both dimensions. Phonological interference and semantic facilitation were observed in all experiments. Priming magnitude was similar for pictures and printed words and did not vary with picture viewing time or number of pictures in the display (either one or four). These effects arose even though participants were not explicitly instructed to name the pictures and where strategic naming would interfere with lexical decision making. This suggests that, by default, processing of related pictures and printed words influences how quickly we recognize spoken words.	\N	\N
24165303	Distortion-product otoacoustic emissions (DPOAEs) collected after sound pressure level (SPL) calibration are susceptible to standing waves that affect measurements at the plane of the probe microphone due to overlap of incident and reflected waves. These standing-wave effects can be as large as 20 dB, and may affect frequencies both above and below 4 kHz. It has been shown that forward pressure level (FPL) calibration minimizes standing-wave effects by isolating the forward-propagating component of the stimulus. Yet, previous work has failed to demonstrate more than a small difference in test performance and behavioral-threshold prediction with DPOAEs after SPL and FPL calibration. One potential limitation in prior studies is that measurements were restricted to octave and interoctave frequencies; as a consequence, data were not necessarily collected at the standing-wave null frequency. In the present study, DPOAE responses were measured with f2 set to each participant's standing-wave frequency in an effort to increase the possibility that differences in test performance and threshold prediction would be observed for SPL and FPL calibration methods. Data were collected from 42 normal-hearing participants and 93 participants with hearing loss. DPOAEs were measured with f2 set to 4 kHz and at each participant's notch frequency after SPL and FPL calibration. DPOAE input/output functions were obtained from -10 to 80 dB in 5 dB steps for each calibration/stimulus condition. Test performance was evaluated using clinical decision theory. Both area under receiver operating characteristic curves for all stimulus levels and cumulative distributions when L2 = 50 dB (a level at which the best performance was observed regardless of calibration method) were used to evaluate the accuracy with which auditory status was determined. A bootstrap procedure was used to evaluate the significance of the differences in test performance between SPL and FPL calibrations. DPOAE predictions of behavioral threshold were evaluated by correlating actual behavioral thresholds and predicted thresholds using a multiple linear regression model. First, larger DPOAE levels were measured after SPL calibration than after FPL calibration, which demonstrated the expected impact of standing waves. Second, for both FPL and SPL calibration, test performance was best for moderate stimulus levels. Third, differences in test performance between calibration methods were evident at low- and high-stimulus levels. Fourth, there were small but statistically significant improvements in test performance after FPL calibration for clinically relevant conditions. Fifth, calibration method had no effect on threshold prediction. Standing waves after SPL calibration have an impact on DPOAE levels. Although the effect of calibration method on test performance was small, test performance was better after FPL calibration than after SPL calibration. There was no effect of calibration method on predictions of behavioral threshold.	\N	\N
24167235	A phonological deficit is thought to affect most individuals with developmental dyslexia. The present study addresses whether the phonological deficit is caused by difficulties with perceptual learning of fine acoustic details. A demanding test of nonverbal auditory memory, "noise learning," was administered to both adults with dyslexia and control adult participants. On each trial, listeners had to decide whether a stimulus was a 1-s noise token or 2 abutting presentations of the same 0.5-s noise token (repeated noise). Without the listener's knowledge, the exact same noise tokens were presented over many trials. An improved ability to perform the task for such "reference" noises reflects learning of their acoustic details. Listeners with dyslexia did not differ from controls in any aspect of the task, qualitatively or quantitatively. They required the same amount of training to achieve discrimination of repeated from nonrepeated noises, and they learned the reference noises as often and as rapidly as the control group. However, they did show all the hallmarks of dyslexia, including a well-characterized phonological deficit. The data did not support the hypothesis that deficits in basic auditory processing or nonverbal learning and memory are the cause of the phonological deficit in dyslexia.	\N	\N
24174656	Despite the prevalence of poverty worldwide, little is known about how early socioeconomic adversity affects auditory brain function. Socioeconomically disadvantaged children are underexposed to linguistically and cognitively stimulating environments and overexposed to environmental toxins, including noise pollution. This kind of sensory impoverishment, we theorize, has extensive repercussions on how the brain processes sound. To characterize how this impoverishment affects auditory brain function, we compared two groups of normal-hearing human adolescents who attended the same schools and who were matched in age, sex, and ethnicity, but differed in their maternal education level, a correlate of socioeconomic status (SES). In addition to lower literacy levels and cognitive abilities, adolescents from lower maternal education backgrounds were found to have noisier neural activity than their classmates, as reflected by greater activity in the absence of auditory stimulation. Additionally, in the lower maternal education group, the neural response to speech was more erratic over repeated stimulation, with lower fidelity to the input signal. These weaker, more variable, and noisier responses are suggestive of an inefficient auditory system. By studying SES within a neuroscientific framework, we have the potential to expand our understanding of how experience molds the brain, in addition to informing intervention research aimed at closing the achievement gap between high-SES and low-SES children.	\N	\N
24180796	This article investigates the relationship between the shape of the mouthpiece and its acoustical properties in brass instruments. The hypothesis is that not only different volumes but also particular cup shapes affect the embouchure and the tone quality in both a physical and perceivable way. Three professional trumpet players were involved, and two different internal cup contours characterized by a "U" and a "V" shape with two types of throat junction (round and sharp) were chosen, based on a Vincent Bach 1 [1/2] C medium mouthpiece. A third intermediate contour was designed as a combination of these. Over 600 sound samples were produced under controlled conditions, the study involving four different stages: (1) Simulation of air-flow, (2) analysis of the sound spectra, (3) study of the players' subjective responses, and (4) perceptual analysis of their timbral differences. Results confirm the U shape is characterized by a stronger air recirculation and produces stronger spectral components above 8 kHz, compared to the V shape. A round throat junction may also be preferable to a sharp one in terms of playability. There is moderate agreement on the aural perception of these differences although the verbal attributes used to qualify these are not shared.	\N	\N
24181980	The role of visual cues in native listeners' perception of speech produced by nonnative speakers has not been extensively studied. Native perception of English sentences produced by native English and Korean speakers in audio-only and audiovisual conditions was examined. Korean speakers were rated as more accented in audiovisual than in the audio-only condition. Visual cues enhanced word intelligibility for native English speech but less so for Korean-accented speech. Reduced intelligibility of Korean-accented audiovisual speech was associated with implicit visual biases, suggesting that listener-related factors partially influence the efficiency of audiovisual integration for nonnative speech perception.	\N	\N
24184174	Expert musicians are able to time their actions accurately and consistently during a musical performance. We investigated how musical expertise influences the ability to reproduce auditory intervals and how this generalises across different techniques and sensory modalities. We first compared various reproduction strategies and interval length, to examine the effects in general and to optimise experimental conditions for testing the effect of music, and found that the effects were robust and consistent across different paradigms. Focussing on a 'ready-set-go' paradigm subjects reproduced time intervals drawn from distributions varying in total length (176, 352 or 704 ms) or in the number of discrete intervals within the total length (3, 5, 11 or 21 discrete intervals). Overall, Musicians performed more veridical than Non-Musicians, and all subjects reproduced auditory-defined intervals more accurately than visually-defined intervals. However, Non-Musicians, particularly with visual stimuli, consistently exhibited a substantial and systematic regression towards the mean interval. When subjects judged intervals from distributions of longer total length they tended to regress more towards the mean, while the ability to discriminate between discrete intervals within the distribution had little influence on subject error. These results are consistent with a Bayesian model that minimizes reproduction errors by incorporating a central tendency prior weighted by the subject's own temporal precision relative to the current distribution of intervals. Finally a strong correlation was observed between all durations of formal musical training and total reproduction errors in both modalities (accounting for 30% of the variance). Taken together these results demonstrate that formal musical training improves temporal reproduction, and that this improvement transfers from audition to vision. They further demonstrate the flexibility of sensorimotor mechanisms in adapting to different task conditions to minimise temporal estimation errors.	\N	\N
24192718	Dexamethasone administered prior to cochlear implantation has been shown to reduce the loss of residual hearing in experimental settings. However, its effect on the tissue response around the implant has not been extensively studied. In this study dexamethasone sodium phosphate was administered to guinea pigs via local delivery to the round window (2% dexamethasone for 120 min prior to surgery, 'local 2/120', or 20% dexamethasone for 30 min prior to surgery) or intravenously (2 mg/kg dexamethasone for 60 min) prior to implantation. Auditory brainstem responses (ABR) were monitored for 3 months, after which the cochleae were embedded in Spurr's resin and sectioned. The extent of the tissue response and the survival of the neurosensory structures were analysed. Both local 2/120 and systemically delivered dexamethasone improved ABR thresholds when compared with control animals. Systemic dexamethasone also reduced the tissue response around the electrode. This suggests that whilst both locally and systemically administered dexamethasone can protect residual hearing after cochlear implantation, their effects upon the tissue response to implantation may differ.	\N	\N
24198087	Cognitive skills, such as processing speed, memory functioning, and the ability to divide attention, are known to diminish with aging. The present study shows that, despite these changes, older adults can successfully compensate for degradations in speech perception. Critically, the older participants of this study were not pre-selected for high performance on cognitive tasks, but only screened for normal hearing. We measured the compensation for speech degradation using phonemic restoration, where intelligibility of degraded speech is enhanced using top-down repair mechanisms. Linguistic knowledge, Gestalt principles of perception, and expectations based on situational and linguistic context are used to effectively fill in the inaudible masked speech portions. A positive compensation effect was previously observed only with young normal hearing people, but not with older hearing-impaired populations, leaving the question whether the lack of compensation was due to aging or due to age-related hearing problems. Older participants in the present study showed poorer intelligibility of degraded speech than the younger group, as expected from previous reports of aging effects. However, in conditions that induce top-down restoration, a robust compensation was observed. Speech perception by the older group was enhanced, and the enhancement effect was similar to that observed with the younger group. This effect was even stronger with slowed-down speech, which gives more time for cognitive processing. Based on previous research, the likely explanations for these observations are that older adults can overcome age-related cognitive deterioration by relying on linguistic skills and vocabulary that they have accumulated over their lifetime. Alternatively, or simultaneously, they may use different cerebral activation patterns or exert more mental effort. This positive finding on top-down restoration skills by the older individuals suggests that new cognitive training methods can teach older adults to effectively use compensatory mechanisms to cope with the complex listening environments of everyday life.	\N	\N
24198324	Temporal pole (TP) cortex is associated with higher-order sensory perception and/or recognition memory, as human patients with damage in this region show impaired performance during some tasks requiring recognition memory (Olson et al. 2007). The underlying mechanisms of TP processing are largely based on examination of the visual nervous system in humans and monkeys, while little is known about neuronal activity patterns in the auditory portion of this region, dorsal TP (dTP; Poremba et al. 2003). The present study examines single-unit activity of dTP in rhesus monkeys performing a delayed matching-to-sample task utilizing auditory stimuli, wherein two sounds are determined to be the same or different. Neurons of dTP encode several task-relevant events during the delayed matching-to-sample task, and encoding of auditory cues in this region is associated with accurate recognition performance. Population activity in dTP shows a match suppression mechanism to identical, repeated sound stimuli similar to that observed in the visual object identification pathway located ventral to dTP (Desimone 1996; Nakamura and Kubota 1996). However, in contrast to sustained visual delay-related activity in nearby analogous regions, auditory delay-related activity in dTP is transient and limited. Neurons in dTP respond selectively to different sound stimuli and often change their sound response preferences between experimental contexts. Current findings suggest a significant role for dTP in auditory recognition memory similar in many respects to the visual nervous system, while delay memory firing patterns are not prominent, which may relate to monkeys' shorter forgetting thresholds for auditory vs. visual objects.	\N	\N
24210181	A time interval between the onset and the offset of a continuous sound (filled interval) is often perceived to be longer than a time interval between two successive brief sounds (empty interval) of the same physical duration. The present study examined whether and how this phenomenon, sometimes called the filled duration illusion (FDI), occurs for short time intervals (40-520 ms). The investigation was conducted with the method of adjustment (Experiment 1) and the method of magnitude estimation (Experiment 2). When the method of adjustment was used, the FDI did not appear for the majority of the participants, but it appeared clearly for some participants. In the latter case, the amount of the FDI increased as the interval duration lengthened. The FDI was more likely to occur with magnitude estimation than with the method of adjustment. The participants who showed clear FDI with one method did not necessarily show such clear FDI with the other method.	\N	\N
24218156	Declarative memory evaluation is an essential step in the clinical and neuropsychological assessment of a variety of neurological disorders. It typically addresses the issue of normality/abnormality of an individual's performance. Another clinical application of the neuropsychological assessment of declarative memory is the longitudinal evaluation of an individual's performance change. In fact, in a variety of neurological conditions repeated assessments are needed to evaluate the modifications of a memory disorder as a function of time or in response to a pharmacological or rehabilitation treatment. This study was aimed at collecting data for measuring and interpreting performance change on a memory test for verbal material. For this purpose, we administered to 100 healthy subjects (age range 20-80 years; years of formal education range 8-17 years) three parallel forms of a test requiring the immediate and delayed recall of a 15-word list. The subjects performed the recall test three times (each time with a different list) at least 1 week apart. The order of the lists was randomized across subjects. Results revealed that performance on the three lists was highly correlated and did not vary as a function of the order of presentation. However, accuracy of recall was slightly better on a list compared to the others. Based on a method devised by Payne and Jones (J Clin Psychol 13:115-121, 1957), we provide normative data for establishing whether a discrepancy in recall accuracy on two versions of the test exceeds the discrepancy expected based on the performance of normal controls.	\N	\N
24218332	Sound localization is important for orienting and focusing attention and for segregating sounds from different sources in the environment. In humans, horizontal sound localization mainly relies on interaural differences in sound arrival time and sound level. Despite their perceptual importance, the neural processing of interaural time and level differences (ITDs and ILDs) remains poorly understood. Animal studies suggest that, in the brainstem, ITDs and ILDs are processed independently by different specialized circuits. The aim of the current study was to investigate whether, at higher processing levels, they remain independent or are integrated into a common code of sound laterality. For that, we measured late auditory cortical potentials in response to changes in sound lateralization elicited by perceptually matched changes in ITD and/or ILD. The responses to the ITD and ILD changes exhibited significant morphological differences. At the same time, however, they originated from overlapping areas of the cortex and showed clear evidence for functional coupling. These results suggest that the auditory cortex contains an integrated code of sound laterality, but also retains independent information about ITD and ILD cues. This cue-related information might be used to assess how consistent the cues are, and thus, how likely they would have arisen from the same source.	\N	\N
24224991	Today's compression hearing aids with noise reduction systems may not manage transient noises effectively because of the short duration of these sounds compared to the onset times of the compressors and/or noise reduction algorithms. The current study was designed to evaluate the effect of a transient noise reduction (TNR) algorithm on listening comfort, speech intelligibility in quiet, and preferred wearer gain in the presence of transients. A single-blinded, repeated-measures design was used. Thirteen experienced hearing aid users with bilaterally symmetrical (≤7.5 dB) sensorineural hearing loss participated in the study. Speech identification in quiet (no transient noise) was identical between the TNR On and the TNR Off conditions. The participants showed subjective preference for the TNR algorithm when "comfortable listening" was used as the criterion. Participants preferred less gain than the default prescription in the presence of transient noise sounds. However, the preferred gain was 2.9 dB higher when the TNR was activated than when it was deactivated. This translated to 12.1% improvement in phoneme identification over the TNR Off condition for soft speech. This study demonstrated that the use of the TNR algorithm would not negatively affect speech identification. The results also suggested that this algorithm may improve listening comfort in the presence of transient noise sounds and ensure consistent use of prescribed gain. Such an algorithm may ensure more consistent audibility across listening environments.	\N	\N
24225652	The aim of this study was to comprehensively evaluate the auditory phenotype in Niemann-Pick disease, type C1 (NPC1), to understand better the natural history of this complex, heterogeneous disorder, and to define further the baseline auditory deficits associated with NPC1 so that use of potentially ototoxic interventions (e.g., 2-hydroxypropyl-ß-cyclodextrin) may be more appropriately monitored and understood. Fifty patients with NPC1 ranging in age from 4 months to 21 years (mean = 9.3 years) enrolled in a natural history/observational study at the National Institutes of Health. The auditory test battery included, when possible, immittance audiometry, pure-tone and speech audiometry, otoacoustic emission testing, and a neurotologic auditory brainstem response study. Longitudinal data were collected on a subset of patients. Over half of the cohort exhibited hearing loss involving the high frequencies ranging from a slight to moderate degree, and 74% of patients presented with clinically significant hearing loss involving the frequencies most important to speech understanding (0.5, 1, 2, 4 kHz). Despite the heterogeneity of the sample, results among patients were sufficiently consistent to implicate retrocochlear dysfunction in the majority (66%) of individuals, with (22%) or without (44%) accompanying cochlear involvement. Some patients (10%) presented with a profile for auditory neuropathy spectrum disorder. The combination of cross-sectional and longitudinal data indicates these patients are at risk for a progressive decline in auditory function. This is the largest cohort of patients with NPC1 evaluated comprehensively for auditory dysfunction, and results implicate the pathological processes of NPC1 in the manifestation of hearing loss. Patients with NPC1 should be monitored audiologically throughout their lives, beginning at the time of diagnosis. Clinicians and researchers should be aware of this historically overlooked aspect of the phenotype.	\N	\N
24227733	Previous imaging studies of congenital blindness have studied individuals with heterogeneous causes of blindness, which may influence the nature and extent of cross-modal plasticity. Here, we scanned a homogeneous group of blind people with bilateral congenital anophthalmia, a condition in which both eyes fail to develop, and, as a result, the visual pathway is not stimulated by either light or retinal waves. This model of congenital blindness presents an opportunity to investigate the effects of very early visual deafferentation on the functional organization of the brain. In anophthalmic animals, the occipital cortex receives direct subcortical auditory input. We hypothesized that this pattern of subcortical reorganization ought to result in a topographic mapping of auditory frequency information in the occipital cortex of anophthalmic people. Using functional MRI, we examined auditory-evoked activity to pure tones of high, medium, and low frequencies. Activity in the superior temporal cortex was significantly reduced in anophthalmic compared with sighted participants. In the occipital cortex, a region corresponding to the cytoarchitectural area V5/MT+ was activated in the anophthalmic participants but not in sighted controls. Whereas previous studies in the blind indicate that this cortical area is activated to auditory motion, our data show it is also active for trains of pure tone stimuli and in some anophthalmic participants shows a topographic mapping (tonotopy). Therefore, this region appears to be performing early sensory processing, possibly served by direct subcortical input from the pulvinar to V5/MT+.	\N	\N
24230923	The Rey Auditory-Verbal Learning Test is an easy to administer test that assesses many memory domains and is, therefore, widely used in the area of clinical neuropsychology. The purpose of this study was to provide normative data for an elderly population living in Spain. The sample of this study was comprised of 156 volunteers over 60 years of age, which were grouped into six different age groups. These groups comprised of 10 participants between the ages of 61 and 65 in the first group, 23 participants (66-70) in the second, 28 participants (71-75) in the third, 35 participants (76-80) in the fourth, 32 participants (81-85)in the fifth and 28 participants (86-95) in the sixth group. Demographic data were collected and means, deviations, and ranges of all the measures were evaluated. Normative data were calculated from the percentiles, and then converted into age-corrected scaled scores with a mean of 10 and a standard deviation of 3.	\N	\N
24231418	In the basic sciences, many researchers now use gap pre-pulse inhibition of the acoustic startle reflex (GPIAS) to determine if an animal has tinnitus after exposure to an ototoxic drug or intense noise. Tinnitus is assumed to be present if the silent gap in an ongoing narrow band noise (NBN) fails to suppress the startle reflex response evoked by an intense noise burst. The lack of gap pre-pulse inhibition presumably occurs because tinnitus fills in the silent intervals in the background noise. To test the perceptual aspects of this hypothesis, we asked hearing impaired subjects with tinnitus if they could perceive 50 ms silent intervals presented in a NBN, which was located above, below or at the subject's tinnitus pitch. The same tests were performed on normal hearing subjects without tinnitus. All subjects, with and without tinnitus, could detect the 50 ms gaps. Thus, using the stimulus parameters similar to those employed in animal and human GPIAS studies, we found that the tinnitus percept does not fill in the silent interval in a perceptual gap detection task; however, these finding do not rule out the possibility that tinnitus interferes with pre-attentive filtering of sensory stimuli in the GPIAS sensorimotor gating paradigm.	\N	\N
24232066	Compare preoperative and postoperative performance in patients undergoing cochlear implantation (CI) for unilateral severe-to-profound sensorineural hearing loss (single-sided deafness, SSD). IRB-approved, prospective Tertiary center Twenty-nine patients have undergone CI for SSD. SSD was due to Ménière's disease (MD) in 10 subjects; these also suffered from recalcitrant vertigo spells and in these 10 patients along with 2 others the CI was placed simultaneous with a labyrinthectomy. CI with or without labyrinthectomy. CNC word and AzBio sentences in quiet were administered to the implanted ear. A multiple-loudspeaker sound localization test was administered in the bilateral listening condition. All data were collected preoperatively and 3, 6, and 12 months postoperatively with postoperative data available for 19 subjects. Additionally, a tinnitus handicap questionnaire is administered pre- and 12-months post-operatively. CNC word and AzBio sentence scores showed improvement in the implanted ear. Sound localization appeared to improve in an experience-dependent fashion in some patients. Most patients reported diminished tinnitus after cochlear implantation. All patients undergoing labyrinthectomy experienced resolution of vertigo attacks. CI restores auditory function to the deafened ear. Additionally, the binaural input appears to improve sound localization for most patients. In patients with severe hearing loss and recalcitrant vertigo attacks because of MD, simultaneous labyrinthectomy and CI effectively relieves vertigo attacks and improves auditory function.	\N	\N
24234167	Are listeners able to adapt to a foreign-accented speaker who has, as is often the case, an inconsistent accent? Two groups of native Dutch listeners participated in a cross-modal priming experiment, either in a consistent-accent condition (German-accented items only) or in an inconsistent-accent condition (German-accented and nativelike pronunciations intermixed). The experimental words were identical for both groups (words with vowel substitutions characteristic of German-accented speech); additional contextual words differed in accentedness (German-accented or nativelike words). All items were spoken by the same speaker: a German native who could produce the accented forms but could also pass for a Dutch native speaker. Listeners in the consistent-accent group were able to adapt quickly to the speaker (i.e., showed facilitatory priming for words with vocalic substitutions). Listeners in the inconsistent-accent condition showed adaptation to words with vocalic substitutions only in the second half of the experiment. These results indicate that adaptation to foreign-accented speech is rapid. Accent inconsistency slows listeners down initially, but a short period of additional exposure is enough for them to adapt to the speaker. Listeners can therefore tolerate inconsistency in foreign-accented speech.	\N	\N
24238764	Two experiments examined when monolingual, English-learning 19-month-old infants learn a second object label. Two experimenters sat together. One labeled a novel object with one novel label, whereas the other labeled the same object with a different label in either the same or a different language. Infants were tested on their comprehension of each label immediately following its presentation. Infants mapped the first label at above chance levels, but they did so with the second label only when requested by the speaker who provided it (Experiment 1) or when the second experimenter labeled the object in a different language (Experiment 2). These results show that 19-month-olds learn second object labels but do not readily generalize them across speakers of the same language. The results highlight how speaker and language spoken guide infants' acceptance of second labels, supporting sociopragmatic views of word learning.	\N	\N
24256043	The procedure maximally retains the physiological structure of the middle ear and external auditory canal, thus effectively improving the patient's hearing ability. We explored the clinical outcomes of treating chronic suppurative otitis media using improved intact canal wall radical mastoidectomy with sandwich graft tympanoplasty. We chose to perform intact canal wall radical mastoidectomy with sandwich graft tympanoplasty in patients with chronic suppurative otitis media. A total of 170 patients were included in the study. Statistical analysis was carried out using software SPSS18.0, adjusted with the chi-squared test. In all, 140 cases were shown to have been treated effectively (82.35%, 140/170). The increased auditory threshold of preoperative bone conduction was not related to the duration of disease and/or the presence of cholesteatoma (p > 0.05), but was associated with ossicular chain disruption or fixation (p < 0.05), specifically the ossicular chain destruction/absorption, granulation tissue wrapping, and consequent fixation. During the procedure, the sleeve-like pedicle flap of external auditory canal and tympanic membrane is covered with graft, allowing good fixation with maintenance of the tympanic membrane's natural shape. The auditory threshold test revealed equal or above normal levels (30 dB) for 126 cases (74.12%, 126/170). The primary healing rate of tympanic membrane achieved was 96.47% (164/170).	\N	\N
24258458	While bilateral cochlear implants (CIs) provide some binaural benefits, these benefits are limited compared to those observed in normal-hearing (NH) listeners. The large frequency-to-electrode allocation bandwidths (BWs) in CIs compared to auditory filter BWs in NH listeners increases the interaural fluctuation rate available for binaural unmasking, which may limit binaural benefits. The purpose of this work was to investigate the effect of interaural fluctuation rate on correlation change discrimination and binaural masking-level differences in NH listeners presented a CI simulation using a pulsed-sine vocoder. In experiment 1, correlation-change just-noticeable differences (JNDs) and tone-in-noise thresholds were measured for narrowband noises with different BWs and center frequencies (CFs). The results suggest that the BW, CF, and/or interaural fluctuation rate are important factors for correlation change discrimination. In experiment 2, the interaural fluctuation rate was systematically varied and dissociated from changes in BW and CF by using a pulsed-sine vocoder. Results indicated that the interaural fluctuation rate did not affect correlation change JNDs for correlated reference noises; however, slow interaural fluctuations increased correlation change JNDs for uncorrelated reference noises. In experiment 3, the BW, CF, and vocoder pulse rate were varied while interaural fluctuation rate was held constant. JNDs increased for increasing BW and decreased for increasing CF. In summary, relatively fast interaural fluctuation rates are not detrimental for detecting changes in interaural correlation. Thus, limiting factors to binaural benefits in CI listeners could be a result of other temporal and/or spectral deficiencies from electrical stimulation.	\N	\N
24260183	In the real world, human speech recognition nearly always involves listening in background noise. The impact of such noise on speech signals and on intelligibility performance increases with the separation of the listener from the speaker. The present behavioral experiment provides an overview of the effects of such acoustic disturbances on speech perception in conditions approaching ecologically valid contexts. We analysed the intelligibility loss in spoken word lists with increasing listener-to-speaker distance in a typical low-level natural background noise. The noise was combined with the simple spherical amplitude attenuation due to distance, basically changing the signal-to-noise ratio (SNR). Therefore, our study draws attention to some of the most basic environmental constraints that have pervaded spoken communication throughout human history. We evaluated the ability of native French participants to recognize French monosyllabic words (spoken at 65.3 dB(A), reference at 1 meter) at distances between 11 to 33 meters, which corresponded to the SNRs most revealing of the progressive effect of the selected natural noise (-8.8 dB to -18.4 dB). Our results showed that in such conditions, identity of vowels is mostly preserved, with the striking peculiarity of the absence of confusion in vowels. The results also confirmed the functional role of consonants during lexical identification. The extensive analysis of recognition scores, confusion patterns and associated acoustic cues revealed that sonorant, sibilant and burst properties were the most important parameters influencing phoneme recognition. . Altogether these analyses allowed us to extract a resistance scale from consonant recognition scores. We also identified specific perceptual consonant confusion groups depending of the place in the words (onset vs. coda). Finally our data suggested that listeners may access some acoustic cues of the CV transition, opening interesting perspectives for future studies.	\N	\N
24271979	Integrating visual and auditory language information is critical for reading. Suppression and congruency effects in audiovisual paradigms with letters and speech sounds have provided information about low-level mechanisms of grapheme-phoneme integration during reading. However, the central question about how such processes relate to reading entire words remains unexplored. Using ERPs, we investigated whether audiovisual integration occurs for words already in beginning readers, and if so, whether this integration is reflected by differences in map strength or topography (aim 1); and moreover, whether such integration is associated with reading fluency (aim 2). A 128-channel EEG was recorded while 69 monolingual (Swiss)-German speaking first-graders performed a detection task with rare targets. Stimuli were presented in blocks either auditorily (A), visually (V) or audiovisually (matching: AVM; nonmatching: AVN). Corresponding ERPs were computed, and unimodal ERPs summated (A + V = sumAV). We applied TANOVAs to identify time windows with significant integration effects: suppression (sumAV-AVM) and congruency (AVN-AVM). They were further characterized using GFP and 3D-centroid analyses, and significant effects were correlated with reading fluency. The results suggest that audiovisual suppression effects occur for familiar German and unfamiliar English words, whereas audiovisual congruency effects can be found only for familiar German words, probably due to lexical-semantic processes involved. Moreover, congruency effects were characterized by topographic differences, indicating that different sources are active during processing of congruent compared to incongruent audiovisual words. Furthermore, no clear associations between audiovisual integration and reading fluency were found. The degree to which such associations develop in beginning readers remains open to further investigation.	\N	\N
24278326	The purpose of this study was to design and to verify a new hearing-aid fitting strategy (Aescu HRL-1) based on the acoustic features of Mandarin. The subjective and objective outcomes were compared to those fitted with NAL-NL1 (National Acoustic Laboratory Non-Linear, version1) in Mandarin-speaking hearing-aid users. Fifteen subjects with sensorineural hearing loss participated in this preliminary study. Each subject wore a pair of four-channel hearing aids fitted with the Aescu HRL-1 and NAL-NL1 prescriptions alternatively for 1 month. Objective and subjective tests including the Mandarin Monosyllable Recognition Test (MMRT), Mandarin Hearing in Noise Test (MHINT), International Outcome Inventory for Hearing Aids (IOI-HA), and a sound-quality questionnaire were used to evaluate the performance of the two prescriptions. The mean MMRT scores were 79.9% and 81.1% for NAL-NL1 and Aescu HRL-1 respectively. They are not statistically different. The corresponding MHINT signal-to-noise ratios were 0.87 and 0.85 dB, also, no significant difference was found between these two strategies. However, in subjective questionnaires, overall, the sound-quality and IOI-HA scores were higher for Aescu HRL-1. The speech recognition performance based on Aescu HRL-1 is as good as that of NAL-NL1 for Mandarin-speaking hearing-aid users. Moreover, the subjects generally responded that Aescu HRL-1 provides a more natural, richer, and better sound quality than does NAL-NL1.	\N	\N
24296543	The acoustic basis of intelligibility associated with varied clear speech instructions was studied. Twelve healthy speakers read 18 sentences in 'habitual', 'clear', 'hearing impaired' and 'overenunciate' conditions. The latter 3 conditions are varieties of clear speech. Acoustic measures included tense and lax vowel space area, a measure of vowel spectral change, articulation rate and sentence-level vocal intensity. Sentences were mixed with multitalker babble to prevent ceiling effects and were orthographically transcribed by 40 listeners. Percent-correct scores were obtained for each speaker and condition. Regression analyses were used to quantify relationships between acoustic measures and intelligibility. Univariate regressions indicated that greater magnitudes of acoustic change in nonhabitual conditions were associated with greater increases in intelligibility. Multivariate regression analysis further indicated that lax vowel space, articulation rate and vocal intensity were significant predictors of intelligibility. Acoustic variables associated with intelligibility differed depending on whether relationships were examined using univariate or multivariate statistics. Multivariate statistics indicated that articulation rate was the strongest predictor of improvements in intelligibility above and beyond all other variables studied. The findings have implications for optimizing therapeutic use of clear speech for clinical populations.	\N	\N
24302571	The activity of sensory neural populations carries information about the environment. This may be extracted from neural activity using different strategies. In the auditory brainstem, a recent theory proposes that sound location in the horizontal plane is decoded from the relative summed activity of two populations in each hemisphere, whereas earlier theories hypothesized that the location was decoded from the identity of the most active cells. We tested the performance of various decoders of neural responses in increasingly complex acoustical situations, including spectrum variations, noise, and sound diffraction. We demonstrate that there is insufficient information in the pooled activity of each hemisphere to estimate sound direction in a reliable way consistent with behavior, whereas robust estimates can be obtained from neural activity by taking into account the heterogeneous tuning of cells. These estimates can still be obtained when only contralateral neural responses are used, consistently with unilateral lesion studies. DOI: http://dx.doi.org/10.7554/eLife.01312.001.	\N	\N
24311693	Dyslexia is a severe and persistent reading and spelling disorder caused by impairment in the ability to manipulate speech sounds. We combined functional magnetic resonance brain imaging with multivoxel pattern analysis and functional and structural connectivity analysis in an effort to disentangle whether dyslexics' phonological deficits are caused by poor quality of the phonetic representations or by difficulties in accessing intact phonetic representations. We found that phonetic representations are hosted bilaterally in primary and secondary auditory cortices and that their neural quality (in terms of robustness and distinctness) is intact in adults with dyslexia. However, the functional and structural connectivity between the bilateral auditory cortices and the left inferior frontal gyrus (a region involved in higher-level phonological processing) is significantly hampered in dyslexics, suggesting deficient access to otherwise intact phonetic representations.	\N	\N
24312408	Nucleus cochlear implant systems incorporate a fast-acting front-end automatic gain control (AGC), sometimes called a compression limiter. The objective of the present study was to determine the effect of replacing the front-end compression limiter with a newly proposed envelope profile limiter. A secondary objective was to investigate the effect of AGC speed on cochlear implant speech intelligibility. The envelope profile limiter was located after the filter bank and reduced the gain when the largest of the filter bank envelopes exceeded the compression threshold. The compression threshold was set equal to the saturation level of the loudness growth function (i.e. the envelope level that mapped to the maximum comfortable current level), ensuring that no envelope clipping occurred. To preserve the spectral profile, the same gain was applied to all channels. Experiment 1 compared sentence recognition with the front-end limiter and with the envelope profile limiter, each with two release times (75 and 625 ms). Six implant recipients were tested in quiet and in four-talker babble noise, at a high presentation level of 89 dB SPL. Overall, release time had a larger effect than the AGC type. With both AGC types, speech intelligibility was lower for the 75 ms release time than for the 625 ms release time. With the shorter release time, the envelope profile limiter provided higher group mean scores than the front-end limiter in quiet, but there was no significant difference in noise. Experiment 2 measured sentence recognition in noise as a function of presentation level, from 55 to 89 dB SPL. The envelope profile limiter with 625 ms release time yielded better scores than the front-end limiter with 75 ms release time. A take-home study showed no clear pattern of preferences. It is concluded that the envelope profile limiter is a feasible alternative to a front-end compression limiter.	\N	\N
24317426	Previous research demonstrates that meaningfully related sounds enhance visual sensitivity to point-light displays of human movement. Here we report two psychophysical studies that investigated whether, and if so when, this facilitation is modulated by the temporal relationship between auditory and visual stimuli. In Experiment 1, participants detected point-light walkers in masks while listening to footsteps that were either synchronous or out-of-phase with point-light footfalls. The relative timing of auditory and visual walking did not impact performance. Experiment 2 further tested the importance of multisensory timing by disrupting the rhythm of the auditory and visual streams. Participants detected point-light walkers while listening to footstep or tone sounds that were either synchronous or temporally random with regards to point-light footfalls. Heard footsteps improved visual sensitivity over heard tones regardless of timing. Taken together, these results suggest that during the detection of others' actions, the perceptual system makes use of meaningfully related sounds whether or not they are synchronous. These results are discussed in relation to the unity assumption theory as well as recent empirical data that suggest that temporal correspondence is not always a critical factor in multisensory perception and integration.	\N	\N
24329490	To link outcome measures used in audiological research to the ICF classification and thereby describe audiological research from the ICF perspective. Through a peer-reviewed or a joint linking procedure, link outcome measures to the ICF classification system using standardized ICF linking rules. Additional linking rules were developed in combination with the established rules to overcome difficulties when connecting audiological data to ICF. Absolute and relative frequencies of ICF categories were reported. The identified outcome measures from the previous study (Part I) constituted the empirical material. In total, 285 ICF categories were identified. The most prevalent categories were related to listening, hearing functions, auditory perceptions, emotions and the physical environment, such as noise and hearing aids. Categories related to communication showed lower relative frequencies, as did categories related to the social and attitudinal environment. Based on the linked outcome measures, communication as a research topic is subordinated to other research topics. The same conclusion can be drawn for research targeting the social and attitudinal environment of adults with HL. Difficulties in the linking procedure were highlighted and discussed, and suggestions for future revisions of the ICF from the audiological perspective were described.	\N	\N
24333301	Mutations in the connexin 26 gene (GJB2) are the most common genetic cause of deafness, leading to congenital bilateral non-syndromic sensorineural hearing loss. Here we report the generation of a mouse model for a connexin 26 (Cx26) mutation, in which cre-Sox10 drives excision of the Cx26 gene from non-sensory cells flanking the auditory epithelium. We determined that these conditional knockout mice, designated Gjb2-CKO, have a severe hearing loss. Immunocytochemistry of the auditory epithelium confirmed absence of Cx26 in the non-sensory cells. Histology of the organ of Corti and the spiral ganglion neurons (SGNs) performed at ages 1, 3, or 6 months revealed that in Gjb2-CKO mice, the organ of Corti began to degenerate in the basal cochlear turn at an early stage, and the degeneration rapidly spread to the apex. In addition, the density of SGNs in Rosenthal's canal decreased rapidly along a gradient from the base of the cochlea to the apex, where some SGNs survived until at least 6 months of age. Surviving neurons often clustered together and formed clumps of cells in the canal. We then assessed the influence of brain derived neurotrophic factor (BDNF) gene therapy on the SGNs of Gjb2-CKO mice by inoculating Adenovirus with the BDNF gene insert (Ad.BDNF) into the base of the cochlea via the scala tympani or scala media. We determined that over-expression of BDNF beginning around 1 month of age resulted in a significant rescue of neurons in Rosenthal's canal of the cochlear basal turn but not in the middle or apical portions. This data may be used to design therapies for enhancing the SGN physiological status in all GJB2 patients and especially in a sub-group of GJB2 patients where the hearing loss progresses due to ongoing degeneration of the auditory nerve, thereby improving the outcome of cochlear implant therapy in these ears.	\N	\N
24342151	The purpose of the study was to identify structural brain differences in school-age children with residual speech sound errors. Voxel based morphometry was used to compare gray and white matter volumes for 23 children with speech sound errors, ages 8;6-11;11, and 54 typically speaking children matched on age, oral language, and IQ. We hypothesized that regions associated with production and perception of speech sounds would differ between groups. Results indicated greater gray matter volumes for the speech sound error group relative to typically speaking controls in bilateral superior temporal gyrus. There was greater white matter volume in the corpus callosum for the speech sound error group, but less white matter volume in right lateral occipital gyrus. Results may indicate delays in neuronal pruning in critical speech regions or differences in the development of networks for speech perception and production.	\N	\N
24344364	Previous studies in both humans and animals have documented improved performance following discrimination training. This enhanced performance is often associated with cortical response changes. In this study, we tested the hypothesis that long-term speech training on multiple tasks can improve primary auditory cortex (A1) responses compared to rats trained on a single speech discrimination task or experimentally naïve rats. Specifically, we compared the percent of A1 responding to trained sounds, the responses to both trained and untrained sounds, receptive field properties of A1 neurons, and the neural discrimination of pairs of speech sounds in speech trained and naïve rats. Speech training led to accurate discrimination of consonant and vowel sounds, but did not enhance A1 response strength or the neural discrimination of these sounds. Speech training altered tone responses in rats trained on six speech discrimination tasks but not in rats trained on a single speech discrimination task. Extensive speech training resulted in broader frequency tuning, shorter onset latencies, a decreased driven response to tones, and caused a shift in the frequency map to favor tones in the range where speech sounds are the loudest. Both the number of trained tasks and the number of days of training strongly predict the percent of A1 responding to a low frequency tone. Rats trained on a single speech discrimination task performed less accurately than rats trained on multiple tasks and did not exhibit A1 response changes. Our results indicate that extensive speech training can reorganize the A1 frequency map, which may have downstream consequences on speech sound processing.	\N	\N
24344815	Dyslexia is commonly attributed to a phonological deficit, but whether it effectively compromises the phonological grammar or lower level systems is rarely explored. To address this question, we gauge the sensitivity of dyslexics to grammatical phonological restrictions on spoken onset clusters (e.g., bl in block). Across languages, certain onsets are preferred to others (e.g., blif ≻ bnif ≻ bdif, where ≻ indicates a preference). Here, we show that dyslexic participants (adult native speakers of Hebrew) are fully sensitive to these phonological restrictions, and they extend them irrespective of whether the onsets are attested in their language (e.g., bnif vs. bdif) or unattested (e.g., mlif vs. mdif). Dyslexics, however, showed reduced sensitivity to phonetic contrasts (e.g., blif vs. belif; ba vs. pa). Together, these results suggest that the known difficulties of dyslexics in speech processing could emanate not from the phonological grammar, but rather from lower level impairments to acoustic/phonetic encoding, lexical storage, and retrieval.	\N	\N
24349414	The diagnosis of tinnitus relies on self-report. Psychoacoustic measurements of tinnitus pitch and loudness are essential for assessing claims and discriminating true from false ones. For this reason, the quantification of tinnitus remains a challenging research goal. We aimed to: (1) assess the precision of a new tinnitus likeness rating procedure with a continuous-pitch presentation method, controlling for music training, and (2) test whether tinnitus psychoacoustic measurements have the sensitivity and specificity required to detect people faking tinnitus. Musicians and non-musicians with tinnitus, as well as simulated malingerers without tinnitus, were tested. Most were retested several weeks later. Tinnitus pitch matching was first assessed using the likeness rating method: pure tones from 0.25 to 16 kHz were presented randomly to participants, who had to rate the likeness of each tone to their tinnitus, and to adjust its level from 0 to 100 dB SPL. Tinnitus pitch matching was then assessed with a continuous-pitch method: participants had to match the pitch of their tinnitus to an external tone by moving their finger across a touch-sensitive strip, which generated a continuous pure tone from 0.5 to 20 kHz in 1-Hz steps. The predominant tinnitus pitch was consistent across both methods for both musicians and non-musicians, although musicians displayed better external tone pitch matching abilities. Simulated malingerers rated loudness much higher than did the other groups with a high degree of specificity (94.4%) and were unreliable in loudness (not pitch) matching from one session to the other. Retest data showed similar pitch matching responses for both methods for all participants. In conclusion, tinnitus pitch and loudness reliably correspond to the tinnitus percept, and psychoacoustic loudness matches are sensitive and specific to the presence of tinnitus.	\N	\N
24350693	To investigate the predicted threshold shift associated with the use of nonlinear hearing aids fitted to the NAL-NL2 or the DSL m[i/o] prescription for children with the same audiograms. For medium and high input levels, we asked: (1) How does predicted asymptotic threshold shifts (ATS) differ according to the choice of prescription? (2) How does predicted ATS vary with hearing level for gains prescribed by the two prescriptions? A mathematical model consisting of the modified power law combined with equations for predicting temporary threshold shift (Macrae, 1994b) was used to predict ATS. Predicted threshold shift were determined for 57 audiograms at medium and high input levels. For the 57 audiograms, DSL m[i/o] gains for high input levels were associated with increased risk relative to NAL-NL2. The variation of ATS with hearing level suggests that NAL-NL2 gains became unsafe when hearing loss > 90 dB HL. The gains prescribed by DSL m[i/o] became unsafe when hearing loss > 80 dB HL at a medium input level, and > 70 dB HL at a high input level. There is a risk of damage to hearing for children using nonlinear amplification. Vigilant checking for threshold shift is recommended.	\N	\N
24357104	Middle ear disease is the primary cause of hearing loss in children and has a significant impact on language development and academic performance. Multiple prognostic factors have previously been examined, but there is little published data regarding frequency-specific hearing outcomes. To examine the relationship between type I tympanoplasty in a pediatric population and frequency-specific hearing changes, as well as the relationship between several prognostic factors and graft retention. Retrospective medical chart review (February 2006 to October 2011) of 492 consecutive pediatric otolaryngology patients undergoing type I tympanoplasty for tympanic membrane (TM) perforation of any etiology at a tertiary-care pediatric otolaryngology practice. Type I tympanoplasty. Preoperative and postoperative audiometric data were collected for patients undergoing successful TM repair. It was hypothesized before data collection that conductive hearing would improve at all frequencies with no significant change in sensorineural hearing. Data collected included air conduction at 250 to 8000 Hz, speech reception thresholds, bone conduction at 500 to 4000 Hz, and air-bone gap at 500 to 4000 Hz. Demographic data obtained included sex, age, size, mechanism, location of perforation, and operative repair technique. Of 492 patients, 320 were excluded; results were thus examined for 172 patients. Surgery was successful for 73.8% of patients. Perforation size was significantly associated with repair success (mean [SD] surgical success rate of 38.6% [15.3%] vs surgical failure rate of 31.4% [15.0%]; P < .01); however, mean (SD) age (9.02 [3.89] years [surgical success] vs 8.52 [3.43] years [surgical failure]; P > .05) and repair technique (medial [73.08%] vs lateral [76.47%] graft success; P > .99) were not. Air conduction significantly improved from 250 to 2000 Hz (P < .001), did not significantly improve at 4000 Hz (P = .08), and there was a nonsignificant decline at 8000 Hz (P = .12). Speech reception threshold significantly improved (20 vs 15 dB; P < .001). This large review found an association of TM perforation size with surgical success and an improvement in speech reception threshold, air conduction at 250 to 2000 Hz, air-bone gap at 500 to 2000 Hz, and worsening bone conduction at 4000 Hz. Patients with high-frequency hearing loss due to TM perforation should not anticipate significant recovery from type I tympanoplasty. Hearing loss at higher frequencies may require postoperative hearing rehabilitation.	\N	\N
24364392	Previous research has shown that damage to the left temporal pole (LTP) is associated with impaired retrieval of words for unique entities, including names of famous people and landmarks. However, it is not known whether retrieving names for famous melodies is associated with the LTP. The aim of this study was to investigate the hypothesis that damage to the LTP would be associated with impaired naming of famous musical melodies. A Melody Naming Test was administered to patients with LTP damage, brain damaged comparison (BDC) patients, and normal comparison participants (NC). The test included various well-known melodies (e.g., "Pop Goes the Weasel"). After hearing each melody, participants were asked to rate their familiarity with the melody and identify it by name. LTP patients named significantly fewer melodies than BDC and NC participants. Recognition of melodies did not differ significantly between groups. The findings suggest that LTP supports retrieval of names for famous melodies. More broadly, these results extend support for the theoretical notion that LTP is important for retrieving proper names for unique concepts, irrespectively of stimulus modality or category.	\N	\N
24366693	The visual cues involved in auditory speech processing are not restricted to information from lip movements but also include head or chin gestures and facial expressions such as eyebrow movements. The fact that visual gestures precede the auditory signal implicates that visual information may influence the auditory activity. As visual stimuli are very close in time to the auditory information for audiovisual syllables, the cortical response to them usually overlaps with that for the auditory stimulation; the neural dynamics underlying the visual facilitation for continuous speech therefore remain unclear. In this study, we used a three-word phrase to study continuous speech processing. We presented video clips with even (without emphasis) phrases as the frequent stimuli and with one word visually emphasized by the speaker as the non-frequent stimuli. Negativity in the resulting ERPs was detected after the start of the emphasizing articulatory movements but before the auditory stimulus, a finding that was confirmed by the statistical comparisons of the audiovisual and visual stimulation. No such negativity was present in the control visual-only condition. The propagation of this negativity was observed between the visual and fronto-temporal electrodes. Thus, in continuous speech, the visual modality evokes predictive coding for the auditory speech, which is analysed by the cerebral cortex in the context of the phrase even before the arrival of the corresponding auditory signal.	\N	\N
24372066	In this study, 35 young, healthy adults were tested on whether speech-like stimuli evoke a unique response in the auditory efferent system. To this end, descending cortical influences on medial olivocochlear (MOC) activity were indirectly evaluated by studying the effects of contralateral suppression on distortion product otoacoustic emissions (DPOAEs) under four conditions: (a) in the absence of any contralateral noise (Baseline), (b) presence of contralateral broadband noise (Noise Baseline), (c) vowel discrimination-in-noise task (VDN) and (d) tone discrimination-in-noise (TDN) task. A statistically significant release from suppression was evident across all tested DPOAE frequencies (1, 1.5 and 2 kHz) only for the VDN task (p < 0.05), which yielded greater release from suppression than the TDN task. These findings indicate that during active listening in the presence of noise, the MOC activity may be differentially modulated depending on the type of stimulus (vowel vs. tone). Specifically, in the presence of background noise, vowels may show a greater release from suppression in the cochlea than frequency, intensity and duration matched tones.	\N	\N
24376601	During sentence production, linguistic information (semantics, syntax, phonology) of words is retrieved and assembled into a meaningful utterance. There is still debate on how we assemble single words into more complex syntactic structures such as noun phrases or sentences. In the present study, event-related potentials (ERPs) were used to investigate the time course of syntactic planning. Thirty-three volunteers described visually animated scenes using naming formats varying in syntactic complexity: from simple words ('W', e.g., "triangle", "red", "square", "green", "to fly towards"), to noun phrases ('NP', e.g., "the red triangle", "the green square", "to fly towards"), to a sentence ('S', e.g., "The red triangle flies towards the green square."). Behaviourally, we observed an increase in errors and corrections with increasing syntactic complexity, indicating a successful experimental manipulation. In the ERPs following scene onset, syntactic complexity variations were found in a P300-like component ('S'/'NP'>'W') and a fronto-central negativity (linear increase with syntactic complexity). In addition, the scene could display two actions - unpredictable for the participant, as the disambiguation occurred only later in the animation. Time-locked to the moment of visual disambiguation of the action and thus the verb, we observed another P300 component ('S'>'NP'/'W'). The data show for the first time evidence of sensitivity to syntactic planning within the P300 time window, time-locked to visual events critical of syntactic planning. We discuss the findings in the light of current syntactic planning views.	\N	\N
24376662	Previous research suggests that deficits in attention-emotion interaction are implicated in schizophrenia symptoms. Although disruption in auditory processing is crucial in the pathophysiology of schizophrenia, deficits in interaction between emotional processing of auditorily presented language stimuli and auditory attention have not yet been clarified. To address this issue, the current study used a dichotic listening task to examine 22 patients with schizophrenia and 24 age-, sex-, parental socioeconomic background-, handedness-, dexterous ear-, and intelligence quotient-matched healthy controls. The participants completed a word recognition task on the attended side in which a word with emotionally valenced content (negative/positive/neutral) was presented to one ear and a different neutral word was presented to the other ear. Participants selectively attended to either ear. In the control subjects, presentation of negative but not positive word stimuli provoked a significantly prolonged reaction time compared with presentation of neutral word stimuli. This interference effect for negative words existed whether or not subjects directed attention to the negative words. This interference effect was significantly smaller in the patients with schizophrenia than in the healthy controls. Furthermore, the smaller interference effect was significantly correlated with severe positive symptoms and delusional behavior in the patients with schizophrenia. The present findings suggest that aberrant interaction between semantic processing of negative emotional content and auditory attention plays a role in production of positive symptoms in schizophrenia. (224 words).	\N	\N
24384081	Large variations in perceptual directional microphone benefit, which far exceed the variation expected from physical performance measures of directional microphones, have been reported in the literature. The cause for the individual variation has not been systematically investigated. To determine the factors that are responsible for the individual variation in reported perceptual directional benefit. A correlational study. Physical performance measures of the directional microphones obtained after they had been fitted to individuals, cognitive abilities of individuals, and measurement errors were related to perceptual directional benefit scores. Fifty-nine hearing-impaired adults with varied degrees of hearing loss participated in the study. All participants were bilaterally fitted with a Motion behind-the-ear device (500 M, 501 SX, or 501 P) from Siemens according to the National Acoustic Laboratories' non-linear prescription, version two (NAL-NL2). Using the Bamford-Kowal-Bench (BKB) sentences, the perceptual directional benefit was obtained as the difference in speech reception threshold measured in babble noise (SRTn) with the devices in directional (fixed hypercardioid) and in omnidirectional mode. The SRTn measurements were repeated three times with each microphone mode. Physical performance measures of the directional microphone included the angle of the microphone ports to loudspeaker axis, the frequency range dominated by amplified sound, the in situ signal-to-noise ratio (SNR), and the in situ three-dimensional, articulation-index weighted directivity index (3D AI-DI). The cognitive tests included auditory selective attention, speed of processing, and working memory. Intraparticipant variation on the repeated SRTn's and the interparticipant variation on the average SRTn were used to determine the effect of measurement error. A multiple regression analysis was used to determine the effect of other factors. Measurement errors explained 52% of the variation in perceptual directional microphone benefit (95% confidence interval [CI]: 34-78%), while another 37% of variation was explained primarily by the physical performance of the directional microphones after they were fitted to individuals. The most contributing factor was the in situ 3D AI-DI measured across the low frequencies. Repeated SRTn measurements are needed to obtain a reliable indication of the perceptual directional benefit in an individual. Further, to obtain optimum benefit from directional microphones, the effectiveness of the microphones should be maximized across the low frequencies.	\N	\N
24386719	Listener retention of silent, gap-length duration was studied. Just noticeable differences (JNDs) for gap length within standard and comparison stimuli were obtained for intervals with and without intervening noise bursts, including a condition with gapped intervening bursts. Outcomes indicate that gap duration itself can be determinant. Also, JNDs were similar whether intervening stimuli were present or absent, differing from results reported for pitch, loudness, and timbre retention. The latter suggests additional/alternative cortical resources might be employed for retention of auditory-temporal information.	\N	\N
24408329	To investigate the auditory behavior of patients with chronic renal failure (CRF) undergoing kidney transplantation. Thirty patients were evaluated, 10 (33.33%) females and 20 (66.67%) males, aging from 13 to 26 years (average, 16.97 years; standard deviation, 3.60 years). Patients underwent the following procedures: anamnesis, otolaryngological examination, audiological evaluation (pure tone and high frequency), acoustic impedance measurements and central auditory processing evaluation. A control group was used to compare the high-frequency audiometry results. The following observations were made: absence of auditory complaints at the time of anamnesis; pure-tone audiometry was predominantly normal; patients presented lower hearing levels at the high-frequency audiometry, when compared to the control group, and as for the acoustic impedance measurements, curves of the type A were predominant; there was a change of the central auditory processing for 14 patients (46.67%) in the Staggered Spondaic Word Test (SSW); there was a significant difference between the age variable and the result of the pure-tone audiometry, that is, hearing sensitivity in thresholds from 250Hz to 8,000Hz decreased with advancing age; and the relation between the type of donor and the SSW test result was significant. Rates were higher when the patients had been transplanted from deceased donors compared to living donors. There were no changes in conventional audiological and high-frequency evaluation, or in the central auditory processing. Professionals involved in the care of kidney transplantation recipients must be better informed about the care, prevention, and early identification of auditory disorders.	\N	\N
24408330	The objective of this research was to assess the auditory abilities of Portuguese children and compare such abilities to the score of the Scale of Auditory Behaviors (SAB). Fifty-one children were evaluated with audiometry, speech audiometry, acoustic immittance measures, and eight behavioral tests involving dichotic listening, monotic listening, temporal processing, and sound localization. Their parents filled in the SAB questionnaire adapted to European A. SAB scores and auditory tests scores were submitted to Pearson's correlation coefficient. There is significant correlation between the score on SAB questionnaire and the auditory processing tests. The greatest coefficient was observed in temporal processing test (p=0.000). There was correlation between the score of SAB and the performance in auditory processing tests, suggesting that the SAB may be used for auditory processing screening.	\N	\N
24413019	The neural underpinnings of auditory information processing have often been investigated using the odd-ball paradigm, in which infrequent sounds (deviants) are presented within a regular train of frequent stimuli (standards). Traditionally, this paradigm has been applied using either high temporal resolution (EEG) or high spatial resolution (fMRI, PET). However, used separately, these techniques cannot provide information on both the location and time course of particular neural processes. The goal of this study was to investigate the neural correlates of auditory processes with a fine spatio-temporal resolution. A simultaneous auditory evoked potentials (AEP) and functional magnetic resonance imaging (fMRI) technique (AEP-fMRI), together with an odd-ball paradigm, were used. Six healthy volunteers, aged 20-35 years, participated in an odd-ball simultaneous AEP-fMRI experiment. AEP in response to acoustic stimuli were used to model bioelectric intracerebral generators, and electrophysiological results were integrated with fMRI data. fMRI activation evoked by standard stimuli was found to occur mainly in the primary auditory cortex. Activity in these regions overlapped with intracerebral bioelectric sources (dipoles) of the N1 component. Dipoles of the N1/P2 complex in response to standard stimuli were also found in the auditory pathway between the thalamus and the auditory cortex. Deviant stimuli induced fMRI activity in the anterior cingulate gyrus, insula, and parietal lobes. The present study showed that neural processes evoked by standard stimuli occur predominantly in subcortical and cortical structures of the auditory pathway. Deviants activate areas non-specific for auditory information processing.	\N	\N
24419006	Three experiments investigated the role of memory and rehearsal in a dichotic emotion recognition task by manipulating the response procedure as well as the interval between encoding and retrieval while taking into account order of report. For all experiments, right-handed undergraduates were presented with dichotic pairs of the words bower, dower, power, and tower pronounced in a sad, angry, happy, or neutral tone of voice. Participants were asked to report the two emotions presented on each trial by clicking on the corresponding drawings or words on a computer screen, either following no delay or a five second delay. Experiment 1 applied the delay conditions as a between-subjects factor whereas it was a within-subject factor in Experiment 2. In Experiments 1 and 2, more correct responses occurred for the left than the right ear, reflecting a left ear advantage (LEA) that was slightly larger with a nonverbal than a verbal response. The LEA was also found to be larger with no delay than with the 5s delay. In addition, participants typically responded first to the left ear stimulus. In fact, the first response produced a LEA whereas the second response produced a right ear advantage. Experiment 3 involved a concurrent task during the delay to prevent rehearsal. In Experiment 3, the pattern of results supported the claim that rehearsal could account for the findings of the first two experiments. The findings are interpreted in the context of the role of rehearsal and memory in models of dichotic listening.	\N	\N
24429136	How humans solve the cocktail party problem remains unknown. However, progress has been made recently thanks to the realization that cortical activity tracks the amplitude envelope of speech. This has led to the development of regression methods for studying the neurophysiology of continuous speech. One such method, known as stimulus-reconstruction, has been successfully utilized with cortical surface recordings and magnetoencephalography (MEG). However, the former is invasive and gives a relatively restricted view of processing along the auditory hierarchy, whereas the latter is expensive and rare. Thus it would be extremely useful for research in many populations if stimulus-reconstruction was effective using electroencephalography (EEG), a widely available and inexpensive technology. Here we show that single-trial (≈60 s) unaveraged EEG data can be decoded to determine attentional selection in a naturalistic multispeaker environment. Furthermore, we show a significant correlation between our EEG-based measure of attention and performance on a high-level attention task. In addition, by attempting to decode attention at individual latencies, we identify neural processing at ∼200 ms as being critical for solving the cocktail party problem. These findings open up new avenues for studying the ongoing dynamics of cognition using EEG and for developing effective and natural brain-computer interfaces.	\N	\N
24429520	Historically, the study of speech processing has emphasized a strong link between auditory perceptual input and motor production output. A kind of 'parity' is essential, as both perception- and production-based representations must form a unified interface to facilitate access to higher-order language processes such as syntax and semantics, believed to be computed in the dominant, typically left hemisphere. Although various theories have been proposed to unite perception and production, the underlying neural mechanisms are unclear. Early models of speech and language processing proposed that perceptual processing occurred in the left posterior superior temporal gyrus (Wernicke's area) and motor production processes occurred in the left inferior frontal gyrus (Broca's area). Sensory activity was proposed to link to production activity through connecting fibre tracts, forming the left lateralized speech sensory-motor system. Although recent evidence indicates that speech perception occurs bilaterally, prevailing models maintain that the speech sensory-motor system is left lateralized and facilitates the transformation from sensory-based auditory representations to motor-based production representations. However, evidence for the lateralized computation of sensory-motor speech transformations is indirect and primarily comes from stroke patients that have speech repetition deficits (conduction aphasia) and studies using covert speech and haemodynamic functional imaging. Whether the speech sensory-motor system is lateralized, like higher-order language processes, or bilateral, like speech perception, is controversial. Here we use direct neural recordings in subjects performing sensory-motor tasks involving overt speech production to show that sensory-motor transformations occur bilaterally. We demonstrate that electrodes over bilateral inferior frontal, inferior parietal, superior temporal, premotor and somatosensory cortices exhibit robust sensory-motor neural responses during both perception and production in an overt word-repetition task. Using a non-word transformation task, we show that bilateral sensory-motor responses can perform transformations between speech-perception- and speech-production-based representations. These results establish a bilateral sublexical speech sensory-motor system.	\N	\N
24431427	The new DSM-5 diagnostic criteria for autism spectrum disorders (ASDs) include sensory disturbances in addition to the well-established language, communication, and social deficits. One sensory disturbance seen in ASD is an impaired ability to integrate multisensory information into a unified percept. This may arise from an underlying impairment in which individuals with ASD have difficulty perceiving the temporal relationship between cross-modal inputs, an important cue for multisensory integration. Such impairments in multisensory processing may cascade into higher-level deficits, impairing day-to-day functioning on tasks, such as speech perception. To investigate multisensory temporal processing deficits in ASD and their links to speech processing, the current study mapped performance on a number of multisensory temporal tasks (with both simple and complex stimuli) onto the ability of individuals with ASD to perceptually bind audiovisual speech signals. High-functioning children with ASD were compared with a group of typically developing children. Performance on the multisensory temporal tasks varied with stimulus complexity for both groups; less precise temporal processing was observed with increasing stimulus complexity. Notably, individuals with ASD showed a speech-specific deficit in multisensory temporal processing. Most importantly, the strength of perceptual binding of audiovisual speech observed in individuals with ASD was strongly related to their low-level multisensory temporal processing abilities. Collectively, the results represent the first to illustrate links between multisensory temporal function and speech processing in ASD, strongly suggesting that deficits in low-level sensory processing may cascade into higher-order domains, such as language and communication.	\N	\N
24437771	Auditory deprivation and stimulation can change the threshold of the acoustic middle ear reflex as well as loudness in adult listeners. However, it has remained unclear whether changes in these measures are due to the same mechanism. In this study, deprivation was achieved using a monaural earplug that was worn by listeners for 7 days. Acoustic reflex thresholds (ARTs) and categorical loudness ratings were measured using a blinded design in which the experimenter was unaware of which ear had been plugged. Immediately after terminating unilateral deprivation, ARTs were obtained at a lower sound pressure level in the ear that had been fitted with an earplug and at a higher sound pressure level in the control ear. In contrast, categorical judgments of loudness changed in the same direction in both ears with a given stimulus level reported as louder after unilateral deprivation. The relationship between changes to the ART and loudness judgments was not statistically significant. For both the ARTs and the categorical loudness judgments, most of the changes had disappeared within 24 h after earplug removal. The changes in ARTs, as a consequence of unilateral sound deprivation, are consistent with a gain control mechanism; however, the lack of relationship with the categorical loudness judgments, and the different pattern of findings for each measure, suggests the possibility of multiple gain mechanisms.	\N	\N
24437774	The discrimination of interaural phase differences (IPDs) requires accurate binaural temporal processing and has been used as a measure of sensitivity to temporal envelope and temporal fine structure (TFS). Previous studies found that TFS-IPD discrimination declined with age and with sensorineural hearing loss (SNHL), but age and SNHL have often been confounded. The aim of this study was to determine the independent contributions of age and SNHL to TFS and envelope IPD discrimination by using a sample of adults with a wide range of ages and SNHL. A two-interval, two-alternative forced-choice procedure was used to measure IPD discrimination thresholds for 20-Hz amplitude-modulated tones with carrier frequencies of 250 or 500 Hz when the IPD was in either the stimulus envelope or TFS. There were positive correlations between absolute thresholds and TFS-IPD thresholds, but not envelope-IPD thresholds, when age was accounted for. This supports the idea that SNHL affects TFS processing independently to age. Age was positively correlated with envelope-IPD thresholds at both carrier frequencies and TFS-IPD thresholds at 500 Hz, when absolute thresholds were accounted for. These results suggest that age negatively affects the binaural processing of envelope and TFS at some frequencies independently of SNHL.	\N	\N
24441742	Abnormal hearing tests have been noted in human immunodeficiency virus (HIV)-infected patients in several studies, but the nature of the hearing deficit has not been clearly defined. The authors performed a cross-sectional study of both HIV+ and HIV- individuals in Tanzania by using an audiological test battery. The authors hypothesized that HIV+ adults would have a higher prevalence of abnormal central and peripheral hearing test results compared with HIV- controls. In addition, they anticipated that the prevalence of abnormal hearing assessments would increase with antiretroviral therapy (ART) use and treatment for tuberculosis (TB). Pure-tone thresholds, distortion product otoacoustic emissions (DPOAEs), tympanometry, and a gap-detection test were performed using a laptop-based hearing testing system on 751 subjects (100 HIV- in the United States, plus 651 in Dar es Salaam, Tanzania, including 449 HIV+ [130 ART- and 319 ART+], and 202 HIV-, subjects. No U.S. subjects had a history of TB treatment. In Tanzania, 204 of the HIV+ and 23 of the HIV- subjects had a history of TB treatment. Subjects completed a video and audio questionnaire about their hearing, as well as a health history questionnaire. HIV+ subjects had reduced DPOAE levels compared with HIV- subjects, but their hearing thresholds, tympanometry results, and gap-detection thresholds were similar. Within the HIV+ group, those on ART reported significantly greater difficulties understanding speech in noise, and were significantly more likely to report that they had difficulty understanding speech than the ART- group. The ART+ group had a significantly higher mean gap-detection threshold compared with the ART- group. No effects of TB treatment were seen. The fact that the ART+/ART- groups did not differ in measures of peripheral hearing ability (DPOAEs, thresholds), or middle ear measures (tympanometry), but that the ART+ group had significantly more trouble understanding speech and had higher gap-detection thresholds indicates a central processing deficit. These data suggest that: (1) hearing deficits in HIV+ individuals could be a CNS side effect of HIV infection, (2) certain ART regimens might produce CNS side effects that manifest themselves as hearing difficulties, and/or (3) some ART regimens may treat CNS HIV inadequately, perhaps due to insufficient CNS drug levels, which is reflected as a central hearing deficit. Monitoring of central hearing parameters could be used to track central effects of either HIV or ART.	\N	\N
24447236	This discussion paper aims to synthesise the literature on patient-centred care from a range of health professions and to relate this to the field of rehabilitative audiology. Through review of the literature, this paper addresses five questions: What is patient-centred care? How is patient-centred care measured? What are the outcomes of patient-centred care? What are the factors contributing to patient-centred care? What are the implications for audiological rehabilitation? Literature review and synthesis. Publications were identified by structured searches in PubMed, Cinahl, Web of Knowledge, and PsychInfo, and by inspecting the reference lists of relevant articles. Few publications from within the audiology profession address this topic and consequently a review and synthesis of literature from other areas of health were used to answer the proposed questions. This paper concludes that patient-centred care is in line with the aims and scope of practice for audiological rehabilitation. However, there is emerging evidence that we still need to inform the conceptualisation of patient-centred audiological rehabilitation. A definition of patient-centred audiological rehabilitation is needed to facilitate studies into the nature and outcomes of it in audiological rehabilitation practice.	\N	\N
24456399	Crossmodal integration of auditory and visual information, such as phonemes and graphemes, is a critical skill for fluent reading. Previous work has demonstrated that white matter connectivity along the arcuate fasciculus (AF) is predicted by reading skill and that crossmodal processing particularly activates the posterior STS (pSTS). However, the relationship between this crossmodal activation and white matter integrity has not been previously reported. We investigated the interrelationship of crossmodal integration, both in terms of behavioral performance and pSTS activity, with AF tract coherence using a rhyme judgment task in a group of 47 children with a range of reading abilities. We demonstrate that both response accuracy and pSTS activity for crossmodal (auditory-visual) rhyme judgments was predictive of fractional anisotropy along the left AF. Unimodal (auditory-only or visual-only) pSTS activity was not significantly related to AF connectivity. Furthermore, activity in other reading-related ROIs did not show the same AV-only AF coherence relationship, and AV pSTS activity was not related to connectivity along other language-related tracts. This study is the first to directly show that crossmodal brain activity is specifically related to connectivity in the AF, supporting its role in phoneme-grapheme integration ability. More generally, this study helps to define an interdependent neural network for reading-related integration.	\N	\N
24488957	In categorical perception (CP), continuous physical signals are mapped to discrete perceptual bins: mental categories not found in the physical world. CP has been demonstrated across multiple sensory modalities and, in audition, for certain over-learned speech and musical sounds. The neural basis of auditory CP, however, remains ambiguous, including its robustness in nonspeech processes and the relative roles of left/right hemispheres; primary/nonprimary cortices; and ventral/dorsal perceptual processing streams. Here, highly trained musicians listened to 2-tone musical intervals, which they perceive categorically while undergoing functional magnetic resonance imaging. Multivariate pattern analyses were performed after grouping sounds by interval quality (determined by frequency ratio between tones) or pitch height (perceived noncategorically, frequency ratios remain constant). Distributed activity patterns in spheres of voxels were used to determine sound sample identities. For intervals, significant decoding accuracy was observed in the right superior temporal and left intraparietal sulci, with smaller peaks observed homologously in contralateral hemispheres. For pitch height, no significant decoding accuracy was observed, consistent with the non-CP of this dimension. These results suggest that similar mechanisms are operative for nonspeech categories as for speech; espouse roles for 2 segregated processing streams; and support hierarchical processing models for CP.	\N	\N
24489819	This study tested the hypothesis that the previously reported advantage of musicians over non-musicians in understanding speech in noise arises from more efficient or robust coding of periodic voiced speech, particularly in fluctuating backgrounds. Speech intelligibility was measured in listeners with extensive musical training, and in those with very little musical training or experience, using normal (voiced) or whispered (unvoiced) grammatically correct nonsense sentences in noise that was spectrally shaped to match the long-term spectrum of the speech, and was either continuous or gated with a 16-Hz square wave. Performance was also measured in clinical speech-in-noise tests and in pitch discrimination. Musicians exhibited enhanced pitch discrimination, as expected. However, no systematic or statistically significant advantage for musicians over non-musicians was found in understanding either voiced or whispered sentences in either continuous or gated noise. Musicians also showed no statistically significant advantage in the clinical speech-in-noise tests. Overall, the results provide no evidence for a significant difference between young adult musicians and non-musicians in their ability to understand speech in noise.	\N	\N
24490946	Studies of face recognition in older adults (60 years of age and older) report increases in false alarms over younger adults (usually 18-30 years of age), but no age differences in hits. To examine this phenomenon, we compared older and younger adults in categorical perception of faces. We hypothesized that face representations in older adults would be broadly tuned, resulting in overlapping representations, manifested by a shallower slope in identity categorization than in younger adults, and age-related reductions in the advantage for between-categories, as compared with within-category, face discrimination. We morphed faces to change linearly from one identity to another. We used familiar or unfamiliar faces in separate conditions to examine the role of familiarity. Categorical perception was assessed in an identity-classification task and a discrimination task. Older adults showed a shallower slope and poorer discrimination compared with younger adults, and both groups exhibited better performance with familiar than unfamiliar faces. Enhanced discriminability for between-categories as compared with within-category faces was seen for both familiar and unfamiliar faces in younger adults, but only for familiar faces in older adults. The more broadly tuned representations of unfamiliar faces in older adults may lead to misidentification and greater false alarms for unfamiliar faces, but not for familiar faces.	\N	\N
24496288	This study evaluated effects of nonlinear frequency compression (NLFC) processing in children with hearing loss for consonant identification in quiet and for spondee identification in competing noise or speech. It was predicted that participants would benefit from NLFC for consonant identification in quiet when access to high-frequency information was critical, but that NLFC would be less beneficial, or even detrimental, when identification relied on mid-frequency cues. Further, it was hypothesized that NLFC could result in greater susceptibility to masking in the spondee task. The rationale for these predictions is that improved access to high-frequency information comes at the cost of decreased spectral resolution. A repeated-measures design compared speech-perception outcomes in 17 pediatric hearing aid users (9 to 17 years of age) wearing Naida V SP "laboratory" hearing aids with NLFC on and off. Data were also collected in an initial baseline session in which children wore their personal hearing aids. Children with a wide range of audiometric configurations were included, but all participants were full-time users of hearing aids with active NLFC. For each hearing aid condition, speech perception was assessed in the sound field by using a closed-set 12-alternative consonant-vowel identification measure in quiet, and a closed-set four-alternative spondee-identification measure in a speech-shaped noise or in a two-talker speech masker. No significant differences in performance were observed between laboratory hearing aid conditions with NLFC activated or deactivated for either speech-perception measure. An unexpected finding was that the majority of participants had no difficulty identifying the high-frequency consonant /s/ even when NLFC was deactivated. Investigation into individual differences revealed that subjects with a greater difference in audible bandwidth with NLFC on versus NLFC off were less likely to demonstrate improvements in high-frequency consonant identification in quiet, but were more likely to demonstrate improvements in spondee identification in speech-shaped noise. Group results observed in the initial baseline assessment using personal aids fitted with more aggressive NLFC settings than used in laboratory aids indicated better consonant identification accuracy in quiet. However, spondee identification in the two-talker masker was poorer with personal compared with laboratory hearing aids. Comparisons across personal and laboratory hearing aids are tempered, however, by the potential of an order effect. The observation of comparable performance with NLFC on and NLFC off in the laboratory aids provides evidence that NLFC is neither detrimental nor advantageous when modest in strength. Results with personal hearing aids fitted with stronger compression settings than laboratory aids (NLFC on) highlight the critical need for further research to determine the impact of NLFC processing on speech perception for a wider range of speech-perception measures and compression settings.	\N	\N
24503772	The bone-anchored hearing device (BAHD) was not introduced in China until 2010. To our knowledge, this is the first study to assess the efficacy of Chinese Mandarin-speaking patients with bilateral aural atresia. To evaluate the speech recognition of Chinese Mandarin-speaking patients with BAHDs as well as patients' satisfaction using 2 questionnaires. A retrospective case review of 16 patients with bilateral aural atresia conducted at a tertiary referral center. A BAHD was implanted during auricle reconstruction surgery or after the auricle was rebuilt. A surgical method to combine the BAHD implantation with the second stage of ear reconstruction was introduced. Speech audiometry test and mean pure-tone threshold results were compared among patients with unaided hearing and those with BAHDs. Scores from the BAHD user questionnaire and Glasgow Children's Benefit Inventory (GCBI) were used to measure patients' satisfaction and subjective health benefit. The mean (SD) speech discrimination scores measured in a sound field with a presentation level of 45 dB HL (hearing level) were 6.7% (7.4%) unaided and 86.5% (4.4%) with a BAHD. Scores with a presentation level of 65 dB HL were 56.5% (7.4%) unaided and 90.1% (3.4%) with a BAHD. The speech reception threshold was 60.6 (7.5) dB HL unaided and 24.7 (5.0) dB HL with a BAHD. The mean (SD) pure-tone threshold of the patients was 61.6 (7.8) dB HL unaided and 23.8 (5.9) dB HL with a BAHD. The BAHD application questionnaire demonstrated excellent patient satisfaction. The mean (SD) benefit score of GCBI was 45.6 (14.4). For aural atresia, the BAHD has been one of the most reliable methods of auditory rehabilitation. It can improve the patient's word recognition performance and quality of life. The technique of BAHD implantation combined with auricular reconstruction in a 2-stages-in-1 surgery and the modified incision of patients with reconstructed auricle proved to be safe and effective.	\N	\N
24508369	Acoustic communication requires gathering, transforming, and interpreting diverse sound cues. To achieve this, all the spatial and temporal features of complex sound stimuli must be captured in the firing patterns of the primary sensory neurons and then accurately transmitted along auditory pathways for additional processing. The mammalian auditory system relies on several synapses with unique properties in order to meet this task: the auditory ribbon synapses, the endbulb of Held, and the calyx of Held. Each of these synapses develops morphological and electrophysiological characteristics that enable the remarkably precise signal transmission necessary for conveying the miniscule differences in timing that underly sound localization. In this article, we review the current knowledge of how these synapses develop and mature to acquire the specialized features necessary for the sense of hearing.	\N	\N
24508791	Amblyopia is a developmental disorder that results in both monocular and binocular deficits. Although traditional treatment in clinical practice (i.e., refractive correction, or occlusion by patching and penalization of the fellow eye) is effective in restoring monocular visual acuity, there is little information on how binocular function, especially stereopsis, responds to traditional amblyopia treatment. We aim to evaluate the effects of perceptual learning on stereopsis in observers with amblyopia in the current study. Eleven observers (21.1 ± 5.1 years, six females) with anisometropic or ametropic amblyopia were trained to judge depth in 10 to 13 sessions. Red-green glasses were used to present three different texture anaglyphs with different disparities but a fixed exposure duration. Stereoacuity was assessed with the Fly Stereo Acuity Test and visual acuity was assessed with the Chinese Tumbling E Chart before and after training. Averaged across observers, training significantly reduced disparity threshold from 776.7″ to 490.4″ (P < 0.01) and improved stereoacuity from 200.3″ to 81.6″ (P < 0.01). Interestingly, visual acuity also significantly improved from 0.44 to 0.35 logMAR (approximately 0.9 lines, P < 0.05) in the amblyopic eye after training. Moreover, the learning effects in two of the three retested observers were largely retained over a 5-month period. Perceptual learning is effective in improving stereo vision in observers with amblyopia. These results, together with previous evidence, suggest that structured monocular and binocular training might be necessary to fully recover degraded visual functions in amblyopia. Chinese Abstract.	\N	\N
24514158	This study analyses the meaning spaces of German pitch contours using two modes of melodic movement: continuous or in steps of sustained pitch. Both the continuous and stepped movements are represented by a set of five basic patterns, the latter being derived from the former. Thirty-six German native speakers judged the pattern sets on a 12-scale semantic differential. The semantic profiles confirm that stepped contours can be conceived of as stylized intonation, in a formal as well as in a functional sense. On the one hand, continuous (non-stylized) and stepped (stylized) contours are assigned different overall meanings (especially on the scales astonished - commonplace and interested - not interested). On the other hand, listeners organize the two contour sets in a similar fashion, which speaks in favour of parallel pattern inventories of continuous and stepped movement, respectively. However, the meaning space of the stylized patterns is affected by formal restrictions, for instance in the step transformation of continuous rises.	\N	\N
24525262	What are the temporal dynamics of perceptual sampling during visual search tasks, and how do they differ between a difficult (or inefficient) and an easy (or efficient) task? Does attention focus intermittently on the stimuli, or are the stimuli processed continuously over time? We addressed these questions by way of a new paradigm using periodic fluctuations of stimulus information during a difficult (color-orientation conjunction) and an easy (+ among Ls) search task. On each stimulus, we applied a dynamic visual noise that oscillated at a given frequency (2-20 Hz, 2-Hz steps) and phase (four cardinal phase angles) for 500 ms. We estimated the dynamics of attentional sampling by computing an inverse Fourier transform on subjects' d-primes. In both tasks, the sampling function presented a significant peak at 2 Hz; we showed that this peak could be explained by nonperiodic search strategies such as increased sensitivity to stimulus onset and offset. Specifically in the difficult task, however, a second, higher-frequency peak was observed at 9 to 10 Hz, with a similar phase for all subjects; this isolated frequency component necessarily entails oscillatory attentional dynamics. In a second experiment, we presented difficult search arrays with dynamic noise that was modulated by the previously obtained grand-average attention sampling function or by its converse function (in both cases omitting the 2 Hz component to focus on genuine oscillatory dynamics). We verified that performance was higher in the latter than in the former case, even for subjects who had not participated in the first experiment. This study supports the idea of a periodic sampling of attention during a difficult search task. Although further experiments will be needed to extend these findings to other search tasks, the present report validates the usefulness of this novel paradigm for measuring the temporal dynamics of attention.	\N	\N
24533757	The effect of deactivating indiscriminable cochlear implant (CI) electrodes for unilaterally implanted adults was evaluated using the BKB (Bamford-Kowal-Bench) sentence test in quiet and in pink noise (signal-to-noise ratio of +10dBA) and the adaptive Coordinate Response Measure (CRM). Each CI recipient who failed electrode differentiation (ED) in at least one electrode-pair, based on results of a pure-tone pitch-ranking task received two research programmes to try out in a cross-over study. Research programmes (RP) either employed discriminable electrodes only or the most discriminable two-thirds of the electrodes in the electrode array for CI recipients failing ED for more than a third of the electrodes. The participants were also asked to subjectively report improvement of or decline in sound quality in everyday listening situations. There was significant improvement in CRM speech reception thresholds (SRTs) (Z = -3.24, N = 15, P = 0.001), BKB sentence scores in quiet (t = 3.17, df = 24, P < 0.005) and also in pink noise (t = 2.26, df = 19, P < 0.005) after deactivating indiscriminable electrodes.	\N	\N
24548324	Not much is known about how people comprehend ironic utterances, and to date, most studies have simply compared processing of ironic versus non-ironic statements. A key aspect of the graded salience hypothesis, distinguishing it from other accounts (such as the standard pragmatic view and direct access view), is that it predicts differences between processing of familiar and unfamiliar ironies. Specifically, if an ironic utterance is familiar, then the ironic interpretation should be available without the need for extra inferential processes, whereas for unfamiliar ironies, the literal interpretation would be computed first, and a mismatch with context would lead to a re-interpretation of the statement as being ironic. We recorded participants' eye movements while they were reading (Experiment 1), and electrical brain activity while they were listening to (Experiment 2), familiar and unfamiliar ironies compared to non-ironic controls. Results show disruption to eye movements and an N400-like effect for unfamiliar ironies only, supporting the predictions of the graded salience hypothesis. In addition, in Experiment 2, a late positivity was found for both familiar and unfamiliar ironic materials, compared to non-ironic controls. We interpret this positivity as reflecting ongoing conflict between the literal and ironic interpretations of the utterance.	\N	\N
24557002	The rapidly evolving field of early diagnostics after the introduction of newborn hearing screening requires rapid, valid, and objective methods, which have to be thoroughly evaluated in adults before use in infants. The aim was to study cross-correlation analysis of interleaved auditory brainstem responses (ABRs) in a wide dynamic range in normal-hearing adults. Off-line analysis allowed for comparison with psychoacoustical click threshold (PCT), pure-tone threshold, and determination of ABR input/output function. Specifically, nonfiltered and band-pass filtered ABRs were studied in various time segments along with time elapsed for ensemble of sweeps reaching a specific detection criterion. Fourteen healthy normal-hearing subjects (18 to 35 years of age, 50% females) without any history of noise exposure participated. They all had pure-tone thresholds better than 20 dB HL (125 to 8000 Hz). ABRs were recorded in both ears using 100 μsec clicks, from 71.5 dB nHL down to -18.5 dB nHL, in 10 dB steps (repetition rate, 39 Hz; time window, 15 msec; filter, 30 to 8000 Hz). The number of sweeps increased from 2000 at 71.5 dB nHL, up to 30000 at -18.5 dB nHL. Each sweep was stored in a data base for off-line analysis. Cross-correlation analysis between two subaverages of interleaved responses was performed in the time domain for nonfiltered and digitally band-pass filtered (300 to 1500 Hz) entire and time-windowed (1 to 11 and 5 to 11 msec) responses. PCTs were measured using a Bekesy technique with the same insert phone and stimulus as used for the ABR (repetition rate, 20 Hz). Time elapsed (≈ number of accepted sweeps/repetition rate) for the ensemble of sweeps needed to reach a cross-correlation coefficient (ρ) of 0.70 (=3.7 dB signal-to-noise ratio [SNR]) was analyzed. Mean cross-correlation coefficients exceeded 0.90 in both ears at stimulus levels ≥11.5 dB nHL for the entire nonfiltered ABR. At 1.5 dB nHL, mean(SD) ρ was 0.53(0.32) and 0.44(0.40) for left and right ears, respectively (n = 14) (=0 dB SNR). In comparison, mean(SD) PCT was -1.9(2.9) and -2.5(3.2) dB nHL for left and right ears, respectively (n = 14), while mean pure-tone average (500 to 2000 Hz) was 2.5 dB HL (n = 28). Almost no effect of band-pass filtering or reduced analysis time window existed. Average time elapsed needed to reach ρ = 0.70 was approximately 20 seconds or less at stimulus levels ≥41.5 dB nHL, and ≈30 seconds at 31.5 dB nHL. The average (interpolated) stimulus level corresponding to ρ=0.70 for the entire nonfiltered ABR was 6.5 dB nHL (n = 28), which coincided with the estimated psychoacoustical threshold for single clicks. ABR could be identified in a short period of time using cross-correlation analysis between interleaved responses. The average stimulus level corresponding to 0 dB SNR in the entire nonfiltered ABR occurred at 1.5 dB nHL, 4 dB above the average PCT. The mean input/output function for the ensemble of sweeps required to reach ρ = 0.70 increased monotonically with increasing stimulus level, in parallel with the ABR based on all sweeps (≥1.5 dB nHL). Time domain cross-correlation analysis of ABR might form the basis for automatic response identification and future threshold-seeking procedures.	\N	\N
24564623	To evaluate the hearing of adolescents with diabetes mellitus type 1(DM1) by otoacoustic emissions (OAEs), and by comparing different tests with pure-tone audiometry to identify potential early cochlear impairments. Pure-tone audiometry, transient evoked otoacoustic emissions (TEOAEs), and distortion product otoacoustic emissions (DPOAEs) were performed in a group of adolescents with and without DM1. Clinical characteristics, disease duration, and glycated haemoglobin levels were studied. Participants were 40 adolescents with DM1 and 40 healthy subjects. Sensorineural hearing loss, affecting frequencies of 6000 and 8000 Hz, was found only in DM1 subjects when compared to the controls (7.7% vs. 0%, p < 0.05). A higher prevalence of cochlear damage was detected by DPOAE responses, 32% belonging from the diabetic group, vs. 3.7% in the control group. Absent TEOAE responses were observed in only three individuals, all from the diabetic group (5.1% of the tests performed in the diabetic group). Additionally, hearing thresholds were better in diabetic subjects with good control when compared to ones with regular or poor control (p = 0.00). Hearing thresholds were higher in poorly controlled diabetics when compared to subjects with good (p = 0.000) or regular control (p = 0.006). Early evidence of cochlear damage was detected in adolescents with DM1 leading to hearing loss at high frequencies. Abnormal DPOAEs responses were found more frequently than the alterations in TEOAEs and pure-tone audiometry, suggesting that DPOAEs evaluation is the most sensitive and it could be used for monitoring the progression of cochlear damage during the early stages of hearing impairment.	\N	\N
24564624	Detailed information on the hearing threshold levels (HTLs) of young Australians was gathered as part of a large-scale study of the relationship between HTL and leisure-noise exposure in young Australians. HTL data for the study population (18-35 year olds) was carefully collected, as well as otoscopy, tympanometry, contra-lateral acoustic reflexes, and otoacoustic emissions (transient and distortion product), together with a comprehensive hearing health history - both past and present. The sample cohort consisted of 1407 individuals, females and males. Prior to analysis, HTL data were filtered according to both a 'Low' and 'High' set of exclusion criteria. The results obtained for both high-screen and low-screen datasets were around +5 dB above the traditionally accepted values of audiometric zero. This is consistent with previous published reports. Comparison with 'ISO 7029 Acoustics: Statistical distribution of hearing thresholds as a function of age' indicated that threshold values for this dataset have a similar distribution to those of the Standard. This data provides a suitable reference HTL ('normative') database for young Australians.	\N	\N
24564688	To characterize the impulse noise exposure and auditory risk for youth recreational firearm users engaged in outdoor target shooting events. The youth shooting positions are typically standing or sitting at a table, which places the firearm closer to the ground or reflective surface when compared to adult shooters. Acoustic characteristics were examined and the auditory risk estimates were evaluated using contemporary damage-risk criteria for unprotected adult listeners and the 120-dB peak limit suggested by the World Health Organization (1999) for children. Impulses were generated by 26 firearm/ammunition configurations representing rifles, shotguns, and pistols used by youth. Measurements were obtained relative to a youth shooter's left ear. All firearms generated peak levels that exceeded the 120 dB peak limit suggested by the WHO for children. In general, shooting from the seated position over a tabletop increases the peak levels, LAeq8 and reduces the unprotected maximum permissible exposures (MPEs) for both rifles and pistols. Pistols pose the greatest auditory risk when fired over a tabletop. Youth should utilize smaller caliber weapons, preferably from the standing position, and always wear hearing protection whenever engaging in shooting activities to reduce the risk for auditory damage.	\N	\N
24568928	A new study reports that activations of superior temporal regions for speech are normal in dyslexia, although being less well connected to downstream frontal regions. These findings support the hypothesis of a deficit in the access to phonological representations rather than in the representations themselves.	\N	\N
24569986	Viewing behavior exhibits temporal and spatial structure that is independent of stimulus content and task goals. One example of such structure is horizontal biases, which are likely rooted in left-right asymmetries of the visual and attentional systems. Here, we studied the existence, extent, and mechanisms of this bias. Left- and right-handed subjects explored scenes from different image categories, presented in original and mirrored versions. We also varied the spatial spectral content of the images and the timing of stimulus onset. We found a marked leftward bias at the start of exploration that was independent of image category. This left bias was followed by a weak bias to the right that persisted for several seconds. This asymmetry was found in the majority of right-handers but not in left-handers. Neither low- nor high-pass filtering of the stimuli influenced the bias. This argues against mechanisms related to the hemispheric segregation of global versus local visual processing. Introducing a delay in stimulus onset after offset of a central fixation spot also had no influence. The bias was present even when stimuli were presented continuously and without any requirement to fixate, associated to both fixation- and saccade-contingent image changes. This suggests the bias is not caused by structural asymmetries in fixation control. Instead the pervasive horizontal bias is compatible with known asymmetries of higher-level attentional areas related to the detection of novel events.	\N	\N
24580021	To evaluate the impact on voice of 2 hours of continuous oral reading. Fifty normophonic women underwent two sessions of voice loading in which the required intensity level varied: 60-65 dB(A) for the first session, and 70-75 dB(A) for the second session. Ten expert judges evaluated the breathiness of one sentence recorded before and after each loading session. Pairs of stimuli were presented randomly to the judges, who were asked to designate the breathiest sample. A significant decrease in breathiness was observed following both sessions, suggesting an improvement of voice subsequent to loading. When comparing the two intensity levels, no difference was found for breathiness after vocal loading.	\N	\N
24584899	The ability of humans to echolocate has been recognized since the 1940s. Little is known about what determines individual differences in echolocation ability, however. Although hearing ability has been suggested as an important factor in blind people and sighted-trained echolocators, there is evidence to suggest that this may not be the case for sighted novices. Therefore, non-auditory aspects of human cognition might be relevant. Previous brain imaging studies have shown activation of the early 'visual', i.e. calcarine, cortex during echolocation in blind echolocation experts, and also during visual imagery in blind and sighted people. Therefore, here we investigated the relationship between echolocation ability and vividness of visual imagery (VVI). Twenty-four sighted echolocation novices completed Marks' (Br J Psychol 1:17-24, 1973) VVI questionnaire and they also performed an echolocation size-discrimination task. Furthermore, they participated in a battery of auditory tests that determined their ability to detect fluctuations in sound frequency and intensity, as well as hearing differences between the right and left ear. A correlational analysis revealed a significant relationship between participants' VVI and echolocation ability, i.e. participants with stronger VVI also had higher echolocation ability, even when differences in auditory abilities were taken into account. In terms of underlying mechanisms, we suggest that either the use of visual imagery is a strategy for echolocation, or that visual imagery and echolocation both depend on the ability to recruit calcarine cortex for cognitive tasks that do not rely on retinal input.	\N	\N
24588528	To investigate the occurrence of 27 chronic medical conditions in a cohort of adults with and without hearing impairment, and to examine the association between these conditions and hearing ability. The National Longitudinal Study on Hearing (NL-SH study) is a large prospective study among adults aged 18 to 70 years, conducted via the internet in the Netherlands. Hearing ability was measured with a digits-in-noise test and comorbidity was assessed through self-report. Cross-sectional data of 890 hearing-impaired and 975 normally-hearing adults were analyzed. Both descriptive statistics and multinomial logistic regression analyses were conducted. Of the NL-SH participants with insufficient or poor hearing ability, 78.5% reported to suffer from at least one additional chronic condition. This proportion was larger than in the normally-hearing group (68.6% with one or more chronic conditions and 37.7% with two or more). After adjustment for age and gender, 'dizziness causing falling', 'diabetes' and 'arthritis types other than osteoarthritis and rheumatic arthritis' were significantly associated with poor hearing ability. Our results show that some previously reported associations do not only occur in older age groups, but also in younger cohorts. Comorbidity is relevant in the rehabilitation (multi-disciplinary care) and the clinical encounter.	\N	\N
24598525	Changes in amplitude and frequency jointly determine much of the communicative significance of complex acoustic signals, including human speech. We have previously described responses of neurons in the core auditory cortex of awake rhesus macaques to sinusoidal amplitude modulation (SAM) signals. Here we report a complementary study of sinusoidal frequency modulation (SFM) in the same neurons. Responses to SFM were analogous to SAM responses in that changes in multiple parameters defining SFM stimuli (e.g., modulation frequency, modulation depth, carrier frequency) were robustly encoded in the temporal dynamics of the spike trains. For example, changes in the carrier frequency produced highly reproducible changes in shapes of the modulation period histogram, consistent with the notion that the instantaneous probability of discharge mirrors the moment-by-moment spectrum at low modulation rates. The upper limit for phase locking was similar across SAM and SFM within neurons, suggesting shared biophysical constraints on temporal processing. Using spike train classification methods, we found that neural thresholds for modulation depth discrimination are typically far lower than would be predicted from frequency tuning to static tones. This "dynamic hyperacuity" suggests a substantial central enhancement of the neural representation of frequency changes relative to the auditory periphery. Spike timing information was superior to average rate information when discriminating among SFM signals, and even when discriminating among static tones varying in frequency. This finding held even when differences in total spike count across stimuli were normalized, indicating both the primacy and generality of temporal response dynamics in cortical auditory processing.	\N	\N
24606277	While many studies have assessed the efficacy of similarity-based cues for auditory stream segregation, much less is known about whether and how the larger-scale structure of sound sequences support stream formation and the choice of sound organization. Two experiments investigated the effects of musical melody and rhythm on the segregation of two interleaved tone sequences. The two sets of tones fully overlapped in pitch range but differed from each other in interaural time and intensity. Unbeknownst to the listener, separately, each of the interleaved sequences was created from the notes of a different song. In different experimental conditions, the notes and/or their timing could either follow those of the songs or they could be scrambled or, in case of timing, set to be isochronous. Listeners were asked to continuously report whether they heard a single coherent sequence (integrated) or two concurrent streams (segregated). Although temporal overlap between tones from the two streams proved to be the strongest cue for stream segregation, significant effects of tonality and familiarity with the songs were also observed. These results suggest that the regular temporal patterns are utilized as cues in auditory stream segregation and that long-term memory is involved in this process.	\N	\N
24606289	For assessing hearing aid algorithms, a method is sought to shift the threshold of a speech-in-noise test to (mostly positive) signal-to-noise ratios (SNRs) that allow discrimination across algorithmic settings and are most relevant for hearing-impaired listeners in daily life. Hence, time-compressed speech with higher speech rates was evaluated to parametrically increase the difficulty of the test while preserving most of the relevant acoustical speech cues. A uniform and a non-uniform algorithm were used to compress the sentences of the German Oldenburg Sentence Test at different speech rates. In comparison, the non-uniform algorithm exhibited greater deviations from the targeted time compression, as well as greater changes of the phoneme duration, spectra, and modulation spectra. Speech intelligibility for fast Oldenburg sentences in background noise at different SNRs was determined with 48 normal-hearing listeners. The results confirmed decreasing intelligibility with increasing speech rate. Speech had to be compressed to more than 30% of its original length to reach 50% intelligibility at positive SNRs. Characteristics influencing the discrimination ability of the test for assessing effective SNR changes were investigated. Subjective and objective measures indicated a clear advantage of the uniform algorithm in comparison to the non-uniform algorithm for the application in speech-in-noise tests.	\N	\N
24606291	The present study investigated the importance of overall segment amplitude and intrinsic segment amplitude modulation of consonants and vowels to sentence intelligibility. Sentences were processed according to three conditions that replaced consonant or vowel segments with noise matched to the long-term average speech spectrum. Segments were replaced with (1) low-level noise that distorted the overall sentence envelope, (2) segment-level noise that restored the overall syllabic amplitude modulation of the sentence, and (3) segment-modulated noise that further restored faster temporal envelope modulations during the vowel. Results from the first experiment demonstrated an incremental benefit with increasing resolution of the vowel temporal envelope. However, amplitude modulations of replaced consonant segments had a comparatively minimal effect on overall sentence intelligibility scores. A second experiment selectively noise-masked preserved vowel segments in order to equate overall performance of consonant-replaced sentences to that of the vowel-replaced sentences. Results demonstrated no significant effect of restoring consonant modulations during the interrupting noise when existing vowel cues were degraded. A third experiment demonstrated greater perceived sentence continuity with the preservation or addition of vowel envelope modulations. Overall, results support previous investigations demonstrating the importance of vowel envelope modulations to the intelligibility of interrupted sentences.	\N	\N
24606310	Musicians have been shown to better perceive pitch and timbre cues in speech and music, compared to non-musicians. It is unclear whether this "musician advantage" persists under conditions of spectro-temporal degradation, as experienced by cochlear-implant (CI) users. In this study, gender categorization was measured in normal-hearing musicians and non-musicians listening to acoustic CI simulations. Recordings of Dutch words were synthesized to systematically vary fundamental frequency, vocal-tract length, or both to create voices from the female source talker to a synthesized male talker. Results showed an overall musician effect, mainly due to musicians weighting fundamental frequency more than non-musicians in CI simulations.	\N	\N
24606491	This research employed a forward-masking paradigm to estimate the current spread of monopolar (MP) and bipolar (BP) maskers, with current amplitudes adjusted to elicit the same loudness. Since the spatial separation between active and return electrodes is smaller in BP than in MP configurations, the BP current spread is more localized and presumably superior in terms of speech intelligibility. Because matching the loudness requires higher current in BP than in MP stimulation, previous forward-masking studies show that BP current spread is not consistently narrower across subjects or electrodes within a subject. The present forward-masking measures of current spread differ from those of previous studies by using the same BP probe electrode configuration for both MP and BP masker configurations, and adjusting the current levels of the MP and BP maskers so as to match them in loudness. With this method, the estimate of masker current spread would not be contaminated by differences in probe current spread. Forward masking was studied in four cochlear implant patients, two females and two males, with speech recognition scores higher than 50%; that is, their auditory-nerve survival status was more than adequate to carry out the experiments. The data showed that MP and BP masker configurations produce equivalent masking patterns (and current spreads) in three participants. A fourth participant displayed asymmetrical patterns with enhancement rather than masking in some cases, especially when the probe and masker were at the same location. This study showed equivalent masking patterns for MP and BP maskers when the BP masker current amplitude was increased to match the loudness of the MP masker, and the same BP probe configuration is used with both maskers. This finding could help to explain why cochlear implant users often fail to accrue higher speech intelligibility benefit from BP stimulation.	\N	\N
24610168	Longitudinal analysis of audiometric data of a large population of noise-exposed workers provides insight into the development of noise-induced hearing loss (NIHL) as a function of noise exposure and age, particularly during the first decade of noise exposure. Data of pure-tone audiometry of 17,930 construction workers who underwent periodic occupational hearing screening at least twice during a 4-year period were available for analysis. These concerned all follow-up measurements of the baseline cohort described by Leensen et al. (Int Arch Occup Environ Health 84:577-590, 2011). Linear mixed models explored the relationship between the annual rate of change in hearing and noise exposure level, exposure duration, and age. Data of 3,111 workers who were tested on three occasions were used to investigate the pattern of hearing loss development. The mean annual deterioration in hearing in this study population was 0.54 dB/yr, and this became larger with increasing noise exposure level and increasing age. Remarkably, during the first decade of noise exposure, an improvement in hearing threshold levels (HTLs) was observed. The change in hearing over three measurements showed a concave development of hearing loss as a function of time, which corresponds to NIHL development. Overall, hearing deteriorated over the measurement period. Because HTLs at follow-up were better than those obtained at baseline, no statement can be made about the NIHL development during the first decade of noise exposure. This improvement in HTLs rather resembles the result of measurement variation in occupational screening audiometry than an actual improvement in hearing ability.	\N	\N
24611446	The role in which two tones are first encountered in an unattended oddball sequence affects how deviance detection, reflected by mismatch negativity, treats them later when the roles reverse: a "primacy bias." We tested whether this effect is modulated by previous behavioral relevance assigned to the two tones. To this end, sequences in which the roles of the two tones alternated were preceded by a go/no-go task in which tones were presented with equal probability. Half of the participants were asked to respond to the short sounds, the other half to long sounds. Primacy bias was initially abolished but returned dependent upon the go-stimulus that the participant was assigned. Results demonstrate a long-term impact of prior learning on deviance detection, and that even when prior importance/equivalence is learned, the bias ultimately returns. Results are discussed in terms of persistent go-stimulus specific changes in responsiveness to sound.	\N	\N
24616979	The aim of this study is to evaluate the development of auditory performance and speech intelligibility within the first year after hearing aid fitting in children with moderate or severe hearing loss, investigate the effects of hearing level on auditory performance and speech intelligibility and provide a clinical database for their hearing and speech habilitation. Twenty-nine children participated in this study, ranging in age at hearing aid fitting from 3 to 8 years old with a mean of 5. 6 years old. 19 were boys and 10 were girls. According to their hearing level, they were divided into two groups. 14 children were in group of moderate hearing loss (41-60 dB HL). 15 children were in group of severe hearing loss (61-80 dB HL). The categories of auditory performance (CAP) and speech intelligibility rating (SIR) were used to evaluate their auditory performance and speech intelligibility. The evaluation was performed before hearing aid fitting and 1, 3, 6, 9, 12 months after fitting. There was significant difference in mean score of CAP between group of moderate hearing loss and severe hearing loss before hearing aid fitting (P < 0.05). However, no significant differences were observed between these two groups at 1, 3, 6, 9, 12 months after fitting (P > 0.05). There was also significant difference in mean score of SIR between group of moderate hearing loss and severe hearing loss before hearing aid fitting (P < 0.05). How ever, no significant differences were also observed between these two groups at 1, 3, 6, 9, 12 months after fitting (P > 0.05). The mean scores of CAP for group of moderate hearing loss at 6, 9, 12 months after fitting were significantly superior in comparison with the score before hearing aid fitting (P < 0.05). The mean scores of SIR for group of moderate hearing loss at 6, 9, 12 months after fitting were significantly superior in comparison with the score before hearing aid fitting (P < 0.05); the mean score at 12 months after fitting was also significantly superior in comparison with the score at 1 month after fitting (P < 0.05). The mean scores of CAP for group of severe hearing loss at 3, 6, 9, 12 months after fitting were significantly superior in comparison with the score before hearing aid fitting (P < 0.05); the mean scores at 9,12 months after fitting were significantly superior in comparison with the score at 1 month after fitting (P < 0.05). The mean scores of SIR for group of severe hearing loss at 3, 6, 9, 12 months after fitting were significantly superior in comparison with the score before hearing aid fitting (P < 0.05); the mean scores at 6, 9, 12 months after fitting were also significantly superior in comparison with the score at 1 month after fitting (P < 0.05). Auditory performance and speech intelligibility in children with moderate or severe hearing loss improved significantly within the first year after hearing aid fitting. The development followed different trajectory.	\N	\N
24621149	Since being approved in 2009, bilateral simultaneous cochlear implantation (CI) has been the standard treatment for children in the UK who meet the criteria for CI. The aim was to report surgical outcomes of bilateral CI in the UK. Between January 2010 and December 2011, 14 UK CI centres collected data prospectively: demographics, aetiology, use of imaging, device type, surgery duration, use of intra-operative electrophysiology, length of stay, and post-operative complications. 1397 CI procedures in 961 CI recipients were included; 436 bilateral simultaneous, 394 bilateral sequential, and 131 unilateral. The majority (85%) were congenitally deaf. The commonest causes of acquired deafness were meningitis and cytomegalovirus infection. The median age for congenitally deaf bilateral simultaneous CI was 2.2 years, mean surgical duration 4.5 hours. 6.3% surgeries were day case procedures. Eight cases (2.0%) of planned bilateral CI had unilateral surgery. The overall major complication rate was 1.6% (0.9% excluding device failures), including explantation due to infection (0.2%), cerebrospinal fluid leak (0.2%), and meningitis (0.1%). There were no permanent facial nerve palsies and no deaths. Sixty-two (6.5%) immediate minor complications included 12 (1.3%) children with significant vestibular impairment. The complication rate was similar following bilateral CI compared to sequential and unilateral CI, and is comparable to other published series. This prospective multi-centre audit provides evidence that bilateral paediatric CI is a safe procedure in the UK, thus endorsing its role as a major therapeutic intervention in childhood deafness.	\N	\N
24626890	Auditory steady-state responses (ASSR) are an important tool to detect objectively frequency-specific hearing thresholds. Pure-tone audiometry is the gold-standard for hearing evaluation, although sometimes it may be inconclusive, especially in children and uncooperative adults. Compare pure tone thresholds (PT) with ASSR thresholds in normal hearing subjects. In this prospective cross-sectional study we included 26 adults (n = 52 ears) of both genders, without any hearing complaints or otologic diseases and normal puretone thresholds. All subjects had clinical history, otomicroscopy, audiometry and immitance measurements. This evaluation was followed by the ASSR test. The mean pure-tone and ASSR thresholds for each frequency were calculated. The mean difference between PTand ASSR thresholdswas 7,12 for 500 Hz, 7,6 for 1000 Hz, 8,27 for 2000 Hz and 9,71 dB for 4000 Hz. There were no difference between PT and ASSR means at either frequency. ASSR thresholds were comparable to pure-tone thresholds in normal hearing adults. Nevertheless it should not be used as the only method of hearing evaluation.	\N	\N
24627225	The goal of this study was to tease apart the roles of phonological awareness (pA) and phonological short-term memory (pSTM) in sentence comprehension, sentence production, and word reading. Children 6- to 10-years of age (N = 377) completed standardized tests of pA ('Elision') and pSTM ('Nonword Repetition') from the Comprehensive Test of Phonological Processing. Concepts and Following Directions (CFD) and Formulated Sentences (FS) were taken from the Clinical Evaluation of Language Fundamentals-Fourth Edition, as measures of sentence comprehension and production, respectively. Children also completed the Word Identification (Word Id) and Word Attack (Word Att) subtests of the Woodcock Reading Mastery Test-Third Edition. Hierarchical multiple regression analyses controlling for age and nonverbal IQ revealed that Elision was the only significant predictor of CFD and FS. While Elision was the strongest predictor of Word Id and Word Att, Nonword Repetition accounted for additional variance in both reading measures. These results emphasize the usefulness of breaking down phonological processing into multiple components and they also have implications language and reading disordered populations.	\N	\N
24630052	To investigate the clinical usefulness of the LS-chirp auditory brainstem response for estimation of behavioral thresholds in young children with mild to severe hearing losses. 68 infants (136 ears) aged 6-12 months (mean age=9.2 months) with bilateral mild to severe hearing losses were studied at Children's Hospital of Fudan University. In all cases, the children were referred for LS-chirp ABR and visual reinforcement audiometric (VRA) measurements. The low-frequency band chirp (LF-chirp) thresholds (frequency band=0.1-0.85kHz) were compared to the average VRA thresholds (frequency band=0.25-0.5kHz), whereas the high-frequency band chirp (HF-chirp) thresholds (frequency band=1-10kHz) were compared to the average VRA thresholds (frequency band=1-4kHz) using statistical correlation coefficient values. The LS-chirp ABR thresholds are very close to behavioral hearing levels. The mean differences between chirp-ABR and VRA thresholds were within 5dBHL for all measurements. The smallest mean threshold difference (<3dBHL) was obtained for the severe hearing loss group. The correlation coefficient values (r) were 0.97 at low-frequency and high-frequency bands. For each carrier frequency, the best correlations between chirp-ABR thresholds and VRA thresholds were obtained at VRA frequency of 0.25kHz/LF-chirp (r=0.98) and VRA frequency of 1kHz/HF-chirp (r=0.98). This study demonstrates the effectiveness using chirp-ABR predicted frequency-specific thresholds, especially of low and middle frequencies. LS-chirp ABR thresholds determined behavioral thresholds in patients with severe hearing losses were better than for mild hearing losses. The use of a chirp-ABR testing ensures higher sensitivity and accuracy than that of auditory stead-state evoked response (ASSR) for measuring frequency-specific thresholds in young children.	\N	\N
24631260	The left anterior temporal lobe (LATL) is robustly implicated in semantic processing by a growing body of literature. However, these results have emerged from two distinct bodies of work, addressing two different processing levels. On the one hand, the LATL has been characterized as a 'semantic hub׳ that binds features of concepts across a distributed network, based on results from semantic dementia and hemodynamic findings on the categorization of specific compared to basic exemplars. On the other, the LATL has been implicated in combinatorial operations in language, as shown by increased activity in this region associated with the processing of sentences and of basic phrases. The present work aimed to reconcile these two literatures by independently manipulating combination and concept specificity within a minimal MEG paradigm. Participants viewed simple nouns that denoted either low specificity (fish) or high specificity categories (trout) presented in either combinatorial (spotted fish/trout) or non-combinatorial contexts (xhsl fish/trout). By combining these paradigms from the two literatures, we directly compared the engagement of the LATL in semantic memory vs. semantic composition. Our results indicate that although noun specificity subtly modulates the LATL activity elicited by single nouns, it most robustly affects the size of the composition effect when these nouns are adjectivally modified, with low specificity nouns eliciting a much larger effect. We conclude that these findings are compatible with an account in which the specificity and composition effects arise from a shared mechanism of meaning specification.	\N	\N
24632323	This study investigated audiovisual synchrony perception in a rhythmic context, where the sound was not consequent upon the observed movement. Participants judged synchrony between a bouncing point-light figure and an auditory rhythm in two experiments. Two questions were of interest: (1) whether the reference in the visual movement, with which the auditory beat should coincide, relies on a position or a velocity cue; (2) whether the figure form and motion profile affect synchrony perception. Experiment 1 required synchrony judgment with regard to the same (lowest) position of the movement in four visual conditions: two figure forms (human or non-human) combined with two motion profiles (human or ball trajectory). Whereas figure form did not affect synchrony perception, the point of subjective simultaneity differed between the two motions, suggesting that participants adopted the peak velocity in each downward trajectory as their visual reference. Experiment 2 further demonstrated that, when judgment was required with regard to the highest position, the maximal synchrony response was considerably low for ball motion, which lacked a peak velocity in the upward trajectory. The finding of peak velocity as a cue parallels results of visuomotor synchronization tasks employing biological stimuli, suggesting that synchrony judgment with rhythmic motions relies on the perceived visual beat.	\N	\N
24636747	Similar to other zona pellucida mutations in the alpha-tectorin (TECTA) gene, the p.Y1870C alteration in DFNA8/12 causes prelingual, nonsyndromic, autosomal dominant hearing loss. Here we investigated the effect of p.Y1870C on reverse transduction by audiometric studies in the family. Pure tone audiometry, brainstem evoked response audiometry, the Freiburger test for speech understanding and transient evoked and distortion product otoacoustic emissions were assessed in three available affected members bearing p.Y1870C. Pure tone audiometry showed U-shaped curves with moderate to severe degrees of hearing impairment confirmed by brainstem evoked response audiometry. Transient evoked and distortion product otoacoustic emissions were completely absent in all affected family members whereas word recognition scores were up to 95%. Although the missense p.Y1870C TECTA mutation leads to complete failure of the cochlear amplifier in humans, very high speech perception scores can be achieved with appropriate therapy.	\N	\N
24639033	Although individuals with autism are known to have significant communication problems, the cellular mechanisms responsible for impaired communication are poorly understood. Valproic acid (VPA) is an anticonvulsant that is a known risk factor for autism in prenatally exposed children. Prenatal VPA exposure in rats causes numerous neural and behavioral abnormalities that mimic autism. We predicted that VPA exposure may lead to auditory processing impairments which may contribute to the deficits in communication observed in individuals with autism. In this study, we document auditory cortex responses in rats prenatally exposed to VPA. We recorded local field potentials and multiunit responses to speech sounds in primary auditory cortex, anterior auditory field, ventral auditory field. and posterior auditory field in VPA exposed and control rats. Prenatal VPA exposure severely degrades the precise spatiotemporal patterns evoked by speech sounds in secondary, but not primary auditory cortex. This result parallels findings in humans and suggests that secondary auditory fields may be more sensitive to environmental disturbances and may provide insight into possible mechanisms related to auditory deficits in individuals with autism.	\N	\N
24647432	Different brain areas integrate information over different timescales, and this capacity to accumulate information increases from early sensory areas to higher order perceptual and cognitive areas. It is currently unknown whether the timescale capacity of each brain area is fixed or whether it adaptively rescales depending on the rate at which information arrives from the world. Here, using functional MRI, we measured brain responses to an auditory narrative presented at different rates. We asked whether neural responses to slowed (speeded) versions of the narrative could be compressed (stretched) to match neural responses to the original narrative. Temporal rescaling was observed in early auditory regions (which accumulate information over short timescales) as well as linguistic and extra-linguistic brain areas (which can accumulate information over long timescales). The temporal rescaling phenomenon started to break down for stimuli presented at double speed, and intelligibility was also impaired for these stimuli. These data suggest that 1) the rate of neural information processing can be rescaled according to the rate of incoming information, both in early sensory regions as well as in higher order cortexes, and 2) the rescaling of neural dynamics is confined to a range of rates that match the range of behavioral performance.	\N	\N
24657592	Fragile X syndrome (FXS) is a common inherited cause of intellectual disability that results from a CGG repeat expansion in the FMR1 gene. Large repeat expansions trigger both transcriptional and translational suppression of Fragile X protein (FMRP) production. Fragile X-associated Tremor/Ataxia Syndrome (FXTAS) is an allelic neurodegenerative disease caused by smaller "pre-mutation" CGG repeat expansions that enhance FMR1 transcription but lead to translational inefficiency and reduced FMRP expression in animal models. Sensorimotor gating as measured by pre-pulse inhibition (PPI) is altered in both FXS patients and Fmr1 knock out (KO) mice. Similarly, FXTAS patients have demonstrated PPI deficits. Recent work suggests there may be overlapping synaptic defects between Fmr1 KO and CGG knock-in premutation mouse models (CGG KI). We therefore sought to interrogate PPI in CGG KI mice. Using a quiet PPI protocol more akin to human testing conditions, we find that Fmr1 KO animals have significantly impaired PPI. Using this same protocol, we find CGG KI mice demonstrate an age-dependent impairment in PPI compared to wild type (WT) controls. This study describes a novel phenotype in CGG KI mice that can be used in future therapeutic development targeting premutation associated symptoms.	\N	\N
24660803	This study investigates the extent to which age-related language processing difficulties are due to a decline in sensory processes or to a deterioration of cognitive factors, specifically, attentional control. Two facets of attentional control were examined: inhibition of irrelevant information and divided attention. Younger and older adults were asked to categorize the initial phoneme of spoken syllables ("Was it m or n?"), trying to ignore the lexical status of the syllables. The phonemes were manipulated to range in eight steps from m to n. Participants also did a discrimination task on syllable pairs ("Were the initial sounds the same or different?"). Categorization and discrimination were performed under either divided attention (concurrent visual-search task) or focused attention (no visual task). The results showed that even when the younger and older adults were matched on their discrimination scores: (1) the older adults had more difficulty inhibiting lexical knowledge than did younger adults, (2) divided attention weakened lexical inhibition in both younger and older adults, and (3) divided attention impaired sound discrimination more in older than younger listeners. The results confirm the independent and combined contribution of sensory decline and deficit in attentional control to language processing difficulties associated with aging. The relative weight of these variables and their mechanisms of action are discussed in the context of theories of aging and language.	\N	\N
24663012	The present study aimed to vocally assess a group of rock singers who use growl voice and reinforced falsetto. A group of 21 rock singers and a control group of 18 pop singers were included. Singing and speaking voice was assessed through acoustic, perceptual, functional and laryngoscopic analysis. No significant differences were observed between groups in most of the analyses. Acoustic and perceptual analysis of the experimental group demonstrated normality of speaking voice. Endoscopic evaluation showed that most rock singers presented during singing voice a high vertical laryngeal position, pharyngeal compression and laryngeal supraglottic compression. Supraglottic activity during speaking voice tasks was also observed. However, overall vocal fold integrity was demonstrated in most of the participants. Slightly abnormal observations were demonstrated in few of them. Singing voice handicap index revealed that the most affected variable was the physical sphere, followed by the social and emotional spheres. Although growl voice and reinforced falsetto represent laryngeal and pharyngeal hyperfunctional activity, they did not seem to contribute to the presence of any major vocal fold disorder in our subjects. Nevertheless, we cannot rule out the possibility that more evident vocal fold disorders could be found in singers who use these techniques more often and during a longer period of time.	\N	\N
24672005	The inner ear receives two types of efferent feedback from the brainstem: one pathway provides gain control on outer hair cells' contribution to cochlear amplification, and the other modulates the excitability of the cochlear nerve. Although efferent feedback can protect hair cells from acoustic injury and thereby minimize noise-induced permanent threshold shifts, most prior studies focused on high-intensity exposures (>100 dB SPL). Here, we show that efferents are essential for long-term maintenance of cochlear function in mice aged 1 year post-de-efferentation without purposeful acoustic overexposure. Cochlear de-efferentation was achieved by surgical lesion of efferent pathways in the brainstem and was assessed by quantitative analysis of immunostained efferent terminals in outer and inner hair cell areas. The resultant loss of efferent feedback accelerated the age-related amplitude reduction in cochlear neural responses, as seen in auditory brainstem responses, and increased the loss of synapses between hair cells and the terminals of cochlear nerve fibers, as seen in confocal analysis of the organ of Corti immunostained for presynaptic and postsynaptic markers. This type of neuropathy, also seen after moderate noise exposure, has been termed "hidden hearing loss", because it does not affect thresholds, but can be seen in the suprathreshold amplitudes of cochlear neural responses, and likely causes problems with hearing in a noisy environment, a classic symptom of age-related hearing loss in humans. Since efferent reflex strength varies among individuals and can be measured noninvasively, a weak reflex may be an important risk factor, and prognostic indicator, for age-related hearing impairment.	\N	\N
24681401	Voice control is critical to communication. To date, studies have used behavioral, electrophysiological and functional data to investigate the neural correlates of voice control using perturbation tasks, but have yet to examine the interactions of these neural regions. The goal of this study was to use structural equation modeling of functional neuroimaging data to examine network properties of voice with and without perturbation. Results showed that the presence of a pitch shift, which was processed as an error in vocalization, altered connections between right STG and left STG. Other regions that revealed differences in connectivity during error detection and correction included bilateral inferior frontal gyrus, and the primary and pre motor cortices. Results indicated that STG plays a critical role in voice control, specifically, during error detection and correction. Additionally, pitch perturbation elicits changes in the voice network that suggest the right hemisphere is critical to pitch modulation.	\N	\N
24681402	Converging evidence suggests that understanding our first-language (L1) results in reactivation of experiential sensorimotor traces in the brain. Surprisingly, little is known regarding the involvement of these processes during second-language (L2) processing. Participants saw L1 or L2 words referring to entities with a typical location (e.g., star, mole) (Experiment 1 & 2) or to an emotion (e.g., happy, sad) (Experiment 3). Participants responded to the words' ink color with an upward or downward arm movement. Despite word meaning being fully task-irrelevant, L2 automatically activated motor responses similar to L1 even when L2 was acquired rather late in life (age >11). Specifically, words such as star facilitated upward, and words such as root facilitated downward responses. Additionally, words referring to positive emotions facilitated upward, and words referring to negative emotions facilitated downward responses. In summary our study suggests that reactivation of experiential traces is not limited to L1 processing.	\N	\N
24684405	We sought to determine whether the results of audiological tests and tinnitus characteristics, particularly tinnitus pitch and minimum masking level (MML), depend on tinnitus etiology, and what other etiology-specific tinnitus characteristics there are. The patients answered questions concerning tinnitus laterality, duration, character, aggravation, alleviation, previous treatment, and circumstances of onset. The results of tympanometry, pure-tone audiometry, distortion-product otoacoustic emissions, tinnitus likeness spectrum, MML, and uncomfortable loudness level were evaluated. Patients with several tinnitus etiological factors were excluded. The remaining participants were divided into groups according to medical history: acute acoustic trauma: 67 ears; chronic acoustic trauma: 82; prolonged use of oral estrogen and progesterone contraceptives: 46; Ménière's disease: 25; congenital hearing loss: 19; sensorineural sudden deafness: 40; dull head trauma: 19; viral labyrinthitis: 53; stroke: 6; presbycusis: 152. Data of 509 ears were analysed. Tinnitus pitch was highest in patients with acute acoustic trauma and lowest in patients receiving estrogen and progesterone. MML was lowest after acute acoustic trauma and in congenital hearing loss, and highest after a stroke and in the case of presbytinnitus. Tinnitus pitch and MML are etiology dependent.	\N	\N
24686520	PURPOSE The purpose of this study was to investigate how linguistic knowledge interacts with indexical knowledge in older children's perception under demanding listening conditions created by extensive talker variability. METHOD Twenty-five 9- to 12-year-old children, 12 from North Carolina (NC) and 13 from Wisconsin (WI), identified 12 vowels in isolated /hVd/ words produced by 120 talkers representing the 2 dialects (NC and WI), both genders, and 3 age groups (generations) of residents from the same geographic locations as the listeners. RESULTS Identification rates were higher for responses to talkers from the same dialect as the listeners and for female speech. Listeners were sensitive to systematic positional variations in vowels and their dynamic structure (formant movement) associated with generational differences in vowel pronunciation resulting from sound change in a speech community. Overall identification rate was 71.7%, which is 8.5% lower than for the adults responding to the same stimuli in Jacewicz and Fox (2012). CONCLUSION Typically developing older children were successful in dealing with both phonetic and indexical variation related to talker dialect, gender, and generation. They were less consistent than the adults, most likely because of less efficient encoding of acoustic-phonetic information in the speech of multiple talkers and relative inexperience with indexical variation.	\N	\N
24686901	PURPOSE The aim of this study was to evaluate the understanding of English sentences produced by native (English) and nonnative (Spanish) talkers by listeners with normal hearing (NH) and listeners with cochlear implants (CIs). METHOD Sentence recognition in noise was measured in adult subjects with CIs and subjects with NH, all of whom were native talkers of American English. Test sentences were from the Hearing in Noise Test (HINT) database and were produced in English by four native and eight nonnative talkers. Subjects also rated the intelligibility and accent for each talker. RESULTS The speech recognition thresholds in noise of subjects with CIs and subjects with NH were 4.23 dB and 1.32 dB poorer with nonnative talkers than with native talkers, respectively. Performance was significantly correlated with talker intelligibility and accent ratings for subjects with CIs but only correlated with talker intelligibility ratings for subjects with NH. For all subjects, performance with individual nonnative talkers was significantly correlated with talkers' number of years of residence in the United States. CONCLUSION CI users exhibited a larger deficit in speech understanding with nonnative talkers than did subjects with NH, relative to native talkers. Nonnative talkers' experience with native culture contributed strongly to speech understanding in noise, intelligibility ratings, and accent ratings of both listeners with NH and listeners with CIs.	\N	\N
24686915	The purpose of this study was to develop a task to evaluate children's English and Spanish speech perception abilities in either noise or competing speech maskers. Eight bilingual Spanish-English and 8 age-matched monolingual English children (ages 4.9-16.4 years) were tested. A forced-choice, picture-pointing paradigm was selected for adaptively estimating masked speech reception thresholds. Speech stimuli were spoken by simultaneous bilingual Spanish-English talkers. The target stimuli were 30 disyllabic English and Spanish words, familiar to 5-year-olds and easily illustrated. Competing stimuli included either 2-talker English or 2-talker Spanish speech (corresponding to target language) and spectrally matched noise. For both groups of children, regardless of test language, performance was significantly worse for the 2-talker than for the noise masker condition. No difference in performance was found between bilingual and monolingual children. Bilingual children performed significantly better in English than in Spanish in competing speech. For all listening conditions, performance improved with increasing age. Results indicated that the stimuli and task were appropriate for speech recognition testing in both languages, providing a more conventional measure of speech-in-noise perception as well as a measure of complex listening. Further research is needed to determine performance for Spanish-dominant listeners and to evaluate the feasibility of implementation into routine clinical use.	\N	\N
24687018	Although a number of questionnaires are available to assess hearing aid benefit and general hearing disability, relatively few investigate spatial hearing ability in more complex listening situations. The aim of this study was to document the performance of individuals with normal hearing using the Spatial Hearing Questionnaire (SHQ; Tyler, Perreau, & Ji, 2009) and to compare performance with published data from cochlear implant (CI) users. Fifty-one participants with normal hearing participated. All participants completed the 24-item SHQ. Also, a factor analysis and reliability tests were performed. Performance on the SHQ was high (87%) for the participants with normal hearing. Subjective ratings varied across different listening situations: Understanding speech in quiet (98%) was rated higher than sound localization (84%) and understanding speech in a background of noise (85%). Compared with previously published data (Tyler, Perreau, & Ji, 2009), listeners with normal hearing rated their spatial hearing ability significantly better than bilateral and unilateral CI users. Results confirmed that the SHQ is a reliable measure of spatial hearing ability for listeners with normal hearing. Overall, results indicated that the SHQ is able to capture expected differences between individuals with normal hearing and CI users. These new data can be used as targets following the provision of hearing devices.	\N	\N
24687041	To describe the inheritance patterns and auditory phenotype features of 3 Canadian families with mutations in 2 X-linked "deafness" genes (DFNX). Audiological, medical, and family histories were collected and family members interviewed to compare hearing thresholds and case histories between cases with mutations in SMPX versus POU3F4. The family pedigrees reveal characteristic X-linked inheritance patterns. Phenotypic features associated with the SMPX (DFNX4) mutation include early onset in males with rapid progression from mild and flat to sloping sensorineural loss, with highly variable onset and hearing loss severity in females. In contrast, phenotypic features associated with the POU3F4 (DFNX2) mutation are characterized by an early onset, mixed hearing loss with fluctuation in males, and a normal hearing phenotype reported for females. The study shows how this unique inheritance pattern and both gender and mutation-specific phenotype variations can alert audiologists to the presence of X-linked genetic etiologies in their clinical practice. By incorporating this knowledge into clinical decision making, audiologists can facilitate the early identification of X-linked hearing loss and contribute to the effective team management of affected families.	\N	\N
24704377	Perceptual synchrony and multisensory integration both vary as a function of stimulus onset asynchrony, but evidences from behavioral, patient, and lesion studies all support some dissociation between these two processes. Although it has been found that both perceptual synchrony and multisensory integration are recalibrated after exposure to asynchronous multisensory stimuli, no studies have directly compared these two recalibration patterns. We addressed this by using McGurk speech and requiring participants to perform simultaneity judgments and a syllable identification task in separate sessions. The results revealed that after exposure to asynchrony, both perceptual synchrony and McGurk fusion shifted toward the temporal lag. The recalibration aftereffects (i.e., the magnitude of shifts) of these two processes have no significant difference and correlation. In addition, McGurk fusion increased strongly at the direction of the temporal lag, which could not be fully explained by fusion shifts. Thus, the present research implies that recalibration patterns of explicit and implicit timing represented by perceptual synchrony and multisensory integration have both similarity and difference.	\N	\N
24709357	Although there is an extensive literature on the study of the neural correlates of consciousness (NCC) this is a subject that is far from being considered over. In this paper we present a novel experimental paradigm, based on binocular rivalry, to study internally and externally generated conscious experiences. We called this procedure bimodal rivalry. In addition, and assuming the non-linear nature of the EEG signals, we propose the use of fractal dimension to characterize the complexity of the EEG signal associated with each percept. Analysis of the data showed a significant difference in complexity between the internally generated and externally generated percepts. Moreover, EEG complexity was dissimilar for externally generated auditory and visual percepts. These results support fractal dimension analyses as a new tool to characterize conscious perception.	\N	\N
24715101	The current study investigated the mechanism underlying subliminal inhibition using the negative compatibility effect (NCE) paradigm. We hypothesized that a decrease in prime activation affects the subsequent inhibitory process, delaying onset of inhibition and reducing its strength. Two experiments tested this hypothesis using arrow stimuli as primes and targets. Two different irrelevant masks (i.e., a mask sharing no prime features) were presented in succession in each trial to not only ensure that primes were processed subliminally, but also avoid feature updating between primes and masks. Prime/target compatibility and prime background density were manipulated in Experiment 1. Results showed that under subliminal inhibitory condition, the NCE disappears when the density increases (i.e., pixel density in the prime's background of 25 %) in Experiment 1. However, when we fixed the prime's background at the density of 25 % and manipulated prime/target compatibility as well as inter-stimuli-interval (ISI) between mask and target in Experiment 2, behavioral results showed marginally significant NCEs in the 150-ms ISI condition. Electrophysiological evidence showed the lateralized readiness potential for compatible trials was significantly more positive than that for incompatible trials during the two consecutive time windows (i.e., 400-450 and 450-500 ms) in the 150-ms ISI condition. In addition, the NCE size was significant smaller in Experiment 2 than in Experiment 1. All of the results support predictions of the continuous subliminal inhibitory mechanism hypothesis which posits that decreases in prime activation strength lead to delay in inhibitory onset and decline in inhibitory strength.	\N	\N
24727491	The binaural cues used by terrestrial animals for sound localization in azimuth may not always suffice for accurate sound localization underwater. The purpose of this research was to examine the theoretical limits of interaural timing and level differences available underwater using computational and physical models. A paired-hydrophone system was used to record sounds transmitted underwater and recordings were analyzed using neural networks calibrated to reflect the auditory capabilities of terrestrial mammals. Estimates of source direction based on temporal differences were most accurate for frequencies between 0.5 and 1.75 kHz, with greater resolution toward the midline (2°), and lower resolution toward the periphery (9°). Level cues also changed systematically with source azimuth, even at lower frequencies than expected from theoretical calculations, suggesting that binaural mechanical coupling (e.g., through bone conduction) might, in principle, facilitate underwater sound localization. Overall, the relatively limited ability of the model to estimate source position using temporal and level difference cues underwater suggests that animals such as whales may use additional cues to accurately localize conspecifics and predators at long distances.	\N	\N
24735233	The dichotic listening task is typically administered by presenting a consonant-vowel (CV) syllable to each ear and asking the participant to report the syllable heard most clearly. The results tend to show more reports of the right ear syllable than of the left ear syllable, an effect called the right ear advantage (REA). The REA is assumed to be due to the crossing over of auditory fibres and the processing of language stimuli being lateralised to left temporal areas. However, the tendency for most dichotic listening experiments to use only CV syllable stimuli limits the extent to which the conclusions can be generalised to also apply to other speech phonemes. The current study re-examines the REA in dichotic listening by using both CV and vowel-consonant (VC) syllables and combinations thereof. Results showed a replication of the REA response pattern for both CV and VC syllables, thus indicating that the general assumption of left-side localisation of processing can be applied for both types of stimuli. Further, on trials where a CV is presented in one ear and a VC is presented in the other ear, the CV is selected more often than the VC, indicating that these phonemes have an acoustic or processing advantage.	\N	\N
24735850	To form a coherent percept of the environment, the brain needs to bind sensory signals emanating from a common source, but to segregate those from different sources [1]. Temporal correlations and synchrony act as prominent cues for multisensory integration [2-4], but the neural mechanisms by which such cues are identified remain unclear. Predictive coding suggests that the brain iteratively optimizes an internal model of its environment by minimizing the errors between its predictions and the sensory inputs [5,6]. This model enables the brain to predict the temporal evolution of natural audiovisual inputs and their statistical (for example, temporal) relationship. A prediction of this theory is that asynchronous audiovisual signals violating the model's predictions induce an error signal that depends on the directionality of the audiovisual asynchrony. As the visual system generates the dominant temporal predictions for visual leading asynchrony, the delayed auditory inputs are expected to generate a prediction error signal in the auditory system (and vice versa for auditory leading asynchrony). Using functional magnetic resonance imaging (fMRI), we measured participants' brain responses to synchronous, visual leading and auditory leading movies of speech, sinewave speech or music. In line with predictive coding, auditory leading asynchrony elicited a prediction error in visual cortices and visual leading asynchrony in auditory cortices. Our results reveal predictive coding as a generic mechanism to temporally bind signals from multiple senses into a coherent percept.	\N	\N
24736111	The aim of this study was to determine the relationship between serum albumin, affective prosody, and symptoms of attention-deficit hyperactivity disorder (ADHD) found coincidentally in a recently published study. Here, serum albumin levels were assessed as a covariate. Twenty healthy male adults (controls) and 20 adult male patients with ADHD participated in the study on two study days. Serum albumin levels and performance in an affective prosody task were assessed, and correlations were determined. Serum albumin had a significant correlation with performance on an affective prosody task on both of the 2 study days. The same correlations were not significant in the healthy control group. There was no difference in the serum albumin level between patients with ADHD and healthy controls. The association between serum albumin and affective prosody in adults with ADHD is a novel finding. However, to date, there is no clear theory that explains this association. Future research should analyze whether serum albumin influences causes changes in performance in affective prosody using experimental designs.	\N	\N
24736186	We investigated whether unattended visual, auditory and tactile stimuli compete for capacity-limited early sensory processing across senses. In three experiments, we probed competitive audio-visual, visuo-tactile and audio-tactile stimulus interactions. To this end, continuous visual, auditory and tactile stimulus streams ('reference' stimuli) were frequency-tagged to elicit steady-state responses (SSRs). These electrophysiological oscillatory brain responses indexed ongoing stimulus processing in corresponding senses. To induce competition, we introduced transient frequency-tagged stimuli in same and/or different senses ('competitors') during reference presentation. Participants performed a separate visual discrimination task at central fixation to control for attentional biases of sensory processing. A comparison of reference-driven SSR amplitudes between competitor-present and competitor-absent periods revealed reduced amplitudes when a competitor was presented in the same sensory modality as the reference. Reduced amplitudes indicated the competitor's suppressive influence on reference stimulus processing. Crucially, no such suppression was found when a competitor was presented in a different than the reference modality. These results strongly suggest that early sensory competition is exclusively modality-specific and does not extend across senses. We discuss consequences of these findings for modeling the neural mechanisms underlying intermodal attention.	\N	\N
24738537	Neuroscientific and musicological approaches to music cognition indicate that listeners familiarized in the Western tonal tradition expect a musical phrase boundary at predictable time intervals. However, phrase boundary prediction processes in music remain untested. We analyzed event-related potentials (ERPs) and event-related induced power changes at the onset and offset of a boundary pause. We made comparisons with modified melodies, where the pause was omitted and filled by tones. The offset of the pause elicited a closure positive shift (CPS), indexing phrase boundary detection. The onset of the filling tones elicited significant increases in theta and beta powers. In addition, the P2 component was larger when the filling tones started than when they ended. The responses to boundary omission suggest that listeners expected to hear a boundary pause. Therefore, boundary prediction seems to coexist with boundary detection in music segmentation.	\N	\N
24744448	Interest in the perception of the material of objects has been growing. While material perception is a critical ability for animals to properly regulate behavioral interactions with surrounding objects (e.g., eating), little is known about its underlying processing. Vision and audition provide useful information for material perception; using only its visual appearance or impact sound, we can infer what an object is made from. However, what material is perceived when the visual appearance of one material is combined with the impact sound of another, and what are the rules that govern cross-modal integration of material information? We addressed these questions by asking 16 human participants to rate how likely it was that audiovisual stimuli (48 combinations of visual appearances of six materials and impact sounds of eight materials) along with visual-only stimuli and auditory-only stimuli fell into each of 13 material categories. The results indicated strong interactions between audiovisual material perceptions; for example, the appearance of glass paired with a pepper sound is perceived as transparent plastic. Rating material-category likelihoods follow a multiplicative integration rule in that the categories judged to be likely are consistent with both visual and auditory stimuli. On the other hand, rating-material properties, such as roughness and hardness, follow a weighted average rule. Despite a difference in their integration calculations, both rules can be interpreted as optimal Bayesian integration of independent audiovisual estimations for the two types of material judgment, respectively.	\N	\N
24750038	Alarms are ubiquitous in anaesthetic practice, but their net effect on anaesthesiologists' performance and patient safety is debated. In this study, 27 anaesthesiologists performed two simulation sessions in random order; one session was programmed to include an alarm condition, with a standard, frequent, clearly audible alarm sound. During these sessions, adverse events were simulated and anaesthesiologists' response times to these events were recorded. Perceived workload was assessed with the NASA Task Load Index. Response times to adverse events and perceived workload were similar in both groups. Pooled response times to atrial fibrillation and desaturation were fast, with a median (range [IQR]) of 8 (4-14 [1-41]) s and 9 (6-16 [1-44]) s, respectively. Pooled response times to an ST segment elevation on the ECG and an obstructed intravenous line were significantly slower, with median (IQR[range]) times of 34 (21-76[4-300]) s and 227 (95-399 [2-600]) s, respectively (p < 0.001). This study shows that in a simulated anaesthesia environment, response times to adverse events are similar in the absence or presence of an audible alarm, and that response times to various critical events differ.	\N	\N
24751750	To prospectively evaluate hearing outcomes in patients with osteogenesis imperfecta undergoing primary stapes surgery and to isolate prognostic factors for success. A nonrandomized, open, prospective case series. A tertiary referral center. Twenty-five consecutive patients who underwent 32 primary stapedotomies for osteogenesis imperfecta with evidence of stapes fixation and available postoperative pure-tone audiometry. Primary stapedotomy with vein graft interposition and reconstruction with a regular Teflon piston or bucket handle-type piston. Preoperative and postoperative audiometric evaluation using conventional 4-frequency (0.5, 1, 2, and 4 kHz) audiometry. Air-conduction thresholds, bone-conduction thresholds, and air-bone gap were measured. The overall audiometric results as well as the results of audiometric evaluation at 3 months and at least 1 year after surgery were used. Overall, postoperative air-bone gap closure to within 10 dB was achieved in 88% of cases. Mean (standard deviation) gain in air-conduction threshold was 22 (9.4) dB for the entire case series, and mean (standard deviation) air-bone gap closure was 22 (9.0) dB. Backward multivariate logistic regression showed that a model with preoperative air-bone gap closure and intraoperatively established incus length accurately predicts success after primary stapes surgery. Stapes surgery is a feasible and safe treatment option in patients with osteogenesis imperfecta. Success is associated with preoperative air-bone gap and intraoperatively established incus length.	\N	\N
24754219	Diachronic velar palatalization is taken as the case study for modeling the emergence of a new phoneme category. The spread of a palatalized variant through the lexicon is treated as a stochastic classification task for the listener/learner. The model combines two measures of similarity to determine classification within an exemplar-theoretic framework: acoustic distance and phonotactic expectation. There are three model outcomes: contrast, allophony, or contextual neutralization between the plain and palatalized velars. It is shown, through a series of simulations, that these can be predicted from the distribution of sounds within the pre-change lexicons, namely, the ratio of the /k-vowel/ sequences containing naturally palatalizing vowels (i, I, e), to those containing non-palatalizers. "Unnatural" phonotactic associations can arise in individual lexicons, but are sharply limited due to the large size of the lexicon and the local nature of the phoneme changes. "Anti-natural" distributions, which categorically violate the proposed implicational relationship between palatalization and frontness/height, are absent. This work provides an explicit and restrictive model of phoneme change. The results also serve as an existence proof for an outcome-blind mechanism of avoiding over-generation.	\N	\N
24755208	A wide range of literature is available on the features of ataxic dysarthria, investigating segmental and prosodic characteristics by acoustic and perceptual means. However, very few studies have been published that look closely at the relationship between the observed phonetic disturbances and their perceptual sequelae, particularly in the area of prosody. The aim of the current study was therefore to examine the stress production of eight individuals with ataxic dysarthria and matched healthy controls, and to relate the results of phonological and perceptual evaluations to phonetic performances to better understand the relationship between these three components for speech outcomes. Speakers performed a sentence stress task which was analysed phonologically in terms of inventory, distribution, implementation and function of pitch accentuation. These data were then evaluated in relation to previously published phonetic and perceptual results on the same speaker group by the authors. Results indicated that the speakers with ataxia used a wide range of pitch patterns, but pitch-accented a higher number of words, and produced shorter phrases. The increased number of pitch accents per phrase was furthermore reflected in a reduced percentage of de-accented words in post-focal position. Perceptual results established this pattern as the main cause for listener errors in identifying the intended stressed item in an utterance. In addition, the performances of two speakers are discussed in greater detail. Although they were unable to de-accent, they nevertheless marked stress appropriately through phonetic compensatory strategies. After reading this article the reader will be able to (1) explain the relevance of phonology and phonetics in the perception of stress production in ataxic dysarthria; (2) describe the different levels of intonational analysis; and (3) understand the observed intonation patterns in ataxic dysarthria as well as the compensatory mechanisms speakers may adopt to produce stress.	\N	\N
24763046	Obesity-related disorders are closely associated with the development of age-related hearing impairment (ARHI). Adiponectin (APN) exerts protective effects against obesity-related conditions including endothelial dysfunction and atherosclerosis. Here, we investigated the impact of APN on ARHI. APN-knockout (APN-KO) mice developed exacerbation of hearing impairment, particularly in the high frequency range, compared with wild-type (WT) mice. Supplementation with APN prevented the hearing impairment in APN-KO mice. At 2 months of age, the cochlear blood flow and capillary density of the stria vascularis (SV) were significantly reduced in APN-KO mice as compared with WT mice. APN-KO mice also showed a significant increase in terminal deoxynucleotidyl transferase dUTP nick end labeling (TUNEL)-positive apoptotic cells in the organ of Corti in the cochlea at 2 months of age. At the age of 6 months, hair cells were lost at the organ of Corti in APN-KO mice. In cultured auditory HEI-OC1 cells, APN reduced apoptotic activity under hypoxic conditions. Clinically, plasma APN levels were significantly lower in humans with ARHI. Multiple logistic regression analysis identified APN as a significant and independent predictor of ARHI. Our observations indicate that APN has an important role in preventing ARHI.	\N	\N
24764261	We investigated the relative effects of simple and complex auditory-visual discrimination training using an adapted alternating treatments design to establish derived stimulus relations in 2 children who had been diagnosed with autism and 1 typically developing peer. Emergence of untrained conditional relations was observed after training in both conditions, with a possible advantage of simple-sample training for 1 participant. Results of generalization and follow-up probes were mixed.	\N	\N
24769166	Although alterations of the limbic system have been linked to tinnitus persistence, the neural networks underlying such alteration are unclear. The present study investigated the effect of tinnitus on emotional processing in middle-aged adults using functional magnetic resonance imaging and stimuli from the International Affective Digital Sounds database. There were three groups of participants: bilateral hearing loss with tinnitus (TIN), age- and gender-matched controls with bilateral hearing loss without tinnitus (HL) and matched normal hearing controls without tinnitus (NH). In the scanner, subjects rated sounds as pleasant, unpleasant, or neutral. The TIN and NH groups, but not the HL group, responded faster to affective sounds compared to neutral sounds. The TIN group had elevated response in bilateral parahippocampus and right insula compared to the NH group, and left parahippocampus compared to HL controls for pleasant relative to neutral sounds. A region-of-interest analysis detected increased activation for NH controls in the right amygdala when responding to affective stimuli, but failed to find a similar heightened response in the TIN and HL groups. All three groups showed increased response in auditory cortices for the affective relative to neutral sounds comparisons. Our results suggest that the emotional processing network is altered in tinnitus to rely on the parahippocampus and insula, rather than the amygdala, and this alteration may maintain a select advantage for the rapid processing of affective stimuli despite the hearing loss. The complex interaction of tinnitus and the limbic system should be accounted for in development of new tinnitus management strategies.	\N	\N
24769280	Lexical access during speech comprehension comprises numerous computations, including activation, competition, and selection. The spatio-temporal profile of these processes involves neural activity in peri-auditory cortices at least as early as 200 ms after stimulation. Their oscillatory dynamics are less well understood, although reports link alpha band de-synchronization with lexical processing. We used magnetoencephalography (MEG) to examine whether these alpha-related oscillations reflect the speed of lexical access, as would be predicted if they index lexical activation. In an auditory semantic priming protocol, monosyllabic nouns were presented while participants performed a lexical decision task. Spatially-localizing beamforming was used to examine spectro-temporal effects in left and right auditory cortex time-locked to target word onset. Alpha and beta de-synchronization (10-20 Hz ERD) was attenuated for words following a related prime compared to an unrelated prime beginning about 270 ms after stimulus onset. This timing is consistent with how information about word identity unfolds incrementally in speech, quantified in information-theoretic terms. These findings suggest that alpha de-synchronization during auditory word processing is associated with early stages of lexical access.	\N	\N
24769430	In this study meaningful social stimuli were used as probes in a task requiring the judgment of semantic appropriateness to investigate contextual integration ability to test the ability of people with Williams syndrome (WS) to integrate information, as opposed to the use of meaningless syllables in audiovisual studies (the McGurk effect). Participants were presented with background auditory primes followed by targets that were either congruent or incongruent with the prime. Two modes of target were presented: a visual target (AV task) or an auditory target (AA task). Participants were asked to respond yes to contextually appropriate pairs and no to those that were contextually inappropriate. The congruency effect was measured as an index of successful central coherence. Similar to normally developing controls, people with WS showed shorter response latencies and greater accuracy in recognizing congruent pairs compared with incongruent pairs. Their performance did not differ from that of controls matched by mental age, but was inferior to that of controls matched by chronological age. The results revealed generalized contextual integration for auditory primes in both tasks, consistent with previous studies using visual presentation of social-related stimuli in people with WS (Hsu, 2013a, 2013c). Further demonstration of the presence of a modality effect on contextual coherence implies that cross-modal learning may be advantageous compared with unimodal learning.	\N	\N
24783989	To assist the human operator, modern auditory interfaces increasingly rely on sound spatialisation to display auditory information and warning signals. However, we often operate in environments that apply vibrations to the whole body, e.g. when driving a vehicle. Here, we report three experiments investigating the effect of sinusoidal vibrations along the vertical axis on spatial hearing. The first was a free-field, narrow-band noise localisation experiment with 5- Hz vibration at 0.88 ms(-2). The other experiments used headphone-based sound lateralisation tasks. Experiment 2 investigated the effect of vibration frequency (4 vs. 8 Hz) at two different magnitudes (0.83 vs. 1.65 ms(-2)) on a left-right discrimination one-interval forced-choice task. Experiment 3 assessed the effect on a two-interval forced-choice location discrimination task with respect to the central and two peripheral reference locations. In spite of the broad range of methods, none of the experiments show a reliable effect of whole-body vibrations on localisation performance. We report three experiments that used both free-field localisation and headphone lateralisation tasks to assess their sensitivity to whole-body vibrations at low frequencies. None of the experiments show a reliable effect of either frequency or magnitude of whole-body vibrations on localisation performance.	\N	\N
24809252	The aim of this work was to investigate perceived loudness change in response to melodies that increase (up-ramp) or decrease (down-ramp) in acoustic intensity, and the interaction with other musical factors such as melodic contour, tempo, and tonality (tonal/atonal). A within-subjects design manipulated direction of linear intensity change (up-ramp, down-ramp), melodic contour (ascending, descending), tempo, and tonality, using single ramp trials and paired ramp trials, where single up-ramps and down-ramps were assembled to create continuous up-ramp/down-ramp or down-ramp/up-ramp pairs. Twenty-nine (Exp 1) and thirty-six (Exp 2) participants rated loudness continuously in response to trials with monophonic 13-note piano melodies lasting either 6.4s or 12s. Linear correlation coefficients >.89 between loudness and time show that time-series loudness responses to dynamic up-ramp and down-ramp melodies are essentially linear across all melodies. Therefore, 'indirect' loudness change derived from the difference in loudness at the beginning and end points of the continuous response was calculated. Down-ramps were perceived to change significantly more in loudness than up-ramps in both tonalities and at a relatively slow tempo. Loudness change was also greater for down-ramps presented with a congruent descending melodic contour, relative to an incongruent pairing (down-ramp and ascending melodic contour). No differential effect of intensity ramp/melodic contour congruency was observed for up-ramps. In paired ramp trials assessing the possible impact of ramp context, loudness change in response to up-ramps was significantly greater when preceded by down-ramps, than when not preceded by another ramp. Ramp context did not affect down-ramp perception. The contribution to the fields of music perception and psychoacoustics are discussed in the context of real-time perception of music, principles of music composition, and performance of musical dynamics.	\N	\N
24809744	Since Köhler's experiments in the 1920s, researchers have demonstrated a correspondence between words and shapes. Dubbed the "Bouba-Kiki" effect, these auditory-visual associations extend across cultures and are thought to be universal. More recently the effect has been shown in other modalities including taste, suggesting the effect is independent of vision. The study presented here tested the "Bouba-Kiki" effect in the auditory-haptic modalities, using 2D cut-outs and 3D models based on Köhler's original drawings. Presented with shapes they could feel but not see, sighted participants showed a robust "Bouba-Kiki" effect. However, in a sample of people with a range of visual impairments, from congenital total blindness to partial sight, the effect was significantly less pronounced. The findings suggest that, in the absence of a direct visual stimulus, visual imagery plays a role in crossmodal integration.	\N	\N
24811450	Mice are emerging as an important behavioral model for studies of auditory perception and acoustic communication. These mammals frequently produce ultrasonic vocalizations, although the details of how these vocalizations are used for communication are not entirely understood. An important step in determining how they might be differentiating their calls is to measure discrimination and identification of the dimensions of various acoustic stimuli. Here, behavioral operant conditioning methods were employed to assess frequency difference limens for pure tones. We found that their thresholds were similar to those in other rodents but higher than in humans. We also asked mice, in an identification paradigm, whether they would use frequency or duration differences to classify stimuli varying on those two dimensions. We found that the mice classified the stimuli based on frequency rather than duration.	\N	\N
24815249	Personal audio refers to the creation of a listening zone within which a person, or a group of people, hears a given sound program, without being annoyed by other sound programs being reproduced in the same space. Generally, these different sound zones are created by arrays of loudspeakers. Although these devices have the capacity to achieve different sound zones in an anechoic environment, they are ultimately used in normal rooms, which are reverberant environments. At high frequencies, reflections from the room surfaces create a diffuse pressure component which is uniform throughout the room volume and thus decreases the directional characteristics of the device. This paper shows how the reverberant performance of an array can be modeled, knowing the anechoic performance of the radiator and the acoustic characteristics of the room. A formulation is presented whose results are compared to practical measurements in reverberant environments. Due to reflections from the room surfaces, pressure variations are introduced in the transfer responses of the array. This aspect is assessed by means of simulations where random noise is added to create uncertainties, and by performing measurements in a real environment. These results show how the robustness of an array is increased when it is designed for use in a reverberant environment.	\N	\N
24815280	Recent studies on binary masking techniques make the assumption that each time-frequency (T-F) unit contributes an equal amount to the overall intelligibility of speech. The present study demonstrated that the importance of each T-F unit to speech intelligibility varies in accordance with speech content. Specifically, T-F units are categorized into two classes, speech-present T-F units and speech-absent T-F units. Results indicate that the importance of each speech-present T-F unit to speech intelligibility is highly related to the loudness of its target component, while the importance of each speech-absent T-F unit varies according to the loudness of its masker component. Two types of mask errors are also considered, which include miss and false alarm errors. Consistent with previous work, false alarm errors are shown to be more harmful to speech intelligibility than miss errors when the mixture signal-to-noise ratio (SNR) is below 0 dB. However, the relative importance between the two types of error is conditioned on the SNR level of the input speech signal. Based on these observations, a mask-based objective measure, the loudness weighted hit-false, is proposed for predicting speech intelligibility. The proposed objective measure shows significantly higher correlation with intelligibility compared to two existing mask-based objective measures.	\N	\N
24815292	Behind-the-ear (BTE) processors of cochlear implant (CI) devices offer little to almost no protection from wind noise in most incidence angles. To assess speech intelligibility, eight CI recipients were tested in 3 and 9 m/s wind. Results indicated that speech intelligibility decreased substantially when the wind velocity, and in turn the wind sound pressure level, increased. A two-microphone wind noise suppression strategy was developed. Scores obtained with this strategy indicated substantial gains in speech intelligibility over other conventional noise reduction strategies tested.	\N	\N
24820112	This study aimed to propose an ototoxicity grading system sensitive to the effect of ototoxicity on specific daily life situations like speech intelligibility and the perception of ultra-high sounds and to test its feasibility compared to current criteria. Pure tone averages (PTAs) for speech perception (1-2-4 kHz) and ultra-high frequencies (8-10-12.5 kHz) were incorporated. Threshold shift and hearing level posttreatment were taken into account. Criteria were tested on head and neck cancer patients treated with (chemo-)radiotherapy ([C]RT) and compared with the Common Terminology Criteria for Adverse Events version 4 (CTCAEv4) and the American Speech-Language-Hearing Association criteria (ASHA). Grades 1 and 2 were based on threshold shifts from baseline (in dB) and subjective complaints. Grades 3 and 4 were defined as treatment-induced hearing loss of ≥ 35 dB at PTA 1-2-4 kHz and ≥ 70 dB at PTA 1-2-4 kHz, respectively. In high-dose cisplatin CRT incidences by the new criteria, CTCAEv4 and ASHA were comparable (78%-88%). In RT and low-dose cisplatin CRT, incidences were 36% to 39% in the new criteria versus 22% to 53% in CTCAEv4 and ASHA. The new criteria show an increased sensitivity to ototoxicity compared to CTCAEv4 and ASHA and provide insight into the effect of hearing loss on certain daily life situations. The new grading system seems feasible for clinic and research purposes.	\N	\N
24834939	Our results indicated that electric acoustic stimulation (EAS) is beneficial for Japanese-speaking patients, including those with less residual hearing at lower frequencies. Comparable outcomes for the patients with less residual hearing indicated that current audiological criteria for EAS could be expanded. Successful hearing preservation results, together with the progressive nature of loss of residual hearing in these patients, mean that minimally invasive full insertion of medium/long electrodes in cochlear implantation (CI) surgery is a desirable solution. The minimally invasive concepts that have been obtained through EAS surgery are, in fact, crucial for all CI patients. This study was conducted to evaluate hearing preservation results and speech discrimination outcomes of hearing preservation surgeries using medium/long electrodes. A total of 32 consecutive minimally invasive hearing preservation CIs (using a round window approach with deep insertion of a flexible electrode) were performed in 30 Japanese patients (two were bilateral cases), including patients with less residual hearing. Hearing preservation rates as well as speech discrimination/perception scores were investigated on a multicenter basis. Postoperative evaluation after full insertion of the flexible electrodes (24 mm, 31.5 mm) showed that residual hearing was well preserved in all 32 ears. In all patients, speech discrimination and perception scores were improved postoperatively.	\N	\N
24840132	Helmets provide soldiers with ballistic and fragmentation protection but impair auditory spatial processing. Missed auditory information can be fatal for a soldier; therefore, helmet design requires compromise between protection and optimal acoustics. Twelve soldiers localised two sound signals presented from six azimuth angles and three levels of elevation presented at two intensity levels and with three background noises. Each participant completed the task while wearing no helmet and with two U.S. Army infantry helmets - the Personnel Armor System for Ground Troops (PASGT) helmet and the Advanced Combat Helmet (ACH). Results showed a significant effect of helmet type on the size of both azimuth and elevation error. The effects of level, background noise, azimuth and elevation were found to be significant. There was no effect of sound signal type. As hypothesised, localisation accuracy was greatest when soldiers did not wear helmet, followed by the ACH. Performance was worst with the PASGT helmet.	\N	\N
24840711	Test data were used to explore the neurocognitive processing of a group of children with cochlear implants (CIs) whose language development is below expectations. This cross-sectional study examines the relationship between neurocognitive processing, as assessed by the Kaufman Assessment Battery for Children-Second Edition, and verbal language standard scores, assessed using either the Comprehensive Assessment of Spoken Language or the Clinical Evaluation of Language Fundamentals in 22 school-age children with CIs. Processing scores of CI recipients with language scores below expectations were compared to those of children meeting or exceeding language expectations. Multiple linear regression estimated the associations of simultaneous and sequential processing with language scores. Though simultaneous processing scores between the two groups were similar, the mean sequential processing score (91.2) in the below expectations group (n = 13) was significantly lower (P = 0.002) than that of children (n = 9) meeting expectations (110.8). After adjusting for age at implantation, a 10-point higher sequential processing score was associated with a 7.4 higher language score (P = 0.027). Simultaneous processing capacity was at least within the average range of cognitive performance, and was not associated with language performance in children with CIs. Conversely, reduced sequential processing capacity was significantly associated with lower language scores. Neurocognitive skills, specifically cognitive sequencing, serial ordering, and auditory-verbal memory may be targets for therapeutic intervention. Intensive cognitive and educational habilitation and in milieu intervention may improve language learning in children with CIs.	\N	\N
24841996	Auditory objects, like their visual counterparts, are perceptually defined constructs, but nevertheless must arise from underlying neural circuitry. Using magnetoencephalography (MEG) recordings of the neural responses of human subjects listening to complex auditory scenes, we review studies that demonstrate that auditory objects are indeed neurally represented in auditory cortex. The studies use neural responses obtained from different experiments in which subjects selectively listen to one of two competing auditory streams embedded in a variety of auditory scenes. The auditory streams overlap spatially and often spectrally. In particular, the studies demonstrate that selective attentional gain does not act globally on the entire auditory scene, but rather acts differentially on the separate auditory streams. This stream-based attentional gain is then used as a tool to individually analyze the different neural representations of the competing auditory streams. The neural representation of the attended stream, located in posterior auditory cortex, dominates the neural responses. Critically, when the intensities of the attended and background streams are separately varied over a wide intensity range, the neural representation of the attended speech adapts only to the intensity of that speaker, irrespective of the intensity of the background speaker. This demonstrates object-level intensity gain control in addition to the above object-level selective attentional gain. Overall, these results indicate that concurrently streaming auditory objects, even if spectrally overlapping and not resolvable at the auditory periphery, are individually neurally encoded in auditory cortex, as separate objects.	\N	\N
24847936	False physiologic monitor alarms are extremely common in the hospital environment. High false alarm rates have the potential to lead to alarm fatigue, leading nurses to delay their responses to alarms, ignore alarms, or disable them entirely. Recent evidence from the U.S. Food and Drug Administration (FDA) and The Joint Commission has demonstrated a link between alarm fatigue and patient deaths. Yet, very little scientific effort has focused on the rigorous quantitative measurement of alarms and responses in the hospital setting. We developed a system using multiple temporarily mounted, minimally obtrusive video cameras in hospitalized patients' rooms to characterize physiologic monitor alarms and nurse responses as a proxy for alarm fatigue. This allowed us to efficiently categorize each alarm's cause, technical validity, actionable characteristics, and determine the nurse's response time. We describe and illustrate the methods we used to acquire the video, synchronize and process the video, manage the large digital files, integrate the video with data from the physiologic monitor alarm network, archive the video to secure servers, and perform expert review and annotation using alarm "bookmarks." We discuss the technical and logistical challenges we encountered, including the root causes of hardware failures as well as issues with consent, confidentiality, protection of the video from litigation, and Hawthorne-like effects. The description of this video method may be useful to multidisciplinary teams interested in evaluating physiologic monitor alarms and alarm responses to better characterize alarm fatigue and other patient safety issues in clinical settings.	\N	\N
24848460	Behavioral and neural findings demonstrate that animals can locate low-frequency sounds along the azimuth by detecting microsecond interaural time differences (ITDs). Information about ITDs is also available in the amplitude modulations (i.e., envelope) of high-frequency sounds. Since medial superior olivary (MSO) neurons encode low-frequency ITDs, we asked whether they employ a similar mechanism to process envelope ITDs with high-frequency carriers, and the effectiveness of this mechanism compared with the process of low-frequency sound. We developed a novel hybrid in vitro dynamic-clamp approach, which enabled us to mimic synaptic input to brain-slice neurons in response to virtual sound and to create conditions that cannot be achieved naturally but are useful for testing our hypotheses. For each simulated ear, a virtual sound, computer generated, was used as input to a computational auditory-nerve model. Model spike times were converted into synaptic input for MSO neurons, and ITD tuning curves were derived for several virtual-sound conditions: low-frequency pure tones, high-frequency tones modulated with two types of envelope, and speech sequences. Computational models were used to verify the physiological findings and explain the biophysical mechanism underlying the observed ITD coding. Both recordings and simulations indicate that MSO neurons are sensitive to ITDs carried by spectrotemporally complex virtual sounds, including speech tokens. Our findings strongly suggest that MSO neurons can encode ITDs across a broad-frequency spectrum using an input-slope-based coincidence-detection mechanism. Our data also provide an explanation at the cellular level for human localization performance involving high-frequency sound described by previous investigators.	\N	\N
24851353	The purpose of this chapter is to describe the vocabulary development and promising, evidence-based vocabulary interventions for English learners (ELs) from preschool through second grade. To achieve this purpose, we have taken six steps. First, we describe the elements of language development in the native language (L1) and a second language (L2) and how these elements relate to three phases of reading development (i.e., the prereading phase, the learning to read phase, and the reading to learn phase). We contend that in order for ELs to succeed in school, they need a strong language foundation prior to entering kindergarten. This language foundation needs to continue developing during the "learning to read" and "reading to learn" phases. Second, we describe the limitations of current practice in preschool for ELs related to vocabulary instruction and to family involvement to support children's language development. Third, we report curricular challenges faced by ELs in early elementary school, and we relate these challenges to the increase in reading and language demands outlined in the Common Core State Standards (CCSS). Specific language activities that can help meet some of the demands are provided in a table. Fourth, we synthesize the research on evidence-based vocabulary instruction and intervention and discuss implications for practice with ELs. Fifth, we describe two intervention projects under development that have the potential to improve EL vocabulary and language proficiency in the early grades. We conclude with a summary of the chapter and provide additional resources on the topic.	\N	\N
24856412	The objective of this study was to examine the relationship between perceived discrimination and psychotic experiences (PE) using validated measures of discrimination and a racially/ethnically diverse population-level sample. Data were drawn from two population-level surveys (The National Latino and Asian American Survey and The National Survey of American Life), which were analyzed together using survey weights and stratification variables. The analytic sample (N=8990) consisted of Latino, Asian, African-American, and Afro-Caribbean adults living in the United States. Separate unadjusted and adjusted multivariable logistic regression models were used, first to examine the crude bivariate relationship between perceived discrimination and PE, and second to examine the relationship adjusting for demographic variables. Adjusted logistic regression models were also used to examine the relationships between perceived discrimination and specific sub-types of PE (auditory and visual hallucinatory experiences, and delusional ideation). When compared to individuals who did not report any discrimination, those who reported the highest levels of discrimination were significantly more likely to report both 12-month PE (Adjusted OR=4.590, p<0.001) and lifetime PE (adjusted OR=4.270, p<0.001). This held true for visual hallucinatory experiences (adjusted OR=3.745, p<0.001), auditory hallucinatory experiences (adjusted OR=5.649, p<0.001), and delusional ideation (adjusted OR=7.208, p<0.001). Perceived discrimination is associated with the increased probability of reporting psychotic experiences in a linear Fashion in the US general population.	\N	\N
24861540	Considerable evidence suggests that performance across a variety of cognitive tasks is effectively supported by the use of verbal and nonverbal strategies. Studies exploring the usefulness of such strategies in children with specific language impairment (SLI) are scarce and report inconsistent findings. To examine the effects of induced labelling and auditory cues on the performance of children with and without SLI during a categorization task. Sixty-six school-age children (22 with SLI, 22 age-matched controls, 22 language-matched controls) completed three versions of a computer-based categorization task: one baseline, one requiring overt labelling and one with auditory cues (tones) on randomized trial blocks. Labelling had no effect on performance for typically developing children but resulted in lower accuracy and longer reaction time in children with SLI. The presence of tones had no effect on accuracy but resulted in faster reaction time and post-error slowing across groups. Verbal strategy use was ineffective for typically developing children and negatively affected children with SLI. All children showed faster performance and increased performance monitoring as a result of tones. Overall, effects of strategy use in children appear to vary based on task demands, strategy domain, age and language ability. Results suggest that children with SLI may benefit from auditory cues in their clinical intervention but that further research is needed to determine when and how verbal strategies might similarly support performance in this population.	\N	\N
24867743	The prevalence of deformational plagiocephaly has risen dramatically in recent years, now affecting 15 percent or more of infants. Prior research using developmental scales suggests that these children may be at elevated risk for developmental delays. However, the low positive predictive value of such instruments in identifying long-term impairment, coupled with their poor reliability in infants, warrants the development of methods to more precisely measure brain function in craniofacial patients. Event-related potentials offer a direct measure of cortical activity that is highly applicable to young populations and has been implemented in other disorders to predict long-term cognitive functioning. The current study used event-related potentials to contrast neural correlates of auditory perception in infants with deformational plagiocephaly and typically developing children. Event-related potentials were recorded while 16 infants with deformational plagiocephaly and 18 nonaffected controls passively listened to speech sounds. Given prior research suggesting their association with subsequent functioning, analyses focused on the P150 and N450 event-related potential components. Deformational plagiocephaly patients and normal controls showed comparable cortical responses to speech sounds at both auditory event-related potential components. Children with deformational plagiocephaly demonstrate neural responses to language that are consistent with normative expectations and comparable to those of typical children. These results indicate that head shape deformity secondary to supine sleep is not associated with impairments in auditory processing. The applicability of the current methods in early infancy suggests that electrophysiologic brain recordings represent a promising method of monitoring brain development in children with cranial disorders. Risk, II.	\N	\N
24869441	Hearing preservation surgery requires specially a traumatic technique. Having some preoperative anatomical data of the size of patient's cochlea surgeon can design his or her insertion depth. In the study we have evaluated a relation between hearing preservation rate and angular insertion depth estimated intraoperatively and postoperatively having measured insertion angle from radiological assessment and calculations given by Escude. There has not been no statistically significant difference between insertion depth angle, either estimated intraoperatively and measured and calculated post-operatively, and hearing preservation rate in the group. This analysis confirms a traumaticy of insertion in hearing preservation surgery.	\N	\N
24869443	To establish whether complex signal processing is beneficial for users of bone anchored hearing aids. Review and analysis of two studies from our own group, each comparing a speech processor with basic digital signal processing (either Baha Divino or Baha Intenso) and a processor with complex digital signal processing (either Baha BP100 or Baha BP110 power). The main differences between basic and complex signal processing are the number of audiologist accessible frequency channels and the availability and complexity of the directional multi-microphone noise reduction and loudness compression systems. Both studies show a small, statistically non-significant improvement of speech understanding in quiet with the complex digital signal processing. The average improvement for speech in noise is +0.9 dB, if speech and noise are emitted both from the front of the listener. If noise is emitted from the rear and speech from the front of the listener, the advantage of the devices with complex digital signal processing as opposed to those with basic signal processing increases, on average, to +3.2 dB (range +2.3 … +5.1 dB, p ≤ 0.0032). Complex digital signal processing does indeed improve speech understanding, especially in noise coming from the rear. This finding has been supported by another study, which has been published recently by a different research group. When compared to basic digital signal processing, complex digital signal processing can increase speech understanding of users of bone anchored hearing aids. The benefit is most significant for speech understanding in noise.	\N	\N
24908093	Inconsistent information from different modalities can be delusive for perception. This phenomenon can be observed with simultaneously presented inconsistent numbers of brief flashes and short tones. The conflict of bimodal information is reflected in double flash or fission, and flash fusion illusions, respectively. The temporal resolution of the vision system plays a fundamental role in the development of these illusions. As the parallel, dorsal and ventral pathways have different temporal resolution we presume that these pathways play different roles in the illusions. We used pathway-optimized stimuli to induce the illusions on separately driven visual streams. Our results show that both pathways support the double flash illusion, while the presence of the fusion illusion depends on the activated pathway. The dorsal pathway, which has better temporal resolution, does not support fusion, while the ventral pathway which has worse temporal resolution shows fusion strongly.	\N	\N
24909603	The current pupillometry study examined the impact of speech-perception training on word recognition and cognitive effort in older adults with hearing loss. Trainees identified more words at the follow-up than at the baseline session. Training also resulted in an overall larger and faster peaking pupillary response, even when controlling for performance and reaction time. Perceptual and cognitive capacities affected the peak amplitude of the pupil response across participants but did not diminish the impact of training on the other pupil metrics. Thus, we demonstrated that pupillometry can be used to characterize training-related and individual differences in effort during a challenging listening task. Importantly, the results indicate that speech-perception training not only affects overall word recognition, but also a physiological metric of cognitive effort, which has the potential to be a biomarker of hearing loss intervention outcome.	\N	\N
24911919	It is usually easy to understand speech, but when several people are talking at once it becomes difficult. The brain must select one speech stream and ignore distracting streams. We tested a theory about the neural and computational mechanisms of attentional selection. The theory is that oscillating signals in brain networks phase-lock with amplitude fluctuations in speech. By doing this, brain-wide networks acquire information from the selected speech, but ignore other speech signals on the basis of their non-preferred dynamics. Two predictions were supported: first, attentional selection boosted the power of neuroelectric signals that were phase-locked with attended speech, but not ignored speech. Second, this phase selectivity was associated with better recall of the attended speech.	\N	\N
24919347	Auditory stimuli often facilitate visual perception. Audiovisual integration requires spatial and/or temporal proximity between visual and auditory stimuli; additionally, sensory processing speed affects the audiovisual integration process. In the present study we examined the relationship between processing speed and the auditory facilitation effect on visual representations by manipulating dot quantity patterns. We hypothesized that the auditory facilitation effect would be observed in longer interstimulus interval conditions with more dot quantities. This is because more processing time would be required to integrate visual and auditory stimuli. During a backward masking paradigm used in experiment 1, the auditory facilitation effect depended on dot quantity among patterns and the interval between visual stimuli and masks. Moreover, differences in processing time required to integrate visual and auditory stimuli between dot quantities was confirmed from a same-different discrimination task in experiment 2. Therefore, dot quantity affects sensory processing time, and a longer processing time is required for integrating visual and auditory stimuli when visual dot quantity is high.	\N	\N
24920615	The human voice carries speech as well as important nonlinguistic signals that influence our social interactions. Among these cues that impact our behavior and communication with other people is the perceived emotional state of the speaker. A theoretical framework for the neural processing stages of emotional prosody has suggested that auditory emotion is perceived in multiple steps (Schirmer and Kotz, 2006) involving low-level auditory analysis and integration of the acoustic information followed by higher-level cognition. Empirical evidence for this multistep processing chain, however, is still sparse. We examined this question using functional magnetic resonance imaging and a continuous carry-over design (Aguirre, 2007) to measure brain activity while volunteers listened to non-speech-affective vocalizations morphed on a continuum between anger and fear. Analyses dissociated neuronal adaptation effects induced by similarity in perceived emotional content between consecutive stimuli from those induced by their acoustic similarity. We found that bilateral voice-sensitive auditory regions as well as right amygdala coded the physical difference between consecutive stimuli. In contrast, activity in bilateral anterior insulae, medial superior frontal cortex, precuneus, and subcortical regions such as bilateral hippocampi depended predominantly on the perceptual difference between morphs. Our results suggest that the processing of vocal affect recognition is a multistep process involving largely distinct neural networks. Amygdala and auditory areas predominantly code emotion-related acoustic information while more anterior insular and prefrontal regions respond to the abstract, cognitive representation of vocal affect.	\N	\N
24923315	The present study examines the articulation and acoustics of the typologically rare and understudied 'whistled' fricative sound in Xitsonga, a Southern Bantu language. Using ultrasound imaging and video recording, we examine the lingual and labial articulation of the whistled fricative. For the acoustic analysis, we employ the multitaper spectral analysis, which ensures reliable spectral estimates. The results revealed an interplay between multiple articulators involved in the production of the sound: the retroflex lingual gesture and the narrowing of the lower lip toward the upper teeth. Acoustically, the spectra of the whistled fricative are more peaked and compact than the acoustically similar palatoalveolar fricative, and the differences manifest themselves most clearly in two acoustic parameters, dynamic amplitude (Ad) and M2 (variance). The acoustic differences are also manifested in F2 and F3 in the surrounding vowels. Additionally, the 'whistled' fricative in Xitsonga is not quite whistled, contrary to the label given to the sound in previous studies. Building on the current articulatory and acoustic results, we discuss two different aerodynamic models for the whistled fricatives in Southern Bantu languages and conclude that the whistled fricative in Xitsonga is best characterized as a retroflex segment accompanied by weak whistling.	\N	\N
24923465	It has been suggested that high-frequency audiometry (HFA) could represent a useful preventive measure in exposed workers. The aim was to investigate the effects of age, ultrasound and noise on high-frequency hearing thresholds. We tested 24 industrial ultrasound-exposed subjects, 113 industrial noise-exposed subjects and 148 non-exposed subjects. Each subject was tested with both conventional-frequency (0.125-8 kHz) and high-frequency (9-18 kHz) audiometry. The hearing threshold at high frequency deteriorated as a function of age, especially in subjects more than 30 years old. The ultrasound-exposed subjects had significantly higher hearing thresholds than the non-exposed ones at the high frequencies, being greatest from 10 to 14 kHz. This hearing loss was already significantly evident in subjects with exposure <5 years and increased with years of exposure and advancing age. The noise exposure group had significantly higher hearing thresholds than the non-exposed group at the conventional frequencies 4 and 6 kHz and at the high frequency of 14 kHz. After stratification for age, there was a significant difference between the two groups at 9-10 and 14-15 kHz only for those under 30 years of age. Multivariate analysis indicated that age was the primary predictor, and noise and ultrasound exposure the secondary predictors of hearing thresholds in the high-frequency range. The results suggest that HFA could be useful in the early diagnosis of noise-induced hearing loss in younger groups of workers (under 30 years of age).	\N	\N
24923619	The investigators compared event-related potential (ERP) amplitudes and event-related oscillations across a broad frequency range during an auditory oddball task using a comprehensive analysis approach to describe shared and unique neural auditory processing characteristics among healthy subjects (HP), schizophrenia probands (SZ) and their first-degree relatives, and bipolar disorder I with psychosis probands (BDP) and their first-degree relatives. This Bipolar-Schizophrenia Network on Intermediate Phenotypes sample consisted of clinically stable SZ (n = 229) and BDP (n = 188), HP (n = 284), first-degree relatives of schizophrenia probands (n = 264), and first-degree relatives of bipolar disorder I with psychosis probands (n = 239). They were administered an auditory oddball task in the electroencephalography environment. Principal components analysis derived data-driven frequency bands evoked power. Spatial principal components analysis reduced ERP and frequency data to component waveforms for each subject. Clusters of time bins with significant group differences on response magnitude were assessed for proband/relative differences from HP and familiality. Nine variables survived a linear discriminant analysis between HP, SZ, and BDP. Of those, two showed evidence (deficit in relatives and familiality) as genetic risk markers more specific to SZ (N1, P3b), one was specific to BDP (P2) and one for psychosis in general (N2). This study supports for both shared and unique deficits in early sensory and late cognitive processing across psychotic diagnostic groups. Additional ERP and time-frequency component alterations (frontal N2/P2, late high, early, mid, and low frequency) may provide insight into deficits in underlying neural architecture and potential protective/compensatory mechanisms in unaffected relatives.	\N	\N
24933411	Cortical auditory evoked potentials (CAEPs) were obtained for vowel tokens presented in an oddball stimulus paradigm. Perceptual measures of vowel discrimination were obtained using a visually-reinforced head-turn paradigm. The hypothesis was that CAEP latencies and amplitudes would differ as a function of vowel type and be correlated with perceptual performance. Twenty normally hearing infants aged 4-12 months were evaluated. CAEP component amplitudes and latencies were measured in response to the standard, frequent token /a/ and for infrequent, deviant tokens /i/, /o/ and /u/, presented at rates of 1 and 2 tokens/s. The perceptual task required infants to make a behavioral response for trials that contained two different vowel tokens, and ignore those in which the tokens were the same. CAEP amplitudes were larger in response to the deviant tokens, when compared to the control condition in which /a/ served as both standard and deviant. This was also seen in waveforms derived by subtracting the response to standard /a/ from the responses to deviant tokens. CAEP component latencies in derived responses at 2/s also demonstrated some sensitivity to vowel contrast type. The average hit rate for the perceptual task was 68.5%, with a 25.7% false alarm rate. There were modest correlations of CAEP amplitudes and latencies with perceptual performance. The CAEP amplitude differences for vowel contrasts could be used as an indicator of the underlying neural capacity to encode spectro-temporal differences in vowel sounds. This technique holds promise for translation to clinical methods for evaluating speech perception.	\N	\N
24936778	To understand the third mobile window effect of chronic otitis media with cholesteatoma with inner ear fistula on the bone conduction threshold, we examined changes in the bone conduction audiogram after tympanoplasty with mastoidectomy for chronic otitis media with cholesteatoma with canal fistula. Retrospective case review. Tertiary referral center. According to the intraoperative classification of Dornhoffer and Milewski, we focused especially on Type IIa (anatomic bony fistula with no perilymph leak). We checked the bone conduction threshold at least 3 times: just before, just after, and 6 months after surgery in 20 ears with Type IIa lateral semicircular canal fistula. Tympanoplasty with mastoidectomy. Bone conduction thresholds before and after tympanoplasty with mastoidectomy. Compared with the preoperative bone conduction threshold, 6 cases were better, 12 cases were unchanged, and 2 cases were worse within the first postoperative week. Finally, 1 case was better, 15 cases were unchanged, and 4 cases were worse at the sixth postoperative month. Patients with a better bone conduction threshold in the low-tone frequencies immediately after surgery had a tendency to show no preoperative fistula symptoms. Postoperative spontaneous nystagmus had a tendency to be observed in patients with a worse bone conduction threshold in the high-tone frequencies. The better bone conduction threshold at low-tone frequencies immediately after tympanoplasty with mastoidectomy and no preoperative fistula symptoms might imply the third mobile window theory. The worse bone conduction threshold in high-tone frequencies with spontaneous nystagmus after surgery might indicate inner ear damage.	\N	\N
24937187	Change deafness is the failure to notice changes in an auditory scene. In this study, we sought to determine if change deafness is a perceptual error, rather than only a reflection of verbal memory limitations. We also examined how successful encoding of objects within a scene is related to successful detection of changes. Event-related potentials (ERPs) were recorded while listeners completed a change-detection and an object-encoding task with scenes composed of recognizable sounds or unrecognizable temporally scrambled versions of the recognizable sounds. More change deafness occurred for the unrecognizable, compared to recognizable sounds, indicating that change deafness is a perceptual error and not solely a product of verbal memory. ERPs from both the recognizable and unrecognizable scenes revealed an enhanced P3b (at PZ/1/2, POZ/3/4 from 350 to 750ms) to detected changes, a marker that conscious change detection has occurred. Recognizable scenes resulted in an enhanced T400 (at T8/TP8, C6/CP6 from 315 to 660ms) to detected changes, possibly indicating activation of established memory representations. Unrecognizable scenes elicited an enhanced P3a (at FCZ/1/2 from 280 to 600ms) to detected changes, indicating enhanced orienting to acoustic change. Performance on the object-encoding task revealed that change deafness was reduced, but not eliminated, when performance on the object-encoding task was accurate.	\N	\N
24937544	Executive functions (EF) are cognitive capacities that allow for planned, controlled behavior and strongly correlate with academic abilities. Several extracurricular activities have been shown to improve EF, however, the relationship between musical training and EF remains unclear due to methodological limitations in previous studies. To explore this further, two experiments were performed; one with 30 adults with and without musical training and one with 27 musically trained and untrained children (matched for general cognitive abilities and socioeconomic variables) with a standardized EF battery. Furthermore, the neural correlates of EF skills in musically trained and untrained children were investigated using fMRI. Adult musicians compared to non-musicians showed enhanced performance on measures of cognitive flexibility, working memory, and verbal fluency. Musically trained children showed enhanced performance on measures of verbal fluency and processing speed, and significantly greater activation in pre-SMA/SMA and right VLPFC during rule representation and task-switching compared to musically untrained children. Overall, musicians show enhanced performance on several constructs of EF, and musically trained children further show heightened brain activation in traditional EF regions during task-switching. These results support the working hypothesis that musical training may promote the development and maintenance of certain EF skills, which could mediate the previously reported links between musical training and enhanced cognitive skills and academic achievement.	\N	\N
24949818	During childhood, verbal learning and memory are important for academic performance. Recent functional MRI studies have reported on the functional correlates of verbal memory proficiency, but few have reported the underlying structural correlates. The present study sought to test the relationship between fronto-temporal white matter integrity and verbal memory proficiency in children. Diffusion weighted images were collected from 17 Black children (age 8-11 years) who also completed the California Verbal Learning Test. To index white matter integrity, fractional anisotropy values were calculated for bilateral uncinate fasciculus. The results revealed that low anisotropy values corresponded to poor verbal memory, whereas high anisotropy values corresponded to significantly better verbal memory scores. These findings suggest that a greater degree of myelination and cohesiveness of axonal fibers in uncinate fasciculus underlie better verbal memory proficiency in children.	\N	\N
24952106	This study evaluated the clinical effectiveness of wireless contralateral routing of offside signals hearing aids (CROS) in patients with severe to profound unilateral sensorineural hearing loss (USNHL). Twenty-one patients with USNHL were enrolled in this prospective study. The change of subjective satisfaction was evaluated using three questionnaires (K-HHIE, K-IOI-HA, K-SSQ). Changes in objective measurements were evaluated with sound localization test (SLT) and hearing in noise test (HINT). These tests were performed at pre-CROS fitting, 2 and 4 weeks after use of CROS. Subjects were grouped according to the age: young (<40 years) vs. old (≥40 years) group. The average K-HHIE and K-SSQ scores significantly improved with the use of CROS. SLT result revealed that hit rate and error degree improved in the young group and lateralization ability improved in both groups. In quiet environments, the reception threshold for speech also indicated a significant benefit in the young group. When the noise was presented to the normal ear, HINT revealed benefit of CROS, while loss of performance with CROS use was significant when noise was presented to the impaired ear. Wireless CROS provided increased satisfaction and overall improvement of localization and hearing. Although true binaural hearing cannot be obtained, CROS is a practical option for rehabilitation of USNHL.	\N	\N
24959621	Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.	\N	\N
24960432	Research on unconscious or unaware vision has demonstrated that unconscious processing can be flexibly adapted to the current goals of human agents. The present review focuses on one area of research, masked visual priming. This method uses visual stimuli presented in a temporal sequence to lower the visibility of one of these stimuli. In this way, a stimulus can be masked and even rendered invisible. Despite its invisibility, a masked stimulus if used as a prime can influence a variety of executive functions, such as response activation, semantic processing, or attention shifting. There are also limitations on the processing of masked primes. While masked priming research demonstrates the top-down dependent usage of unconscious vision during task-set execution it also highlights that the set-up of a new task-set depends on conscious vision as its input. This basic distinction captures a major qualitative difference between conscious and unconscious vision.	\N	\N
24961249	The sound-induced flash illusion (SIFI) is a multisensory perceptual phenomenon in which the number of brief visual stimuli perceived by an observer is influenced by the number of concurrently presented sounds. While the strength of this illusion has been shown to be modulated by the temporal congruence of the stimuli from each modality, there is conflicting evidence regarding its dependence upon their spatial congruence. We addressed this question by examining SIFIs under conditions in which the spatial reliability of the visual stimuli was degraded and different sound localization cues were presented using either free-field or closed-field stimulation. The likelihood of reporting a SIFI varied with the spatial cue composition of the auditory stimulus and was highest when binaural cues were presented over headphones. SIFIs were more common for small flashes than for large flashes, and for small flashes at peripheral locations, subjects experienced a greater number of illusory fusion events than fission events. However, the SIFI was not dependent on the spatial proximity of the audiovisual stimuli, but was instead determined primarily by differences in subjects' underlying sensitivity across the visual field to the number of flashes presented. Our findings indicate that the influence of auditory stimulation on visual numerosity judgments can occur independently of the spatial relationship between the stimuli.	\N	\N
24972303	Groove-based rhythm is a basic and much appreciated feature of Western popular music. It is commonly associated with dance, movement and pleasure and is characterized by the repetition of a basic rhythmic pattern. At various points in the musical course, drum breaks occur, representing a change compared to the repeated pattern of the groove. In the present experiment, we investigated the brain response to such drum breaks in a repetitive groove. Participants were scanned with functional magnetic resonance imaging (fMRI) while listening to a previously unheard naturalistic groove with drum breaks at uneven intervals. The rhythmic pattern and the timing of its different parts as performed were the only aspects that changed from the repetitive sections to the breaks. Differences in blood oxygen level-dependent activation were analyzed. In contrast to the repetitive parts, the drum breaks activated the left cerebellum, the right inferior frontal gyrus (RIFG), and the superior temporal gyri (STG) bilaterally. A tapping test using the same stimulus showed an increase in the standard deviation of inter-tap-intervals in the breaks versus the repetitive parts, indicating extra challenges for auditory-motor integration in the drum breaks. Both the RIFG and STG have been associated with structural irregularity and increase in musical-syntactical complexity in several earlier studies, whereas the left cerebellum is known to play a part in timing. Together these areas may be recruited in the breaks due to a prediction error process whereby the internal model is being updated. This concurs with previous research suggesting a network for predictive feed-forward control that comprises the cerebellum and the cortical areas that were activated in the breaks.	\N	\N
24972535	The aim of this study was to establish a multiparameter voice assessment profile using objective multiparameter test and subjective voice quality assessment. We assessed 50 patients with voice disorders before and after operation. The assessment incorporates (1) subjective voice quality assessment, (2) patients' self-assessment, and (3) objective acoustic analysis. The subjective voice quality assessment uses GRABS system to evaluates the grade of hoarseness (G), proposed by the Japanese Society for Logopedics and Phoniatrics. Patients' self-assessment is modified based on the Chinese version of voice handicap index (VHI) scale, composed of functional (F), physiological (P), emotional (E) part, and a total score (T). The acoustical analysis evaluate the patients' voice sample by voice analysis software "Dr. Speech". Three parameters, jitter (J), shimmer(S), and normalized noise energy (NNE), were taken in analysis. We observed high correlations among subentries F, P, and the total score TvH of the VHI scale in patients' subjective assessment. Parameter E does not correlate well with other assessed parameters. The Chinese version of VHI, which incorporate multifactors including age, education, and especially the cultural difference may account for the inconsistent correction in parameter E. In the objective acoustic analysis, high correlation among the three parameters J, S, and NNE is observed. Systemic assessment combining a subjective voice quality assessment, an objective acoustic analysis, and a self-assessment is helpful in clinical practice in the diagnosis and treatment for voice disorders. The E component in VHI scale assessment may not be a reliable parameter to evaluate treatment outcome.	\N	\N
24975453	The Hearing Implant Sound Quality Index (HISQUI19) seems to be a valid tool for quantifying the self-perceived level of auditory benefit that cochlear implant (CI) users experience in everyday listening situations. Additional research is, however, required. To develop and validate a user-friendly instrument for quantifying the self-perceived level of auditory benefit that CI users experience in everyday listening situations. This was an explorative, uncontrolled, single-group, cross-sectional study. Items for the HISQUI19 were decided upon using user input and verified by professionals. The HISQUI19 was assessed on 75 CI users from hearing implant centres in Germany and Austria to determine the questions. The HISQUI19, consisting of 19 items scored on a 7-point Likert scale, was validated. Subjects older than 60 years at time of implantation did not have significantly higher mean values than subjects younger than 60 years. Gender and whether subjects are unilateral or bilateral implant CI users did not influence self-perceived functioning. Subjects with ≤20 years of hearing loss reported no significantly higher functioning than those with >20 years of hearing loss.	\N	\N
24980742	The motivation for infants' non-word vocalizations in the second half of the first year of life and later is unclear. This study of hearing infants and infants with profound hearing loss with and without cochlear implants addressed the hypothesis that vocalizations are primarily motivated by auditory feedback. Early access to cochlear implants has created unique conditions of auditory manipulation that permit empirical tests of relations between auditory perception and infant behavior. Evidence from two separate tests of the research hypothesis showed that, before cochlear implantation, infants with profound hearing loss vocalized significantly less often than hearing infants; however, soon after cochlear implantation, they vocalized at levels commensurate with hearing peers. In contrast, vocal behaviors that are typically considered reflexive or emotion-based signals (e.g., crying) were infrequent overall and did not vary with auditory access. These results support the hypothesis that auditory feedback is a critical component motivating early vocalization frequency.	\N	\N
24990679	The human music faculty might have evolved from rudimentary components that occur in non-human animals. The evolutionary history of these rudimentary perceptual features is not well understood and rarely extends beyond a consideration of vertebrates that possess a cochlea. One such antecedent is a preferential response to what humans perceive as consonant harmonic sounds, which are common in many animal vocal repertoires. We tested the phonotactic response of female túngara frogs (Physalaemus pustulosus) to variations in the frequency ratios of their harmonically structured mating call to determine whether frequency ratio influences attraction to acoustic stimuli in this vertebrate that lacks a cochlea. We found that the ratio of frequencies present in acoustic stimuli did not influence female response. Instead, the amount of inner ear stimulation predicted female preference behaviour. We conclude that the harmonic relationships that characterize the vocalizations of these frogs did not evolve in response to a preference for frequency intervals with low-integer ratios. Instead, the presence of harmonics in their mating call, and perhaps in the vocalizations of many other animals, is more likely due to the biomechanics of sound production rather than any preference for 'more musical' sounds.	\N	\N
24993544	Three experiments investigated memory for semantic information with the goal of determining boundary conditions for the manifestation of semantic auditory distraction. Irrelevant speech disrupted the free recall of semantic category- exemplars to an equal degree regardless of whether the speech coincided with presentation or test phases of the task (Experiment 1), and this occurred regardless of whether it comprised random words or coherent sentences (Experiment 2). The effects of background speech were greater when the irrelevant speech was semantically related to the to-be-remembered material, but only when the irrelevant words were high in output dominance (Experiment 3). The implications of these findings in relation to the processing of task material and the processing of background speech are discussed.	\N	\N
24993633	To compare the Naida CI UltraZoom adaptive beamformer and T-Mic settings in a real life environment. Speech reception thresholds (SRTs) were measured in a moderately reverberant room, using the German Oldenburger sentence test. The speech signal was always presented from the front loudspeaker at 0° azimuth and fixed masking noise was presented either simultaneously from all eight loudspeakers around the subject at 0°, ±45°, ±90°, ±135°, and 180° azimuth or from five loudspeakers positioned at ±70°, ±135°, and 180° azimuth. In the third test setup, an additional roving noise was added to the six loudspeaker arrangement. There was a significant difference in mean SRTs between the Naida CI T-Mic and UltraZoom in each of the three test setups. The largest improvements were seen in the six speaker roving and fixed noise conditions. Adding ClearVoice to the Naida CI T-Mic setting significantly improved the SRT in both fixed noise conditions, but not in the roving noise condition. In each setup, the lowest SRTs were obtained with the UltraZoom plus ClearVoice setting. The degree of improvement was consistent with previous beamforming studies. In the most challenging listening situation, with noise from eight speakers and speech and noise presented coincidentally from the front, UltraZoom still provided a significant benefit. When a moving noise source was added, the improvement in SRT provided by UltraZoom was maintained. When tested in challenging and realistic noise environments, the Naida CI UltraZoom adaptive beamformer resulted in significantly lower mean SRTs than when the T-Mic alone was used.	\N	\N
24995902	Neuroplasticity (NPL), neuromodulation (NM), and neuroprotection (NPT) are ongoing biophysiological processes that are linked together in sensory systems, the goal being the maintenance of a homeostasis of normal sensory function in the central nervous system. It is hypothesized that when the balance between excitatory - inhibitory action is broken in sensory systems, predominantly due to neuromodulatory activity with reduced induced inhibition and excitation predominates, sensory circuits become plastic with adaptation at synaptic levels to environmental inputs(1). Tinnitus an aberrant auditory sensation, for all clinical types, is clinically considered to reflect a failure of NPL, NM, and NPT to maintain normal auditory function at synaptic levels in sensory cortex and projected to downstream levels in the central auditory system in brain and sensorineural elements in ear. Clinically, the tinnitus sensation becomes behaviorally manifest with varying degrees of annoyance, reflecting a principle of sensory physiology that each sensation has components, i.e. sensory, affect/behavior, psychomotor and memory. Modalities of tinnitus therapies, eg instrumentation, pharmacology, surgery, target a particular component of tinnitus, with resultant activation of neuromodulators at multiple neuromodulatory centers in brain and ear. Effective neuromodulation at sensory neuronal synaptic levels results in NPL in sensory cortex, NPT and tinnitus relief. Functional brain imaging, metabolic (PET brain) and electrophysiology quantitative electroencephalography (QEEG) data in a cochlear implant soft failure patient demonstrates what is clinically considered to reflect NPL, NM, NPT. The reader is provided with a rationale for tinnitus diagnosis and treatment, with a focus on ES, reflecting the biology underlying NPL, NM, NPT.	\N	\N
25013945	Music as alternate engagement (MAE) can be used effectively to distract children during painful or anxiety-provoking medical procedures. For such interventions to be successful, it would seem important to assess the degree to which a child can attend to musical stimuli. The purposes of this study were as follows: (a) To establish construct validity by determining the extent to which the Music Attentiveness Screening Assessment (MASA) measures auditory attention; and (b) to gather evidence regarding MASA test-retest and inter-observer reliability. The Auditory Attention (AA) subtest from the NEPSY-II (NEPSY, Second Edition) and the two items from MASA were administered to a nonclinical sample of children (N = 50) aged 5 to 9 years. There was a statistically significant proportion of AA score variance shared with MASA (both items), R (2) = .21, F(2, 47) = 6.34, p = .004. Test-retest reliability on the first MASA item was moderately high (Pearson r = .84) while on the second item it was lower (r = .63). Similarly, interobserver agreement was high for Item I (intraclass correlation coefficient [ICC] = .95) and lower for Item II (ICC = .71). Evidence suggests that MASA measures, at least in part, auditory attention. Despite this finding, a large proportion of unexplained variance remains. Furthermore, reliability estimates (test-retest and interobserver agreement) differ between both items. These findings are discussed with particular attention paid to the ways in which MASA should be revised and further study conducted.	\N	\N
25016092	Lesion and neuroimaging studies indicate that the insula mediates motor aspects of speech production, specifically, articulatory control. Although it has direct connections to Broca's area, the canonical speech production region, the insula is also broadly connected with other speech and language centres, and may play a role in coordinating higher-order cognitive aspects of speech and language production. The extent of the insula's involvement in speech and language processing was assessed using the Activation Likelihood Estimation (ALE) method. Meta-analyses of 42 fMRI studies with healthy adults were performed, comparing insula activation during performance of language (expressive and receptive) and speech (production and perception) tasks. Both tasks activated bilateral anterior insulae. However, speech perception tasks preferentially activated the left dorsal mid-insula, whereas expressive language tasks activated left ventral mid-insula. Results suggest distinct regions of the mid-insula play different roles in speech and language processing.	\N	\N
25026154	The purpose of this study was to compare 3 T and 1.5 T fMRI results during emotional music listening. Stimuli comprised of psychoacoustically balanced instrumental musical pieces, with three different affective expressions (fear, neutral, joy). Participants (N=32) were split into two groups, one subjected to fMRI scanning using 3 T and another group scanned using 1.5 T. Whole brain t-tests (corrected for multiple comparisons) compared joy and fear in each of the two groups. The 3 T group showed significant activity differences between joy and fear localized in bilateral superficial amygdala, bilateral hippocampus and bilateral auditory cortex. The 1.5 T group showed significant activity differences between joy and fear localized in bilateral auditory cortex and cuneus. This is the first study to compare results obtained under different field strengths with regard to affective processes elicited by means of auditory/musical stimulation. The findings raise concern over false negatives in the superficial amygdala and hippocampus in affective studies conducted under 1.5 T and caution that imaging improvements due to increasing magnetic field strength can be influenced by region-specific characteristics.	\N	\N
25031365	The neural mechanisms underlying the attainment of fear memory accuracy for appropriate discriminative responses to aversive and nonaversive stimuli are unclear. Considerable evidence indicates that coactivator of transcription and histone acetyltransferase cAMP response element binding protein (CREB) binding protein (CBP) is critically required for normal neural function. CBP hypofunction leads to severe psychopathological symptoms in human and cognitive abnormalities in genetic mutant mice with severity dependent on the neural locus and developmental time of the gene inactivation. Here, we showed that an acute hypofunction of CBP in the medial prefrontal cortex (mPFC) results in a disruption of fear memory accuracy in mice. In addition, interruption of CREB function in the mPFC also leads to a deficit in auditory discrimination of fearful stimuli. While mice with deficient CBP/CREB signaling in the mPFC maintain normal responses to aversive stimuli, they exhibit abnormal responses to similar but nonrelevant stimuli when compared to control animals. These data indicate that improvement of fear memory accuracy involves mPFC-dependent suppression of fear responses to nonrelevant stimuli. Evidence from a context discriminatory task and a newly developed task that depends on the ability to distinguish discrete auditory cues indicated that CBP-dependent neural signaling within the mPFC circuitry is an important component of the mechanism for disambiguating the meaning of fear signals with two opposing values: aversive and nonaversive.	\N	\N
25032683	Categorization is an important cognitive process. However, the correct categorization of a stimulus is often challenging because categories can have overlapping boundaries. Whereas perceptual categorization has been extensively studied in vision, the analogous phenomenon in audition has yet to be systematically explored. Here, we test whether and how human subjects learn to use category distributions and prior probabilities, as well as whether subjects employ an optimal decision strategy when making auditory-category decisions. We asked subjects to classify the frequency of a tone burst into one of two overlapping, uniform categories according to the perceived tone frequency. We systematically varied the prior probability of presenting a tone burst with a frequency originating from one versus the other category. Most subjects learned these changes in prior probabilities early in testing and used this information to influence categorization. We also measured each subject's frequency-discrimination thresholds (i.e., their sensory uncertainty levels). We tested each subject's average behavior against variations of a Bayesian model that either led to optimal or sub-optimal decision behavior (i.e. probability matching). In both predicting and fitting each subject's average behavior, we found that probability matching provided a better account of human decision behavior. The model fits confirmed that subjects were able to learn category prior probabilities and approximate forms of the category distributions. Finally, we systematically explored the potential ways that additional noise sources could influence categorization behavior. We found that an optimal decision strategy can produce probability-matching behavior if it utilized non-stationary category distributions and prior probabilities formed over a short stimulus history. Our work extends previous findings into the auditory domain and reformulates the issue of categorization in a manner that can help to interpret the results of previous research within a generative framework.	\N	\N
25033791	Noise has the potential to impair cognitive performance. For nonnative speakers, the effect of noise on performance is more severe than their native counterparts. What remains unknown is the effectiveness of countermeasures such as noise attenuating devices in such circumstances. Therefore, the main aim of the present research was to examine the effectiveness of active noise attenuating countermeasures in the presence of simulated aircraft noise for both native and nonnative English speakers. Thirty-two participants, half native English speakers and half native German speakers completed four recognition (cued) recall tasks presented in English under four different audio conditions, all in the presence of simulated aircraft noise. The results of the research indicated that in simulated aircraft noise at 65 dB(A), performance of nonnative English speakers was poorer than for native English speakers. The beneficial effects of noise cancelling headphones in improving the signal to noise ratio led to an improved performance for nonnative speakers. These results have particular importance for organizations operating in a safety-critical environment such as aviation.	\N	\N
25046122	Behavioral investigations of the acquisition of some have shown that children favor its logical interpretation (some and possibly all). Adults, however, use the pragmatic interpretation (some but not all) derived by a scalar implicature. Certain experimental manipulations increase children's rates of adult-like responses, indicating that children are capable of computing implicatures. A functional MRI (fMRI) study examining adults linked the left inferior frontal gyrus (IFG) to implicature computation, and prefrontal regions, the left middle frontal gyrus (MFG), and medial frontal gyrus (MeFG), to processing the mismatch between implicatures and the context in which they were presented. In the current fMRI study, we aimed to determine whether children's failure to give pragmatic interpretations to some results from a failure in implicature computation or in implicature-mismatch processing. We explored children's brain activations with the same experimental task administered to adults. In a region-of-interest analysis, children showed an activational pattern similar to the one observed in adults in the left IFG with increased activations for the implicature conditions. By contrast, in the left MFG, children showed decreased activation for the mismatched implicatures compared with matched and no implicature conditions. No difference between the conditions was observed in the MeFG. For both implicature conditions, no activation in the left IFG was observed when comparing adults and children directly. However, for mismatched implicatures, adults showed greater activation in the prefrontal regions compared with children. Our results suggest that children may have an adult-like computation of implicatures (even when their behavior does not necessarily indicate that), but they fail in resolving implicature-mismatch situations.	\N	\N
25056109	There is converging evidence for the notion that pain affects a broad range of attentional domains. This study investigated the influence of pain on the involuntary capture of attention as indexed by the P3a component in the event-related potential derived from the electroencephalogram. Participants performed in an auditory oddball task in a pain-free and a pain condition during which they submerged a hand in cold water. Novel, infrequent and unexpected auditory stimuli were presented randomly in a series of frequent standard and infrequent target tones. P3a and P3b amplitudes were observed to novel, unexpected and target-related stimuli, respectively. Both electrophysiological components were characterized by reduced amplitudes in the pain compared with the pain-free condition. Hit rate and reaction time to target stimuli did not differ between the two conditions presumably because the experimental task was not difficult enough to exceed attentional capacities under pain conditions. These results indicate that voluntary attention serving the maintenance and control of ongoing information processing (reflected by the P3b amplitude) is impaired by pain. In addition, the involuntary capture of attention and orientation to novel, unexpected information (measured by the P3a) is also impaired by pain. Thus, neurophysiological measures examined in this study support the theoretical positions proposing that pain can reduce attentional processing capacity. These findings have potentially important implications at the theoretical level for our understanding of the interplay of pain and cognition, and at the therapeutic level for the clinical treatment of individuals experiencing ongoing pain.	\N	\N
25064434	The present magnetoencephalography study used the cortically constrained minimum-norm estimates of human brain activity to elucidate functional roles of neural generators for detecting different magnitudes of lexical tones changes. A multiple-deviant oddball paradigm was used in which the syllable "yi" with a low-dipping tone (T3) was the common standard sound and the same syllable with a high-level tone (T1) or a high-rising tone (T2) were the large and small deviant sounds, respectively. The data revealed a larger magnetic mismatch field (MMNm) for large deviant in the left hemisphere. The source analysis also confirmed that the MMNm to lexical tone changes was generated in bilateral superior temporal gyri and only the large deviant revealed left lateralization. A set of frontal generators was activated at a later time and revealed differential sensitivities to the degree of deviance. The left anterior insula, the right anterior cingulate cortex, and the right ventral orbital frontal cortex were activated when detecting a large deviant, whereas the right frontal-opercular region was sensitive to the small deviant. These frontal generators were thought to be associated with various top-down mechanisms for attentional modulation. The time frequency (TF) analysis showed that large deviants yielded large theta band (5-7Hz) activity over the left anterior scalp and the left central scalp, while small deviants yielded large alpha band activity (9-11Hz) over the posterior scalp. The results of TF analyses implied that mechanisms of working memory and functional inhibition involved in the processes of acoustic change detection.	\N	\N
25074900	Studies of visual masking have provided a wide range of important insights into the processes involved in visual coding. However, very few of these studies have employed natural scenes as masks. Little is known on how the particular features found in natural scenes affect visual detection thresholds and how the results obtained using unnatural masks relate to the results obtained using natural masks. To address this issue, this paper describes a psychophysical study designed to obtain local contrast detection thresholds for a database of natural images. Via a three-alternative forced-choice experiment, we measured thresholds for detecting 3.7 cycles/° vertically oriented log-Gabor noise targets placed within an 85 × 85-pixels patch (1.9° patch) drawn from 30 natural images from the CSIQ image database (Larson & Chandler, Journal of Electronic Imaging, 2010). Thus, for each image, we obtained a masking map in which each entry in the map denotes the root mean squared contrast threshold for detecting the log-Gabor noise target at the corresponding spatial location in the image. From qualitative observations we found that detection thresholds were affected by several patch properties such as visual complexity, fineness of textures, sharpness, and overall luminance. Our quantitative analysis shows that except for the sharpness measure (correlation coefficient of 0.7), the other tested low-level mask features showed a weak correlation (correlation coefficients less than or equal to 0.52) with the detection thresholds. Furthermore, we evaluated the performance of a computational contrast gain control model that performed fairly well with an average correlation coefficient of 0.79 in predicting the local contrast detection thresholds. We also describe specific choices of parameters for the gain control model. The objective of this database is to provide researchers with a large ground-truth dataset in order to further investigate the properties of the human visual system using natural masks.	\N	\N
25080602	In an ever-changing environment, selecting appropriate responses in conflicting situations is essential for biological survival and social success and requires cognitive control, which is mediated by dorsomedial prefrontal cortex (DMPFC) and dorsolateral prefrontal cortex (DLPFC). How these brain regions communicate during conflict processing (detection, resolution, and adaptation), however, is still unknown. The Stroop task provides a well-established paradigm to investigate the cognitive mechanisms mediating such response conflict. Here, we explore the oscillatory patterns within and between the DMPFC and DLPFC in human epilepsy patients with intracranial EEG electrodes during an auditory Stroop experiment. Data from the DLPFC were obtained from 12 patients. Thereof four patients had additional DMPFC electrodes available for interaction analyses. Our results show that an early θ (4-8 Hz) modulated enhancement of DLPFC γ-band (30-100 Hz) activity constituted a prerequisite for later successful conflict processing. Subsequent conflict detection was reflected in a DMPFC θ power increase that causally entrained DLPFC θ activity (DMPFC to DLPFC). Conflict resolution was thereafter completed by coupling of DLPFC γ power to DMPFC θ oscillations. Finally, conflict adaptation was related to increased postresponse DLPFC γ-band activity and to θ coupling in the reverse direction (DLPFC to DMPFC). These results draw a detailed picture on how two regions in the prefrontal cortex communicate to resolve cognitive conflicts. In conclusion, our data show that conflict detection, control, and adaptation are supported by a sequence of processes that use the interplay of θ and γ oscillations within and between DMPFC and DLPFC.	\N	\N
25090306	It is widely acknowledged that individuals with elevated depressive symptoms exhibit deficits in inter-personal communication. Research has primarily focused on speech production in individuals with elevated depressive symptoms. Little is known about speech perception in individuals with elevated depressive symptoms, especially in challenging listening conditions. Here, we examined speech perception in young adults with low- or high-depressive (HD) symptoms in the presence of a range of maskers. Maskers were selected to reflect various levels of informational masking (IM), which refers to cognitive interference due to signal and masker similarity, and energetic masking (EM), which refers to peripheral interference due to signal degradation by the masker. Speech intelligibility data revealed that individuals with HD symptoms did not differ from those with low-depressive symptoms during EM, but they exhibited a selective deficit during IM. Since IM is a common occurrence in real-world social settings, this listening deficit may exacerbate communicative difficulties.	\N	\N
25092665	What do we hear when someone speaks and what does auditory cortex (AC) do with that sound? Given how meaningful speech is, it might be hypothesized that AC is most active when other people talk so that their productions get decoded. Here, neuroimaging meta-analyses show the opposite: AC is least active and sometimes deactivated when participants listened to meaningful speech compared to less meaningful sounds. Results are explained by an active hypothesis-and-test mechanism where speech production (SP) regions are neurally re-used to predict auditory objects associated with available context. By this model, more AC activity for less meaningful sounds occurs because predictions are less successful from context, requiring further hypotheses be tested. This also explains the large overlap of AC co-activity for less meaningful sounds with meta-analyses of SP. An experiment showed a similar pattern of results for non-verbal context. Specifically, words produced less activity in AC and SP regions when preceded by co-speech gestures that visually described those words compared to those words without gestures. Results collectively suggest that what we 'hear' during real-world speech perception may come more from the brain than our ears and that the function of AC is to confirm or deny internal predictions about the identity of sounds.	\N	\N
25096108	The summation of loudness across ears is often studied by measuring the level difference required for equal loudness (LDEL) of monaural and diotic sounds. Typically, the LDEL is ∼5-6 dB, consistent with the idea that a diotic sound is ∼1.5 times as loud as the same sound presented monaurally at the same level, as predicted by the loudness model of Moore and Glasberg [J. Acoust. Soc. Am. 121, 1604-1612 (2007)]. One might expect that the LDEL would be <5-6 dB for hearing-impaired listeners, because loudness recruitment leads to a more rapid change of loudness for a given change in level. However, previous data sometimes showed similar LDEL values for normal-hearing and hearing-impaired listeners. Here, the LDEL was measured for hearing-impaired listeners using narrowband and broadband noises centered at 500 Hz, where audiometric thresholds were near-normal, and at 3000 or 4000 Hz, where audiometric thresholds were elevated. The mean LDEL was 5.6 dB at 500 Hz and 4.2 dB at the higher center frequencies. The results were predicted reasonably well by an extension of the loudness model of Moore and Glasberg.	\N	\N
25096138	The effects of audiovisual versus auditory training for speech-in-noise identification were examined in 60 young participants. The training conditions were audiovisual training, auditory-only training, and no training (n = 20 each). In the training groups, gated consonants and words were presented at 0 dB signal-to-noise ratio; stimuli were either audiovisual or auditory-only. The no-training group watched a movie clip without performing a speech identification task. Speech-in-noise identification was measured before and after the training (or control activity). Results showed that only audiovisual training improved speech-in-noise identification, demonstrating superiority over auditory-only training.	\N	\N
25113242	This study investigates the influence of rhythmic expectancies on language processing. It is assumed that language rhythm involves an alternation of strong and weak beats within a linguistic domain. Hence, in some contexts rhythmically induced stress shifts occur in order to comply with the Rhythm Rule. In English, this rule operates to prevent clashes of stressed adjacent syllables or lapses of adjacent unstressed syllables. While previous studies investigated effects on speech production and perception, this study focuses on brain responses to structures either obeying or deviating from this rule. Event-related potentials show that rhythmic regularity is relevant for language processing: rhythmic deviations evoked different ERP components reflecting the deviance from rhythmic expectancies. An N400 effect found for shifted items reflects higher costs in lexical processing due to stress deviation. The overall results disentangle lexical and rhythmical influences on language processing and complement the findings of previous studies on rhythmical processing.	\N	\N
25118042	To compare the fitting time requirements and the efficiency in achieving improvements in speech perception during the first 6 months after initial stimulation of computer-assisted fitting with the Fitting to Outcome eXpert' (FOX) and a standard clinical fitting procedure. Twenty-seven post-lingually deafened adults, newly implanted recipients of the Advanced Bionics HiRes 90K™ cochlear implant from Germany, the UK, and France took part in a controlled, randomized, clinical study. Speech perception was measured for all participants and fitting times were compared across groups programmed using FOX and conventional programming methods. The fitting time for FOX was significantly reduced at 14 days (P < 0.001) but equivalent over the 6-month period. The groups were not well matched for duration of deafness; therefore, speech perception could not be compared across groups. Despite including more objective measures of performance than a standard fitting approach and the adjustment of a greater range of parameters during initial fitting, FOX did not add to the overall fitting time when compared to the conventional approach. FOX significantly reduced the fitting time in the first 2 weeks and by providing a standard fitting protocol, reduced variability across centres. FOX computer-assisted fitting can be successfully used at switch on, in different clinical environments, reducing fitting time in the first 2 weeks and is efficient at providing a usable program.	\N	\N
25121623	The current study provides evidence that the absence of a syntactically expected item leads to a sustained cognitive processing demand. Event-related potentials were measured at the omission of a syntactically expected object argument in a speech sequence. English monolingual adults listened to paired sentences. The first sentence in the pair established a context. The second sentence provided a response to the first sentence that was either grammatically correct by containing an overt object argument in the form of a pronoun, or was syntactically unacceptable by omitting the expected object pronoun. Event-related potentials measured at the omission of the object argument showed a prolonged positivity for 100-600 ms with a broad scalp distribution, and for 600-1000 ms with a focus in the anterior region. This observed omitted stimulus potential may contain characteristics of the P300 component, associated with the detection of the deviation of an expected stimulus, and the classical P600 related to syntactic reanalysis. Further, the late anterior P600 may indicate an increased memory demand in sentence comprehension. Thus, this linguistic omitted stimulus potential is a cognitive indicator of language processing that can be used to investigate the organization of linguistic knowledge.	\N	\N
25126691	To evaluate the relationship between conductive hearing loss and maxillary constriction. A total of 120 people, aged from 7 to 40 years, who were referred to an audiologist when taking out health insurance or for school pre-registration check-up, were selected for this study. A total of 60 participants who had hearing threshold levels greater than 15 dB in both ears were chosen as the conductive hearing loss group. The remaining 60, with normal hearing thresholds of less than 15 dB, were used as the control group. All participants were referred to an orthodontic clinic. Participants who had a posterior crossbite and high palatal vault were considered to suffer from maxillary constriction. There were no significant differences between the sex ratios and mean ages of the groups. However, participants with conductive hearing loss were 3.5 times more likely than controls to suffer from maxillary constriction. Patients who suffer from conductive hearing loss are likely to show a maxillary abnormality when examined by an orthodontist.	\N	\N
25139422	We investigated global integration (wrap-up) processes at the boundaries of musical phrases by comparing the effects of well and non-well formed phrases on event-related potentials time-locked to two boundary points: the onset and the offset of the boundary pause. The Closure Positive Shift, which is elicited at the boundary offset, was not modulated by the quality of phrase structure (well vs. non-well formed). In contrast, the boundary onset potentials showed different patterns for well and non-well formed phrases. Our results contribute to specify the functional meaning of the Closure Positive Shift in music, shed light on the large-scale structural integration of musical input, and raise new hypotheses concerning shared resources between music and language.	\N	\N
25150964	Timbre is an important attribute of sound both in music and nature. Previously, using an operant conditioning paradigm, we found that black-capped chickadees and humans show similar response patterns in discriminating triadic chords of the same timbre and transferred this discrimination to a novel key center (novel absolute pitch). The current study examined how varying the timbre of the chords influenced discrimination. Using a similar operant conditioning procedure, we trained humans (Experiment 1) and chickadees (Experiments 2 and 3) to discriminate a major chord from 6 other chord types that had semitone deviations from the major chord. The pattern of errors of the 2 species replicated our previous findings. We then tested participants with novel timbres. We found that humans readily transferred their discrimination to novel timbres, suggesting they were attending to triadic pitch relations. The chickadees failed to transfer to novel timbres, suggesting they were using a different strategy to perform the original chord discrimination. We conducted an acoustic analysis examining frequency ranges that are biologically relevant to chickadees. We found that the relative intensity within each chord of the frequencies used in black-capped chickadee song significantly correlated with chickadees' percent response during probe testing. In Experiment 3, we trained a new set of chickadees by including either expanded pitch or timbre training before testing. Although chickadees showed some transfer to novel chords following this expanded training, we found that neither type of expanded training helped the chickadees when probe tested with novel stimuli.	\N	\N
25151640	Congenital amusia has been described as a lifelong deficit of music perception and production, notably including amusic individuals' difficulties to recognize a familiar tune without the aid of lyrics. The present study aimed to evaluate whether amusic individuals might have acquired long-term knowledge of familiar music, and to test for the minimal amount of acoustic information necessary to access this knowledge (if any) in amusia. Segments of familiar and unfamiliar instrumental musical pieces were presented with increasing duration (250, 500, 1000 msec etc.), and participants provided familiarity judgments for each segment. Results showed that amusic individuals succeeded in differentiating familiar from unfamiliar excerpts with as little acoustic information as did control participants (i.e., within 500 msec). The findings reveal that amusic individuals have stored musical pieces in long-term memory (LTM), and, together with other recent findings, they suggest that congenital amusia might impair conscious access to music processing rather than music processing per se.	\N	\N
25158372	Cortical auditory-evoked potentials (CAEPs), an objective measure of human speech encoding in individuals with normal or impaired auditory systems, can be used to assess the outcomes of hearing aids and cochlear implants in infants, or in young children who cannot co-operate for behavioural speech discrimination testing. The current study aimed to determine whether naturally produced speech stimuli /m/, /g/ and /t/ evoke distinct CAEP response patterns that can be reliably recorded and differentiated, based on their spectral information and whether the CAEP could be an electrophysiological measure to differentiate between these speech sounds. CAEPs were recorded from 18 school-aged children with normal hearing, tested in two groups: younger (5 - 7 years) and older children (8 - 12 years). Cortical responses differed in their P1 and N2 latencies and amplitudes in response to /m/, /g/ and /t/ sounds (from low-, mid- and high-frequency regions, respectively). The largest amplitude of the P1 and N2 component was for /g/ and the smallest was for /t/. The P1 latency in both age groups did not show any significant difference between these speech sounds. The N2 latency showed a significant change in the younger group but not in the older group. The N2 latency of the speech sound /g/ was always noted earlier in both groups. This study demonstrates that spectrally different speech sounds are encoded differentially at the cortical level, and evoke distinct CAEP response patterns. CAEP latencies and amplitudes may provide an objective indication that spectrally different speech sounds are encoded differently at the cortical level.	\N	\N
25158615	To evaluate methods for measuring long-term benefits of cochlear implantation in a patient with single-sided deafness (SSD) with respect to spatial hearing and to document improved quality of life because of reduced tinnitus. A single adult male with profound right-sided sensorineural hearing loss and normal hearing in the left ear who underwent right-sided cochlear implantation. The subject was evaluated at 6, 9, 12, and 18 months after implantation on speech intelligibility with specific target-masker configurations, sound localization accuracy, audiologic performance, and tinnitus handicap. Testing conditions involved the acoustic (NH) ear only, the cochlear implant (CI) ear (acoustic ear plugged), and the bilateral condition (CI+NH). Measures of spatial hearing included speech intelligibility improvement because of spatial release from masking (SRM) and sound localization. In addition, traditional measures known as "head shadow," "binaural squelch," and "binaural summation" were evaluated. The best indicator for improved speech intelligibility was SRM, in which both ears are activated, but the relative locations of target and masker(s) are manipulated. Measures that compare performance with a single ear to performance using bilateral auditory input indicated evidence of the ability to integrate inputs across the ears, possibly reflecting early binaural processing, with 12 months of bilateral input. Sound localization accuracy improved with addition of the implant, and a large improvement with respect to tinnitus handicap was observed. Cochlear implantation resulted in improved sound localization accuracy when compared with performance using only the NH ear, and reduced tinnitus handicap was observed with use of the implant. The use of SRM addresses some of the current limitations of traditional measures of spatial and binaural hearing, as spatial cues related to target and maskers are manipulated, rather than the ear(s) tested. Sound testing methods and calculations described here are therefore recommended for assessing performance of a larger sample size of individuals with SSD who receive a CI.	\N	\N
25167217	Objective To investigate the effect of increasing phase duration (pulse width, T-pulse) using a biphasic pulse composed of an initial anodic active phase followed by a balancing cathodic phase on the electrically evoked auditory brainstem responses (eABRs) recorded at the time of cochlear implantation. Design eABRs recorded during 188 surgeries for cochlear implantation from 1999 to 2006 in a single center were retrospectively reviewed by two independent observers. All patients were fitted with a NEURELEC cochlear implant (CI) device, initially DIGISONIC(®) then DIGISONIC SP(®) (2004-2006). Result Immediately following cochlear implantation, stimulation by the CI resulted in reliable wave III and V eABR waveforms (mean wave III latency 2.23 ± 0.38 ms SD and wave V latency 4.28 ± 0.42 ms SD). Latencies followed an apical to basal gradient (0.32 ms increase in mean eV latency and 0.12 ms for eIII latency). With increasing phase duration, wave III and wave V latencies significantly decreased in association with a shortening of the eIII-eV interwave gap, while amplitudes of both waves increased. Conclusion The impact of increasing phase duration on latency and amplitude of brainstem responses in a large set of patients implanted with NEURELEC CIs was reported.	\N	\N
25170794	The neural resonance theory of musical meter explains musical beat tracking as the result of entrainment of neural oscillations to the beat frequency and its higher harmonics. This theory has gained empirical support from experiments using simple, abstract stimuli. However, to date there has been no empirical evidence for a role of neural entrainment in the perception of the beat of ecologically valid music. Here we presented participants with a single pop song with a superimposed bassoon sound. This stimulus was either lined up with the beat of the music or shifted away from the beat by 25% of the average interbeat interval. Both conditions elicited a neural response at the beat frequency. However, although the on-the-beat condition elicited a clear response at the first harmonic of the beat, this frequency was absent in the neural response to the off-the-beat condition. These results support a role for neural entrainment in tracking the metrical structure of real music and show that neural meter tracking can be disrupted by the presentation of contradictory rhythmic cues.	\N	\N
25176617	Verbal memory is a fundamental prerequisite for language learning. This study investigated 7-month-olds' (N = 62) ability to remember the identity and order of elements in a multisyllabic word. The results indicate that infants detect changes in the order of edge syllables, or the identity of the middle syllables, but fail to encode the order of middle syllables. This suggests that the representational format of multisyllabic words is determined by core mnemonic biases, which favor accurate encoding of edges and limits the encoding of temporal order for internal segments. The studies support accounts proposing that content and order are encoded separately; in addition, the data show that this dissociation occurs early in development.	\N	\N
25185802	In two studies based on Stanley Milgram's original pilots, we present the first systematic examination of cyranoids as social psychological research tools. A cyranoid is created by cooperatively joining in real-time the body of one person with speech generated by another via covert speech shadowing. The resulting hybrid persona can subsequently interact with third parties face-to-face. We show that naïve interlocutors perceive a cyranoid to be a unified, autonomously communicating person, evidence for a phenomenon Milgram termed the "cyranic illusion." We also show that creating cyranoids composed of contrasting identities (a child speaking adult-generated words and vice versa) can be used to study how stereotyping and person perception are mediated by inner (dispositional) vs. outer (physical) identity. Our results establish the cyranoid method as a unique means of obtaining experimental control over inner and outer identities within social interactions rich in mundane realism.	\N	\N
25188354	Although active listening is an influential behavior, which can affect the social responses of others, the neural correlates underlying its perception have remained unclear. Sensing active listening in social interactions is accompanied by an improvement in the recollected impressions of relevant experiences and is thought to arouse positive feelings. We therefore hypothesized that the recognition of active listening activates the reward system, and that the emotional appraisal of experiences that had been subject to active listening would be improved. To test these hypotheses, we conducted functional magnetic resonance imaging (fMRI) on participants viewing assessments of their own personal experiences made by evaluators with or without active listening attitude. Subjects rated evaluators who showed active listening more positively. Furthermore, they rated episodes more positively when they were evaluated by individuals showing active listening. Neural activation in the ventral striatum was enhanced by perceiving active listening, suggesting that this was processed as rewarding. It also activated the right anterior insula, representing positive emotional reappraisal processes. Furthermore, the mentalizing network was activated when participants were being evaluated, irrespective of active listening behavior. Therefore, perceiving active listening appeared to result in positive emotional appraisal and to invoke mental state attribution to the active listener.	\N	\N
25190323	In a standard center cueing paradigm, participants are asked to identify a target object presented either to the left or the right of a center cue (e.g., eye gaze, head-turn, arrow, etc.). When the center cue is non-predictive (e.g., the arrow points to the correct location of the target only 50 % of the time), the target can still be identified faster at the validly cued location than at the invalidly cued location. However, the abrupt onset of an object can elicit reflexive attention orientation. It is important to investigate whether this abrupt onset effect interferes with the cueing effect elicited by center cues because this interference effect, if it exists, should be controlled for in order to improve the test validity of the center cueing task. In an attentional cueing paradigm, we examined how the abrupt appearance of an exogenous target object mitigates the influence of center cues involving either a head turn (Experiment 1) or an arrow (Experiment 2). In Experiment 1, a non-predictive head-turn cue was followed by a target object (circle or square) presented in the left or right visual field. In the non-distractor condition, the target object was presented by itself. In this case, it is assumed that the sudden appearance of the target provides an orienting cue to the observer. To equalize the cueing effect of the target object, we presented a competing distractor object (triangle) in the opposite visual field to the target object. The participant's task was to categorize the target object as either a circle or square while ignoring the non-target triangle object in the opposite visual field. In Experiment 2, the arrow version of the cued recognition task was used, in which a single-headed arrow pointed to the object. The results from both experiments showed that both the non-predictive head-turn and arrow cues produced a reliable cueing effect in the distractor and non-distractor conditions. However, the magnitude of the cueing effect was greater in the distractor condition than in the non-distractor condition, suggesting that the abrupt onset of the target object acts like an exogenous signal, thereby reducing the impact of the internal head turn and arrow cues.	\N	\N
25190394	Factors that might affect perceptual pitch match between acoustic and electric stimulation were examined in 25 bimodal listeners using magnitude estimation. Pre-operative acoustic thresholds in both ears, and duration of severe-profound loss, were first examined as correlates with degree of match between the measured pitch and that predicted by the spiral ganglion frequency-position model. The degree of match was examined with respect to (1) the ratio between the measured and predicted pitch percept on the most apical electrode and (2) the ratio between the slope of the measured and predicted pitch function. Second, effect of listening experience was examined to assess whether adaptation occurred over time to match the frequency assignment to electrodes. Pre-experience pitch estimates on the apical electrode were within the predicted range in only 28% of subjects, and the slope of the electrical pitch function was lower than predicted in all except one subject. Subjects with poorer hearing tended to have a lower pitch and a shallower electrical pitch function than predicted by the model. Pre-operative hearing thresholds in the contralateral ear and hearing loss duration were not correlated with the degree of pitch match, and there was no significant group effect of listening experience.	\N	\N
25190407	Normal-hearing (NH) listeners make use of context, speech redundancy and top-down linguistic processes to perceptually restore inaudible or masked portions of speech. Previous research has shown poorer perception and restoration of interrupted speech in CI users and NH listeners tested with acoustic simulations of CIs. Three hypotheses were investigated: (1) training with CI simulations of interrupted sentences can teach listeners to use the high-level restoration mechanisms more effectively, (2) phonemic restoration benefit, an increase in intelligibility of interrupted sentences once its silent gaps are filled with noise, can be induced with training, and (3) perceptual learning of interrupted sentences can be reflected in clinical speech audiometry. To test these hypotheses, NH listeners were trained using periodically interrupted sentences, also spectrally degraded with a noiseband vocoder as CI simulation. Feedback was presented by displaying the sentence text and playing back both the intact and the interrupted CI simulation of the sentence. Training induced no phonemic restoration benefit, and learning was not transferred to speech audiometry measured with words. However, a significant improvement was observed in overall intelligibility of interrupted spectrally degraded sentences, with or without filler noise, suggesting possibly better use of restoration mechanisms as a result of training.	\N	\N
25194209	Roles of subcortical structures in language processing are vague, but, interestingly, basal ganglia and thalamic Deep Brain Stimulation can go along with reduced lexical capacities. To deepen the understanding of this impact, we assessed word processing as a function of thalamic versus subthalamic Deep Brain Stimulation. Ten essential tremor patients treated with thalamic and 14 Parkinson׳s disease patients with subthalamic Deep Brain Stimulation performed an acoustic Lexical Decision Task ON and OFF stimulation. Combined analysis of task performance and event-related potentials allowed the determination of processing speed, priming effects, and N400 as neurophysiological correlate of lexical stimulus processing. 12 age-matched healthy participants acted as control subjects. Thalamic Deep Brain Stimulation prolonged word decisions and reduced N400 potentials. No comparable ON-OFF effects were present in patients with subthalamic Deep Brain Stimulation. In the latter group of patients with Parkinson' disease, N400 amplitudes were, however, abnormally low, whether under active or inactive Deep Brain Stimulation. In conclusion, performance speed and N400 appear to be influenced by state functions, modulated by thalamic, but not subthalamic Deep Brain Stimulation, compatible with concepts of thalamo-cortical engagement in word processing. Clinically, these findings specify cognitive sequels of Deep Brain Stimulation in a target-specific way.	\N	\N
25196041	Cognitive enhancement resulting from nicotinic acetylcholine receptor stimulation may be evidenced by increased efficiency of the auditory-frontal cortex network of auditory discrimination, which is impaired in schizophrenia, a cognitive disorder associated with excessive tobacco use. Investigating automatic (preattentive) detection of acoustic change with the mismatch negativity (MMN) brain event-related potential in response to nicotine in individuals with varying baseline levels of auditory discrimination may provide useful insight into the cholinergic regulation of this neural network and its potential amelioration with novel nicotinic agents. Sixty healthy, non-smoking male volunteers were presented with an 'optimal' multi-feature MMN paradigm in a randomized, placebo controlled double-blind design with 6 mg of nicotine gum. Participants with low, medium, and high baseline amplitudes responded differently to nicotine (vs. placebo), and nicotine response was feature specific. Whereas MMN in individuals with high amplitudes was diminished by nicotine, MMN increased in those with low amplitudes. Nicotine effects were not shown in medium amplitude participants. These findings provide preliminary support for the role of nicotinic neurotransmission in sensory memory processing of auditory change and suggest that nicotinic receptor modulation can both enhance and diminish change detection, depending on baseline MMN and its eliciting stimulus feature.	\N	\N
25201816	Evidence suggests that deafness-induced changes in visual perception, cognition and attention may compensate for a hearing loss. Such alterations, however, may also negatively influence adaptation to a cochlear implant. This study investigated whether involuntary attentional capture by salient visual stimuli is altered in children who use a cochlear implant. Thirteen experienced implant users (aged 8-16 years) and age-matched normally hearing children were presented with a rapid sequence of simultaneous visual and auditory events. Participants were tasked with detecting numbers presented in a specified color and identifying a change in the tonal frequency whilst ignoring irrelevant visual distractors. Compared to visual distractors that did not possess the target-defining characteristic, target-colored distractors were associated with a decrement in visual performance (response time and accuracy), demonstrating a contingent capture of involuntary attention. Visual distractors did not, however, impair auditory task performance. Importantly, detection performance for the visual and auditory targets did not differ between the groups. These results suggest that proficient cochlear implant users demonstrate normal capture of visuospatial attention by stimuli that match top-down control settings.	\N	\N
25208843	When an action produces an effect, both events are perceived to be shifted in time toward each other. This shift is called Intentional Binding (IB) effect. First evidence shows that this shift does not depend on the statistical predictability of the produced effect's identity (Desantis, Hughes, & Waszak, 2012). We confirm this result by comparing the perceived duration of action-effect intervals before valid and invalid action effects using the method of constant stimuli. The perceived duration of action-effect intervals did not differ for valid and invalid effects. This result was true for different durations of the action-effect interval (Experiments 1-4: 250 ms, Experiments 1 & 2: 400 ms), different effect modalities (Experiments 1 & 3: visual, Experiments 2-4: auditive), and two types of validity variations (Experiments 1 & 2: 80% valid, Experiments 3 & 4: 100% valid vs. random). We validated our results by using a clock paradigm and a numerical duration estimation task (Experiment 4). We conclude that the IB effect is not the result of internal prediction due to action-effect bindings, but might rely on higher-order processes.	\N	\N
25214304	Prior studies of spatial negative priming indicate that distractor-assigned keypress responses are inhibited as part of visual, but not auditory, processing. However, recent evidence suggests that static keypress responses are not directly activated by spatially presented sounds and, therefore, might not call for an inhibitory process. In order to investigate the role of response inhibition in auditory processing, we used spatially directed responses that have been shown to result in direct response activation to irrelevant sounds. Participants localized a target sound by performing manual joystick responses (Experiment 1) or head movements (Experiment 2B) while ignoring a concurrent distractor sound. Relations between prime distractor and probe target were systematically manipulated (repeated vs. changed) with respect to identity and location. Experiment 2A investigated the influence of distractor sounds on spatial parameters of head movements toward target locations and showed that distractor-assigned responses are immediately inhibited to prevent false responding in the ongoing trial. Interestingly, performance in Experiments 1 and 2B was not generally impaired when the probe target appeared at the location of the former prime distractor and required a previously withheld and presumably inhibited response. Instead, performance was impaired only when prime distractor and probe target mismatched in terms of location or identity, which fully conforms to the feature-mismatching hypothesis. Together, the results suggest that response inhibition operates in auditory processing when response activation is provided but is presumably too short-lived to affect responding on the subsequent trial.	\N	\N
25215617	Previous studies of frequency discrimination training (FDT) for tinnitus used repetitive task-based training programmes relying on extrinsic factors to motivate participation. Studies reported limited improvement in tinnitus symptoms. To evaluate FDT exploiting intrinsic motivations by integrating training with computer-gameplay. Sixty participants were randomly assigned to train on either a conventional task-based training, or one of two interactive game-based training platforms over six weeks. Outcomes included assessment of motivation, tinnitus handicap, and performance on tests of attention. Participants reported greater intrinsic motivation to train on the interactive game-based platforms, yet compliance of all three groups was similar (∼ 70%) and changes in self-reported tinnitus severity were not significant. There was no difference between groups in terms of change in tinnitus severity or performance on measures of attention. FDT can be integrated within an intrinsically motivating game. Whilst this may improve participant experience, in this instance it did not translate to additional compliance or therapeutic benefit. ClinicalTrials.gov NCT02095262.	\N	\N
25218167	Knowing the context of a discourse is an essential prerequisite for comprehension. Here we used functional magnetic resonance imaging (fMRI) to disclose brain networks supporting context-dependent speech comprehension. During fMRI, 20 participants listened to 1-min spoken narratives preceded by pictures that were either contextually matching or mismatching with the narrative. Matching pictures increased narrative comprehension, decreased hemodynamic activity in Broca׳s area, and enhanced its functional connectivity with left anterior superior frontal gyrus, bilateral inferior parietal cortex, as well as anterior and posterior cingulate cortex. Further, the anterior (BA 45) and posterior (BA 44) portions of Broca׳s area differed in their functional connectivity patterns. Both BA 44 and BA 45 have shown increased connectivity with right angular gyrus and supramarginal gyrus. Whereas BA 44 showed increased connectivity with left angular gyrus, left inferior/middle temporal gyrus and left postcentral gyrus, BA 45 showed increased connectivity with right posterior cingulate cortex, right anterior inferior frontal gyrus, lateral occipital cortex and anterior cingulate cortex. Our results suggest that a fronto-parietal functional network supports context-dependent narrative comprehension, and that Broca׳s area is involved in resolving ambiguity from speech when appropriate contextual cues are lacking.	\N	\N
25223106	A sound's duration provides important information about the event producing it. Although many of the sounds we hear every day are 'percussive' in nature (ie resulting from two objects impacting) and therefore exhibit decaying/damped amplitude envelopes, perceptual experiments frequently use tones synthesized with 'flat' or abruptly ending envelopes. Such sounds afford an estimation strategy involving calculating the elapsed time between tone onset and offset--a strategy that would be problematic for ecologically pervasive decaying sounds. Here we compare duration judgments for tones with percussive (ie gradually decaying) and flat (ie abruptly ending) amplitude envelopes, finding evidence for the use of different strategies. This result is discussed in terms of its implications for dominant theories and models of sensory perception that are often assessed using artificial sounds (ie 'flat tones') affording strategies that may not be optimal or even available for everyday listening.	\N	\N
25224031	In human and nonhuman primates, the cortical motor system comprises a collection of brain areas primarily related to motor control. Existing evidence suggests that no other mammalian group has the number, extension, and complexity of motor-related areas observed in the frontal lobe of primates. Such diversity is probably related to the wide behavioral flexibility that primates display. Indeed, recent comparative anatomical, psychophysical, and neurophysiological studies suggest that the evolution of the motor cortical areas closely correlates with the emergence of high cognitive abilities. Advances in understanding the cortical motor system have shown that these areas are also related to functions previously linked to higher-order associative areas. In addition, experimental observations have shown that the classical distinction between perceptual and motor functions is not strictly followed across cortical areas. In this paper, we review evidence suggesting that evolution of the motor system had a role in the shaping of different cognitive functions in primates. We argue that the increase in the complexity of the motor system has contributed to the emergence of new abilities observed in human and nonhuman primates, including the recognition and imitation of the actions of others, speech perception and production, and the execution and appreciation of the rhythmic structure of music.	\N	\N
25226375	We seek to determine the extent of age-related decline in speech perception performance among cochlear implant recipients as quantified by various metrics. Retrospective chart review. Tertiary referral center. The records of 70 postlingually deafened adults who received cochlear implants between 2004 and 2013 were reviewed. Unilateral cochlear implantation. Postoperative AzBio and Consonant-Nucleus-Consonant (CNC) scores at greater than 3 months postactivation. Group analyses comparing patients aged 65 years and older (elderly) with younger adult patients (control). In addition, multivariate linear regression analyses were performed that incorporated preoperative pure-tone audiograms, duration of deafness, duration of follow-up, sex, and laterality of the implanted ear to quantitate the dependence of AzBio and CNC results on age at implantation (AAI). Performance on AzBio for the control and elderly groups were 74.6% ± 4.1% and 59.5% ± 4.5% (p = 0.032), respectively. Performance on CNC scores were 63.9% ± 3.4% and 55.3% ± 3.3% (p = 0.098), respectively. Multiple linear regression showed a significant correlation of AzBio with AAI, whereas CNC did not correlate significantly (correlation coefficients = -0.006 and -0.003, p = 0.019 and 0.081, respectively). Patients implanted at a later age performed more poorly on AzBio sentences. A similar trend was noted with CNC scores although not significant. The variability in correlation coefficients and significance between both speech perception tests and AAI suggests that, as patients age, their performance on each individual test will be affected to a varying degree.	\N	\N
25234731	Mandarin Chinese is a lexical tone language that has four tones, with a change in tone denoting a change in lexical meaning. There are few studies regarding lexical tone identification abilities in deafened children using either cochlear implants (CIs) or hearing aids (HAs). Furthermore, no study has compared the lexical tone identification abilities of deafened children with their hearing devices turned on and off. The present study aimed to investigate the lexical tone identification abilities of deafened children with CIs or HAs. Forty prelingually deafened children (20 with CIs and 20 with HAs) participated in the study. In the HA group, 20 children were binaurally aided. In the CI group, all of the children were unilaterally implanted. All of the subjects completed a computerized lexical tone pairs test with their hearing devices turned on and off. The correct answers of all items were recorded as the total score and the correct answers of the tone pairs were recorded as subtotal scores. No significant differences in the tone pair identification scores were found between the CI group and HA group either with the devices turned on or off (t=1.62, p=0.11; t=1.863, p=0.07, respectively). The scores in the aided condition were higher than in the unaided condition regardless of the device used (t=22.09, p<0.001, in the HA group; t=20.20, p<0.001, in the CI group). Significantly higher scores were found in the tone pairs that contained tone 4. Age at fitting of the devices was correlated with tone identification abilities in both the CI and HA groups. Other demographic factors were not correlated with tone identification ability. The hearing device, whether a hearing aid or cochlear implant, is beneficial for tone identification. The lexical tone identification abilities were similar regardless of whether the subjects wore a HA or CI. Lexical tone pairs with different durations and dissimilar tone contour patterns are more easily identified. Receiving devices at earlier age tends to produce better lexical tone identification abilities in prelingually deafened children.	\N	\N
25234885	This study examined the ability of listeners to utilize syntactic structure to extract a target stream of speech from among competing sounds. Target talkers were identified by voice or location, which was held constant throughout a test utterance, and paired with correct or incorrect (random word order) target sentence syntax. Both voice and location provided reliable cues for identifying target speech even when other features varied unpredictably. The target sentences were masked either by predominantly energetic maskers (noise bursts) or by predominantly informational maskers (similar speech in random word order). When the maskers were noise bursts, target sentence syntax had relatively minor effects on identification performance. However, when the maskers were other talkers, correct target sentence syntax resulted in significantly better speech identification performance than incorrect syntax. Furthermore, conformance to correct syntax alone was sufficient to accurately identify the target speech. The results were interpreted as supporting the idea that the predictability of the elements comprising streams of speech, as manifested by syntactic structure, is an important factor in binding words together into coherent streams. Furthermore, these findings suggest that predictability is particularly important for maintaining the coherence of an auditory stream over time under conditions high in informational masking.	\N	\N
25235005	The contribution of recovered envelopes (RENVs) to the utilization of temporal-fine structure (TFS) speech cues was examined in normal-hearing listeners. Consonant identification experiments used speech stimuli processed to present TFS or RENV cues. Experiment 1 examined the effects of exposure and presentation order using 16-band TFS speech and 40-band RENV speech recovered from 16-band TFS speech. Prior exposure to TFS speech aided in the reception of RENV speech. Performance on the two conditions was similar (∼50%-correct) for experienced listeners as was the pattern of consonant confusions. Experiment 2 examined the effect of varying the number of RENV bands recovered from 16-band TFS speech. Mean identification scores decreased as the number of RENV bands decreased from 40 to 8 and were only slightly above chance levels for 16 and 8 bands. Experiment 3 examined the effect of varying the number of bands in the TFS speech from which 40-band RENV speech was constructed. Performance fell from 85%- to 31%-correct as the number of TFS bands increased from 1 to 32. Overall, these results suggest that the interpretation of previous studies that have used TFS speech may have been confounded with the presence of RENVs.	\N	\N
25243615	It is well established that categorising the emotional content of facial expressions may differ depending on contextual information. Whether this malleability is observed in the auditory domain and in genuine emotion expressions is poorly explored. We examined the perception of authentic laughter and crying in the context of happy, neutral and sad facial expressions. Participants rated the vocalisations on separate unipolar scales of happiness and sadness and on arousal. Although they were instructed to focus exclusively on the vocalisations, consistent context effects were found: For both laughter and crying, emotion judgements were shifted towards the information expressed by the face. These modulations were independent of response latencies and were larger for more emotionally ambiguous vocalisations. No effects of context were found for arousal ratings. These findings suggest that the automatic encoding of contextual information during emotion perception generalises across modalities, to purely non-verbal vocalisations, and is not confined to acted expressions.	\N	\N
25247311	Neonatal necrotizing enterocolitis (NEC) is associated with an increased incidence of poor neurodevelopment. The knowledge of underlying neurophysiology is very limited, and the influence of NEC on the preterm brainstem is very poorly understood. To assess the effect of NEC on the immature auditory brainstem by excluding any possible confounding effect of preterm birth. We recorded and analyzed brainstem auditory evoked response (BAER) at different click rates in preterm babies (30-34 weeks gestation) after NEC. The results were compared with those in age-matched healthy preterm babies who had no NEC. At click rate 21/s, the latencies of BAER waves I and III in the preterm NEC babies were similar to those babies without NEC. However, wave V latency was longer in the NEC babies than in those without NEC. The I-V interpeak interval was also longer in the NEC babies than in those without NEC. These abnormalities were persistent at higher click rates 51 and 91/s. Wave I amplitude in the preterm NEC babies did not differ significantly from that in those without NEC, but wave III and V amplitudes were smaller than in those without NEC at all 21-91/s clicks. Compared with healthy preterm babies, preterm babies after NEC showed a major increase in wave V latency and I-V interval at all 21-91/s clicks. Brainstem auditory function is impaired in preterm NEC babies after excluding the possible confounding effect of preterm birth. Neonatal NEC and associated perinatal conditions adversely affect the premature brainstem.	\N	\N
25248101	Using an auditory variant of task switching, we examined the ability to intentionally switch attention in a dichotic-listening task. In our study, participants responded selectively to one of two simultaneously presented auditory number words (spoken by a female and a male, one for each ear) by categorizing its numerical magnitude. The mapping of gender (female vs. male) and ear (left vs. right) was unpredictable. The to-be-attended feature for gender or ear, respectively, was indicated by a visual selection cue prior to auditory stimulus onset. In Experiment 1, explicitly cued switches of the relevant feature dimension (e.g., from gender to ear) and switches of the relevant feature within a dimension (e.g., from male to female) occurred in an unpredictable manner. We found large performance costs when the relevant feature switched, but switches of the relevant feature dimension incurred only small additional costs. The feature-switch costs were larger in ear-relevant than in gender-relevant trials. In Experiment 2, we replicated these findings using a simplified design (i.e., only within-dimension switches with blocked dimensions). In Experiment 3, we examined preparation effects by manipulating the cueing interval and found a preparation benefit only when ear was cued. Together, our data suggest that the large part of attentional switch costs arises from reconfiguration at the level of relevant auditory features (e.g., left vs. right) rather than feature dimensions (ear vs. gender). Additionally, our findings suggest that ear-based target selection benefits more from preparation time (i.e., time to direct attention to one ear) than gender-based target selection.	\N	\N
25255036	In human face-to-face communication, language comprehension is a multi-modal, situated activity. However, little is known about how we combine information from different modalities during comprehension, and how perceived communicative intentions, often signaled through visual signals, influence this process. We explored this question by simulating a multi-party communication context in which a speaker alternated her gaze between two recipients. Participants viewed speech-only or speech+gesture object-related messages when being addressed (direct gaze) or unaddressed (gaze averted to other participant). They were then asked to choose which of two object images matched the speaker's preceding message. Unaddressed recipients responded significantly more slowly than addressees for speech-only utterances. However, perceiving the same speech accompanied by gestures sped unaddressed recipients up to a level identical to that of addressees. That is, when unaddressed recipients' speech processing suffers, gestures can enhance the comprehension of a speaker's message. We discuss our findings with respect to two hypotheses attempting to account for how social eye gaze may modulate multi-modal language comprehension.	\N	\N
25261772	Amazing progress has been made in providing useful hearing to hearing-impaired individuals using cochlear implants, but challenges remain. One such challenge is understanding the effects of partial degeneration of the auditory nerve, the target of cochlear implant stimulation. Here we review studies from our human and animal laboratories aimed at characterizing the health of the implanted cochlea and the auditory nerve. We use the data on cochlear and neural health to guide rehabilitation strategies. The data also motivate the development of tissue-engineering procedures to preserve or build a healthy cochlea and improve performance obtained by cochlear implant recipients or eventually replace the need for a cochlear implant. This article is part of a Special Issue entitled <Lasker Award>.	\N	\N
25263528	Although children can use social categories to intelligently select informants, children's preference for in-group informants has not been consistently demonstrated across age and context. This research clarifies the extent to which children use social categories to guide learning by presenting participants with a live or video-recorded action demonstration by a linguistic in-group and/or out-group model. Participants' (N = 104) propensity to imitate these actions was assessed. Nineteen-month-olds did not selectively imitate the actions of the in-group model in live contexts, though in-group preferences were found after watching the demonstration on video. Three-year-olds selectively imitated the actions demonstrated by the in-group member regardless of context. These results indicate that in-group preferences have a more nuanced effect on social learning than previous research has indicated.	\N	\N
25269621	Despite the fact that no invariant acoustic property corresponds to a single stop consonant coupled with different vowels (e.g., [da], [de], and [du]), adults effortlessly identify the same consonant embedded in different syllables. In so doing, they solve the invariance problem. Can 3- and 6-month-olds solve it as well? To answer this question, we developed a novel methodology based on pupillometry. In Experiment 1, we demonstrated for the first time that infants are sensitive to the distinction between frequent and infrequent acoustic stimuli, showing greater pupil dilation in response to infrequent stimuli. Building on this effect, in Experiment 2, we showed that 6-month-olds, but not 3-month-olds, solve the invariance problem. Moreover, this ability develops before, and therefore independently of, the ability to produce well-formed syllables.	\N	\N
25275862	To compare within-subject bilateral-binaural and bimodal complementary abilities between bimodal (cochlear implant and hearing aid; CI/HA) and bilateral CI hearing (CI/CI), thereby enabling better-informed counseling of experienced CI/HA users contemplating contralateral implantation. Comparative within-subject case review. Outpatient hearing clinic. Ten experienced adult CI/HA users with severe-to-profound hearing loss in the HA ear, who converted to CI/CI between 2 and 11 years after initial implantation. Task-specific testing of bilateral-binaural hearing (sound lateralization, binaural summation/redundancy/unmasking, head-shadow effect), bimodal complementary benefit (contribution of low-frequency information), and a self-report Speech, Spatial, and Qualities of Hearing (SSQ) questionnaire, all before and 1 year after contralateral cochlear implantation. Test result differences between CI/HA and CI/CI conditions. CI/CI hearing was better than CI/HA for speech lateralization and for perception of semantically unpredictable sentences in speech noise with speech at 0 degrees and noise at +90 degrees azimuth on the old CI side. CI/HA was better than CI/CI only for differences between perception of natural prosody speech and of speech with flattened fundamental frequency (F0) contour with speech and noise in front (at 0 degrees azimuth). Total scores on the SSQ questionnaire were higher in CI/CI than in CI/HA users. Counseling regarding contralateral implantation for CI/HA users with severe-to-profound hearing loss in the HA ear, though generally positive, should consider individual functional needs, and cover expectations about the expected trade-off between gaining improved understanding and speech lateralization in challenging listening conditions and losing some low-frequency cues still available with CI/HA hearing.	\N	\N
25280122	Both psychological stress and predictive signals relating to expected sensory input are believed to influence perception, an influence which, when disrupted, may contribute to the generation of auditory hallucinations. The effect of stress and semantic expectation on auditory perception was therefore examined in healthy participants using an auditory signal detection task requiring the detection of speech from within white noise. Trait anxiety was found to predict the extent to which stress influenced response bias, resulting in more anxious participants adopting a more liberal criterion, and therefore experiencing more false positives, when under stress. While semantic expectation was found to increase sensitivity, its presence also generated a shift in response bias towards reporting a signal, suggesting that the erroneous perception of speech became more likely. These findings provide a potential cognitive mechanism that may explain the impact of stress on hallucination-proneness, by suggesting that stress has the tendency to alter response bias in highly anxious individuals. These results also provide support for the idea that top-down processes such as those relating to semantic expectation may contribute to the generation of auditory hallucinations.	\N	\N
25281311	Interactions between ourselves and the external world are mediated by a multisensory representation of the space surrounding the body, i.e. the peripersonal space (PPS). In particular, a special interplay is observed among tactile stimuli delivered on a body part, e.g. the hand, and visual or auditory external inputs presented close, but not far, from the same body part, e.g. within hand PPS. This coding of multisensory stimuli as a function of their distance from the hand has a role in upper limb actions. However, it remains unclear whether PPS representation affects the motor system only when stimuli occur specifically at the hand location or when they move within a continuous portion of space where the hand can potentially act. Here, in order to study these two alternatively hypotheses, we assessed the critical distance at which moving sounds have a direct effect on hand corticospinal excitability by using Transcranial Magnetic Stimulation (TMS). Specifically, TMS single pulses were delivered when a sound source was perceived at six different positions in space: from very close to subjects' hand (15 cm) to far away (90 cm). Moreover, sound direction was manipulated to test if stimuli approaching and receding from the hand might have the same relevance for the motor system. MEPs amplitude was enhanced when sounds were delivered within a limited distance from the hand (around 60 cm) as compared to when the sounds were beyond this space. This effect captures the spatial boundaries within which PPS representation modulates hand cortico-motor excitability. This spatially-dependent modulation of corticospinal activity was not further affected by the sound direction. Such findings support a strict link between the multisensory representation of the space around the body and the motor representation of potential approaching or defensive acts within that space.	\N	\N
25282057	A fundamental question in language development is how infants start to assign meaning to words. Here, using three Electroencephalogram (EEG)-based measures of brain activity, we establish that preverbal 11-month-old infants are sensitive to the non-arbitrary correspondences between language sounds and concepts, that is, to sound symbolism. In each trial, infant participants were presented with a visual stimulus (e.g., a round shape) followed by a novel spoken word that either sound-symbolically matched ("moma") or mismatched ("kipi") the shape. Amplitude increase in the gamma band showed perceptual integration of visual and auditory stimuli in the match condition within 300 msec of word onset. Furthermore, phase synchronization between electrodes at around 400 msec revealed intensified large-scale, left-hemispheric communication between brain regions in the mismatch condition as compared to the match condition, indicating heightened processing effort when integration was more demanding. Finally, event-related brain potentials showed an increased adult-like N400 response - an index of semantic integration difficulty - in the mismatch as compared to the match condition. Together, these findings suggest that 11-month-old infants spontaneously map auditory language onto visual experience by recruiting a cross-modal perceptual processing system and a nascent semantic network within the first year of life.	\N	\N
25301567	Previous research has shown that the human auditory system continuously monitors its acoustic environment, detecting a variety of irregularities (e.g., deviance from prior stimulation regularity in pitch, loudness, duration, and (perceived) sound source location). Detection of irregularities can be inferred from a component of the event-related brain potential (ERP), referred to as the mismatch negativity (MMN), even in conditions in which participants are instructed to ignore the auditory stimulation. The current study extends previous findings by demonstrating that auditory irregularities brought about by a change in room acoustics elicit a MMN in a passive oddball protocol (acoustic stimuli with differing room acoustics, that were otherwise identical, were employed as standard and deviant stimuli), in which participants watched a fiction movie (silent with subtitles). While the majority of participants reported no awareness for any changes in the auditory stimulation, only one out of 14 participants reported to have become aware of changing room acoustics or sound source location. Together, these findings suggest automatic monitoring of room acoustics.	\N	\N
25305712	The pathophysiology of nonfluent primary progressive aphasia (nfvPPA) remains poorly understood. Here, we compared quantitatively speech parameters in patients with nfvPPA versus healthy older individuals under altered auditory feedback, which has been shown to modulate normal speech output. Patients (n=15) and healthy volunteers (n=17) were recorded while reading aloud under delayed auditory feedback [DAF] with latency 0, 50 or 200 ms and under DAF at 200 ms plus 0.5 octave upward pitch shift. DAF in healthy older individuals was associated with reduced speech rate and emergence of speech sound errors, particularly at latency 200 ms. Up to a third of the healthy older group under DAF showed speech slowing and frequency of speech sound errors within the range of the nfvPPA cohort. Our findings suggest that (in addition to any anterior, primary language output disorder) these key features of nfvPPA may reflect distorted speech input signal processing, as simulated by DAF. DAF may constitute a novel candidate pathophysiological model of posterior dorsal cortical language pathway dysfunction in nfvPPA.	\N	\N
25306203	Reduced auditory P300 amplitude is a robust schizophrenia deficit exhibiting the qualities of a viable genetic endophenotype. These include heritability, test-retest reliability, and trait-like stability. Recent evidence suggests that P300 may also serve as a predictive biomarker for transition to psychosis during the schizophrenia prodrome. Historically, the utility of the P300 has been limited by its clinical nonspecificity, cross-site measurement variability, and required EEG expertise. The Consortium on the Genetics of Schizophrenia (COGS-2) study provided an opportunity to examine the consistency of the measure across multiple sites with varying degrees of EEG experience, and to identify important modulating factors that contribute to measurement variability. Auditory P300 was acquired from 649 controls and 587 patients at 5 sites. An overall patient deficit was observed with effect size 0.62. Each site independently observed a significant patient deficit, but site differences also existed. In patients, site differences reflected clinical differences in positive symptomatology and functional capacity. In controls, site differences reflected differences in racial stratification, smoking and substance use history. These factors differentially suppressed the P300 response, but only in control subjects. This led to an attenuated patient-control difference among smokers and among African Americans with history of substance use. These findings indicate that the P300 can be adequately assessed quantitatively, across sites, without substantial EEG expertise. Measurements are suitable for both genetic endophenotype analyses and studies of psychosis risk and conversion. However, careful attention must be given to selection of appropriate comparison samples to avoid misleading false negative results.	\N	\N
25313714	Perception of spoken language requires attention to acoustic as well as visible phonetic information. This article reviews the known differences in audiovisual speech perception in children with autism spectrum disorders (ASD) and specifies the need for interventions that address this construct. Elements of an audiovisual training program are described. This researcher-developed program delivered via an iPad app presents natural speech in the context of increasing noise, but supported with a speaking face. Children are cued to attend to visible articulatory information to assist in perception of the spoken words. Data from four children with ASD ages 8-10 are presented showing that the children improved their performance on an untrained auditory speech-in-noise task.	\N	\N
25319676	Subjective tinnitus is the perception of sound in the absence of a corresponding external sound for which there is no known medical etiology. For a minority of individuals with tinnitus, the condition impacts their ability to lead a normal lifestyle and is severely debilitating. There is no known cure for tinnitus, so current therapy focuses on reducing the effect of tinnitus on the patient's quality of life. Tinnitus retraining therapy (TRT) uses nonpsychiatric tinnitus-specific educational counseling and sound therapy in a habituation-based protocol to reduce the patient's tinnitus-evoked negative reaction to, and awareness of, the tinnitus, with the ultimate goal of reducing the tinnitus impact on the patient's quality of life. Some studies support the efficacy of TRT, but no trial to date has compared TRT with the current standard of care or evaluated the separate contributions of TRT counseling and sound therapy. The Tinnitus Retraining Therapy Trial (TRTT) is a randomized, double-blind, placebo-controlled, multicenter trial for individuals with intolerable tinnitus. The TRTT is enrolling active-duty and retired military personnel and their dependents with functionally adequate hearing sensitivity and severe tinnitus at US Air Force, Navy, and Army medical centers. Eligible study participants are randomized to TRT, partial TRT, or standard care to determine the efficacy of TRT and its components (TRT counseling and sound therapy). The primary outcome is change in score on the Tinnitus Questionnaire assessed longitudinally between baseline and follow-up (3, 6, 12, and 18 months following treatment). Secondary outcomes include subscale score changes in the Tinnitus Questionnaire, overall and subscale score changes in the Tinnitus Functional Index and Tinnitus Handicap Inventory, and change in the visual analog scale of the TRT Interview Form. Audiological outcomes include tinnitus pitch and loudness match and measures of loudness discomfort levels. The incidence of depression as a safety measure is assessed at each visit using the Beck Depression Inventory Fast Screen. Clinicaltrials.gov NCT01177137.	\N	\N
25324150	Recent literature reviews have highlighted the need to better understand the relation between speaker and listener behavior when teaching learners with autism spectrum disorders (ASD). The current study used a modified parallel-treatments design to compare directly the degree to which tact and listener behavior emerged during instruction in the opposite relation for 4 children with ASD. Results showed tact training to be either equally or more efficient than listener training for all participants. However, varied patterns of emergent responding across participants indicate a need for further research. Data on collateral responding during instruction did not suggest that the presence or absence of overt collateral behaviors were predictive of emergence. The results highlight the importance for clinicians and educators to assess emergent tact and listener repertoires periodically.	\N	\N
25324726	Our concepts of sound localization in the vertebrate brain are widely based on the general assumption that both the ability to detect air-borne sounds and the neuronal processing are homologous in archosaurs (present day crocodiles and birds) and mammals. Yet studies repeatedly report conflicting results on the neuronal circuits and mechanisms, in particular the role of inhibition, as well as the coding strategies between avian and mammalian model systems. Here we argue that mammalian and avian phylogeny of spatial hearing is characterized by a convergent evolution of hearing air-borne sounds rather than by homology. In particular, the different evolutionary origins of tympanic ears and the different availability of binaural cues in early mammals and archosaurs imposed distinct constraints on the respective binaural processing mechanisms. The role of synaptic inhibition in generating binaural spatial sensitivity in mammals is highlighted, as it reveals a unifying principle of mammalian circuit design for encoding sound position. Together, we combine evolutionary, anatomical and physiological arguments for making a clear distinction between mammalian processing mechanisms and coding strategies and those of archosaurs. We emphasize that a consideration of the convergent nature of neuronal mechanisms will significantly increase the explanatory power of studies of spatial processing in both mammals and birds.	\N	\N
25325783	Visual crowding is generally thought to affect recognition mostly or only at the level of feature combination. Calling this assertion into question, recent studies have shown that if a target object and its flankers belong to different categories crowding is weaker than if they belong to the same category. Nevertheless, these results can be explained in terms of featural differences between categories. The current study tests if category-level (i.e., high-level) interference in crowding occurs when featural differences are controlled for. First, replicating previous results, we found lower critical spacing for targets and flankers belonging to different categories. Second, we observed the same, albeit weaker, category-specific effect when objects in both categories had the exact same feature set, suggesting that category-specific effects persist even when featural differences are fully controlled for. Third, we manipulated the semantic content of the flankers while keeping their feature set constant, by using upright or rotated objects, and found that meaning modulated crowding. An exclusively feature-based account of crowding would predict no differences due to such changes in meaning. We conclude that crowding results from not only the well-documented feature-level interactions but also additional interactions at a level where objects are grouped by meaning.	\N	\N
25326606	What is the perceptual fate of invisible stimuli-are they processed at all and does their processing have consequences for the perception of other stimuli? As has been shown previously in the somatosensory system, even stimuli that are too weak to be consciously detected can influence our perception: Subliminal stimulation impairs perception of near-threshold stimuli and causes a functional deactivation in the somatosensory cortex. In a recent study, we showed that subliminal visual stimuli lead to similar responses, indicated by an increase in alpha-band power as measured with electroencephalography (EEG). In the current study, we investigated whether a behavioral inhibitory mechanism also exists within the visual system. We tested the detection of peripheral visual target stimuli under three different conditions: Target stimuli were presented alone or embedded in a concurrent train of subliminal stimuli either at the same location as the target or in the opposite hemifield. Subliminal stimuli were invisible due to their low contrast, not due to a masking procedure. We demonstrate that target detection was impaired by the subliminal stimuli, but only when they were presented at the same location as the target. This finding indicates that subliminal, low-intensity stimuli induce a similar inhibitory effect in the visual system as has been observed in the somatosensory system. In line with previous reports, we propose that the function underlying this effect is the inhibition of spurious noise by the visual system.	\N	\N
25332098	Development and evolution of auditory hindbrain nuclei are two major unsolved issues in hearing research. Recent characterization of transgenic mice identified the rhombomeric origins of mammalian auditory nuclei and unraveled genes involved in their formation. Here, we provide an overview on these data by assembling them into rhombomere-specific gene regulatory networks (GRNs), as they underlie developmental and evolutionary processes. To explore evolutionary mechanisms, we compare the GRNs operating in the mammalian auditory hindbrain with data available from the inner ear and other vertebrate groups. Finally, we propose that the availability of genomic sequences from all major vertebrate taxa and novel genetic techniques for non-model organisms provide an unprecedented opportunity to investigate development and evolution of the auditory hindbrain by comparative molecular approaches. The dissection of the molecular mechanisms leading to auditory structures will also provide an important framework for auditory processing disorders, a clinical problem difficult to tackle so far. These data will, therefore, foster basic and clinical hearing research alike.	\N	\N
25344346	It is increasingly recognized that motor routines dynamically shape the processing of sensory inflow (e.g., when hand movements are used to feel a texture or identify an object). In the present research, we captured the shaping of auditory perception by movement in humans by taking advantage of a specific context: music. Participants listened to a repeated rhythmical sequence before and after moving their bodies to this rhythm in a specific meter. We found that the brain responses to the rhythm (as recorded with electroencephalography) after body movement were significantly enhanced at frequencies related to the meter to which the participants had moved. These results provide evidence that body movement can selectively shape the subsequent internal representation of auditory rhythms.	\N	\N
25346316	In the United States, falls are the leading cause of accidental deaths in adults aged over 65 years. Epidemiologic studies indicate that there is a correlation between hearing loss and the risk of falling among older people. The vestibular, proprioceptive, and visual systems are known to contribute to postural stability, but the contribution of audition to maintaining balance has not yet been determined. Cross-sectional study to measure postural stability in bilateral hearing-aid users aged over 65 years in aided and unaided conditions. Balance was assessed using the Romberg on foam test and the tandem stance test. Tests were administered in the presence of a point-source broadband white-noise sound (0-4 kHz) source in both unaided and aided conditions in the dark. Subjective measures of balance were made using the Activities-specific Balance Confidence Scale. Performance was significantly better in the aided than the unaided condition (P = 0.005 for both tests). No statistically significant relationship between improvement in balance, and hearing was identified. Participants did not report that they perceived a difference in balance between the two conditions. These results indicate that hearing aids are a novel treatment modality for imbalance in older adults with hearing loss and suggest that wearing hearing aids may offer a significant public-health benefit for avoiding falls in this population.	\N	\N
25350757	This study used event-related potentials (ERPs) to assess effects of low-level prenatal lead exposure on auditory recognition memory in 2-month-old infants. Infants were divided into four groups according to cord-blood lead concentration: (1) <2.00 μ g/dL, (2) 2.00-2.99 μ g/dL, (3) 3.0-3.7 μ g/dL, and (4) ≥3.7 μ g/dL. The first group showed the normally expected differences in P2, P750, and late slow wave (LSW) amplitudes elicited by mothers' and strangers' voices. These differences were not observed for one or more ERP components in the other groups. Thus, there was electrophysiological evidence of poorer auditory recognition memory at 2 months with cord-blood lead ≥2.00 μ g/dL.	\N	\N
25358027	To evaluate a speech-processing strategy in which the lowest frequency channel is conveyed using an asymmetric pulse shape and "phantom stimulation", where current is injected into one intra-cochlear electrode and where the return current is shared between an intra-cochlear and an extra-cochlear electrode. This strategy is expected to provide more selective excitation of the cochlear apex, compared to a standard strategy where the lowest-frequency channel is conveyed by symmetric pulses in monopolar mode. In both strategies all other channels were conveyed by monopolar stimulation. Within-subjects comparison between the two strategies. Four experiments: (1) discrimination between the strategies, controlling for loudness differences, (2) consonant identification, (3) recognition of lowpass-filtered sentences in quiet, (4) sentence recognition in the presence of a competing speaker. Eight users of the Advanced Bionics CII/Hi-Res 90k cochlear implant. Listeners could easily discriminate between the two strategies but no consistent differences in performance were observed. The proposed method does not improve speech perception, at least in the short term.	\N	\N
25358716	Information processing of all acoustic stimuli involves temporal lobe regions referred to as auditory cortices, which receive direct afferents from the auditory thalamus. However, the perception of music (as well as speech or spoken language) is a complex process that also involves secondary and association cortices that conform a large functional network. Using different analytical techniques and stimulation paradigms, several studies have shown that certain areas are particularly sensitive to specific acoustic characteristics inherent to music (e.g., rhythm). This chapter reviews the functional anatomy of the auditory cortices, and highlights specific experiments that suggest the existence of distinct cortical networks for the perception of music and speech.	\N	\N
25373970	Otoacoustic emission (OAE) tests of the medial-olivocochlear reflex (MOCR) in humans were assessed for viability as clinical assays. Two reflection-source OAEs [TEOAEs: transient-evoked otoacoustic emissions evoked by a 47 dB sound pressure level (SPL) chirp; and discrete-tone SFOAEs: stimulus-frequency otoacoustic emissions evoked by 40 dB SPL tones, and assessed with a 60 dB SPL suppressor] were compared in 27 normal-hearing adults. The MOCR elicitor was a 60 dB SPL contralateral broadband noise. An estimate of MOCR strength, MOCR%, was defined as the vector difference between OAEs measured with and without the elicitor, normalized by OAE magnitude (without elicitor). An MOCR was reliably detected in most ears. Within subjects, MOCR strength was correlated across frequency bands and across OAE type. The ratio of across-subject variability to within-subject variability ranged from 2 to 15, with wideband TEOAEs and averaged SFOAEs giving the highest ratios. MOCR strength in individual ears was reliably classified into low, normal, and high groups. SFOAEs using 1.5 to 2 kHz tones and TEOAEs in the 0.5 to 2.5 kHz band gave the best statistical results. TEOAEs had more clinical advantages. Both assays could be made faster for clinical applications, such as screening for individual susceptibility to acoustic trauma in a hearing-conservation program.	\N	\N
25385771	The ability to perceive a regular beat in music and synchronize to this beat is a widespread human skill. Fundamental to musical behaviour, beat and meter refer to the perception of periodicities while listening to musical rhythms and often involve spontaneous entrainment to move on these periodicities. Here, we present a novel experimental approach inspired by the frequency-tagging approach to understand the perception and production of rhythmic inputs. This approach is illustrated here by recording the human electroencephalogram responses at beat and meter frequencies elicited in various contexts: mental imagery of meter, spontaneous induction of a beat from rhythmic patterns, multisensory integration and sensorimotor synchronization. Collectively, our observations support the view that entrainment and resonance phenomena subtend the processing of musical rhythms in the human brain. More generally, they highlight the potential of this approach to help us understand the link between the phenomenology of musical beat and meter and the bias towards periodicities arising under certain circumstances in the nervous system. Entrainment to music provides a highly valuable framework to explore general entrainment mechanisms as embodied in the human brain.	\N	\N
25385777	Physiological rhythms are sensitive to social interactions and could contribute to defining social rhythms. Nevertheless, our knowledge of the implications of breathing in conversational turn exchanges remains limited. In this paper, we addressed the idea that breathing may contribute to timing and coordination between dialogue partners. The relationships between turns and breathing were analysed in unconstrained face-to-face conversations involving female speakers. No overall relationship between breathing and turn-taking rates was observed, as breathing rate was specific to the subjects' activity in dialogue (listening versus taking the turn versus holding the turn). A general inter-personal coordination of breathing over the whole conversation was not evident. However, specific coordinative patterns were observed in shorter time-windows when participants engaged in taking turns. The type of turn-taking had an effect on the respective coordination in breathing. Most of the smooth and interrupted turns were taken just after an inhalation, with specific profiles of alignment to partner breathing. Unsuccessful attempts to take the turn were initiated late in the exhalation phase and with no clear inter-personal coordination. Finally, breathing profiles at turn-taking were different than those at turn-holding. The results support the idea that breathing is actively involved in turn-taking and turn-holding.	\N	\N
25401380	Interference between a target and simultaneous maskers occurs both at the cochlear level through energetic masking and more centrally through informational masking (IM). Hence, quantifying the amount of IM requires a strict control of the energetic component. Presenting target and maskers on different sides (i.e., dichotically) reduces energetic masking but provides listeners with important lateralization cues that also drastically reduce IM. The main purpose of this study (Experiment 1) was to evaluate a "switch" manipulation aiming at restoring most of the IM despite dichotic listening. Experiment 2 was designed to investigate the source of the difficulty induced by this switching dichotic condition. In Experiment 1, the authors presented 60 normal-hearing young adults with a detection task in which a regularly repeating target was embedded in a randomly varying background masker. The authors evaluated spatial masking release induced by three different dichotic listening conditions in comparison with a diotic baseline. Dichotic stimuli were presented in either a nonswitching or a switching condition. In the latter case, the presentation sides of dichotic target and maskers alternated several times throughout 10 sec sequences. The impact of the number of switches on IM was investigated parametrically, with both pure and complex tone sequences. In Experiment 2, the authors compared performance of 13 young, normal-hearing listeners in a monotic and dichotic version of the rapidly switching condition, using pure-tone sequences. When target and maskers switched rapidly within sequences, IM was significantly stronger than in nonswitching dichotic sequences and was comparable with the masking effect induced by diotic sequences. Furthermore, Experiment 2 suggests that rapidly switching target and maskers prevent listeners from relying on lateralization cues inherent to the dichotic condition, hence preserving important amounts of IM. This paradigm thus provides an original tool to isolate IM in signal and maskers having overlapping spectra.	\N	\N
25412406	Speech comprehension studies have generally focused on the isolation and function of regions with positive blood oxygen level dependent (BOLD) signals with respect to a resting baseline. Although regions with negative BOLD signals in comparison to a resting baseline have been reported in language-related tasks, their relationship to regions of positive signals is not fully appreciated. Based on the emerging notion that the negative signals may represent an active function in language tasks, the authors test the hypothesis that negative BOLD signals during receptive language are more associated with comprehension than content-free versions of the same stimuli. Regions associated with comprehension of speech were isolated by comparing responses to passive listening to natural speech to two incomprehensible versions of the same speech: one that was digitally time reversed and one that was muffled by removal of high frequencies. The signal polarity was determined by comparing the BOLD signal during each speech condition to the BOLD signal during a resting baseline. As expected, stimulation-induced positive signals relative to resting baseline were observed in the canonical language areas with varying signal amplitudes for each condition. Negative BOLD responses relative to resting baseline were observed primarily in frontoparietal regions and were specific to the natural speech condition. However, the BOLD signal remained indistinguishable from baseline for the unintelligible speech conditions. Variations in connectivity between brain regions with positive and negative signals were also specifically related to the comprehension of natural speech. These observations of anticorrelated signals related to speech comprehension are consistent with emerging models of cooperative roles represented by BOLD signals of opposite polarity.	\N	\N
25415467	To determine differences in speech perception outcomes for patients who received a CI422 and a Contour cochlear implant. Retrospective case review. Tertiary referral center. Thirty-two adults who underwent cochlear implantation. Cochlear implantation using a CI422 or Contour device. Bamford-Kowal-Bench (BKB) speech perception scores at 3 and 9 months after activation. The mean BKB scores at 3 months for the CI422 device were 86.0% in quiet and 55.1% in noise. This compares with 86.0% in quiet and 62.3% in noise for the Contour device. At 9 months, the mean BKB scores were 85.9% in quiet and 67.1% in noise for the CI422 and 90.1% in quiet and 77.6% in noise for the Contour device. There was no statistically significant difference (p > 0.05) between speech perception outcomes at 3 or 9 months. This study suggests that CI422 and Contour electrode both improve speech perception outcomes postoperatively, and there does not appear to be any significant difference in outcome between the two types of devices.	\N	\N
25415938	Embodied music cognition stresses the role of the human body as mediator for the encoding and decoding of musical expression. In this paper, we set up a low dimensional functional model that accounts for 70% of the variability in the expressive body movement responses to music. With the functional principal component analysis, we modeled individual body movements as a linear combination of a group average and a number of eigenfunctions. The group average and the eigenfunctions are common to all subjects and make up what we call the commonalities. An individual performance is then characterized by a set of scores (the individualities), one score per eigenfunction. The model is based on experimental data which finds high levels of coherence/consistency between participants when grouped according to musical education. This shows an ontogenetic effect. Participants without formal musical education focus on the torso for the expression of basic musical structure (tempo). Musically trained participants decode additional structural elements in the music and focus on body parts having more degrees of freedom (such as the hands). Our results confirm earlier studies that different body parts move differently along with the music.	\N	\N
25421408	Individuals lip read themselves more accurately than they lip read others when only the visual speech signal is available (Tye-Murray et al., Psychonomic Bulletin & Review, 20, 115-119, 2013). This self-advantage for vision-only speech recognition is consistent with the common-coding hypothesis (Prinz, European Journal of Cognitive Psychology, 9, 129-154, 1997), which posits (1) that observing an action activates the same motor plan representation as actually performing that action and (2) that observing one's own actions activates motor plan representations more than the others' actions because of greater congruity between percepts and corresponding motor plans. The present study extends this line of research to audiovisual speech recognition by examining whether there is a self-advantage when the visual signal is added to the auditory signal under poor listening conditions. Participants were assigned to sub-groups for round-robin testing in which each participant was paired with every member of their subgroup, including themselves, serving as both talker and listener/observer. On average, the benefit participants obtained from the visual signal when they were the talker was greater than when the talker was someone else and also was greater than the benefit others obtained from observing as well as listening to them. Moreover, the self-advantage in audiovisual speech recognition was significant after statistically controlling for individual differences in both participants' ability to benefit from a visual speech signal and the extent to which their own visual speech signal benefited others. These findings are consistent with our previous finding of a self-advantage in lip reading and with the hypothesis of a common code for action perception and motor plan representation.	\N	\N
25436670	Sound waves emitted by two or more simultaneous sources reach the ear as one complex waveform. Auditory scene analysis involves parsing a complex waveform into separate perceptual representations of the sound sources [Bregman, A. S. Auditory scene analysis: The perceptual organization of sounds. London: MIT Press, 1990]. Harmonicity provides an important cue for auditory scene analysis. Normally, harmonics at integer multiples of a fundamental frequency are perceived as one sound with a pitch corresponding to the fundamental frequency. However, when one harmonic in such a complex, pitch-evoking sound is sufficiently mistuned, that harmonic emerges from the complex tone and is perceived as a separate auditory object. Previous work has shown that the percept of two objects is indexed in both children and adults by the object-related negativity component of the ERP derived from EEG recordings [Alain, C., Arnott, S. T., & Picton, T. W. Bottom-up and top-down influences on auditory scene analysis: Evidence from event-related brain potentials. Journal of Experimental Psychology: Human Perception and Performance, 27, 1072-1089, 2001]. Here we examine the emergence of object-related responses to an 8% harmonic mistuning in infants between 2 and 12 months of age. Two-month-old infants showed no significant object-related response. However, in 4- to 12-month-old infants, a significant frontally positive component was present, and by 8-12 months, a significant frontocentral object-related negativity was present, similar to that seen in older children and adults. This is in accordance with previous research demonstrating that infants younger than 4 months of age do not integrate harmonic information to perceive pitch when the fundamental is missing [He, C., Hotson, L., & Trainor, L. J. Maturation of cortical mismatch mismatch responses to occasional pitch change in early infancy: Effects of presentation rate and magnitude of change. Neuropsychologia, 47, 218-229, 2009]. The results indicate that the ability to use harmonic information to segregate simultaneous sounds emerges at the cortical level between 2 and 4 months of age.	\N	\N
25445239	In this study we validate passive language fMRI protocols designed for clinical application in pediatric epilepsy surgical planning as they do not require overt participation from patients. We introduced a set of quality checks that assess reliability of noninvasive fMRI mappings utilized for clinical purposes. We initially compared two fMRI language mapping paradigms, one active in nature (requiring participation from the patient) and the other passive in nature (requiring no participation from the patient). Group-level analysis in a healthy control cohort demonstrated similar activation of the putative language centers of the brain in the inferior frontal (IFG) and temporoparietal (TPG) regions. Additionally, we showed that passive language fMRI produced more left-lateralized activation in TPG (LI=+0.45) compared to the active task; with similarly robust left-lateralized IFG (LI=+0.24) activations using the passive task. We validated our recommended fMRI mapping protocols in a cohort of 15 pediatric epilepsy patients by direct comparison against the invasive clinical gold-standards. We found that language-specific TPG activation by fMRI agreed to within 9.2mm to subdural localizations by invasive functional mapping in the same patients, and language dominance by fMRI agreed with Wada test results at 80% congruency in TPG and 73% congruency in IFG. Lastly, we tested the recommended passive language fMRI protocols in a cohort of very young patients and confirmed reliable language-specific activation patterns in that challenging cohort. We concluded that language activation maps can be reliably achieved using the passive language fMRI protocols we proposed even in very young (average 7.5 years old) or sedated pediatric epilepsy patients.	\N	\N
25449865	Utilizing the high temporal resolution of event-related potentials (ERPs), we examined the effects of temporal reliability of sounds on visual detection. Significantly faster reaction times to visual target stimuli were observed when reliable temporal information was provided by a task-irrelevant auditory stimulus. Three main ERP components related to the effects of auditory temporal reliability were found: the first at 180-240 ms over a wide central area, the second at 300-400 ms over an anterior area, and the third at 300-380 ms over bilateral temporal areas. Our results support the hypothesis that temporal reliability affects visual detection and indicate that auditory facilitation of visual detection is partly due to spread of attention and thus results from implicit temporal linking of auditory and visual information at a relatively late processing stage.	\N	\N
25473957	To examine the differences in quality of life for vestibular schwannoma patients undergoing conservative management, gamma knife, and surgery. Vestibular schwannoma patients without a diagnosis of NF2. Vestibular schwannoma treatment or conservative management. Penn Acoustic Neuroma Quality of Life (PANQOL) survey scores (0-100). One hundred eighty-six patients (98 conservative, 49 gamma knife, 39 surgery) were included. Mean patient age (years) of the surgery group (49 ± 14) was significantly younger than both the conservative (58 ± 13) and gamma knife group (59 ± 12) (p < 0.001). Mean follow-up time was 2.6 years.Tumor size (mm) was found to be significantly different between the conservative (8 ± 4.8), gamma knife (18 ± 5.9), and surgery (22 ± 8.3) groups (p < 0.001). Speech recognition threshold and speech discrimination percentage were significantly better for the conservative group compared to the gamma knife or surgery groups (p < 0.001).The hearing domain scores seemed better for the conservative group (62 ± 26) when compared to the surgery group (47 ± 25). The general and total domain scores were similar for all treatment groups, whereas the quality-of-life scores for gamma knife and surgery were similar. Although surgery groups' significantly larger tumors and worse hearing were apparent in specific PANQOL domains, all patients achieved a similar general level of quality of life.	\N	\N
25474416	One of the major complaints of people with a single-sided deafness is the inability to localize sound sources. Evidence suggests that subjects with a hearing loss can benefit from the use of a cochlear implant (CI) in sound localization. This study aimed to determine the effect of CI use on localization ability in unilaterally deafened subjects. Sixteen adult subjects with postlingual unilateral deafness, fitted with a CI on the deaf side, were included in this study. The auditory speech sounds evaluation (A§E) localization test was used to determine localization with a CI on (binaural) and a CI off (monaural). The root mean square error was used as a measure of the subject's localization performance. Stratified analyses were performed to test the influence of gender, age of implantation (<55 years and >55 years), and the duration of deafness (<10 years and >10 years) on localization ability. Subjects with a CI on localized significantly better than without a CI. Gender, age, and the duration of deafness had no effect on the localization ability of the subjects. Cochlear implantation is effective in improving localization abilities in subjects with unilateral deafness. The root mean square error dropped significantly with binaural hearing compared to monaural hearing.	\N	\N
25477777	For multimodal Human-Computer Interaction (HCI), it is very useful to identify the modalities on which the user is currently processing information. This would enable a system to select complementary output modalities to reduce the user's workload. In this paper, we develop a hybrid Brain-Computer Interface (BCI) which uses Electroencephalography (EEG) and functional Near Infrared Spectroscopy (fNIRS) to discriminate and detect visual and auditory stimulus processing. We describe the experimental setup we used for collection of our data corpus with 12 subjects. On this data, we performed cross-validation evaluation, of which we report accuracy for different classification conditions. The results show that the subject-dependent systems achieved a classification accuracy of 97.8% for discriminating visual and auditory perception processes from each other and a classification accuracy of up to 94.8% for detecting modality-specific processes independently of other cognitive activity. The same classification conditions could also be discriminated in a subject-independent fashion with accuracy of up to 94.6 and 86.7%, respectively. We also look at the contributions of the two signal types and show that the fusion of classifiers using different features significantly increases accuracy.	\N	\N
25480056	Mounting evidence suggests that listeners perceptually compensate for the adverse effects of reverberation in rooms when listening to speech monaurally. However, it is not clear whether the underlying perceptual mechanism would be at all effective in the high levels of stimulus uncertainty that are present in everyday listening. Three experiments investigated monaural compensation with a consonant identification task in which listeners heard different speech on each trial. Consonant confusions frequently arose when a greater degree of reverberation was added to a test-word than to its surrounding context, but compensation became apparent in conditions where the context reverberation was increased to match that of the test-word; here, the confusions were largely resolved. A second experiment shows that information from the test-word itself can also effect compensation. Finally, the time course of compensation was examined by applying reverberation to a portion of the preceding context; consonant identification improves as this portion increases in duration. These findings indicate a monaural compensation mechanism that is likely to be effective in everyday listening, allowing listeners to recalibrate as their reverberant environment changes.	\N	\N
25486827	Temporal summation in masking has been measured simultaneously with a resolution of the masker's spectral structure to find psychoacoustic characteristics for estimation of speech intelligibility, to detect the manifestation of peripheral processes in auditory perception in humans. For this, detection thresholds of a test signal with different durations were determined. The test signal was pulse with a Gaussian envelope and a sine-wave carrier. It was presented simultaneously with a noise masker. The minimal pulse duration was inversely proportional to width of the critical bands of hearing, formed at the pulse's center frequency. The maximal pulse duration always was 50 ms. We adopted pulses with duration of 1-10 ms as a model of consonants and pulses with duration of 20-50 ms as a model of vowels. The band pass noises with rippled structure of the amplitude spectrum of two types were used as maskers. The central frequency of one masker coincided with a spectral hump and the central frequency of the other--with the spectral failure. If the pulses and maskers central frequencies were equal, the first and second maskers were called on- and - off- maskers. If the auditory system could discriminate the rippled structure of the masker's spectra, the difference in the detection thresholds of the pulses, which was presented with each of the maskers, was not equal to zero. The difference in the detection thresholds allows us to estimate resolution of the masker's spectra, i.e. frequency selectivity. Changes in the pulse's and masker's central frequencies allow us to evaluate the hearing ability in certain frequency domain. Changes in the masker's levels allow us to find influence of nonlinear dynamic properties of cochlea on temporal summation and frequency selectivity. This paper presents the results of measurements of temporal summation in masking, obtained in two frequency domains 2 and 4 kHz, in 4 subjects with normal hearing and in 1 subject with age-related hearing loss, who complained about deterioration of speech intelligibility. It has been found an increasing temporal summation and an improving the resolution of the rippled structure of the amplitude spectra for the maskers with average levels. We believe, the reasons could be found in the properties of the peripheral pulse coding, such as (1) the stabilization zones of excitation of the basilar.membrane, the range of the characteristic frequencies and the number of excited auditory nerve fibers and the number ofspikes, generated by fibers, due to the nonlinearity of the dynamic properties of the cochlea, (2) increasing synchronization of a reaction of the excited fibers and shortening in time of this reaction, due to thefiber's refractory properties.	\N	\N
25514452	Temporal processing ability has been linked to speech understanding ability and older adults often complain of difficulty understanding speech in difficult listening situations. Temporal processing can be evaluated using gap detection procedures. There is some research showing that gap detection can be evaluated using an electrophysiological procedure. However, there is currently no research establishing gap detection threshold using the N1-P2 response. The purposes of the current study were to 1) determine gap detection thresholds in younger and older normal-hearing adults using an electrophysiological measure, 2) compare the electrophysiological gap detection threshold and behavioral gap detection threshold within each group, and 3) investigate the effect of age on each gap detection measure. This study utilized an older adult group and younger adult group to compare performance on an electrophysiological and behavioral gap detection procedure. The subjects in this study were 11 younger, normal-hearing adults (mean = 22 yrs) and 11 older, normal-hearing adults (mean = 64.36 yrs). All subjects completed an adaptive behavioral gap detection procedure in order to determine their behavioral gap detection threshold (BGDT). Subjects also completed an electrophysiologic gap detection procedure to determine their electrophysiologic gap detection threshold (EGDT). Older adults demonstrated significantly larger gap detection thresholds than the younger adults. However, EGDT and BGDT were not significantly different in either group. The mean difference between EGDT and BGDT for all subjects was 0.43 msec. Older adults show poorer gap detection ability when compared to younger adults. However, this study shows that gap detection thresholds can be measured using evoked potential recordings and yield results similar to a behavioral measure.	\N	\N
25517630	Auditory processing disorder patients may have deficits in auditory temporal resolution. This study explored: (1) the ear specific norms for young adults using the adaptive tests of temporal resolution (ATTR); (2) the reliability of ATTR using two different modes of stimuli presentation; and (3) the concurrent validity of ATTR with reference to the gaps-in-noise (GIN) test. GIN and ATTR were administered through a standard audiometer and headphones. As ATTR can also be completed using a computer with commercially available headphones, thresholds from these two variants were compared. Thirty normal-hearing young adults were recruited. The mean ATTR gap detection thresholds (GDTs) derived under audiometer administration were 4.60 ms (SD 1.49) and 4.97 ms (SD 1.98) for the left and right ear, respectively. The approximated threshold (A. th.), an equivalent measure to the GDT in the GIN, mean values were 5.37 ms (SD 0.98) and 5.33 ms (SD 1.07) for left and right ears, respectively. No significant threshold difference was found between the ATTR variants. A positive, moderate correlation was found, and Bland-Altman plot analysis revealed good agreement, between GDT and A.th. ATTR and GIN results were moderately associated. Moreover, the ATTR was found to have high test-retest reliability and high specificity for the current participants.	\N	\N
25521593	A new approach for the segregation of monaural sound mixtures is presented based on the principle of temporal coherence and using auditory cortical representations. Temporal coherence is the notion that perceived sources emit coherently modulated features that evoke highly-coincident neural response patterns. By clustering the feature channels with coincident responses and reconstructing their input, one may segregate the underlying source from the simultaneously interfering signals that are uncorrelated with it. The proposed algorithm requires no prior information or training on the sources. It can, however, gracefully incorporate cognitive functions and influences such as memories of a target source or attention to a specific set of its attributes so as to segregate it from its background. Aside from its unusual structure and computational innovations, the proposed model provides testable hypotheses of the physiological mechanisms of this ubiquitous and remarkable perceptual ability, and of its psychophysical manifestations in navigating complex sensory environments.	\N	\N
25534365	The temporal masking curve (TMC) method is a behavioral technique for inferring human cochlear compression. The method relies on the assumptions that in the absence of compression, forward-masking recovery is independent of masker level and probe frequency. The present study aimed at testing the validity of these assumptions. Masking recovery was investigated for eight listeners with sensorineural hearing loss carefully selected to have absent or nearly absent distortion product otoacoustic emissions. It is assumed that for these listeners basilar membrane responses are linear, hence that masking recovery is independent of basilar membrane compression. TMCs for probe frequencies of 0.5, 1, 2, 4, and 6 kHz were available for these listeners from a previous study. The dataset included TMCs for masker frequencies equal to the probe frequencies plus reference TMCs measured using a high-frequency probe and a low, off-frequency masker. All of the TMCs were fitted using linear regression, and the resulting slope and intercept values were taken as indicative of masking recovery and masker level, respectively. Results for on-frequency TMCs suggest that forward-masking recovery is generally independent of probe frequency and of masker level and hence that it would be reasonable to use a reference TMC for a high-frequency probe to infer cochlear compression at lower frequencies. Results further show, however, that reference TMCs were sometimes shallower than corresponding on-frequency TMCs for identical probe frequencies, hence that compression could be overestimated in these cases. We discuss possible reasons for this result and the conditions when it might occur.	\N	\N
25536846	We explored the functional units of speech segmentation in Japanese using dichotic presentation and a detection task requiring no intentional sublexical analysis. Indeed, illusory perception of a target word might result from preattentive migration of phonemes, morae, or syllables from one ear to the other. In Experiment I, Japanese listeners detected targets presented in hiragana and/or kanji. Phoneme migrations did occur, suggesting that orthography-independent sublexical constituents play some role in segmentation. However, syllable and especially mora migrations were more numerous. This pattern of results was not observed in French speakers (Experiment 2), suggesting that it reflects native segmentation in Japanese. To control for the intervention of kanji representations (many words are written in kanji, and one kanji often corresponds to one syllable), in Experiment 3, Japanese listeners were presented with target loanwords that can be written only in katakana. Again, phoneme migrations occurred, while the first mora and syllable led to similar rates of illusory percepts. No migration occurred for the second, "special" mora (/J/ or/N/), probably because this constitutes the latter part of a heavy syllable. Overall, these findings suggest that multiple units, such as morae, syllables, and even phonemes, function independently of orthographic knowledge in Japanese preattentive speech segmentation.	\N	\N
25546030	A study was conducted to determine whether modifications to input compression and input frequency response characteristics can improve music-listening satisfaction in cochlear implant users. Experiment 1 compared three pre-processed versions of music and speech stimuli in a laboratory setting: original, compressed, and flattened frequency response. Music excerpts comprised three music genres (classical, country, and jazz), and a running speech excerpt was compared. Experiment 2 implemented a flattened input frequency response in the speech processor program. In a take-home trial, participants compared unaltered and flattened frequency responses. Ten and twelve adult Nucleus Freedom cochlear implant users participated in Experiments 1 and 2, respectively. Experiment 1 revealed a significant preference for music stimuli with a flattened frequency response compared to both original and compressed stimuli, whereas there was a significant preference for the original (rising) frequency response for speech stimuli. Experiment 2 revealed no significant mean preference for the flattened frequency response, with 9 of 11 subjects preferring the rising frequency response. Input compression did not alter music enjoyment. Comparison of the two experiments indicated that individual frequency response preferences may depend on the genre or familiarity, and particularly whether the music contained lyrics.	\N	\N
25556198	The auditory steady-state response, which measures the ability of neural ensembles to entrain to rhythmic auditory stimuli, has been used in human electroencephalogram studies to assess sensory processing and electrical oscillatory deficits. Patients with schizophrenia show a deficit in auditory steady-state response at 40 Hz, and therefore this may be a useful biomarker to study this disorder. We used auditory steady-state response recordings from the primary auditory cortex, hippocampus, and vertex electroencephalogram sites in awake behaving rats to determine whether pharmacological impairment of excitatory or inhibitory neurotransmission mimics auditory steady-state response abnormalities in schizophrenia. We found the most robust response to auditory stimuli in the primary auditory cortex, in line with previous studies suggesting this region is the primary generator of the auditory steady-state response in humans. Acute MK-801 (0.1mg/kg i.p.) increased primary auditory cortex intertrial coherence during auditory steady-state response at 20 and 40 Hz. Chronic MK-801 (21-day exposure at this daily dose) had no significant effect on 40-Hz auditory steady-state response. Furthermore, we found no effect of acute or chronic picrotoxin (a GABA-A antagonist) on intertrial coherence. Our data indicate that acute N-methyl-d-aspartate receptor antagonism increases synchronous activity in the primary auditory cortex in a frequency-specific manner, supporting the widely held view that acute N-methyl-d-aspartate antagonism augments gamma oscillations. Thus, rodent auditory steady-state response could be a valuable method to study the cortical ability to support synchronous activity at specific frequencies.	\N	\N
25561538	Some of the psychological abilities that underlie human speech are shared with other species. One hallmark of speech is that linguistic context affects both how speech sounds are categorized into phonemes, and how different versions of phonemes are produced. We here confirm earlier findings that swamp sparrows categorically perceive the notes that constitute their learned songs and then investigate how categorical boundaries differ according to context. We clustered notes according to their acoustic structure, and found statistical evidence for clustering into 10 population-wide note types. Examining how three related types were perceived, we found, in both discrimination and labeling tests, that an "intermediate" note type is categorized with a "short" type when it occurs at the beginning of a song syllable, but with a "long" type at the end of a syllable. In sum, three produced note-type clusters appear to be underlain by two perceived categories. Thus, in birdsong, as in human speech, categorical perception is context-dependent, and as is the case for human phonology, there is a complex relationship between underlying categorical representations and surface forms. Our results therefore suggest that complex phonology can evolve even in the absence of rich linguistic components, like syntax and semantics.	\N	\N
25565661	The objective of this study was to examine the hypothesis that between-channel gap detection, which includes between-frequency and between-ear gap detection, and perception of stop consonants, which is mediated by the length of voice-onset time (VOT), share common mechanisms, namely relative-timing operation in monitoring separate perceptual channels. The authors measured gap detection thresholds and identification functions of /ba/ and /pa/ along VOT in 49 native young adult Japanese listeners. There were three gap detection tasks. In the between-frequency task, the leading and trailing markers differed in terms of center frequency (Fc). The leading marker was a broadband noise of 10 to 20,000 Hz. The trailing marker was a 0.5-octave band-passed noise of 1000-, 2000-, 4000-, or 8000-Hz Fc. In the between-ear task, the two markers were spectrally identical but presented to separate ears. In the within-frequency task, the two spectrally identical markers were presented to the same ear. The /ba/-/pa/ identification functions were obtained in a task in which the listeners were presented synthesized speech stimuli of varying VOTs from 10 to 46 msec and asked to identify them as /ba/ or /pa/. The between-ear gap thresholds were significantly positively correlated with the between-frequency gap thresholds (except those obtained with the trailing marker of 4000-Hz Fc). The between-ear gap thresholds were not significantly correlated with the within-frequency gap thresholds, which were significantly correlated with all the between-frequency gap thresholds. The VOT boundaries and slopes of /ba/-/pa/ identification functions were not significantly correlated with any of these gap thresholds. There was a close relation between the between-ear and between-frequency gap detection, supporting the view that these two types of gap detection share common mechanisms of between-channel gap detection. However, there was no evidence for a relation between the perception of stop consonants and the between-frequency/ear gap detection in native Japanese speakers.	\N	\N
25571013	Binaural beat (BB) illusions are experienced as continuous central pulsations when two sounds with slightly different frequencies are delivered to each ear. It has been shown that steady-state auditory evoked potentials (AEPs) to BBs can be captured and investigated. The authors recently developed a new method of evoking transient AEPs to binaural beats using frequency modulated stimuli. This methodology was able to create single BBs in predetermined intervals with varying carrier frequencies. This study examines the effects of the BB duration and the frequency modulating component of the stimulus on the binaural beats and their evoked potentials. Normal hearing subjects were tested with a set of four durations (25, 50, 100, and 200 ms) with two stimulation configurations, binaural dichotic (binaural beats) and diotic (frequency modulation). The results obtained from the study showed that out of the given durations, the 100 ms beat, was capable of evoking the largest amplitude responses. The frequency modulation effect showed a decrease in peak amplitudes with increasing beat duration until their complete disappearance at 200 ms. Even though, at 200 ms, the frequency modulation effects were not present, the binaural beats were still perceived and captured as evoked potentials.	\N	\N
25571287	The pathologic auditory sensation in decompensated tinnitus patients is accompanied by the inability to habituate even temporary to this sound. This disability might originate from simultaneous activation of brain areas for the appraisal of the stimulus valence as, e.g., the limbic system. This coactivation of limbic areas is likely to modulate the degree and persistence of selective attention assigned to the tinnitus stream, which in turn could also explain interindividual differences in tinnitus loudness perception. Preliminary studies demonstrate that the amount of allocated attention and the habituation deficit can be mapped to changes in auditory late evoked responses (ALRs). Utilizing a numerical model for the simulation of ALRs we were able to predict a general habituation behavior in two patient groups with different degrees of tinnitus severity. Evaluating the instantaneous phase of simulated and measured ALRs by its von Mises concentration parameter, we verify a habituation deficit relative to the degree of decompensation and thus provide additional support for our neurofunctional model of limbic influences on neural processing of sensory information.	\N	\N
25577901	People with one eye show altered sensory processing. Such changes might reflect a central reweighting of sensory information that might impact on how multisensory cues are integrated. We assessed whether people who lost an eye early in life differ from controls with respect to audiovisual integration. In order to quantify the relative weightings assigned to each sensory system, participants were asked to spatially localize audiovisual events that have been previously shown to be optimally combined and perceptually fused from the point of view of location in a normal population, where the auditory and visual components were spatially disparate. There was no difference in the variability of localizing unimodal visual and auditory targets by people with one eye compared to controls. People with one eye did however, demonstrate slower reaction times to localize visual stimuli compared to auditory stimuli and were slower than binocular and eye-patched control groups. When localizing bimodal targets, the weightings assigned to each sensory modality in both people with one eye and controls were predictable from their unimodal performance, in accordance with Maximum Likelihood Estimation and the time it took all three groups to localize the bimodal targets was faster than for vision alone. Regardless of demonstrating a longer response time to visual stimuli, people with one eye appear to integrate the auditory and visual components of multisensory events optimally when determining spatial location.	\N	\N
25597464	There is a paucity of published studies examining how children with hearing loss understand speech over the telephone. Previous studies on adults with hearing aids have suggested that adults with bilateral hearing aids experience significant difficulty recognizing speech on the telephone when listening with one ear, but the provision of telephone input to both ears substantially improved speech understanding. The objectives of this study were to measure speech recognition in quiet and in noise for a group of older children with hearing loss over the telephone and to evaluate the effects of binaural hearing (e.g., DuoPhone) on speech recognition over the telephone. A cross-sectional, repeated-measures design was used in this study. A total of 14 children, ages 6-14 yr, participated in the study. Participants were obtained using convenience sampling from a nonprofit clinic population. Speech recognition in quiet and in noise with binaural versus monaural telephone input was compared in pediatric participants. Monosyllabic word recognition was assessed in quiet and classroom noise set at 50 dBA in conditions with monaural and binaural (DuoPhone) telephone input. The children's speech recognition in quiet and in noise was significantly better with binaural telephone input relative to monaural telephone input. To obtain optimal performance on the telephone, the following considerations may apply: (1) use of amplification with binaural streaming capabilities (e.g., DuoPhone), (2) counseling of family and children on how to best use the telephone, (3) provision of telecoil with microphone attenuation for improved signal-to-noise ratio, and (4) use of probe tube measures to verify the appropriateness of the telephone programs.	\N	\N
25597465	In order to differentiate between a conductive hearing loss (CHL) and a sensorineural hearing loss (SNHL) in the hearing-impaired individual, we compared thresholds to air conduction (AC) and bone conduction (BC) auditory stimulation. The presence of a gap between these thresholds (an air-bone gap) is taken as a sign of a CHL, whereas similar threshold elevations reflect an SNHL. This is based on the assumption that BC stimulation directly excites the inner ear, bypassing the middle ear. However, several of the classic mechanisms of BC stimulation such as ossicular chain inertia and the occlusion effect involve middle ear structures. An additional mode of auditory stimulation, called soft tissue conduction (STC; also called nonosseous BC) has been demonstrated, in which the clinical bone vibrator elicits hearing when it is applied to soft tissue sites on the head, neck, and thorax. The purpose of this study was to assess the relative contributions of threshold determinations to stimulation by STC, in addition to AC and osseous BC, to the differential diagnosis between a CHL and an SNHL. Baseline auditory thresholds were determined in normal participants to AC (supra-aural earphones), BC (B71 bone vibrator at the mastoid, with 5 N application force), and STC (B71 bone vibrator) to the submental area and to the submandibular triangle with 5 N application force) stimulation in response to 0.5, 1.0, 2.0, and 4.0 kHz tones. A CHL was then simulated in the participants by means of an ear plug. Separately, an SNHL was simulated in these participants with 30 dB effective masking. STUDY SAMPLE consisted of 10 normal-hearing participants (4 males; 6 females, aged 20-30 yr). AC, BC, and STC thresholds were determined in the initial normal state and in the presence of each of the simulations. The earplug-induced CHL simulation led to a mean AC threshold elevation of 21-37 dB (depending on frequency), but not of BC and STC thresholds. The masking-induced SNHL led to a mean elevation of AC, BC, and STC thresholds (23-36 dB, depending on frequency). In each type of simulation, the BC threshold shift was similar to that of the STC threshold shift. These results, which show a similar threshold shift for STC and for BC as a result of these simulations, together with additional clinical and laboratory findings, provide evidence that BC thresholds likely represent the threshold of the nonosseous BC (STC) component of multicomponent BC at the BC stimulation site, and thereby succeed in clinical practice to contribute to the differential diagnosis. This also provides evidence that STC (nonosseous BC) stimulation at low intensities probably does not involve components of the middle ear, represents true cochlear function, and therefore can also contribute to a differential diagnosis (e.g., in situations where the clinical bone vibrator cannot be applied to the mastoid or forehead with a 5 N force, such as in severe skull fracture).	\N	\N
25605693	Tetramethylpyrazine has been suggested to have a therapeutic effect on impaired hearing that is induced by aminoglycoside antibiotics. However, its effectiveness on streptomycin ototoxicity and its cellular mechanisms are relatively unknown. Here we investigate the protective effect of tetramethylpyrazine on streptomycin-induced ototoxicity in guinea pig cochlea. Prospective randomized laboratory study. Hearing Research Laboratory of China Medical University. Adult guinea pigs were randomized to 4 groups. Hearing sensitivity of guinea pigs was tested by auditory brainstem response measurements before streptomycin exposure and again 10 days later. The cochlear tissues were prepared for electron microscopy and immunohistochemical staining of heat shock protein 70 (HSP70). The effect of tetramethylpyrazine on streptomycin-induced activation of caspase-3 was evaluated by Western blotting. Co-therapy with tetramethylpyrazine reduced a profound streptomycin-induced auditory threshold shift compared with streptomycin treatment alone (P = .0002 or P = .00008). Tetramethylpyrazine also attenuated the structural disruption in streptomycin-treated outer hair cells and marginal cells of vascular stria by transmission electronic microscopy and scanning electronic microscopy, respectively. Moreover, tetramethylpyrazine decreased the streptomycin-stimulated expressions of HSP70 and caspase-3. The correlation analysis demonstrated that HSP70 expression had a positive correlation with auditory brainstem response thresholds (|R| = 0.6-0.9, P = .0073 or P = .0169). Our data suggest that the protective effect of tetramethylpyrazine on hearing function is associated with the reduction of stress response and inhibition of apoptosis. Tetramethylpyrazine may have therapeutic potential for patients with ototoxicity diseases.	\N	\N
25611857	This study reviewed whether advanced age should be a consideration when revision cochlear implantation is warranted. To examine whether age at revision cochlear implantation is related to postrevision speech perception performance. A retrospective analysis was performed in an academic tertiary care center. Participants included 14 younger adults (<65 years) and 15 older adults (≥65 years) who underwent revision cochlear implantation. Revision cochlear implantation. Speech perception performance, as measured with consonant-nucleus-consonant [CNC] words in quiet, at the best prerevision interval as well as the 3- and 6-month postrevision intervals were compared between the 2 cohorts. The CNC word test consists of 10 lists of 50 phonemically balanced monosyllabic words, scored with a range of 0% to 100% correct. Both cohorts experienced a restoration in speech perception scores after revision cochlear implantation compared with their best performance before the revision (mean [SD] CNC word test scores for the younger cohort: 43.9% [25.6%] before revision and 47.7% [21.3%] at 3 months and 47.6% [19.8%] at 6 months after revision; for the older cohort: 36.3% [19.1%] before revision and 35.3% [17.2%] at 3 months and 39.9% [16.3%] at 6 months after revision; F₂,₅₄= 0.93; P = .40). There was no interaction between age at revision surgery and speech perception performance at each assessment interval (F₂,₅₄= 0.51; P = .60). In this study, age at revision cochlear implantation was not related to postrevision speech perception performance. Advanced age should not be considered a contraindication to revision cochlear implantation.	\N	\N
25613931	To study electrical stimulation, auditory functionality, and language development in patients with inner ear malformations involving the anterior labyrinth who underwent cochlear implantation. Retrospective case review. Reference hospital for cochlear implantation. Review of 14 cases of severe hearing loss with major (common cavity deformity and cochlear hypoplasia) or minor (e.g., incomplete partition and basal turn aplasia) malformations. After cochlear implantation, data were gathered on the threshold (THR) and maximum comfort level (MCL) of the electrical stimulation and the number of functioning electrodes. Auditory responses to speech (EARS protocol) subtests were used to evaluate auditory functionality and language acquisition at 6, 12, and 24 months post-implantation. Tests used were: LIP profile, MTP (3, 6 and 12 words), OLD (open set test) and CLD (close set test). Results were compared with findings in a control group of 28 cochlear implantation patients without these malformations and with congenital hearing loss. The mean THR was 11.02μC in patients with malformations versus 3.5μC in those without, a significant difference. The THR also significantly differed between groups with major and minor malformations. Fewer functioning electrodes were used in patients with malformations. Auditory functionality scores were best in controls than in patients with malformations, who scored ≤50%, finding the lowest scores in those with major malformations. Patients with inner ear malformations undergoing cochlear implantation require greater stimuli to obtain an auditory response and have worse auditory functionality outcomes; these differences are greater in those with major versus minor malformations Nevertheless, cochlear implantation appears to be beneficial for all patients with these malformations to a greater or lesser extent.	\N	\N
25617593	We examined 4- and 6-month-old infants' sensitivity to the perceptual association between pitch and object size. Crossmodal correspondence effects were observed in 6-month-old infants but not in younger infants, suggesting that experience and/or further maturation is needed to fully develop this crossmodal association.	\N	\N
25618049	An acoustic survey of secondary schools in England has been undertaken. Room acoustic parameters and background noise levels were measured in 185 unoccupied spaces in 13 schools to provide information on the typical acoustic environment of secondary schools. The unoccupied acoustic and noise data were correlated with various physical characteristics of the spaces. Room height and the amount of glazing were related to the unoccupied reverberation time and therefore need to be controlled to reduce reverberation to suitable levels for teaching and learning. Further analysis of the unoccupied data showed that the introduction of legislation relating to school acoustics in England and Wales in 2003 approximately doubled the number of school spaces complying with current standards. Noise levels were also measured during 274 lessons to examine typical levels generated during teaching activities in secondary schools and to investigate the influence of acoustic design on working noise levels in the classroom. Comparison of unoccupied and occupied data showed that unoccupied acoustic conditions affect the noise levels occurring during lessons. They were also related to the time spent in disruption to the lessons (e.g., students talking or shouting) and so may also have an impact upon student behavior in the classroom.	\N	\N
25618071	Listeners can use pitch changes in speech to identify talkers. Individuals exhibit large variability in sensitivity to pitch and in accuracy perceiving talker identity. In particular, people who have musical training or long-term tone language use are found to have enhanced pitch perception. In the present study, the influence of pitch experience on talker identification was investigated as listeners identified talkers in native language as well as non-native languages. Experiment 1 was designed to explore the influence of pitch experience on talker identification in two groups of individuals with potential advantages for pitch processing: musicians and tone language speakers. Experiment 2 further investigated individual differences in pitch processing and the contribution to talker identification by testing a mediation model. Cumulatively, the results suggested that (a) musical training confers an advantage for talker identification, supporting a shared resources hypothesis regarding music and language and (b) linguistic use of lexical tones also increases accuracy in hearing talker identity. Importantly, these two types of hearing experience enhance talker identification by sharpening pitch perception skills in a domain-general manner.	\N	\N
25618091	This study reports a finding about vocal expressions of emotion in Mandarin Chinese. Production and perception experiments used the same tone and mixed tone sequences to test whether pitch variation is restricted due to the presence of lexical tones. Results showed that the restriction of pitch variation occurred in all high level tone sequences (tone 1 group) with the expression of happiness but did not happen for other dynamic tone groups. However, perception analysis revealed that all the emotions in every tone group received high identification rates; this indicates that listeners used other cues for encoding happiness in the tone 1 group. This study demonstrates that the restriction of pitch variation does not affect the perception of vocal emotions.	\N	\N
25618101	Speech perception studies generally focus on the acoustic information present in the frequency regions below 6 kHz. Recent evidence suggests that there is perceptually relevant information in the higher frequencies, including information affecting speech intelligibility. This experiment examined whether listeners are able to accurately identify a subset of vowels and consonants in CV-context when only high-frequency (above 5 kHz) acoustic information is available (through high-pass filtering and masking of lower frequency energy). The findings reveal that listeners are capable of extracting information from these higher frequency regions to accurately identify certain consonants and vowels.	\N	\N
25628152	The aim of the present study was to investigate whether the saliency effect for word beginnings reported in children with dyslexia (Marshall & Van der Lely, 2009) can be found also in typically developing children. Thirty-four typically developing Italian children aged 8-10 years completed two specifically designed tasks: a production task and a perception task. Both tasks used nonwords containing clusters consisting of plosive plus liquid (e.g. pl). Clusters could be either in a stressed or in an unstressed syllable and could be either in initial position (first syllable) or in medial position (second syllable). In the production task, children were asked to repeat the nonwords. In the perception task, the children were asked to discriminate between two nonwords differing in one phoneme belonging to a cluster by reporting whether two repetitions were the same or different. Results from the production task showed that children are more accurate in repeating stressed than unstressed syllables, but there was no difference with respect to position of the cluster. Results from the perception task showed that children performed more accurately when discriminating word initial contrasts than when discriminating word medial contrasts, especially if the cluster was unstressed. Implications of this finding for clinical assessments are discussed.	\N	\N
25630393	The objective of this study was to test if stimulating multiple electrodes can improve temporal pitch ranking performance at low and high stimulation rates. Temporal pitch cues are usually based on modifying the stimulation rate of the implant and thereby provide a continuum of pitches on a single electrode up to approximately 300 Hz. Ten cochlear implant subjects were asked to pitch rank stimuli presented with direct electrical stimulation. The pulses were applied on one, three, six, or eleven electrodes. In one of the conditions the current amplitude of each pulse was randomly varied between 0 and 100%. Their frequency ranged from 100 up to 500 pps. Listeners showed the previously reported performance pattern in most conditions with very good performance at the lowest standard rates and deteriorating performance to near chance level at the highest rate tested. Performance with eleven electrodes was significantly better than performance with one electrode at 500 pps. Stimulating on multiple electrodes can improve temporal pitch perception.	\N	\N
25634776	To compare some perceptual and acoustic characteristics of the voices of children who use the advanced combination encoder (ACE) or fine structure processing (FSP) speech coding strategies, and to investigate whether these characteristics differ from children with normal hearing. Acoustic analysis of the sustained vowel /a/ was performed using the multi-dimensional voice program (MDVP). Analyses of sequential and spontaneous speech were performed using the real time pitch. Perceptual analyses of these samples were performed using visual-analogic scales of pre-selected parameters. Seventy-six children from three years to five years and 11 months of age participated. Twenty-eight were users of ACE, 23 were users of FSP, and 25 were children with normal hearing. Although both groups with CI presented with some deviated vocal features, the users of ACE presented with voice quality more like children with normal hearing than the users of FSP. Sound processing of ACE appeared to provide better conditions for auditory monitoring of the voice, and consequently, for better control of the voice production. However, these findings need to be further investigated due to the lack of comparative studies published to understand exactly which attributes of sound processing are responsible for differences in performance.	\N	\N
25636271	The major purpose of this study was to explore the changes in the local/global gamma-band neural synchronies during target/non-target processing due to task difficulty under an auditory three-stimulus oddball paradigm. Multichannel event-related potentials (ERPs) were recorded from fifteen healthy participants during the oddball task. In addition to the conventional ERP analysis, we investigated the modulations in gamma-band activity (GBA) and inter-regional gamma-band phase synchrony (GBPS) for infrequent target and non-target processing due to task difficulty. The most notable finding was that the difficulty-related changes in inter-regional GBPS (33-35 Hz) at P300 epoch (350-600 ms) completely differed for target and non-target processing. As task difficulty increased, the GBPS significantly reduced for target processing but increased for non-target processing. This result contrasts with the local neural synchrony in gamma-bands, which was not affected by task difficulty. Another major finding was that the spatial patterns of functional connectivity were dissociated for target and non-target processing with regard to the difficult task. The spatial pattern for target processing was compatible with the top-down attention network, whereas that for the non-target corresponded to the bottom-up attention network. Overall, we found that the inter-regional gamma-band neural synchronies during target/non-target processing change significantly with task difficulty and that this change is dissociated between target and non-target processing. Our results indicate that large-scale neural synchrony is more relevant for the difference in information processing between target and non-target stimuli.	\N	\N
25638938	When one hears footsteps in the hall, one is able to instantly recognise it as a person: this is an everyday example of auditory biological motion perception. Despite the familiarity of this experience, research into this phenomenon is in its infancy compared with visual biological motion perception. Here, two experiments explored sensitivity to, and recognition of, auditory stimuli of biological and nonbiological origin. We hypothesised that the cadence of a walker gives rise to a temporal pattern of impact sounds that facilitates the recognition of human motion from auditory stimuli alone. First a series of detection tasks compared sensitivity with three carefully matched impact sounds: footsteps, a ball bouncing, and drumbeats. Unexpectedly, participants were no more sensitive to footsteps than to impact sounds of nonbiological origin. In the second experiment participants made discriminations between pairs of the same stimuli, in a series of recognition tasks in which the temporal pattern of impact sounds was manipulated to be either that of a walker or the pattern more typical of the source event (a ball bouncing or a drumbeat). Under these conditions, there was evidence that both temporal and nontemporal cues were important in recognising theses stimuli. It is proposed that the interval between footsteps, which reflects a walker's cadence, is a cue for the recognition of the sounds of a human walking.	\N	\N
25646513	Neural overlap in processing music and speech, as measured by the co-activation of brain regions in neuroimaging studies, may suggest that parts of the neural circuitries established for language may have been recycled during evolution for musicality, or vice versa that musicality served as a springboard for language emergence. Such a perspective has important implications for several topics of general interest besides evolutionary origins. For instance, neural overlap is an important premise for the possibility of music training to influence language acquisition and literacy. However, neural overlap in processing music and speech does not entail sharing neural circuitries. Neural separability between music and speech may occur in overlapping brain regions. In this paper, we review the evidence and outline the issues faced in interpreting such neural data, and argue that converging evidence from several methodologies is needed before neural overlap is taken as evidence of sharing.	\N	\N
25653354	Perceptual phase entrainment improves speech intelligibility by phase-locking the brain's high-excitability and low-excitability phases to relevant or irrelevant events in the speech input. However, it remains unclear whether phase entrainment to speech can be explained by a passive "following" of rhythmic changes in sound amplitude and spectral content or whether entrainment entails an active tracking of higher-level cues: in everyday speech, rhythmic fluctuations in low-level and high-level features always covary. Here, we resolve this issue by constructing novel speech/noise stimuli with intelligible speech but without systematic changes in sound amplitude and spectral content. The probability of detecting a tone pip, presented to human listeners at random moments during our speech/noise stimuli, was significantly modulated by the rhythmic changes in high-level information. Thus, perception can entrain to the speech rhythm even without concurrent fluctuations in sound amplitude or spectral content. Strikingly, the actual entrainment phase depended on the tone-pip frequency, with tone pips within and beyond the principal frequency range of the speech sound modulated in opposite fashion. This result suggests that only those neural populations processing the actually presented frequencies are set to their high-excitability phase, whereas other populations are entrained to the opposite, low-excitability phase. Furthermore, we show that the perceptual entrainment is strongly reduced when speech intelligibility is abolished by presenting speech/noise stimuli in reverse, indicating that linguistic information plays an important role for the observed perceptual entrainment.	\N	\N
25656953	This study investigated the sensitivity of 9-month-old infants to the alignment between prosodic and gesture prominences in pointing-speech combinations. Results revealed that the perception of prominence is multimodal and that infants are aware of the timing of gesture-speech combinations well before they can produce them.	\N	\N
25665752	This study compared the timing of appearance of three components of age-related hearing loss that determine the pattern and severity of presbycusis: the functional and structural pathologies of sensory cells and neurons and changes in gap detection (GD), the latter as an indicator of auditory temporal processing. Using UM-HET4 mice, genetically heterogeneous mice derived from four inbred strains, we studied the integrity of inner and outer hair cells by position along the cochlear spiral, inner hair cell-auditory nerve connections, spiral ganglion neurons (SGN), and determined auditory thresholds, as well as pre-pulse and gap inhibition of the acoustic startle reflex (ASR). Comparisons were made between mice of 5-7, 22-24 and 27-29 months of age. There was individual variability among mice in the onset and extent of age-related auditory pathology. At 22-24 months of age a moderate to large loss of outer hair cells was restricted to the apical third of the cochlea and threshold shifts in the auditory brain stem response were minimal. There was also a large and significant loss of inner hair cell-auditory nerve connections and a significant reduction in GD. The expression of Ntf3 in the cochlea was significantly reduced. At 27-29 months of age there was no further change in the mean number of synaptic connections per inner hair cell or in GD, but a moderate to large loss of outer hair cells was found across all cochlear turns as well as significantly increased ABR threshold shifts at 4, 12, 24 and 48 kHz. A statistical analysis of correlations on an individual animal basis revealed that neither the hair cell loss nor the ABR threshold shifts correlated with loss of GD or with the loss of connections, consistent with independent pathological mechanisms.	\N	\N
25669257	Individual factors beyond the audiogram, such as age and cognitive abilities, can influence speech intelligibility and speech quality judgments. This paper develops a neural network framework for combining multiple subject factors into a single model that predicts speech intelligibility and quality for a nonlinear hearing-aid processing strategy. The nonlinear processing approach used in the paper is frequency compression, which is intended to improve the audibility of high-frequency speech sounds by shifting them to lower frequency regions where listeners with high-frequency loss have better hearing thresholds. An ensemble averaging approach is used for the neural network to avoid the problems associated with overfitting. Models are developed for two subject groups, one having nearly normal hearing and the other mild-to-moderate sloping losses.	\N	\N
25673838	Critical periods are developmental windows during which the stimuli an animal encounters can reshape response properties in the affected system to a profound degree. Despite this window's importance, the neural mechanisms that regulate it are not completely understood. Pioneering studies in visual cortex initially indicated that norepinephrine (NE) permits ocular dominance column plasticity during the critical period, but later research has suggested otherwise. More recent work implicating NE in experience-dependent plasticity in the adult auditory cortex led us to re-examine the role of NE in critical period plasticity. Here, we exposed dopamine β-hydroxylase knock-out (Dbh(-/-)) mice, which lack NE completely from birth, to a biased acoustic environment during the auditory cortical critical period. This manipulation led to a redistribution of best frequencies (BFs) across auditory cortex in our control mice, consistent with prior work. By contrast, Dbh(-/-) mice failed to exhibit the expected redistribution of BFs, even though NE-deficient and NE-competent mice showed comparable auditory cortical organization when reared in a quiet colony environment. These data suggest that while intrinsic tonotopic patterning of auditory cortical circuitry occurs independently from NE, NE is required for critical period plasticity in auditory cortex.	\N	\N
25685775	The aim of this research was to analyze temporal auditory processing and phonological awareness in school-age children with benign childhood epilepsy with centrotemporal spikes (BECTS). Patient group (GI) consisted of 13 children diagnosed with BECTS. Control group (GII) consisted of 17 healthy children. After neurological and peripheral audiological assessment, children underwent a behavioral auditory evaluation and phonological awareness assessment. The procedures applied were: Gaps-in-Noise test (GIN), Duration Pattern test, and Phonological Awareness test (PCF). Results were compared between the groups and a correlation analysis was performed between temporal tasks and phonological awareness performance. GII performed significantly better than the children with BECTS (GI) in both GIN and Duration Pattern test (P < 0.001). GI performed significantly worse in all of the 4 categories of phonological awareness assessed: syllabic (P = 0.001), phonemic (P = 0.006), rhyme (P = 0.015) and alliteration (P = 0.010). Statistical analysis showed a significant positive correlation between the phonological awareness assessment and Duration Pattern test (P < 0.001). From the analysis of the results, it was concluded that children with BECTS may have difficulties in temporal resolution, temporal ordering, and phonological awareness skills. A correlation was observed between auditory temporal processing and phonological awareness in the suited sample.	\N	\N
25693304	When we interact with objects in our environment, as a general rule we are not aware of the proximal stimulation they provide, but we directly experience the external object. This process of assigning an external cause is known as distal attribution. It is extremely difficult to measure how distal attribution emerges because it arises so early in life and appears to be automatic. Sensory substitution systems give us the possibility to measure the process as it occurs online. With these devices, objects in our environment produce novel proximal stimulation patterns and individuals have to establish the link between the proximal stimulation and the distal object. This review disentangles the contributing factors that allow the nervous system to assign a distal cause, thereby creating the experience of an external world. In particular, it highlights the role of the assumption of a stable world, the role of movement, and finally that of calibration. From the existing sensory substitution literature it appears that distal attribution breaks down when one of these principles is violated and as such the review provides an important piece to the puzzle of distal attribution.	\N	\N
25698006	The present study investigated the possibility that the human auditory system demonstrates frequency selectivity to spectro-temporal amplitude modulations. Threshold modulation depth for detecting sinusoidal spectro-temporal modulations was measured using a generalized masked threshold pattern paradigm with narrowband masker modulations. Four target spectro-temporal modulations were examined, differing in their temporal and spectral modulation frequencies: a temporal modulation of -8, 8, or 16 Hz combined with a spectral modulation of 1 cycle/octave and a temporal modulation of 4 Hz combined with a spectral modulation of 0.5 cycles/octave. The temporal center frequencies of the masker modulation ranged from 0.25 to 4 times the target temporal modulation. The spectral masker-modulation center-frequencies were 0, 0.5, 1, 1.5, and 2 times the target spectral modulation. For all target modulations, the pattern of average thresholds for the eight normal-hearing listeners was consistent with the hypothesis of a spectro-temporal modulation filter. Such a pattern of modulation-frequency sensitivity was predicted on the basis of psychoacoustical data for purely temporal amplitude modulations and purely spectral amplitude modulations. An analysis of separability indicates that, for the present data set, selectivity in the spectro-temporal modulation domain can be described by a combination of a purely spectral and a purely temporal modulation filter function.	\N	\N
25721795	Music and speech are skills that require high temporal precision of motor output. A key question is how humans achieve this timing precision given the poor temporal resolution of somatosensory feedback, which is classically considered to drive motor learning. We hypothesise that auditory feedback critically contributes to learn timing, and that, similarly to visuo-spatial learning models, learning proceeds by correcting a proportion of perceived timing errors. Thirty-six participants learned to tap a sequence regularly in time. For participants in the synchronous-sound group, a tone was presented simultaneously with every keystroke. For the jittered-sound group, the tone was presented after a random delay of 10-190 ms following the keystroke, thus degrading the temporal information that the sound provided about the movement. For the mute group, no keystroke-triggered sound was presented. In line with the model predictions, participants in the synchronous-sound group were able to improve tapping regularity, whereas the jittered-sound and mute group were not. The improved tapping regularity of the synchronous-sound group also transferred to a novel sequence and was maintained when sound was subsequently removed. The present findings provide evidence that humans engage in auditory feedback error-based learning to improve movement quality (here reduce variability in sequence tapping). We thus elucidate the mechanism by which high temporal precision of movement can be achieved through sound in a way that may not be possible with less temporally precise somatosensory modalities. Furthermore, the finding that sound-supported learning generalises to novel sequences suggests potential rehabilitation applications.	\N	\N
25724819	Older adults, even with clinically normal hearing sensitivity, often report difficulty understanding speech in the presence of background noise. Part of this difficulty may be related to age-related degradations in the neural representation of speech sounds, such as formant transitions. Frequency-following responses (FFRs), which are dependent on phase-locked neural activity, were elicited using sounds consisting of linear frequency sweeps, which may be viewed as simple models of formant transitions. Eighteen adults (ten younger, 22-24 years old, and nine older, 51-67 years old) were tested. FFRs were elicited by tonal sweeps in six conditions. Two directions of frequency change, rising or falling, were used for each of three rates of frequency change. Stimulus-to-response cross correlations revealed that older adults had significantly poorer representation of the tonal sweeps, and that FFRs became poorer for faster rates of change. An additional FFR signal-to-noise ratio analysis based on time windows revealed that across the FFR waveforms and rates of frequency change, older adults had smaller (poorer) signal-to-noise ratios. These results indicate that older adults, even with clinically-normal hearing sensitivity, have degraded phase-locked neural representations of dynamic frequency.	\N	\N
25726264	Speech is a complex acoustic signal showing a quasiperiodic structure at several timescales. Integrated neural signals recorded in the cortex also show periodicity at different timescales. In this chapter we outline the neural mechanisms that potentially allow the auditory cortex to segment and encode continuous speech. This chapter focuses on how the human auditory cortex uses the temporal structure of the acoustic signal to extract phonemes and syllables, the two major constituents of connected speech. We argue that the quasiperiodic structure of collective neural activity in auditory cortex represents the ideal mechanical infrastructure to fractionate continuous speech into linguistic constituents of variable sizes.	\N	\N
25726287	Neural disorders of the auditory nerve are associated with particular disorders of auditory perceptions dependent on processing of acoustic temporal cues. These include: (1) speech perception; (2) localizing a sound's origin in space; and (3) identifying sounds in background noise. Auditory neuropathy (AN) is a consequence of: (1) presynaptic disorders affecting inner hair cell ribbon synapses; (2) postsynaptic disorders of auditory nerve dendrites; and (3) postsynaptic disorders of auditory nerve axons. The etiologies of these disorders are diverse, similar to other cranial or peripheral neuropathies. The pathologies cause attenuated and dyssynchronous auditory nerve discharges. Therapies and management of patients with AN are reviewed.	\N	\N
25726290	Neglect is a neurologic disorder, typically associated with lesions of the right hemisphere, in which patients are biased towards their ipsilesional - usually right - side of space while awareness for their contralesional - usually left - side is reduced or absent. Neglect is a multimodal disorder that often includes deficits in the auditory domain. Classically, auditory extinction, in which left-sided sounds that are correctly perceived in isolation are not detected in the presence of synchronous right-sided stimulation, has been considered the primary sign of auditory neglect. However, auditory extinction can also be observed after unilateral auditory cortex lesions and is thus not specific for neglect. Recent research has shown that patients with neglect are also impaired in maintaining sustained attention, on both sides, a fact that is reflected by an impairment of auditory target detection in continuous stimulation conditions. Perhaps the most impressive auditory symptom in full-blown neglect is alloacusis, in which patients mislocalize left-sided sound sources to their right, although even patients with less severe neglect still often show disturbance of auditory spatial perception, most commonly a lateralization bias towards the right. We discuss how these various disorders may be explained by a single model of neglect and review emerging interventions for patient rehabilitation.	\N	\N
25730449	To assess whether recombinant growth factor (hGH) therapy has an effect on cochlear implant (CI) performance. Two pediatric CI recipients (S1, S2) who underwent treatment with hGH for short stature were identified for review. S1 has bilateral labyrinthine dysplasia and received implants at ages 10 months (right) and 4 years 3 months (left). S2 was diagnosed with severe to progressive sensorineural hearing loss bilaterally and received a CI at age 9 years 10 months (left). Case series. Cochlear implant, hGH, and speech perception data were collected. Phonetically Balanced Kindergarten (PBK) and Consonant Nucleus Consonant (CNC) word recognition scores were reviewed to assess auditory perception. Electrode impedances, threshold levels, and comfort levels were also reviewed. After 4 months of hGH, word recognition scores for S1 were observed to decrease from 90 to 72% (right) and were stable at 40% (left). Despite troubleshooting, performance continued to decline bilaterally to 52% (right) and 28% (left), and the decision was made to discontinue hGH. One month after cessation of hGH, word recognition scores began improving to 74% (right) and 68% (left). Word recognition scores for S2 were observed to have decreased from 92% the previous year to 82% after taking hGH for 2 months. Given both our previous experience with S1 and discussions with S2's parents, hGH was discontinued after 10 months of therapy. Two months after cessation of hGH, S2's word recognition had improved to 86% (left). Our case studies illustrate that implanted children undergoing treatment with hGH may experience a decrease in speech perception, which recovers after the cessation of treatment. Since hGH use has become more prevalent in recent years, it is important to inquire whether children undergoing, or who have undergone, implantation are receiving hGH so that they may be appropriately monitored.	\N	\N
25731581	While potentially improving audibility for listeners with considerable high frequency hearing loss, the effects of implementing nonlinear frequency compression (NFC) for listeners with moderate high frequency hearing loss are unclear. The purpose of this study was to investigate the effects of activating NFC for listeners who are not traditionally considered candidates for this technology. Participants wore study hearing aids with NFC activated for a 3-4 week trial period. After the trial period, they were tested with NFC and with conventional processing on measures of consonant discrimination threshold in quiet, consonant recognition in quiet, sentence recognition in noise, and acceptableness of sound quality of speech and music. Seventeen adult listeners with symmetrical, mild to moderate sensorineural hearing loss participated. Better ear, high frequency pure-tone averages (4, 6, and 8 kHz) were 60 dB HL or better. Activating NFC resulted in lower (better) thresholds for discrimination of /s/, whose spectral center was 9 kHz. There were no other significant effects of NFC compared to conventional processing. These data suggest that the benefits, and detriments, of activating NFC may be limited for this population.	\N	\N
25731582	To evaluate wideband amplification and non-linear frequency compression (NLFC) as a means to improve speech recognition for children with mild/moderate hearing loss. Randomized within-subject design with repeated measures across test conditions. Eleven children with mild to moderate hearing loss were evaluated with: (1) Phonak BTE without NLFC, (2) Phonak BTE with NLFC, and (3) Oticon BTE with wideband response extending to 8000 Hz. Use of NLFC provided better detection and recognition of high-frequency stimuli (e.g. /sh/ and /s/). No difference in performance between conditions was observed for speech recognition when measured with the University of Western Ontario (UWO) plurals test and the UWO distinctive features difference test. Finally, there were no differences between conditions on the BKB-SIN test. Children with mild to moderate hearing loss have good access to high-frequency phonemes presented at fixed levels (e.g. 50 to 60 dBA) with both wideband and NLFC technology. Similarly, sentence recognition in noise was similar with wideband and NLFC. Adaptive test procedures that probe performance at lower input levels showed small but significant improvements in the detection and recognition of the phonemes /s/ and /sh/ with NLFC condition when compared to the NLFC Off and wideband conditions.	\N	\N
25733362	With the growing number of older adults receiving cochlear implants (CI), there is general agreement that substantial benefits can be gained. Nonetheless, variability in speech perception performance is high, and the relative contribution and interactions among peripheral, central-auditory, and cognitive factors are not fully understood. The goal of the present study was to compare auditory-cognitive processing in older-adult CI recipients with that of older normal-hearing (NH) listeners by means of behavioral and electrophysiologic manifestations of a high-load cognitive task. Auditory event-related potentials (AERPs) were recorded from 9 older postlingually deafened adults with CI (age at CI >60) and 10 age-matched listeners with NH, while performing an auditory Stroop task. Participants were required to classify the speaker's gender (male/female) that produced the words 'mother' or 'father' while ignoring the irrelevant congruent or incongruent word meaning. Older CI and NH listeners exhibited comparable reaction time, performance accuracy, and initial sensory-perceptual processing (i.e. N1 potential). Nonetheless, older CI recipients showed substantially prolonged and less efficient perceptual processing (i.e. P3 potential). Congruency effects manifested in longer reaction time (i.e. Stroop effect), execution time, and P3 latency to incongruent versus congruent stimuli in both groups in a similar fashion; however, markedly prolonged P3 and shortened execution time were evident in older CI recipients. Collectively, older adults (CI and NH) employed a combined perceptual and postperceptual conflict processing strategy; nonetheless, the relative allotment of perceptual resources was substantially enhanced to maintain adequate performance in CI recipients. In sum, the recording of AERPs together with the simultaneously obtained behavioral measures during a Stroop task exposed a differential time course of auditory-cognitive processing in older CI recipients that was not manifested in the behavioral end products of processing. These data may have implications regarding clinical evaluation and rehabilitation procedures that should be tailored specifically for this unique group of patients.	\N	\N
25734571	This study assesses attention and response control through visual and auditory stimuli in a primary care pediatric sample. The sample consisted of 191 participants aged between 7 and 13 years old. It was divided into 2 groups: (a) 90 children with ADHD, according to diagnostic (DSM-IV-TR) (APA, 2002) and clinical (ADHD Rating Scale-IV) (DuPaul, Power, Anastopoulos, & Reid, 1998) criteria, and (b) 101 children without a history of ADHD. The aims were: (a) to determine and compare the performance of both groups in attention and response control, (b) to identify attention and response control deficits in the ADHD group. Assessments were carried out using the Integrated Visual and Auditory Continuous Performance Test (IVA/CPT, Sandford & Turner, 2002). Results showed that the ADHD group had visual and auditory attention deficits, F(3, 170) = 14.38; p < .01, deficits in fine motor regulation (Welch´s t-test = 44.768; p < .001) and sensory/motor activity (Welch'st-test = 95.683, p < .001; Welch's t-test = 79.537, p < .001). Both groups exhibited a similar performance in response control, F(3, 170) = .93, p = .43.Children with ADHD showed inattention, mental processing speed deficits, and loss of concentration with visual stimuli. Both groups yielded a better performance in attention with auditory stimuli.	\N	\N
25740521	Speech recognition in noise can be challenging for older adults and elicits elevated activity throughout a cingulo-opercular network that is hypothesized to monitor and modify behaviors to optimize performance. A word recognition in noise experiment was used to test the hypothesis that cingulo-opercular engagement provides performance benefit for older adults. Healthy older adults (N = 31; 50-81 years of age; mean pure tone thresholds <32 dB HL from 0.25 to 8 kHz, best ear; species: human) performed word recognition in multitalker babble at 2 signal-to-noise ratios (SNR = +3 or +10 dB) during a sparse sampling fMRI experiment. Elevated cingulo-opercular activity was associated with an increased likelihood of correct recognition on the following trial independently of SNR and performance on the preceding trial. The cingulo-opercular effect increased for participants with the best overall performance. These effects were lower for older adults compared with a younger, normal-hearing adult sample (N = 18). Visual cortex activity also predicted trial-level recognition for the older adults, which resulted from discrete decreases in activity before errors and occurred for the oldest adults with the poorest recognition. Participants demonstrating larger visual cortex effects also had reduced fractional anisotropy in an anterior portion of the left inferior frontal-occipital fasciculus, which projects between frontal and occipital regions where activity predicted word recognition. Together, the results indicate that older adults experience performance benefit from elevated cingulo-opercular activity, but not to the same extent as younger adults, and that declines in attentional control can limit word recognition.	\N	\N
25770375	Arthur Lessac developed a voice training approach that concentrated on three energies: structural action, tonal action, and consonant action. In Lessac-Madsen Resonant Voice Therapy (LMRVT), speech-language pathologists help patients achieve a resonant voice through structural posturing and awareness of tonal changes. However, LMRVT many not necessarily include the third component of Lessac's approach: consonant action.This study examines the effect that increased effort on consonant production has on the speaking voice-particularly regarding vocal loudness and projection. Audio samples were collected from eight actor participants who read a monologue using three distinct styles: normal articulation, poor articulation (elicited using a bite block), and overarticulation (elicited using a Lessac-based training intervention). Twenty graduate students of speech-language pathology listened to speech samples from the different conditions and made comparative judgments regarding articulation, loudness, and projection. Group results showed a strong correlation between the articulatory condition and the level of perceived loudness and projection. That is, as precision of articulation increased, the ratings of perceived loudness and projection increased, as well. These findings indicate that articulation treatment may have a positive influence on the perception of vocal loudness and projection. This has implications for future directions in expanding voice therapy modalities.	\N	\N
25773636	Most people derive pleasure from music. Neuroimaging studies show that the reward system of the human brain is central to this experience. Specifically, the dorsal and ventral striatum release dopamine when listening to pleasurable music, and activity in these structures also codes the reward value of musical excerpts. Moreover, the striatum interacts with cortical mechanisms involved in perception and valuation of musical stimuli. Recent studies have begun to explore individual differences in the way that this complex system functions. Development of a questionnaire for music reward experiences has allowed the identification of separable factors associated with musical pleasure, described as music-seeking, emotion-evocation, mood regulation, sensorimotor, and social factors. Applying this questionnaire to a large sample uncovered approximately 5% of the population with low sensitivity to musical reward in the absence of generalized anhedonia or depression. Further study of this group revealed that there are individuals who respond normally both behaviorally and psychophysiologically to rewards other than music (e.g., monetary value) but do not experience pleasure from music despite normal music perception ability and preserved ability to identify intended emotions in musical passages. This specific music anhedonia bears further study, as it may shed light on the function and dysfunction of the reward system.	\N	\N
25774428	The perception of near-threshold visual stimuli has been shown to depend in part on the phase (i.e., time in the cycle) of ongoing alpha (8-13 Hz) oscillations in the visual cortex relative to the onset of that stimulus. However, it is currently unknown whether the phase of the ongoing alpha activity can be manipulated by top-down factors such as attention or expectancy. Using three variants of a cross-modal attention paradigm with constant predictable stimulus onsets, we examined if cues signaling to attend to either the visual or the auditory domain influenced the phase of alpha oscillations in the associated sensory cortices. Importantly, intermixed in all three experiments, we included trials without a target to estimate the phase at target presentation without contamination from the early evoked responses. For these blank trials, at the time of expected target and distractor onset, we examined (1) the degree of the uniformity in phase angles across trials, (2) differences in phase angle uniformity compared with a pretarget baseline, and (3) phase angle differences between visual and auditory target conditions. Across all three experiments, we found that, although the cues induced a modulation in alpha power in occipital electrodes, neither the visual condition nor the auditory cue condition induced any significant phase-locking across trials during expected target or distractor presentation. These results suggest that, although alpha power can be modulated by top-down factors such as attention and expectation, the phase of the ongoing alpha oscillation is not under such control.	\N	\N
25774653	The effect of stimulation history on the perception of a current event can yield two opposite effects, namely: adaptation or hysteresis. The perception of the current event thus goes in the opposite or in the same direction as prior stimulation, respectively. In audiovisual (AV) synchrony perception, adaptation effects have primarily been reported. Here, we tested if perceptual hysteresis could also be observed over adaptation in AV timing perception by varying different experimental conditions. Participants were asked to judge the synchrony of the last (test) stimulus of an AV sequence with either constant or gradually changing AV intervals (constant and dynamic condition, respectively). The onset timing of the test stimulus could be cued or not (prospective vs. retrospective condition, respectively). We observed hysteretic effects for AV synchrony judgments in the retrospective condition that were independent of the constant or dynamic nature of the adapted stimuli; these effects disappeared in the prospective condition. The present findings suggest that knowing when to estimate a stimulus property has a crucial impact on perceptual simultaneity judgments. Our results extend beyond AV timing perception, and have strong implications regarding the comparative study of hysteresis and adaptation phenomena.	\N	\N
25781179	Lifestyle including smoking, noise exposure with MP3 player and drinking alcohol are considered as risk factors for affecting hearing synergistically. However, little is known about the association of cigarette smoking with hearing impairment among subjects who carry a lifestyle without using MP3 player and drinking alcohol. We showed here the influence of smoking on hearing among Bangladeshi subjects who maintain a lifestyle devoid of using MP3 player and drinking alcohol. A total of 184 subjects (smokers: 90; non-smokers: 94) were included considering their duration and frequency of smoking for conducting this study. The mean hearing thresholds of non-smoker subjects at 1, 4, 8 and 12 kHz frequencies were 5.63 ± 2.10, 8.56±5.75, 21.06 ± 11.06, 40.79 ± 20.36 decibel (dB), respectively and that of the smokers were 7 ± 3.8, 13.27 ± 8.4, 30.66 ± 12.50 and 56.88 ± 21.58 dB, respectively. The hearing thresholds of the smokers at 4, 8 and 12 kHz frequencies were significantly (p<0.05) higher than those of the non-smokers, while no significant differences were observed at 1 kHz frequency. We also observed no significant difference in auditory thresholds among smoker subgroups based on smoking frequency. In contrast, subjects smoked for longer duration (>5 years) showed higher level of auditory threshold (62.16 ± 19.87 dB) at 12 kHz frequency compared with that (41.52 ± 19.21 dB) of the subjects smoked for 1-5 years and the difference in auditory thresholds was statistically significant (p<0.0002). In this study, the Brinkman Index (BI) of smokers was from 6 to 440 and the adjusted odds ratio showed a positive correlation between hearing loss and smoking when adjusted for age and body mass index (BMI). In addition, age, but not BMI, also played positive role on hearing impairment at all frequencies. Thus, these findings suggested that cigarette smoking affects hearing level at all the frequencies tested but most significantly at extra higher frequencies.	\N	\N
25786320	The spatial specificity of auditory approaching and withdrawing aftereffects was investigated in an anechoic chamber. The adapting and testing stimuli were presented from loudspeakers located in front of the subject at the distance of 1.1 m (near) and 4.5 m (far) from the listener's head. Approach and withdrawal of stimuli were simulated by increasing or decreasing the amplitude of the wide-noise impulse sequence. The listeners were required to determine the movement direction of test stimulus following each 5-s adaptation period. The listeners' "withdrawal" responses were used for psychometric functions plotting and for quantitative assessment of auditory aftereffect. The data summarized for all 8 participants indicated that the asymmetry of approaching and withdrawing aftereffects depended on spatial localization of adaptor and test. The asymmetry of aftereffects was largest when adaptor and test were presented from the same loudspeaker (either near or far). Adaptation to the approach induced a directionally dependent displacement of the psychometric functions relative to control condition without adaptation and adaptation to the withdrawal was not. The magnitude of approaching aftereffect was greater when adaptor and test were located in near spatial domain than when they came from far domain. When adaptor and test were presented from the distinct loudspeakers, magnitude approaching aftereffect was decreasing in comparison to the same spatial localization, but after adaptation to withdrawal it was increasing. As a result, the directionally dependent displacements of the psychometric functions relative to control condition were observed after adaptation as to approach and to withdrawal. The discrepancy of the psychometric functions received after adaptation to approach and to withdrawal at near and far spatial domains was greater under the same localization of adaptor and test in comparison to their distinct localization. We assume that the peculiarities of approaching and withdrawing aftereffects observed reflect their spatial specificity. It is possible that spatial peculiarities of approaching and withdrawing aftereffects can be associated with specialized mechanisms for analysis of motion at the different distance from subject.	\N	\N
25786957	Performing a secondary task while listening to speech has a detrimental effect on speech processing, but the locus of the disruption within the speech system is poorly understood. Recent research has shown that cognitive load imposed by a concurrent visual task increases dependency on lexical knowledge during speech processing, but it does not affect lexical activation per se. This suggests that "lexical drift" under cognitive load occurs either as a post-lexical bias at the decisional level or as a secondary consequence of reduced perceptual sensitivity. This study aimed to adjudicate between these alternatives using a forced-choice task that required listeners to identify noise-degraded spoken words with or without the addition of a concurrent visual task. Adding cognitive load increased the likelihood that listeners would select a word acoustically similar to the target even though its frequency was lower than that of the target. Thus, there was no evidence that cognitive load led to a high-frequency response bias. Rather, cognitive load seems to disrupt sublexical encoding, possibly by impairing perceptual acuity at the auditory periphery.	\N	\N
25788705	Spatial crowding refers to impaired target identification when the target is surrounded by other stimuli in space temporal crowding refers to impaired target identification when the target is surrounded by other stimuli in time previously, when spatial and temporal crowding were measured in the fovea they were interrelated with amblyopic observers but almost absent with normal observers bonneh, sagi, & polat, 2007. In the current study we examined whether reliable temporal crowding can be found for normal observers with peripheral presentation 9° of eccentricity, and whether similar relations between temporal and spatial crowding will emerge to that end, we presented a sequence of three displays separated by a varying interstimulus interval (ISI). Each display included either one letter : experiments 1a ,: 1b ,: 1c or three letters separated by a varying interletter spacing: Experiments 2a ,: 2b). One of these displays included an oriented T. Observers indicated the T's orientation. As expected, we found spatial crowding: accuracy improved as the interletter spacing increased. Critically, we also found temporal crowding: in all experiments accuracy increased as the ISI increased, even when only stimulus-onset asynchronies (SOAs) larger than 150 ms were included, ensuring this effect does not reflect mere ordinary masking. Thus, with peripheral presentation, temporal crowding also emerged for normal observers. However, only a weak interaction between temporal and spatial crowding was found.	\N	\N
25798581	Contralateral masking is the phenomenon where a masker presented to one ear affects the ability to detect a signal in the opposite ear. For normal hearing listeners, contralateral masking results in masking patterns that are both sharper and dramatically smaller in magnitude than ipsilateral masking. The goal of this study was to investigate whether medial olivocochlear (MOC) efferents are needed for the sharpness and relatively small magnitude of the contralateral masking function. To do this, bilateral cochlear implant patients were tested because, by directly stimulating the auditory nerve, cochlear implants circumvent the effects of the MOC efferents. The results indicated that, as with normal hearing listeners, the contralateral masking function was sharper than the ipsilateral masking function. However, although there was a reduction in the magnitude of the contralateral masking function compared to the ipsilateral masking function, it was relatively modest. This is in sharp contrast to the results of normal hearing listeners where the magnitude of the contralateral masking function is greatly reduced. These results suggest that MOC function may not play a large role in the sharpness of the contralateral masking function but may play a considerable role in the magnitude of the contralateral masking function.	\N	\N
25816820	The Fukuda stepping test is commonly used to assess peripheral vestibular function. It has, however, been suggested that its maximal sensitivity and specificity are 70 per cent and 50 per cent, respectively. This study was undertaken to evaluate environmental factors that may influence the reliability of this assessment and hence to 'sharpen' its use in a clinical setting. Forty-four participants aged between 20 and 43 years were asked to perform the Fukuda stepping test in both a standard clinic room and a soundproofed room under the following conditions in a randomised order: on the floor versus on foam; with and without a sound-localising source; and with and without ear defenders. Significant differences in the extent of rotation were found when comparing the results obtained in several settings, including standing on the floor in a standard room versus a soundproofed room (p = 0.036), and standing on foam in a standard room versus a soundproofed room (p = 0.015). Our results suggest that certain alterations to the test environment may improve the sensitivity of this clinical examination.	\N	\N
25832187	Previous studies have shown that discrimination sensitivity in 2AFC tasks depends on the presentation order of the standard and comparison stimulus. The present study examined whether this so-called Type B effect generalizes across different standard magnitudes. Therefore, Experiment 1 employed an auditory duration discrimination task with short (100 ms) and long (1,000 ms) standard durations and a constant interstimulus interval (ISI) of 1,000 ms. For both standard durations, a clear Type B effect emerged. In Experiment 2, discrimination sensitivity was assessed for short (300 ms) and long (1,000 ms) ISIs and a constant standard duration of 100 ms, in order to examine whether the Type B effect diminishes or even reverses when both stimuli are presented in rapid succession, as was suggested by previous studies. In the short, but not the long ISI condition, the Type B effect was virtually eliminated. Taken together, the present experiments suggest that the Type B effect is robust across standard magnitude, but diminishes when the time interval between both stimuli is reduced. This result pattern is discussed within the framework of the Internal Reference Model and the Sensation Weighting Model. It is also demonstrated that both models provide a quantitative account of the present results.	\N	\N
25878263	Amplitude modulations are fundamental features of natural signals, including human speech and nonhuman primate vocalizations. Because natural signals frequently occur in the context of other competing signals, we used a forward-masking paradigm to investigate how the modulation context of a prior signal affects cortical responses to subsequent modulated sounds. Psychophysical "modulation masking," in which the presentation of a modulated "masker" signal elevates the threshold for detecting the modulation of a subsequent stimulus, has been interpreted as evidence of a central modulation filterbank and modeled accordingly. Whether cortical modulation tuning is compatible with such models remains unknown. By recording responses to pairs of sinusoidally amplitude modulated (SAM) tones in the auditory cortex of awake squirrel monkeys, we show that the prior presentation of the SAM masker elicited persistent and tuned suppression of the firing rate to subsequent SAM signals. Population averages of these effects are compatible with adaptation in broadly tuned modulation channels. In contrast, modulation context had little effect on the synchrony of the cortical representation of the second SAM stimuli and the tuning of such effects did not match that observed for firing rate. Our results suggest that, although the temporal representation of modulated signals is more robust to changes in stimulus context than representations based on average firing rate, this representation is not fully exploited and psychophysical modulation masking more closely mirrors physiological rate suppression and that rate tuning for a given stimulus feature in a given neuron's signal pathway appears sufficient to engender context-sensitive cortical adaptation.	\N	\N
25885195	To verify the effect of long-term use of hearing aids with frequency compression for verbal behavior tests and daily activities. Thirty-two adults, aged between 30 and 60 years old, with moderate to severe sensorineural hearing loss at high frequencies with steeply sloping configuration were divided into two groups: 16 with hearing aids with frequency compression algorithm enabled and 16 not enabled. All participants underwent the detection tests of consonant sounds, monosyllable recognition in quiet environments, identification of fricative monosyllables, and Abbreviated Profile of Hearing Aid Benefit (APHAB) questionnaire in five times throughout a 12-month trial. Detection of consonant sounds, recognition of monosyllables in quiet environments and identification of fricative monosyllables improved significantly with frequency compression enabled. Participants had their APHAB scores improved whether they were adapted to the frequency compression or not. Frequency compression provides the anticipated improvement in audibility, detection of high-frequency consonant sounds, and recognition of monosyllables.	\N	\N
25913551	Since 1972, the World Health Organization (WHO) has declared noise as a pollutant. Over the last decades, the quality of the urban environment has attracted the interest of researchers due to the growing urban sprawl, especially in developing countries. The objective of this study was to evaluate the effects of noise exposure in six urban soundscapes: Areas with high and low levels of noise in scenarios of leisure, work, and home. Cross-sectional study. The study was conducted in two steps: Evaluation of noise levels, with the development of noise maps, and health related inquiries. 180 individuals were interviewed, being 60 in each scenario, divided into 30 exposed to high level of noise and 30 to low level. Chi-Square test and Ordered Logistic Regression Model (P < 0,005). 70% of the interviewees reported noticing some source of noise in the selected scenarios and it was observed an association between exposure and perception of some source of noise (P < 0.001). 41.7% of the interviewees reported some degree of annoyance, being that this was associated with exposure (P < 0.001). There was also an association between exposure in different scenarios and reports of poor quality of sleep (P < 0.001). In the scenarios of work and home, the chance of reporting annoyance increased when compared with the scenario of leisure. We conclude that the use of this sort of assessment may clarify the relationship between urban noise exposure and health.	\N	\N
25914528	The possible relationship between audiometric hearing thresholds and cognitive performance on language tests was analyzed in a cross-sectional cohort of older adults aged ≥65 years (N=98) with different degrees of cognitive impairment. Participants were distributed into two groups according to Reisberg's Global Deterioration Scale (GDS): a normal/predementia group (GDS scores 1-3) and a moderate/moderately severe dementia group (GDS scores 4 and 5). Hearing loss (pure-tone audiometry) and receptive and production-based language function (Verbal Fluency Test, Boston Naming Test, and Token Test) were assessed. Results showed that the dementia group achieved significantly lower scores than the predementia group in all language tests. A moderate negative correlation between hearing loss and verbal comprehension (r=-0.298; P<0.003) was observed in the predementia group (r=-0.363; P<0.007). However, no significant relationship between hearing loss and verbal fluency and naming scores was observed, regardless of cognitive impairment. In the predementia group, reduced hearing level partially explains comprehension performance but not language production. In the dementia group, hearing loss cannot be considered as an explanatory factor of poor receptive and production-based language performance. These results are suggestive of cognitive rather than simply auditory problems to explain the language impairment in the elderly.	\N	\N
25920851	Broadened auditory filters associated with sensorineural hearing loss have clearly been shown to diminish speech recognition in noise for adults, but far less is known about potential effects for children. This study examined speech recognition in noise for adults and children using simulated auditory filters of different widths. Specifically, 5 groups (20 listeners each) of adults or children (5 and 7 yrs), were asked to recognize sentences in speech-shaped noise. Seven-year-olds listened at 0 dB signal-to-noise ratio (SNR) only; 5-yr-olds listened at +3 or 0 dB SNR; and adults listened at 0 or -3 dB SNR. Sentence materials were processed both to smear the speech spectrum (i.e., simulate broadened filters), and to enhance the spectrum (i.e., simulate narrowed filters). Results showed: (1) Spectral smearing diminished recognition for listeners of all ages; (2) spectral enhancement did not improve recognition, and in fact diminished it somewhat; and (3) interactions were observed between smearing and SNR, but only for adults. That interaction made age effects difficult to gauge. Nonetheless, it was concluded that efforts to diagnose the extent of broadening of auditory filters and to develop techniques to correct this condition could benefit patients with hearing loss, especially children.	\N	\N
25994736	A follow-up experiment to those conducted by Brown and Yost [(2011). J. Acoust. Soc. Am. 130, 358-364; (2013). Basic Aspects of Hearing: Physiology and Perception (Springer, London, UK)] examined interaural time difference (ITD) discrimination for a low-frequency target noise band flanked by monotic noise bands that were either lower-frequency than the target band, higher-frequency, or both. The flanking bands were either spectrally contiguous with the target band or spectrally separated. Significant interference in ITD processing occurred in the presence of the high-frequency flanking band. Results are discussed by way of a comparison of the conditions in the present study to those in studies of binaural interference. The possible role of attention is also discussed.	\N	\N
25997868	In cochlear implant (CI) recipients with unilateral hearing loss (UHL) and normal hearing (NH) in the contralateral ear, the central auditory system receives signals of different auditory modalities, i.e. electrically via the CI ear as well as acoustically via the NH ear. The present study investigates binaural integration of bimodal stimulation in the central auditory system of 10 CI subjects with UHL by applying a modified version of the Rapidly Alternating Speech Perception (RASP) test to characterise speech recognition ability under monotic and dichotic listening arrangements. Subsequently, the results for each monotic and dichotic test condition were compared to quantify the binaural benefit from CI usage. The study results demonstrate significantly improved speech recognition under dichotic compared to monotic listening conditions, providing evidence that there is binaural integration of acoustically and electrically transmitted speech segments in the central nervous system at brainstem and cortical levels. In contrast to more commonly used tests of binaural integration, such as localisation, the RASP test provides the clinical option to investigate binaural integration involving structures at the cortical level.	\N	\N
25998097	Recommendation for cochlear implant (CI) treatment for individuals with severe to profound single-sided deafness (SSD) and asymmetrical hearing loss (AHL) is on the rise. This raises the need for greater consistency in the definition of CI candidacy for these cases and in the assessment methods of patient-related benefits to permit effective comparison and interpretation of the outcomes with both conventional and implantable options across studies. During a dedicated seminar on implant treatment in AHL patients, the panellists of the closing round table reviewed the clinical experience presented with the aim to define clear audiometric characteristics for both AHL and SSD cases, as well as a common data set enabling consistent evaluation of hearing benefits in this population. The panellists agreed on a clear differentiation between AHL and SSD CI candidates, defining average pure-tone thresholds up to 4 kHz for better and poorer ears. Agreement was reached on a minimum set of assessment procedures, and included the necessity of trials with conventional CROS/BICROS hearing aids and bone conduction devices before considering CI treatment. Objective assessment of sound localisation abilities was identified as the most relevant criterion to quantify performance before and after treatment. In parallel, subjective assessment of overall hearing ability was recommended via the Speech, Spatial and Qualities of hearing questionnaire. Longitudinal follow-up of these parameters and the hours of daily use were considered essential to reflect the potential treatment benefits for this population. The consistency in the data collection and its report will further support health authorities in their decision on acceptable gains from available hearing loss treatment options.	\N	\N
26017796	To compare the efficacy and feasibility of teleaudiometry with that of sweep audiometry in elementary school children, using pure-tone audiometry as the gold standard. A total of 243 students with a mean age of 8.3 years participated in the study. Of these, 118 were boys, and 125 were girls. The following procedures were performed: teleaudiometry screening with software that evaluates hearing at frequencies of 1,000, 2000 and 4000 Hz at 25 dBHL; sweep audiometry screening in an acoustic booth (20 dBHL at the same frequencies); pure-tone audiometry thresholds in an acoustic booth (frequencies of 500, 1000, 2000 and 4000 Hz); and acoustic immittance measurements. The diagnostic capacities of the teleaudiometry/sweep audiometry screening methods were as follows: sensitivity  ϝ  58%/65%; specificity  ϝ  86%/99%; positive predictive value  ϝ  51%/91%; negative predictive value  ϝ  89%/92%; and accuracy  ϝ  81%/92%. Teleaudiometry and sweep audiometry showed moderate agreement. Furthermore, the use of these methods in series with immittance testing improved the specificity, whereas parallel testing improved the sensitivity. Teleaudiometry was found to be reliable and feasible for screening hearing in school children. Moreover, teleaudiometry is the preferred method for remote areas where specialized personnel and specific equipment are not available, and its use may reduce the costs of hearing screening programs.	\N	\N
26025759	Our fMRI study investigates auditory rhyme processing in spoken language to further elucidate the topic of functional lateralization of language processing. During scanning, 14 subjects listened to four different types of versed word strings and subsequently performed either a rhyme or a meter detection task. Our results show lateralization to auditory-related temporal regions in the right hemisphere irrespective of task. As for the left hemisphere we report responses in the supramarginal gyrus as well as in the opercular part of the inferior frontal gyrus modulated by the presence of regular meter and rhyme. The interaction of rhyme and meter was associated with increased involvement of the superior temporal sulcus and the putamen of the right hemisphere. Overall, these findings support the notion of right-hemispheric specialization for suprasegmental analyses during processing of spoken sentences and provide neuroimaging evidence for the influence of metrics on auditory rhyme processing.	\N	\N
26055197	The timely diagnosis and treatment of acquired hearing loss in the pediatric population has significant implications for a child's development. Audiological assessment in children, however, carries both technological and logistical challenges. Typically, specialized methods (such as play audiometry) are required to maintain the child's attention and can be resource intensive. These challenges were previously addressed by a novel, calibrated, interactive play audiometer for Apple(®) iOS(®) called "ShoeBOX Audiometry". This device has potential applications for deployment in environments where traditional clinical audiometry is either unavailable or impractical. The objective of this study was to assess the screening capability of the tablet audiometer in an uncontrolled environment using consumer ear-bud headphones. Consecutive patients presenting to the Audiology Clinic at the Children's Hospital of Eastern Ontario (ages 4 and older) were recruited. Participants' hearing was evaluted using the tablet audiometer calibrated to Apple(®) In-Ear headphones. The warble tone thresholds obtained were compared to gold standard measurements taken with a traditional clinical audiometer inside a soundbooth. 80 patients were enrolled. The majority of participants were capable of completing an audiologic assessment using the tablet computer. Due to ambient noise levels outside a soundbooth, thresholds obtained at 500Hz were not consistent with traditional audiometry. Excluding 500Hz threholds, the tablet audiometer demonstrated strong negative predictive value (89.7%) as well as strong sensitivity (91.2%) for hearing loss. Thresholds obtained in an uncontrolled setting are not reflective of diagnostic thresholds due to the uncalibrated nature of the headphones and variability of the setting without a booth. Nevertheless, the tablet audiometer proved to be both a valid and sensitive instrument for unsupervised screening of warble-tone thresholds in children.	\N	\N
26065403	To date, there have been less than 30 cases of cochlear implantation (CI) in patients with superficial siderosis (SS) reported in the literature. The primary objective of the current study is to evaluate CI outcomes in six additional patients (seven ears) with SS and sensorineural hearing loss (SNHL) and to perform a systematic review of the literature. Case series and systematic review of the literature. Two tertiary academic CI centers. All patients with SS who underwent CI between 2007 and 2014. Cochlear implantation. Pre- and post-implantation speech perception scores and durability of benefit. A total of seven ears (four males; median age 52 yr) with SS and SNHL met inclusion criteria. All patients developed progressive bilateral SNHL that was no longer amenable to conventional hearing aids. Additional presenting symptoms included vestibulopathy (n = 4), cerebellar ataxia (n = 3), mild dementia (n = 1), and myelopathy (n = 1). All patients underwent uncomplicated CI, and intraoperative device telemetry revealed normal responses in all electrodes. The median postoperative auditory threshold average was 32.5 dB HL (range 16-36 dB) and the median postoperative CNC word score was 51% (range 46-64%). The median duration of follow-up was 15.5 months (range 3-64 mo). All patients demonstrated initial improvement in speech perception testing. Two patients had performance decline and worsening dementia resulting from progressive SS. Cochlear implantation is a viable strategy for auditory rehabilitation in patients with SS and associated SNHL. Most individuals enjoy benefit from CI; however, patients should be counseled regarding the risks of performance decline with progressive SS.	\N	\N
26093425	Natural auditory scenes often consist of several sound sources overlapping in time, but separated in space. Yet, location is not fully exploited in auditory grouping: spatially separated sounds can get perceptually fused into a single auditory object and this leads to difficulties in the identification and localization of concurrent sounds. Here, the brain mechanisms responsible for grouping across spatial locations were explored in magnetoencephalography (MEG) recordings. The results show that the cortical representation of a vowel spatially separated into two locations reflects the perceived location of the speech sound rather than the physical locations of the individual components. In other words, the auditory scene is neurally rearranged to bring components into spatial alignment when they were deemed to belong to the same object. This renders the original spatial information unavailable at the level of the auditory cortex and may contribute to difficulties in concurrent sound segregation.	\N	\N
26093429	Sound focusing is to create a concentrated acoustic field in the region surrounded by a loudspeaker array. This problem was tackled in the previous research via the Helmholtz integral approach, brightness control, acoustic contrast control, etc. In this paper, the same problem was revisited from the perspective of beamforming. A source array model is reformulated in terms of the steering matrix between the source and the field points, which lends itself to the use of beamforming algorithms such as minimum variance distortionless response (MVDR) and linearly constrained minimum variance (LCMV) originally intended for sensor arrays. The beamforming methods are compared with the conventional methods in terms of beam pattern, directional index, and control effort. Objective tests are conducted to assess the audio quality by using perceptual evaluation of audio quality (PEAQ). Experiments of produced sound field and listening tests are conducted in a listening room, with results processed using analysis of variance and regression analysis. In contrast to the conventional energy-based methods, the results have shown that the proposed methods are phase-sensitive in light of the distortionless constraint in formulating the array filters, which helps enhance audio quality and focusing performance.	\N	\N
26093435	Working memory capacity has been linked to performance on many higher cognitive tasks, including the ability to perceive speech in noise. Current efforts to train working memory have demonstrated that working memory performance can be improved, suggesting that working memory training may lead to improved speech perception in noise. A further advantage of working memory training to improve speech perception in noise is that working memory training materials are often simple, such as letters or digits, making them easily translatable across languages. The current effort tested the hypothesis that working memory training would be associated with improved speech perception in noise and that materials would easily translate across languages. Native Mandarin Chinese and native English speakers completed ten days of reversed digit span training. Reading span and speech perception in noise both significantly improved following training, whereas untrained controls showed no gains. These data suggest that working memory training may be used to improve listeners' speech perception in noise and that the materials may be quickly adapted to a wide variety of listeners.	\N	\N
26093448	Physiological measures of neural activity in the auditory cortex have revealed plasticity following unilateral deafness. Central projections from the remaining ear reorganize to produce a stronger cortical response than normal. However, little is known about the perceptual consequences of this increase. One possibility is improved sound intensity discrimination. Intensity difference limens were measured in 11 individuals with unilateral deafness that were previously shown to exhibit increased cortical activity to sounds heard by the intact ear. Significantly smaller mean difference limens were observed compared with controls. These results provide evidence of the perceptual consequences of plasticity in humans following unilateral deafness.	\N	\N
26107084	To examine the voice and personality characteristics of patients diagnosed with organic dysphonia secondary to vocal fold immobility. The study comprised patients of both genders, attending the Clinic School of Speech Therapy of the Federal University of Paraíba, with otorhinolaryngological diagnosis of vocal fold immobility and speech therapy diagnosis of dysphonia. The self-assessment of voice was measured through a Vocal Screening Protocol and Voice Symptoms Scale (VoiSS), the voice was collected for auditory-perceptive evaluation, and the Factorial Personality Battery (FPB) was used. Descriptive statistical analysis was performed to determine the frequency, mean, and standard deviation of the studied variables. Eight patients participated in the study, of both genders, with average age of 40.4 ± 16.9 years. The more frequent risk factors were the personal ones (4.7 ± 2.1). In the VoiSS, the patients presented a higher average in the limitation score (34.1 ± 15.7). From the auditory-perceptive evaluation, moderate intensity of vocal deviation was obtained, with predominant vocal roughness (57.7 ± 25.2). In the FPB, the patients had an average higher than the cutoff scores in neuroticism (3.8 ± 1.4) and accomplishment (5.2 ± 1.0). The predominant vocal parameter was roughness. The patients referred to a few risk factors that compromise the vocal behavior and presented the neuroticism and realization factors as a highlight in their personality. Thus, individuals with vocal fold immobility show personality characteristics that may be a reflection of their voice disorder, not a factor that determines their dysphonia.	\N	\N
26121827	To assess the clinical effeetiveness of prelingually deaf children after cochlear implantation at different ages so as to provide reasonable expectations for the patients and guidance for the clinical treatment. Electronic databases PubMed, YZ365. COM, WANFANG DATA, CMJD, CHKD, CNKI were searched using relevant keywords. Extracted data included author, year of publication, diagnosis, et al. Reported treatment outcomes were clustered into speech discrimination and hearing abilities. Meta-analyses were performed on studies with numerical results using random or fixed effects model. There were eight randomized control studies including 442 patients. Comparing speech perception of prelingually deaf children after cochlear implantation younger than three years old (experimental group) and 3-6 years old (control group), three and six months after operation showed that experimental group performed significantly worse than control group; 12 months after operation showed that experimental group performed significantly better than control group. Comparing hearing abilities, three and six months after operation showed that experimental group performed significantly worse than control group; 12 months after operation showed showed that experimental group performed significantly better than control group. Comparing speech perception of younger or older than 4. 5 years old children showed that after 1.5-2 years of operation children implanted younger than 4.5 years of age performed significantly better than children implanted older than 4.5 years old. Comparing speech perception of 7-12 years old children showed that after 3, 6, 12 months of operation patients of 7-12 years old performed significantly better than those children older than 12 years old. Comparing speech perception of implantation younger or older than 18 years old (7-14 yeas old was group A, > 14-18 yeas old was group B, older than 18 yeas old was group C) showed that after one and four years of operation A > B > C, and there were significant differences among them. Comparing warble tone threshold average (WTA) showed that after one year of operation A < B < C, and there were significant differences among them. However, after four years of operation, there was no significant difference among them. Prelinguistically deafened patients younger than three years old with cochlear implantation, insisting on scienctific rehabilitation training for a long period of time can receive the optimal recovery effect. The older patients are suggested as early as possible receiving cochlear implantation. The longer they are implanted, the better results they will receive. Moreover, the younger age they are implanted, the faster postoperative language progress they will receive. Further controlled studies with longer follow-up periods and more person included may make the effectiveness of cochlear implantaion more reliable.	\N	\N
26152053	During the first years of life, sensory modalities communicate with each other. This process is fundamental for the development of unisensory and multisensory skills. The absence of one sensory input impacts on the development of other modalities. Since 2008 we have studied these aspects and developed our cross-sensory calibration theory. This theory emerged from the observation that children start to integrate multisensory information (such as vision and touch) only after 8-10 years of age. Before this age the more accurate sense teaches (calibrates) the others; when one calibrating modality is missing, the other modalities result impaired. Children with visual disability have problems in understanding the haptic or auditory perception of space and children with motor disabilities have problems in understanding the visual dimension of objects. This review presents our recent studies on multisensory integration and cross-sensory calibration in children and adults with and without sensory and motor disabilities. The goal of this review is to show the importance of interaction between sensory systems during the early period of life in order to correct perceptual development to occur.	\N	\N
26152058	Echolocation can be used by blind and sighted humans to navigate their environment. The current study investigated the neural activity underlying processing of path direction during walking. Brain activity was measured with fMRI in three blind echolocation experts, and three blind and three sighted novices. During scanning, participants listened to binaural recordings that had been made prior to scanning while echolocation experts had echolocated during walking along a corridor which could continue to the left, right, or straight ahead. Participants also listened to control sounds that contained ambient sounds and clicks, but no echoes. The task was to decide if the corridor in the recording continued to the left, right, or straight ahead, or if they were listening to a control sound. All participants successfully dissociated echo from no echo sounds, however, echolocation experts were superior at direction detection. We found brain activations associated with processing of path direction (contrast: echo vs. no echo) in superior parietal lobule (SPL) and inferior frontal cortex in each group. In sighted novices, additional activation occurred in the inferior parietal lobule (IPL) and middle and superior frontal areas. Within the framework of the dorso-dorsal and ventro-dorsal pathway proposed by Rizzolatti and Matelli (2003), our results suggest that blind participants may automatically assign directional meaning to the echoes, while sighted participants may apply more conscious, high-level spatial processes. High similarity of SPL and IFC activations across all three groups, in combination with previous research, also suggest that all participants recruited a multimodal spatial processing system for action (here: locomotion).	\N	\N
26177161	Code-blends (simultaneous words and signs) are a unique characteristic of bimodal bilingual communication. Using fMRI, we investigated code-blend comprehension in hearing native ASL-English bilinguals who made a semantic decision (edible?) about signs, audiovisual words, and semantically equivalent code-blends. English and ASL recruited a similar fronto-temporal network with expected modality differences: stronger activation for English in auditory regions of bilateral superior temporal cortex, and stronger activation for ASL in bilateral occipitotemporal visual regions and left parietal cortex. Code-blend comprehension elicited activity in a combination of these regions, and no cognitive control regions were additionally recruited. Furthermore, code-blends elicited reduced activation relative to ASL presented alone in bilateral prefrontal and visual extrastriate cortices, and relative to English alone in auditory association cortex. Consistent with behavioral facilitation observed during semantic decisions, the findings suggest that redundant semantic content induces more efficient neural processing in language and sensory regions during bimodal language integration.	\N	\N
26185045	Discourse structure enables us to generate expectations based upon linguistic material that has already been introduced. The present magnetoencephalography (MEG) study addresses auditory perception of test sentences in which discourse coherence was manipulated by using presuppositions (PSP) that either correspond or fail to correspond to items in preceding context sentences with respect to uniqueness and existence. Context violations yielded delayed auditory M50 and enhanced auditory M200 cross-correlation responses to syllable onsets within an analysis window of 1.5s following the PSP trigger words. Furthermore, discourse incoherence yielded suppression of spectral power within an expanded alpha band ranging from 6 to 16Hz. This effect showed a bimodal temporal distribution, being significant in an early time window of 0.0-0.5s following the PSP trigger and a late interval of 2.0-2.5s. These findings indicate anticipatory top-down mechanisms interacting with various aspects of bottom-up processing during speech perception.	\N	\N
26185046	A number of studies have shown that from an early age, bilinguals outperform their monolingual peers on executive control tasks. We previously found that bilingual children and adults also display greater attention to unexpected language switches within speech. Here, we investigated the effect of a bilingual upbringing on speech perception in one language. We recorded monolingual and bilingual toddlers' event-related potentials (ERPs) to spoken words preceded by pictures. Words matching the picture prime elicited an early frontal positivity in bilingual participants only, whereas later ERP amplitudes associated with semantic processing did not differ between groups. These results add to the growing body of evidence that bilingualism increases overall attention during speech perception whilst semantic integration is unaffected.	\N	\N
26200250	Event-related brain potentials (ERPs) demonstrate that human auditory cortical responses are sensitive to changes in static pitch as indexed by the pitch onset response (POR), a negativity generated at the initiation of acoustic periodicity. Yet, it is still unclear if this brain signature is sensitive to dynamic, time-varying properties of pitch more characteristic of those found in naturalistic speech and music. Neuroelectric PORs were recorded in response to contrastive pitch patterns differing in their pitch height, time-variance, and directionality (i.e., rise vs. fall). Broadband noise followed by contiguous iterated rippled noise (producing salient pitch sweeps) was used to temporally separate neural activity coding the onset of acoustic energy from the onset of time-varying pitch. Analysis of PORs revealed distinct modulations in response latency that distinguished static from time-varying pitch contours (steady-state<dynamic) and pitch height (high<low). However, PORs were insensitive to the direction of pitch sweeps (rise=fall). Our findings suggest that the POR signature provides a useful neural index of auditory cortical pitch processing for some, but not all pitch-evoking stimuli.	\N	\N
26222937	To analyze the occurrence of acoustic reflex and its threshold on newborns using the 226 and 1,000 Hz probes. Thirty-six newborns with "PASS" results in newborn hearing screening and tympanogram with one or two peaks for both probe tones were included. Group I comprised 20 full-term newborns without risk indicator for hearing loss, and Group II comprised 16 newborns with at least one risk indicator. The study about ipsilateral acoustic reflex thresholds was conducted in 500, 1,000, 2,000, and 4,000 Hz. The groups presented the acoustic reflex thresholds between 50 and 100 dB for both probe tones. In the comparison between the probes, there were differences in all frequencies evaluated in Group I, with the lowest threshold mean for the 1,000 Hz probe. In Group II, differences were detected at 2,000 Hz. The mean acoustic reflex thresholds were similar in both groups for the 226 Hz probe. There was a difference for the 1,000 Hz probe in all tested frequencies. The percentage of response was higher in both groups for the 1,000 Hz probe. The kappa test showed extremely poor agreement in the comparison of results between both probes. The occurrence of acoustic reflex was higher in newborns and its thresholds were lower with the 1,000 Hz probe both for healthy newborns and for newborns at risk.	\N	\N
26290244	Plasticity in the visual cortex of blind individuals provides a rare window into the mechanisms of cortical specialization. In the absence of visual input, occipital ("visual") brain regions respond to sound and spoken language. Here, we examined the time course and developmental mechanism of this plasticity in blind children. Nineteen blind and 40 sighted children and adolescents (4-17 years old) listened to stories and two auditory control conditions (unfamiliar foreign speech, and music). We find that "visual" cortices of young blind (but not sighted) children respond to sound. Responses to nonlanguage sounds increased between the ages of 4 and 17. By contrast, occipital responses to spoken language were maximal by age 4 and were not related to Braille learning. These findings suggest that occipital plasticity for spoken language is independent of plasticity for Braille and for sound. We conclude that in the absence of visual input, spoken language colonizes the visual system during brain development. Our findings suggest that early in life, human cortex has a remarkably broad computational capacity. The same cortical tissue can take on visual perception and language functions. Studies of plasticity provide key insights into how experience shapes the human brain. The "visual" cortex of adults who are blind from birth responds to touch, sound, and spoken language. To date, all existing studies have been conducted with adults, so little is known about the developmental trajectory of plasticity. We used fMRI to study the emergence of "visual" cortex responses to sound and spoken language in blind children and adolescents. We find that "visual" cortex responses to sound increase between 4 and 17 years of age. By contrast, responses to spoken language are present by 4 years of age and are not related to Braille-learning. These findings suggest that, early in development, human cortex can take on a strikingly wide range of functions.	\N	\N
26323201	To determine if differences between dyslexic and typical readers in their reading scores and verbal IQ are evident as early as first grade and whether the trajectory of these differences increases or decreases from childhood to adolescence. The subjects were the 414 participants comprising the Connecticut Longitudinal Study, a sample survey cohort, assessed yearly from 1st to 12th grade on measures of reading and IQ. Statistical analysis employed longitudinal models based on growth curves and multiple groups. As early as first grade, compared with typical readers, dyslexic readers had lower reading scores and verbal IQ, and their trajectories over time never converge with those of typical readers. These data demonstrate that such differences are not so much a function of increasing disparities over time but instead because of differences already present in first grade between typical and dyslexic readers. The achievement gap between typical and dyslexic readers is evident as early as first grade, and this gap persists into adolescence. These findings provide strong evidence and impetus for early identification of and intervention for young children at risk for dyslexia. Implementing effective reading programs as early as kindergarten or even preschool offers the potential to close the achievement gap.	\N	\N
26336746	Dynamics of activity in the frequency band of theta waves during of procedures, listening of the acoustic image of the own EEG was investigated. The formation of the acoustic image EEG was performed with a significant reduction of musical properties. It is shown that the increase in activity in the theta range depends on the level of synchronization and consistency of the presentation of the acoustic image own EEG relative to the current bioelectrical activity of the brain. The maximum increase in activity in the theta range was observed with minimum time delay and maximum consistency requirements of sounds with the current EEG. It is concluded that the increase in activity in the range of theta waves in the listening environment acoustic image own EEG is determined by the correlation of sounds with the current bioelectric activity of the brain.	\N	\N
26377472	Human cortex is comprised of specialized networks that support functions, such as visual motion perception and language processing. How do genes and experience contribute to this specialization? Studies of plasticity offer unique insights into this question. In congenitally blind individuals, "visual" cortex responds to auditory and tactile stimuli. Remarkably, recent evidence suggests that occipital areas participate in language processing. We asked whether in blindness, occipital cortices: (1) develop domain-specific responses to language and (2) respond to a highly specialized aspect of language-syntactic movement. Nineteen congenitally blind and 18 sighted participants took part in two fMRI experiments. We report that in congenitally blind individuals, but not in sighted controls, "visual" cortex is more active during sentence comprehension than during a sequence memory task with nonwords, or a symbolic math task. This suggests that areas of occipital cortex become selective for language, relative to other similar higher-cognitive tasks. Crucially, we find that these occipital areas respond more to sentences with syntactic movement but do not respond to the difficulty of math equations. We conclude that regions within the visual cortex of blind adults are involved in syntactic processing. Our findings suggest that the cognitive function of human cortical areas is largely determined by input during development. Human cortex is made up of specialized regions that perform different functions, such as visual motion perception and language processing. How do genes and experience contribute to this specialization? Studies of plasticity show that cortical areas can change function from one sensory modality to another. Here we demonstrate that input during development can alter cortical function even more dramatically. In blindness a subset of "visual" areas becomes specialized for language processing. Crucially, we find that the same "visual" areas respond to a highly specialized and uniquely human aspect of language-syntactic movement. These data suggest that human cortex has broad functional capacity during development, and input plays a major role in determining functional specialization.	\N	\N
26380997	The usage of personal listening devices (PLDs) is associated with risks of hearing loss. The aim of this study is to evaluate the effects of music exposure from these devices on high-frequency hearing thresholds of PLD users. A total of 282 young adults were questioned regarding their listening habits and symptoms associated with PLD listening. Their audiogram thresholds were determined at high (3-8 kHz) frequencies and extended high frequencies (EHFs, 9-16 kHz). The preferred listening volumes of PLD users were used to compute their overall 8-h equivalent music exposure levels (LAeq8h). Approximately 80% of the subjects were regular PLD users. Of these, 20.1% had LAeq8h of ≥75 dBA, while 4.4% of them had LAeq8h of ≥85 dBA, which carries a high risk of hearing damage. Compared with those exposed to LAeq8h of <75 dBA, subjects who had LAeq8h of ≥75 dBA reported a significantly higher incidence of tinnitus and difficulty in hearing others immediately after using PLDs. PLD users who were exposed to LAeq8h of ≥75 dBA and had been using their devices for ≥4 years also showed significantly higher mean audiogram thresholds compared with non-users at most EHFs tested. In addition, the thresholds of PLD users at EHFs showed a weak but significant positive correlation with their LAeq8h. The present findings suggest that excessive exposure to music among PLD users may lead to initial effects on their hearing at very high frequencies.	\N	\N
26536965	It has been shown that musicians are at risk of noise-induced hearing loss. The aim of the study has been to evaluate the temporary changes of hearing in the case of orchestral musicians after group rehearsals. The study group comprised 18 orchestral musicians, aged 30-58 years old (mean: 40 years old) having 12-40 years (mean: 22 years) of professional experience. The temporary changes in hearing after group rehearsals were determined using transient-evoked otoacoustic emissions (TEOAEs). Noise exposures during group rehearsals were also evaluated. Musicians' hearing threshold levels were higher (worse) than expected for the equivalent non-noise-exposed population. Moreover, the high frequency notched audiograms were observed in some of them. After rehearsals, during which musicians were exposed to orchestral noise at A-weighted equivalent-continuous sound pressure level (normalized to 8-h working day) varied from 75.6-83.1 dB (mean: 79.4 dB). The significant post-exposure reductions of TEOAE amplitudes (approx. 0.7 dB) both for the total response and frequency bands of 2000 and 3000 Hz were noted. However, there were no significant differences between pre- and postexposure reproducibility of TEOAE. Obtained results have confirmed that orchestral musicians are at risk of hearing loss due to their professional activities, even at exposures to orchestral noise less than the limit values for occupational noise.	\N	\N
26538659	Deficits in auditory emotion recognition (AER) are a core feature of schizophrenia and a key component of social cognitive impairment. AER deficits are tied behaviorally to impaired ability to interpret tonal ("prosodic") features of speech that normally convey emotion, such as modulations in base pitch (F0M) and pitch variability (F0SD). These modulations can be recreated using synthetic frequency modulated (FM) tones that mimic the prosodic contours of specific emotional stimuli. The present study investigates neural mechanisms underlying impaired AER using a combined event-related potential/resting-state functional connectivity (rsfMRI) approach in 84 schizophrenia/schizoaffective disorder patients and 66 healthy comparison subjects. Mismatch negativity (MMN) to FM tones was assessed in 43 patients/36 controls. rsfMRI between auditory cortex and medial temporal (insula) regions was assessed in 55 patients/51 controls. The relationship between AER, MMN to FM tones, and rsfMRI was assessed in the subset who performed all assessments (14 patients, 21 controls). As predicted, patients showed robust reductions in MMN across FM stimulus type (p = 0.005), particularly to modulations in F0M, along with impairments in AER and FM tone discrimination. MMN source analysis indicated dipoles in both auditory cortex and anterior insula, whereas rsfMRI analyses showed reduced auditory-insula connectivity. MMN to FM tones and functional connectivity together accounted for ∼50% of the variance in AER performance across individuals. These findings demonstrate that impaired preattentive processing of tonal information and reduced auditory-insula connectivity are critical determinants of social cognitive dysfunction in schizophrenia, and thus represent key targets for future research and clinical intervention. Schizophrenia patients show deficits in the ability to infer emotion based upon tone of voice [auditory emotion recognition (AER)] that drive impairments in social cognition and global functional outcome. This study evaluated neural substrates of impaired AER in schizophrenia using a combined event-related potential/resting-state fMRI approach. Patients showed impaired mismatch negativity response to emotionally relevant frequency modulated tones along with impaired functional connectivity between auditory and medial temporal (anterior insula) cortex. These deficits contributed in parallel to impaired AER and accounted for ∼50% of variance in AER performance. Overall, these findings demonstrate the importance of both auditory-level dysfunction and impaired auditory/insula connectivity in the pathophysiology of social cognitive dysfunction in schizophrenia.	\N	\N
26562889	Visual search is an essential task for many lifesaving professions; airport security personnel search baggage X-ray images for dangerous items and radiologists examine radiographs for tumors. Accuracy is critical for such searches; however, there are potentially negative influences that can affect performance; for example, the displays can be cluttered and can contain multiple targets. Previous research has demonstrated that clutter can hurt search performance and a second target is less likely to be detected in a multiple-target search after a first target has been found, which raises a concern-how does clutter affect multiple-target search performance? The current study explored clutter in a multiple-target search paradigm, where there could be one or two targets present, and targets appeared in varying levels of clutter. There was a significant interaction between clutter and target number: Increasing levels of clutter did not affect single-target detection but did reduce detection of a second target. Multiple-target search accuracy is known to be sensitive to contextual influences, and the current results reveal a specific effect wherein clutter disproportionally affected multiple-target search accuracy. These results suggest that the detection and processing of a first target might enhance the masking effects of clutter around a second target.	\N	\N
26575193	We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed.	\N	\N
26753216	The stress response has been well documented in past music therapy literature. However, hypometabolism, or the relaxation response, has received much less attention. Music therapists have long utilized various music-assisted relaxation techniques with both live and recorded music to elicit such a response. The ongoing proliferations of relaxation music through commercial media and the dire lack of evidence to support such claims warrant attention from healthcare professionals and music therapists. The purpose of these 3 studies was to investigate the correlational relationships between 12 psychophysical properties of music, preference, familiarity, and degree of perceived relaxation in music. Fourteen music therapists recommended and analyzed 30 selections of relaxation music. A group of 80 healthy adults then rated their familiarity, preference, and degree of perceived relaxation in the music. The analysis provided a detailed description of the intrinsic properties in music that were perceived to be relaxing by listeners. These properties included tempo, mode, harmonic, rhythmic, instrumental, and melodic complexities, timbre, vocalization/lyrics, pitch range, dynamic variations, and contour. In addition, music preference was highly correlated with listeners' perception of relaxation in music for both music therapists and healthy adults. The correlation between familiarity and degree of relaxation reached significance in the healthy adult group. Results from this study provided an in-depth operational definition of the intrinsic parameters in relaxation music and also highlighted the importance of preference and familiarity in eliciting the relaxation response.	\N	\N
26891543	PROBLEM/OBJECTIVES: Maxillary constriction and high palatal arch are associated with increased risk of chronic eustachian tube dysfunction and conductive hearing loss (CHL) due to chronic effusion. However, this relationship has not been clearly demonstrated. This study assessed CHL in school children with a narrowed maxilla and deep palatal vault. Thirty-two children with maxillary constriction were randomly selected for the study group and 28 children with normal transverse maxillary development were selected for the control group. Pure-tone audiograms were obtained for all children, and hearing levels and air-bone gaps were measured. Air-bone gap measurements in the control group ranged from 5.50 to 14.50 decibels (dB), and in the study group they were between 5.00 and 24.00 dB. In the study group, 14 (43.8%) children had slight CHL, and the remaining 18 (56.2%) children had normal hearing levels. In the control group, all of the children had normal hearing levels. Hearing levels and air-bone gaps were greater in the study group than the control group. This study showed that children with a narrowed maxilla and deep palatal vault may have slight CHL. Therefore, the onset of CHL should be followed with hearing screening programs.	\N	\N
26941686	Actions that produce sounds infuse our daily lives. Some of these sounds are a natural consequence of physical interactions (such as a clang resulting from dropping a pan), but others are artificially designed (such as a beep resulting from a keypress). Although the relationship between actions and sounds has previously been examined, the frame of reference of these associations is still unknown, despite it being a fundamental property of a psychological representation. For example, when an association is created between a keypress and a tone, it is unclear whether the frame of reference is egocentric (gesture-sound association) or exocentric (key-sound association). This question is especially important for artificially created associations, which occur in technology that pairs sounds with actions, such as gestural interfaces, virtual or augmented reality, and simple buttons that produce tones. The frame of reference could directly influence the learnability, the ease of use, the extent of immersion, and many other factors of the interaction. To explore whether action-sound associations are egocentric or exocentric, an experiment was implemented using a computer keyboard's number pad wherein moving a finger from one key to another produced a sound, thus creating an action-sound association. Half of the participants received egocentric instructions to move their finger with a particular gesture. The other half of the participants received exocentric instructions to move their finger to a particular number on the keypad. All participants were performing the same actions, and only the framing of the action varied between conditions by altering task instructions. Participants in the egocentric condition learned the gesture-sound association, as revealed by a priming paradigm. However, the exocentric condition showed no priming effects. This finding suggests that action-sound associations are egocentric in nature. A second part of the same session further confirmed the egocentric nature of these associations by showing no change in the priming effect after moving to a different starting location. Our findings are consistent with an egocentric representation of action-sound associations, which could have implications for applications that utilize these associations.	\N	\N
22619989	The paper involves exposure to noise of the State Police officers connected with the use of firearms. The noise generated by these weapons is of short duration and high intensity. The research was carried out during the sessions of firearm training of State Police officers to assess exposure to noise. The values of the various investigations, both audiometric and phonometric, carried out made it possible to demonstrate a significant exposure and a temporary increase in the threshold, above the frequency of 6000 Hz. Even taking account of the abatement from use of headphones, an exposure was demonstrated that was above the statutory limits, as was confirmed by the temporary hearing threshold shift.	\N	\N
