The processing and perception of size information in speech sounds
- 1 January 2005
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 117 (1), 305-318
- https://doi.org/10.1121/1.1828637
Abstract
There is information in speech sounds about the length of the vocal tract; specifically, as a child grows, the resonators in the vocal tract grow and the formant frequencies of the vowels decrease. It has been hypothesized that the auditory system applies a scale transform to all sounds to segregate size information from resonator shape information, and thereby enhance both size perception and speech recognition [Irino and Patterson, Speech Commun. 36, 181–203 (2002)]. This paper describes size discrimination experiments and vowel recognition experiments designed to provide evidence for an auditory scaling mechanism. Vowels were scaled to represent people with vocal tracts much longer and shorter than normal, and with pitches much higher and lower than normal. The results of the discrimination experiments show that listeners can make fine judgments about the relative size of speakers, and they can do so for vowels scaled well beyond the normal range. Similarly, the recognition experiments show good performance for vowels in the normal range, and for vowels scaled well beyond the normal range of experience. Together, the experiments support the hypothesis that the auditory system automatically normalizes for the size information in communication sounds.This publication has 32 references indexed in Scilit:
- STRAIGHT: A new speech synthesizer for vowel formant discriminationAcoustics Research Letters Online, 2004
- Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in soundsSpeech Communication, 1999
- A time-domain, level-dependent auditory filter: The gammachirpThe Journal of the Acoustical Society of America, 1997
- The Lack of A Priori Distinctions Between Learning AlgorithmsNeural Computation, 1996
- Time-domain modeling of peripheral auditory processing: A modular architecture and a software platformThe Journal of the Acoustical Society of America, 1995
- The identification of vowel-like harmonic complexes: Effects of component phase, level, and fundamental frequencyThe Journal of the Acoustical Society of America, 1995
- The scale representationIEEE Transactions on Signal Processing, 1993
- Physiologic and acoustic differences between male and female voicesThe Journal of the Acoustical Society of America, 1989
- Signal and masker uncertainty in intensity discriminationThe Journal of the Acoustical Society of America, 1981
- Control Methods Used in a Study of the VowelsThe Journal of the Acoustical Society of America, 1952