Analysis, Synthesis, and Perception of Voice Quality Variations Among Female and Male Talkers
Overview
Affiliations
Voice quality variations include a set of voicing sound source modifications ranging from laryngealized to normal to breathy phonation. Analysis of reiterant imitations of two sentences by ten female and six male talkers has shown that the potential acoustic cues to this type of voice quality variation include: (1) increases to the relative amplitude of the fundamental frequency component as open quotient increases; (2) increases to the amount of aspiration noise that replaces higher frequency harmonics as the arytenoids become more separated; (3) increases to lower formant bandwidths; and (4) introduction of extra pole zeros in the vocal-tract transfer function associated with tracheal coupling. Perceptual validation of the relative importance of these cues for signaling a breathy voice quality has been accomplished using a new voicing source model for synthesis of more natural male and female voices. The new formant synthesizer, KLSYN88, is fully documented here. Results of the perception study indicate that, contrary to previous research which emphasizes the importance of increased amplitude of the fundamental component, aspiration noise is perceptually most important. Without its presence, increases to the fundamental component may induce the sensation of nasality in a high-pitched voice. Further results of the acoustic analysis include the observations that: (1) over the course of a sentence, the acoustic manifestations of breathiness vary considerably--tending to increase for unstressed syllables, in utterance-final syllables, and at the margins of voiceless consonants; (2) on average, females are more breathy than males, but there are very large differences between subjects within each gender; (3) many utterances appear to end in a "breathy-laryngealized" type of vibration; and (4) diplophonic irregularities in the timing of glottal periods occur frequently, especially at the end of an utterance. Diplophonia and other deviations from perfect periodicity may be important aspects of naturalness in synthesis.
Relating production and perception in two Raglai dialects at different stages of registrogenesis.
dinh L, Brunelle M, T T Phonetica. 2025; 82(1):87-110.
PMID: 39824758 PMC: 11808351. DOI: 10.1515/phon-2024-0032.
Shen J, Heller Murray E Ear Hear. 2024; 46(2):474-482.
PMID: 39494949 PMC: 11832343. DOI: 10.1097/AUD.0000000000001599.
Van Stan J, Hillman R, Krusemark C, Muise J, Stadelman-Cohen T, Mehta D J Speech Lang Hear Res. 2024; 67(10):3521-3535.
PMID: 39320344 PMC: 11482575. DOI: 10.1044/2024_JSLHR-23-00727.
Voice quality types and uses in North American English.
Wright R, Mansfield C, Panfili L Anglophonia. 2024; 27.
PMID: 39081683 PMC: 11288166. DOI: 10.4000/anglophonia.1952.
Word and Gender Identification in the Speech of Transgender Individuals.
Doyle K, Harel D, Feeny G, Novak V, McAllister T J Voice. 2024; .
PMID: 39019670 PMC: 11735684. DOI: 10.1016/j.jvoice.2024.06.007.