TY - JOUR
T1 - Using lexical stress, speech rate, rhythm, and pauses to characterize and normalize second language speech intelligibility
AU - Kang, Okim
AU - Hirschi, Kevin
AU - Hansen, John
AU - Looney, Stephen
AU - Miao, Yongzhi
N1 - Publisher Copyright:
© 2022 Acoustical Society of America. All rights reserved.
PY - 2022/12/5
Y1 - 2022/12/5
N2 - While a range of measures based on speech production, language, and perception are possible for the predication and estimation of speech intelligibility (Kang et al., 2020), what constitutes second language (L2) intelligibility remains under-defined. Prosodic and temporal features (i.e., stress, speech rate, rhythm, and pause placement) have been shown to impact listener perception (Kang et al., 2020), but their relationship with highly intelligible speech is yet unclear. This study aimed to characterize the L2 speech intelligibility. Acoustic analyses, including PRAAT and Python scripts, were conducted on 405 speech samples (30 seconds each) from 102 L2 English speakers with a wide variety of backgrounds, proficiency levels, and intelligibility levels. The results indicate that highly intelligible speakers of English employ between 2-4 syllables per second and that higher or lower speeds are less intelligible. Silent pauses between 0.3 and 0.8 seconds were associated with the highest levels of intelligibility. Rhythm, as measured by Δ syllable length of all content syllables, was marginally associated with intelligibility. Finally, lexical stress accuracy did not interfere substantially with intelligibility until less than 70% of the polysyllabic words were incorrect. Findings inform the fields of first and second language research as well as language education and pathology.
AB - While a range of measures based on speech production, language, and perception are possible for the predication and estimation of speech intelligibility (Kang et al., 2020), what constitutes second language (L2) intelligibility remains under-defined. Prosodic and temporal features (i.e., stress, speech rate, rhythm, and pause placement) have been shown to impact listener perception (Kang et al., 2020), but their relationship with highly intelligible speech is yet unclear. This study aimed to characterize the L2 speech intelligibility. Acoustic analyses, including PRAAT and Python scripts, were conducted on 405 speech samples (30 seconds each) from 102 L2 English speakers with a wide variety of backgrounds, proficiency levels, and intelligibility levels. The results indicate that highly intelligible speakers of English employ between 2-4 syllables per second and that higher or lower speeds are less intelligible. Silent pauses between 0.3 and 0.8 seconds were associated with the highest levels of intelligibility. Rhythm, as measured by Δ syllable length of all content syllables, was marginally associated with intelligibility. Finally, lexical stress accuracy did not interfere substantially with intelligibility until less than 70% of the polysyllabic words were incorrect. Findings inform the fields of first and second language research as well as language education and pathology.
UR - http://www.scopus.com/inward/record.url?scp=85176498846&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85176498846&partnerID=8YFLogxK
U2 - 10.1121/2.0001790
DO - 10.1121/2.0001790
M3 - Conference article
AN - SCOPUS:85176498846
SN - 1939-800X
VL - 50
JO - Proceedings of Meetings on Acoustics
JF - Proceedings of Meetings on Acoustics
IS - 1
M1 - 060008
T2 - 183rd Meeting of the Acoustical Society of America, ASA 2022
Y2 - 5 December 2022 through 9 December 2022
ER -