Examining the effect of high-frequency information on the classification of conversationally produced English fricativesa)

Viktor Kharlamov, Daniel Brenner, Benjamin V. Tucker

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

This study examines the role of frequencies above 8 kHz in the classification of conversational speech fricatives [f, v, θ, ð, s, z, ʃ, ʒ, h] in random forest modeling. Prior research has mostly focused on spectral measures for fricative categorization using frequency information below 8 kHz. The contribution of higher frequencies has received only limited attention, especially for non-laboratory speech. In the present study, we use a corpus of sociolinguistic interview recordings from Western Canadian English sampled at 44.1 and 16 kHz. For both sampling rates, we analyze spectral measures obtained using Fourier analysis and the multitaper method, and we also compare models without and with amplitudinal measures. Results show that while frequency information above 8 kHz does not improve classification accuracy in random forest analyses, inclusion of such frequencies can affect the relative importance of specific measures. This includes a decreased contribution of center of gravity and an increased contribution of spectral standard deviation for the higher sampling rate. We also find no major differences in classification accuracy between Fourier and multitaper measures. The inclusion of power measures improves model accuracy but does not change the overall importance of spectral measures.

Original languageEnglish (US)
Pages (from-to)1896-1902
Number of pages7
JournalJournal of the Acoustical Society of America
Volume154
Issue number3
DOIs
StatePublished - Sep 1 2023

ASJC Scopus subject areas

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Examining the effect of high-frequency information on the classification of conversationally produced English fricativesa)'. Together they form a unique fingerprint.

Cite this