Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands

Hansen, John H. L.; Kim, Wooil; Rahurkar, Mandar; Ruzanski, Evan; Meyerhoff, James

doi:10.1155/2011/906789

Research Article
Open access
Published: 07 March 2011

Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands

John H. L. Hansen¹,
Wooil Kim¹,
Mandar Rahurkar¹,
Evan Ruzanski¹ &
…
James Meyerhoff¹

EURASIP Journal on Advances in Signal Processing volume 2011, Article number: 906789 (2011) Cite this article

1780 Accesses
20 Citations
Metrics details

Abstract

The problem of detecting psychological stress from speech is challenging due to differences in how speakers convey stress. Changes in speech production due to speaker state are not linearly dependent on changes in stress. Research is further complicated by the existence of different stress types and the lack of metrics capable of discriminating stress levels. This study addresses the problem of automatic detection of speech under stress using a previously developed feature extraction scheme based on the Teager Energy Operator (TEO). To improve detection performance a (i) selected sub-band frequency partitioned weighting scheme and (ii) weighting scheme for all frequency bands are proposed. Using the traditional TEO-based feature vector with a closed-speaker Hidden Markov Model-trained stressed speech classifier, error rates of 22.5/13.0% for stress/neutral speech are obtained. With the new weighted sub-band detection scheme, closed-speaker error rates are reduced to 4.7/4.6% for stress/neutral detection, with a relative error reduction of 79.1/64.6%, respectively. For the open-speaker case, stress/neutral speech detection error rates of 69.7/16.2% using traditional features are used to 13.1/4.0% (a relative 81.3/75.4% reduction) with the proposed automatic frequency sub-band weighting scheme. Finally, issues related to speaker dependent/independent scenarios, vowel duration, and mismatched vowel type on stress detection performance are discussed.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, The University of Texas at Dallas, Richardson, TX, 75083-0688, USA
John H. L. Hansen, Wooil Kim, Mandar Rahurkar, Evan Ruzanski & James Meyerhoff

Authors

John H. L. Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Wooil Kim
View author publications
You can also search for this author in PubMed Google Scholar
Mandar Rahurkar
View author publications
You can also search for this author in PubMed Google Scholar
Evan Ruzanski
View author publications
You can also search for this author in PubMed Google Scholar
James Meyerhoff
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John H. L. Hansen.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Hansen, J.H.L., Kim, W., Rahurkar, M. et al. Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands. EURASIP J. Adv. Signal Process. 2011, 906789 (2011). https://doi.org/10.1155/2011/906789

Download citation

Received: 25 September 2010
Revised: 10 December 2010
Accepted: 10 February 2011
Published: 07 March 2011
DOI: https://doi.org/10.1155/2011/906789

Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords