Identification of speech transients using variable frame rate analysis and wavelet packets

Daniel M. Rasetshwane, J. Robert Boston, Ching Chung Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Speech transients are important cues for identifying and discriminating speech sounds. Yoo et al and Tantibundhit et al were successful in identifying speech transients and, emphasizing them, improving the intelligibility of speech in noise [3] [4]. However, their methods are computationally intensive and unsuitable for real-time applications. This paper presents a method to identify and emphasize speech transients that combines subband decomposition by the wavelet packet transform with variable frame rate (VFR) analysis and unvoiced consonant detection. The VFR analysis is applied to each wavelet packet to define a transitivity function that describes the extent to which the wavelet coefficients of that packet are changing. Unvoiced consonant detection is used to identify unvoiced consonant intervals and the transitivity function is amplified during these intervals. The wavelet coefficients are multiplied by the transitivity function for that packet, amplifying the coefficients localized at times when they are changing and attenuating coefficients at times when they are steady. Inverse transform of the modified wavelet packet coefficients produces a signal corresponding to speech transients similar to the transients identified by Yoo et al and Tantibundhit et al. A preliminary implementation of the algorithm runs more efficiently.

Original languageEnglish (US)
Title of host publication28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS'06
Pages1727-1730
Number of pages4
DOIs
StatePublished - 2006
Event28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS'06 - New York, NY, United States
Duration: Aug 30 2006Sep 3 2006

Publication series

NameAnnual International Conference of the IEEE Engineering in Medicine and Biology - Proceedings
ISSN (Print)0589-1019

Conference

Conference28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS'06
CountryUnited States
CityNew York, NY
Period8/30/069/3/06

ASJC Scopus subject areas

  • Signal Processing
  • Biomedical Engineering
  • Computer Vision and Pattern Recognition
  • Health Informatics

Fingerprint Dive into the research topics of 'Identification of speech transients using variable frame rate analysis and wavelet packets'. Together they form a unique fingerprint.

  • Cite this

    Rasetshwane, D. M., Boston, J. R., & Li, C. C. (2006). Identification of speech transients using variable frame rate analysis and wavelet packets. In 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS'06 (pp. 1727-1730). [4030147] (Annual International Conference of the IEEE Engineering in Medicine and Biology - Proceedings). https://doi.org/10.1109/IEMBS.2006.260720