Time-frequency voiced and unvoiced excitation models for harmonic speech systems

  • Miguel Arjona Ramírez University of São Paulo
Keywords: speech analysis, speech coding, sparse representations, modulation transform, time-frequency analysis, voiced-unvoiced decision

Abstract

Time-frequency voiced and unvoiced models are proposed for the excitation of a harmonic autoregressive wideband speech analysis-synthesis system. The time-frequency voiced excitation (TFVEX) model has low time resolution defined by the concentration of the excitation signal distribution in the modulation domain while the time-frequency unvoiced excitation (TFUNEX) model has cycle time discrimination with lower amplitude resolution and while the frequency resolution for both models is an octave. The speech reconstructed by the compound TFUVEX unvoiced-voiced model is rated above the speech degraded by a modulated noise reference unit (MNRU) at 25 dB in listening tests while yielding a parametric compression of over ten times.

Author Biography

Miguel Arjona Ramírez, University of São Paulo

Associate Professor

Department of Electronic Systems Engineering, Escola Politécnica

 

Published
01-12-2014
How to Cite
Arjona Ramírez, M. (2014). Time-frequency voiced and unvoiced excitation models for harmonic speech systems. Journal of Communication and Information Systems, 29(1). https://doi.org/10.14209/jcis.2014.6
Section
Regular Papers