Um Sistema de Conversão Texto-Fala Para o Português Falado no Brasil

  • Flávio Olmos Simões
  • Fábio Violaro
  • Plínio A. Barbosa
  • Eleonora C. Albano
Keywords: sintese de fala, conversão texto-fala, síntese da voz, sistemas de processarnento da fala, interação homem-máquina, modelos linguísticos.

Abstract

Text-to-speech synthesis is the subject treated in this work. Most of the difficulties related to this task are discussed, and an implementation of a Brazilian Portuguese text-to-speech concatenative synthesis system is presented. The system uses an inventory of 2,450 pre-recorded speech segments, and is able to employ both TD~PSOLA and hybrid synthesis techniques. The use of carefully chosen linguistic criteria, mainly during phonetic transcription and also during the creation of the speech segments inventory, is the main contribution of this work. The phonetic notation employed distinguishes two kinds of phonetic segments (full and reduced), on the basis of the extension of coarticulation phenomena. The main criterion underlying the building of the speech segments inventory was to preserve reduced segments and vowel clusters. Nevertheless, some of the reduced segments were split, aiming at reducing the size of the inventory. Once again, in this case, specific linguistic criteria were employed, in order to minimize spectral discontinuities after concatenation.
Published
12-02-2018
How to Cite
Simões, F., Violaro, F., Barbosa, P., & Albano, E. (2018). Um Sistema de Conversão Texto-Fala Para o Português Falado no Brasil. Journal of Communication and Information Systems, 15(2). https://doi.org/10.14209/jcis.2000.8
Section
Regular Papers