Ayuda
Ir al contenido

Multilingual and crosslingual acoustic modelling for automatic speech recognition

  • Autores: Frank Diehl
  • Directores de la Tesis: María Asunción Moreno Bilbao (dir. tes.)
  • Lectura: En la Universitat Politècnica de Catalunya (UPC) ( España ) en 2007
  • Idioma: español
  • Tribunal Calificador de la Tesis: José Bernardo Mariño Acebal (presid.), Enric Monte Moreno (secret.), Zdravco Kacic (voc.), Carmen García Mateo (voc.), Daniel Tapias Merino (voc.)
  • Materias:
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • This thesis studies the definition, implementation and validation of multilingual and crosslingual acoustic models for automatic speech recognition (ASR), The acoustic model constitutes one of the basic building blocks of an automatic speech recognition system. In today's state-of-the-art ASR systems it is common practise to extract the parameters of the acoustic model from a acoustic template database. It has been shown that this methodology results in high performance ASR systems. However, a principal drawback of this procedure consist in its dependency on suitable speech databases to train the models, and the inevitable dependency of the final target system on the language used for training the models. That is, in case of acoustic model training, a acoustic model can hardly be build if no or only a limited amount of speech material of a target language is available, and, during recognition, the ASR system is fixed to the language which was used to train it.

      Multilingual and crosslingual acoustic modelling is seen as a potential way to overcome these drawbacks at least partly. The basic idea consists in sharing acoustic knowledge between languages, or to reuse already available acoustic knowledge from one or more source languages for a target language.

      The thesis on hand thus focuses on two major aspects of multilingual and crosslingual acoustic modelling: acoustic model definition and acoustic model adaptation.

      In case of acoustic model definition the stress lies on the definition of suitable linguistic features. Linguistic features constitute the input domain of the phonetic-acoustic decision tree which is used to define context dependent acoustic models. Usually such features are derived knowledge-based by a linguistic expert which is familiar with both, the source and the target language. However, linguistic experts which are familiar with all concerned languages might be hard to find. Thus, in a multilingual but also in the crosslingual enviro


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno