Multilingual and crosslingual acoustic modelling for automatic speech recognition

Frank Diehl

Ayuda

Multilingual and crosslingual acoustic modelling for automatic speech recognition

Autores: Frank Diehl
Directores de la Tesis: María Asunción Moreno Bilbao (dir. tes.)
Lectura: En la Universitat Politècnica de Catalunya (UPC) ( España ) en 2007
Idioma: español
Tribunal Calificador de la Tesis: José Bernardo Mariño Acebal (presid.), Enric Monte Moreno (secret.), Zdravco Kacic (voc.), Carmen García Mateo (voc.), Daniel Tapias Merino (voc.)
Materias:
- Matemáticas
  - Probabilidad
    - Procesos de Markov
    - Procesos estocásticos
  - Estadística
    - Técnicas de inferencia estadística
Texto completo no disponible (Saber más ...)
Resumen
- This thesis studies the definition, implementation and validation of multilingual and crosslingual acoustic models for automatic speech recognition (ASR), The acoustic model constitutes one of the basic building blocks of an automatic speech recognition system. In today's state-of-the-art ASR systems it is common practise to extract the parameters of the acoustic model from a acoustic template database. It has been shown that this methodology results in high performance ASR systems. However, a principal drawback of this procedure consist in its dependency on suitable speech databases to train the models, and the inevitable dependency of the final target system on the language used for training the models. That is, in case of acoustic model training, a acoustic model can hardly be build if no or only a limited amount of speech material of a target language is available, and, during recognition, the ASR system is fixed to the language which was used to train it.
  
  Multilingual and crosslingual acoustic modelling is seen as a potential way to overcome these drawbacks at least partly. The basic idea consists in sharing acoustic knowledge between languages, or to reuse already available acoustic knowledge from one or more source languages for a target language.
  
  The thesis on hand thus focuses on two major aspects of multilingual and crosslingual acoustic modelling: acoustic model definition and acoustic model adaptation.
  
  In case of acoustic model definition the stress lies on the definition of suitable linguistic features. Linguistic features constitute the input domain of the phonetic-acoustic decision tree which is used to define context dependent acoustic models. Usually such features are derived knowledge-based by a linguistic expert which is familiar with both, the source and the target language. However, linguistic experts which are familiar with all concerned languages might be hard to find. Thus, in a multilingual but also in the crosslingual enviro

Acceso de usuarios registrados

¿Olvidó su contraseña?

¿Es nuevo? Regístrese

Ventajas de registrarse

Dialnet Plus

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Coordinado por: