Cross-language phoneme mapping for phonetic search keyword spotting in continuous speech of under-resourced languages

Ella Tetariy, Yossi Bar-Yosef, Vered Silber-Varod, Michal Gishri, Ruthi Alon-Lavi, Vered Aharonson, Irit Opher, Ami Moyal

Abstract


As automatic speech recognition-based applications become increasingly common in a wide variety of market segments, thereis a growing need to support more languages. However, for many languages, the language resources needed to train speechrecognition engines are either limited or completely non-existent, and the process of acquiring or constructing new languageresources is both long and costly. This paper suggests a methodology that enables Phonetic Search Keyword Spotting to beimplemented in a large speech database of any given under-resourced language using cross-language phoneme mappings toanother language. The phoneme mapping enables a speech recognition engine from a sufficiently resourced and well-trainedsource language to be used for phoneme recognition in the new target language. The keyword search is then performed overa lattice of target language phonemes. Three cross-language phoneme mapping techniques are examined: knowledge-based,data-driven and phoneme recognition performance-based. The results suggest that Phonetic Search Keyword Spotting basedon the cross-language phoneme mapping approach proposed herein can serve as a quick initial solution for validating keywordspotting applications in new, under-resourced languages.

Full Text:

PDF


DOI: https://doi.org/10.5430/air.v4n2p72

Refbacks

  • There are currently no refbacks.


Artificial Intelligence Research

ISSN 1927-6974 (Print)   ISSN 1927-6982 (Online)

Copyright © Sciedu Press 
To make sure that you can receive messages from us, please add the 'Sciedupress.com' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.