An ensemble of grapheme and phoneme-based models for automatic English to Kannada back-transliteration Online publication date: Fri, 26-Feb-2021
by B.S. Sowmya Lakshmi; B.R. Shambhavi
International Journal of Intelligence and Sustainable Computing (IJISC), Vol. 1, No. 2, 2021
Abstract: The task of mapping graphemes or phonemes of one language into phoneme approximations of another language is known as machine transliteration. In this paper, three machine transliteration approaches, 'grapheme-based model', 'phoneme-based model' and 'hybrid model' have been proposed to achieve back transliteration of Romanised Kannada word to its native script Kannada, a resource poor language. A bilingual corpus of around 3 lakh words is built, which comprises of pairs of Romanised Kannada word with its corresponding word in Kannada script. The paradigms are assessed with 3,000 Romanised Kannada test words. Hybrid model achieved better accuracy of 85.93% when compared with other two models.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Intelligence and Sustainable Computing (IJISC):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com