Design and Implementation of Amharic Text-to-Speech System for Visual-Impaired and Blind Students

No Thumbnail Available
Date
2024-10-28
Authors
Walelign S.
Zewdie, E.
Yaregal, A
Shegaw, A.M.
Mastewal, M.
Journal Title
Journal ISSN
Volume Title
Publisher
MIRG
Abstract
This project focuses on the development of an advanced Amharic Text-to-Speech (TTS) system for visually impaired and blind students, with a primary emphasis on enhancing accessibility and usability. The comprehensive methodology encompasses Corpus Collection and Preprocessing, involving the assembly of a diverse Amharic language corpus and its meticulous preprocessing. Phonetic and Prosodic Modeling techniques are employed to capture the nuances of Amharic pronunciation. Additionally, the integration of Tacotron 2 and WaveGlow models, along with the training process, is detailed. The project extends its impact through the seamless integration of the TTS system into a mobile application, with a user-friendly interface designed specifically for visually impaired users. The anticipated outcome is a versatile and inclusive platform that empowers users to convert written text into spoken Amharic effortlessly. The success of the project is evaluated through extensive user testing, ensuring accessibility, usability, and naturalness in the synthesized speech for the targeted user group.
Description
Scholarly article
Keywords
Citation
Tewabe W., Mossie Z., Assabie Y., Anagaw S. and Mekonen M. (2023). “Design and Implementation of Amharic Text-to-Speech System for Visual-Impaired and Blind Students". In Proceedings of the International Conference on Artificial Intelligence and Robotics (MIRG-ICAIR 2023), pp. 39-45, MIRG
Collections