Improving Naturalness of Neural-based TTS System Trained with Arabic Limited Data
| Sawalha, Layan | ||
| Alradhi, Mohammad | ||
| 2023-03-13T16:07:09Z | ||
| 2023-03-13T16:07:09Z | ||
| 2023 | ||
AbstractIn this paper, we investigated different approaches, a neural network speech synthesis system and a non-autoregressive text-to-speech (TTS) model. In the neural network speech synthesis, we showed how a baseline system based on Merlin is used for TTS synthesis to produce the most human-like voice; typically, it is only implemented with a front-end text processor and a WORLD vocoder. Here, we first adapted Continuous and Ahocoder vocoders; and then we investigated the effectiveness of each vocoder’s techniques to produce the highest quality speech. In the non-autoregressive TTS model, we implemented the state-of-the-results Fastspeech2 system, which provided high-quality speech synthesis in a timely manner without controllability and robustness problems. Here, we focused on integrating a different language but with limited data while maintaining its high-quality produced sounds. Through objective and subjective evaluations, we verify that our method can outperform the baseline system with full data. | ||
| http://hdl.handle.net/10890/40714 | ||
| en | ||
| Improving Naturalness of Neural-based TTS System Trained with Arabic Limited Data | ||
| Konferenciaközlemény | ||
| Open access | ||
| Szerző | ||
| 2023.02.07 | ||
| Budapest | ||
| 1st Workshop on Intelligent Infocommunication Networks, Systems and Services (WI2NS2) | ||
| 2023.02.07 | ||
| 978-963-421-902-6 | ||
| Budapest University of Technology and Economics | ||
| Online | ||
| 1st Workshop on Intelligent Infocommunication Networks, Systems and Services | ||
| Post print | ||
| Faculty of Electrical Engineering and Informatics | ||
| 71 | ||
| 10.3311/WINS2023-013 | ||
| 75 | ||
| Text-to-speech | ||
| TTS | ||
| Machine Learning | ||
| Deep Learning | ||
| Deep Neural Networks | ||
| Speech Synthesis | ||
| Konferenciacikk | ||
| Budapest University of Technology and Economics |
Files
Original bundle
- Name:
- WINS2023-013.pdf
- Size:
- 394.77 KB
- Format:
- Adobe Portable Document Format
- Description:
- WINS2023-013.pdf