Műegyetemi Digitális Archívum

Conformer-based Automatic Speech Recognition for Arabic Dialects

Hassairi, Amin
Mihajlik, Péter
2025-02-20T13:51:27Z
2025-02-20T13:51:27Z
2025

Abstract

Automatic Speech Recognition has shown a significant upward trend in recent years. This paper investigates an ASR system for the Arabic language, developed using the Conformer-CTC character-based model within the NeMo framework. The system leverages the latest deep learning techniques, focusing on the conformer architecture combined with Connectionist Temporal Classification for sequence-to-sequence learning. The model is supervised, using labeled training data to map the input audio to text. The Mozilla Common Voice 11.0 dataset, which offers diverse spoken Arabic samples, is used for training. This paper details the model training process, including configuration setup, data processing, and optimization strategies. The performance of the model is evaluated, offering insights into the challenges and effectiveness of the Conformer-CTC character-based model for Arabic speech recognition tasks.

http://hdl.handle.net/10890/58910
en
Conformer-based Automatic Speech Recognition for Arabic Dialects
Könyvfejezet
Open access
Szerző
2025-02-03
Budapest
3rd Workshop on Intelligent Infocommunication Networks, Systems and Services
2025-02-20
978-963-421-982-8
Budapest University of Technology and Economics
Budapest
3rd Workshop on Intelligent Infocommunication Networks, Systems and Services
Post print
Faculty of Electrical Engineering and Informatics
21
10.3311/WINS2025-004
26
ASR
Conformer-CTC
Character-based
Arabic language
Deep Learning
Konferenciacikk
Budapest University of Technology and Economics

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
WINS_04_d.pdf
Size:
790.43 KB
Format:
Adobe Portable Document Format
Description:
WINS_04_d.pdf