Műegyetemi Digitális Archivum
    • magyar
    • English
  • English 
    • magyar
    • English
  • Login
View Item 
  •   DSpace Home
  • 1. Tudományos közlemények, publikációk
  • Konferenciák gyűjteményei
  • 1st Workshop on Intelligent Infocommunication Networks, Systems and Services
  • View Item
  •   DSpace Home
  • 1. Tudományos közlemények, publikációk
  • Konferenciák gyűjteményei
  • 1st Workshop on Intelligent Infocommunication Networks, Systems and Services
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A Comparison of Data Augmentation Methods on Ultrasound Tongue Images for Articulatory- to-Acoustic Mapping towards Silent Speech Interfaces

Thumbnail
View/Open
WINS2023-011.pdf (2.643Mb)
Metadata
Show full item record
Link to refer to this document:
10.3311/WINS2023-011
Collections
  • 1st Workshop on Intelligent Infocommunication Networks, Systems and Services [19]
Abstract
Silent Speech Interfaces (SSI), being a subfield of speech technology, break the limitations of automatic speech recognition when acoustic signals cannot be produced or clearly captured. SSI focuses on the articulation process of speech production in order to map articulatory data into acoustics. Ultrasound tongue imaging (UTI), a non-invasive, clinically safe technique to view the shape, position, and movements of the tongue, has recently become popular in the process of collecting articulatory data of the tongue movement. It has already been shown that data augmentation can be helpful for solving the overfitting problem and improving the generalization ability of deep neural networks. In this paper, we discuss the preliminary implementation and comparison of data augmentation methods on Azerbaijani ultrasound and speech recordings that has been recorded by us. These strategies include consecutive and intermittent time masking, sinusoidal noise injection, and random scaling. We explore the generation of new data samples using the provided methods on the dataset. We use mean-squared error validation loss as an evaluation metric to measure the performance of all the above data augmentation methods.
Title
A Comparison of Data Augmentation Methods on Ultrasound Tongue Images for Articulatory- to-Acoustic Mapping towards Silent Speech Interfaces
Author
Ibrahimov, Ibrahim
Gosztolya, Gábor
Csapó, Tamás Gábor
Date of issue
2023
Access level
Open access
Copyright owner
Szerző
Conference title
1st Workshop on Intelligent Infocommunication Networks, Systems and Services (WI2NS2)
Conference place
Budapest
Conference date
2023.02.07
Language
en
Page
59 - 64
Subject
data augmentation, silent speech interfaces, ultrasound tongue imaging, speech technology
Version
Post print
Identifiers
DOI: 10.3311/WINS2023-011
Title of the container document
1st Workshop on Intelligent Infocommunication Networks, Systems and Services
ISBN, e-ISBN
978-963-421-902-6
Document type
Konferenciaközlemény
Document genre
Konferenciacikk
University
Budapest University of Technology and Economics
Faculty
Faculty of Electrical Engineering and Informatics

Content by
Theme by 
Atmire NV
DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback

Content by
DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
Atmire NV
 

 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Content by
Theme by 
Atmire NV
DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback

Content by
DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
Atmire NV