Derin öğrenme ile Türkçe ses işaretlerinden rakam tanıma
Yükleniyor...
Tarih
2024-02-14
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Batman Üniversitesi Lisansüstü Eğitim Enstitüsü
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
In today's rapidly advancing technological landscape, speech-based recognition systems play a crucial role in various fields. Sound, beyond being a fundamental means of communication among indi-viduals, serves as a critical factor in applications such as automation, security, and user experience. The effective utilization of sound in digital environments is made possible, particularly through the develop-ment of speech recognition technologies. These technologies have the capability to analyze sound sig-nals, comprehend spoken language, and perform various tasks. Digital digit classification, particularly, constitutes a significant application area for these speech recognition technologies. Digital digit classifica-tion involves the development of systems that can accurately recognize and distinguish digital digits obtained from sound signals. This has a wide range of potential applications, from telecommunication systems to voice command systems and from speech-based security applications to various industrial and commercial applications. In this context, the importance of speech-based digital digit classification spans from everyday life to industrial applications. This study aims to contribute to technological ad-vancements in this field by evaluating different machine learning models used in the process of digital digit classification from sound signals. SVM, LSTM, and CNN models were assessed for digital digit clas-sification from sound signals, with the CNN model achieving the highest success rate at 81.94% in the 80-20 training-test ratio. The CNN model demonstrated high performance, particularly achieving a 98.2% success rate for the digit "Six (6)." Different success rates were observed among other digits, with high performance for "One (1)" and "Nine (9)" but lower success rates for "Three (3)," "Four (4)," and "Eight (8)." In the scope of the study, evaluations conducted under different training-test ratios revealed that the LSTM model exhibited the highest success at a 50-50 training-test ratio. SVM achieved its highest suc-cess rate at an 80-20 training-test ratio. However, overall, deep learning models, specifically LSTM and CNN, outperformed SVM, indicating that these numerical results highlight the effective use of deep learn-ing models, especially in sound-based digital recognition applications.
Açıklama
Anahtar Kelimeler
CNN, CWT, LSTM, Ses Dijit Sınıflandırma, Sound Digit Classification
Kaynak
WoS Q Değeri
Scopus Q Değeri
Cilt
Sayı
Künye
Eroğlu, A. (2023). Derin öğrenme ile Türkçe ses işaretlerinden rakam tanıma. (Yayınlanmamış Yüksek Lisans Tezi). Batman Üniversitesi Lisansüstü Eğitim Enstitüsü, Batman.