Türkçe konuşma tanıma için kütüphane oluşturma çalışması ve konuşma tanıma uygulaması

Kestepe, Sedat

Türkçe konuşma tanıma için kütüphane oluşturma çalışması ve konuşma tanıma uygulaması

dc.contributor.advisor	Kasapbaşı, Mustafa Cem
dc.contributor.author	Kestepe, Sedat
dc.date.accessioned	2024-10-10T18:25:22Z
dc.date.available	2024-10-10T18:25:22Z
dc.date.issued	2022
dc.department	Enstitüler, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Ana Bilim Dalı	en_US
dc.description	Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Ana Bilim Dalı	en_US
dc.description.abstract	Günümüz dünyasında günlük yaşamla entegre olan Otomatik Konuşma Tanıma (ASR) sistemleri oldukça yaygındır. Ancak, Konuşma Tanıma'daki ilerlemelerin hızı tüm diller için aynı değildir. ASR, İngilizce dili için bir gereklilik olsa da, Türkçe gibi diğer diller için daha fazla gelişme bekleyen bir alandır. Sondan eklemeli bir dil olduğu için diğerlerine göre türetilebilecek kelime sayısında kesin bir sınırlama yoktur. Ayrıca Türkçe mevcut konuşma kütüphanelerinin sayısında da bir eksiklik söz konusudur. Bu çalışmada Türkçe Otomatik Konuşma Tanıma uygulamalarının önündeki temel engellerden biri olan Türkçe konuşma kütüphanesine duyulan ihtiyaca katkı sağlanması amaçlanmıştır. Türkçe içerik sağlayıcılar tarafından sıklıkla kullanılan bir internet platformundan videolar kullanılarak yeni bir Türkçe konuşma kütüphanesi oluşturulması için bir çalışma yapılmıştır. Ardından, forced alignment ve speech detection teknikleri kullanılarak konuşma kütüphanesinin kalitesi artırıldı. Ortaya çıkan konuşma kütüphanesi en gelişmiş Konuşma Tanıma teknikleri ile test edildi ve sonuç başarım oranları karşılaştırıldı. Çalışmanın sonuçları umut verici görünmektedir ve daha sonraki çalışmalarda kullanılacaklar.	en_US
dc.description.abstract	In today's world, Automatic Speech Recognition (ASR) systems which are integrated with daily life are quite common. However, the speed of advancements in Speech Recognition is not the same for all languages. While ASR is a requirement for the English language, it is still an area that expects more advancements for other languages like Turkish. Since it is an agglutinative language, compared to others, there are no certain limits on the number of words that can be derived. Moreover, there is a lacking of enough Turkish corpus as well. In this study, it is aimed to contribute to the need for Turkish speech corpus which is one of the main obstacles to Turkish Automatic Speech Recognition applications. A study is carried out for the construction of a new Turkish speech corpus using videos from an internet platform that is often used by Turkish content providers. Then the quality of the corpus was increased using forced alignment and speech detection techniques. The resulting corpus was tested and compared against the state-of-art Speech recognition techniques. The results of the study look promising and they are going to be used in further studies.	en_US
dc.identifier.endpage	30	en_US
dc.identifier.startpage	1	en_US
dc.identifier.uri	https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=r4I1HnmXxFQovUpyAyUmxOSQyd-mUa66cae5PdH7EVNPbYFaPSqIBJmpIeO4I-nS
dc.identifier.uri	https://hdl.handle.net/11467/7569
dc.identifier.yoktezid	784684	en_US
dc.language.iso	tr	en_US
dc.publisher	İstanbul Ticaret Üniversitesi	en_US
dc.relation.publicationcategory	Tez	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.snmz	2024_Tez	en_US
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	en_US
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Türkçe konuşma tanıma için kütüphane oluşturma çalışması ve konuşma tanıma uygulaması	en_US
dc.title.alternative	A corpus construction study for Turkish automatic speech recognition	en_US
dc.type	Master Thesis	en_US

Koleksiyon

Fen Bilimleri Enstitüsü Tez Koleksiyonu

Türkçe konuşma tanıma için kütüphane oluşturma çalışması ve konuşma tanıma uygulaması

Dosyalar

Koleksiyon