Case study on well-known topic modeling methods for document classification
dc.authorid | 0000-0002-1941-6693 | en_US |
dc.contributor.author | Özdemirci, Süleyman | |
dc.contributor.author | Turan, Metin | |
dc.date.accessioned | 2021-12-27T07:33:26Z | |
dc.date.available | 2021-12-27T07:33:26Z | |
dc.date.issued | 2021 | en_US |
dc.department | Fakülteler, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.description.abstract | Topic modeling has numerous applications like text categorization, topic clustering, document tagging, feature extraction on wide document collections. In this study, practical exploration method of topic modeling of Latent Dirichlet Allocation, transformers based machine learning method Bidirectional Encoder Representations from Transformers and Term Frequency — Inverse Document Frequency method were applied to the document set separately. It includes sport and education articles collected from internet by graduate students, 801 number totally. The purpose of this study is to observe which method best suits to the topic modeling and if possible in order to increase the accuracy rate via ensemble of these methods. As a result of this study, it was observed that even it has some disadvantages, BERT classified the documents with the correct topic with an average of %92.6 success ratio, overwhelming the others. | en_US |
dc.identifier.endpage | 1309 | en_US |
dc.identifier.isbn | 978-1-7281-8501-9 | |
dc.identifier.scopus | 2-s2.0-85102556721 | en_US |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.startpage | 1304 | en_US |
dc.identifier.uri | https://hdl.handle.net/11467/5139 | |
dc.identifier.wos | WOS:000722293800220 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | Proceedings of the Sixth International Conference on Inventive Computation Technologies [ICICT 2021] | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - İdari Personel ve Öğrenci | en_US |
dc.rights | info:eu-repo/semantics/embargoedAccess | en_US |
dc.subject | Classification | en_US |
dc.subject | Topic modeling | en_US |
dc.subject | LDA | en_US |
dc.subject | BERT | en_US |
dc.subject | TF-IDF | en_US |
dc.title | Case study on well-known topic modeling methods for document classification | en_US |
dc.type | Conference Object | en_US |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- Case_Study_on_well-known_Topic_Modeling_Methods_for_Document_Classification.pdf
- Boyut:
- 1.41 MB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
Lisans paketi
1 - 1 / 1
Küçük Resim Yok
- İsim:
- license.txt
- Boyut:
- 1.56 KB
- Biçim:
- Item-specific license agreed upon to submission
- Açıklama: