Bilimsel yayınlardan anahtar kelime çıkarımı

Birdevrim, Ahmet Sina

Bilimsel yayınlardan anahtar kelime çıkarımı

Date

2018

Authors

Birdevrim, Ahmet Sina

Publisher

İstanbul Ticaret Üniversitesi

Access Rights

info:eu-repo/semantics/openAccess

Abstract

Akademik alanlarda yapılan çalışmalar okuyucuya, bilim ve teknoloji alanındaki ilerlemeler hakkında bilgi vermeyi amaçlamaktadır. Öte yandan okuyucuların bu bilimsel çalışmalarda aramakta oldukları bilgilere doğru ve hızlı bir şekilde ulaşabilmeleri gerekmektedir. Bu tür ihtiyaçlardan dolayı makale gibi belgelerde anahtar kelimelerin kullanılması, okuyucunun aranan bilgiye kolay ulaşmasını sağlar. Anahtar kelimeler çok sayıda metin tabanlı materyalin analizini kolaylaştırdığı gibi, istenen bilgiye hızlı ve kolay erişimi de sağlar. Bu verileri çıkarmak için otomatik anahtar kelime çıkarma algoritmaları kullanılabilir. Otomatik anahtar kelime çıkarma algoritmaları, belirli bir metinde yer alan en açık kelimeleri veya cümleleri ayıklamayı amaçlar. Bu amaç için en sık kullanılan algoritmalar TF-IDF ve RAKE dır. Fakat bu yaklaşımlar, pek çok metinde kullanılan aynı anlamı taşıyan farklı kelimeleri göz önüne alamamaktadır. Bu tezde geliştirilen algoritma ile verilen bir metindeki eşanlamlı kelimler tek bir çatı altında toplanarak bu kelimelerin sıklığı arttırılarak algoritmaların başarım oranları iyileştirmeye çalışılmıştır. Uygulanan bu yöntem diğer algoritmalar karşılaştırılmıştır ve sonuçlar ortaya konulmuştur.
The academic studies and publications aim to give insights to the reader about the progress in Science & Technology. On the other hand, readers should be able to reach the information they seek both promptly and accurately. For this reason, Keywords are used as search tools in obtaining the desired information from the articles and publications. Keywords facilitate the analysis of a large number of textual materials as well as provides rapid and easy access to desirable information. Automatic keyword algorithms, the method based on extracting the most explicit words or phrases in a given text, can be used to extract this data. TF-IDF and RAKE are the most commonly used algorithms for this purpose. The working principle of these methods do not take into account various form of words that have the same meaning. This study introduces a further algorithm that gathers synonym words in a text under one cluster and hence provides these words with increased frequencies. The proposed algorithm is compared with the other algorithms and some implications are presented.

Description

Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Ana Bilim Dalı

Keywords

Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control

URI

https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=fS4sqEZr79C_n60Rk6MjFWT0SLn9cQ7GY20C85YUalnWQZJI-l2DSpqqAu7Qpfqs
https://hdl.handle.net/11467/7556

Collections

Fen Bilimleri Enstitüsü Tez Koleksiyonu

Full item page

Bilimsel yayınlardan anahtar kelime çıkarımı

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Access Rights

Abstract

Description

Keywords

Journal or Series

WoS Q Value

Scopus Q Value

Volume

Issue

Citation

URI

Collections