A Quantitative Based Research on the Production of Image Captioning

Küçük Resim Yok

Tarih

2023

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Ismail Saritas

Erişim Hakkı

info:eu-repo/semantics/restrictedAccess

Özet

It is widely recognized that modern systems can discern the context of an image and enrich it with relevant captions through the fusion of computer vision and natural language processing, a technique referred to as 'image caption production.' This article aims to shed light on and analyze various image captioning techniques that have evolved over the past few decades, including the Attention Model, Region-Level Caption Detection, Semantic Content-Based Models, Multimodal Models, and more. The evaluation of these techniques employs diverse criteria such as Precision Rate, Recall Rate, F1 Score, Accuracy Rate, among others, while employing various datasets for comparison. This article offers a comprehensive structural examination of contemporary image captioning methods. Researchers can leverage the insights from this analysis to develop innovative, improved approaches that sidestep the shortcomings of older methods while retaining their beneficial aspects.

Açıklama

Anahtar Kelimeler

Attention Model; Image Caption; Multimodal Model; Region Level Captions; Semantic Content

Kaynak

International Journal of Intelligent Systems and Applications in Engineering

WoS Q Değeri

Scopus Q Değeri

N/A

Cilt

11

Sayı

4

Künye