A Quantitative Based Research on the Production of Image Captioning

Ajibade, Samuel-Soma M.; Zaidi, Abdelhamid; Maidin, Siti Sarah; Ishak, Wan Hussain Wan; Adetunla, Adedotun

A Quantitative Based Research on the Production of Image Captioning

Tarih

2023

Yazarlar

Ajibade, Samuel-Soma M.

Zaidi, Abdelhamid

Maidin, Siti Sarah

Ishak, Wan Hussain Wan

Adetunla, Adedotun

Yayıncı

Ismail Saritas

Erişim Hakkı

info:eu-repo/semantics/restrictedAccess

Özet

It is widely recognized that modern systems can discern the context of an image and enrich it with relevant captions through the fusion of computer vision and natural language processing, a technique referred to as 'image caption production.' This article aims to shed light on and analyze various image captioning techniques that have evolved over the past few decades, including the Attention Model, Region-Level Caption Detection, Semantic Content-Based Models, Multimodal Models, and more. The evaluation of these techniques employs diverse criteria such as Precision Rate, Recall Rate, F1 Score, Accuracy Rate, among others, while employing various datasets for comparison. This article offers a comprehensive structural examination of contemporary image captioning methods. Researchers can leverage the insights from this analysis to develop innovative, improved approaches that sidestep the shortcomings of older methods while retaining their beneficial aspects.

Anahtar Kelimeler

Attention Model; Image Caption; Multimodal Model; Region Level Captions; Semantic Content

Kaynak

International Journal of Intelligent Systems and Applications in Engineering

Scopus Q Değeri

N/A

Cilt

11

Sayı

4

Bağlantı

https://hdl.handle.net/11467/7039

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

A Quantitative Based Research on the Production of Image Captioning

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon