Neural Twins Talk & Alternative Calculations

التفاصيل البيبلوغرافية
العنوان: Neural Twins Talk & Alternative Calculations
المؤلفون: Zohourianshahzadi, Zanyar, Kalita, Jugal K.
المصدر: International Journal of Semantic Computing, 2021, 93-116
سنة النشر: 2021
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Inspired by how the human brain employs a higher number of neural pathways when describing a highly focused subject, we show that deep attentive models used for the main vision-language task of image captioning, could be extended to achieve better performance. Image captioning bridges a gap between computer vision and natural language processing. Automated image captioning is used as a tool to eliminate the need for human agent for creating descriptive captions for unseen images.Automated image captioning is challenging and yet interesting. One reason is that AI based systems capable of generating sentences that describe an input image could be used in a wide variety of tasks beyond generating captions for unseen images found on web or uploaded to social media. For example, in biology and medical sciences, these systems could provide researchers and physicians with a brief linguistic description of relevant images, potentially expediting their work.
Comment: This paper was published at World Scientific Journal, International Journal of Semantic Computing. This is a preprint version that was submitted to the journal before final publication. arXiv admin note: substantial text overlap with arXiv:2009.12524
نوع الوثيقة: Working Paper
DOI: 10.1142/S1793351X21500045
URL الوصول: http://arxiv.org/abs/2108.02807
رقم الأكسشن: edsarx.2108.02807
قاعدة البيانات: arXiv
الوصف
DOI:10.1142/S1793351X21500045