20 Artículos

A Context Semantic Auxiliary Network for Image Captioning

Acceso

en línea

Jianying Li and Xiangjun Shao

Image captioning is a challenging task, which generates a sentence for a given image. The earlier captioning methods mainly decode the visual features to generate caption sentences for the image. However, the visual features lack the context semantic inf... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 7 Año: 2023

Bi-LS-AttM: A Bidirectional LSTM and Attention Mechanism Model for Improving Image Captioning

Acceso

en línea

Tian Xie, Weiping Ding, Jinbao Zhang, Xusen Wan and Jiehua Wang

The discipline of automatic image captioning represents an integration of two pivotal branches of artificial intelligence, namely computer vision (CV) and natural language processing (NLP). The principal functionality of this technology lies in transmuti... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 13 Año: 2023

Novel Object Captioning with Semantic Match from External Knowledge

Acceso

en línea

Sen Du, Hong Zhu, Guangfeng Lin, Dong Wang and Jing Shi

Automatically describing the content of an image is a challenging task that is on the edge between natural language and computer vision. The current image caption models can describe the objects that are frequently seen in the training set very well, but... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 13 Año: 2023

Im2Graph: A Weakly Supervised Approach for Generating Holistic Scene Graphs from Regional Dependencies

Acceso

en línea

Swarnendu Ghosh, Teresa Gonçalves and Nibaran Das

Conceptual representations of images involving descriptions of entities and their relations are often represented using scene graphs. Such scene graphs can express relational concepts by using sets of triplets ⟨subject—predicate&... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 2 Año: 2023

Reconsidering Tourism Destination Images by Exploring Similarities between Travelogue Texts and Photographs

Acceso

en línea

Xin Zhang, Xiaoqian Lu, Xiaolan Zhou and Chaohai Shen

With the rise of user-generated content (UGC) and deep learning technology, more and more researchers construct and measure the tourism destination image (TDI) through online travelogues. However, due to the impact of COVID-19 prevention and control, the... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 11 Año: 2022

Leverage Boosting and Transformer on Text-Image Matching for Cheap Fakes Detection

Acceso

en línea

Tuan-Vinh La, Minh-Son Dao, Duy-Dong Le, Kim-Phung Thai, Quoc-Hung Nguyen and Thuy-Kieu Phan-Thi

The explosive growth of the social media community has increased many kinds of misinformation and is attracting tremendous attention from the research community. One of the most prevalent ways of misleading news is cheapfakes. Cheapfakes utilize non-AI t... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 11 Año: 2022

Goal-Driven Visual Question Generation from Radiology Images

Acceso

en línea

Mourad Sarrouti, Asma Ben Abacha and Dina Demner-Fushman

Visual Question Generation (VQG) from images is a rising research topic in both fields of natural language processing and computer vision. Although there are some recent efforts towards generating questions from images in the open domain, the VQG task in... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 8 Año: 2021

Learn and Tell: Learning Priors for Image Caption Generation

Acceso

en línea

Pei Liu, Dezhong Peng and Ming Zhang

In this work, we propose a novel priors-based attention neural network (PANN) for image captioning, which aims at incorporating two kinds of priors, i.e., the probabilities being mentioned for local region proposals (PBM priors) and part-of-speech clues ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 19 Año: 2020

Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map

Acceso

en línea

Boeun Kim, Saim Shin and Hyedong Jung

Image captioning is a promising research topic that is applicable to services that search for desired content in a large amount of video data and a situation explanation service for visually impaired people. Previous research on image captioning has been... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 13 Año: 2019

Dense Model for Automatic Image Description Generation with Game Theoretic Optimization

Acceso

en línea

Sreela S R and Sumam Mary Idicula

Due to the rapid growth of deep learning technologies, automatic image description generation is an interesting problem in computer vision and natural language generation. It helps to improve access to photo collections on social media and gives guidance... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 11 Año: 2019

« Anterior Página: 1 de 2 Siguiente »