|
|
|
Jianying Li and Xiangjun Shao
Image captioning is a challenging task, which generates a sentence for a given image. The earlier captioning methods mainly decode the visual features to generate caption sentences for the image. However, the visual features lack the context semantic inf...
ver más
|
|
|
|
|
|
|
Tian Xie, Weiping Ding, Jinbao Zhang, Xusen Wan and Jiehua Wang
The discipline of automatic image captioning represents an integration of two pivotal branches of artificial intelligence, namely computer vision (CV) and natural language processing (NLP). The principal functionality of this technology lies in transmuti...
ver más
|
|
|
|
|
|
|
Sen Du, Hong Zhu, Guangfeng Lin, Dong Wang and Jing Shi
Automatically describing the content of an image is a challenging task that is on the edge between natural language and computer vision. The current image caption models can describe the objects that are frequently seen in the training set very well, but...
ver más
|
|
|
|
|
|
|
Swarnendu Ghosh, Teresa Gonçalves and Nibaran Das
Conceptual representations of images involving descriptions of entities and their relations are often represented using scene graphs. Such scene graphs can express relational concepts by using sets of triplets ⟨subject—predicate&...
ver más
|
|
|
|
|
|
|
Xin Zhang, Xiaoqian Lu, Xiaolan Zhou and Chaohai Shen
With the rise of user-generated content (UGC) and deep learning technology, more and more researchers construct and measure the tourism destination image (TDI) through online travelogues. However, due to the impact of COVID-19 prevention and control, the...
ver más
|
|
|
|
|
|
|
Tuan-Vinh La, Minh-Son Dao, Duy-Dong Le, Kim-Phung Thai, Quoc-Hung Nguyen and Thuy-Kieu Phan-Thi
The explosive growth of the social media community has increased many kinds of misinformation and is attracting tremendous attention from the research community. One of the most prevalent ways of misleading news is cheapfakes. Cheapfakes utilize non-AI t...
ver más
|
|
|
|
|
|
|
Mourad Sarrouti, Asma Ben Abacha and Dina Demner-Fushman
Visual Question Generation (VQG) from images is a rising research topic in both fields of natural language processing and computer vision. Although there are some recent efforts towards generating questions from images in the open domain, the VQG task in...
ver más
|
|
|
|
|
|
|
Pei Liu, Dezhong Peng and Ming Zhang
In this work, we propose a novel priors-based attention neural network (PANN) for image captioning, which aims at incorporating two kinds of priors, i.e., the probabilities being mentioned for local region proposals (PBM priors) and part-of-speech clues ...
ver más
|
|
|
|
|
|
|
Boeun Kim, Saim Shin and Hyedong Jung
Image captioning is a promising research topic that is applicable to services that search for desired content in a large amount of video data and a situation explanation service for visually impaired people. Previous research on image captioning has been...
ver más
|
|
|
|
|
|
|
Sreela S R and Sumam Mary Idicula
Due to the rapid growth of deep learning technologies, automatic image description generation is an interesting problem in computer vision and natural language generation. It helps to improve access to photo collections on social media and gives guidance...
ver más
|
|
|
|