|
|
|
Oscar Ondeng, Heywood Ouma and Peter Akuon
Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using visio...
ver más
|
|
|
|
|
|
|
Xinjing Zhang and Qixun Zhou
Human pose estimation, as the basis of advanced computer vision, has a wide application perspective. In existing studies, the high-capacity model based on the heatmap method can achieve accurate recognition results, but it encounters many difficulties wh...
ver más
|
|
|
|
|
|
|
Hyeong-Ju Kang
Object detection in many real applications requires the capability of detecting small objects in a system with limited resources. Convolutional neural networks (CNNs) show high performance in object detection, but they are not adequate to resource-limite...
ver más
|
|
|
|
|
|
|
Jianying Li and Xiangjun Shao
Image captioning is a challenging task, which generates a sentence for a given image. The earlier captioning methods mainly decode the visual features to generate caption sentences for the image. However, the visual features lack the context semantic inf...
ver más
|
|
|
|
|
|
|
Yun-Wei Lin, Yuh-Hwan Liu, Yi-Bing Lin and Jian-Chang Hong
Deep learning models are often trained with a large amount of labeled data to improve the accuracy for moving object detection in new fields. However, the model may not be robust enough due to insufficient training data in the new field, resulting in som...
ver más
|
|
|
|
|
|
|
Chunxian Wang, Xiaoxing Wang, Yiwen Wang, Shengchao Hu, Hongyang Chen, Xuehai Gu, Junchi Yan and Tao He
Neural architecture search (NAS) is a popular branch of automatic machine learning (AutoML), which aims to search for efficient network structures. Many prior works have explored a wide range of search algorithms for classification tasks, and have achiev...
ver más
|
|
|
|
|
|
|
Deepika Kumar, Varun Srivastava, Daniela Elena Popescu and Jude D. Hemanth
Image captioning is oriented towards describing an image with the best possible use of words that can provide a semantic, relatable meaning of the scenario inscribed. Different models can be used to accomplish this arduous task depending on the context a...
ver más
|
|
|
|
|
|
|
Shahriar Shakir Sumit, Dayang Rohaya Awang Rambli, Seyedali Mirjalili, Muhammad Mudassir Ejaz and M. Saef Ullah Miah
Human detection is a special application of object recognition and is considered one of the greatest challenges in computer vision. It is the starting point of a number of applications, including public safety and security surveillance around the world. ...
ver más
|
|
|
|
|
|
|
Qian Zhang, Jie Ren, Hong Liang, Ying Yang and Lu Chen
Small object detection becomes a challenging problem in computer vision due to low resolution and less feature information. Making full use of high-resolution features is an important factor in improving small object detection. In this paper, to improve ...
ver más
|
|
|
|
|
|
|
Hyobin Sunwoo, Wonjun Choi, Seunguk Na, Cheekyeong Kim and Seokjae Heo
As reconstruction and redevelopment accelerate, the generation of construction waste increases, and construction waste treatment technology is being developed accordingly, especially using artificial intelligence (AI). The majority of AI research project...
ver más
|
|
|
|