29 Artículos

A Review of Transformer-Based Approaches for Image Captioning

Acceso

en línea

Oscar Ondeng, Heywood Ouma and Peter Akuon

Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using visio... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 19 Año: 2023

RepNet: A Lightweight Human Pose Regression Network Based on Re-Parameterization

Acceso

en línea

Xinjing Zhang and Qixun Zhou

Human pose estimation, as the basis of advanced computer vision, has a wide application perspective. In existing studies, the high-capacity model based on the heatmap method can achieve accurate recognition results, but it encounters many difficulties wh... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 16 Año: 2023

SSDLiteX: Enhancing SSDLite for Small Object Detection

Acceso

en línea

Hyeong-Ju Kang

Object detection in many real applications requires the capability of detecting small objects in a system with limited resources. Convolutional neural networks (CNNs) show high performance in object detection, but they are not adequate to resource-limite... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 21 Año: 2023

A Context Semantic Auxiliary Network for Image Captioning

Acceso

en línea

Jianying Li and Xiangjun Shao

Image captioning is a challenging task, which generates a sentence for a given image. The earlier captioning methods mainly decode the visual features to generate caption sentences for the image. However, the visual features lack the context semantic inf... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 7 Año: 2023

FenceTalk: Exploring False Negatives in Moving Object Detection

Acceso

en línea

Yun-Wei Lin, Yuh-Hwan Liu, Yi-Bing Lin and Jian-Chang Hong

Deep learning models are often trained with a large amount of labeled data to improve the accuracy for moving object detection in new fields. However, the model may not be robust enough due to insufficient training data in the new field, resulting in som... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 10 Año: 2023

FastDARTSDet: Fast Differentiable Architecture Joint Search on Backbone and FPN for Object Detection

Acceso

en línea

Chunxian Wang, Xiaoxing Wang, Yiwen Wang, Shengchao Hu, Hongyang Chen, Xuehai Gu, Junchi Yan and Tao He

Neural architecture search (NAS) is a popular branch of automatic machine learning (AutoML), which aims to search for efficient network structures. Many prior works have explored a wide range of search algorithms for classification tasks, and have achiev... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 20 Año: 2022

Dual-Modal Transformer with Enhanced Inter- and Intra-Modality Interactions for Image Captioning

Acceso

en línea

Deepika Kumar, Varun Srivastava, Daniela Elena Popescu and Jude D. Hemanth

Image captioning is oriented towards describing an image with the best possible use of words that can provide a semantic, relatable meaning of the scenario inscribed. Different models can be used to accomplish this arduous task depending on the context a... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 13 Año: 2022

ReSTiNet: On Improving the Performance of Tiny-YOLO-Based CNN Architecture for Applications in Human Detection

Acceso

en línea

Shahriar Shakir Sumit, Dayang Rohaya Awang Rambli, Seyedali Mirjalili, Muhammad Mudassir Ejaz and M. Saef Ullah Miah

Human detection is a special application of object recognition and is considered one of the greatest challenges in computer vision. It is the starting point of a number of applications, including public safety and security surveillance around the world. ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 18 Año: 2022

BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection

Acceso

en línea

Qian Zhang, Jie Ren, Hong Liang, Ying Yang and Lu Chen

Small object detection becomes a challenging problem in computer vision due to low resolution and less feature information. Making full use of high-resolution features is an important factor in improving small object detection. In this paper, to improve ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 7 Año: 2022

Comparison of the Performance of Artificial Intelligence Models Depending on the Labelled Image by Different User Levels

Acceso

en línea

Hyobin Sunwoo, Wonjun Choi, Seunguk Na, Cheekyeong Kim and Seokjae Heo

As reconstruction and redevelopment accelerate, the generation of construction waste increases, and construction waste treatment technology is being developed accordingly, especially using artificial intelligence (AI). The majority of AI research project... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 6 Año: 2022

« Anterior Página: 1 de 2 Siguiente »