30 Artículos

Data Augmentation with Cross-Modal Variational Autoencoders (DACMVA) for Cancer Survival Prediction

Acceso

en línea

Sara Rajaram and Cassie S. Mitchell

The ability to translate Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) into different modalities and data types is essential to improve Deep Learning (DL) for predictive medicine. This work presents DACMVA, a novel framework ... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 1 Año: 2024

A Survey of Full-Cycle Cross-Modal Retrieval: From a Representation Learning Perspective

Acceso

en línea

Suping Wang, Ligu Zhu, Lei Shi, Hao Mo and Songfu Tan

Cross-modal retrieval aims to elucidate information fusion, imitate human learning, and advance the field. Although previous reviews have primarily focused on binary and real-value coding methods, there is a scarcity of techniques grounded in deep repres... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 7 Año: 2023

A Cross-Modal Hash Retrieval Method with Fused Triples

Acceso

en línea

Wenxiao Li, Hongyan Mei, Yutian Li, Jiayao Yu, Xing Zhang, Xiaorong Xue and Jiahao Wang

Due to the fast retrieval speed and low storage cost, cross-modal hashing has become the primary method for cross-modal retrieval. Since the emergence of deep cross-modal hashing methods, cross-modal retrieval significantly improved. However, the existin... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 18 Año: 2023

Bimodal Fusion Network with Multi-Head Attention for Multimodal Sentiment Analysis

Acceso

en línea

Rui Zhang, Chengrong Xue, Qingfu Qi, Liyuan Lin, Jing Zhang and Lun Zhang

The enrichment of social media expression makes multimodal sentiment analysis a research hotspot. However, modality heterogeneity brings great difficulties to effective cross-modal fusion, especially the modality alignment problem and the uncontrolled ve... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 3 Año: 2023

MM-ConvBERT-LMS: Detecting Malicious Web Pages via Multi-Modal Learning and Pre-Trained Model

Acceso

en línea

Xin Tong, Bo Jin, Jingya Wang, Ying Yang, Qiwei Suo and Yong Wu

In recent years, the number of malicious web pages has increased dramatically, posing a great challenge to network security. While current machine learning-based detection methods have emerged as a promising alternative to traditional detection technique... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 5 Año: 2023

Intuitively Searching for the Rare Colors from Digital Artwork Collections by Text Description: A Case Demonstration of Japanese Ukiyo-e Print Retrieval

Acceso

en línea

Kangying Li, Jiayun Wang, Biligsaikhan Batjargal and Akira Maeda

In recent years, artworks have been increasingly digitized and built into databases, and such databases have become convenient tools for researchers. Researchers who retrieve artwork are not only researchers of humanities, but also researchers of materia... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 7 Año: 2022

DA-GAN: Dual Attention Generative Adversarial Network for Cross-Modal Retrieval

Acceso

en línea

Liewu Cai, Lei Zhu, Hongyan Zhang and Xinghui Zhu

Cross-modal retrieval aims to search samples of one modality via queries of other modalities, which is a hot issue in the community of multimedia. However, two main challenges, i.e., heterogeneity gap and semantic interaction across different modalities,... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 2 Año: 2022

Large-Scale Multimodal Piano Music Identification Using Marketplace Fingerprinting

Acceso

en línea

Daniel Yang, Arya Goutam, Kevin Ji and TJ Tsai

This paper studies the problem of identifying piano music in various modalities using a single, unified approach called marketplace fingerprinting. The key defining characteristic of marketplace fingerprinting is choice: we consider a broad range of fing... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 5 Año: 2022

Cross-Modal Manifold Propagation for Image Recommendation

Acceso

en línea

Meng Jian, Jingjing Guo, Xin Fu, Lifang Wu and Ting Jia

The growing complex user intention gap and information overload are obstacles for users to access the desired content. User interactions and the involved content indicate rich evidence of users? interests. It is required to investigate interaction charac... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 6 Año: 2022

Induction Mechanism of Auditory-Assisted Vision for Target Search Localization in Mixed Reality (MR) Environments

Acceso

en línea

Wei Wang, Ning Xu, Sina Dang, Xuefeng Hong and Jue Qu

In MR (mixed reality) environments, visual searches are often used for search and localization missions. There are some problems with search and localization technologies, such as a limited field of view and information overload. They are unable to satis... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 7 Año: 2022

« Anterior Página: 1 de 2 Siguiente »