155 Artículos

Uncertainty in Visual Generative AI

Acceso

en línea

Kara Combs, Adam Moyer and Trevor J. Bihl

Recently, generative artificial intelligence (GAI) has impressed the world with its ability to create text, images, and videos. However, there are still areas in which GAI produces undesirable or unintended results due to being ?uncertain?. Before wider ... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 17 Num: 0 Par: 4 Año: 2024

Advanced Techniques for Geospatial Referencing in Online Media Repositories

Acceso

en línea

Dominik Warch, Patrick Stellbauer and Pascal Neis

In the digital transformation era, video media libraries? untapped potential is immense, restricted primarily by their non-machine-readable nature and basic search functionalities limited to standard metadata. This study presents a novel multimodal metho... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 3 Año: 2024

Integrating an Image-Generative Tool on Creative Design Brainstorming Process of a Safavid Mosque Architecture Conceptual Form

Acceso

en línea

Aref Maksoud, Aya Elshabshiri, Amani Saeed Hilal Humaid Alzaabi and Aseel Hussien

The study aims to understand to what extent employing A.I. image-generative tools in architectural concept brainstorming demonstrates effectiveness, accuracy, and adherence to text and image inputs, and evaluate the utilization of A.I. image-generative t... ver más

Revista: Buildings Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

Image Watermarking Using Discrete Wavelet Transform and Singular Value Decomposition for Enhanced Imperceptibility and Robustness

Acceso

en línea

Mahbuba Begum, Sumaita Binte Shorif, Mohammad Shorif Uddin, Jannatul Ferdush, Tony Jan, Alistair Barros and Md Whaiduzzaman

Digital multimedia elements such as text, image, audio, and video can be easily manipulated because of the rapid rise of multimedia technology, making data protection a prime concern. Hence, copyright protection, content authentication, and integrity ver... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 17 Num: 0 Par: 1 Año: 2024

Mixture of Attention Variants for Modal Fusion in Multi-Modal Sentiment Analysis

Acceso

en línea

Chao He, Xinghua Zhang, Dongqing Song, Yingshan Shen, Chengjie Mao, Huosheng Wen, Dingju Zhu and Lihua Cai

With the popularization of better network access and the penetration of personal smartphones in today?s world, the explosion of multi-modal data, particularly opinionated video messages, has created urgent demands and immense opportunities for Multi-Moda... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 8 Num: 0 Par: 2 Año: 2024

A Multi-View Interactive Approach for Multimodal Sarcasm Detection in Social Internet of Things with Knowledge Enhancement

Acceso

en línea

Hao Liu, Bo Yang and Zhiwen Yu

Multimodal sarcasm detection is a developing research field in social Internet of Things, which is the foundation of artificial intelligence and human psychology research. Sarcastic comments issued on social media often imply people?s real attitudes towa... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 5 Año: 2024

Sequential Brain CT Image Captioning Based on the Pre-Trained Classifiers and a Language Model

Acceso

en línea

Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim

Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

A Survey of AI Techniques in IoT Applications with Use Case Investigations in the Smart Environmental Monitoring and Analytics in Real-Time IoT Platform

Acceso

en línea

Yohanes Yohanie Fridelin Panduman, Nobuo Funabiki, Evianita Dewi Fajrianti, Shihao Fang and Sritrusta Sukaridhoto

In this paper, we have developed the SEMAR (Smart Environmental Monitoring and Analytics in Real-Time) IoT application server platform for fast deployments of IoT application systems. It provides various integration capabilities for the collection, displ... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 3 Año: 2024

A Study on Generating Webtoons Using Multilingual Text-to-Image Models

Acceso

en línea

Kyungho Yu, Hyoungju Kim, Jeongin Kim, Chanjun Chun and Pankoo Kim

Text-to-image technology enables computers to create images from text by simulating the human process of forming mental images. GAN-based text-to-image technology involves extracting features from input text; subsequently, they are combined with noise an... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 12 Año: 2023

CapGAN: Text-to-Image Synthesis Using Capsule GANs

Acceso

en línea

Maryam Omar, Hafeez Ur Rehman, Omar Bin Samin, Moutaz Alazab, Gianfranco Politano and Alfredo Benso

Text-to-image synthesis is one of the most critical and challenging problems of generative modeling. It is of substantial importance in the area of automatic learning, especially for image creation, modification, analysis and optimization. A number of wo... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 10 Año: 2023

« Anterior Página: 1 de 10 Siguiente »