|
|
|
Kara Combs, Adam Moyer and Trevor J. Bihl
Recently, generative artificial intelligence (GAI) has impressed the world with its ability to create text, images, and videos. However, there are still areas in which GAI produces undesirable or unintended results due to being ?uncertain?. Before wider ...
ver más
|
|
|
|
|
|
|
Dominik Warch, Patrick Stellbauer and Pascal Neis
In the digital transformation era, video media libraries? untapped potential is immense, restricted primarily by their non-machine-readable nature and basic search functionalities limited to standard metadata. This study presents a novel multimodal metho...
ver más
|
|
|
|
|
|
|
Aref Maksoud, Aya Elshabshiri, Amani Saeed Hilal Humaid Alzaabi and Aseel Hussien
The study aims to understand to what extent employing A.I. image-generative tools in architectural concept brainstorming demonstrates effectiveness, accuracy, and adherence to text and image inputs, and evaluate the utilization of A.I. image-generative t...
ver más
|
|
|
|
|
|
|
Mahbuba Begum, Sumaita Binte Shorif, Mohammad Shorif Uddin, Jannatul Ferdush, Tony Jan, Alistair Barros and Md Whaiduzzaman
Digital multimedia elements such as text, image, audio, and video can be easily manipulated because of the rapid rise of multimedia technology, making data protection a prime concern. Hence, copyright protection, content authentication, and integrity ver...
ver más
|
|
|
|
|
|
|
Yohanes Yohanie Fridelin Panduman, Nobuo Funabiki, Evianita Dewi Fajrianti, Shihao Fang and Sritrusta Sukaridhoto
In this paper, we have developed the SEMAR (Smart Environmental Monitoring and Analytics in Real-Time) IoT application server platform for fast deployments of IoT application systems. It provides various integration capabilities for the collection, displ...
ver más
|
|
|
|
|
|
|
Chao He, Xinghua Zhang, Dongqing Song, Yingshan Shen, Chengjie Mao, Huosheng Wen, Dingju Zhu and Lihua Cai
With the popularization of better network access and the penetration of personal smartphones in today?s world, the explosion of multi-modal data, particularly opinionated video messages, has created urgent demands and immense opportunities for Multi-Moda...
ver más
|
|
|
|
|
|
|
Hao Liu, Bo Yang and Zhiwen Yu
Multimodal sarcasm detection is a developing research field in social Internet of Things, which is the foundation of artificial intelligence and human psychology research. Sarcastic comments issued on social media often imply people?s real attitudes towa...
ver más
|
|
|
|
|
|
|
Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim
Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta...
ver más
|
|
|
|
|
|
|
Kyungho Yu, Hyoungju Kim, Jeongin Kim, Chanjun Chun and Pankoo Kim
Text-to-image technology enables computers to create images from text by simulating the human process of forming mental images. GAN-based text-to-image technology involves extracting features from input text; subsequently, they are combined with noise an...
ver más
|
|
|
|
|
|
|
Maryam Omar, Hafeez Ur Rehman, Omar Bin Samin, Moutaz Alazab, Gianfranco Politano and Alfredo Benso
Text-to-image synthesis is one of the most critical and challenging problems of generative modeling. It is of substantial importance in the area of automatic learning, especially for image creation, modification, analysis and optimization. A number of wo...
ver más
|
|
|
|