|
|
|
Haohan Shi, Xiyu Shi and Safak Dogan
Audio inpainting plays an important role in addressing incomplete, damaged, or missing audio signals, contributing to improved quality of service and overall user experience in multimedia communications over the Internet and mobile networks. This paper p...
ver más
|
|
|
|
|
|
|
Jingwen Yang and Ruohua Zhou
Whisper speaker recognition (WSR) has received extensive attention from researchers in recent years, and it plays an important role in medical, judicial, and other fields. Among them, the establishment of a whisper dataset is very important for the study...
ver más
|
|
|
|
|
|
|
Peranut Nimitsurachat and Peter Washington
Emotion recognition models using audio input data can enable the development of interactive systems with applications in mental healthcare, marketing, gaming, and social media analysis. While the field of affective computing using audio data is rich, a m...
ver más
|
|
|
|
|
|
|
Qiwei Shen, Junjie Xu, Jiahao Mei, Xingjiao Wu and Daoguo Dong
|
|
|
|
|
|
|
Jun-Hwa Kim and Chee Sun Won
|
|
|
|
|
|
|
Dominik Warch, Patrick Stellbauer and Pascal Neis
In the digital transformation era, video media libraries? untapped potential is immense, restricted primarily by their non-machine-readable nature and basic search functionalities limited to standard metadata. This study presents a novel multimodal metho...
ver más
|
|
|
|
|
|
|
Urszula Libal and Pawel Biernacki
An automatic honey bee classification system based on audio signals for tracking the frequency of workers and drones entering and leaving a hive.
|
|
|
|
|
|
|
Mohamed Dhiaeddine Messaoudi, Bob-Antoine J. Menelas and Hamid Mcheick
This research introduces an innovative smart cane architecture designed to empower visually impaired individuals. Integrating advanced sensors and social media connectivity, the smart cane enhances accessibility and encourages physical activity. Three me...
ver más
|
|
|
|
|
|
|
Mingyoung Jeng, Alvir Nobel, Vinayak Jha, David Levy, Dylan Kneidel, Manu Chaudhary, Ishraq Islam, Evan Baumgartner, Eade Vanderhoof, Audrey Facer, Manish Singh, Abina Arshad and Esam El-Araby
Convolutional neural networks (CNNs) have proven to be a very efficient class of machine learning (ML) architectures for handling multidimensional data by maintaining data locality, especially in the field of computer vision. Data pooling, a major compon...
ver más
|
|
|
|
|
|
|
Mahbuba Begum, Sumaita Binte Shorif, Mohammad Shorif Uddin, Jannatul Ferdush, Tony Jan, Alistair Barros and Md Whaiduzzaman
Digital multimedia elements such as text, image, audio, and video can be easily manipulated because of the rapid rise of multimedia technology, making data protection a prime concern. Hence, copyright protection, content authentication, and integrity ver...
ver más
|
|
|
|