|
|
|
Haohan Shi, Xiyu Shi and Safak Dogan
Audio inpainting plays an important role in addressing incomplete, damaged, or missing audio signals, contributing to improved quality of service and overall user experience in multimedia communications over the Internet and mobile networks. This paper p...
ver más
|
|
|
|
|
|
|
Peranut Nimitsurachat and Peter Washington
Emotion recognition models using audio input data can enable the development of interactive systems with applications in mental healthcare, marketing, gaming, and social media analysis. While the field of affective computing using audio data is rich, a m...
ver más
|
|
|
|
|
|
|
Jacob Bushur and Chao Chen
The introduction of artificial neural networks to speech recognition applications has sparked the rapid development and popularization of digital assistants. These digital assistants constantly monitor the audio captured by a microphone for a small set o...
ver más
|
|
|
|
|
|
|
Jun Yuan, Qiang Zhao, Wei Wang, Xiangsheng Meng, Jun Li and Qin Li
As a typical active noise control algorithm, Filtered-x Least Mean Square (FxLMS) is widely used in the field of audio denoising. In this study, an audio denoising coprocessor based on Retrenched Injunction System Computer-V (RISC-V), a custom instructio...
ver más
|
|
|
|
|
|
|
Jaeun Seo, Daeho Lee and Inyoung Park
Despite the high expectations of the voice shopping market, the impact of reviews and product types on voice commerce has yet to be explored. The purpose of this study is to investigate the effect of reviews and product types on users? trust and purchase...
ver más
|
|
|
|
|
|
|
Hai-Yan Yao, Wang-Gen Wan and Xiang Li
Analysis of pedestrians? motion is important to real-world applications in public scenes. Due to the complex temporal and spatial factors, trajectory prediction is a challenging task. With the development of attention mechanism recently, transformer netw...
ver más
|
|
|
|
|
|
|
Nefeli Dourou, Valeria Bruschi, Susanna Spinsante and Stefania Cecchi
Using equalization to improve sound listening experience is a well-established topic among the audio society. Finding a general equalization curve is a difficult task because of spectral content influenced by the reproduction system (loudspeakers and roo...
ver más
|
|
|
|
|
|
|
Nikolaos M. Papadakis, Ioanna Aroni and Georgios E. Stavroulakis
MPEG-1 Layer 3 (MP3) is one of the most popular compression formats used for sound and especially for music. However, during the coding process, the MP3 algorithm negatively affects the spectral and dynamic characteristics of the audio file being compres...
ver más
|
|
|
|
|
|
|
Zuhragvl Aysa, Mijit Ablimit, Hankiz Yilahun and Askar Hamdulla
In multi-lingual, multi-speaker environments (e.g., international conference scenarios), speech, language, and background sounds can overlap. In real-world scenarios, source separation techniques are needed to separate target sounds. Downstream tasks, su...
ver más
|
|
|
|
|
|
|
Tianyun Liu, Diqun Yan, Rangding Wang, Nan Yan and Gang Chen
The number of channels is one of the important criteria in regard to digital audio quality. Generally, stereo audio with two channels can provide better perceptual quality than mono audio. To seek illegal commercial benefit, one might convert a mono audi...
ver más
|
|
|
|