28 Artículos

Speech Inpainting Based on Multi-Layer Long Short-Term Memory Networks

Acceso

en línea

Haohan Shi, Xiyu Shi and Safak Dogan

Audio inpainting plays an important role in addressing incomplete, damaged, or missing audio signals, contributing to improved quality of service and overall user experience in multimedia communications over the Internet and mobile networks. This paper p... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 2 Año: 2024

Audio-Based Emotion Recognition Using Self-Supervised Learning on an Engineered Feature Space

Acceso

en línea

Peranut Nimitsurachat and Peter Washington

Emotion recognition models using audio input data can enable the development of interactive systems with applications in mental healthcare, marketing, gaming, and social media analysis. While the field of affective computing using audio data is rich, a m... ver más

Revista: AI Formato: Electrónico

Tabla de contenido: Vol: 5 Num: 0 Par: 1 Año: 2024

Neural Network Exploration for Keyword Spotting on Edge Devices

Acceso

en línea

Jacob Bushur and Chao Chen

The introduction of artificial neural networks to speech recognition applications has sparked the rapid development and popularization of digital assistants. These digital assistants constantly monitor the audio captured by a microphone for a small set o... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 6 Año: 2023

Audio Denoising Coprocessor Based on RISC-V Custom Instruction Set Extension

Acceso

en línea

Jun Yuan, Qiang Zhao, Wei Wang, Xiangsheng Meng, Jun Li and Qin Li

As a typical active noise control algorithm, Filtered-x Least Mean Square (FxLMS) is widely used in the field of audio denoising. In this study, an audio denoising coprocessor based on Retrenched Injunction System Computer-V (RISC-V), a custom instructio... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 4 Num: 0 Par: 3 Año: 2022

Can Voice Reviews Enhance Trust in Voice Shopping? The Effects of Voice Reviews on Trust and Purchase Intention in Voice Shopping

Acceso

en línea

Jaeun Seo, Daeho Lee and Inyoung Park

Despite the high expectations of the voice shopping market, the impact of reviews and product types on voice commerce has yet to be explored. The purpose of this study is to investigate the effect of reviews and product types on users? trust and purchase... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 20 Año: 2022

End-to-End Pedestrian Trajectory Forecasting with Transformer Network

Acceso

en línea

Hai-Yan Yao, Wang-Gen Wan and Xiang Li

Analysis of pedestrians? motion is important to real-world applications in public scenes. Due to the complex temporal and spatial factors, trajectory prediction is a challenging task. With the development of attention mechanism recently, transformer netw... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 1 Año: 2022

The Influence of Listeners? Mood on Equalization-Based Listening Experience

Acceso

en línea

Nefeli Dourou, Valeria Bruschi, Susanna Spinsante and Stefania Cecchi

Using equalization to improve sound listening experience is a well-established topic among the audio society. Finding a general equalization curve is a difficult task because of spectral content influenced by the reproduction system (loudspeakers and roo... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 4 Num: 0 Par: 3 Año: 2022

Effectiveness of MP3 Coding Depends on the Music Genre: Evaluation Using Semantic Differential Scales

Acceso

en línea

Nikolaos M. Papadakis, Ioanna Aroni and Georgios E. Stavroulakis

MPEG-1 Layer 3 (MP3) is one of the most popular compression formats used for sound and especially for music. However, during the coding process, the MP3 algorithm negatively affects the spectral and dynamic characteristics of the audio file being compres... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 4 Num: 0 Par: 3 Año: 2022

Language Identification-Based Evaluation of Single Channel Speech Separation of Overlapped Speeches

Acceso

en línea

Zuhragvl Aysa, Mijit Ablimit, Hankiz Yilahun and Askar Hamdulla

In multi-lingual, multi-speaker environments (e.g., international conference scenarios), speech, language, and background sounds can overlap. In real-world scenarios, source separation techniques are needed to separate target sounds. Downstream tasks, su... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 10 Año: 2022

Identification of Fake Stereo Audio Using SVM and CNN

Acceso

en línea

Tianyun Liu, Diqun Yan, Rangding Wang, Nan Yan and Gang Chen

The number of channels is one of the important criteria in regard to digital audio quality. Generally, stereo audio with two channels can provide better perceptual quality than mono audio. To seek illegal commercial benefit, one might convert a mono audi... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 7 Año: 2021

« Anterior Página: 1 de 2 Siguiente »