Redirigiendo al acceso original de articulo en 18 segundos...
ARTÍCULO
TITULO

Ensemble-Based Short Text Similarity: An Easy Approach for Multilingual Datasets Using Transformers and WordNet in Real-World Scenarios

Isabella Gagliardi and Maria Teresa Artese    

Resumen

When integrating data from different sources, there are problems of synonymy, different languages, and concepts of different granularity. This paper proposes a simple yet effective approach to evaluate the semantic similarity of short texts, especially keywords. The method is capable of matching keywords from different sources and languages by exploiting transformers and WordNet-based methods. Key features of the approach include its unsupervised pipeline, mitigation of the lack of context in keywords, scalability for large archives, support for multiple languages and real-world scenarios adaptation capabilities. The work aims to provide a versatile tool for different cultural heritage archives without requiring complex customization. The paper aims to explore different approaches to identifying similarities in 1- or n-gram tags, evaluate and compare different pre-trained language models, and define integrated methods to overcome limitations. Tests to validate the approach have been conducted using the QueryLab portal, a search engine for cultural heritage archives, to evaluate the proposed pipeline.

 Artículos similares

       
 
Nejc Co?, Reza Ahmadian and Roger A. Falconer    
Understanding the impact of various hydraulic structures, such as coastal reservoirs and tidal range impoundments, has been one of the key challenges of hydro?environmental engineering in recent years. Over the last half-century, several proposals for ti... ver más
Revista: Water

 
Ashraf Abdelkarim and Ahmed F.D. Gaber    
This study aims to assess the impact of flash floods in the Wadi Nu?man basin on urban areas, east of Mecca, which are subjected to frequent floods, during the period from 1988?2019. By producing and analyzing the maps of the regions, an integrated appro... ver más
Revista: Water

 
Young Hwan Choi and Joong Hoon Kim    
This study compares the performance of self-adaptive optimization approaches in efficient water distribution systems (WDS) design and presents a guide for the selection of the appropriate method employing optimization utilizing the characteristic of each... ver más
Revista: Water

 
António Carlos Pinheiro Fernandes, Luís Filipe Sanches Fernandes, Daniela Patrícia Salgado Terêncio, Rui Manuel Vitor Cortes and Fernando António Leal Pacheco    
Interactions between pollution sources, water contamination, and ecological integrity are complex phenomena and hard to access. To comprehend this subject of study, it is crucial to use advanced statistical tools, which can unveil cause-effect relationsh... ver más
Revista: Water

 
M.H.J.P. Gunarathna, Kazuhito Sakai, Tamotsu Nakandakari, Kazuro Momii and M.K.N. Kumari    
Poor data availability on soil hydraulic properties in tropical regions hampers many studies, including crop and environmental modeling. The high cost and effort of measurement and the increasing demand for such data have driven researchers to search for... ver más
Revista: Water