Inicio  /  Information  /  Vol: 12 Par: 5 (2021)  /  Artículo
ARTÍCULO
TITULO

Exploring Reusability and Reproducibility for a Research Infrastructure for L1 and L2 Learner Corpora

Alexander König    
Jennifer-Carmen Frey and Egon W. Stemle    

Resumen

Up until today research in various educational and linguistic domains such as learner corpus research, writing research, or second language acquisition has produced a substantial amount of research data in the form of L1 and L2 learner corpora. However, the multitude of individual solutions combined with domain-inherent obstacles in data sharing have so far hampered comparability, reusability and reproducibility of data and research results. In this article, we present work in creating a digital infrastructure for L1 and L2 learner corpora and populating it with data collected in the past. We embed our infrastructure efforts in the broader field of infrastructures for scientific research, drawing from technical solutions and frameworks from research data management, among which the FAIR guiding principles for data stewardship. We share our experiences from integrating some L1 and L2 learner corpora from concluded projects into the infrastructure while trying to ensure compliance with the FAIR principles and the standards we established for reproducibility, discussing how far research data that has been collected in the past can be made comparable, reusable and reproducible. Our results show that some basic needs for providing comparable and reusable data are covered by existing general infrastructure solutions and can be exploited for domain-specific infrastructures such as the one presented in this article. Other aspects need genuinely domain-driven approaches. The solutions found for the corpora in the presented infrastructure can only be a preliminary attempt, and further community involvement would be needed to provide templates and models acknowledged and promoted by the community. Furthermore, forward-looking data management would be needed starting from the beginning of new corpus creation projects to ensure that all requirements for FAIR data can be met.

 Artículos similares

       
 
S. Rahma, R. A. E. Putra     Pág. 76 - 83
The main role of a transportation network is providing optimum services for transportation network. Over the time, the population is increasing and the needs of reliable transportation network also increased. Transportation network consists of node and c... ver más

 
Nanda Nurisman, Trika Agnestasia Tarigan     Pág. 162 - 168
Labuhan Jukung Beach is one of the beaches in Kru, which is located on Krui Bay, West Coast District. This beach is a tourist beach directly adjacent to the Indian Ocean, so it has a high wave. Based on wind data from 2008 ? 2017 that be analyzed in this... ver más

 
I. Oktaviani, M. Asril, Y. Aryanti, S. S. Leksikowati     Pág. 47 - 52
The conversion of agricultural land and plantation into an area with high human activity can affect the biodiversity contained in it. The biodiversity of a region can be surveyed and collect in a systematic database to know the wealth of flora and fauna ... ver más

 
Houaria ABED, Lynda ZAOUI     Pág. 97 - 113
Recent years have witnessed great interest in developing methods for content-based image retrieval (CBIR). Generally, the image search results which are returned by an image search engine contain multiple topics, and organizing the results into different... ver más

 
Hugo López-Fernández     Pág. 22 - 25
Mass spectrometry using matrix assisted laser desorption ionization coupled to time of flight analyzers (MALDI-TOF MS) has become popular during the last decade due to its high speed, sensitivity and robustness for detecting proteins and peptides. This a... ver más