Redirigiendo al acceso original de articulo en 16 segundos...
ARTÍCULO
TITULO

Usage of the Term Big Data in Biomedical Publications: A Text Mining Approach

Allard J. van Altena    
Perry D. Moerland    
Aeilko H. Zwinderman and Sílvia Delgado Olabarriaga    

Resumen

In this study, we attempt to assess the value of the term Big Data when used by researchers in their publications. For this purpose, we systematically collected a corpus of biomedical publications that use and do not use the term Big Data. These documents were used as input to a machine learning classifier to determine how well they can be separated into two groups and to determine the most distinguishing classification features. We generated 100 classifiers that could correctly distinguish between Big Data and non-Big Data documents with an area under the Receiver Operating Characteristic (ROC) curve of 0.96. The differences between the two groups were characterized by terms specific to Big Data themes?such as ?computational?, ?mining?, and ?challenges??and also by terms that indicate the research field, such as ?genomics?. The ROC curves when plotted for various time intervals showed no difference over time. We conclude that there is a detectable and stable difference between publications that use the term Big Data and those that do not. Furthermore, the use of the term Big Data within a publication seems to indicate a distinct type of research in the biomedical field. Therefore, we conclude that value can be attributed to the term Big Data when used in a publication and this value has not changed over time.

 Artículos similares

       
 
Mohamed Sherif Zaghloul, Ebrahim Ghaderpour, Hatef Dastour, Babak Farjad, Anil Gupta, Hyung Eum, Gopal Achari and Quazi K. Hassan    
Changes in water resources within basins can significantly impact ecosystems, agriculture, and biodiversity, among others. Basins in northern Canada have a cold climate, and the recent changes in climate can have a profound impact on water resources in t... ver más
Revista: Hydrology

 
Jay Thakkar, Nicholas Bowen, Allen C. Chang, Peter Horwath, Margaret J. Sobkowicz and Jan Kosny    
This study investigates improvements in low-cost latent heat storage material calcium chloride hexahydrate (CaCl2.6H2O). Its melting point is between 25 and 28 °C, with relatively high enthalpy (170?190 J/g); however, this phase change material (PCM) sho... ver más
Revista: Buildings

 
Surabhi Upadhyay, Priya Silwal, Rajaram Prajapati, Rocky Talchabhadel, Sandesh Shrestha, Sudeep Duwal and Hanik Lakhe    
High spatio-temporal resolution and accurate long-term rainfall estimates are critical in sustainable water resource planning and management, assessment of climate variability and extremes, and hydro-meteorology-related water system decisions. The recent... ver más
Revista: Hydrology

 
Changfeng Jing, Shasha Guo, Hongyang Zhang, Xinxin Lv and Dongliang Wang    
To achieve Sustainable Development Goal 7 (SDG7), it is essential to detect the spatiotemporal patterns of electricity consumption, particularly the spatiotemporal heterogeneity of consumers. This is also crucial for rational energy planning and manageme... ver más

 
Santi Phithakkitnukooon, Karn Patanukhom and Merkebe Getachew Demissie    
Dockless electric scooters (e-scooter) have emerged as a green alternative to automobiles and a solution to the first- and last-mile problems. Demand anticipation, or being able to accurately predict spatiotemporal demand of e-scooter usage, is one suppl... ver más