Redirigiendo al acceso original de articulo en 21 segundos...
ARTÍCULO
TITULO

Evaluating the Representativeness of Socio-Demographic Variables over Time for Geo-Social Media Data

Andreas Petutschnig    
Bernd Resch    
Stefan Lang and Clemens Havas    

Resumen

Geo-social media data are widely used as a data source to model populations and processes in a variety of contexts. However, if the data do not adequately represent the population they are drawn from, analysis results will be biased. Unaddressed, these biases may lead to false interpretations and conclusions. In this paper, we propose a generic methodology for investigating the representativeness of geo-social media data for population groups of similar statistical predictive power based on reference data. The groups are designed to be spatially coherent regions with similar prediction errors. Based on these units, we investigate the influence of different socio-demographic covariates on the representativeness. We perform experiments based on over 1.6 billion tweets and 90 socio-demographic covariates. We demonstrate that Twitter data representativeness varies strongly over time and space. Our results show that densely populated areas tend to be underrepresented consistently in non-spatial models. Over time, some covariates like the number of people aged 20 years exhibit highly different effects on the prediction models, whereas others are much more stable. The spatial effects can most frequently be explained using spatial error models, indicating spatially related errors that indicate the necessity of additional covariates. Finally, we provide hints for interpreting the results of our approach for researchers using the concepts presented in this paper.

 Artículos similares

       
 
Peter Schuhmann, Ryan Skeete, Richard Waite, Prosper Bangwayo-Skeete, James Casey, Hazel A. Oxenford and David A. Gill    
Seawater quality is critical for island and coastal communities dependent on coastal tourism. Improper management of coastal development and inland watersheds can decrease seawater quality and adversely impact marine life, human health, and economic grow... ver más
Revista: Water

 
Dessalegn Jaweso, Brook Abate, Andreas Bauwe and Bernd Lennartz    
This study aimed to assess trends of hydro-meteorological variables in the Upper Omo-Ghibe river basin, Ethiopia. Data records from eleven rainfall, eight air temperature, and five streamflow stations between 1981 to 2008 were investigated. The trends an... ver más
Revista: Water

 
Margarita Garcia-Vila, Rodrigo Morillo-Velarde and Elias Fereres    
Process-based crop models such as AquaCrop are useful for a variety of applications but must be accurately calibrated and validated. Sugar beet is an important crop that is grown in regions under water scarcity. The discrepancies and uncertainty in past ... ver más
Revista: Water

 
Inta Kotane     Pág. 861 - 870
and profitableness ratios and was used in the solvency evaluation of Latvian enterprises. The practical research was carried out according to the accounting data of the small enterprises of Latvia. The aim of the research is to appraise the use of the fi... ver más

 
Hanna Zofia Kolodziejczyk     Pág. 7 - 16
Financial market participants are influenced by the news reaching them from all manner of sources, including the country?s central bank. In this paper we model daily returns of WIG20 index with respect to announcements made by the National Bank of Poland... ver más