Inicio  /  Future Internet  /  Vol: 15 Par: 3 (2023)  /  Artículo
ARTÍCULO
TITULO

Data Protection and Multi-Database Data-Driven Models

Lili Jiang and Vicenç Torra    

Resumen

Anonymization and data masking have effects on data-driven models. Different anonymization methods have been developed to provide a good trade-off between privacy guarantees and data utility. Nevertheless, the effects of data protection (e.g., data microaggregation and noise addition) on data integration and on data-driven models (e.g., machine learning models) built from these data are not known. In this paper, we study how data protection affects data integration, and the corresponding effects on the results of machine learning models built from the outcome of the data integration process. The experimental results show that the levels of protection that prevent proper database integration do not affect machine learning models that learn from the integrated database to the same degree. Concretely, our preliminary analysis and experiments show that data protection techniques have a lower level of impact on data integration than on machine learning models.

 Artículos similares

       
 
Meng Li, Jiqiang Liu and Yeping Yang    
Data governance is an extremely important protection and management measure throughout the entire life cycle of data. However, there are still data governance issues, such as data security risks, data privacy breaches, and difficulties in data management... ver más
Revista: Future Internet

 
Lijun Zu, Wenyu Qi, Hongyi Li, Xiaohua Men, Zhihui Lu, Jiawei Ye and Liang Zhang    
The digital transformation of banks has led to a paradigm shift, promoting the open sharing of data and services with third-party providers through APIs, SDKs, and other technological means. While data sharing brings personalized, convenient, and enriche... ver más
Revista: Future Internet

 
Mosaad Ali Hussein Ali, Farag M. Mewafy, Wei Qian, Ajibola Richard Faruwa, Ali Shebl, Saleh Dabaa and Hussein A. Saleem    
The effective detection and monitoring of mining tailings? leachates (MTLs) plays a pivotal role in environmental protection and remediation efforts. Electrical resistivity tomography (ERT) is a non-invasive technique widely employed for mapping subsurfa... ver más
Revista: Water

 
Ze Liu, Jingzhao Zhou, Xiaoyang Yang, Zechuan Zhao and Yang Lv    
Water resource modeling is an important means of studying the distribution, change, utilization, and management of water resources. By establishing various models, water resources can be quantitatively described and predicted, providing a scientific basi... ver más
Revista: Water

 
Jakob Benisch, Björn Helm, Xin Chang and Peter Krebs    
The European Union Water Framework Directive (2000/60/EC; WFD) aims to achieve a good ecological and chemical status of all bodies of surface water by 2027. The development of integrated guidance on surface water chemical monitoring (e.g., WFD Guidance D... ver más
Revista: Water