Redirigiendo al acceso original de articulo en 20 segundos...
Inicio  /  Information  /  Vol: 15 Par: 3 (2024)  /  Artículo
ARTÍCULO
TITULO

Algorithm-Based Data Generation (ADG) Engine for Dual-Mode User Behavioral Data Analytics

Iman I. M. Abu Sulayman    
Peter Voege and Abdelkader Ouda    

Resumen

The increasing significance of data analytics in modern information analysis is underpinned by vast amounts of user data. However, it is only feasible to amass sufficient data for various tasks in specific data-gathering contexts that either have limited security information or are associated with older applications. There are numerous scenarios where a domain is too new, too specialized, too secure, or data are too sparsely available to adequately support data analytics endeavors. In such cases, synthetic data generation becomes necessary to facilitate further analysis. To address this challenge, we have developed an Algorithm-based Data Generation (ADG) Engine that enables data generation without the need for initial data, relying instead on user behavior patterns, including both normal and abnormal behavior. The ADG Engine uses a structured database system to keep track of users across different types of activity. It then uses all of this information to make the generated data as real as possible. Our efforts are particularly focused on data analytics, achieved by generating abnormalities within the data and allowing users to customize the generation of normal and abnormal data ratios. In situations where obtaining additional data through conventional means would be impractical or impossible, especially in the case of specific characteristics like anomaly percentages, algorithmically generated datasets provide a viable alternative. In this paper, we introduce the ADG Engine, which can create coherent datasets for multiple users engaged in different activities and across various platforms, entirely from scratch. The ADG Engine incorporates normal and abnormal ratios within each data platform through the application of core algorithms for time-based and numeric-based anomaly generation. The resulting abnormal percentage is compared against the expected values and ranges from 0.13 to 0.17 abnormal data instances in each column. Along with the normal/abnormal ratio, the results strongly suggest that the ADG Engine has successfully completed its primary task.

 Artículos similares

       
 
Yongchao Hui, Yuehua Cheng, Bin Jiang and Lei Yang    
This research presents a novel data-based multi-parameter health assessment method to meet the growing need for the in-orbit health assessment of satellite components. This method analyzed changes in component health status by calculating distribution de... ver más
Revista: Aerospace

 
Bin Cheng, Ping Chen, Xin Zhang, Keyu Fang, Xiaoli Qin and Wei Liu    
With the rapid development of ubiquitous data collection and data analysis, data privacy in a recommended system is facing more and more challenges. Differential privacy technology can provide strict privacy protection while reducing the risk of privacy ... ver más
Revista: Applied Sciences

 
Moli Chen, Xunkai Wei, Hao Wang and Zhenhe Jiang    
Compared to maximum state parameters, such as maximum altitude and Mach, the number of hovers and S turns can be used as process parameters representing the complexity of military aircraft maneuvers when classifying big flight mission data to compile fli... ver más
Revista: Applied Sciences

 
Yunfei Zhang, Hongzhen Xu and Xiaojun Yu    
An improved recommendation algorithm based on Conditional Variational Autoencoder (CVAE) and Constrained Probabilistic Matrix Factorization (CPMF) is proposed to address the issues of poor recommendation performance in traditional user-based collaborativ... ver más
Revista: Applied Sciences

 
Xiaoyu Liu, Wenqian Peng, Liuyin Xie and Xiang Zhang    
In this paper, a novel bat algorithm based on the quantum computing concept and pyramid neural network (PNN) is presented and applied to the electromagnetic motor optimization problem. Due to the problems of high loss, high temperature rise and threateni... ver más
Revista: Applied Sciences