Inicio  /  Information  /  Vol: 13 Par: 10 (2022)  /  Artículo
ARTÍCULO
TITULO

Fast Component Density Clustering in Spatial Databases: A Novel Algorithm

Bilal Bataineh    

Resumen

Clustering analysis is a significant technique in various fields, including unsupervised machine learning, data mining, pattern recognition, and image analysis. Many clustering algorithms are currently used, but almost all of them encounter various challenges, such as low accuracy, required number of clusters, slow processing, inability to produce non-spherical shaped clusters, and unstable performance with respect to data characteristics and size. In this research, a novel clustering algorithm called the fast component density clustering in spatial databases (FCDCSD) is proposed by utilizing a density-based clustering technique to address the aforementioned existing challenges. First, from the smallest to the largest point in the spatial field, each point is labeled with a temporary value, and the adjacent values in one component are stored in a set. Then, all sets with shared values are merged and resolved to obtain a single value that is representative of the merged sets. These values represent final cluster values; that is, the temporary equivalents in the dataset are replaced to generate the final clusters. If some noise appears, then a post-process is performed, and values are assigned to the nearest cluster based on a set of rules. Various synthetic datasets were used in the experiments to evaluate the efficiency of the proposed method. Results indicate that FCDCSD is generally superior to affinity propagation, agglomerative hierarchical, k-means, mean-shift, spectral, and density-based spatial clustering of applications with noise, ordering points for identifying clustering structures, and Gaussian mixture clustering methods.

 Artículos similares

       
 
Sana Toumi, Yassine Amirat, Elhoussin Elbouchikhi, Zhibin Zhou and Mohamed Benbouzid    
This article deals with the techno-economic optimal sizing of a tidal stream turbine (TST)?battery system. In this study, the TST system consists of a turbine rotor and a permanent magnet synchronous generator (PMSG) associated with a three-phase convert... ver más

 
Fanchang Meng, Zili Zhang, Yanhui Kang, Chengjun Cui, Dezhao Wang, Xinxin Zhang and Weihu Zhou    
With the advent of the era of big data and the vigorous development of consumer electronics, the demand for higher-speed processing capacity for gigantic amounts of data is increasing; this requires finer and far more numerous connections between dies as... ver más
Revista: Applied Sciences

 
Nikola Ivkovic, Robert Kudelic and Marin Golub    
Ant colony optimization (ACO) is a well-known class of swarm intelligence algorithms suitable for solving many NP-hard problems. An important component of such algorithms is a record of pheromone trails that reflect colonies? experiences with previously ... ver más
Revista: Algorithms

 
Yedam Na, Bonmoo Koo, Taeyoon Park, Jonghyeok Park and Wook-Hee Kim    
With the increasing capacity and cost-efficiency of DRAM in multi-core environments, in-memory databases have emerged as fundamental solutions for delivering high performance. The index structure is a crucial component of the in-memory database, which, l... ver más
Revista: Applied Sciences

 
Guoping Xiang, Tao Jiang, Qingwen Yang, Shenghua Cui, Ling Zhu, Yuhang He and Huajin Li    
The Daguangbao (DGB) landslide was the largest landslide that was triggered by the 2008 Wenchuan earthquake with a magnitude of Ms8.0. The sliding surface of this landslide was constrained on a bedding fault 400 m below the ground surface. Seismic record... ver más
Revista: Applied Sciences