Redirigiendo al acceso original de articulo en 21 segundos...
Inicio  /  Algorithms  /  Vol: 15 Par: 4 (2022)  /  Artículo
ARTÍCULO
TITULO

Performance of Parallel K-Means Algorithms in Java

Libero Nigro    

Resumen

K-means is a well-known clustering algorithm often used for its simplicity and potential efficiency. Its properties and limitations have been investigated by many works reported in the literature. K-means, though, suffers from computational problems when dealing with large datasets with many dimensions and great number of clusters. Therefore, many authors have proposed and experimented different techniques for the parallel execution of K-means. This paper describes a novel approach to parallel K-means which, today, is based on commodity multicore machines with shared memory. Two reference implementations in Java are developed and their performances are compared. The first one is structured according to a map/reduce schema that leverages the built-in multi-threaded concurrency automatically provided by Java to parallel streams. The second one, allocated on the available cores, exploits the parallel programming model of the Theatre actor system, which is control-based, totally lock-free, and purposely relies on threads as coarse-grain ?programming-in-the-large? units. The experimental results confirm that some good execution performance can be achieved through the implicit and intuitive use of Java concurrency in parallel streams. However, better execution performance can be guaranteed by the modular Theatre implementation which proves more adequate for an exploitation of the computational resources.

 Artículos similares

       
 
Bingyu Zhang, Yingtang Wei, Ronghua Liu, Shunzhen Tian and Kai Wei    
The calibration and validation of hydrological model simulation performance and model applicability evaluation in Gansu Province is the foundation of the application of the flash flood early warning and forecasting platform in Gansu Province. It is diffi... ver más
Revista: Water

 
Fangling Leng, Fan Li, Yubin Bao, Tiancheng Zhang and Ge Yu    
As graph models become increasingly prevalent in the processing of scientific data, the exploration of effective methods for the mining of meaningful patterns from large-scale graphs has garnered significant research attention. This paper delves into the... ver más
Revista: Applied Sciences

 
Szymon Kurpiel, Krzysztof Zagórski, Jacek Cieslik, Krzysztof Skrzypkowski and Witold Brostow    
The aerospace industry is imposing increasingly strict dimensional tolerances, which is forcing continuous development in component manufacturing. Ensuring tight dimensional tolerances is difficult for thin-walled structures due to their reduced stiffnes... ver más
Revista: Applied Sciences

 
Jingxiong Lei, Xuzhi Liu, Haolang Yang, Zeyu Zeng and Jun Feng    
High-resolution remote sensing images (HRRSI) have important theoretical and practical value in urban planning. However, current segmentation methods often struggle with issues like blurred edges and loss of detailed information due to the intricate back... ver más
Revista: Applied Sciences

 
Sardar Anisul Haque, Mohammad Tanvir Parvez and Shahadat Hossain    
Matrix?matrix multiplication is of singular importance in linear algebra operations with a multitude of applications in scientific and engineering computing. Data structures for storing matrix elements are designed to minimize overhead information as wel... ver más
Revista: Algorithms