Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Information  /  Vol: 13 Par: 12 (2022)  /  Artículo
ARTÍCULO
TITULO

Fast Training Set Size Reduction Using Simple Space Partitioning Algorithms

Stefanos Ougiaroglou    
Theodoros Mastromanolis    
Georgios Evangelidis and Dionisis Margaris    

Resumen

The Reduction by Space Partitioning (RSP3) algorithm is a well-known data reduction technique. It summarizes the training data and generates representative prototypes. Its goal is to reduce the computational cost of an instance-based classifier without penalty in accuracy. The algorithm keeps on dividing the initial training data into subsets until all of them become homogeneous, i.e., they contain instances of the same class. To divide a non-homogeneous subset, the algorithm computes its two furthest instances and assigns all instances to their closest furthest instance. This is a very expensive computational task, since all distances among the instances of a non-homogeneous subset must be calculated. Moreover, noise in the training data leads to a large number of small homogeneous subsets, many of which have only one instance. These instances are probably noise, but the algorithm mistakenly generates prototypes for these subsets. This paper proposes simple and fast variations of RSP3 that avoid the computationally costly partitioning tasks and remove the noisy training instances. The experimental study conducted on sixteen datasets and the corresponding statistical tests show that the proposed variations of the algorithm are much faster and achieve higher reduction rates than the conventional RSP3 without negatively affecting the accuracy.

 Artículos similares

       
 
Nikolaos Makrakis, Prodromos N. Psarropoulos and Yiannis Tsompanakis    
Large-scale lifelines in seismic-prone regions very frequently cross areas that are characterized by active tectonic faulting, as complete avoidance might be techno-economically unfeasible. The resulting Permanent Ground Displacements (PGDs) constitute a... ver más
Revista: Infrastructures

 
Zuwei Tan, Runze Li and Yufei Zhang    
The inlet is one of the most important components of a hypersonic vehicle. The design and optimization of the hypersonic inlet is of great significance to the research and development of hypersonic vehicles. In recent years, artificial intelligence techn... ver más
Revista: Aerospace

 
Youngki Park and Youhyun Shin    
In this paper, we introduce an efficient approach to multi-label image classification that is particularly suited for scenarios requiring rapid adaptation to new classes with minimal training data. Unlike conventional methods that rely solely on neural n... ver más
Revista: Applied Sciences

 
Guilherme Perin, Lichao Wu and Stjepan Picek    
The adoption of deep neural networks for profiling side-channel attacks opened new perspectives for leakage detection. Recent publications showed that cryptographic implementations featuring different countermeasures could be broken without feature selec... ver más
Revista: Algorithms

 
Jong-Wook Kim, Jin-Young Choi, Eun-Ju Ha and Jae-Ho Choi    
Seniors who live alone at home are at risk of falling and injuring themselves and, thus, may need a mobile robot that monitors and recognizes their poses automatically. Even though deep learning methods are actively evolving in this area, they have limit... ver más
Revista: Applied Sciences