Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Information  /  Vol: 13 Par: 12 (2022)  /  Artículo
ARTÍCULO
TITULO

Fast Training Set Size Reduction Using Simple Space Partitioning Algorithms

Stefanos Ougiaroglou    
Theodoros Mastromanolis    
Georgios Evangelidis and Dionisis Margaris    

Resumen

The Reduction by Space Partitioning (RSP3) algorithm is a well-known data reduction technique. It summarizes the training data and generates representative prototypes. Its goal is to reduce the computational cost of an instance-based classifier without penalty in accuracy. The algorithm keeps on dividing the initial training data into subsets until all of them become homogeneous, i.e., they contain instances of the same class. To divide a non-homogeneous subset, the algorithm computes its two furthest instances and assigns all instances to their closest furthest instance. This is a very expensive computational task, since all distances among the instances of a non-homogeneous subset must be calculated. Moreover, noise in the training data leads to a large number of small homogeneous subsets, many of which have only one instance. These instances are probably noise, but the algorithm mistakenly generates prototypes for these subsets. This paper proposes simple and fast variations of RSP3 that avoid the computationally costly partitioning tasks and remove the noisy training instances. The experimental study conducted on sixteen datasets and the corresponding statistical tests show that the proposed variations of the algorithm are much faster and achieve higher reduction rates than the conventional RSP3 without negatively affecting the accuracy.

 Artículos similares

       
 
Xinzhi Liu, Jun Yu, Toru Kurihara, Congzhong Wu, Zhao Niu and Shu Zhan    
It seems difficult to recognize an object from its background with similar color using conventional segmentation methods. An efficient way is to utilize hyperspectral images that contain more wave bands and richer information than only RGB components. Pa... ver más
Revista: Applied Sciences

 
Nikolaos Makrakis, Prodromos N. Psarropoulos and Yiannis Tsompanakis    
Large-scale lifelines in seismic-prone regions very frequently cross areas that are characterized by active tectonic faulting, as complete avoidance might be techno-economically unfeasible. The resulting Permanent Ground Displacements (PGDs) constitute a... ver más
Revista: Infrastructures

 
Guilherme Perin, Lichao Wu and Stjepan Picek    
The adoption of deep neural networks for profiling side-channel attacks opened new perspectives for leakage detection. Recent publications showed that cryptographic implementations featuring different countermeasures could be broken without feature selec... ver más
Revista: Algorithms

 
Youngki Park and Youhyun Shin    
In this paper, we introduce an efficient approach to multi-label image classification that is particularly suited for scenarios requiring rapid adaptation to new classes with minimal training data. Unlike conventional methods that rely solely on neural n... ver más
Revista: Applied Sciences

 
Guangchao Yang, Jigang Zhang, Zhehao Ma and Weixiao Xu    
The steel tube-reinforced concrete (STRC) shear wall plays an important role in the seismic design of high-rise building structures. Due to the synergistic collaboration between steel tubes and concrete, they effectively enhance the ductility and energy ... ver más
Revista: Applied Sciences