ARTÍCULO
TITULO

ONLINE FUZZY CLUSTERING OF HIGH DIMENSION DATA STREAMS BASED ON NEURAL NETWORK ENSEMBLES

Yevgeniy Bodyanskiy    
Iryna Perova    
Polina Zhernova    

Resumen

The subject matter of the article is fuzzy clustering of high-dimensional data based on the ensemble approach, provided that a number and shape of clusters are not known. The goal of the work is to create the neuro-fuzzy approach for clustering data when the data stream is fed for online processing and a number and shape of clusters are unknown. The following tasks are solved in the article - the input feature space is compressed in the online mode; the model of neural network ensembles for data clustering is built; the ensemble of neuro-fuzzy networks for clustering high-dimensional data is developed; the approach for clustering data in the online mode is worked out. The following results are obtained - the main idea of the proposed approach is based on a modification of the fuzzy C-means algorithm. To reduce the dimension of the input space, the modified Hebb-Sanger network is suggested to be used; this net is characterized by the increased speed and is built on the basis of the modified Oja neurons. A speed-optimized learning algorithm for the Oja neuron is proposed. Such a network implements the method of principal components in the online mode with high speed. Conclusions. In the event the reduction-compression procedure cannot be used due to the probability of losing the physical meaning of the original space, a new clustering criterion was introduced; this criterion contains both a well-known polynomial fuzzifier and the weighment of individual components of the deviations of presented images from cluster centroids. The recurrent modification based on the algorithms proposed in this article is introduced. A mathematical model is developed to determine the quality of clustering with the use of the Xi-Beni index, which was modified for the online mode. The experimental results confirm the fact that the proposed system enables solving a wide range of Data Mining tasks when data sets are processed online, provided that a number and shape of clusters are not known and there is a large number of observations as well.

 Artículos similares

       
 
Diju Gao, Weixi Xie, Chunteng Bao, Bin Liu and Jiaxing Zhuang    
In order to realize accurate and fast firefighting at sea, a control method for water cannons of unmanned fireboats considering wind and ship motion disturbances is presented. This method combines information fusion, computer vision, and prediction techn... ver más

 
Shuxin Jin, Mai Hao and Ming Cai    
Driven by economic development and environmental protection, vehicles are gradually renovating their power to renewable energy. As an essential part of renewable energy, photovoltaic (PV) energy is highly valued and studied worldwide. Future social devel... ver más
Revista: Applied Sciences

 
Chidentree Treestayapun and Aldo Jonathan Muñoz-Vázquez    
Memory properties of fractional-order operators are considered for an input-output data model for highly uncertain nonlinear systems. The model arises by relating the fractional-order variation of the output to the fractional-order variation of the input... ver más
Revista: Applied Sciences

 
Julian Estevez, Jose Manuel Lopez-Guede, Gorka Garate and Manuel Graña    
This paper deals with the control of a team of unmanned air vehicles (UAVs), specifically quadrotors, for which their mission is the transportation of a deformable linear object (DLO), i.e., a cable, hose or similar object in quasi-stationary state, whil... ver más
Revista: Applied Sciences

 
Hongjun Fan, Hossein Enshaei and Shantha Gamini Jayasinghe    
Liquified natural gas (LNG) as a marine fuel has gained momentum as the maritime industry moves towards a sustainable future. Since unwanted LNG release may lead to severe consequences, performing quantitative risk assessment (QRA) for LNG bunkering oper... ver más