ARTÍCULO
TITULO

Combination of Reduction Detection Using TOPSIS for Gene Expression Data Analysis

Jogeswar Tripathy    
Rasmita Dash    
Binod Kumar Pattanayak    
Sambit Kumar Mishra    
Tapas Kumar Mishra and Deepak Puthal    

Resumen

In high-dimensional data analysis, Feature Selection (FS) is one of the most fundamental issues in machine learning and requires the attention of researchers. These datasets are characterized by huge space due to a high number of features, out of which only a few are significant for analysis. Thus, significant feature extraction is crucial. There are various techniques available for feature selection; among them, the filter techniques are significant in this community, as they can be used with any type of learning algorithm and drastically lower the running time of optimization algorithms and improve the performance of the model. Furthermore, the application of a filter approach depends on the characteristics of the dataset as well as on the machine learning model. Thus, to avoid these issues in this research, a combination of feature reduction (CFR) is considered designing a pipeline of filter approaches for high-dimensional microarray data classification. Considering four filter approaches, sixteen combinations of pipelines are generated. The feature subset is reduced in different levels, and ultimately, the significant feature set is evaluated. The pipelined filter techniques are Correlation-Based Feature Selection (CBFS), Chi-Square Test (CST), Information Gain (InG), and Relief Feature Selection (RFS), and the classification techniques are Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), and k-Nearest Neighbor (k-NN). The performance of CFR depends highly on the datasets as well as on the classifiers. Thereafter, the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is used for ranking all reduction combinations and evaluating the superior filter combination among all.

 Artículos similares

       
 
Gary Reyes, Vivian Estrada, Roberto Tolozano-Benites and Victor Maquilón    
The steady increase in data generation by GPS systems poses storage challenges. Previous studies show the need to address trajectory compression. The demand for accuracy and the magnitude of data require effective compression strategies to reduce storage... ver más

 
Vincent Oriez, Nga Thi-Thanh Pham, Jérôme Peydecastaing, Philippe Behra and Pierre-Yves Pontalier    
Sugarcane bagasse (SCB), a by-product of the sugar industry, is composed mainly of cellulose, hemicelluloses, and lignin, and can be used to replace petrochemical polymers in various applications. In this work, SCB was treated under mild alkaline conditi... ver más

 
Xavier Flete, Nicolas Binder, Yannick Bousquet and Sandrine Cros    
In the current study, full-stage unsteady simulations were performed to investigate rotating instability inception mechanisms in a particularly large tip clearance centrifugal compressor with a vaneless diffuser and a volute. Four operating points along ... ver más

 
Rahimeh Maghsoudi, Saman Javadi, Mojtaba Shourian and Golmar Golmohammadi    
Determining optimal exploitation from aquifers is always a major challenge, especially for aquifers facing a drop in their groundwater level. In aquifers with artificial recharge, more complex algorithms are required to determine the optimal exploitation... ver más
Revista: Hydrology

 
Vivek Venishetty, Prem B. Parajuli and Dipesh Nepal    
Best management practices (BMPs) are management operations that reduce pollution and improve water quality. This study assessed the spatial variability of BMPs effectiveness within the Yazoo River Watershed (YRW) using Soil and Water Assessment Tool (SWA... ver más
Revista: Hydrology