Inicio  /  Future Internet  /  Vol: 12 Par: 3 (2020)  /  Artículo
ARTÍCULO
TITULO

Feature Selection Algorithms as One of the Python Data Analytical Tools

Nikita Pilnenskiy and Ivan Smetannikov    

Resumen

With the current trend of rapidly growing popularity of the Python programming language for machine learning applications, the gap between machine learning engineer needs and existing Python tools increases. Especially, it is noticeable for more classical machine learning fields, namely, feature selection, as the community attention in the last decade has mainly shifted to neural networks. This paper has two main purposes. First, we perform an overview of existing open-source Python and Python-compatible feature selection libraries, show their problems, if any, and demonstrate the gap between these libraries and the modern state of feature selection field. Then, we present new open-source scikit-learn compatible ITMO FS (Information Technologies, Mechanics and Optics University feature selection) library that is currently under development, explain how its architecture covers modern views on feature selection, and provide some code examples on how to use it with Python and its performance compared with other Python feature selection libraries.

 Artículos similares

       
 
Lianlian He, Hao Li and Rui Zhang    
Recent advances in knowledge graphs show great promise to link various data together to provide a semantic network. Place is an important part in the big picture of the knowledge graph since it serves as a powerful glue to link any data to its georeferen... ver más

 
Ancilon Leuch Alencar, Marcelo Dornbusch Lopes, Anita Maria da Rocha Fernandes, Julio Cesar Santos dos Anjos, Juan Francisco De Paz Santana and Valderi Reis Quietinho Leithardt    
In the current era of social media, the proliferation of images sourced from unreliable origins underscores the pressing need for robust methods to detect forged content, particularly amidst the rapid evolution of image manipulation technologies. Existin... ver más
Revista: Future Internet

 
Yan Chen and Chunchun Hu    
Accurate prediction of fine particulate matter (PM2.5) concentration is crucial for improving environmental conditions and effectively controlling air pollution. However, some existing studies could ignore the nonlinearity and spatial correlation of time... ver más

 
Georg Gartner, Olesia Ignateva, Bibigul Zhunis and Johanna Pühringer    
Maps are the culmination of numerous choices, with many offering multiple alternatives. Not all of these choices are inherently guided by default, clarity, or universally accepted best practices, guidelines, or recommendations. In the realm of cartograph... ver más

 
Sepideh Molaei, Stefano Cirillo and Giandomenico Solimando    
MicroRNAs (miRNAs) play a crucial role in cancer development, but not all miRNAs are equally significant in cancer detection. Traditional methods face challenges in effectively identifying cancer-associated miRNAs due to data complexity and volume. This ... ver más