Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Applied Sciences  /  Vol: 12 Par: 13 (2022)  /  Artículo
ARTÍCULO
TITULO

Getting over High-Dimensionality: How Multidimensional Projection Methods Can Assist Data Science

Evandro S. Ortigossa    
Fábio Felix Dias and Diego Carvalho do Nascimento    

Resumen

The exploration and analysis of multidimensional data can be pretty complex tasks, requiring sophisticated tools able to transform large amounts of data bearing multiple parameters into helpful information. Multidimensional projection techniques figure as powerful tools for transforming multidimensional data into visual information according to similarity features. Integrating this class of methods into a framework devoted to data sciences can contribute to generating more expressive means of visual analytics. Although the Principal Component Analysis (PCA) is a well-known method in this context, it is not the only one, and, sometimes, its abilities and limitations are not adequately discussed or taken into consideration by users. Therefore, knowing in-depth multidimensional projection techniques, their strengths, and the possible distortions they can create is of significant importance for researchers developing knowledge-discovery systems. This research presents a comprehensive overview of current state-of-the-art multidimensional projection techniques and shows example codes in Python and R languages, all available on the internet. The survey segment discusses the different types of techniques applied to multidimensional projection tasks from their background, application processes, capabilities, and limitations, opening the internal processes of the methods and demystifying their concepts. We also illustrate two problems, from a genetic experiment (supervised) and text mining (non-supervised), presenting solutions through multidimensional projection application. Finally, we brought elements that reverberate the competitiveness of multidimensional projection techniques towards high-dimension data visualization, commonly needed in data sciences solutions.

 Artículos similares

       
 
Pedro Madureira, Nuno Cardoso, Filipe Sousa, Waldir Moreira, Antonio Oliveira-Jr, Marco Bazzani and Philip Gouverneur    
The population is getting old, and the use of technology has improved the quality of life of the senior population. This is confirmed by the increasing number of solutions targeting healthy and active ageing. Such solutions keep track of the daily routin... ver más
Revista: Information

 
Yong Jun Cho    
The theoretical treatment of statistical properties relevant to nonlinear random waves of finite bandwidth, such as the joint distribution of wave crest and its associated wave period, is an overdue task hampered by the complicated form of the analytical... ver más

 
Tobias Hecht, Stefan Kratzert and Klaus Bengler    
Automated driving research as a key topic in the automotive industry is currently undergoing change. Research is shifting from unexpected and time-critical take-over situations to human machine interface (HMI) design for predictable transitions. Furtherm... ver más
Revista: Information

 
Rana Saeed Al-Maroof,Said Abdelrahim Salloum,Ahmad Qasim AlHamadand,Khaled Shaalan     Pág. pp. 157 - 178
The importance of using Google Translate (GT) has become dominantly more effective. Most researchers, professors, and students rely on its translation as an immediate source of getting the information in different countries all over the world. However, t... ver más

 
Sridhara Nayak,Tetsuya Takemi     Pág. 159 - 165
Recent IPCC reports suggest that the world is getting warmer. Consequently, the concentration of atmospheric water vapor, which determines the water for precipitation, is substantially increasing in accordance with the Clausius-Clapeyron (CC) relationshi... ver más
Revista: Atmósfera