Inicio  /  Algorithms  /  Vol: 14 Par: 12 (2021)  /  Artículo
ARTÍCULO
TITULO

A Visual Mining Approach to Improved Multiple- Instance Learning

Sonia Castelo    
Moacir Ponti and Rosane Minghim    

Resumen

Multiple-instance learning (MIL) is a paradigm of machine learning that aims to classify a set (bag) of objects (instances), assigning labels only to the bags. This problem is often addressed by selecting an instance to represent each bag, transforming an MIL problem into standard supervised learning. Visualization can be a useful tool to assess learning scenarios by incorporating the users? knowledge into the classification process. Considering that multiple-instance learning is a paradigm that cannot be handled by current visualization techniques, we propose a multiscale tree-based visualization called MILTree to support MIL problems. The first level of the tree represents the bags, and the second level represents the instances belonging to each bag, allowing users to understand the MIL datasets in an intuitive way. In addition, we propose two new instance selection methods for MIL, which help users improve the model even further. Our methods can handle both binary and multiclass scenarios. In our experiments, SVM was used to build the classifiers. With support of the MILTree layout, the initial classification model was updated by changing the training set, which is composed of the prototype instances. Experimental results validate the effectiveness of our approach, showing that visual mining by MILTree can support exploring and improving models in MIL scenarios and that our instance selection methods outperform the currently available alternatives in most cases.

 Artículos similares

       
 
Neda Rostamzadeh, Sheikh S. Abdullah, Kamran Sedig, Amit X. Garg and Eric McArthur    
Laboratory tests play an essential role in the early and accurate diagnosis of diseases. In this paper, we propose SUNRISE, a visual analytics system that allows the user to interactively explore the relationships between laboratory test results and a di... ver más
Revista: Informatics

 
Viktor Uglev and Oleg Sychev    
The article discusses the problem of visualization of complex multiparameter systems, defined by datasets on their structure, functional structure, and activity in the form of complex graphs and transition of traditional representation of the data acquir... ver más
Revista: Algorithms

 
Evandro S. Ortigossa, Fábio Felix Dias and Diego Carvalho do Nascimento    
The exploration and analysis of multidimensional data can be pretty complex tasks, requiring sophisticated tools able to transform large amounts of data bearing multiple parameters into helpful information. Multidimensional projection techniques figure a... ver más
Revista: Applied Sciences

 
Wei Xiao, Mingxia Liu and Xubing Chen    
The underground intelligent load-haul-dump vehicle (LHD) is a product of the deep integration of traditional LHD with information network technology, automatic controlling and artificial intelligence technology. It gathers the functions of environmental ... ver más
Revista: Applied Sciences

 
Ning Li, Kefu Chen and Huixin He    
According to the natural language perspective, UGC has been significantly used for the screening of key nodes in knowledge discovery and strategic investment. This article presents a new research framework that is proposed for the decomposition of UGC kn... ver más
Revista: Information