ARTÍCULO
TITULO

Selection of Informative Operations in the Construction of Linear Non-elementary Regression Models

Mikhail Bazilevskiy    

Resumen

This article is devoted to one of the main problems of regression analysis ? the choice of regression model structural specification. The work is based on the linear non-elementary regressions proposed earlier by the author, which, in addition to explanatory variables, include binary operations of all their possible pairs. In such models, with an increase in the number of explanatory variables, the number of binary operations increases significantly. The aim of this work is to develop selection algorithms in linear non-elementary regressions of the most informative variables and operations. An algorithm for approximate estimation of linear non-elementary regressions using the ordinary least squares is considered. The problem of selection of informative operations is formulated. Two strategies for constructing linear non-elementary regressions are proposed. In the first of them there are no restrictions on the number of occurrences of explanatory variables in the model and on the number of binary operations. In the second, the model contains the largest number of binary operations, and each explanatory variable is included in it only once. Using combinatorics, the computational complexity of each of these strategies was determined. It turned out that the problem of constructing a linear non-elementary model based on the second strategy is solved in practice much faster than a similar problem based on the first strategy. The proposed algorithms were implemented using the Gretl package as a special program. With the help of it, high-quality linear non-elementary regression models of freight rail transportation in the Irkutsk region were built

 Artículos similares

       
 
Sergii Babichev, Lyudmyla Yasinska-Damri, Igor Liakh and Jirí ?kvor    
The development of hybrid models focused on gene expression data processing for the allocation of differentially expressed and mutually correlated genes is one of the current directions in modern bioinformatics. The solution to this problem can allow us ... ver más
Revista: Applied Sciences

 
Giulia Boccacci, Francesca Frasca, Chiara Bertolin and Anna Maria Siani    
Among non-destructive testing (NDT) techniques applied to structural health monitoring in existing timber structures, ranging from visual inspection to more sophisticated analysis, acoustic emission (AE) is currently seldomly used to detect mechanical st... ver más
Revista: Applied Sciences

 
Consolata Gakii, Paul O. Mireji and Richard Rimiru    
Analysis of high-dimensional data, with more features (p" role="presentation">??p p ) than observations (N" role="presentation">??N N ) (p>N" role="presentation">??>??p>N p > N ), places significant demand in cost and memory computational usage at... ver más
Revista: Algorithms

 
Edina Chandiwana, Caston Sigauke and Alphonce Bere    
Probabilistic solar power forecasting has been critical in Southern Africa because of major shortages of power due to climatic changes and other factors over the past decade. This paper discusses Gaussian process regression (GPR) coupled with core vector... ver más
Revista: Algorithms

 
Jose Francisco Saenz-Cogollo and Maurizio Agelli    
Finding an optimal combination of features and classifier is still an open problem in the development of automatic heartbeat classification systems, especially when applications that involve resource-constrained devices are considered. In this paper, a n... ver más
Revista: Algorithms