Methods to improve the accuracy of machine learning algorithms while reducing the dimensionality of the data set

A.V. Vorobyev

Resumen

The limited availability of information collection is a factor hindering the application of high-performance machine learning algorithms. The development of methods to improve the accuracy of models while reducing the observation periods, can be an effective tool for prediction in understudied areas. The paper considers the relationship between the dimensionality of the data set and the predictive capabilities of machine learning models, and determines the impact of the number of observations on the accuracy and robustness of models built on ensemble algorithms and regularized regression algorithms. In the course of the experiments, the change in the weighted average absolute error with decreasing the dimensionality of the set was considered, and the algorithms most resistant to this factor were identified. The lower limit of use of ensemble algorithms for detection of regularities and construction of stable model, in regression tasks, in cases of non-linear dependence of target variable with predictors and under condition of absence of high impact of anomalies and noises in data was revealed. The effect of automated Bayesian hyperparameter optimization on model accuracy when the data set is reduced is considered. The models for which pre-optimization of hyperparameters, by means of wood-structured Parzen estimation, is the most preferable are determined.

Acceso

PÁGINAS

pp. 29 - 34

NÚMERO

Volumen: 9 Número: 10 Parte: 0 (2021)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Applied Sciences
Information

Artículos similares

Stacked Multiscale Densely Connected Temporal Convolutional Attention Network for Multi-Objective Speech Enhancement in an Airborne Environment

Acceso

Ping Huang and Yafeng Wu

Airborne speech enhancement is always a major challenge for the security of airborne systems. Recently, multi-objective learning technology has become one of the mainstream methods of monaural speech enhancement. In this paper, we propose a novel multi-o... ver más

Revista: Aerospace

Novel Processing Methods of Low-Clinker Multi-Component Cementitious Materials?A Review

Acceso

Pawel Lisowski and Michal A. Glinicki

The wide use of multi-component cement of highly reduced Portland clinker factor is largely impeded by detrimental changes in the rheological properties of concrete mixes, a substantial reduction in the early rate of cement hardening, and sometimes the i... ver más

Revista: Applied Sciences

Pretreatments Applied to Wheat Straw to Obtain Bioethanol

Acceso

Carmen Otilia Rusanescu, Maria Ciobanu, Marin Rusanescu and Raluca Lucia Dinculoiu

This work is a comprehensive study focusing on various methods for processing wheat straw to enhance its suitability for bioethanol production. It delves into mechanical, physical, chemical, and biological pretreatments, each aimed at improving the enzym... ver más

Revista: Applied Sciences

Nonlinear Robust Adaptive Control of Universal Manipulators Based on Desired Trajectory

Acceso

Yu Chen, Jianwan Ding, Yu Chen and Dong Yan

The introduction of a dynamic model in robot trajectory tracking control design can significantly improve its trajectory tracking accuracy, but there are many uncertainties in the robot dynamic model which can be dealt with through robust control and ada... ver más

Revista: Applied Sciences

Advancements in the Intelligent Detection of Driver Fatigue and Distraction: A Comprehensive Review

Acceso

Shichen Fu, Zhenhua Yang, Yuan Ma, Zhenfeng Li, Le Xu and Huixing Zhou

Detecting the factors affecting drivers? safe driving and taking early warning measures can effectively reduce the probability of automobile safety accidents and improve vehicle driving safety. Considering the two factors of driver fatigue and distractio... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas