Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Algorithms  /  Vol: 13 Par: 5 (2020)  /  Artículo
ARTÍCULO
TITULO

The Effect of Different Deep Network Architectures upon CNN-Based Gaze Tracking

Hui-Hui Chen    
Bor-Jiunn Hwang    
Jung-Shyr Wu and Po-Ting Liu    

Resumen

In this paper, we explore the effect of using different convolutional layers, batch normalization and the global average pooling layer upon a convolutional neural network (CNN) based gaze tracking system. A novel method is proposed to label the participant?s face images as gaze points retrieved from eye tracker while watching videos for building a training dataset that is closer to human visual behavior. The participants can swing their head freely; therefore, the most real and natural images can be obtained without too many restrictions. The labeled data are classified according to the coordinate of gaze and area of interest on the screen. Therefore, varied network architectures are applied to estimate and compare the effects including the number of convolutional layers, batch normalization (BN) and the global average pooling (GAP) layer instead of the fully connected layer. Three schemes, including the single eye image, double eyes image and facial image, with data augmentation are used to feed into neural network to train and evaluate the efficiency. The input image of the eye or face for an eye tracking system is mostly a small-sized image with relatively few features. The results show that BN and GAP are helpful in overcoming the problem to train models and in reducing the amount of network parameters. It is shown that the accuracy is significantly improved when using GAP and BN at the mean time. Overall, the face scheme has a highest accuracy of 0.883 when BN and GAP are used at the mean time. Additionally, comparing to the fully connected layer set to 512 cases, the number of parameters is reduced by less than 50% and the accuracy is improved by about 2%. A detection accuracy comparison of our model with the existing George and Routray methods shows that our proposed method achieves better prediction accuracy of more than 6%.

 Artículos similares

       
 
Xueting Ma, Congying Wang, Huaping Luo and Ganggang Guo    
To enhance the accuracy of multispectral detection using unmanned aerial vehicles (UAVs), multispectral data of jujube fruit with different soluble solids content (SSC) and moisture content (MC) were obtained under different relative azimuth angles. Pred... ver más
Revista: Applied Sciences

 
Kre?imir Nincevic, Thierry Guillet, Omar Al Mansouri and Roman Wan-Wendner    
This contribution summarizes the largest available literature data collection on tensile and shear loaded anchor tests, obtained in two independent studies and performed by two different research groups. It was the objective of the two studies to investi... ver más
Revista: Applied Sciences

 
Yuan-Hang Zhang, Xiao-Jie Wang, Xu-Zhen Zhang, Maoukouf Saad and Rui-Jie Zhao    
The deep sea harbors abundant mineral, oil, and gas resources, making it highly valuable for commercial development, including the extraction of minerals. Due to the relatively large particle size of these minerals, how they interact with fluids is signi... ver más

 
Yan Gao, Zixin Guo and Quan Yuan    
The mechanical response and deformation characteristics in calcareous sand foundations during pile driving and setup were studied using model tests combined with the technical methods of tactile pressure sensors and close-range photogrammetry. Different ... ver más

 
Mohammed Y. Fattah, Qutaiba G. Majeed and Hasan H. Joni    
The theoretical and practical studies of the cyclic loads resulting from the movement and passage of trains on the unsaturated subgrade to determine the effect of the degree of saturation and moisture content on the foundations and infrastructure of the ... ver más
Revista: Infrastructures