Resumen
The video camera is essential for reliable activity monitoring, and a robust analysis helps in efficient interpretation. The systematic assessment of classroom activity through videos can help understand engagement levels from the perspective of both students and teachers. This practice can also help in robot-assistive classroom monitoring in the context of human?robot interaction. Therefore, we propose a novel algorithm for student?teacher activity recognition using 3D CNN (STAR-3D). The experiment is carried out using India?s indigenously developed supercomputer PARAM Shivay by the Centre for Development of Advanced Computing (C-DAC), Pune, India, under the National Supercomputing Mission (NSM), with a peak performance of 837 TeraFlops. The EduNet dataset (registered under the trademark of the DRSTATM dataset), a self-developed video dataset for classroom activities with 20 action classes, is used to train the model. Due to the unavailability of similar datasets containing both students? and teachers? actions, training, testing, and validation are only carried out on the EduNet dataset with 83.5% accuracy. To the best of our knowledge, this is the first attempt to develop an end-to-end algorithm that recognises both the students? and teachers? activities in the classroom environment, and it mainly focuses on school levels (K-12). In addition, a comparison with other approaches in the same domain shows our work?s novelty. This novel algorithm will also influence the researcher in exploring research on the ?Convergence of High-Performance Computing and Artificial Intelligence?. We also present future research directions to integrate the STAR-3D algorithm with robots for classroom monitoring.