JISE


  [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16]


Journal of Information Science and Engineering, Vol. 39 No. 1, pp. 19-37


Adoption of Gesture Interactive Robot in Music Perception Education with Deep Learning Approach


JIA-XIN HU1, YU SONG2 AND YI-YAO ZHANG3,+
1School of Music and Dance
Qiqihar University
Qiqihar, 161000 P.R. China
E-mail: teacherhu0727@163.com

2Development Research Centre of Music Industry
Communication University of China
Beijing, 100024 P.R. China
E-mail: rain07122003@163.com

3School of Art and Communication
Beijing Normal University
Beijing, 100875 P.R. China

+E-mail: 11112018044@bnu.edu.cn


This work intends to help students perceive music, study music, create music, and realize the “human-computer interaction” music teaching mode. A distributed design pattern is adopted to design a gesture interactive robot suitable for music education. First, the client is designed. The client gesture acquisition module employs a dual-channel convolutional neural network (DCCNN) for gesture recognition. The convolutional layer of the constructed DCCNN contains convolution kernels with two sizes, which operate on the image. Second, the server is designed, which recognizes the collected gesture instruction data through two-stream convolutional neural network (CNN). This network cuts the gesture instruction data into K segments, and sparsely samples each segment into a short sequence. The optical flow algorithm is employed to extract the optical flow features of each short sequence. Finally, the performance of the robot is tested. The results show that the combination of convolution kernels with sizes of 5×5 and 7×7 has a recognition accuracy of 98%, suggesting that DCCNN can effectively collect gesture command data. After training, DCCNN’s gesture recognition accuracy rate reaches 90%, which is higher than main-stream dynamic gesture recognition algorithms under the same conditions. In addition, the recognition accuracy of the gesture interactive robot is above 90%, suggesting that this robot can meet normal requirements and has good reliability and stability. It is also recommended to be utilized in music perception teaching to provide a basis for establishing a multi-sensory music teaching model.


Keywords: robot, gesture recognition, DCCNN, two-stream convolutional neural networks, deep learning

  Retrieve PDF document (JISE_202301_05.pdf)