Deep learning speech recognition for residential assistant robot
Description
This work presents the design and validation of a voice assistant to command robotic tasks in a residential environment, as a support for people who require isolation or support due to body motor problems. The preprocessing of a database of 3600 audios of 8 different categories of words like “paper”, “glass” or “robot”, that allow to conform commands such as "carry paper" or "bring medicine", obtaining a matrix array of Mel frequencies and its derivatives, as inputs to a convolutional neural network that presents an accuracy of 96.9% in the discrimination of the categories. The command recognition tests involve recognizing groups of three words starting with "robot", for example, "robot bring glass", and allow identifying 8 different actions per voice command, with an accuracy of 88.75%.
Files
10 21832.pdf
Files
(958.7 kB)
Name | Size | Download all |
---|---|---|
md5:0e6c9f49e506073edfcd13e84b6a3c95
|
958.7 kB | Preview Download |