UP - logo
E-viri
Celotno besedilo
  • A Study on Speech Recogniti...
    Zinchenko, Kateryna; Chien-Yu Wu; Kai-Tai Song

    IEEE transactions on industrial informatics, 04/2017, Letnik: 13, Številka: 2
    Journal Article

    Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1° of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for ~1.8-s speech duration. The reference image navigation range is from 2.5 mm for ~0.5-s speech duration up to 6 mm for ~1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.