Akademska digitalna zbirka SLovenije - logo
E-resources
Full text
Peer reviewed
  • Deep learning-based human a...
    Moutinho, Duarte; F. Rocha, Luís; Costa, Carlos M.; Teixeira, Luís F.; Veiga, Germano

    Robotics and computer-integrated manufacturing, April 2023, 2023-04-00, Volume: 80
    Journal Article

    Human–Robot Collaboration is a critical component of Industry 4.0, contributing to a transition towards more flexible production systems that are quickly adjustable to changing production requirements. This paper aims to increase the natural collaboration level of a robotic engine assembly station by proposing a cognitive system powered by computer vision and deep learning to interpret implicit communication cues of the operator. The proposed system, which is based on a residual convolutional neural network with 34 layers and a long-short term memory recurrent neural network (ResNet-34 + LSTM), obtains assembly context through action recognition of the tasks performed by the operator. The assembly context was then integrated in a collaborative assembly plan capable of autonomously commanding the robot tasks. The proposed model showed a great performance, achieving an accuracy of 96.65% and a temporal mean intersection over union (mIoU) of 94.11% for the action recognition of the considered assembly. Moreover, a task-oriented evaluation showed that the proposed cognitive system was able to leverage the performed human action recognition to command the adequate robot actions with near-perfect accuracy. As such, the proposed system was considered as successful at increasing the natural collaboration level of the considered assembly station. •Implicit communication cues are crucial for effortless Human–Robot Collaboration.•Human actions contain task-focused information that provide operation context.•A Deep Learning cognitive system was proposed to interpret task-focused human actions.•The proposed cognitive system was applied in a real collaborative assembly scenario.•Great accuracy was achieved at recognizing human actions and commanding robot tasks.