Controlling Industrial Robots with High-Level Verbal Commands

Choi, Dongkyu; Shi, Wei; Liang, Ying Siu; Yeo, Kheng Hui; Kim, Jung-Jae

doi:10.1007/978-3-030-90525-5_19

Dongkyu Choi¹⁶,
Wei Shi¹⁷,
Ying Siu Liang¹⁶,
Kheng Hui Yeo¹⁷ &
…
Jung-Jae Kim¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13086))

Included in the following conference series:

International Conference on Social Robotics

2773 Accesses
2 Citations

Abstract

Industrial robots today are still mostly pre-programmed to perform a specific task. Despite previous research in human-robot interaction in the academia, adopting such systems in industrial settings is not trivial and has rarely been done. In this paper, we introduce a robotic system that we control with high-level verbal commands, leveraging some of the latest neural approaches to language understanding and a cognitive architecture for goal-directed but reactive execution. We show that a large-scale pre-trained language model can be effectively fine-tuned for translating verbal instructions into robot tasks, better than other semantic parsing methods, and that our system is capable of handling through dialogue a variety of exceptions that happen during human-robot interaction including unknown tasks, user interruption, and changes in the world state.

This research is supported by A*STAR under its Human-Robot Collaborative AI for Advanced Manufacturing and Engineering (Award A18A2b0046).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/donglixp/coarse2fine.

References

Artzi, Y., Das, D., Petrov, S.: Learning compact lexicons for CCG semantic parsing. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1273–1283 (2014)
Google Scholar
Cambria, E., Poria, S., Hazarika, D., Kwok, K.: SenticNet 5: discovering conceptual primitives for sentiment analysis by means of context embeddings. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1795–1802 (2018)
Google Scholar
Chen, H., Tan, H., Kuntz, A., Bansal, M., Alterovitz, R.: Enabling robots to understand incomplete natural language instructions using commonsense reasoning. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 1963–1969 (2020)
Google Scholar
Choi, D., Langley, P.: Evolution of the ICARUS cognitive architecture. Cogn. Syst. Res. 48, 25–38 (2018)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4171–418 (2019)
Google Scholar
Dong, L., Lapata, M.: Coarse-to-fine decoding for neural semantic parsing. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, pp. 731–742 (2018)
Google Scholar
Elgohary, A., Hosseini, S., Awadallah, A.H.: Speak to your parser: interactive text-to-SQL with natural language feedback. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 2065–2077 (2020)
Google Scholar
Fikes, R., Nilsson, N.: STRIPS: a new approach to the application of theorem proving to problem solving. Artif. Intell. 2, 189–208 (1971)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Horn, A.: On sentences which are true of direct unions of algebras. J. Symbolic Log. 16, 14–21 (1951)
Article MathSciNet Google Scholar
Jia, Y., She, L., Cheng, Y., Bao, J., Chai, J.Y., Xi, N.: Program robots manufacturing tasks by natural language instructions. In: Proceedings of the IEEE International Conference on Automation Science and Engineering, pp. 633–638 (2016)
Google Scholar
Kuo, Y.L., Katz, B., Barbu, A.: Deep compositional robotic planners that follow natural language commands. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 4906–4912 (2020)
Google Scholar
Laird, J.E., et al.: Interactive task learning. IEEE Intell. Syst. 32(4), 6–21 (2017)
Article Google Scholar
Park, J.S., Jia, B., Bansal, M., Manocha, D.: Efficient generation of motion plans from attribute-based natural language instructions using dynamic constraint mapping. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 6964–6971 (2019)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: Global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1532–1543 (2014)
Google Scholar
Radford, A., et al.: Language models are unsupervised multitask learners. OpenAI blog 1, 9 (2019)
Google Scholar
Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)
MathSciNet MATH Google Scholar
Venkatesh, S.G., et al.: Spatial reasoning from natural language instructions for robot manipulation. In: Proceedings of the IEEE International Conference on Robotics and Automation (2021)
Google Scholar
Wächter, M., et al.: Integrating multi-purpose natural language understanding, robot’s memory, and symbolic planning for task execution in humanoid robots. Robot. Auton. Syst. 99, 148–165 (2018)
Article Google Scholar
Yin, P., Neubig, G., Yih, W.T., Riedel, S.: TaBERT: pretraining for joint understanding of textual and tabular data. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 8413–8426 (2020)
Google Scholar
Zeng, J., et al.: Photon: a robust cross-domain text-to-SQL system. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 204–214 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore, Singapore
Dongkyu Choi & Ying Siu Liang
Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore, Singapore
Wei Shi, Kheng Hui Yeo & Jung-Jae Kim

Authors

Dongkyu Choi
View author publications
You can also search for this author in PubMed Google Scholar
Wei Shi
View author publications
You can also search for this author in PubMed Google Scholar
Ying Siu Liang
View author publications
You can also search for this author in PubMed Google Scholar
Kheng Hui Yeo
View author publications
You can also search for this author in PubMed Google Scholar
Jung-Jae Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dongkyu Choi .

Editor information

Editors and Affiliations

Department of Electronic and Communication Engineering, National University of Singapore, Faculty of Engineering, Singapore, Singapore
Haizhou Li
The National University of Singapore, Singapore, Singapore
Shuzhi Sam Ge
A*STAR Institute for Infocomm Research, Singapore, Singapore
Yan Wu
Center for Human Technologies, Istituto Italiano Tecnologia, Genoa, Italy
Agnieszka Wykowska
Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS, USA
Hongsheng He
Qingdao University, Qingdao, China
Xiaorui Liu
School of Cyber Science and Technology, Beihang University, Beijing, Beijing, China
Dongyu Li
Social Cognition Human-Robot Interaction, Istituto Italiano di Tecnologia, Genoa, Italy
Jairo Perez-Osorio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Choi, D., Shi, W., Liang, Y.S., Yeo, K.H., Kim, JJ. (2021). Controlling Industrial Robots with High-Level Verbal Commands. In: Li, H., et al. Social Robotics. ICSR 2021. Lecture Notes in Computer Science(), vol 13086. Springer, Cham. https://doi.org/10.1007/978-3-030-90525-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-90525-5_19
Published: 02 November 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-90524-8
Online ISBN: 978-3-030-90525-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics