Learning optimal controllers: a dynamical motion primitive approach
Authors: Hugo Tadashi M. Kussaba, Abdalla Swikir, Fan Wu, Anastasija Demerdjieva, Gitta Kutyniok, Sami Haddadin
Published in: IFAC-PapersOnLine, vol. 56, no. 2 (2023)
DOI: 10.1016/j.ifacol.2023.10.1242
No images available.
Abstract
No abstract available.
Tags
No tag(s) or keyword(s) available.
Videos
No video(s) available.
Downloads
No additional files.
BibTeX
If you want to cite this work, you can use the following BibTeX file:
@article{KusabaSWDKH23,
title = {Learning optimal controllers: a dynamical motion primitive approach},
journal = {IFAC-PapersOnLine},
volume = {56},
number = {2},
pages = {4776-4782},
year = {2023},
note = {22nd IFAC World Congress},
issn = {2405-8963},
doi = {https://doi.org/10.1016/j.ifacol.2023.10.1242},
url = {https://www.sciencedirect.com/science/article/pii/S2405896323016464},
author = {Hugo T.M. Kussaba and Abdalla Swikir and Fan Wu and Anastasija Demerdjieva and Gitta Kutyniok and Sami Haddadin},
keywords = {Real-time optimal control, Learning from control, Autonomous robotic systems},
abstract = {Real-time computation of optimal control is a challenging problem and, to solve this difficulty, many frameworks proposed to use learning techniques to learn (possibly sub-optimal) controllers and enable their usage in an online fashion. Among these techniques, the optimal motion framework is a simple, yet powerful technique, that obtained success in many complex real-world applications. The main idea of this approach is to take advantage of dynamic motion primitives, a widely used tool in robotics to learn trajectories from demonstrations. While usually these demonstrations come from humans, the optimal motion framework is based on demonstrations coming from optimal solutions, such as the ones obtained by numeric solvers. As usual in many learning techniques, a drawback of this approach is that it is hard to estimate the suboptimality of learned solutions, since finding easily computable and non-trivial upper bounds to the error between an optimal solution and a learned solution is, in general, unfeasible. However, we show in this paper that it is possible to estimate this error for a broad class of problems. Furthermore, we apply this estimation technique to achieve a novel and more efficient sampling scheme to be used within the optimal motion framework, enabling the usage of this framework in some scenarios where the computational resources are limited.}
}