Boosting Performance of Visual Servoing Using Deep Reinforcement Learning From Multiple Demonstrations

Abstract

In this study, knowledge of multiple controllers was used and combined with deep reinforcement learning (RL) to train a visual servoing (VS) technique. Deep RL algorithms were successful in solving complicated control problems, however they generally require a large amount of data before they achieve an acceptable performance. We developed a method that generates online hyper-volume action bounds from demonstrations of multiple controllers (experts) to address the issue of insufficient data in RL. The agent then continues to explore the created bounds to find more optimized solutions and gain more rewards. By doing this, we cut out pointless agent explorations, which results in a reduction in training time as well as an improvement in performance of the trained policy. During the training process, we used domain randomization and domain adaptation to make the VS approach robust in the real world. As a result, we showed a 51% decrease in training time to achieve the desired level of performance, compared to the case when RL was used solely. The findings showed that the developed method outperformed other baseline VS methods (image-based VS, position-based VS, and hybrid-decoupled VS) in terms of VS error convergence speed and maintained higher manipulability.

Publication DOI: https://doi.org/10.1109/ACCESS.2023.3256724
Divisions: College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies > Applied AI & Robotics
College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies
Aston University (General)
Funding Information: This work was supported in part by the project ‘‘Reuse and Recycling of Lithium-Ion Batteries (RELIB),’’ and in part by The Faraday Institution under Grant FIRG005.
Additional Information: This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Uncontrolled Keywords: Visual servoing,reinforcement learning,online action bounding,reinforcement learning from demonstrations,manipulability
Publication ISSN: 2169-3536
Last Modified: 01 Sep 2025 07:39
Date Deposited: 29 Aug 2025 14:38
Full Text Link:
Related URLs: https://ieeexpl ... cument/10068197 (Publisher URL)
PURE Output Type: Article
Published Date: 2023-03-13
Accepted Date: 2023-03-08
Authors: Aflakian, Ali
Rastegharpanah, Alireza (ORCID Profile 0000-0003-4264-6857)
Stolkin, Rustam

Download

[img]

Version: Published Version

License: Creative Commons Attribution


Export / Share Citation


Statistics

Additional statistics for this record