Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Real-time implementation of deep reinforcement learning controller for speed tracking of robotic fish through data-assisted modeling
Electronics & Instrumentation Engineering, School of EEE, SASTRA University, Thanjavur, Tamil Nadu (IND).
Electronics & Instrumentation Engineering, School of EEE, SASTRA University, Thanjavur, Tamil Nadu (IND).
Electronics and Communication Engineering, School of EEE, SASTRA University, Thanjavur, Tamil Nadu (IND).
University West, Department of Engineering Science, Division of Production Systems. (KAMPT)ORCID iD: 0000-0002-4091-7732
2023 (English)In: Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, ISSN 0954-4062, p. 1-2Article in journal (Refereed) Epub ahead of print
Abstract [en]

This article proposes real-time speed tracking of two-link surface swimming robotic fish using a deep reinforcement learning (DRL) controller. Hydrodynamic modelling of robotic fish is done by virtue of Newtonian dynamics and Lighthill’s kinematic model. However, this includes external unsteady reactive forces that cannot be modeled accurately due to the distributed nature of hydrodynamic behavior. Therefore, a novel data-assisted dynamic model and control method is proposed for the speed tracking of robotic fish. Initially, the cruise speed motion data are collected through experiments. The water-resistance coefficient is estimated using the least mean square fit, which is then adopted in the model. Subsequently, a closed-loop discrete-time DRL controller trained through a soft actor-critic (SAC) agent is implemented through simulations. SAC overcomes the brittleness problem encountered by other policy gradient approaches by encouraging the policy network for maximum exploration and not assigning a higher probability to any single part of actions. Due to this robustness in the policy learning, the convergence error becomes low in RL-SAC than RL-DDPG controller. The simulation results verify that the DRL-SAC control with data-assisted modelling substantially improves the speed tracking performance. Further, this controller is validated in real-time, and it is observed that the SAC-trained controller tracks the desired speed more accurately than the DDPG controller.

Place, publisher, year, edition, pages
Sage Publications, 2023. p. 1-2
Keywords [en]
speed tracking, robotic fish, data-assisted modeling
National Category
Manufacturing, Surface and Joining Technology
Research subject
Production Technology
Identifiers
URN: urn:nbn:se:hv:diva-20057DOI: 10.1177/09544062231174127Scopus ID: 2-s2.0-85159707159OAI: oai:DiVA.org:hv-20057DiVA, id: diva2:1766468
Available from: 2023-06-13 Created: 2023-06-13 Last updated: 2023-06-13

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ramasamy, Sudha

Search in DiVA

By author/editor
Ramasamy, Sudha
By organisation
Division of Production Systems
Manufacturing, Surface and Joining Technology

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 12 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf