https://github.com/asad-shahid/intelligent-task-learning

https://github.com/asad-shahid/intelligent-task-learning

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org, springer.com
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: Asad-Shahid
  • Language: Python
  • Default Branch: master
  • Size: 8.47 MB
Statistics
  • Stars: 9
  • Watchers: 1
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Created almost 6 years ago · Last pushed over 2 years ago

https://github.com/Asad-Shahid/Intelligent-Task-Learning/blob/master/

# Intelligent-Task-Learning
The Repository for the _Autonomous Robots 2022 Journal article:_

"[Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning](https://link.springer.com/article/10.1007/s10514-022-10034-z)" 

This contains a python toolkit for learning a grasping task with Franka Emika Panda Robot. The robot can be trained to grasp the cube, avoid obstacles and learn to manage redundancy using modern Reinforcement Learning algorithms of [Proximal Policy Optimization (PPO)](https://arxiv.org/abs/1707.06347) and [Soft Actor-Critic (SAC)](https://arxiv.org/abs/1812.05905). It is powered by [MuJoCo physics engine](http://www.mujoco.org/) 


# How to cite
```
@article{shahid2022continuous,
  title={Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning},
  author={Shahid, Asad Ali and Piga, Dario and Braghin, Francesco and Roveda, Loris},
  journal={Autonomous Robots},
  pages={1--16},
  year={2022},
  publisher={Springer}
}
```

# New Experiment (Dynamic Enviornment)
The adapted policy can grasp the moving cube in 30 mints of retraining.
![](adapted_policy.gif)

Before, the base grasping policy trained on a static cube is not able to grasp the moving cube.
![](base_policy.gif)

# Simulation Video
[![Video](https://img.youtube.com/vi/aX55Zc2XMTE/maxres3.jpg)](https://www.youtube.com/watch?v=aX55Zc2XMTE)

# Experimental validation
[![Video](https://github.com/Asad-Shahid/Intelligent-Task-Learning/blob/master/exp_image.png)](https://drive.google.com/file/d/1zlS-_HIWMlIAvrxqGNGRyMbuDfQrws8z/view)


## Installation

To use this toolkit, it is required to first install [MuJoCo 200](https://www.roboti.us/index.html) and then [mujoco-py](https://github.com/openai/mujoco-py) from Open AI. mujoco-py allows using MuJoCo from python interface.
The installation requires python 3.6 or higher. It is recommended to install all the required packages under a conda virtual environment


## References
This toolit is mainly developed based on [Surreal Robotics Suite](https://github.com/StanfordVL/robosuite) and the Reinforcement learning part is referenced from
[this repo](https://github.com/clvrai/furniture)

Owner

  • Name: Asad A. Shahid
  • Login: Asad-Shahid
  • Kind: user

GitHub Events

Total
  • Fork event: 1
Last Year
  • Fork event: 1

Dependencies

panda/environment.yml pypi
  • atari-py ==0.2.6
  • benedict ==0.3.2
  • cloudpickle ==1.2.2
  • colorlog ==4.1.0
  • configparser ==4.0.2
  • cython ==0.28.6
  • dill ==0.3.1.1
  • docker-pycreds ==0.4.0
  • fasteners ==0.15
  • future ==0.18.2
  • gitdb2 ==3.0.2
  • gitpython ==3.0.8
  • glfw ==1.10.1
  • gql ==0.2.0
  • graphql-core ==1.1
  • gym ==0.15.6
  • h5py ==2.10.0
  • hjson ==3.0.1
  • idna ==2.9
  • imageio ==2.6.1
  • imageio-ffmpeg ==0.4.0
  • imutils ==0.5.3
  • ipdb ==0.13.2
  • lockfile ==0.12.2
  • monotonic ==1.5
  • moviepy ==1.0.1
  • mpi4py ==3.0.3
  • mujoco-py ==1.50.1.68
  • numpy ==1.18.1
  • nvidia-ml-py3 ==7.352.0
  • opencv-python ==4.2.0.32
  • pathtools ==0.1.2
  • proglog ==0.1.9
  • progressbar2 ==3.47.0
  • promise ==2.3
  • psutil ==5.7.0
  • pybullet ==2.6.5
  • pycparser ==2.19
  • pyglet ==1.4.10
  • pyquaternion ==0.9.5
  • python-utils ==2.3.0
  • pyyaml ==5.3
  • requests ==2.23.0
  • sentry-sdk ==0.14.1
  • shortuuid ==0.5.0
  • smmap2 ==2.0.5
  • stable-baselines ==2.9.0
  • subprocess32 ==3.5.4
  • tensorflow-gpu ==1.14.0
  • tensorplex ==0.9.post1
  • terminaltables ==3.1.0
  • torchx ==0.9
  • tqdm ==4.43.0
  • wandb ==0.8.28
  • watchdog ==0.10.2
  • zmq ==0.0.0