https://github.com/asad-shahid/intelligent-task-learning

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, springer.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: Asad-Shahid
Language: Python
Default Branch: master
Size: 8.47 MB

Statistics

Stars: 9
Watchers: 1
Forks: 2
Open Issues: 0
Releases: 0

Created almost 6 years ago · Last pushed over 2 years ago

https://github.com/Asad-Shahid/Intelligent-Task-Learning/blob/master/

# Intelligent-Task-Learning
The Repository for the _Autonomous Robots 2022 Journal article:_

"[Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning](https://link.springer.com/article/10.1007/s10514-022-10034-z)" 

This contains a python toolkit for learning a grasping task with Franka Emika Panda Robot. The robot can be trained to grasp the cube, avoid obstacles and learn to manage redundancy using modern Reinforcement Learning algorithms of [Proximal Policy Optimization (PPO)](https://arxiv.org/abs/1707.06347) and [Soft Actor-Critic (SAC)](https://arxiv.org/abs/1812.05905). It is powered by [MuJoCo physics engine](http://www.mujoco.org/) 


# How to cite
```
@article{shahid2022continuous,
  title={Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning},
  author={Shahid, Asad Ali and Piga, Dario and Braghin, Francesco and Roveda, Loris},
  journal={Autonomous Robots},
  pages={1--16},
  year={2022},
  publisher={Springer}
}
```

# New Experiment (Dynamic Enviornment)
The adapted policy can grasp the moving cube in 30 mints of retraining.
![](adapted_policy.gif)

Before, the base grasping policy trained on a static cube is not able to grasp the moving cube.
![](base_policy.gif)

# Simulation Video
[![Video](https://img.youtube.com/vi/aX55Zc2XMTE/maxres3.jpg)](https://www.youtube.com/watch?v=aX55Zc2XMTE)

# Experimental validation
[![Video](https://github.com/Asad-Shahid/Intelligent-Task-Learning/blob/master/exp_image.png)](https://drive.google.com/file/d/1zlS-_HIWMlIAvrxqGNGRyMbuDfQrws8z/view)


## Installation

To use this toolkit, it is required to first install [MuJoCo 200](https://www.roboti.us/index.html) and then [mujoco-py](https://github.com/openai/mujoco-py) from Open AI. mujoco-py allows using MuJoCo from python interface.
The installation requires python 3.6 or higher. It is recommended to install all the required packages under a conda virtual environment


## References
This toolit is mainly developed based on [Surreal Robotics Suite](https://github.com/StanfordVL/robosuite) and the Reinforcement learning part is referenced from
[this repo](https://github.com/clvrai/furniture)

Owner

Name: Asad A. Shahid
Login: Asad-Shahid
Kind: user

Repositories: 2
Profile: https://github.com/Asad-Shahid

GitHub Events

Total

Fork event: 1

Last Year

Fork event: 1

Dependencies

panda/environment.yml pypi

atari-py ==0.2.6
benedict ==0.3.2
cloudpickle ==1.2.2
colorlog ==4.1.0
configparser ==4.0.2
cython ==0.28.6
dill ==0.3.1.1
docker-pycreds ==0.4.0
fasteners ==0.15
future ==0.18.2
gitdb2 ==3.0.2
gitpython ==3.0.8
glfw ==1.10.1
gql ==0.2.0
graphql-core ==1.1
gym ==0.15.6
h5py ==2.10.0
hjson ==3.0.1
idna ==2.9
imageio ==2.6.1
imageio-ffmpeg ==0.4.0
imutils ==0.5.3
ipdb ==0.13.2
lockfile ==0.12.2
monotonic ==1.5
moviepy ==1.0.1
mpi4py ==3.0.3
mujoco-py ==1.50.1.68
numpy ==1.18.1
nvidia-ml-py3 ==7.352.0
opencv-python ==4.2.0.32
pathtools ==0.1.2
proglog ==0.1.9
progressbar2 ==3.47.0
promise ==2.3
psutil ==5.7.0
pybullet ==2.6.5
pycparser ==2.19
pyglet ==1.4.10
pyquaternion ==0.9.5
python-utils ==2.3.0
pyyaml ==5.3
requests ==2.23.0
sentry-sdk ==0.14.1
shortuuid ==0.5.0
smmap2 ==2.0.5
stable-baselines ==2.9.0
subprocess32 ==3.5.4
tensorflow-gpu ==1.14.0
tensorplex ==0.9.post1
terminaltables ==3.1.0
torchx ==0.9
tqdm ==4.43.0
wandb ==0.8.28
watchdog ==0.10.2
zmq ==0.0.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science