https://github.com/asad-shahid/intelligent-task-learning
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: arxiv.org, springer.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary
Last synced: 9 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: Asad-Shahid
- Language: Python
- Default Branch: master
- Size: 8.47 MB
Statistics
- Stars: 9
- Watchers: 1
- Forks: 2
- Open Issues: 0
- Releases: 0
Created almost 6 years ago
· Last pushed over 2 years ago
https://github.com/Asad-Shahid/Intelligent-Task-Learning/blob/master/
# Intelligent-Task-Learning
The Repository for the _Autonomous Robots 2022 Journal article:_
"[Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning](https://link.springer.com/article/10.1007/s10514-022-10034-z)"
This contains a python toolkit for learning a grasping task with Franka Emika Panda Robot. The robot can be trained to grasp the cube, avoid obstacles and learn to manage redundancy using modern Reinforcement Learning algorithms of [Proximal Policy Optimization (PPO)](https://arxiv.org/abs/1707.06347) and [Soft Actor-Critic (SAC)](https://arxiv.org/abs/1812.05905). It is powered by [MuJoCo physics engine](http://www.mujoco.org/)
# How to cite
```
@article{shahid2022continuous,
title={Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning},
author={Shahid, Asad Ali and Piga, Dario and Braghin, Francesco and Roveda, Loris},
journal={Autonomous Robots},
pages={1--16},
year={2022},
publisher={Springer}
}
```
# New Experiment (Dynamic Enviornment)
The adapted policy can grasp the moving cube in 30 mints of retraining.

Before, the base grasping policy trained on a static cube is not able to grasp the moving cube.

# Simulation Video
[](https://www.youtube.com/watch?v=aX55Zc2XMTE)
# Experimental validation
[](https://drive.google.com/file/d/1zlS-_HIWMlIAvrxqGNGRyMbuDfQrws8z/view)
## Installation
To use this toolkit, it is required to first install [MuJoCo 200](https://www.roboti.us/index.html) and then [mujoco-py](https://github.com/openai/mujoco-py) from Open AI. mujoco-py allows using MuJoCo from python interface.
The installation requires python 3.6 or higher. It is recommended to install all the required packages under a conda virtual environment
## References
This toolit is mainly developed based on [Surreal Robotics Suite](https://github.com/StanfordVL/robosuite) and the Reinforcement learning part is referenced from
[this repo](https://github.com/clvrai/furniture)
Owner
- Name: Asad A. Shahid
- Login: Asad-Shahid
- Kind: user
- Repositories: 2
- Profile: https://github.com/Asad-Shahid
GitHub Events
Total
- Fork event: 1
Last Year
- Fork event: 1
Dependencies
panda/environment.yml
pypi
- atari-py ==0.2.6
- benedict ==0.3.2
- cloudpickle ==1.2.2
- colorlog ==4.1.0
- configparser ==4.0.2
- cython ==0.28.6
- dill ==0.3.1.1
- docker-pycreds ==0.4.0
- fasteners ==0.15
- future ==0.18.2
- gitdb2 ==3.0.2
- gitpython ==3.0.8
- glfw ==1.10.1
- gql ==0.2.0
- graphql-core ==1.1
- gym ==0.15.6
- h5py ==2.10.0
- hjson ==3.0.1
- idna ==2.9
- imageio ==2.6.1
- imageio-ffmpeg ==0.4.0
- imutils ==0.5.3
- ipdb ==0.13.2
- lockfile ==0.12.2
- monotonic ==1.5
- moviepy ==1.0.1
- mpi4py ==3.0.3
- mujoco-py ==1.50.1.68
- numpy ==1.18.1
- nvidia-ml-py3 ==7.352.0
- opencv-python ==4.2.0.32
- pathtools ==0.1.2
- proglog ==0.1.9
- progressbar2 ==3.47.0
- promise ==2.3
- psutil ==5.7.0
- pybullet ==2.6.5
- pycparser ==2.19
- pyglet ==1.4.10
- pyquaternion ==0.9.5
- python-utils ==2.3.0
- pyyaml ==5.3
- requests ==2.23.0
- sentry-sdk ==0.14.1
- shortuuid ==0.5.0
- smmap2 ==2.0.5
- stable-baselines ==2.9.0
- subprocess32 ==3.5.4
- tensorflow-gpu ==1.14.0
- tensorplex ==0.9.post1
- terminaltables ==3.1.0
- torchx ==0.9
- tqdm ==4.43.0
- wandb ==0.8.28
- watchdog ==0.10.2
- zmq ==0.0.0