agents

Reproducible results for the various types of reinforcement algorithms I have implemented

https://github.com/beegass/agents

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (7.5%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Reproducible results for the various types of reinforcement algorithms I have implemented

Basic Info

Host: GitHub
Owner: BeeGass
License: other
Language: Python
Default Branch: master
Homepage:
Size: 6.72 MB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 4 years ago · Last pushed about 4 years ago

Metadata Files

Readme License Citation

RL Agents
Jax | Flux | PyTorch

A collection of Reinforcement Learning (RL) Methods I have implemented in jax/flax, flux and pytorch with particular effort put into readability and reproducibility.

Python

Requirements For Jax

Python >= 3.8
jax

Installation

$ git clone https://github.com/BeeGass/Agents.git

Usage

$ cd Agents/agents-jax $ python main.py

Requirements For PyTorch

PyTorch >= 1.10

Usage

$ cd Agents/agents-pytorch $ python main.py

Julia

Requirements For Flux

TODO
TODO

Usage

``` $ cd Agents/agents-flux $ # TBA

```

Config File Template yaml TBA

Weights And Biases Integration TBA

Preliminary RL Implementations

| Model | NumPy/Vanilla | Jax/Flax | Flux | Config | Paper | |:-------------------------------------------- |:-------------:|:--------:|:-------:|:-------:|:-------------------------------------------------------------------------------------------- | | Policy Evaluation | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Policy Improvement | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Policy Iteration | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Value Iteration | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | On-policy first visit Monte-Carlo prediction | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | On-policy first visit Monte-Carlo control | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Sarsa (on-policy TD control) | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Q-learing (off-policy TD control) | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects |

Off-Policy Results

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:----------- |:-------:|:--------:|:-------:|:-------:|:------------------------------------------ | | DQN | ☐ | ☐ | ☐ | ☐ | Link | | DDPG | ☐ | ☐ | ☐ | ☐ | Link | | DRQN | ☐ | ☐ | ☐ | ☐ | Link | | Dueling-DQN | ☐ | ☐ | ☐ | ☐ | Link | | Double-DQN | ☐ | ☐ | ☐ | ☐ | Link | | PER | ☐ | ☐ | ☐ | ☐ | Link | | Rainbow | ☐ | ☐ | ☐ | ☐ | Link |

Policy Results

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:----- |:-------:|:--------:|:-------:|:-------:|:---------------------------------------- | | PPO | ☐ | ☐ | ☐ | ☐ | Link | | TRPO | ☐ | ☐ | ☐ | ☐ | Link | | SAC | ☐ | ☐ | ☐ | ☐ | Link | | A2C | ☐ | ☐ | ☐ | ☐ | Link | | A3C | ☐ | ☐ | ☐ | ☐ | Link | | TD3 | ☐ | ☐ | ☐ | ☐ | Link |

Fun Stuff

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:------------------- |:-------:|:--------:|:-------:|:-------:|:-------------------------------------------- | | World Models | ☐ | ☐ | ☐ | ☐ | Link | | Dream to Control | ☐ | ☐ | ☐ | ☐ | Link | | Dream to Control v2 | ☐ | ☐ | ☐ | ☐ | Link |

Citation

bib @software{Gass_Agents_2021, author = {Gass, B.A., Gass, B.A.}, doi = {10.5281/zenodo.1234}, month = {12}, title = {{Agents}}, url = {https://github.com/BeeGass/Agents}, version = {1.0.0}, year = {2021} }

Owner

Name: Bryan
Login: BeeGass
Kind: user
Location: Cambridge, MA
Company: @USArmyResearchLab

Website: onlygass.dev
Twitter: BeeAGass
Repositories: 14
Profile: https://github.com/BeeGass

Research Engineer interested in SSMs

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Gass, B.A."
  given-names: "Gass, B.A."
  orcid: "https://orcid.org/0000-0000-0000-0000"
title: "Agents"
version: 1.0.0
doi: 10.5281/zenodo.1234
date-released: 2021-12-06
url: "https://github.com/BeeGass/Agents"

GitHub Events

Total

Last Year

Committers

Last synced: over 1 year ago

All Time

Total Commits: 33
Total Committers: 2
Avg Commits per committer: 16.5
Development Distribution Score (DDS): 0.303

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Bryan	4****s	23
Bryan Gass	b**3@l**m	10

Issues and Pull Requests

Last synced: over 1 year ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

agents

Science Score: 67.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

RL Agents Jax | Flux | PyTorch

Python

Requirements For Jax

Installation

Usage

Requirements For PyTorch

Usage

Julia

Requirements For Flux

Usage

```

Preliminary RL Implementations

Off-Policy Results

Policy Results

Fun Stuff

Citation

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

RL Agents
Jax | Flux | PyTorch