agents

Reproducible results for the various types of reinforcement algorithms I have implemented

https://github.com/beegass/agents

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.5%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Reproducible results for the various types of reinforcement algorithms I have implemented

Basic Info
  • Host: GitHub
  • Owner: BeeGass
  • License: other
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 6.72 MB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 4 years ago · Last pushed almost 4 years ago
Metadata Files
Readme License Citation

README.md

RL Agents
Jax | Flux | PyTorch

A collection of Reinforcement Learning (RL) Methods I have implemented in jax/flax, flux and pytorch with particular effort put into readability and reproducibility.

Python

Requirements For Jax

  • Python >= 3.8
  • jax

Installation

$ git clone https://github.com/BeeGass/Agents.git

Usage

$ cd Agents/agents-jax $ python main.py

Requirements For PyTorch

  • PyTorch >= 1.10

Usage

$ cd Agents/agents-pytorch $ python main.py

Julia

Requirements For Flux

  • TODO
  • TODO

Usage

``` $ cd Agents/agents-flux $ # TBA

```

Config File Template yaml TBA

Weights And Biases Integration TBA


Preliminary RL Implementations

| Model | NumPy/Vanilla | Jax/Flax | Flux | Config | Paper | |:-------------------------------------------- |:-------------:|:--------:|:-------:|:-------:|:-------------------------------------------------------------------------------------------- | | Policy Evaluation | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Policy Improvement | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Policy Iteration | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Value Iteration | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | On-policy first visit Monte-Carlo prediction | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | On-policy first visit Monte-Carlo control | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Sarsa (on-policy TD control) | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects | | Q-learing (off-policy TD control) | ☑ | ☐ | ☐ | ☐ | DS595-RL-Projects |


Off-Policy Results

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:----------- |:-------:|:--------:|:-------:|:-------:|:------------------------------------------ | | DQN | ☐ | ☐ | ☐ | ☐ | Link | | DDPG | ☐ | ☐ | ☐ | ☐ | Link | | DRQN | ☐ | ☐ | ☐ | ☐ | Link | | Dueling-DQN | ☐ | ☐ | ☐ | ☐ | Link | | Double-DQN | ☐ | ☐ | ☐ | ☐ | Link | | PER | ☐ | ☐ | ☐ | ☐ | Link | | Rainbow | ☐ | ☐ | ☐ | ☐ | Link |

Policy Results

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:----- |:-------:|:--------:|:-------:|:-------:|:---------------------------------------- | | PPO | ☐ | ☐ | ☐ | ☐ | Link | | TRPO | ☐ | ☐ | ☐ | ☐ | Link | | SAC | ☐ | ☐ | ☐ | ☐ | Link | | A2C | ☐ | ☐ | ☐ | ☐ | Link | | A3C | ☐ | ☐ | ☐ | ☐ | Link | | TD3 | ☐ | ☐ | ☐ | ☐ | Link |

Fun Stuff

| Model | PyTorch | Jax/Flax | Flux | Config | Paper | |:------------------- |:-------:|:--------:|:-------:|:-------:|:-------------------------------------------- | | World Models | ☐ | ☐ | ☐ | ☐ | Link | | Dream to Control | ☐ | ☐ | ☐ | ☐ | Link | | Dream to Control v2 | ☐ | ☐ | ☐ | ☐ | Link |

Citation

bib @software{Gass_Agents_2021, author = {Gass, B.A., Gass, B.A.}, doi = {10.5281/zenodo.1234}, month = {12}, title = {{Agents}}, url = {https://github.com/BeeGass/Agents}, version = {1.0.0}, year = {2021} }

Owner

  • Name: Bryan
  • Login: BeeGass
  • Kind: user
  • Location: Cambridge, MA
  • Company: @USArmyResearchLab

Research Engineer interested in SSMs

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Gass, B.A."
  given-names: "Gass, B.A."
  orcid: "https://orcid.org/0000-0000-0000-0000"
title: "Agents"
version: 1.0.0
doi: 10.5281/zenodo.1234
date-released: 2021-12-06
url: "https://github.com/BeeGass/Agents"

GitHub Events

Total
Last Year

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 33
  • Total Committers: 2
  • Avg Commits per committer: 16.5
  • Development Distribution Score (DDS): 0.303
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Bryan 4****s 23
Bryan Gass b****3@l****m 10

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels