sheeprl
Distributed Reinforcement Learning accelerated by Lightning Fabric
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (4.8%) to scientific vocabulary
Keywords
Repository
Distributed Reinforcement Learning accelerated by Lightning Fabric
Basic Info
- Host: GitHub
- Owner: Eclectic-Sheep
- License: apache-2.0
- Language: Python
- Default Branch: main
- Homepage: https://eclecticsheep.ai
- Size: 21.8 MB
Statistics
- Stars: 388
- Watchers: 4
- Forks: 54
- Open Issues: 25
- Releases: 25
Topics
Metadata Files
README.md
⚡ SheepRL 🐑
| Environment | Total frames | Training time | Test reward | Paper reward | GPUs |
|---|---|---|---|---|---|
| Crafter | 1M | 1d 3h | 12.1 | 11.7 | 1-V100 |
| Atari-MsPacman | 100K | 14h | 1542 | 1327 | 1-3080 |
| Atari-Boxing | 100K | 14h | 84 | 78 | 1-3080 |
| DOA++(w/o optimizations)1 | 7M | 18d 22h | 2726/33282 | N.A. | 1-3080 |
| Minecraft-Nav(w/o optimizations) | 8M | 16d 4h | 27% >= 70 14% >= 100 |
N.A. | 1-V100 |
- For comparison: 1M in 2d 7h vs 1M in 1d 5h (before and after optimizations resp.)
- Best leaderboard score in DIAMBRA (11/7/2023)
Benchmarks
The training times of our implementations compared to the ones of Stable Baselines3 are shown below:
| SheepRL v0.4.0 | SheepRL v0.4.9 | SheepRL v0.5.2 (Numpy Buffers) |
SheepRL v0.5.5 (Numpy Buffers) |
StableBaselines31 | ||
|---|---|---|---|---|---|---|
| PPO | 1 device | 192.31s ± 1.11 | 138.3s ± 0.16 | 80.81s ± 0.68 | 81.27s ± 0.47 | 77.21s ± 0.36 |
| 2 devices | 85.42s ± 2.27 | 59.53s ± 0.78 | 46.09s ± 0.59 | 36.88s ± 0.30 | N.D. | |
| A2C | 1 device | N.D. | N.D. | N.D. | 84.76s ± 0.37 | 84.22s ± 0.99 |
| 2 devices | N.D. | N.D. | N.D. | 28.95s ± 0.75 | N.D. | |
| SAC | 1 device | 421.37s ± 5.27 | 363.74s ± 3.44 | 318.06s ± 4.46 | 320.21 ± 6.29 | 336.06s ± 12.26 |
| 2 devices | 264.29s ± 1.81 | 238.88s ± 4.97 | 210.07s ± 27 | 225.95 ± 3.65 | N.D. | |
| Dreamer V1 | 1 device | 4201.23s | N.D. | 2921.38s | 2207.13s | N.D. |
| Dreamer V2 | 1 device | 1874.62s | N.D. | 1148.1s | 906.42s | N.D. |
| Dreamer V3 | 1 device | 2022.99s | N.D. | 1378.01s | 1589.30s | N.D. |
[!NOTE]
All experiments have been run on 4 CPUs in Lightning Studio. All benchmarks, but the Dreamers' ones, have been run 5 times and we have taken the mean and the std of the runs. We have disabled the test function, the logging, and the checkpoints. Moreover, the models were not registered using MLFlow.
Dreamers' benchmarks have been run 1 time with logging and checkpoints, without running the test function.
- The StableBaselines3 version is
v2.2.1, please install the package withpip install stable-baselines3==2.2.1
What
An easy-to-use framework for reinforcement learning in PyTorch, accelerated with Lightning Fabric.
The algorithms sheeped by sheeprl out-of-the-box are:
| Algorithm | Coupled | Decoupled | Recurrent | Vector obs | Pixel obs | Status | | ------------------------- | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ | | A2C | :heavycheckmark: | :x: | :x: | :heavycheckmark: | :x: | :heavycheckmark: | | A3C | :heavycheckmark: | :x: | :x: | :heavycheckmark: | :x: | :construction: | | PPO | :heavycheckmark: | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | PPO Recurrent | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | SAC | :heavycheckmark: | :heavycheckmark: | :x: | :heavycheckmark: | :x: | :heavycheckmark: | | SAC-AE | :heavycheckmark: | :x: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | DroQ | :heavycheckmark: | :x: | :x: | :heavycheckmark: | :x: | :heavycheckmark: | | Dreamer-V1 | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Dreamer-V2 | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Dreamer-V3 | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V1) | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V2) | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V3) | :heavycheckmark: | :x: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: |
and more are coming soon! Open a PR if you have any particular request :sheep:
The actions supported by sheeprl agents are: | Algorithm | Continuous | Discrete | Multi-Discrete | | ------------------------- | ------------------ | ------------------ | ------------------ | | A2C | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | A3C | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | PPO | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | PPO Recurrent | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | SAC | :heavycheckmark: | :x: | :x: | | SAC-AE | :heavycheckmark: | :x: | :x: | | DroQ | :heavycheckmark: | :x: | :x: | | Dreamer-V1 | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Dreamer-V2 | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Dreamer-V3 | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V1) | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V2) | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: | | Plan2Explore (Dreamer V3) | :heavycheckmark: | :heavycheckmark: | :heavycheckmark: |
The environments supported by sheeprl are:
| Algorithm | Installation command | More info | Status |
| ------------------ | ---------------------------- | ----------------------------------------------- | ------------------ |
| Classic Control | pip install sheeprl | | :heavycheckmark: |
| Box2D | pip install sheeprl[box2d] | Please install first swig with pip install swig | :heavycheckmark: |
| Mujoco (Gymnasium) | pip install sheeprl[mujoco] | how_to/mujoco | :heavycheckmark: |
| Atari | pip install sheeprl[atari] | how_to/atari | :heavycheckmark: |
| DeepMind Control | pip install sheeprl[dmc] | how_to/dmc | :heavycheckmark: |
| MineRL | pip install sheeprl[minerl] | how_to/minerl | :heavycheckmark: |
| MineDojo | pip install sheeprl[minedojo] | how_to/minedojo | :heavycheckmark: |
| DIAMBRA | pip install sheeprl[diambra] | how_to/diambra | :heavycheckmark: |
| Crafter | pip install sheeprl[crafter] | https://github.com/danijar/crafter | :heavycheckmark: |
| Super Mario Bros | pip install sheeprl[supermario] | https://github.com/Kautenja/gym-super-mario-bros/tree/master | :heavycheckmark: |
Why
We want to provide a framework for RL algorithms that is at the same time simple and scalable thanks to Lightning Fabric.
Moreover, in many RL repositories, the RL algorithm is tightly coupled with the environment, making it harder to extend them beyond the gym interface. We want to provide a framework that allows to easily decouple the RL algorithm from the environment, so that it can be used with any environment.
How to use it
Installation
Three options exist for installing SheepRL
- Install the latest version directly from the PyPi index
- Clone the repo and install the local version
- pip-install the framework using the GitHub clone URL
Instructions for the three methods are shown below.
Install SheepRL from PyPi
You can install the latest version of SheepRL with
bash
pip install sheeprl
[!NOTE]
To install optional dependencies one can run for example
pip install sheeprl[atari,box2d,dev,mujoco,test]
For a detailed information about all the optional dependencies you can install please have a look at the What section
Cloning and installing a local version
First, clone the repo with:
bash
git clone https://github.com/Eclectic-Sheep/sheeprl.git
cd sheeprl
From inside the newly created folder run
bash
pip install .
[!NOTE]
To install optional dependencies one can run for example
pip install .[atari,box2d,dev,mujoco,test]
Installing the framework from the GitHub repo
If you haven't already done so, create an environment with your choice of venv or conda.
The example will use Python standard's venv module and assumes macOS or Linux.
```sh
create a virtual environment
python3 -m venv .venv
activate the environment
source .venv/bin/activate
if you do not wish to install extras such as mujuco, atari do
pip install "sheeprl @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with atari and mujuco environment support, do
pip install "sheeprl[atari,mujoco,dev] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with box2d environment support, do
pip install swig pip install "sheeprl[box2d] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with minedojo environment support, do
pip install "sheeprl[minedojo,dev] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with minerl environment support, do
pip install "sheeprl[minerl,dev] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with diambra environment support, do
pip install "sheeprl[diambra,dev] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install with super mario bros environment support, do
pip install "sheeprl[supermario,dev] @ git+https://github.com/Eclectic-Sheep/sheeprl.git"
or, to install all extras, do
pip install swig pip install "sheeprl[box2d,atari,mujoco,minerl,supermario,dev,test] @ git+https://github.com/Eclectic-Sheep/sheeprl.git" ```
Additional: installing on an M-series Mac
[!CAUTION]
If you are on an M-series Mac and encounter an error attributed box2dpy during installation, you need to install SWIG using the instructions shown below.
It is recommended to use homebrew to install SWIG to support Gym.
```sh
if needed install homebrew
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
then, do
brew install swig
then attempt to pip install with the preferred method, such as
pip install "sheeprl[atari,box2d,mujoco,dev,test] @ git+https://github.com/Eclectic-Sheep/sheeprl.git" ```
Additional: MineRL and MineDojo
[!NOTE]
If you want to install the minedojo or minerl environment support, Java JDK 8 is required: you can install it by following the instructions at this link.
[!CAUTION]
MineRL and MineDojo environments have conflicting requirements, so DO NOT install them together with the
pip install sheeprl[minerl,minedojo]command, but instead install them individually with either the commandpip install sheeprl[minerl]orpip install sheeprl[minedojo]before running an experiment with the MineRL or MineDojo environment, respectively.
Run an experiment with SheepRL
Now you can use one of the already available algorithms, or create your own. For example, to train a PPO agent on the CartPole environment with only vector-like observations, just run
bash
python sheeprl.py exp=ppo env=gym env.id=CartPole-v1
if you have installed from a cloned repo, or
bash
sheeprl exp=ppo env=gym env.id=CartPole-v1
if you have installed SheepRL from PyPi.
Similarly, you check all the available algorithms with
bash
python sheeprl/available_agents.py
if you have installed from a cloned repo, or
bash
sheeprl-agents
if you have installed SheepRL from PyPi.
That's all it takes to train an agent with SheepRL! 🎉
Before you start using the SheepRL framework, it is highly recommended that you read the following instructional documents:
- How to run experiments
- How to modify the default configs
- How to work with steps
- How to select observations
Moreover, there are other useful documents in the
howtofolder, these documents contain some guidance on how to properly use the framework.
:chartwithupwards_trend: Check your results
Once you trained an agent, a new folder called logs will be created, containing the logs of the training. You can visualize them with TensorBoard:
bash
tensorboard --logdir logs
https://github.com/Eclectic-Sheep/sheeprl/assets/7341604/46ad4acd-180d-449d-b46a-25b4a1f038d9
:nerd_face: More about running an algorithm
What you run is the PPO algorithm with the default configuration. But you can also change the configuration by passing arguments to the script.
For example, in the default configuration, the number of parallel environments is 4. Let's try to change it to 8 by passing the --num_envs argument:
bash
sheeprl exp=ppo env=gym env.id=CartPole-v1 env.num_envs=8
All the available arguments, with their descriptions, are listed in the sheeprl/config directory. You can find more information about the hierarchy of configs here.
Running with Lightning Fabric
To run the algorithm with Lightning Fabric, you need to specify the Fabric parameters through the CLI. For example, to run the PPO algorithm with 4 parallel environments on 2 nodes, you can run:
bash
sheeprl fabric.accelerator=cpu fabric.strategy=ddp fabric.devices=2 exp=ppo env=gym env.id=CartPole-v1
You can check the available parameters for Lightning Fabric here.
Evaluate your Agents
You can easily evaluate your trained agents from checkpoints: training configurations are retrieved automatically.
bash
sheeprl-eval checkpoint_path=/path/to/checkpoint.ckpt fabric.accelerator=gpu env.capture_video=True
For more information, check the corresponding howto.
:book: Repository structure
The repository is structured as follows:
algos: contains the implementations of the algorithms. Each algorithm is in a separate folder, and (possibly) contains the following files:<algorithm>.py: contains the implementation of the algorithm.<algorithm>_decoupled.py: contains the implementation of the decoupled version of the algorithm, if present.agent: optional, contains the implementation of the agent.loss.py: contains the implementation of the loss functions of the algorithm.utils.py: contains utility functions for the algorithm.
configs: contains the default configs of the algorithms.data: contains the implementation of the data buffers.envs: contains the implementation of the environment wrappers.models: contains the implementation of some standard models (building blocks), like the multi-layer perceptron (MLP) or a simple convolutional network (NatureCNN)utils: contains utility functions for the framework.
Coupled vs Decoupled
In the coupled version of an algorithm, the agent interacts with the environment and executes the training loop.
In the decoupled version, a process is responsible only for interacting with the environment, and all the other processes are responsible for executing the training loop. The two processes communicate through distributed collectives, adopting the abstraction provided by Fabric's TorchCollective.
Coupled
The algorithm is implemented in the <algorithm>.py file.
There are 2 functions inside this script:
main(): initializes all the components of the algorithm, and executes the interactions with the environment. Once enough data is collected, the training loop is executed by calling thetrain()function.train(): executes the training loop. It samples a batch of data from the buffer, computes the loss, and updates the parameters of the agent.
Decoupled
The decoupled version of an algorithm is implemented in the <algorithm>_decoupled.py file.
There are 3 functions inside this script:
main(): initializes all the components of the algorithm, the collectives for the communication between the player and the trainers, and calls theplayer()andtrainer()functions.player(): executes the interactions with the environment. It samples an action from the policy network, executes it in the environment, and stores the transition in the buffer. After a predefined number of interactions with the environment, the player randomly splits the collected data into almost equal chunks and sends them separately to the trainers. It then waits for the trainers to finish the agent update.trainer(): executes the training loop. It receives a chunk of data from the player, computes the loss, and updates the parameters of the agent. After the agent has been updated, the first of the trainers sends back the updated agent weights to the player, which can interact again with the environment.
Algorithms implementation
You can check inside the folder of each algorithm the README.md file for the details about the implementation.
All algorithms are kept as simple as possible, in a CleanRL fashion. But to allow for more flexibility and also more clarity, we tried to abstract away anything that is not strictly related to the training loop of the algorithm.
For example, we decided to create a models folder with already-made models that can be composed to create the model of the agent.
For each algorithm, losses are kept in a separate module, so that their implementation is clear and can be easily utilized for the decoupled or the recurrent version of the algorithm.
:cardindexdividers: Buffer
For the buffer implementation, we choose to use a wrapper around a dictionary of Numpy arrays.
To enable a simple way to work with numpy memory-mapped arrays, we implemented the sheeprl.utils.memmap.MemmapArray, a container that handles the memory-mapped arrays.
This flexibility makes it very simple to implement, with the classes ReplayBuffer, SequentialReplayBuffer, EpisodeBuffer, and EnvIndependentReplayBuffer, all the buffers needed for on-policy and off-policy algorithms.
:mag: Technical details
The shape of the Numpy arrays in the dictionary is (T, B, *), where T is the number of timesteps, B is the number of parallel environments, and * is the shape of the data.
For the ReplayBuffer to be used as a RolloutBuffer, the proper buffer_size must be specified. For example, for PPO, the buffer_size must be [T, B], where T is the number of timesteps and B is the number of parallel environments.
:bow: Contributing
The best way to contribute is by opening an issue to discuss a new feature or a bug, or by opening a PR to fix a bug or to add a new feature.
:mailboxwithno_mail: Contacts
You can contact us for any further questions or discussions:
- Federico Belotti: belo.fede@outlook.com
- Davide Angioni: davide.angioni@orobix.com
- Refik Can Malli: refikcan.malli@orobix.com
- Michele Milesi: michele.milesi@orobix.com
Owner
- Name: EclecticSheep
- Login: Eclectic-Sheep
- Kind: organization
- Location: Italy
- Website: https://eclecticsheep.ai/
- Repositories: 1
- Profile: https://github.com/Eclectic-Sheep
Citation (CITATION.cff)
cff-version: 1.2.0
message: "Sheep us in your research paper! Use this for citations 🐑"
title: "SheepRL"
abstract: "A easy to use, easy to customize, production oriented framework for State of the Art Reinforcement Learning."
date-released: 2023-05-17
authors:
- name: "EclecticSheep"
- family-names: "Angioni"
given-names: "Davide"
- family-names: "Belotti"
given-names: "Federico"
- family-names: "Can Malli"
given-names: "Refik"
- family-names: "Milesi"
given-names: "Michele"
version: 0.5.7
license: "Apache-2.0"
DOI: 10.5281/zenodo.11030899
url: "https://eclecticsheep.ai"
repository-code: "https://github.com/Eclectic-Sheep/sheeprl/"
keywords:
- reinforcement learning
- deep learning
- artificial intelligence
GitHub Events
Total
- Issues event: 17
- Watch event: 81
- Delete event: 6
- Issue comment event: 17
- Push event: 69
- Pull request review event: 2
- Pull request review comment event: 2
- Fork event: 18
- Create event: 1
Last Year
- Issues event: 17
- Watch event: 81
- Delete event: 6
- Issue comment event: 17
- Push event: 69
- Pull request review event: 2
- Pull request review comment event: 2
- Fork event: 18
- Create event: 1
Committers
Last synced: 7 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| belerico_t | f****i@o****m | 431 |
| Michele Milesi | m****i@s****t | 275 |
| Davide Angioni | d****i@o****m | 11 |
| pre-commit-ci[bot] | 6****] | 5 |
| DavideTr8 | 6****8 | 5 |
| orobix | s****1@o****m | 3 |
| Refik Can Malli | r****i@g****m | 1 |
| Locatelli Alex | 1****x | 1 |
| Justin G | j****n@g****v | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 77
- Total pull requests: 108
- Average time to close issues: 22 days
- Average time to close pull requests: 5 days
- Total issue authors: 30
- Total pull request authors: 6
- Average comments per issue: 3.6
- Average comments per pull request: 0.07
- Merged pull requests: 90
- Bot issues: 0
- Bot pull requests: 4
Past Year
- Issues: 13
- Pull requests: 3
- Average time to close issues: about 1 month
- Average time to close pull requests: 12 days
- Issue authors: 11
- Pull request authors: 2
- Average comments per issue: 1.62
- Average comments per pull request: 1.0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- belerico (9)
- anthony0727 (6)
- defrag-bambino (6)
- LYK-love (6)
- Disastorm (6)
- LucaVendruscolo (5)
- geranim0 (4)
- zichunxx (3)
- chrisgao99 (3)
- michele-milesi (2)
- redzhepdx (2)
- crocodile13 (1)
- ll7 (1)
- brodequin-slaps (1)
- elisaparga19 (1)
Pull Request Authors
- belerico (66)
- michele-milesi (35)
- DavideTr8 (4)
- LucaVendruscolo (4)
- pre-commit-ci[bot] (3)
- LocaAlex (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 89 last-month
- Total dependent packages: 1
- Total dependent repositories: 0
- Total versions: 13
- Total maintainers: 3
pypi.org: sheeprl
High-quality, single file and distributed implementation of Deep Reinforcement Learning algorithms with production-friendly features
- Documentation: https://sheeprl.readthedocs.io/
- License: Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION 1. Definitions. "License" shall mean the terms and conditions for use, reproduction, and distribution as defined by Sections 1 through 9 of this document. "Licensor" shall mean the copyright owner or entity authorized by the copyright owner that is granting the License. "Legal Entity" shall mean the union of the acting entity and all other entities that control, are controlled by, or are under common control with that entity. For the purposes of this definition, "control" means (i) the power, direct or indirect, to cause the direction or management of such entity, whether by contract or otherwise, or (ii) ownership of fifty percent (50%) or more of the outstanding shares, or (iii) beneficial ownership of such entity. "You" (or "Your") shall mean an individual or Legal Entity exercising permissions granted by this License. "Source" form shall mean the preferred form for making modifications, including but not limited to software source code, documentation source, and configuration files. "Object" form shall mean any form resulting from mechanical transformation or translation of a Source form, including but not limited to compiled object code, generated documentation, and conversions to other media types. "Work" shall mean the work of authorship, whether in Source or Object form, made available under the License, as indicated by a copyright notice that is included in or attached to the work (an example is provided in the Appendix below). "Derivative Works" shall mean any work, whether in Source or Object form, that is based on (or derived from) the Work and for which the editorial revisions, annotations, elaborations, or other modifications represent, as a whole, an original work of authorship. For the purposes of this License, Derivative Works shall not include works that remain separable from, or merely link (or bind by name) to the interfaces of, the Work and Derivative Works thereof. "Contribution" shall mean any work of authorship, including the original version of the Work and any modifications or additions to that Work or Derivative Works thereof, that is intentionally submitted to Licensor for inclusion in the Work by the copyright owner or by an individual or Legal Entity authorized to submit on behalf of the copyright owner. For the purposes of this definition, "submitted" means any form of electronic, verbal, or written communication sent to the Licensor or its representatives, including but not limited to communication on electronic mailing lists, source code control systems, and issue tracking systems that are managed by, or on behalf of, the Licensor for the purpose of discussing and improving the Work, but excluding communication that is conspicuously marked or otherwise designated in writing by the copyright owner as "Not a Contribution." "Contributor" shall mean Licensor and any individual or Legal Entity on behalf of whom a Contribution has been received by Licensor and subsequently incorporated within the Work. 2. Grant of Copyright License. Subject to the terms and conditions of this License, each Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare Derivative Works of, publicly display, publicly perform, sublicense, and distribute the Work and such Derivative Works in Source or Object form. 3. Grant of Patent License. Subject to the terms and conditions of this License, each Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable (except as stated in this section) patent license to make, have made, use, offer to sell, sell, import, and otherwise transfer the Work, where such license applies only to those patent claims licensable by such Contributor that are necessarily infringed by their Contribution(s) alone or by combination of their Contribution(s) with the Work to which such Contribution(s) was submitted. If You institute patent litigation against any entity (including a cross-claim or counterclaim in a lawsuit) alleging that the Work or a Contribution incorporated within the Work constitutes direct or contributory patent infringement, then any patent licenses granted to You under this License for that Work shall terminate as of the date such litigation is filed. 4. Redistribution. You may reproduce and distribute copies of the Work or Derivative Works thereof in any medium, with or without modifications, and in Source or Object form, provided that You meet the following conditions: (a) You must give any other recipients of the Work or Derivative Works a copy of this License; and (b) You must cause any modified files to carry prominent notices stating that You changed the files; and (c) You must retain, in the Source form of any Derivative Works that You distribute, all copyright, patent, trademark, and attribution notices from the Source form of the Work, excluding those notices that do not pertain to any part of the Derivative Works; and (d) If the Work includes a "NOTICE" text file as part of its distribution, then any Derivative Works that You distribute must include a readable copy of the attribution notices contained within such NOTICE file, excluding those notices that do not pertain to any part of the Derivative Works, in at least one of the following places: within a NOTICE text file distributed as part of the Derivative Works; within the Source form or documentation, if provided along with the Derivative Works; or, within a display generated by the Derivative Works, if and wherever such third-party notices normally appear. The contents of the NOTICE file are for informational purposes only and do not modify the License. You may add Your own attribution notices within Derivative Works that You distribute, alongside or as an addendum to the NOTICE text from the Work, provided that such additional attribution notices cannot be construed as modifying the License. You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions for use, reproduction, or distribution of Your modifications, or for any such Derivative Works as a whole, provided Your use, reproduction, and distribution of the Work otherwise complies with the conditions stated in this License. 5. Submission of Contributions. Unless You explicitly state otherwise, any Contribution intentionally submitted for inclusion in the Work by You to the Licensor shall be under the terms and conditions of this License, without any additional terms or conditions. Notwithstanding the above, nothing herein shall supersede or modify the terms of any separate license agreement you may have executed with Licensor regarding such Contributions. 6. Trademarks. This License does not grant permission to use the trade names, trademarks, service marks, or product names of the Licensor, except as required for reasonable and customary use in describing the origin of the Work and reproducing the content of the NOTICE file. 7. Disclaimer of Warranty. Unless required by applicable law or agreed to in writing, Licensor provides the Work (and each Contributor provides its Contributions) on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied, including, without limitation, any warranties or conditions of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE. You are solely responsible for determining the appropriateness of using or redistributing the Work and assume any risks associated with Your exercise of permissions under this License. 8. Limitation of Liability. In no event and under no legal theory, whether in tort (including negligence), contract, or otherwise, unless required by applicable law (such as deliberate and grossly negligent acts) or agreed to in writing, shall any Contributor be liable to You for damages, including any direct, indirect, special, incidental, or consequential damages of any character arising as a result of this License or out of the use or inability to use the Work (including but not limited to damages for loss of goodwill, work stoppage, computer failure or malfunction, or any and all other commercial damages or losses), even if such Contributor has been advised of the possibility of such damages. 9. Accepting Warranty or Additional Liability. While redistributing the Work or Derivative Works thereof, You may choose to offer, and charge a fee for, acceptance of support, warranty, indemnity, or other liability obligations and/or rights consistent with this License. However, in accepting such obligations, You may act only on Your own behalf and on Your sole responsibility, not on behalf of any other Contributor, and only if You agree to indemnify, defend, and hold each Contributor harmless for any liability incurred by, or claims asserted against, such Contributor by reason of your accepting any such warranty or additional liability. END OF TERMS AND CONDITIONS APPENDIX: How to apply the Apache License to your work. To apply the Apache License to your work, attach the following boilerplate notice, with the fields enclosed by brackets "[]" replaced with your own identifying information. (Don't include the brackets!) The text should be enclosed in the appropriate comment syntax for the file format. We also recommend that a file or class name and description of purpose be included on the same "printed page" as the copyright notice for easier identification within third-party archives. Copyright 2023 Eclectic Sheep Team Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
-
Latest release: 0.5.7
published over 1 year ago
Rankings
Maintainers (3)
Dependencies
- actions/checkout v3 composite
- actions/setup-python v4 composite
- gymnasium [box2d]==0.28.*
- lightning ==2.0.*
- lightning-utilities <0.9
- moviepy ==1.0.*
- python-dotenv >=1.0.0
- tensorboard >=2.10
- tensordict ==0.1.*