rl-heat-treatment

https://github.com/nima-siboni/rl-heat-treatment

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary

Last synced: 11 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: nima-siboni
Language: Python
Default Branch: main
Size: 35.2 KB

Statistics

Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created almost 4 years ago · Last pushed over 2 years ago

Metadata Files

Readme Citation

Reinforcement Learning for Heat-treatment

This is a reinforcement learning solution for finding the optimal heat-treatment in material sciences. The reinforcement learning is done with RLlib on a 2D phase-field environment, Furnace-Env.

0 -- Installation

0.1 -- Creating the conda env

buildoutcfg conda create -n rl python=3.8

0.2 -- Installing the required packages

Activate the conda env: buildoutcfg conda activate rl

0.3 -- Update your pip

buildoutcfg pip install --upgrade pip

0.4 -- Clone the project

buildoutcfg git clone git@github.com:nima-siboni/RL-Heat-Treatment.git cd RL-Heat-Treatment

0.4 -- Install the requirements

Install the requirements in the env buildoutcfg pip install -r requirements.txt

0.5 -- Modifying RLlib (for `EpsilonGreedyCautious`)

If you want to use Caution-Epsilon-Greedy instead of the standard Epsilon-Greedy of RLlib, you need to follow these 3 steps: * First configure the exploration config in agent_config.cfg: python "exploration_config": {"type": "EpsilonGreedyCautious"} # or "EpsilonGreedy" * Then place the epsilon_greedy_cautious.py where RLlib saves its explorations; For example under conda environment directory followed by envs/rl/lib/python3.8/site-packages/ray/rllib/utils/exploration * In the same directory modify the __init__.py by importing the new epsilon scheduler and adding it to list of all explorations __all__: ```python from ray.rllib.utils.exploration.epsilongreedycautious import EpsilonGreedyCautious

all = [ ... "EpsilonGreedyCautious", ... ] ```

0.6 -- Installing the package

It is an optional step. You can install this repository as a package by running python pip install -e . from the main directory of the repo.

Have fun!

1 -- Configurations

The configuration of the enviornment, the agent, and the training are done by the following files * env_config.cfg: a config file for the environment, for details visit Furnace-Env. * agent_config.cfg: here one can set the RLlib's algorithm config, and finally * training_config.cfg: common training/logging configs are set here.

2 -- Learning

To start learning, execute the following command within the RL-Heat-Treatment: python experience and learn.py

3 -- Evaluating the trained agent

To evaluate the agent's performance, in agents_performance.py, set: * the directory from where the agent should be loaded or you can alternatively set the find_the_latest_agent = True to load the last checkpoint: buildoutcfg checkpoint_dir = 'training_results/checkpoints/checkpoint_000031/checkpoint-31' * the directory the results should be outputted buildoutcfg results_directory = './agents_performance_analysis/' Executing this file, creates a directory with one subdirectory for each evaluation episode containing: * the initial phase field, * the final phase field, * a GIF of intermediate PFs, * temperature vs step profile, * the density vs steps, * energy cost vs steps, * accumulated rewards vs steps, and * G2 vs steps.

To have an statistical analysis of the performance of the agent over many episodes, use performance_analysis.py to produce histograms and joint plots of the performance of the trained agent.

Owner

Name: semi-ergodic
Login: nima-siboni
Kind: user

Repositories: 2
Profile: https://github.com/nima-siboni

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "H. SIBONI"
  given-names: "NIMA"
- family-names: "REZAEI MIANROODI"
  given-names: "JABER"
title: "RL-Heat-Treatment"
version: 1.0.0
date-released: 2022-09-18
url: "https://github.com/nima-siboni/RL-Heat-Treatment"

GitHub Events

Total

Last Year

Dependencies

requirements.txt pypi

gymnasium ==0.28.1
numpy ==1.21.0
packaging ==21.0
pandas *
pre-commit *
protobuf ==3.20.1
pydantic *
ray ==2.8.0
ray *
seaborn *
tensorboard ==2.8.0
tensorflow ==2.8.0
tqdm *

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

rl-heat-treatment

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Reinforcement Learning for Heat-treatment

0 -- Installation

0.1 -- Creating the conda env

0.2 -- Installing the required packages

0.3 -- Update your pip

0.4 -- Clone the project

0.4 -- Install the requirements

0.5 -- Modifying RLlib (for `EpsilonGreedyCautious`)

0.6 -- Installing the package

1 -- Configurations

2 -- Learning

3 -- Evaluating the trained agent

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies

rl-heat-treatment

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Reinforcement Learning for Heat-treatment

0 -- Installation

0.1 -- Creating the conda env

0.2 -- Installing the required packages

0.3 -- Update your pip

0.4 -- Clone the project

0.4 -- Install the requirements

0.5 -- Modifying RLlib (for EpsilonGreedyCautious)

0.6 -- Installing the package

1 -- Configurations

2 -- Learning

3 -- Evaluating the trained agent

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies

0.5 -- Modifying RLlib (for `EpsilonGreedyCautious`)