https://github.com/gscriva/pixel-cnn

A convolutional autoregressive model to boost Monte Carlo Simulations.

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.5%) to scientific vocabulary

Keywords

deep-learning montecarlo-simulation pytorch

Last synced: 8 months ago · JSON representation

Repository

A convolutional autoregressive model to boost Monte Carlo Simulations.

Basic Info

Host: GitHub
Owner: gscriva
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 67.4 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Topics

deep-learning montecarlo-simulation pytorch

Created about 5 years ago · Last pushed over 2 years ago

Metadata Files

Readme License

PixelCNN for MCMC Ansatz

This is a custom implementation using PyTorch Lightning of the autoregressive PixelCNN model, see also: * Pixel Recurrent Neural Networks * Conditional Image Generation with PixelCNN Decoders * PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications * PixelSNAIL: An Improved Autoregressive Generative Model

Build the env

We highly recommend to create an isolated environment, e.g., using conda conda create -n <env> python==3.9 and then install all the dependencies running pip install -r requirements.txt.

Usage

To use, create your user in Weights and Biases (if you don't have one) and login typing wandb login.

Then you have to add train, validation and test datasets, which are .npy array of size [N,L,L] (L is the lattice size), in the correct folder, which is data/.

As last step remember to create a .env file (using the template template.env) that should look as bash export YOUR_TRAIN_DATASET_PATH="/your/root/to/train/dataset" export YOUR_VAL_DATASET_PATH="/your/root/to/validation/dataset" export YOUR_TEST_DATASET_PATH="/your/root/to/test/dataset" export PROJECT_ROOT="/your/project/root" So now you add to your repository the following files bash . ├── data # dataset dir │ ├── train_dataset.npy │ ├── validation_dataset.npy │ ├── test_dataset.npy ├── .env # system-specific env variables, e.g. PROJECT_ROOT │ To run training just type python src/run.py. If you need to modify default parameters just change the conf/* files according to your prefereces.

Best 2 checkpoints, i.e., best two models according to the validation loss, are saved in /your/project/root/wandb/online-run-YYYYMMDD_HHMMSS-hash/files/pixel-cnn/hash/checkpoints/epoch=XX-step=XXXX.ckpt.

To load and use the model to generate new sample run python src/generate.py --ckpt_path <ckpt_path> --num_sample <num>.

About

Generic template to bootstrap your PyTorch project. Click on and avoid writing boilerplate code for:

PyTorch Lightning, lightweight PyTorch wrapper for high-performance AI research.
Hydra, a framework for elegantly configuring complex applications.
DVC, track large files, directories, or ML models. Think "Git for data".
Weights and Biases, organize and analyze machine learning experiments. (educational account available)
Streamlit, turns data scripts into shareable web apps in minutes.

nn-template is opinionated so you don't have to be. If you use this template, please add to your README.

Usage Examples

Checkout the mwe branch to view a minimum working example on MNIST.

Structure

bash . ├── .cache ├── conf # hydra compositional config │ ├── data │ ├── default.yaml # current experiment configuration │ ├── hydra │ ├── logging │ ├── model │ ├── optim │ └── train ├── data # datasets ├── .env # system-specific env variables, e.g. PROJECT_ROOT ├── requirements.txt # basic requirements ├── src │ ├── common # common modules and utilities │ ├── pl_data # PyTorch Lightning datamodules and datasets │ ├── pl_modules # PyTorch Lightning modules │ ├── run.py # entry point to run current conf │ └── ui # interactive streamlit apps └── wandb # local experiments (auto-generated)

Data Version Control

DVC runs alongside git and uses the current commit hash to version control the data.

Initialize the dvc repository:

bash $ dvc init

To start tracking a file or directory, use dvc add:

bash $ dvc add data/ImageNet

DVC stores information about the added file (or a directory) in a special .dvc file named data/ImageNet.dvc, a small text file with a human-readable format. This file can be easily versioned like source code with Git, as a placeholder for the original data (which gets listed in .gitignore):

bash git add data/ImageNet.dvc data/.gitignore git commit -m "Add raw data"

Making changes

When you make a change to a file or directory, run dvc add again to track the latest version:

bash $ dvc add data/ImageNet

Switching between versions

The regular workflow is to use git checkout first to switch a branch, checkout a commit, or a revision of a .dvc file, and then run dvc checkout to sync data:

bash $ git checkout <...> $ dvc checkout

Weights and Biases

Weights & Biases helps you keep track of your machine learning projects. Use tools to log hyperparameters and output metrics from your runs, then visualize and compare results and quickly share findings with your colleagues.

This is an example of a simple dashboard.

Quickstart

Read more in the docs. Particularly useful the log method, accessible from inside a PyTorch Lightning module with self.logger.experiment.log.

W&B is our logger of choice, but that is a purely subjective decision. Since we are using Lightning, you can replace wandb with the logger you prefer (you can even build your own). More about Lightning loggers here.

Hydra

Hydra is an open-source Python framework that simplifies the development of research and other complex applications. The key feature is the ability to dynamically create a hierarchical configuration by composition and override it through config files and the command line. The name Hydra comes from its ability to run multiple similar jobs - much like a Hydra with multiple heads.

The basic functionalities are intuitive: it is enough to change the configuration files in conf/* accordingly to your preferences. Everything will be logged in wandb automatically.

Consider creating new root configurations conf/myawesomeexp.yaml instead of always using the default conf/default.yaml.

Sweeps

You can easily perform hyperparameters sweeps, which override the configuration defined in /conf/*.

The easiest one is the grid-search. It executes the code with every possible combinations of the specified hyperparameters:

bash PYTHONPATH=. python src/run.py -m optim.optimizer.lr=0.02,0.002,0.0002 optim.lr_scheduler.T_mult=1,2 optim.optimizer.weight_decay=0,1e-5

You can explore aggregate statistics or compare and analyze each run in the W&B dashboard.

We recommend to go through at least the Basic Tutorial, and the docs about Instantiating objects with Hydra.

PyTorch Lightning

Lightning makes coding complex networks simple. It is not a high level framework like keras, but forces a neat code organization and encapsulation.

You should be somewhat familiar with PyTorch and PyTorch Lightning before using this template.

Environment Variables

System specific variables (e.g. absolute paths to datasets) should not be under version control, otherwise there will be conflicts between different users.

The best way to handle system specific variables is through environment variables.

You can define new environment variables in a .env file in the project root. A copy of this file (e.g. .env.template) can be under version control to ease new project configurations.

To define a new variable write inside .env:

bash export MY_VAR=/home/user/my_system_path

You can dynamically resolve the variable name from Python code with:

python get_env('MY_VAR')

and in the Hydra .yaml configuration files with:

yaml ${oc.env:MY_VAR}

Owner

Name: Giuseppe Scriva
Login: gscriva
Kind: user
Location: Zurich
Company: University of Camerino

Website: https://cutt.ly/gN7rbGs
Repositories: 2
Profile: https://github.com/gscriva

Just another infinite monkey trying to write his Divine Comedy.

GitHub Events

Total

Last Year

Dependencies

requirements.txt pypi

dvc *
hydra-core ==1.1.0.dev5
matplotlib *
python-dotenv *
pytorch-lightning ==1.2.5
stqdm *
streamlit ==0.79.0
torch ==1.8.1
torchtext ==0.9.1
torchvision ==0.9.1
wandb ==0.10.23

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/gscriva/pixel-cnn

Science Score: 10.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

PixelCNN for MCMC Ansatz

Build the env

Usage

About

Usage Examples

Structure

Data Version Control

Making changes

Switching between versions

Weights and Biases

Quickstart

Hydra

Sweeps

PyTorch Lightning

Environment Variables

Owner

GitHub Events

Total

Last Year

Dependencies