Delve

Delve: Neural Network Feature Variance Analysis - Published in JOSS (2022)

Science Score: 98.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 5 DOI reference(s) in README and JOSS metadata
✓
Academic publication links
Links to: arxiv.org, springer.com, zenodo.org
○
Committers with academic emails
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Keywords

convolutional-neural-networks deep-learning layer-saturation model-training neural-dynamics pruning pytorch training-monitor visualization

Scientific Fields

Earth and Environmental Sciences Physical Sciences - 40% confidence

Sociology Social Sciences - 40% confidence

Last synced: 6 months ago · JSON representation ·

Repository

PyTorch model training and layer saturation monitor

Basic Info

Host: GitHub
Owner: delve-team
License: mit
Language: Python
Default Branch: master
Homepage: https://delve-docs.readthedocs.io
Size: 13.8 MB

Statistics

Stars: 82
Watchers: 3
Forks: 13
Open Issues: 2
Releases: 30

Topics

convolutional-neural-networks deep-learning layer-saturation model-training neural-dynamics pruning pytorch training-monitor visualization

Created over 7 years ago · Last pushed almost 3 years ago

Metadata Files

Readme Changelog License Citation

README.rst

Delve: Deep Live Visualization and Evaluation |logo|
====================================================

|PyPI version| |Tests| |codecov.io| |License: MIT| |DOI| 

Delve is a Python package for analyzing the inference dynamics of your model.

.. image:: https://raw.githubusercontent.com/justinshenk/playground/master/saturation_demo.gif
   :alt: playground

Use Delve if you need a lightweight PyTorch extension that: 

- Gives you insight into the inference dynamics of your architecture 
- Allows you to optimize and adjust neural networks models to your dataset without much trial and error 
- Allows you to analyze the eigenspaces your data at different stages of inference 
- Provides you basic tooling for experiment logging

Motivation
----------

Designing a deep neural network is a trial and error heavy process that
mostly revolves around comparing performance metrics of different runs.
One of the key issues with this development process is that the results
of metrics not really propagate back easily to concrete design
improvements. Delve provides you with spectral analysis tools that allow
you to investigate the inference dynamic evolving in the model while
training. This allows you to spot underutilized and unused layers.
Mismatches between object size and neural architecture among other
inefficiencies. These observations can be propagated back directly to
design changes in the architecture even before the model has fully
converged, allowing for a quicker and more guided design process.

This work is closely related to Maithra Raghu (Google Brain) et al's work on SVCCA:

- "Maithra Raghu on the differences between wide and deep networks", 2020 `[YouTube] `_
- "SVCCA:Singular Vector Canonical Correlation Analysis for Deep Learning and Interpretability", 2017 `[arXiv] `_
  
Installation
------------

.. code:: bash

   pip install delve

Using Layer Saturation to improve model performance
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The saturation metric is the core feature of delve. By default
saturation is a value between 0 and 1.0 computed for any convolutional,
lstm or dense layer in the network. The saturation describes the
percentage of eigendirections required for explaining 99% of the
variance. Simply speaking, it tells you how much your data is “filling
up” the individual layers inside your model.

In the image below you can see how saturation portraits inefficiencies
in your neural network. The depicted model is ResNet18 trained on 32
pixel images, which is way to small for a model with a receptive field
exceeding 400 pixels in the final layers.

.. image:: https://raw.githubusercontent.com/delve-team/delve/master/images/resnet.PNG
   :alt: resnet.PNG

To visualize what this poorly chosen input resolution does to the
inference, we trained logistic regressions on the output of every layer
to solve the same task as the model. You can clearly see that only the
first half of the model (at best) is improving the intermedia solutions
of our logistic regression “probes”. The layers following this are
contributing nothing to the quality of the prediction! You also see that
saturation is extremly low for this layers!

We call this a *tail* and it can be removed by either increasing the
input resolution or (which is more economical) reducing the receptive
field size to match the object size of your dataset.

.. figure:: https://raw.githubusercontent.com/delve-team/delve/master/images/resnetBetter.PNG
   :alt: resnetBetter.PNG

We can do this by removing the first two downsampling layers, which
quarters the growth of the receptive field of your network, which
reduced not only the number of parameters but also makes more use of the
available parameters, by making more layers contribute effectivly!

**For more details check our publication on this topics** - `Spectral
Analysis of Latent Representations `__
- `Feature Space Saturation during
Training `__ - `(Input) Size Matters
for CNN
Classifiers `__
- `Should you go deeper? Optimizing Convolutional Neural Networks
without training `__ - Go with the
Flow: the distribution of information processing in multi-path networks
(soon)

Demo
----

.. code:: python


   import torch
   from delve import SaturationTracker
   from torch.cuda import is_available
   from torch.nn import CrossEntropyLoss
   from torchvision.datasets import CIFAR10
   from torchvision.transforms import ToTensor, Compose
   from torch.utils.data.dataloader import DataLoader
   from torch.optim import Adam
   from torchvision.models.vgg import vgg16

   # setup compute device
   from tqdm import tqdm

   if __name__ == "__main__":

     device = "cuda:0" if is_available() else "cpu"

     # Get some data
     train_data = CIFAR10(root="./tmp", train=True,
                          download=True, transform=Compose([ToTensor()]))
     test_data = CIFAR10(root="./tmp", train=False, download=True, transform=Compose([ToTensor()]))

     train_loader = DataLoader(train_data, batch_size=1024,
                               shuffle=True, num_workers=6,
                               pin_memory=True)
     test_loader = DataLoader(test_data, batch_size=1024,
                              shuffle=False, num_workers=6,
                              pin_memory=True)

     # instantiate model
     model = vgg16(num_classes=10).to(device)

     # instantiate optimizer and loss
     optimizer = Adam(params=model.parameters())
     criterion = CrossEntropyLoss().to(device)

     # initialize delve
     tracker = SaturationTracker("my_experiment", save_to="plotcsv", modules=model, device=device)

     # begin training
     for epoch in range(10):
       model.train()
       for (images, labels) in tqdm(train_loader):
         images, labels = images.to(device), labels.to(device)
         prediction = model(images)
         optimizer.zero_grad(set_to_none=True)
         with torch.cuda.amp.autocast():
           outputs = model(images)
           _, predicted = torch.max(outputs.data, 1)

           loss = criterion(outputs, labels)
         loss.backward()
         optimizer.step()

       total = 0
       test_loss = 0
       correct = 0
       model.eval()
       for (images, labels) in tqdm(test_loader):
         images, labels = images.to(device), labels.to(device)
         outputs = model(images)
         loss = criterion(outputs, labels)
         _, predicted = torch.max(outputs.data, 1)

         total += labels.size(0)
         correct += torch.sum((predicted == labels)).item()
         test_loss += loss.item()

       # add some additional metrics we want to keep track of
       tracker.add_scalar("accuracy", correct / total)
       tracker.add_scalar("loss", test_loss / total)

       # add saturation to the mix
       tracker.add_saturations()

     # close the tracker to finish training
     tracker.close()

Supported Layers
----------------

* Dense/Linear
* LSTM
* Convolutional

Citation
--------

If you use Delve in your publication, please cite:

.. code-block:: txt

   @software{delve,
   author       = {Justin Shenk and
                     Mats L. Richter and
                     Wolf Byttner and
                     Michał Marcinkiewicz},
   title        = {delve-team/delve: Latest},
   month        = aug,
   year         = 2021,
   publisher    = {Zenodo},
   version      = {v0.1.50},
   doi          = {10.5281/zenodo.5233859},
   url          = {https://doi.org/10.5281/zenodo.5233859}
   }


Why this name, Delve?
~~~~~~~~~~~~~~~~~~~~~

**delve** (*verb*):

-  reach inside a receptacle and search for something
-  to carry on intensive and thorough research for data, information, or
   the like

.. |logo| image:: https://github.com/delve-team/delve/blob/master/images/delve_logo.png
.. |PyPI version| image:: https://badge.fury.io/py/delve.svg
   :target: https://badge.fury.io/py/delve
.. |Tests| image:: https://github.com/delve-team/delve/actions/workflows/tests.yaml/badge.svg
   :target: https://github.com/delve-team/delve/actions/workflows/tests.yaml
.. |codecov.io| image:: https://codecov.io/github/delve-team/delve/coverage.svg?branch=master
   :target: https://codecov.io/github/delve-team/delve/?branch=master
.. |License: MIT| image:: https://img.shields.io/badge/License-MIT-blue.svg
   :target: https://opensource.org/licenses/MIT
.. |DOI| image:: https://zenodo.org/badge/136951823.svg
   :target: https://zenodo.org/badge/latestdoi/136951823

Owner

Name: Delve
Login: delve-team
Kind: organization

Website: https://github.com/justinshenk/playground
Repositories: 5
Profile: https://github.com/delve-team

Delve is a library for visualizing layer saturation during neural network training

JOSS Publication

Delve: Neural Network Feature Variance Analysis

Published

January 29, 2022

DOI

10.21105/joss.03992

Volume 7, Issue 69, Page 3992

Authors

Justin Shenk

VisioLab, Berlin, Germany, Institute of Cognitive Science, University of Osnabrueck, Osnabrueck, Germany

Mats L. Richter

Institute of Cognitive Science, University of Osnabrueck, Osnabrueck, Germany

Wolf Byttner

Rapid Health, London, England, United Kingdom

Editor

Rachel Kurchin

Citation (CITATION)

@software{delve,
  author       = {Justin Shenk and
                  Mats L. Richter and
                  Wolf Byttner and
                  Michał Marcinkiewicz},
  title        = {delve-team/delve: v0.1.45},
  month        = aug,
  year         = 2021,
  publisher    = {Zenodo},
  version      = {v0.1.45},
  doi          = {10.5281/zenodo.5233860},
  url          = {https://doi.org/10.5281/zenodo.5233860}
}

Papers & Mentions

Total mentions: 3

Systematic analysis of transcription start sites in avian development

DOI: 10.1371/journal.pbio.2002887
OpenAlex ID: https://openalex.org/W2752105574
Published: September 2017

Last synced: 4 months ago

Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system

DOI: 10.1038/sdata.2017.163
OpenAlex ID: https://openalex.org/W2765677725
Published: October 2017

Last synced: 4 months ago

Monitoring transcription initiation activities in rat and dog

DOI: 10.1038/sdata.2017.173
OpenAlex ID: https://openalex.org/W2770926404
Published: November 2017

Last synced: 4 months ago

GitHub Events

Total

Watch event: 4

Last Year

Watch event: 4

Committers

Last synced: 7 months ago

All Time

Total Commits: 468
Total Committers: 8
Avg Commits per committer: 58.5
Development Distribution Score (DDS): 0.344

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Justin Shenk	s**n@g**m	307
MLRichter	m**r@u**e	146
Wolf Byttner	w**f@r**k	8
Sambit Kumar Dash	s**h@g**m	2
Dion Häfner	m**l@d**e	2
mmarcinkiewicz	m**z@g**m	1
Mats Richter	m**r@a**e	1
MLRichter	D****!	1

Committer Domains (Top 20 + Academic)

agile-im.de: 1 dionhaefner.de: 1 rapidhealth.co.uk: 1 uos.de: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 21
Total pull requests: 46
Average time to close issues: about 2 months
Average time to close pull requests: 3 days
Total issue authors: 10
Total pull request authors: 7
Average comments per issue: 1.76
Average comments per pull request: 0.59
Merged pull requests: 40
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

justinshenk (7)
dionhaefner (5)
stared (2)
themightyoarfish (1)
dawrym (1)
Garrus990 (1)
liujuncn (1)
Saran-nns (1)
MLRichter (1)
marthinwurer (1)

Pull Request Authors

MLRichter (26)
justinshenk (12)
WolfByttner (3)
dionhaefner (2)
rivol (1)
mmarcinkiewicz (1)
sambitdash (1)

Top Labels

Issue Labels

bug (3) good first issue (1)

Pull Request Labels

bug (2) enhancement (1)

Packages

Total packages: 1
Total downloads:
- pypi 154 last-month

Total dependent packages: 0
Total dependent repositories: 1
Total versions: 44
Total maintainers: 2

pypi.org: delve

Delve lets you monitor PyTorch model layer saturation during training

Homepage: https://github.com/delve-team/delve
Documentation: https://delve-docs.readthedocs.io/en/latest/
License: MIT license
Latest release: 0.1.50
published almost 3 years ago

Versions: 44
Dependent Packages: 0
Dependent Repositories: 1
Downloads: 154 Last month

Rankings

Stargazers count: 7.9%

Forks count: 9.8%

Dependent packages count: 10.1%

Average: 15.4%

Dependent repos count: 21.6%

Downloads: 27.8%

Maintainers (2)

jshenk MLRichter

Last synced: 6 months ago

Dependencies

requirements/docs.txt pypi

codecov *
ipython *
matplotlib *
pandas *
pydata-sphinx-theme *
pytest *
pytest-cov *
sphinx *
sphinx-gallery *
torch *
tqdm *
yapf *

requirements/requirements.txt pypi

matplotlib *
numpy *
pandas *
pytest *
tensorboardX *
tqdm *
yapf *

.github/workflows/draft-pdf.yml actions

actions/checkout v2 composite
actions/upload-artifact v1 composite
openjournals/openjournals-draft-action master composite

.github/workflows/python-app.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/tests.yaml actions

actions/checkout v2 composite
codecov/codecov-action v2.0.2 composite
conda-incubator/setup-miniconda v2 composite

environment.yml pypi

setup.py pypi

Delve

Science Score: 98.0%

Keywords

Scientific Fields

Repository

Basic Info

Statistics

Topics

Metadata Files

README.rst

Owner

JOSS Publication

Delve: Neural Network Feature Variance Analysis

Authors

Editor

Tags

Citation (CITATION)

Papers & Mentions

Systematic analysis of transcription start sites in avian development

Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system

Monitoring transcription initiation activities in rat and dog

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: delve

Rankings

Maintainers (2)

Dependencies