sfpk-main

Official implementation of SFPK-pruner in ICLR 2025 "Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation"

https://github.com/mzf666/sfpk-main

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.1%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Official implementation of SFPK-pruner in ICLR 2025 "Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation"

Basic Info

Host: GitHub
Owner: mzf666
License: apache-2.0
Language: Python
Default Branch: main
Size: 680 KB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 1
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Official Implementation of Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation (ICLR 2025)

Overview

What is SFPK? This repo contains the pre-release implementation of Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation pruner (SFPK-pruner), a novel probabilistic neural network pruner proposed in SFPK: Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation, ICLR 2025.

Roughly speaking, SFPK explicitly models the evolution of the population (e.g. mask distribution) of the optimal subnetwork as the sparsity progressively increases. Therefore, one can progressively prune a dense model to the target sparsity level, by simulating the SFPK dynamic initialized from the dense model (with $0$-sparsity).

Why is it termed SFPK? Conceptually similar to the traditional Fokker-Planck-Kolmogorov (FPK) equation, which models the time evolution of particle density in thermodynamics, the proposed SFPK theory offers a closed-form, mathematically tractable approach to approximate the evolution of the optimal subnetwork distribution as sparsity gradually increases, serving as an analog to the FPK equation in probabilistic neural pruning.

How does SFPK work? Intuitively, SFPK-pruner can be interpreted as running an interacting ensemble of $n$ mask trajectories with increasing sparsity. While each mask is pulled by an attraction force (blue arrows) from the local optimum of the next sparsity level, the interaction energy introduces repulsive forces (red arrows) between pairs of trajectories, which encourages the trajectories to explore diverse optima. This exploration effect is further enhanced by the gradient noise induced by the mini-batch gradient computation.

In practice, our SFPK-pruner is a model-agnostic pruning method since it does not rely on a specific formulation of the model loss with respect to the model parameters and masks. Additionally, one can implement the mask variable as either filter masks, channel masks, or node masks to conduct structured pruning with desired granularity.

Updates

Currently, we are exploring SFPK on more architectures (i.e., diffusion, GPT) and datasets. We are also working on SFPK acceleration. We welcome any discussions, suggestions, and feedback on SFPK.

02-Feb-2025: SFPK implementation (version 1.0) released.
23-Jan-2025: SFPK is accepted to ICLR 2025.

Installation

To install the dependency:

bash pip install -r requirements.txt

Usage

Our scripts are tested on NVIDIA A100 GPUs with Python 3.8.12 and PyTorch 1.13.0.

By configuring arguments in main.py, one can train target dense model from scratch via Experiments/pretrain.py, apply oneshot pruning to a specified model via Experiments/oneshot.py, or apply iterative pruning to a specified model via Experiments/iterative.py.

Before executing the scripts, one need to fill the placeholders <host_name> <root>, <data_dir> and <model_dir> in main.py, lines 22-36.

To train a ResNet-20 from scratch on CIFAR-100:

bash python main.py \ --exp pretrain \ --seed 0 \ --cuda_idx 0 \ --num_workers 8 \ --data cifar100 \ --model resnet20 \ --use_init

To apply $95\%$-sparsity oneshot pruning to ResNet-20 on CIFAR-100 by:

bash python main.py \ --exp oneshot \ --seed 0 \ --cuda_idx 0 \ --num_workers 8 \ --data cifar100 \ --model resnet20 \ --bsz 64 \ --prn_epochs 1 \ --ft_epochs 100 \ --fixed_lr_epochs 0 \ --lr 1e-2 \ --warmup_epochs 5 \ --cold_start_lr 1e-4 \ --sparsity 0.05 \ # sparsity = remain / total --prn_scope global \ --pruner SFPK \ --N 100 \ --r 1.4 \ --sfpk_n_mask 10 \ --sfpk_repl_mode exp \ --sfpk_repl_lam 0.2 \ --sfpk_repl_weighted \ --sfpk_vote_ratio 1 \ --sfpk_vote_mode soft # --structural # for structured pruning

To apply $95\%$-sparsity $10$-epoch iterative pruning to ResNet-50-V2 on ImageNet:

bash python main.py \ --exp iterative \ --seed 0 \ --cuda_idx 0 \ --num_workers 8 \ --data imagenet \ --model resnet50v2 \ --bsz 256 \ --prn_epochs 10 \ --ft_epochs 50 \ --fixed_lr_epochs 0 \ --lr 1e-2 \ --grad_n_accum 8 \ --warmup_epochs 5 \ --cold_start_lr 1e-4 \ --sparsity 0.05 \ # sparsity = remain / total --prn_scope global \ --pruner SFPK \ --N 150 \ --r 5.5 \ --sfpk_n_mask 10 \ --sfpk_repl_mode exp \ --sfpk_repl_lam 0.2 \ --sfpk_repl_weighted \ --sfpk_vote_ratio 1 \ --sfpk_vote_mode soft

Thanks

This repo is heavily borrowed from SynFlow and SpODE.

Citation

bibtex @inproceedings{ SFPK_iclr2025, title={Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation}, author={Zhanfeng Mo, Haosen Shi, Sinno Jialin Pan}, booktitle={The Thirteenth International Conference on Learning Representations}, year={2025}, url={https://openreview.net/forum?id=hJ1BaJ5ELp} }

Owner

Login: mzf666
Kind: user

Repositories: 1
Profile: https://github.com/mzf666

Citation (CITATION.cff)

cff-version: 1.0.0
title: "Probabilistic Neural Pruning via Sparsity Evolutionary Fokker-Planck-Kolmogorov Equation"
version: 1.0.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Mo"
    given-names: "Zhanfeng"
year: 2025
repository-code: "https://openreview.net/forum?id=hJ1BaJ5ELp"

GitHub Events

Total

Issues event: 1
Watch event: 5
Push event: 2
Create event: 2

Last Year

Issues event: 1
Watch event: 5
Push event: 2
Create event: 2

Dependencies

requirements.txt pypi

PyPDF2 ==2.11.2
PyYAML ==6.0
easydict ==1.10
matplotlib ==3.5.1
numpy ==1.22.4
pandas ==1.4.3
scipy ==1.7.3
seaborn ==0.12.1
torch ==1.13.0
torchvision ==0.14.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science