llmcompressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
bimp
Code to reproduce the experiments of ICLR2023-paper: How I Learned to Stop Worrying and Love Retraining
https://github.com/astorfi/attention-guided-sparsity
Attention-Based Guided Structured Sparsity of Deep Neural Networks
compressedsensing.jl
Contains a wide-ranging collection of compressed sensing and feature selection algorithms. Examples include matching pursuit algorithms, forward and backward stepwise regression, sparse Bayesian learning, and basis pursuit.
sms
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
https://github.com/danielathome19/engram-neural-network
tensorflow-engram: A Python package for Engram Neural Networks, adding biologically-inspired Hebbian memory and engram layers to TensorFlow/Keras models, supporting memory traces, plasticity, attention, and sparsity for neural sequence learning.