LLM-LNS-Quantization
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (1.9%) to scientific vocabulary
Last synced: 6 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: pouya-haghi
- License: mit
- Language: Python
- Default Branch: main
- Size: 3.34 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Created over 2 years ago
· Last pushed over 1 year ago
Metadata Files
Readme
License
Citation
Codeowners
README.md
LLM-LNS-Quantization
This repository updates LMEvalHarness to add new quantization methods to it.
The new formats are: - LNS8, LNS4 - Dynamic LNS with optimizations (per-block quantization) in 8 and 4 bits - FP8, FP4 - MX block floating-point - ZeroQuant - VSQuant - INT8 (W8A8) - SmoothQuant - LLM.int8()
The file is under lm_eval/models/huggingface.py.
Owner
- Login: pouya-haghi
- Kind: user
- Repositories: 2
- Profile: https://github.com/pouya-haghi
Citation (CITATION.bib)
@software{eval-harness,
author = {Gao, Leo and
Tow, Jonathan and
Biderman, Stella and
Black, Sid and
DiPofi, Anthony and
Foster, Charles and
Golding, Laurence and
Hsu, Jeffrey and
McDonell, Kyle and
Muennighoff, Niklas and
Phang, Jason and
Reynolds, Laria and
Tang, Eric and
Thite, Anish and
Wang, Ben and
Wang, Kevin and
Zou, Andy},
title = {A framework for few-shot language model evaluation},
month = sep,
year = 2021,
publisher = {Zenodo},
version = {v0.0.1},
doi = {10.5281/zenodo.5371628},
url = {https://doi.org/10.5281/zenodo.5371628}
}
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Dependencies
Dockerfile
docker
- nvidia/cuda 11.2.0-cudnn8-runtime-ubuntu20.04 build
tests/Dockerfile
docker
- nvidia/cuda 11.2.0-cudnn8-runtime-ubuntu20.04 build
requirements.txt
pypi
setup.py
pypi
- accelerate >=0.17.1
- datasets >=2.0.0
- einops *
- jsonlines *
- numexpr *
- omegaconf >=2.2
- openai >=0.6.4
- peft >=0.2.0
- pybind11 >=2.6.2
- pycountry *
- pytablewriter *
- rouge-score >=0.0.4
- sacrebleu ==1.5.0
- scikit-learn >=0.24.1
- sqlitedict *
- torch >=1.7
- tqdm-multiprocess *
- transformers >=4.1
- zstandard *
tests/requirements.txt
pypi
tests/setup.py
pypi
- accelerate >=0.17.1
- datasets >=2.0.0
- einops *
- jsonlines *
- numexpr *
- omegaconf >=2.2
- openai >=0.6.4
- peft >=0.2.0
- pybind11 >=2.6.2
- pycountry *
- pytablewriter *
- rouge-score >=0.0.4
- sacrebleu ==1.5.0
- scikit-learn >=0.24.1
- sqlitedict *
- torch >=1.7
- tqdm-multiprocess *
- transformers >=4.1
- zstandard *