Projects

Updated 10 months ago

lazyllm-llamafactory • Rank 25.6 • Science 77%

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers

Updated 10 months ago

qonnx • Rank 18.7 • Science 77%

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

deep-learning fpga inference machine-learning onnx quantization quantized-neural-networks

Updated 10 months ago

torchao • Rank 25.9 • Science 64%

PyTorch native quantization and sparsity for training and inference

brrr cuda dtypes float8 inference llama mx offloading optimizer pytorch quantization sparsity training transformer

Updated 10 months ago

onnx2tf • Rank 22.1 • Science 67%

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.

android coreml deep-learning docker keras lstm machine-learning model-converter models onnx onnx-tensorflow quantization tensorflow tensorflow-lite tfjs tflite transformer yolov9

Updated 10 months ago

llmcompressor • Rank 22.6 • Science 54%

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

compression quantization sparsity

Updated 10 months ago

@stdlib/ml-incr-kmeans • Rank 5.5 • Science 67%

Incrementally partition data into `k` clusters.

correlation cosine data-mining euclidean javascript k-means kmeans learning lloyds-algorithm machine math mathematics ml node node-js nodejs quantization statistics stats stdlib

Updated 10 months ago

qkeras • Rank 18.2 • Science 46%

QKeras: a quantization deep learning library for Tensorflow Keras

accelerator asic-design deep-learning fpga fpga-accelerator hardware-acceleration keras machine-learning quantization quantized-networks quantized-neural-networks tensorflow

Updated 10 months ago

optimum • Rank 27.2 • Science 36%

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

graphcore habana inference intel onnx onnxruntime optimization pytorch quantization tflite training transformers

Updated 10 months ago

q-galore • Rank 5.3 • Science 54%

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

large-language-models low-rank memory-efficient-learning quantization

Updated 10 months ago

chinese-llama-alpaca • Rank 12.2 • Science 46%

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

alpaca alpaca-2 large-language-models llama llama-2 llm lora nlp plm pre-trained-language-models quantization

Updated 10 months ago

tf2deepfloorplan • Rank 6.2 • Science 51%

TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.

attention-network curl deep-learning deep-neural-networks docker flask github-actions github-release google-colab image-processing image-recognition jupyter-notebook keras-tensorflow pygame pypi-package python3 quantization tensorboard tensorflow2 tflite

Updated 10 months ago

https://github.com/mobiusml/hqq • Rank 19.5 • Science 36%

Official implementation of Half-Quadratic Quantization (HQQ)

llm machine-learning quantization

Updated 10 months ago

https://github.com/cedrickchee/awesome-ml-model-compression • Rank 7.9 • Science 46%

Awesome machine learning model compression research papers, quantization, tools, and learning material.

awesome-list machine-learning model-compression neural-networks pruning quantization

Updated 10 months ago

https://github.com/aisuko/notebooks • Rank 4.6 • Science 26%

Implementation for the different ML tasks on Kaggle platform with GPUs.

accelerator computer-vision fine-tuning kaggle large-language-models multimodal natural-language-processing neural-network peft pytorch quantization renforcement-learning tensorboard transformers visulization wandb

Updated 10 months ago

https://github.com/beomi/bitnet-transformers • Rank 5.7 • Science 23%

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

llm quantization quantization-aware-training transformers

Updated 10 months ago

active-time-theory-time-dilation • Science 44%

Re-interpreting Time Dilation Through the Lens of Active Time Theory

bigbang bigbangtheory blackhole gravity gravity-simulation nasa quantization quantum-algorithms quantum-computing quantum-mechanics quantum-physics time time-dilation timedilation

Updated 10 months ago

active-time-theory-atomic-clocks • Science 44%

The Secret Inner Workings of Time Exposed by Atomic Clocks

bigbang bigbangtheory blackhole gravity gravity-simulation nasa quantization quantum-algorithms quantum-computing quantum-mechanics quantum-physics time time-dilation timedilation

Updated 10 months ago

hailo_model_zoo • Science 26%

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

ai-accelerators computer-vision deep-learning edge-ai hailo hailo8 quantization quantized-neural-networks

Updated 10 months ago

active-time-theory-quantum-tunneling • Science 44%

Simulation of Quantum Tunneling Dynamics Validates Signatures of Active Time

bigbang bigbangtheory blackhole gravity quantization quantum-algorithms quantum-computing quantum-information quantum-mechanics time

Updated 10 months ago

active-time-theory • Science 44%

The Foundations of Active Time Theory

bigbang bigbangtheory blackhole gravity quantization quantum quantum-algorithms quantum-computing quantum-information quantum-mechanics time

Updated 10 months ago

master-thesis • Science 44%

One Bit at a Time: Impact of Quantisation on Neural Machine Translation

encoder-decoder fully-quantized-transformer nmt pytorch quantization quantization-aware-training seq2seq transformer transformers

Updated 10 months ago

psumsim • Science 44%

PSumSim: A Simulator for Partial-Sum Quantization in Analog Matrix-Vector Multipliers

compute-in-memory histogram histograms matrix-vector-multiplication neural-networks numpy partial-sum quantization stochastic-modeling stochastic-simulation

Updated 10 months ago

https://github.com/autohdw/qublas • Science 26%

Quantized BLAS

blas cpp cpp23 meta-programming quantization template

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

lazyllm-llamafactory • Rank 25.6 • Science 77%

qonnx • Rank 18.7 • Science 77%

torchao • Rank 25.9 • Science 64%

onnx2tf • Rank 22.1 • Science 67%

llmcompressor • Rank 22.6 • Science 54%

@stdlib/ml-incr-kmeans • Rank 5.5 • Science 67%

qkeras • Rank 18.2 • Science 46%

optimum • Rank 27.2 • Science 36%

q-galore • Rank 5.3 • Science 54%

chinese-llama-alpaca • Rank 12.2 • Science 46%

tf2deepfloorplan • Rank 6.2 • Science 51%

https://github.com/mobiusml/hqq • Rank 19.5 • Science 36%

https://github.com/cedrickchee/awesome-ml-model-compression • Rank 7.9 • Science 46%

https://github.com/aisuko/notebooks • Rank 4.6 • Science 26%

https://github.com/beomi/bitnet-transformers • Rank 5.7 • Science 23%

active-time-theory-time-dilation • Science 44%

active-time-theory-atomic-clocks • Science 44%

hailo_model_zoo • Science 26%

active-time-theory-quantum-tunneling • Science 44%

active-time-theory • Science 44%

master-thesis • Science 44%

psumsim • Science 44%

https://github.com/autohdw/qublas • Science 26%