Updated 9 months ago

transformers • Rank 38.7 • Science 64%

šŸ¤— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Updated 9 months ago

torchao • Rank 25.9 • Science 64%

PyTorch native quantization and sparsity for training and inference

Updated 9 months ago

onnx2tf • Rank 22.1 • Science 67%

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.

Updated 9 months ago

lm-eval • Rank 28.4 • Science 59%

A framework for few-shot evaluation of language models.

Updated 9 months ago

smart-transformers • Rank 7.9 • Science 77%

Smart Transformers are a versatile machine learning tool that can be integrated with Pytorch, TensorFlow, and JAX. Smart transformers provide accurate computations required for cryptographic algorithms. These transformers is that they are independent modules, making it efficient to experiment with various research projects related to cryptanalysis

Updated 9 months ago

uform • Rank 15.6 • Science 64%

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and šŸ”œ video, up to 5x faster than OpenAI CLIP and LLaVA šŸ–¼ļø & šŸ–‹ļø

Updated 9 months ago

en-transformer • Rank 11.6 • Science 67%

Implementation of E(n)-Transformer, which incorporates attention mechanisms into Welling's E(n)-Equivariant Graph Neural Network

Updated 9 months ago

rwkv-lm • Rank 11.2 • Science 67%

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Updated 9 months ago

torchdistill • Rank 15.0 • Science 59%

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. šŸ†26 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. šŸŽ Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Updated 9 months ago

https://github.com/timeseriesai/tsai • Rank 21.4 • Science 46%

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

Updated 9 months ago

transformer-srl • Rank 10.4 • Science 54%

Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also predicate disambiguation.

Updated 9 months ago

ultralyticspro • Rank 9.7 • Science 54%

šŸ”„šŸ”„šŸ”„ äø“ę³ØäŗŽYOLO11,YOLOv8态TYOLOv12态YOLOv10态RT-DETR态YOLOv7态YOLOv5ę”¹čæ›ęØ”åž‹ļ¼ŒSupport to improve backbone, neck, head, loss, IoU, NMS and other modulesšŸš€

Updated 9 months ago

monitors4codegen • Rank 7.0 • Science 54%

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

Updated 9 months ago

sorsa • Rank 6.2 • Science 54%

SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models

Updated 9 months ago

caspr • Rank 5.5 • Science 54%

CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.

Updated 9 months ago

text-case • Rank 14.9 • Science 44%

Convert text between `camelCase`, `PascalCase`, `Capital Case`, `Header-Case`, `Title Case`, `path/case`, `snake_case`, `param-case`, `dot.case`, `CONSTANT_CASE`, `lower case`, `lOWER CASE FIRST`, `UPPER CASE`, `Upper case first` and other

Updated 9 months ago

proteinn-structure-predictor • Rank 1.6 • Science 57%

A transformer network trained to predict end-to-end single sequence protein structure as a set of angles given amino acid sequences.

Updated 9 months ago

stormtrooper • Rank 7.6 • Science 44%

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

Updated 9 months ago

sparql-transformer • Rank 10.0 • Science 39%

A more handy way to use SPARQL data in your web app

Updated 9 months ago

gpt • Rank 1.8 • Science 44%

Generative Pre-trained Transformer in PyTorch from scratch

Updated 9 months ago

computer-vision-in-action • Rank 9.3 • Science 36%

A computer vision closed-loop learning platform where code can be run interactively online. å­¦ä¹ é—­ēŽÆć€Šč®”ē®—ęœŗč§†č§‰å®žęˆ˜ę¼”ē»ƒļ¼šē®—ę³•äøŽåŗ”ē”Øć€‹äø­ę–‡ē”µå­ä¹¦ć€ęŗē ć€čÆ»č€…äŗ¤ęµē¤¾åŒŗļ¼ˆęŒē»­ę›“ę–°äø­ ...) šŸ“˜ åœØēŗæē”µå­ä¹¦ https://charmve.github.io/computer-vision-in-action/ šŸ‘‡é”¹ē›®äø»é”µ

Updated 9 months ago

gcvit • Rank 9.0 • Science 36%

Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformer

Updated 9 months ago

relora • Rank 6.1 • Science 26%

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Updated 9 months ago

conformer • Rank 6.1 • Science 23%

Implementation of the convolutional module from the Conformer paper, for use in Transformers

Updated 9 months ago

band • Rank 8.7 • Science 13%

BAND:BERT Application aNd Deployment, A simple and efficient BERT model training and deployment framework.

Updated 9 months ago

https://github.com/bytedance/bytetransformer • Rank 6.9 • Science 13%

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Updated 9 months ago

text-sim • Rank 7.9 • Science 10%

ę–‡ęœ¬ē›øä¼¼åŗ¦ļ¼ˆåŒ¹é…ļ¼‰č®”ē®—ļ¼Œęä¾›Baselineć€č®­ē»ƒć€ęŽØē†ć€ęŒ‡ę ‡åˆ†ęž...ä»£ē åŒ…å«TensorFlow/PytorchåŒē‰ˆęœ¬

Updated 9 months ago

https://github.com/baldassarrefe/performer-gat-shapenet • Rank 0.7 • Science 13%

Performer vs. Graph Attention Network on ShapeNet with GradCAM explanations.

Updated 9 months ago

https://github.com/cosbidev/matnet • Rank 2.2 • Science 10%

Multi-Level Fusion and Self-Attention Transformer-Based Model for Multivariate Multi-Step Day-Ahead PV Generation Forecasting

Updated 9 months ago

decompose • Science 67%

Principal Component Analysis (PCA) Algorithm was implemented to determine the Functional Age of the Power Transformer using Return Voltage Measurement (RVM). [submitted]

Updated 9 months ago

arrakis-mi • Science 44%

Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.

Updated 9 months ago

git • Science 54%

[ECCV2024 OralšŸ”„] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Updated 9 months ago

grte • Science 65%

Graph Receptive Transformer Encoder

Updated 9 months ago

intr • Science 75%

This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.

Updated 9 months ago

ecg_classification • Science 49%

Official and maintained implementation of the paper "Exploring Novel Algorithms for Atrial Fibrillation Detection by Driving Graduate Level Education in Medical Machine Learning" (ECG-DualNet) [Physiological Measurement 2022, EMBC 2023].

Updated 9 months ago

memoria-pytorch • Science 41%

Memoria is a human-inspired memory architecture for neural networks.

Updated 9 months ago

https://github.com/amazon-science/gluonmm • Science 36%

A library of transformer models for computer vision and multi-modality research

Updated 9 months ago

https://github.com/amazon-science/tubelet-transformer • Science 26%

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Updated 9 months ago

hardvs • Science 54%

[AAAI-2024] HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors

Updated 9 months ago

plankassembly • Science 54%

[ICCV 2023] PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs

Updated 9 months ago

bitcoin-trader-ml • Science 44%

Automated 24/7 bitcoin trader for Coinbase using Transformer Neural Networks

Updated 9 months ago

community-aware-graph-transformer • Science 67%

Community-aware Graph Transformer (CGT) is a novel Graph Transformer model that utilizes community structures to address node degree biases in message-passing mechanism and developed by NS Lab @ CUK based on pure PyTorch backend.

Updated 9 months ago

https://github.com/christophreich1996/swin-transformer-v2 • Science 10%

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].

Updated 9 months ago

spatialfusion-lm • Science 26%

SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.

Updated 9 months ago

https://github.com/aveek-saha/transformer • Science 23%

A TensorFlow 2.0 Implementation of the Transformer: Attention Is All You Need

Updated 9 months ago

https://github.com/bentoml/transformers-nlp-service • Science 13%

Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more

Updated 9 months ago

final-state-transformer • Science 44%

Machine learning development toolkit built upon Transformer encoder network architectures and tailored for the realm of high-energy physics and particle-collision event analysis.

Updated 9 months ago

badtransformer • Science 44%

Simplified and easy-to-understand transformer

Updated 9 months ago

qaqg • Science 54%

Indonesian Question Answering and Question Generation using Transformers

Updated 9 months ago

network-traffic-transformer • Science 67%

Network Traffic Transformer to learn network dynamics from packet traces. Learn fundamental dynamics with pre-training and fine-tune to multiple applications.

Updated 9 months ago

mlsae • Science 41%

Multi-Layer Sparse Autoencoders (ICLR 2025)

Updated 9 months ago

comprehensive-transformer-tts • Science 36%

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Updated 9 months ago

cv-3315-is-all-you-need • Science 54%

CV′3315 Is All You Need – Semantic Segmentation Course Competition @ The University of Adelaide

Updated 9 months ago

open-muse • Science 54%

Open reproduction of MUSE for fast text2image generation.

Updated 9 months ago

spn4cir • Science 54%

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

Updated 9 months ago

saits • Science 67%

The official PyTorch implementation of the paper "SAITS: Self-Attention-based Imputation for Time Series". A fast and state-of-the-art (SOTA) deep-learning neural network model for efficient time-series imputation (impute multivariate incomplete time series containing NaN missing data/values with machine learning). https://arxiv.org/abs/2202.08516

Updated 9 months ago

deno-nodejs-transformer • Science 44%

A Deno module for transform Deno package to NodeJS package. This module is a modified edition of the JSR package `dnt`.

Updated 9 months ago

wordart • Science 54%

The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.

Updated 9 months ago

https://github.com/dc-research/tempo • Science 36%

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

Updated 9 months ago

superpoint_transformer • Science 67%

Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering"

Updated 9 months ago

renderformer • Science 54%

Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Updated 9 months ago

faceformer • Science 54%

[CVPR 2022] Neural Face Identification in a 2D Wireframe Projection of a Manifold Object

Updated 9 months ago

https://github.com/cosmaadrian/strawberry-problem • Science 36%

Official repository for "The Strawberry Problem šŸ“: Emergence of Character-level Understanding in Tokenized Language Models"

Updated 9 months ago

intr-paquetlab • Science 49%

Fork of Imageomics/INTR for paquetlab project

Updated 9 months ago

mazegpt • Science 44%

AI model for making mazes that extends OpenAIs GPT2 model