Spleeter
Spleeter: a fast and efficient music source separation tool with pre-trained models - Published in JOSS (2020)
instances
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
hugsvision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
cppe5
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
https://github.com/superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
https://github.com/bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
https://github.com/csinva/gan-vae-pretrained-pytorch
Pretrained GANs + VAEs + classifiers for MNIST/CIFAR in pytorch.
https://github.com/deepset-ai/farm
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
https://github.com/dc-research/tempo
The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.
https://github.com/awslabs/gap-text2sql
GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
https://github.com/cedrickchee/pytorch-pretrained-bert
PyTorch version of Google AI's BERT model with script to load Google's pre-trained models
https://github.com/cedrickchee/pretrained-models.pytorch
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
https://github.com/cvjena/cnn-models
ImageNet pre-trained models with batch normalization for the Caffe framework
https://github.com/awslabs/pptod
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
fuzi.mingcha
夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答服务。
https://github.com/cyberagentailab/webcolor
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
https://github.com/amirzenoozi/models-with-tensorflow
AI Models Implementation on Tensorflow
FMAT
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.
https://github.com/yinghao-li/muben
MUBen: Benchmarking the Uncertainty of Molecular Representation Models
https://github.com/amazon-science/lc-plm
LC-PLM: long-context protein language model based on BiMamba-S architecture