proteinworkshop
Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)
https://github.com/amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
awesome-robotics-3d
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
https://github.com/amazon-science/wqa-multi-sentence-inference
This repository contains code used for our Multi Sentence Inference NAACL'22 paper.
https://github.com/amazon-science/mix-generation
MixGen: A New Multi-Modal Data Augmentation
https://github.com/bytedance/twist
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
graphg
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
transform-emr
This model is a decoder transformer based model aiming to model events predictions from EMR records as a sequential text generation problem. This project is a part of my thesis research.
transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
https://github.com/bojarlab/gifflar
Glycan Informed Foundational Framework for Learning Abstract Representations, based on Combinatorial Complexes and Heterogeneous GNNs
https://github.com/buaadreamer/mllm-finetuning-demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory