Projects

Updated 11 months ago

cambrian • Rank 9.0 • Science 54%

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

chatbot clip computer-vision dino instruction-tuning large-language-models llms mllm multimodal-large-language-models representation-learning

Updated 11 months ago

https://github.com/buaadreamer/qwen2-vl-history • Rank 2.4 • Science 13%

Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums

beauty history llama-factory mllm multimodal-large-language-models museum qwen2-vl supervised-finetuning

Updated 11 months ago

https://github.com/buaadreamer/chinese-llava-med • Science 13%

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

ai chinese gpt4v huggingface-datasets llama-factory llava medical minigpt4 mllm multimodal qwen1-5 transformers

Updated 11 months ago

awesome-llms-meet-multimodal-generation • Science 67%

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

aigc large-language-models large-vision-language-models llm lvlm mllm multimodal-generation multimodal-large-language-models multimodal-models multimodality text-to-3d text-to-audio text-to-image text-to-music text-to-sound text-to-speech text-to-video

Updated 11 months ago

https://github.com/buaadreamer/mllm-finetuning-demo • Science 13%

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

finetune-llm huggingface-datasets llama-factory llava lora mllm paligemma pretraining supervised-finetuning transformers yi-vl

Updated 11 months ago

spatialfusion-lm • Science 26%

SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.

3d-estimation computer-vision depth-estimation foundation-models mllm multimodal-llm point-clouds robotics scene-understanding spatial-intelligence stereo-vision transformer vision-language-model vision-transformer zero-shot-learning

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

cambrian • Rank 9.0 • Science 54%

https://github.com/buaadreamer/qwen2-vl-history • Rank 2.4 • Science 13%

https://github.com/buaadreamer/chinese-llava-med • Science 13%

awesome-llms-meet-multimodal-generation • Science 67%

https://github.com/buaadreamer/mllm-finetuning-demo • Science 13%

spatialfusion-lm • Science 26%