paddlemix_comfyui
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.8%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: liuxiao-guan
- License: apache-2.0
- Language: Python
- Default Branch: main
- Size: 177 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
2025.04.21FLUX
- PaddleMIXFLUXPaddleMIXFLUX421https://www.wjx.top/vm/QTuwoyG.aspx?udsid=997416
** 2025.03.31 Qwen2.5VLXPU** * AIQwen2.5VLPaddleMIXP800
** 2025.03.17 Qwen2.5VL** * PaddlePaddleQwen2.5VLPaddleMIXVLLM10%-30%
** 2025.01.20 (AIStudio)** * AI * - AI Studio
** 2025.01.08 PP-VCtrl** * PP-VCtrl
** 2025.01.02 PP-DocBee** * PP-DocBeePP-DocBeeSOTA
** 2024.10.31 [](paddlemix_applications.md)** * 96,30,25 * - [AI Studio](https://aistudio.baidu.com/aistudio/community/multimodal?from=singlemessage) **2024.10.11 PaddleMIX v2.1** * [PaddleNLP 3.0 beta](https://github.com/PaddlePaddle/PaddleNLP/releases/tag/v3.0.0-beta0) * [Qwen2-VL](./paddlemix/examples/qwen2_vl/)[InternVL2](./paddlemix/examples/internvl2/)[Stable Diffusion 3 (SD3)](https://github.com/PaddlePaddle/PaddleMIX/blob/develop/ppdiffusers/examples/dreambooth/README_sd3.md) * [PP-InsCapTagger](./paddlemix/datacopilot/example/pp_inscaptagger/)50% * InternVL2LLaVASD3SDXL910B **2024.07.25 PaddleMIX v2.0** * LLaVAQwen-VLAutoSFTmixtokenSFT5.6 * [PPDiffusers 0.24.1](./ppdiffusers/README.md)LCMpeftaccelerateComfyUI * [DataCopilot](./paddlemix/datacopilot/) **2023.10.7 PaddleMIX v1.0** * BLIP-2 * [AppFlow](./applications/README.md)11 * [PPDiffusers](./ppdiffusers/README.md) 0.19.3 SDXL
PaddleMIX
| ComfyUI | R1+MIX | **** |
| :--------------------------------------------------------------------------------------------------------------------------------------------: | :----------------------------------------------------------------------------------------------------------------------------------------------: | :--------------------------------------------------------------------------------------------------------------------------------------: |
| |
|
|
| **** | AI50+Lora | **** |
|
|
|
|
PaddleMIX4D``benchmark
PaddleMIXPP-DocBeePP-VCtrlDataCopilot``
1. PaddleMIX
git clone https://github.com/PaddlePaddle/PaddleMIX
cd PaddleMIX
2.
conda create -n paddlemix python=3.10 -y
conda activate paddlemix
3. PaddlePaddle
1: GPU/CPU
- CUDA 11.x12.3
- PaddlePaddle 3.0.0b2
sh build_paddle_env.sh
2:
PaddlePaddleInstallation
4.
1:
:
sh build_env.sh
2:
```bash
PaddleMIX
pip install -e .
ppdiffusers
cd ppdiffusers pip install -e . cd .. ```
5.
```bash sh check_env.sh
: - paddlepaddle: 3.0.0b2develop - paddlenlp: 3.0.0b2 - ppdiffusers: 0.29.0 - huggingface_hub: 0.23.0 ```
6.
- FastLayerNormFusedLayerNormEVA-CLIPDIT_LLAMA
- CUDA
bash cd paddlemix/external_ops python setup.py install
-
- LLaVA
benchmark - benchmark - benchmark
|
PP-DocBee
PaddleMIXPP-DocBeeSOTA
PP-VCtrl
PaddleMIXPP-VCtrl
DataCopilot
PaddleMIXDataCopilotPaddleMIX``DataCopilot
PP-InsCapTagger(Instance Capability Tagger) DataCopilot PaddleMIX LLaVA SFTLLaVASFT50%
PP-InsCapTagger()
| Model | ScienceQA | TextVQA | VQAv2 | GQA | MMMU | MME | |----------------------------------|-----------------------------------------|----------------------------------------|----------------------------------------|----------------------------------------|----------------------------------------|-----------------------------------------| | llava-1.5-7b (origin) | 66.8 | 58.2 | 78.5 | 62 | - | - | | llava-1.5-7b (rerun) | 69.01 | 57.6 | 79 | 62.95 | 36.89 | 1521323 | | llava-1.5-7b (random 50%) | 67.31 | 55.6 | 76.89 | 61.01 | 34.67 | 1421
286 | | **llava-1.5-7b (our 50%)** | **70.24** *(+2.93)* | **57.12** *(+1.52)* | **78.32** *(+1.43)* | **62.14** *(+1.13)* | **37.11** *(+2.44)* | **1476** *(+55)*
**338** *(+52)* | ``[pp_inscaptagger](paddlemix/datacopilot/example/pp_inscaptagger/readme.md)
FAQ
PaddleMIX Hugging Face Transformers Hugging Face
PaddleMIX : Contributors co63oc CrazyBoyM KPCOFGS pkhk-1 1649759610 DrRyanHuang zhiboniu cocoshe sneaxiy yangrongxinuser cheng221 Liyulingyue zhoutianzi666 Birdylx FeixLiu Tsaiyue fightfat warrentdrew swagger-coder ...
@misc{paddlemix2023,
title={PaddleMIX, Paddle Multimodal Integration and eXploration.},
author={PaddlePaddle Authors},
howpublished = {\url{https://github.com/PaddlePaddle/PaddleMIX}},
year={2023}
}
Owner
- Login: liuxiao-guan
- Kind: user
- Repositories: 3
- Profile: https://github.com/liuxiao-guan
GitHub Events
Total
- Push event: 2
- Public event: 1
- Create event: 1
Last Year
- Push event: 2
- Public event: 1
- Create event: 1
Dependencies
- ppdiffusers *
- requests *
- Pillow *
- av *
- datasets *
- gradio ==4.44.1
- pandas *
- tqdm *
- matplotlib ==3.7.5
- nltk *
- numpy ==1.23.5
- opencc ==1.1.6
- paddlenlp >=3.0.0b2
- scipy ==1.12.0
- megfile *
- natsort *
- paddlenlp ==3.0.0b3
- numpy *
- opencv-python ==4.6.0.66
- openmim *
- paddlepaddle-gpu *
- supervision ==0.18.0
- tokenizers *
- wheel *
- MonkeyType ==23.3.0
- PyYAML *
- click *
- h5py *
- librosa >=0.11.0
- matplotlib *
- numpy ==1.26.4
- praat-parselmouth ==0.4.3
- pyworld ==0.3.4
- resampy *
- scipy >=1.10.0
- tqdm *
- paddlenlp ==3.0.0b2
- pillow *
- tqdm *
- paddlenlp >=3.0.0b2
- paddlenlp >=3.0.0b2
- paddlenlp >=3.0.0b2
- ligo-segments *
- ppdiffusers >=0.16.3
- Pillow ==9.5.0
- av ==11.0.0
- decord ==0.6.0
- einops ==0.7.0
- imageio ==2.33.0
- imageio-ffmpeg ==0.4.9
- numpy ==1.23.5
- omegaconf ==2.2.3
- opencv-contrib-python ==4.8.1.78
- opencv-python ==4.8.1.78
- scikit-image ==0.21.0
- scikit-learn ==1.3.2
- scipy ==1.11.4
- tqdm ==4.66.1
- gradio ==4.16.0
- insightface *
- onnxruntime *
- paddlenlp ==2.7.2
- beartype *
- ftfy *
- gdown *
- imageio *
- imageio-ffmpeg *
- pandarallel *
- pandas *
- pyarrow *
- pyav *
- tqdm *
- gradio >=4.0.0
- paddlenlp ==2.7.2
- Pillow *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- Pillow *
- albumentations *
- fastcore *
- opencv-python *
- opencv-python-headless <=4.3
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- pyyaml *
- scipy *
- visualdl *
- Pillow *
- fastcore *
- paddlenlp >=2.7.2
- ppdiffusers >=0.19.3
- visualdl *
- Pillow *
- fastcore *
- paddlenlp >=2.7.2
- requests *
- tqdm *
- einops *
- paddlenlp >=2.7.1
- ppdififusers >=0.19.4
- visualdl *
- basicsr ==1.4.2
- cchardet *
- gradio ==3.16.2
- opencv-python *
- paddlehub >=2.3.1
- paddlenlp >=2.7.2
- paddleseg ==2.7.0
- ppdiffusers >=0.24.0
- Pillow *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- Pillow *
- wandb ==0.19.4
- Pillow *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- visualdl *
- einops *
- decord *
- gradio ==5.12.0
- hydra-core *
- iopath *
- moviepy ==1.0.3
- paddlex ==3.0.0b2
- pycocoevalcap *
- supervision *
- fastcore *
- safetensors *
- visualdl *
- charset_normalizer >=3.3.2
- einops *
- opencv-python *
- opencv-python *
- paddlehub >=2.3.1
- paddlenlp >=2.5.1
- paddleseg >=2.7.0
- pillow ==9.4.0
- Pillow *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- Jinja2 *
- Pillow *
- datasets *
- ftfy *
- ppdiffusers >=0.24.0
- wandb *
- Pillow *
- fastcore *
- paddlenlp >=2.4.5
- ppdiffusers >=0.6.3
- visualdl *
- Pillow *
- fastcore *
- paddlenlp >=2.7.2
- paddlepaddle-gpu >=2.6.0
- ppdiffusers >=0.19.4
- visualdl *
- accimage *
- av *
- decord *
- einops >=0.6.1
- omegaconf *
- Pillow *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- Pillow *
- datasets *
- paddlenlp >=2.7.2
- ppdiffusers >=0.24.0
- visualdl *
- beartype *
- einops *
- academictorrents *
- albumentations ==1.4.4
- einops *
- jsonargparse >=4.27.7
- lpips *
- omegaconf *
- requests *
- Pillow *
- av ==13.1.0
- einops >=0.6.1
- ftfy *
- hf_transfer *
- huggingface_hub *
- ligo-segments *
- note_seq *
- omegaconf *
- opencv-python *
- paddlenlp >=3.0.0b2
- paddlesde *
- parameterized *
- regex *
- requests_mock *
- safetensors >=0.3.1
- urllib3 <=2.0.0
- diffusers *
- omegaconf *
- paddlenlp >=2.5.0
- paddlepaddle-gpu *
- ppdiffusers >0.9.0
- torch *
- transformers *
- Pillow * test
- albumentations * test
- cchardet * test
- datasets * test
- einops * test
- fastcore * test
- ligo-segments * test
- opencv-python * test
- opencv-python-headless <=4.3 test
- paddlenlp >=2.7.2 test
- ppdiffusers >=0.24.0 test
- pyyaml * test
- requests * test
- safetensors * test
- scipy * test
- visualdl * test
- Pillow *
- av *
- brisque *
- decord >=0.6.0
- einops >=0.6.1
- ftfy *
- h5py *
- jsonschema >=4.19.0
- librosa *
- numpy ==1.23.5
- opencv-python *
- paddlenlp >=3.0.0b3
- pyav *
- pycocoevalcap *
- referencing >=0.32.1
- regex *
- soundfile *
- tensorboardX *