finetune-e2e-model-drivelm
Finetune DriveLM to improve generalization ability and enable interaction with human.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.3%) to scientific vocabulary
Repository
Finetune DriveLM to improve generalization ability and enable interaction with human.
Basic Info
- Host: GitHub
- Owner: allrivertosea
- License: apache-2.0
- Language: Jupyter Notebook
- Default Branch: main
- Size: 7.66 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Finetune-e2e-model-DriveLM
Finetune DriveLM to improve generalization ability and enable interaction with human.
Getting started
1. Installation
Clone Project
bash
git clone --recursive https://github.com/OpenDriveLab/DriveLM.git
Install
bash
conda create -n drivelm python=3.8
conda activate drivelm
cd ./DriveLM/challenge/llama_adapter_v2_multimodal7b
pip install -i https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simple/ -r requirements.txt
2. Preparing Data
Download subset of datasets from Huggingface and perform data processing.
```bash https://huggingface.co/datasets/OpenDriveLab/DriveLM
cd ./DriveLM/challenge/ python extractdata.py python convertdata.py python convert2llama.py
```
3. Download pretrained models
```bash llama: https://huggingface.co/nyanko7/LLaMA-7B/tree/main
LLaMA-Adapter V2: https://github.com/OpenGVLab/LLaMA-Adapter/releases/tag/v.2.0.0 ```
Fine-tuning DriveLM
1. Fine-tuning
bash
cd ./DriveLM/challenge/llama_adapter_v2_multimodal7b
python main_finetune.py --data_config "/APP/DriveLM/challenge/llama_adapter_v2_multimodal7b/finetune_data_config.yaml" --batch_size 1 --epochs 3 --warmup_epochs 1 --blr 10e-4 --weight_decay 0.02 --llama_path "/APP/DriveLM/challenge/model_weights/llama_model_weights" --output_dir "/APP/DriveLM/outputs" --pretrained_path "/APP/DriveLM/challenge/model_weights/LLaMA-Adapter_V2/7fa55208379faf2dd862565284101b0e4a2a72114d6490a95e432cf9d9b6c813_BIAS-7B.pth" --log_dir "/APP/DriveLM/outputs/output.log"
2. Inference
bash
cd ./DriveLM/challenge/llama_adapter_v2_multimodal7b
python demo.py
3. Visualization
bash
cd ./DriveLM/challenge/llama_adapter_v2_multimodal7b
task_vis.ipynb
License and Citation
All assets and code in this repository are under the Apache 2.0 license unless specified otherwise. The language data is under CC BY-NC-SA 4.0. Other datasets (including nuScenes) inherit their own distribution licenses.
bibtex
@article{sima2023drivelm,
title={DriveLM: Driving with Graph Visual Question Answering},
author={Sima, Chonghao and Renz, Katrin and Chitta, Kashyap and Chen, Li and Zhang, Hanxue and Xie, Chengen and Luo, Ping and Geiger, Andreas and Li, Hongyang},
journal={arXiv preprint arXiv:2312.14150},
year={2023}
}
bibtex
@misc{contributors2023drivelmrepo,
title={DriveLM: Driving with Graph Visual Question Answering},
author={DriveLM contributors},
howpublished={\url{https://github.com/OpenDriveLab/DriveLM}},
year={2023}
}
Owner
- Login: allrivertosea
- Kind: user
- Repositories: 1
- Profile: https://github.com/allrivertosea
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "DriveLM Contributors" title: "Drive on Language" date-released: 2023-08-25 url: "https://github.com/OpenDriveLab/DriveLM/" license: Apache-2.0
GitHub Events
Total
- Push event: 1
Last Year
- Push event: 1
Dependencies
- Pillow *
- fairscale *
- gradio *
- openai *
- opencv-python *
- sentencepiece *
- tenacity *
- tensorboard *
- torch ==2.0.0
- torchvision ==0.15.1
- tqdm *