hierarchical_planning_decompose_net
Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs
https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary
Repository
Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs
Basic Info
- Host: GitHub
- Owner: arash-shahmansoori
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 10.1 MB
Statistics
- Stars: 7
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs
https://medium.com/@arash.mansoori65/advancing-hierarchical-planning-in-multi-modal-task-decomposition-through-fine-tuning-open-source-2c93e984d434

Welcome to the repository dedicated to pioneering research in the realm of artificial intelligence, focusing on hierarchical planning and multi-modal task decomposition using an advanced fine-tuning process on the Mistral-7B version 2 Large Language Model (LLM). Here, you will find a comprehensive collection of resources, including the source code, dataset, and methodology used to achieve this groundbreaking work.
Overview
This project introduces a novel approach to equipping LLMs with hierarchical planning capabilities for decomposing composite tasks into multi-layer, actionable subtasks. By leveraging a custom dataset and employing sophisticated fine-tuning processes such as Quantized Low Rank Adapters (QLORA) and a unique TrImE (Trimming, Election, and Merging) process, we have successfully demonstrated a two-layer hierarchical planning mechanism. This enables our fine-tuned LLM to orchestrate complex, multi-modal tasks across different layers of abstraction, a critical advancement for the development of intelligent orchestrator agents.
Main Contributions
- First Open Source LLM with Hierarchical Planning: We present the first publicly available LLM fine-tuned to perform hierarchical planning for multi-modal task decomposition.
- Innovative Dataset for Hierarchical Planning: Our custom-built dataset, tailored for learning two-layer planning for multi-modal task decomposition, sets a new standard in the field.
Key Features
- Two-Layer Planning: Our model adeptly decomposes composite tasks into an abstract layer of keywords and modes, followed by a detailed layer outlining actionable subtasks and their sequences.
- Multi-Modal Task Support: The dataset and resulting model are proficient in handling tasks that span across text, image, video, and audio modalities, showcasing versatility in task decomposition.
- Open Source Access: We provide unrestricted access to all training source code, adapters, and a portion of the dataset, along with comprehensive documentation on the training and fine-tuning processes.
Repository Contents
- Training Source Code: Complete source code for fine-tuning the Mistral-7B v2 LLM, including the implementation of QLORA and the TrImE process.
- Adapters and Datasets: Pre-trained adapters (available in HuggingFace: arashmsn/HierarchicalPlanningDecomposition_Mistral-7B-Instruct-v0.2) and a 1000-row sample of our groundbreaking dataset designed for hierarchical planning of multi-modal task decomposition.
- Evaluation Results: Detailed benchmarks and evaluation results showcasing the effectiveness of the multi-layer planning approach.
- Mermaid Diagram: Visual representation of the fine-tuning and training workflow for enhanced understanding.
Quick Start
To begin exploring and utilizing this revolutionary approach to hierarchical planning in LLMs, please refer to the notebooks.
Mermaid Workflow Diagram
Below is a mermaid diagram illustrating the fine-tuning and training process devised for our project:
mermaid
graph TD;
A[Start: fine-tuning Mistral-7B v2 with QLORA] --> B[Training Adapter 1: Layer-1 Abstract Planning];
A --> C[Training Adapter 2: Layer-2 Detailed Planning];
B --> D[Merging Adapters];
C --> D;
D --> E[TrImE: Trimming, Election, and Merging Process];
E --> F[End: Model with Hierarchical Planning Capability];
Citation
If our work assists or inspires your research, please cite this project using the following:
bibtex
@misc{hierarchical_planning_2024,
title={Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs},
author={Arash Shahmansoori},
year={2024},
howpublished={\url{https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net.git}},
}
License
This project is open source and available under the MIT License.
We are excited to share our advancements with the community and eagerly anticipate the innovative applications and further research this project will inspire.
Owner
- Login: arash-shahmansoori
- Kind: user
- Location: Ireland
- Repositories: 1
- Profile: https://github.com/arash-shahmansoori
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Shahmansoori"
given-names: "Arash"
orcid: "https://orcid.org/0000-0001-5126-8005"
title: "Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs"
version: "1.0.0"
date-released: "2024-03-28"
doi: ""
url: "https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net.git"
GitHub Events
Total
- Watch event: 4
Last Year
- Watch event: 4