hierarchical_planning_decompose_net

Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs

https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.7%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs

Basic Info
  • Host: GitHub
  • Owner: arash-shahmansoori
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 10.1 MB
Statistics
  • Stars: 7
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 2 years ago · Last pushed about 2 years ago
Metadata Files
Readme License Citation

README.md

Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs

https://medium.com/@arash.mansoori65/advancing-hierarchical-planning-in-multi-modal-task-decomposition-through-fine-tuning-open-source-2c93e984d434

Train-Validation-Loss

Welcome to the repository dedicated to pioneering research in the realm of artificial intelligence, focusing on hierarchical planning and multi-modal task decomposition using an advanced fine-tuning process on the Mistral-7B version 2 Large Language Model (LLM). Here, you will find a comprehensive collection of resources, including the source code, dataset, and methodology used to achieve this groundbreaking work.

Overview

This project introduces a novel approach to equipping LLMs with hierarchical planning capabilities for decomposing composite tasks into multi-layer, actionable subtasks. By leveraging a custom dataset and employing sophisticated fine-tuning processes such as Quantized Low Rank Adapters (QLORA) and a unique TrImE (Trimming, Election, and Merging) process, we have successfully demonstrated a two-layer hierarchical planning mechanism. This enables our fine-tuned LLM to orchestrate complex, multi-modal tasks across different layers of abstraction, a critical advancement for the development of intelligent orchestrator agents.

Main Contributions

  • First Open Source LLM with Hierarchical Planning: We present the first publicly available LLM fine-tuned to perform hierarchical planning for multi-modal task decomposition.
  • Innovative Dataset for Hierarchical Planning: Our custom-built dataset, tailored for learning two-layer planning for multi-modal task decomposition, sets a new standard in the field.

Key Features

  • Two-Layer Planning: Our model adeptly decomposes composite tasks into an abstract layer of keywords and modes, followed by a detailed layer outlining actionable subtasks and their sequences.
  • Multi-Modal Task Support: The dataset and resulting model are proficient in handling tasks that span across text, image, video, and audio modalities, showcasing versatility in task decomposition.
  • Open Source Access: We provide unrestricted access to all training source code, adapters, and a portion of the dataset, along with comprehensive documentation on the training and fine-tuning processes.

Repository Contents

  • Training Source Code: Complete source code for fine-tuning the Mistral-7B v2 LLM, including the implementation of QLORA and the TrImE process.
  • Adapters and Datasets: Pre-trained adapters (available in HuggingFace: arashmsn/HierarchicalPlanningDecomposition_Mistral-7B-Instruct-v0.2) and a 1000-row sample of our groundbreaking dataset designed for hierarchical planning of multi-modal task decomposition.
  • Evaluation Results: Detailed benchmarks and evaluation results showcasing the effectiveness of the multi-layer planning approach.
  • Mermaid Diagram: Visual representation of the fine-tuning and training workflow for enhanced understanding.

Quick Start

To begin exploring and utilizing this revolutionary approach to hierarchical planning in LLMs, please refer to the notebooks.

Mermaid Workflow Diagram

Below is a mermaid diagram illustrating the fine-tuning and training process devised for our project:

mermaid graph TD; A[Start: fine-tuning Mistral-7B v2 with QLORA] --> B[Training Adapter 1: Layer-1 Abstract Planning]; A --> C[Training Adapter 2: Layer-2 Detailed Planning]; B --> D[Merging Adapters]; C --> D; D --> E[TrImE: Trimming, Election, and Merging Process]; E --> F[End: Model with Hierarchical Planning Capability];

Citation

If our work assists or inspires your research, please cite this project using the following:

bibtex @misc{hierarchical_planning_2024, title={Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs}, author={Arash Shahmansoori}, year={2024}, howpublished={\url{https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net.git}}, }

License

This project is open source and available under the MIT License.


We are excited to share our advancements with the community and eagerly anticipate the innovative applications and further research this project will inspire.

Owner

  • Login: arash-shahmansoori
  • Kind: user
  • Location: Ireland

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Shahmansoori"
    given-names: "Arash"
    orcid: "https://orcid.org/0000-0001-5126-8005"
title: "Advancing Hierarchical Planning in Multi-Modal Task Decomposition Through Fine-Tuning Open Source LLMs"
version: "1.0.0"
date-released: "2024-03-28"
doi: ""
url: "https://github.com/arash-shahmansoori/hierarchical_planning_decompose_net.git"

GitHub Events

Total
  • Watch event: 4
Last Year
  • Watch event: 4