categorical_planning

Categorical Planning for Multi-Modal Tasks in Large Language Models

https://github.com/arash-shahmansoori/categorical_planning

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.3%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Categorical Planning for Multi-Modal Tasks in Large Language Models

Basic Info

Host: GitHub
Owner: arash-shahmansoori
License: mit
Language: Jupyter Notebook
Default Branch: main
Size: 14.2 MB

Statistics

Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed about 2 years ago

Metadata Files

Readme License Citation

Categorical Planning for Multi-Modal Tasks in Large Language Models

Train-Validation-Loss

Abstract

This research explores the significance of planning ahead in artificial intelligence, specifically within multi-modal composite tasks. We introduce a novel planning method that incorporates abstraction and category theory to enhance planning efficiency. Our approach uses concepts from category theory—objects and morphisms, redefined as subtasks and transitions—to establish planning categories at varying levels of abstraction. Our main contributions include the development of a specialized dataset for multi-modal planning, the application of category theory for detailed and abstract planning, and the introduction of a structured learning approach for integrating information across different planning categories. This work ultimately aims to improve the quality of multi-modal generated content through effective transition and transformation techniques.

How to Use

Quick Start

To begin exploring and utilizing the proposed method please refer to the notebooks.

Alternatively, you can run the scripts locally by following the steps below.

sh python -m venv .venv

sh source .venv/bin/activate

Upgrade pip and install all the necessary requirements as follows.

sh pip install --upgrade pip

Installation

Install all the necessary requirements.

sh pip install -r requirements.txt

Dataset Generation

To create the dataset from scratch use the script provided in the run_dataset_creation.py. We have already provided the 1k rows of the dataset for you to use in the data directory.

Environment Variables

Create a .env file and set your openai API key and other required API keys of your choice, e.g., wandb and huggingface.

Fine-Tuning

To run the fine-tuning, save the fine-tuned model and tokenizer, and the evaluation results use the following command.

For blueprint category:

sh python train_categorical_blueprint_planning_aws.py

For detailed category:

sh python train_categorical_detail_planning_aws.py

For merged categorical planning:

sh python merge_categorical_blueprint_detail_planning.py

Checkpoints

The checkpoints for blueprint, detailed, and merged adapters have been made available in the following HuggingFace repositories, respectively:

sh arashmsn/Blueprint_Planning_Mistral-7B-Instruct-v0.2 arashmsn/Detailed_Planning_Mistral-7B-Instruct-v0.2 arashmsn/Merge_Planning_Mistral-7B-Instruct-v0.2

Results

The evaluation results for categorical planning using blueprint and detailed planning adapters together with the merged adapter for multi-modal composite tasks are stored in the results directory. Other metrics including training and evaluation losses are provided in the assets directory.

To plot the categorical planner in the form of a graph using mermaid use plot_graph.py.

Author

Arash Shahmansoori (arash.mansoori65@gmail.com)

License

This project is open-sourced under the MIT License, allowing for widespread use and contribution back to the community. Please refer to the MIT License file for detailed terms and conditions.

Acknowledgements

We extend our gratitude to the AI research community for the open-source tools that have made this work possible. Special thanks to the contributors of the Mistral LLM and the creators of the QLORA technique for their groundbreaking contributions to the field of AI.

We invite contributors, researchers, and AI enthusiasts to join us in advancing the field of categorical planning for multi-modal AI research. Together, we can build powerful AI systems capable of understanding and executing complex composite tasks.

Owner

Login: arash-shahmansoori
Kind: user
Location: Ireland

Repositories: 1
Profile: https://github.com/arash-shahmansoori

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Shahmansoori"
    given-names: "Arash"
    orcid: "https://orcid.org/0000-0001-5126-8005"
title: "Categorical Planning for Multi-Modal Tasks in Large Language Models"
version: "1.0.0"
date-released: "2024-05-03"
doi: ""
url: "https://github.com/arash-shahmansoori/categorical_planning.git"

GitHub Events

Total

Last Year

Dependencies

requirements.txt pypi

datasets *
openai *
protobuf ==3.20.1
python-dotenv *
s3fs *
sagemaker *
setuptools *
tiktoken *
torch *
tqdm *

scripts/requirements.txt pypi

bitsandbytes *
datasets *
pyarrow ==13.0.0
torch *
wandb *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

categorical_planning

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Categorical Planning for Multi-Modal Tasks in Large Language Models

Abstract

How to Use

Quick Start

Installation

Dataset Generation

Environment Variables

Fine-Tuning

Checkpoints

Results

Author

License

Acknowledgements

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies