awesome-llm-agents

A Collection of High Quality research papers and open-source projects about LLM-agents

https://github.com/junhua/awesome-llm-agents

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.2%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

A Collection of High Quality research papers and open-source projects about LLM-agents

Basic Info

Host: GitHub
Owner: junhua
License: apache-2.0
Default Branch: main
Size: 105 KB

Statistics

Stars: 47
Watchers: 3
Forks: 6
Open Issues: 1
Releases: 1

Created almost 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Awesome-LLM-Agents: Recent Trends and Advancement in Agentic AI

This Awesome-LLM-Agents contains A hand-picked and carefully categorised reading list. Furthermore, I will conduct review on each paper and project, and (hopefully) put them in anto a survey paper. The detailed thought process of forming this project is documented at this Medium Post. It's put behind a paywall to prevent the evil LLMs' crawling. The full category breakdown.

LLM Core — Foundation Models

Scaling laws for neural language models (OpenAI, 2020, arXiv)
LLaMA: Open and Efficient Foundation Language Models (Meta, Feb 2023, arXiv)
The Llama 3 Herd of Models (Meta, July 2024, arXiv)
Sparks of Artificial General Intelligence: Early experiments with GPT-4 (Microsoft, Apr 2023, arXiv)
Apple Intelligence Foundation Language Models (Apple, Doc)
StarCoder (Dec 2023, arXiv)
Gemma 2B: Improving Open Language Models at a Practical Size (Jul, 2024, arXiv)

LLM Core — Prompt Engineering

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Google Brain, 2022, NeurIPS)
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Princeton & DeepMind, 2023, NeurIPS, Benchmark)
Self-Consistency Improves Chain of Thought Reasoning in Language Models (Google Brain, 2023, ICLR)
ReAct: Synergizing Reasoning and Action in Language Models (Princeton & Google Brain, Mar 2023 ICLR)
Reflexion: Language agents with verbal reinforcement learning (Northeastern, MIT & Princeton, 2023, NeurIPS)
ART: Automatic multi-step reasoning and tool-use for large language models (UW, UCI, Microsoft, Allen AI & Meta, 2023, arXiv)
Directional Stimulus Prompting (UCSB & Microsoft, 2023, NeurIPS)
Active Prompting with Chain-of-Thought for Large Language Models (HKUST etc., Jul 2024, arXiv)
Step-Back Prompting Enables Reasoning Via Abstraction in Large Language Models (DeepMind, Mar 2024, arXiv)

LLM Core — Retrieval-Augmented Generation

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach (DeepMind, Jul 2024, arXiv)
Retrieval-Augmented Generation for Large Language Models: A Survey (Tongji & Fudan, Mar 2024, arXiv)
Improving Retrieval Augmented Language Model with Self-Reasoning (Baidu, Jul 2024, arXiv)

LLM Core — Finetuning / PEFT

Lora: Low-rank adaptation of large language models (Microsoft & CMU, Oct 2021, arXiv)
QLoRA: Efficient Finetuning of Quantized LLMs (UW, 2023, NeurIPS)
A Survey on LoRA of Large Language Models (ZJU, Jul 2024, arXiv)
Distilling System 2 into System 1 (Meta, Jul 2024, arXiv)
Mixture of LoRA Experts (Microsoft & Tsinghua, 2024, ICLR)

LLM Core — Alignments and Safety

Rule Based Rewards for Language Model Safety (OpenAI, Jul 2024, Preprint)
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More (Salesforce, Jul 2024, arXiv)
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study (Tsinghua, Apr 2024, arXiv)
PERL: Parameter Efficient Reinforcement Learning from Human Feedback (Google, Mar 2024, arXiv)
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback (Google, Dec 2023, arXiv)
Training language models to follow instructions with human feedback (OpenAI, Mar 2022, arXiv)
Constitutional AI: Harmlessness from AI Feedback (Anthropic, Dec 2022, arXiv) ⭐
Self-instruct: Aligning language models with self-generated instructions (Allen AI, May 2023, ACL) ⭐
Direct preference optimization: Your language model is secretly a reward model (Stanford, 2023, NeurIPS)⭐
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback (Tsinghua, UIUC, Tencent, RUC etc., 2024, ICML) ⭐
Camels in a changing climate: Enhancing lm adaptation with tulu 2 (Allen AI & UW, Nov 2023, arXiv)

- Steerlm: Attribute conditioned sft as an (user-steerable) alternative to rlhf (Nvidia, Oct 2023, arXiv)

LLM Core — Datasets, benchmarks, Metrics

GAIA: a benchmark for general AI assistants (Meta, Nov 2023, ICLR)
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators (Stanford, Apr 2024, arXiv)
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (UCB, UCSD, CMU & Stanford, Dec 2023, NeurIPS)
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets (KAIST, Apr 2024, ICLR)
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference (UCB, Stanford & UCSD, Mar 2024, arXiv)
Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF (UCB, 2023, HF)
Lmsys-chat-1m: A large-scale real-world llm conversation dataset (UCB, UCSD, CMU & Stanford, Mar 2024, ICLR)

Agent Core — Planning / Reasoning Describe

Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents (PKU, 2024, NIPS)
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning (NUS, 2023 , NIPS)

Agent Core — Memory

A Survey on the Memory Mechanism of Large Language Model based Agents (RUC &Huawei, Apr 2024, arXiv)

Agent Core — Tools

Offline Training of Language Model Agents with Functions as Learnable Weights (PSU, UW, USC & Microsoft, 2024, ICML)
Tool Learning with Foundation Models (Tsinghua, UIUC, CMU, etc., 2023, arXiv)
Toolformer: Language models can teach themselves to use tools (Meta, 2023, NeurIPS)

Agentic Workflow — Paradigms

Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View (ZJU & Deepmind, Oct 2023, arXiv)
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? (ZJU, HKUST & UIUC, May 2024, arXiv)
360◦REA: Towards A Reusable Experience Accumulation with 360◦Assessment for Multi-Agent System (Apr 2024, arXiv)
CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (KAUST, 2023, NIPS)
A Survey on Large Language Model based Autonomous Agents (2023, arXiv)
Mixture-of-Agents Enhances Large Language Model Capabilities (Together AI, Jun 2024, arXiv)

Agentic Applications — Simulation

Generative Agents: Interactive Simulacra of Human Behavior (Stanford/Google, Apr 2023, arXiv, Demo)
Deciphering digital detectives: Understanding llm behaviors and capabilities in multi-agent mystery games (Umontreal, Dec 2023, arXiv)
VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft (ZJU, Jun 2024, arXiv)

Agentic Applications — Finance

Complete list: Awesome-Finance-AI-Papers

Multi-agent frameworks

LangGraph (GitHub)
AutoGen by Microsoft (GitHub, Paper)
AgentScope by Alibaba Group(GitHub, System Paper, Projects paper)
Translation-agent by Andrew Ng (GitHub)

TODO:

Agentic Workflow — Human-Agent Interactions
Agentic Applications — Dev Tools
Agentic Applications — Content Creation (AIGC)
Agentic Applications — Social Network
Agentic Applications — Education
Production Operations — LLMOps
Production Operations — AI Cloud
Production Operations — Monitoring

Citation

Please cite the repo if you refer to the content of the this repository for your work.

@misc{awesome-llm-agents-jl, author = {Junhua Liu}, title = {Awesome-LLM-Agents: recent trends and advancement in Agentic AI}, year = {2024}, month = {August}, publisher = {Medium}, journal = {AI Advances}, doi = {10.5281/zenodo.14021180}, url = {https://medium.com/junhua/awesome-llm-agents-recent-trends-and-advancement-in-agentic-ai-90bac6249060}, }

Owner

Name: Junhua LIU
Login: junhua
Kind: user
Location: Singapore
Company: Singapore University of Technology and Design

Website: https://qlassroom.ai
Repositories: 11
Profile: https://github.com/junhua

Founder of Qlassroom, PhD@SUTD

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Liu"
  given-names: "Junhua"
  orcid: "https://orcid.org/0000-0003-4477-7439"
title: "Awesome-LLM-Agents: Recent Trends and Advancement in Agentic AI"
version: 1.0.0
doi: 10.5281/zenodo.1234
date-released: 2024-08-03
url: "https://github.com/junhua/awesome-llm-agents"

GitHub Events

Total

Create event: 1
Issues event: 1
Release event: 1
Watch event: 48
Push event: 2
Fork event: 6

Last Year

Create event: 1
Issues event: 1
Release event: 1
Watch event: 48
Push event: 2
Fork event: 6

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science