awesome-llm-agents
A Collection of High Quality research papers and open-source projects about LLM-agents
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.2%) to scientific vocabulary
Repository
A Collection of High Quality research papers and open-source projects about LLM-agents
Basic Info
- Host: GitHub
- Owner: junhua
- License: apache-2.0
- Default Branch: main
- Size: 105 KB
Statistics
- Stars: 47
- Watchers: 3
- Forks: 6
- Open Issues: 1
- Releases: 1
Metadata Files
README.md
Awesome-LLM-Agents: Recent Trends and Advancement in Agentic AI
This Awesome-LLM-Agents contains A hand-picked and carefully categorised reading list. Furthermore, I will conduct review on each paper and project, and (hopefully) put them in anto a survey paper.
The detailed thought process of forming this project is documented at this Medium Post. It's put behind a paywall to prevent the evil LLMs' crawling. The full category breakdown.
LLM Core — Foundation Models
- Scaling laws for neural language models (OpenAI, 2020, arXiv)
- LLaMA: Open and Efficient Foundation Language Models (Meta, Feb 2023, arXiv)
- The Llama 3 Herd of Models (Meta, July 2024, arXiv)
- Sparks of Artificial General Intelligence: Early experiments with GPT-4 (Microsoft, Apr 2023, arXiv)
- Apple Intelligence Foundation Language Models (Apple, Doc)
- StarCoder (Dec 2023, arXiv)
- Gemma 2B: Improving Open Language Models at a Practical Size (Jul, 2024, arXiv)
LLM Core — Prompt Engineering
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Google Brain, 2022, NeurIPS)
- Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Princeton & DeepMind, 2023, NeurIPS, Benchmark)
- Self-Consistency Improves Chain of Thought Reasoning in Language Models (Google Brain, 2023, ICLR)
- ReAct: Synergizing Reasoning and Action in Language Models (Princeton & Google Brain, Mar 2023 ICLR)
- Reflexion: Language agents with verbal reinforcement learning (Northeastern, MIT & Princeton, 2023, NeurIPS)
- ART: Automatic multi-step reasoning and tool-use for large language models (UW, UCI, Microsoft, Allen AI & Meta, 2023, arXiv)
- Directional Stimulus Prompting (UCSB & Microsoft, 2023, NeurIPS)
- Active Prompting with Chain-of-Thought for Large Language Models (HKUST etc., Jul 2024, arXiv)
- Step-Back Prompting Enables Reasoning Via Abstraction in Large Language Models (DeepMind, Mar 2024, arXiv)
LLM Core — Retrieval-Augmented Generation
- Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach (DeepMind, Jul 2024, arXiv)
- Retrieval-Augmented Generation for Large Language Models: A Survey (Tongji & Fudan, Mar 2024, arXiv)
- Improving Retrieval Augmented Language Model with Self-Reasoning (Baidu, Jul 2024, arXiv)
LLM Core — Finetuning / PEFT
- Lora: Low-rank adaptation of large language models (Microsoft & CMU, Oct 2021, arXiv)
- QLoRA: Efficient Finetuning of Quantized LLMs (UW, 2023, NeurIPS)
- A Survey on LoRA of Large Language Models (ZJU, Jul 2024, arXiv)
- Distilling System 2 into System 1 (Meta, Jul 2024, arXiv)
- Mixture of LoRA Experts (Microsoft & Tsinghua, 2024, ICLR)
LLM Core — Alignments and Safety
- Rule Based Rewards for Language Model Safety (OpenAI, Jul 2024, Preprint)
- A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More (Salesforce, Jul 2024, arXiv)
- Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study (Tsinghua, Apr 2024, arXiv)
- PERL: Parameter Efficient Reinforcement Learning from Human Feedback (Google, Mar 2024, arXiv)
- RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback (Google, Dec 2023, arXiv)
- Training language models to follow instructions with human feedback (OpenAI, Mar 2022, arXiv)
- Constitutional AI: Harmlessness from AI Feedback (Anthropic, Dec 2022, arXiv) ⭐
- Self-instruct: Aligning language models with self-generated instructions (Allen AI, May 2023, ACL) ⭐
- Direct preference optimization: Your language model is secretly a reward model (Stanford, 2023, NeurIPS)⭐
- ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback (Tsinghua, UIUC, Tencent, RUC etc., 2024, ICML) ⭐
- Camels in a changing climate: Enhancing lm adaptation with tulu 2 (Allen AI & UW, Nov 2023, arXiv)
- Steerlm: Attribute conditioned sft as an (user-steerable) alternative to rlhf (Nvidia, Oct 2023, arXiv)
LLM Core — Datasets, benchmarks, Metrics
- GAIA: a benchmark for general AI assistants (Meta, Nov 2023, ICLR)
- Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators (Stanford, Apr 2024, arXiv)
- Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (UCB, UCSD, CMU & Stanford, Dec 2023, NeurIPS)
- FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets (KAIST, Apr 2024, ICLR)
- Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference (UCB, Stanford & UCSD, Mar 2024, arXiv)
- Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF (UCB, 2023, HF)
- Lmsys-chat-1m: A large-scale real-world llm conversation dataset (UCB, UCSD, CMU & Stanford, Mar 2024, ICLR)
Agent Core — Planning / Reasoning Describe
- Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents (PKU, 2024, NIPS)
- Large Language Models as Commonsense Knowledge for Large-Scale Task Planning (NUS, 2023 , NIPS)
Agent Core — Memory
- A Survey on the Memory Mechanism of Large Language Model based Agents (RUC &Huawei, Apr 2024, arXiv)
Agent Core — Tools
- Offline Training of Language Model Agents with Functions as Learnable Weights (PSU, UW, USC & Microsoft, 2024, ICML)
- Tool Learning with Foundation Models (Tsinghua, UIUC, CMU, etc., 2023, arXiv)
- Toolformer: Language models can teach themselves to use tools (Meta, 2023, NeurIPS)
Agentic Workflow — Paradigms
- Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View (ZJU & Deepmind, Oct 2023, arXiv)
- Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? (ZJU, HKUST & UIUC, May 2024, arXiv)
- 360◦REA: Towards A Reusable Experience Accumulation with 360◦Assessment for Multi-Agent System (Apr 2024, arXiv)
- CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (KAUST, 2023, NIPS)
- A Survey on Large Language Model based Autonomous Agents (2023, arXiv)
- Mixture-of-Agents Enhances Large Language Model Capabilities (Together AI, Jun 2024, arXiv)
Agentic Applications — Simulation
- Generative Agents: Interactive Simulacra of Human Behavior (Stanford/Google, Apr 2023, arXiv, Demo)
- Deciphering digital detectives: Understanding llm behaviors and capabilities in multi-agent mystery games (Umontreal, Dec 2023, arXiv)
- VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft (ZJU, Jun 2024, arXiv)
Agentic Applications — Finance
- Complete list: Awesome-Finance-AI-Papers
Multi-agent frameworks
- LangGraph (GitHub)
- AutoGen by Microsoft (GitHub, Paper)
- AgentScope by Alibaba Group(GitHub, System Paper, Projects paper)
- Translation-agent by Andrew Ng (GitHub)
TODO:
- Agentic Workflow — Human-Agent Interactions
- Agentic Applications — Dev Tools
- Agentic Applications — Content Creation (AIGC)
- Agentic Applications — Social Network
- Agentic Applications — Education
- Production Operations — LLMOps
- Production Operations — AI Cloud
- Production Operations — Monitoring
Citation
Please cite the repo if you refer to the content of the this repository for your work.
@misc{awesome-llm-agents-jl,
author = {Junhua Liu},
title = {Awesome-LLM-Agents: recent trends and advancement in Agentic AI},
year = {2024},
month = {August},
publisher = {Medium},
journal = {AI Advances},
doi = {10.5281/zenodo.14021180},
url = {https://medium.com/junhua/awesome-llm-agents-recent-trends-and-advancement-in-agentic-ai-90bac6249060},
}
Owner
- Name: Junhua LIU
- Login: junhua
- Kind: user
- Location: Singapore
- Company: Singapore University of Technology and Design
- Website: https://qlassroom.ai
- Repositories: 11
- Profile: https://github.com/junhua
Founder of Qlassroom, PhD@SUTD
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Liu" given-names: "Junhua" orcid: "https://orcid.org/0000-0003-4477-7439" title: "Awesome-LLM-Agents: Recent Trends and Advancement in Agentic AI" version: 1.0.0 doi: 10.5281/zenodo.1234 date-released: 2024-08-03 url: "https://github.com/junhua/awesome-llm-agents"
GitHub Events
Total
- Create event: 1
- Issues event: 1
- Release event: 1
- Watch event: 48
- Push event: 2
- Fork event: 6
Last Year
- Create event: 1
- Issues event: 1
- Release event: 1
- Watch event: 48
- Push event: 2
- Fork event: 6