stereoplegic 's Collections
Branch-Solve-Merge Improves Large Language Model Evaluation and
Generation
Paper
• 2310.15123
• Published
• 8
ToolChain*: Efficient Action Space Navigation in Large Language Models
with A* Search
Paper
• 2310.13227
• Published
• 15
LASER: LLM Agent with State-Space Exploration for Web Navigation
Paper
• 2309.08172
• Published
• 14
Language Agent Tree Search Unifies Reasoning Acting and Planning in
Language Models
Paper
• 2310.04406
• Published
• 10
Autonomous Tree-search Ability of Large Language Models
Paper
• 2310.10686
• Published
• 2
Tree-Planner: Efficient Close-loop Task Planning with Large Language
Models
Paper
• 2310.08582
• Published
• 3
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning
Paper
• 2310.04474
• Published
• 2
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
• 2310.12823
• Published
• 36
FireAct: Toward Language Agent Fine-tuning
Paper
• 2310.05915
• Published
• 2
Adapting LLM Agents Through Communication
Paper
• 2310.01444
• Published
• 3
MusicAgent: An AI Agent for Music Understanding and Generation with
Large Language Models
Paper
• 2310.11954
• Published
• 25
Promptor: A Conversational and Autonomous Prompt Generation Agent for
Intelligent Text Entry Techniques
Paper
• 2310.08101
• Published
• 2
SAI: Solving AI Tasks with Systematic Artificial Intelligence in
Communication Network
Paper
• 2310.09049
• Published
• 1
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Paper
• 2310.01557
• Published
• 13
Are Human-generated Demonstrations Necessary for In-context Learning?
Paper
• 2309.14681
• Published
• 1
Agent Instructs Large Language Models to be General Zero-Shot Reasoners
Paper
• 2310.03710
• Published
• 2
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model
Collaboration
Paper
• 2310.00280
• Published
• 3
SteP: Stacked LLM Policies for Web Actions
Paper
• 2310.03720
• Published
• 8
You Only Look at Screens: Multimodal Chain-of-Action Agents
Paper
• 2309.11436
• Published
• 1
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language
Feedback
Paper
• 2309.10691
• Published
• 4
Lemur: Harmonizing Natural Language and Code for Language Agents
Paper
• 2310.06830
• Published
• 33
EcoAssistant: Using LLM Assistant More Affordably and Accurately
Paper
• 2310.03046
• Published
• 6
SALMON: Self-Alignment with Principle-Following Reward Models
Paper
• 2310.05910
• Published
• 2
SCREWS: A Modular Framework for Reasoning with Revisions
Paper
• 2309.13075
• Published
• 18
DSPy: Compiling Declarative Language Model Calls into Self-Improving
Pipelines
Paper
• 2310.03714
• Published
• 37
LLM Guided Inductive Inference for Solving Compositional Problems
Paper
• 2309.11688
• Published
• 1
AskIt: Unified Programming Interface for Programming with Large Language
Models
Paper
• 2308.15645
• Published
• 2
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Paper
• 2309.17452
• Published
• 3
CodeChain: Towards Modular Code Generation Through Chain of
Self-revisions with Representative Sub-modules
Paper
• 2310.08992
• Published
• 12
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
• 2310.08740
• Published
• 15
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation
and Generalization
Paper
• 2310.10134
• Published
• 1
Multimodal Multi-Hop Question Answering Through a Conversation Between
Tools and Efficiently Finetuned Large Language Models
Paper
• 2309.08922
• Published
• 1
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via
Tool Embeddings
Paper
• 2305.11554
• Published
• 2
GEAR: Augmenting Language Models with Generalizable and Efficient Tool
Resolution
Paper
• 2307.08775
• Published
• 1
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented
Language Models
Paper
• 2305.18323
• Published
• 1
Chameleon: Plug-and-Play Compositional Reasoning with Large Language
Models
Paper
• 2304.09842
• Published
• 2
Visual Programming: Compositional visual reasoning without training
Paper
• 2211.11559
• Published
• 1
Agents: An Open-source Framework for Autonomous Language Agents
Paper
• 2309.07870
• Published
• 43
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper
• 2310.17796
• Published
• 18
Reason for Future, Act for Now: A Principled Framework for Autonomous
LLM Agents with Provable Sample Efficiency
Paper
• 2309.17382
• Published
• 6
Enabling Intelligent Interactions between an Agent and an LLM: A
Reinforcement Learning Approach
Paper
• 2306.03604
• Published
• 1
ComputeGPT: A computational chat model for numerical problems
Paper
• 2305.06223
• Published
• 1
Natural Language Embedded Programs for Hybrid Language Symbolic
Reasoning
Paper
• 2309.10814
• Published
• 3
Program of Thoughts Prompting: Disentangling Computation from Reasoning
for Numerical Reasoning Tasks
Paper
• 2211.12588
• Published
• 3
Structured Chain-of-Thought Prompting for Code Generation
Paper
• 2305.06599
• Published
• 1
Of Models and Tin Men: A Behavioural Economics Study of Principal-Agent
Problems in AI Alignment using Large-Language Models
Paper
• 2307.11137
• Published
• 1
ExpeL: LLM Agents Are Experiential Learners
Paper
• 2308.10144
• Published
• 3
i-Code Studio: A Configurable and Composable Framework for Integrative
AI
Paper
• 2305.13738
• Published
• 1
AssistGPT: A General Multi-modal Assistant that can Plan, Execute,
Inspect, and Learn
Paper
• 2306.08640
• Published
• 27
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized
Toolsets
Paper
• 2309.17428
• Published
• 1
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
APIs
Paper
• 2307.16789
• Published
• 102
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
Models
Paper
• 2308.00675
• Published
• 37
DocPrompting: Generating Code by Retrieving the Docs
Paper
• 2207.05987
• Published
• 1
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper
• 2302.04761
• Published
• 12
GPT4Tools: Teaching Large Language Model to Use Tools via
Self-instruction
Paper
• 2305.18752
• Published
• 5
ToolCoder: Teach Code Generation Models to use API search tools
Paper
• 2305.04032
• Published
• 1
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring
Emergent Behaviors
Paper
• 2308.10848
• Published
• 1
Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State
Decoding
Paper
• 2310.07075
• Published
• 1
A Survey on Large Language Model based Autonomous Agents
Paper
• 2308.11432
• Published
• 3
OpenAGI: When LLM Meets Domain Experts
Paper
• 2304.04370
• Published
• 1
Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM
Agents
Paper
• 2306.03314
• Published
• 2
Exploring the Intersection of Large Language Models and Agent-Based
Modeling via Prompt Engineering
Paper
• 2308.07411
• Published
• 2
Cognitive Architectures for Language Agents
Paper
• 2309.02427
• Published
• 8
The Rise and Potential of Large Language Model Based Agents: A Survey
Paper
• 2309.07864
• Published
• 8
Self-driven Grounding: Large Language Model Agents with Automatical
Language-aligned Skill Learning
Paper
• 2309.01352
• Published
• 1
Unleashing Cognitive Synergy in Large Language Models: A Task-Solving
Agent through Multi-Persona Self-Collaboration
Paper
• 2307.05300
• Published
• 20
Communicative Agents for Software Development
Paper
• 2307.07924
• Published
• 6
Lumos: Learning Agents with Unified Data, Modular Design, and
Open-Source LLMs
Paper
• 2311.05657
• Published
• 30
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal
Language Models
Paper
• 2311.05997
• Published
• 37
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
Paper
• 2310.02304
• Published
• 1
Is AI the better programming partner? Human-Human Pair Programming vs.
Human-AI pAIr Programming
Paper
• 2306.05153
• Published
• 1
"Teach AI How to Code": Using Large Language Models as Teachable Agents
for Programming Education
Paper
• 2309.14534
• Published
• 2
Towards Teachable Conversational Agents
Paper
• 2102.10387
• Published
• 1
Dynamic Planning with a LLM
Paper
• 2308.06391
• Published
• 2
LLM Augmented Hierarchical Agents
Paper
• 2311.05596
• Published
• 1
Execution-Based Evaluation for Open-Domain Code Generation
Paper
• 2212.10481
• Published
• 1
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Paper
• 2311.10775
• Published
• 9
MetaTool Benchmark for Large Language Models: Deciding Whether to Use
Tools and Which to Use
Paper
• 2310.03128
• Published
• 1
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
Model-based Agents in Real-world Systems
Paper
• 2311.11315
• Published
• 7
Understanding HTML with Large Language Models
Paper
• 2210.03945
• Published
• 1
Responsible Task Automation: Empowering Large Language Models as
Responsible Task Automators
Paper
• 2306.01242
• Published
• 2
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Paper
• 2312.14878
• Published
• 15
GAIA: a benchmark for General AI Assistants
Paper
• 2311.12983
• Published
• 245
Modeling Complex Mathematical Reasoning via Large Language Model based
MathAgent
Paper
• 2312.08926
• Published
• 9
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
• 2312.10003
• Published
• 44
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Paper
• 2401.03065
• Published
• 11
AutoAgents: A Framework for Automatic Agent Generation
Paper
• 2309.17288
• Published
• 5
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code
Empowers Large Language Models to Serve as Intelligent Agents
Paper
• 2401.00812
• Published
• 11
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API
Names?
Paper
• 2309.07804
• Published
• 2
Prompt2Model: Generating Deployable Models from Natural Language
Instructions
Paper
• 2308.12261
• Published
• 1
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
Paper
• 2401.06201
• Published
• 2
LEVER: Learning to Verify Language-to-Code Generation with Execution
Paper
• 2302.08468
• Published
• 1
ProTIP: Progressive Tool Retrieval Improves Planning
Paper
• 2312.10332
• Published
• 8
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Paper
• 2402.01622
• Published
• 38
SymbolicAI: A framework for logic-based approaches combining generative
models and solvers
Paper
• 2402.00854
• Published
• 22
Efficient Exploration for LLMs
Paper
• 2402.00396
• Published
• 22
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper
• 2401.17464
• Published
• 21
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper
• 2311.05437
• Published
• 51
Empowering LLM to use Smartphone for Intelligent Task Automation
Paper
• 2308.15272
• Published
• 1
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
• 2402.14034
• Published
• 13
Large Language Model based Multi-Agents: A Survey of Progress and
Challenges
Paper
• 2402.01680
• Published
• 2
LLM Multi-Agent Systems: Challenges and Open Problems
Paper
• 2402.03578
• Published
• 1
Professional Agents -- Evolving Large Language Models into Autonomous
Experts with Human-Level Competencies
Paper
• 2402.03628
• Published
S-Agents: self-organizing agents in open-ended environment
Paper
• 2402.04578
• Published
SpeechAgents: Human-Communication Simulation with Multi-Modal
Multi-Agent Systems
Paper
• 2401.03945
• Published
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Paper
• 2403.04746
• Published
• 24
LLM Agent Operating System
Paper
• 2403.16971
• Published
• 73
MuMath-Code: Combining Tool-Use Large Language Models with
Multi-perspective Data Augmentation for Mathematical Reasoning
Paper
• 2405.07551
• Published
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of
Large Language Models in Real-world Scenarios
Paper
• 2401.00741
• Published
• 1
AgileCoder: Dynamic Collaborative Agents for Software Development based
on Agile Methodology
Paper
• 2406.11912
• Published
• 27
From MOOC to MAIC: Reshaping Online Teaching and Learning through
LLM-driven Agents
Paper
• 2409.03512
• Published
• 29
How to Build an AI Tutor that Can Adapt to Any Course and Provide
Accurate Answers Using Large Language Model and Retrieval-Augmented
Generation
Paper
• 2311.17696
• Published
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper
• 2601.09259
• Published
• 95