Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Shen, Weizhou; Li, Chenliang; Chen, Hongzhan; Yan, Ming; Quan, Xiaojun; Chen, Hehong; Zhang, Ji; Huang, Fei

Computer Science > Artificial Intelligence

arXiv:2401.07324 (cs)

[Submitted on 14 Jan 2024 (v1), last revised 16 Feb 2024 (this version, v3)]

Title:Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Authors:Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang

View PDF

Abstract:Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool use demands that LLMs not only understand user queries and generate answers accurately but also excel in task planning, tool invocation, and result summarization. While traditional works focus on training a single LLM with all these capabilities, performance limitations become apparent, particularly with smaller models. To overcome these challenges, we propose a novel approach that decomposes the aforementioned capabilities into a planner, caller, and summarizer. Each component is implemented by a single LLM that focuses on a specific capability and collaborates with others to accomplish the task. This modular framework facilitates individual updates and the potential use of smaller LLMs for building each capability. To effectively train this framework, we introduce a two-stage training paradigm. First, we fine-tune a backbone LLM on the entire dataset without discriminating sub-tasks, providing the model with a comprehensive understanding of the task. Second, the fine-tuned LLM is used to instantiate the planner, caller, and summarizer respectively, which are continually fine-tuned on respective sub-tasks. Evaluation across various tool-use benchmarks illustrates that our proposed multi-LLM framework surpasses the traditional single-LLM approach, highlighting its efficacy and advantages in tool learning.

Comments:	On progress, github repo: this https URL
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2401.07324 [cs.AI]
	(or arXiv:2401.07324v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2401.07324

Submission history

From: Weizhou Shen [view email]
[v1] Sun, 14 Jan 2024 16:17:07 UTC (1,568 KB)
[v2] Thu, 1 Feb 2024 04:34:07 UTC (3,495 KB)
[v3] Fri, 16 Feb 2024 12:42:25 UTC (10,664 KB)

Computer Science > Artificial Intelligence

Title:Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators