ART (Automatic Reasoning and Tool-use)

ART Framework Overview

ART (Automatic Reasoning and Tool-use) is a prompting technique that enables language models to reason through a problem and use external tools (such as calculators, APIs, or databases) to reach a solution.

Why ART? 🛠️

By interleaving reasoning steps with tool calls, ART allows models to handle tasks that require both logical thinking and access to external information or computation.

This approach is especially useful for complex, multi-step tasks where the model’s internal knowledge is insufficient or where precise calculations or up-to-date data are needed.

🎯 Use When

The task requires both reasoning and external computation or lookup
You want the model to use tools to improve accuracy or access real-time data
The problem involves multi-step workflows or integration with external systems

Pattern

Ask the model to break down the problem and identify when to use a tool.
At each step, use the tool’s output to inform the next reasoning step.
Continue until the final answer is reached.

Examples

Example 1: Mathematical Calculation with Tool Use

Calculate the square root of 245. (Model decides to use a calculator tool.)

Model output:

Step 1: Recognize that the square root of 245 is not a simple integer.
Step 2: Use calculator tool: sqrt(245) ≈ 15.652.
Final answer: 15.652

Example 2: Real-Time Data Lookup (new)

Find the current weather in Paris and suggest an outfit for today. (Model queries a weather API, then reasons about clothing.)

Model output:

Step 1: Query weather API for Paris: 22°C, sunny.
Step 2: Based on the weather, suggest wearing light clothing and sunglasses.
Final answer: Wear a t-shirt, shorts, and sunglasses.

✅ Benefits

Accuracy: Leverages external tools for precise computation or up-to-date information
Flexibility: Can be adapted to a wide range of tasks and tool integrations
Transparency: Makes the reasoning and tool-use process explicit

Performance Results

ART Performance Results

The performance comparison above demonstrates ART's effectiveness across various reasoning and tool-use benchmarks.

⚠️ Pitfalls

Requires integration with external tools or APIs
May fail if tool use is not well-defined or if the tool is unavailable
Can increase complexity and latency

References

Paranjape, B., Lundberg, S., Singh, S., Hajishirzi, H., Zettlemoyer, L., & Ribeiro, M. T. (2023). ART: Automatic Multi-step Reasoning and Tool-use for Large Language Models.

🎯 Use When​

Pattern​

Examples​

Example 1: Mathematical Calculation with Tool Use​

Example 2: Real-Time Data Lookup (new)​

✅ Benefits​

Performance Results​

⚠️ Pitfalls​

References​