๐ง AI That Knows When to Grab a Calculator โ ToolsRL Teaches Vision Models to Use Tools
What happens when you ask an AI to count pizza slices and calculate percentages from a photo? It guesses โ and often gets it wrong.
That's because today's vision-language models try to do everything "in their head," even math they're terrible at.
Researchers from Fudan University just introduced ToolsRL โ a reinforcement learning framework that teaches AI *when and how* to reach for external tools like calculators and code interpreters.
The key insight: instead of only rewarding correct final answers, ToolsRL provides direct supervision on *tool usage itself*. The AI learns not just to solve problems, but to recognize which problems need tools.
๐ฏ Key results:
- A 7B parameter model outperforms models 10x its size on visual math tasks
- 15%+ improvement on math-heavy visual reasoning benchmarks
- The model learns to selectively use tools โ simple questions get quick answers, complex ones trigger calculator calls
Think of it like teaching an apprentice carpenter: instead of just saying "make it straight," you teach them to pick up the hammer. The right tool at the right time changes everything.
This matters because the smartest AI isn't the one that knows everything โ it's the one that knows when to ask for help.
๐ Source
huggingface-papers