Menu

Post image 1
Post image 2
Post image 3
Post image 4
Post image 5
Post image 6
Post image 7
Post image 8
Post image 9
Post image 10
Post image 11
Post image 12
Post image 13
1 / 13
0

Using AI to click around on a website burns 45x as many tokens as just using APIs

theregister·Thomas Claburn·26 days ago
#eqM7Ej3i
Reading 0:00
15s threshold

AI For AI agents, seeing is expensive Businesses deploying AI agents to automate computer usage may be spending far more money than necessary if those agents try to emulate human visual interaction. Reflex, an enterprise application platform, recently set out to compare vision agents with API agents. A vision agent in this context refers to an AI agent that mimics human interaction by relying on image processing and optical character recognition to operate an application. In this instance, that's Claude Sonnet navigating a web app user interface via browser-use 0.12 , a tool for automated web browser operation. An API agent here refers to Claude Sonnet interacting with a web app via tools and APIs. The agent calls the same handling mechanisms that the UI calls and receives structured data in response, rather than a web page screenshot that must be analyzed.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More