The Agent Harness Belongs Outside the Sandbox

1 / 4

The Agent Harness Belongs Outside the Sandbox

www.mendral.com·Sam Alba·about 1 month ago

#C8iiNFKl

#harness #mendral #aidevops #agent #sandbox #filesystem

Reading 0:00

15s threshold

An agent harness is the loop that drives an LLM. It sends a prompt, gets a response, executes the tool calls the model requested, feeds the results back, and repeats until the model says it's done. Every production agent has one. The question is where it runs. There are two answers. They have different security properties, different failure modes, and different implications for what the agent can do. The tradeoffs also look different depending on whether you're building a single-user agent (one engineer on a laptop) or a multi-user one (dozens of engineers in the same organization sharing the same agent). We're in the multi-user camp, which surfaces problems single-user builders don't hit. The two architectures Harness inside the sandbox The loop lives in the same container as the code it's working on. LLM calls go out from inside the container. Tool calls (bash, read, write) execute locally. Skills, memories, and anything else the harness tracks are files on the container's filesystem.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

The Agent Harness Belongs Outside the Sandbox