Why I Stopped Letting Claude Shell Out for Security Scans

1 / 2

Why I Stopped Letting Claude Shell Out for Security Scans

DEV Community·Grumpy Sage·22 days ago

#NGilDzbT

#security #ai #agent #shell #tools #findings

Reading 0:00

15s threshold

A founder I know spent last Tuesday night debugging what he thought was a Claude bug. He'd wired up Claude Code to his repo with the default shell tool, asked it to "scan this codebase for secrets and SQL injection," and watched it confidently produce a clean report. Zero findings. He shipped to staging. Twelve hours later his Datadog alert fired on a Postgres error trace that exposed a hardcoded service account key in a config file Claude had supposedly scanned. He called me at 11pm. We screen-shared. The problem was almost funny once we saw it. Claude had run cyscan — correctly, with the right flags — against the wrong directory. It had cd 'd into a subfolder earlier in the conversation to read a file, never cd 'd back, and then run the scan from there. The scan completed in 400ms because there were six files in scope. Claude wrote up a confident summary of those six files, called it a codebase audit, and moved on. That's not a Claude failure. That's a tool design failure.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Why I Stopped Letting Claude Shell Out for Security Scans