Introduction In earlier experiments, we explored a simple but powerful idea: An agent can generate code, run it, observe failures, and iteratively fix itself. That approach worked well for single-file applications across Python, Go, and TypeScript. But real systems are not single files. They are: structured\ layered\ multi-module\ opinionated This raises a deeper question: Can agents generate complete repositories — — not just code? In this article, we push agentic coding one step further: 👉 generating a Clean Architecture notes application 👉 as a multi-file Python project 👉 verified with unit tests (pytest) The Shift: From Code to Systems system = structure + boundaries + contracts + code Target Architecture project_agent_coded_python_clean_arch/ domain/ models.py0 application/ services.py infrastructure/ db.py api/ main.py tests/ test_notes.py Enter fullscreen mode Exit fullscreen mode The Agent Evolution generate → run → validate → test → fix Running the Agent python agent_coder_python_clean_arch.py…