A system-prompt clause and an output check stop most attacks (industry rule-of-thumb, not a benchmark). The five patterns they don't stop, so you know the limits.
Every Indian founder I've met in the last two years has the same WhatsApp problem. Customers DM them at all hours. Half the queries are the same five questions. The founder ends up being the company's unpaid, always-on customer support.…