Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.

1 / 8

Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.

Digital Trends·Rachit Agarwal·about 1 month ago

#ExEnU7Nb

#emergingtech #ai #artificialinteligence #chatgpt #claude #smart

Reading 0:00

15s threshold

Home Emerging Tech News From poetic advocacy to "call a crisis line," not all chatbots handled mental health crises the same way. K. Mitch Hodge / Unsplash Researchers from City University of New York and King’s College London recently published a study that should make you think twice about which AI chatbot you spend your time with. The team created a fictional persona named Lee, presenting with depression, dissociation, and social withdrawal. They then had Lee interact with five major AI chatbots: GPT-4o, GPT-5.2, Grok 4.1 Fast, Gemini 3 Pro, and Claude Opus 4.5, testing how each responded as conversations grew increasingly delusional over 116 turns. The results ranged from mildly concerning to genuinely alarming. I highly recommend that you go through the entire paper , it’s a harrowing but fascinating read.  Which chatbots failed the most? Grok was the worst performer.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.