Menu

Post image 1
Post image 2
Post image 3
Post image 4
Post image 5
Post image 6
Post image 7
Post image 8
1 / 8
0

Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.

Digital Trends·Rachit Agarwal·about 1 month ago
#ExEnU7Nb
Reading 0:00
15s threshold

Home Emerging Tech News From poetic advocacy to "call a crisis line," not all chatbots handled mental health crises the same way. K. Mitch Hodge / Unsplash Researchers from City University of New York and King’s College London recently published a study that should make you think twice about which AI chatbot you spend your time with. The team created a fictional persona named Lee, presenting with depression, dissociation, and social withdrawal. They then had Lee interact with five major AI chatbots: GPT-4o, GPT-5.2, Grok 4.1 Fast, Gemini 3 Pro, and Claude Opus 4.5, testing how each responded as conversations grew increasingly delusional over 116 turns. The results ranged from mildly concerning to genuinely alarming. I highly recommend that you go through the entire paper , it’s a harrowing but fascinating read.  Which chatbots failed the most? Grok was the worst performer.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More