📰00Self-fulfilling misalignment? - Marginal REVOLUTIONMarginal REVOLUTION·Tyler Cowen·25 days ago#qrsdnnx8#navbar#comments#print#bedfcadfdcdfccccd1fed9d3cb90dbdacb#self#anthropic+6 more🧰Tag tools✨Add tagFrom Anthropic: We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.… Read more15s0Read later0Read More