News

Anthropic’s Claude AI chatbot can now end conversations if it is distressed - Testing showed that chatbot had ‘pattern of ...
Claude won't stick around for toxic convos. Anthropic says its AI can now end extreme chats when users push too far.
Anthropic's latest feature for two of its Claude AI models could be the beginning of the end for the AI jailbreaking ...
Anthropic has said that their Claude Opus 4 and 4.1 models will now have the ability to end conversations that are “extreme ...
Anthropic rolled out a feature letting its AI assistant terminate chats with abusive users, citing "AI welfare" concerns and ...
Anthropic empowers Claude AI to end conversations in cases of repeated abuse, prioritizing model welfare and responsible AI ...
Roughly 200 people gathered in San Francisco on Saturday to mourn the loss of Claude 3 Sonnet, an older AI model that ...
Anthropic’s Claude Code now features continuous AI security reviews, spotting vulnerabilities in real time to keep unsafe ...
Data, analysis, and analytics are a major part of safeguards. A research paper describes multistep reasoning and how Claude ...