News

But two new papers from the AI company Anthropic, both published on the preprint server arXiv, provide new insight into how ...
It's August, which means Hot Science Summer is two-thirds over. This week, NASA released an exceptionally pretty photo of ...
A new study reveals that AI models can secretly pass harmful traits to one another raising concerns about hidden risks in ...
AI is supposed to be helpful, honest, and most importantly, harmless, but we've seen plenty of evidence that its behavior can ...
Researchers are testing new ways to prevent and predict dangerous personality shifts in AI models before they occur in the wild.
First Energy, Meta, Anthropic among other investors Other projects announced at the Energy and Innovation Summit: $15 billion, First Energy.
A new study from Anthropic introduces "persona vectors," a technique for developers to monitor, predict and control unwanted LLM behaviors.
U.S. state legislatures are where the action is for placing guardrails around artificial intelligence technologies, given the ...
AI is a relatively new tool, and despite its rapid deployment in nearly every aspect of our lives, researchers are still ...
Using two open-source models (Qwen 2.5 and Meta’s Llama 3) Anthropic engineers went deep into the neural networks to find the ...
A new study from Anthropic suggests that traits such as sycophancy or evilness are associated with specific patterns of ...
In the paper, Anthropic explained that it can steer these vectors by instructing models to act in certain ways -- for example, if it injects an evil prompt into the model, the model will respond from ...