AI Safety Archives - The Technology Express

OpenAI has introduced GPT-Red, an internal automated red-teaming system that uses self-play to identify prompt…

Anthropic has launched Claude Sonnet 5, making it the new default AI model for Free,…

OpenAI has introduced GPT-5.6, its latest family of frontier AI models, but the company is…

Anthropic has signed a global artificial intelligence alliance agreement aimed at promoting responsible AI development,…

Anthropic is accelerating the rollout of its advanced AI model, Claude Mythos, as demand for…

A team of Microsoft researchers has demonstrated how one unlabeled prompt can disable safety protections…

OpenAI has introduced new controls that allow users to adjust ChatGPT’s warmth, enthusiasm, and emoji…

A new team is being formed to build artificial intelligence capable of outperforming humans in…

OpenAI has announced major changes in how ChatGPT interacts with users under 18, focusing on…

In the fast-moving world of artificial intelligence, innovations arrive almost daily. Yet one recent update…