AI Safety

AI security researchers testing model safety
Single Prompt Exposes Major AI Safety Vulnerabilities

A team of Microsoft researchers has demonstrated how one unlabeled prompt can disable safety protections…

ChatGPT logo with customization update
OpenAI Adds ChatGPT Tone and Emoji Controls

OpenAI has introduced new controls that allow users to adjust ChatGPT’s warmth, enthusiasm, and emoji…

AI team developing medical superintelligence
Microsoft Unveils New Team to Build Medical-Focused Superintelligence

A new team is being formed to build artificial intelligence capable of outperforming humans in…

OpenAI announces new teen restrictions
OpenAI Imposes New ChatGPT Restrictions for Under-18 Users

OpenAI has announced major changes in how ChatGPT interacts with users under 18, focusing on…

Hand holding phone showing Anthropic Claude logo
Anthropic’s Claude Can Now End Harmful Conversations

In the fast-moving world of artificial intelligence, innovations arrive almost daily. Yet one recent update…

AI chatbot illustration with digital chat bubbles.
Meta pulls back on chatbots targeting children

Earlier this summer, Meta removed a massive number of child predators from Facebook and Instagram.…

Anthropic AI executives join HumanLoop team for enterprise
Anthropic Acquires HumanLoop Executives to Boost Enterprise AI

Anthropic has hired HumanLoop’s CEO along with several key team members to enhance its enterprise…

ChatGPT interface with mental health update notice
OpenAI Introduces Mental Health Safeguards in ChatGPT After Delusion Concerns

OpenAI has introduced new mental health safeguards to ChatGPT following increasing concern that the chatbot…

OpenAI and Anthropic logos with AI interface
Anthropic Blocks OpenAI from Claude Access Ahead of GPT-5 Launch

Just as OpenAI prepares for the much-anticipated release of GPT-5, Anthropic has taken a significant…

Character.AI app showing AI-generated video and social feed interface features.
Character.AI Launches Video Creation and Social Features

Character.AI, a platform known for AI-generated characters and role-playing, recently announced a major update introducing…