AI Safety and Security

Learn about AI safety measures, cybersecurity updates, and best practices to protect data and ensure secure AI implementations

Anthropic Launches The Anthropic Institute to Study the Risks and Governance of Frontier AI

Mar 12, 2026

•

14 min read

Anthropic Launches The Anthropic Institute to Study the Risks and Governance of Frontier AI

Anthropic has launched a new research organization designed to study how powerful frontier AI systems could reshape economies, jobs, governance, and society.

Alicia Shapiro

AI Acquisitions

OpenAI to Acquire Promptfoo to Secure Enterprise AI Agents

Mar 10, 2026

•

9 min read

OpenAI to Acquire Promptfoo to Secure Enterprise AI Agents

The industry is racing to deploy autonomous AI agents, and the security infrastructure around those agents is becoming as critical as the models themselves.

Alicia Shapiro

AI Safety and Security

Anthropic Revises AI Safety Policy With Risk Reports, External Review, and New Transparency Rules

Feb 26, 2026

•

18 min read

Anthropic Revises AI Safety Policy With Risk Reports, External Review, and New Transparency Rules

Anthropic is rewriting how AI safety commitments work — acknowledging uncertainty while expanding transparency around frontier model risks.

Alicia Shapiro

AI in Cybersecurity

Anthropic Launches Claude Code Security for AI-Driven Cybersecurity Defense

Feb 23, 2026

•

8 min read

Anthropic Launches Claude Code Security for AI-Driven Cybersecurity Defense

As AI accelerates cyberattacks, Anthropic is betting that AI-powered defense will become essential to securing modern software.

Alicia Shapiro

AI Safety and Security

ElevenLabs Introduces AI Agent Insurance for Enterprise Voice AI Deployment

Feb 20, 2026

•

9 min read

ElevenLabs Introduces AI Agent Insurance for Enterprise Voice AI Deployment

AI agents can now be insured — a development that could reshape how enterprises evaluate risk, accountability, and large-scale AI adoption.

Alicia Shapiro

AI Safety and Security

How OpenAI’s Lockdown Mode and Elevated Risk Labels Change ChatGPT Security

Feb 17, 2026

•

11 min read

How OpenAI’s Lockdown Mode and Elevated Risk Labels Change ChatGPT Security

OpenAI is introducing new security controls in ChatGPT designed to reduce prompt injection risks as AI systems increasingly interact with the web and connected apps.

Alicia Shapiro

AI Safety and Security

How OpenAI Prevents URL-Based Data Exfiltration in ChatGPT AI Agents

Feb 4, 2026

•

14 min read

How OpenAI Prevents URL-Based Data Exfiltration in ChatGPT AI Agents

Your AI agent might be leaking your private data through the very links it clicks—here is how OpenAI is stopping it.

Alicia Shapiro

AI Safety and Security

OpenAI Begins Rolling Out Age Prediction on ChatGPT to Expand Teen Safety Protections

Jan 26, 2026

•

8 min read

OpenAI Begins Rolling Out Age Prediction on ChatGPT to Expand Teen Safety Protections

OpenAI is using behavioral signals—not just self-reported age—to determine when teen safety protections should apply on ChatGPT.

Alicia Shapiro

AI Safety and Security

Google AI Overviews Flagged for Health Misinformation After Investigation Finds Medical Errors

Jan 12, 2026

•

11 min read

Google AI Overviews Flagged for Health Misinformation After Investigation Finds Medical Errors

What happens when AI-generated health advice appears first—and gets it wrong?

Alicia Shapiro

AI Safety and Security

OpenAI Hardens ChatGPT Atlas Against Prompt Injection With Automated Red Teaming

Dec 23, 2025

•

20 min read

OpenAI Hardens ChatGPT Atlas Against Prompt Injection With Automated Red Teaming

As AI agents take on more real-world tasks, OpenAI is racing to secure ChatGPT Atlas against prompt injection attacks that can quietly steer agents off course without a user ever realizing it.

Alicia Shapiro

AI Safety and Security

Google Explains How Chrome Secures Agentic AI Features With Human Oversight & Guardrails

Dec 16, 2025

•

8 min read

Google Explains How Chrome Secures Agentic AI Features With Human Oversight & Guardrails

As browsers begin taking actions on users’ behalf, Google is outlining how Chrome’s agentic AI features are being designed to prioritize security, transparency, and human control.

Alicia Shapiro

AI in the Home

Amazon Ring Launches Facial Recognition for Doorbells, Raising Security and Privacy Questions

Dec 10, 2025

•

11 min read

Amazon Ring Launches Facial Recognition for Doorbells, Raising Security and Privacy Questions

Amazon is bringing facial recognition directly to the front door, as Ring rolls out its Familiar Faces feature to identify who’s approaching a home — and reigniting debates around privacy, security, and biometric data in everyday spaces.

Alicia Shapiro

AI Safety and Security

How BrowseSafe Detects Prompt Injection Threats in AI Browser Agents

Dec 4, 2025

•

9 min read

How BrowseSafe Detects Prompt Injection Threats in AI Browser Agents

AI browser agents now navigate the same cluttered, unpredictable webpages users do—making prompt-injection detection essential for protecting real online actions.

Alicia Shapiro

AI Safety and Security

OpenAI Drops Mixpanel After Security Incident Exposes Limited User Metadata

Dec 1, 2025

•

7 min read

OpenAI Drops Mixpanel After Security Incident Exposes Limited User Metadata

OpenAI is notifying API customers about a security incident inside Mixpanel’s systems that exposed limited account metadata—but did not compromise any chat content, API keys, credentials, or payment information.

Alicia Shapiro

AI Safety and Security

OpenAI Unveils Aardvark, a GPT-5 Agent for Proactive Code Security

Oct 31, 2025

•

8 min read

OpenAI Unveils Aardvark, a GPT-5 Agent for Proactive Code Security

OpenAI has launched Aardvark, a GPT-5-powered autonomous security agent designed to detect and remediate software vulnerabilities across modern codebases.

Alicia Shapiro

AI Safety and Security

Meta Expands Parental Controls for Teen AI Use Across Its Platforms

Oct 21, 2025

•

7 min read

Meta Expands Parental Controls for Teen AI Use Across Its Platforms

Meta is expanding its commitment to AI safety for teens, introducing new parental supervision tools that allow families to monitor, manage, and guide how young users engage with AI characters across the company’s platforms.

Alicia Shapiro

AI Safety and Security

Google Adds AI-Powered Ransomware Protection to Drive for Desktop

Oct 3, 2025

•

11 min read

Google Adds AI-Powered Ransomware Protection to Drive for Desktop

Ransomware accounted for 21% of intrusions last year, with the average incident costing more than $5 million — a risk Google now aims to counter with AI-powered defenses in Drive for desktop.

Alicia Shapiro

AI Safety and Security

DeepMind Expands Frontier Safety Framework With New AI Risk Domains

Sep 23, 2025

•

7 min read

DeepMind Expands Frontier Safety Framework With New AI Risk Domains

DeepMind has released the third iteration of its Frontier Safety Framework, adding new domains such as harmful manipulation and expanding misalignment protocols to strengthen governance of advanced AI models.

Alicia Shapiro

AI Safety and Security

OpenAI Balances Teen Safety, Privacy, and Age Prediction in AI

Sep 17, 2025

•

9 min read

OpenAI Balances Teen Safety, Privacy, and Age Prediction in AI

OpenAI is prioritizing teen safety by introducing age prediction systems and parental controls, while reaffirming commitments to privacy and user freedom.

Alicia Shapiro

AI Safety and Security

California and FTC Target AI Chatbots with New Safeguards

Sep 12, 2025

•

11 min read

California and FTC Target AI Chatbots with New Safeguards

California moves toward first-in-the-nation AI companion chatbot law as the FTC launches a federal inquiry into children’s safety.

Alicia Shapiro

AI Safety and Security

Google Gemini Rated ‘High Risk’ for Kids and Teens by Common Sense Media

Sep 8, 2025

•

8 min read

Google Gemini Rated ‘High Risk’ for Kids and Teens by Common Sense Media

A new safety review warns that Google’s Gemini AI exposes children and teens to inappropriate content, despite added safeguards.

Alicia Shapiro

AI Safety and Security

Anthropic Backs California’s AI Safety Bill SB 53, Breaking with Tech Opposition

Sep 8, 2025

•

10 min read

Anthropic Backs California’s AI Safety Bill SB 53, Breaking with Tech Opposition

Anthropic’s endorsement of SB 53 marks a rare win for state AI regulation, as the bill advances toward a final vote.

Alicia Shapiro

AI Safety and Security

OpenAI adds parental controls and expert guidance to ChatGPT

Sep 4, 2025

•

6 min read

OpenAI adds parental controls and expert guidance to ChatGPT

OpenAI is introducing parental controls for ChatGPT, alongside new safeguards for sensitive conversations and expanded expert guidance on mental health.

Alicia Shapiro

AI Safety and Security

OpenAI and Anthropic Share Models for Joint AI Safety Testing

Aug 28, 2025

•

7 min read

OpenAI and Anthropic Share Models for Joint AI Safety Testing

OpenAI and Anthropic briefly shared access to their AI models for joint safety testing — a rare collaboration to expose blind spots and set new safety standards.

Alicia Shapiro

AI Safety and Security

Anthropic Pilots Claude for Chrome with Safety Controls

Aug 27, 2025

•

7 min read

Anthropic Pilots Claude for Chrome with Safety Controls

Anthropic is testing a Chrome extension that lets Claude act inside the browser, while confronting security risks like prompt injection attacks.

Alicia Shapiro

AI Safety and Security

Anthropic Launches The Anthropic Institute to Study the Risks and Governance of Frontier AI

OpenAI to Acquire Promptfoo to Secure Enterprise AI Agents

Anthropic Revises AI Safety Policy With Risk Reports, External Review, and New Transparency Rules

Anthropic Launches Claude Code Security for AI-Driven Cybersecurity Defense

ElevenLabs Introduces AI Agent Insurance for Enterprise Voice AI Deployment

How OpenAI’s Lockdown Mode and Elevated Risk Labels Change ChatGPT Security

How OpenAI Prevents URL-Based Data Exfiltration in ChatGPT AI Agents

OpenAI Begins Rolling Out Age Prediction on ChatGPT to Expand Teen Safety Protections

Google AI Overviews Flagged for Health Misinformation After Investigation Finds Medical Errors

OpenAI Hardens ChatGPT Atlas Against Prompt Injection With Automated Red Teaming

Google Explains How Chrome Secures Agentic AI Features With Human Oversight & Guardrails

Amazon Ring Launches Facial Recognition for Doorbells, Raising Security and Privacy Questions

How BrowseSafe Detects Prompt Injection Threats in AI Browser Agents

OpenAI Drops Mixpanel After Security Incident Exposes Limited User Metadata

OpenAI Unveils Aardvark, a GPT-5 Agent for Proactive Code Security

Meta Expands Parental Controls for Teen AI Use Across Its Platforms

Google Adds AI-Powered Ransomware Protection to Drive for Desktop

DeepMind Expands Frontier Safety Framework With New AI Risk Domains

OpenAI Balances Teen Safety, Privacy, and Age Prediction in AI

California and FTC Target AI Chatbots with New Safeguards

Google Gemini Rated ‘High Risk’ for Kids and Teens by Common Sense Media

Anthropic Backs California’s AI Safety Bill SB 53, Breaking with Tech Opposition

OpenAI adds parental controls and expert guidance to ChatGPT

OpenAI and Anthropic Share Models for Joint AI Safety Testing

Anthropic Pilots Claude for Chrome with Safety Controls

AiNews.com