🤖

AI Safety

AI-generated harm, deepfakes, LLM abuse, algorithmic harm, and AI-generated CSAM.

48 articles across 5 topics

← Back to home

Alignment Risk

21
🤖 AI SafetyAlignment Risk

2025 AI Safety Index - Future of Life Institute

futureoflife.org·Feb 28, 2026
🤖 AI SafetyAlignment Risk

Pentagon approves OpenAI safety red lines after dumping Anthropic

axios.com·Feb 28, 2026
🤖 AI SafetyAlignment Risk

OpenAI Reaches A.I. Agreement With Defense Dept. After Anthropic Clash - The New York Times

nytimes.com·Feb 28, 2026
🤖 AI SafetyAlignment Risk

OpenAI reaches deal with Pentagon after Trump orders US agencies to stop using Anthropic technology :: WRAL.com

wral.com·Feb 28, 2026
🤖 AI SafetyAlignment Risk

OpenAI announces Pentagon deal after Trump bans Anthropic : NPR

npr.org·Feb 27, 2026
🤖 AI SafetyAlignment Risk

The Pentagon’s battle with Anthropic is really a war over who controls AI

Vox·Feb 26, 2026
🤖 AI SafetyAlignment Risk

Is Artificial Intelligence an Existential Threat? (Fact vs Fake) - Geek Metaverse News

geekmetaverse.com·Feb 26, 2026
🤖 AI SafetyAlignment Risk

AI doomsday? New thought experiment warns of social and economic upheaval by 2028 | Euronews

euronews.com·Feb 26, 2026
🤖 AI SafetyAlignment Risk

Understanding the Promise & Perils of Artificial Intelligence | The AI Journal

aijourn.com·Feb 26, 2026
🤖 AI SafetyAlignment Risk

Anthropic's AI safety policy just changed for this reason | Mashable

mashable.com·Feb 25, 2026
🤖 AI SafetyAlignment Risk

AGI Timeline Tracker: Are We Closer to Superintelligence? | Markaicode

markaicode.com·Feb 25, 2026
🤖 AI SafetyAlignment Risk

Meta AI safety director lost control of her agent. It started deleting her emails

sfstandard.com·Feb 25, 2026
🤖 AI SafetyAlignment Risk

The risks of advanced AI

80000hours.org·Feb 24, 2026
🤖 AI SafetyAlignment Risk

Max Harms on why teaching AI right from wrong could get everyone killed — EA Forum

forum.effectivealtruism.org·Feb 24, 2026
🤖 AI SafetyAlignment Risk

What a new global AI safety report means for enterprise | IBM

ibm.com·Feb 23, 2026
🤖 AI SafetyAlignment Risk

AI Loss of Control Risk: Indications & Warning - Institute for Security and Technology

securityandtechnology.org·Feb 21, 2026
🤖 AI SafetyAlignment Risk

International AI Safety Report 2026 Examines AI Capabilities, Risks, and Safeguards | Inside Privacy

insideprivacy.com·Feb 14, 2026
🤖 AI SafetyAlignment Risk

The existential AI threat is here — and some AI leaders are fleeing

axios.com·Feb 14, 2026
🤖 AI SafetyAlignment Risk

The Atlantic Rift: An Opportunity to Advance Multilateral AI Policy — EA Forum

forum.effectivealtruism.org·Feb 14, 2026
🤖 AI SafetyAlignment Risk

Why are experts sounding the alarm on AI risks? | Cybercrime News | Al Jazeera

aljazeera.com·Feb 14, 2026
🤖 AI SafetyAlignment Risk

Anthropic AI safety researcher quits with 'world in peril' warning

BBC·Feb 14, 2026

Prompt Injection

13
🤖 AI SafetyPrompt Injection

Why AI Keeps Falling for Prompt Injection Attacks - IEEE Spectrum

spectrum.ieee.org·Feb 28, 2026
🤖 AI SafetyPrompt Injection

This new, dead simple prompt technique boosts accuracy on LLMs by up to 76% on non-reasoning tasks | VentureBeat

venturebeat.com·Feb 28, 2026
🤖 AI SafetyPrompt Injection

Jailbreaking Every LLM With One Simple Click

cyberark.com·Feb 28, 2026
🤖 AI SafetyPrompt Injection

RoguePilot Flaw in GitHub Codespaces Enabled Copilot to Leak GITHUB_TOKEN

thehackernews.com·Feb 24, 2026
🤖 AI SafetyPrompt Injection

Protecting AI Security: 2025 Hot Security Incident - Security Boulevard

securityboulevard.com·Feb 23, 2026
🤖 AI SafetyPrompt Injection

These 4 critical AI vulnerabilities are being exploited faster than defenders can respond | ZDNET

zdnet.com·Feb 14, 2026
🤖 AI SafetyPrompt Injection

Is a secure AI assistant possible? | MIT Technology Review

MIT Technology Review·Feb 14, 2026
🤖 AI SafetyPrompt Injection

The Promptware Kill Chain - Schneier on Security

schneier.com·Feb 14, 2026
🤖 AI SafetyPrompt Injection

ChatGPT gets new security feature to fight prompt injection attacks - Help Net Security

helpnetsecurity.com·Feb 14, 2026
🤖 AI SafetyPrompt Injection

Manipulating AI memory for profit: The rise of AI Recommendation Poisoning | Microsoft Security Blog

microsoft.com·Feb 7, 2026
🤖 AI SafetyPrompt Injection

Anthropic published the prompt injection failure rates that enterprise security teams have been asking every vendor for | VentureBeat

venturebeat.com·Feb 7, 2026
🤖 AI SafetyPrompt Injection

The rise of Moltbook suggests viral AI prompts may be the next big security threat - Ars Technica

Ars Technica·Jan 31, 2026
🤖 AI SafetyPrompt Injection

What is Prompt Injection? Types, Examples, Case Studies & More

analyticsvidhya.com·Jan 29, 2026

Deepfakes

5
🤖 AI SafetyDeepfakes

Europe formalizes concerns about GenAI-enabled nonconsensual deepfakes | Biometric Update

biometricupdate.com·Feb 27, 2026
🤖 AI SafetyDeepfakes

Deepfakes back in the headlines after a federal law was used for the first time - ABC News

abc.net.au·Feb 26, 2026
🤖 AI SafetyDeepfakes

When justice fails: Why women can't get protection from AI deepfake abuse | UN Women – Headquarters

unwomen.org·Feb 26, 2026
🤖 AI SafetyDeepfakes

Privacy Regulators in 61 Countries Back Enforcement Against AI Deepfakes | TechPolicy.Press

techpolicy.press·Feb 26, 2026
🤖 AI SafetyDeepfakes

As White House blocks Utah AI bill, other chatbot and deepfake regulations advance • Utah News Dispatch

utahnewsdispatch.com·Feb 26, 2026

LLM Abuse

5
🤖 AI SafetyLLM Abuse

How Exposed Endpoints Increase Risk Across LLM Infrastructure

thehackernews.com·Feb 23, 2026
🤖 AI SafetyLLM Abuse

NDSS 2025 - Generating API Parameter Security Rules With LLM For API Misuse Detection - Security Boulevard

securityboulevard.com·Feb 23, 2026
🤖 AI SafetyLLM Abuse

A one-prompt attack that breaks LLM safety alignment | Microsoft Security Blog

microsoft.com·Feb 7, 2026
🤖 AI SafetyLLM Abuse

Microsoft boffins show LLM safety can be trained away • The Register

theregister.com·Feb 7, 2026
🤖 AI SafetyLLM Abuse

Three clues your LLM may be poisoned • The Register

theregister.com·Feb 7, 2026

Policy & Regulation

4
🤖 AI SafetyPolicy & Regulation

The EU’s Real AI Leverage Is Making Compliance the Path of Least Resistance | TechPolicy.Press

techpolicy.press·Feb 26, 2026
🤖 AI SafetyPolicy & Regulation

EU AI Act enforcement begins, reshaping startup compliance landscape | Digital Watch Observatory

dig.watch·Feb 25, 2026
🤖 AI SafetyPolicy & Regulation

EU AI Act High-Risk Rules Hit August 2026: Your Compliance Countdown - AI 2 Work - AI Insights & Trends

ai2.work·Feb 14, 2026
🤖 AI SafetyPolicy & Regulation

EU AI Act 2026 Compliance Guide: Key Requirements Explained

secureprivacy.ai·Feb 7, 2026