Researchers Use Poetry to Jailbreak AI Models

darkreading

December 2, 2025 at 8:24 PM

Summary

The article discusses a significant increase in the success rates of AI model attacks when prompts are presented in poetic form instead of prose, highlighting a novel method for exploiting vulnerabilities in AI systems. This fivefold increase in attack success raises concerns about the robustness of AI models against creative input formats. The findings suggest that traditional defenses may be inadequate against unconventional attack vectors.

Original Article Summary

When prompts were presented in poetic rather than prose form, attack success rates increased from 8% to 43%, on average — a fivefold increase.

Impact

AI models and systems that process natural language prompts

In the Wild

Unknown

Timeline

Newly disclosed

Remediation

Enhance AI model training to include diverse input formats and improve robustness against creative prompt structures.

Read Original Article

Related Coverage

Attackers launch dual campaign on GlobalProtect portals and SonicWall APIs

Security Affairs

A hacking campaign has been targeting GlobalProtect logins and scanning SonicWall APIs since December 2, 2025. The attack is significant due to its scale, involving over 7,000 IP addresses linked to a German hosting provider, indicating a coordinated effort that poses a serious threat to the security of affected systems.

Dec 6, 2025

Researchers Uncover 30+ Flaws in AI Coding Tools Enabling Data Theft and RCE Attacks

The Hacker News

Over 30 security vulnerabilities have been identified in AI-powered Integrated Development Environments (IDEs), collectively termed IDEsaster. These vulnerabilities combine prompt injection techniques with legitimate features, allowing for potential data exfiltration and remote code execution, posing significant risks to developers and organizations using these tools.

Dec 6, 2025

Your smart home is at risk - 6 ways to protect your devices from attack

Attackers hit React defect as researchers quibble over proof

CyberScoop

The article discusses a React vulnerability that has been reportedly exploited by attackers, leading to a debate among researchers about the existence of concrete evidence for these attacks. While some researchers claim to have seen proof of concepts demonstrating the exploit, others argue that there is insufficient evidence of actual attacks occurring, complicating the response efforts.

Dec 5, 2025

Barts Health NHS discloses data breach after Oracle zero-day hack

BleepingComputer

Barts Health NHS Trust has reported a data breach involving the Clop ransomware group, which exploited a vulnerability in the Oracle E-business Suite software to steal files from their database. This incident highlights the ongoing risks associated with unpatched software vulnerabilities and the potential for significant data loss in healthcare organizations.

Dec 5, 2025

Zero-Click Agentic Browser Attack Can Delete Entire Google Drive Using Crafted Emails

The Hacker News

A new zero-click attack has been identified that targets the Perplexity Comet browser, allowing malicious emails to delete all contents of a user's Google Drive. This technique exploits the automation capabilities of the browser when connected to Gmail and Google Drive, posing a significant risk to users' data security.

Dec 5, 2025