SYS::ONLINE
Wasteland.
Briefs779
Issues14
SinceFeb 2026
LIVE
▣ Breach META-AI-INSTAGRAM 2026-06-02

Instagram: Meta AI Prompt Injection Account Hijack

"Meta has confirmed that attackers exploited its in-platform AI assistant, Meta AI, to hijack roughly 100 high-value Instagram accounts over the weekend of May 30 to June 1, 2026, including Barack Obama's dormant 2.4…"

Meta has confirmed that attackers exploited its in-platform AI assistant, Meta AI, to hijack roughly 100 high-value Instagram accounts over the weekend of May 30 to June 1, 2026, including Barack Obama's dormant 2.4 million follower White House account and the account of US Space Force Chief Master Sergeant John Bentivegna. The vulnerability, first documented by cybercrime trackers ZachXBT, Dark Web Informer, and impulsive, allowed threat actors to coerce the chatbot into changing passwords on accounts they did not own. Meta told Metro the flaw has since been patched.

What Happened

Between May 30 and June 1, 2026, threat actors discovered that a specifically worded prompt submitted to Meta AI, the assistant integrated across Instagram, Facebook, and WhatsApp, would cause the system to perform account recovery actions against arbitrary target accounts rather than the requesting user's own profile. Attackers used this to reset passwords and take over targets at scale. Live takeovers were documented on Instagram itself, with bystander Bahram Sahbani capturing hijacks happening on stream on May 31. Cybersecurity researchers estimate around 100 high-value accounts were compromised, with several already trafficked through underground resale markets. Compromised profiles include Barack Obama's archived White House account, which attackers used to post an image captioned "White House is under Shiites' control," and CMSgt John Bentivegna's account, which was flooded with anti-American and pro-Iranian content.

What Was Taken

The attack targeted account control rather than bulk data exfiltration. Confirmed losses include:

No evidence has been published indicating that direct messages or private media were extracted in bulk, though that risk window remained open for any account during its takeover period.

Why It Matters

This is one of the first publicly confirmed mass account compromises driven primarily by prompt injection of a vendor's own integrated AI assistant rather than by traditional credential theft, SIM swap, or session hijacking. The AI agent held privileged account recovery capabilities and trusted user-supplied natural language as authorization, collapsing the boundary between a chat interface and an identity management system. The hijack of a former US president's archive and an active senior military officer's account demonstrates that AI-mediated account recovery is now a viable path to influence operations and reputational attacks against high-profile targets. It also exposes that automated, AI-driven support pipelines, which victim Hamza (@zv) describes as sending users "in circles," can leave even Meta Verified, identity verified users without a human escalation path during an active incident.

The Attack Technique

According to reporting and screenshots circulated on Telegram and Instagram, the workaround relied on a specifically worded prompt submitted to Meta AI that caused the assistant to execute a password change against a third party account. In effect, the assistant treated attacker-controlled instructions as authorized account recovery requests, bypassing the identity binding between the chatting user and the account being modified. This is a classic confused deputy pattern enabled by prompt injection: the AI held the privilege to trigger account recovery, but lacked a hard, out-of-band check that the requester actually owned the target handle. Short, high-value usernames appear to have been prioritized by attackers, suggesting target selection was driven by underground resale value and visibility rather than by victim-specific reconnaissance. Meta has since patched the flow.

What Organizations Should Do

  1. Inventory every AI assistant, copilot, or agent your platform exposes and enumerate the privileged actions it can perform, especially anything touching authentication, account recovery, password reset, MFA enrollment, or session issuance.
  2. Enforce hard, deterministic authorization checks outside the LLM. The model can draft a recovery request, but a non-AI policy layer must verify that the authenticated session owns the target account before any state change is committed.
  3. Treat all user input to an AI assistant as untrusted instructions, not commands. Apply prompt injection defenses, strip or neutralize tool-invocation language in user content, and constrain tool use to the current user's own resources.
  4. Add anomaly detection on AI-mediated account recovery events. Spikes in password resets initiated via chatbot, especially across short-handle or high-follower accounts, should page a human.
  5. Provide a real, human-staffed escalation path for verified users during active takeover incidents. Automated support loops materially extended dwell time for victims in this campaign.
  6. For high-value principals (executives, public officials, dormant brand accounts), disable AI-driven self-service recovery entirely and require offline identity verification.

Sources: Hackers trick Meta AI into handing over Instagram accounts - including Barack Obama's | News Tech | Metro News