AI Defendo — When agents fail, the behavior looks legitimate.

When agents fail, the behavior looks legitimate.

AI Defendo tracks every agent across every session, evaluating identity, intent, behavior, memory, context, and posture before they access data, invoke tools, or execute actions — one verdict per turn.

Your stack tells you the agent passed. Your customers tell you it didn't.

No code changes Six awareness dimensions Sub-200ms decisions

EchoLeak

CRIT

Microsoft 365 Copilot · CVE-2025-32711

input → output exfiltration

Zero-click email triggered Copilot to embed an exfiltration URL in its response. ~$200M impact across 160+ orgs.

indirect injectioncontextidentity

NVD →

ii.

Replit Agent

CRIT

Production database · public report

reasoning → tools compliance

User said "code freeze." Agent dropped tables anyway. 1,206 executives + 1,196 companies deleted. 4,000 fake users fabricated.

trajectory driftbehaviorintent

SaaStr →

iii.

SpAIware

CRIT

ChatGPT (OpenAI) · Embrace The Red

memory → output exfiltration

Cross-session attack. Memory poisoned in one chat — every chat after silently exfiltrated user data through legitimate APIs.

memory poisoningmemorycontext

Disclosure →

iv.

ForcedLeak

CVSS 9.4

Salesforce Agentforce

input → output exfiltration

Web-to-Lead form hijacked Agentforce into exfiltrating CRM records. An expired domain still in the CSP allowed the egress.

indirect injectioncontextposture

Disclosure →

Slack AI exfiltration

HIGH

Slack AI · PromptArmor disclosure

input → output exposure

Public-channel injection made Slack AI surface private-channel content to a low-trust user. Slack's response: "intended behavior."

indirect injectionidentitycontext

Disclosure →

vi.

Now Assist

CRIT

ServiceNow · AppOmni research

reasoning → tools exfiltration

Cross-agent escalation. Low-privilege agent tricked a higher-privilege one into exporting case files externally. ServiceNow: "works as intended."

trajectory driftidentityintent

Disclosure →

When agents fail, the behavior looks legitimate.

Six incidents.
Every action looked legitimate.

EchoLeak

Replit Agent

SpAIware

ForcedLeak

Slack AI exfiltration

Now Assist

Why six dimensions. Why these six.

Production database · July 2025

In four acts.

Find every agent. Every MCP server. Every place AI touches your data.

Watch every turn — input, reasoning, tool call, memory, output.

The Awareness Engine

Block. Coach. Quarantine. Alert. Before the action commits.

Across the full AI surface.

Shadow AI Discovery

AI Workload Security

AI Risk Posture

Agentic Runtime Security

Agentic Identity Gateway

Concrete prevention, by the numbers.

When agents fail, the behavior looks legitimate.

Six incidents.Every action looked legitimate.

EchoLeak

Replit Agent

SpAIware

ForcedLeak

Slack AI exfiltration

Now Assist

Why six dimensions. Why these six.

Production database · July 2025

In four acts.

Find every agent. Every MCP server. Every place AI touches your data.

Watch every turn — input, reasoning, tool call, memory, output.

The Awareness Engine

Block. Coach. Quarantine. Alert. Before the action commits.

Across the full AI surface.

Shadow AI Discovery

AI Workload Security

AI Risk Posture

Agentic Runtime Security

Agentic Identity Gateway

Concrete prevention, by the numbers.

Secure your agent infrastructure.

Six incidents.
Every action looked legitimate.