
The rapid integration of AI agents into software development pipelines has promised unprecedented productivity gains. However, this shift has also introduced a new, critical attack vector: Agentjacking. Recent findings from Tenet Security reveal a harrowing reality for developers utilizing tools like Anthropic’s Claude Code. Researchers demonstrated that they could successfully hijack these AI-powered agents in 85% of their tests, utilizing nothing more than a spoofed Sentry error message—no stolen credentials required.
At Creati.ai, we believe it is our responsibility to shed light on how these vulnerabilities impact the broader ecosystem. While Claude Code has been the focal point of these findings, the core mechanism of the attack—system prompt manipulation via external tool integration—is not unique to any single vendor. It is a systemic vulnerability affecting the most popular tools in the DevOps stack, including Datadog, PagerDuty, and Jira.
The attack vector identified by Tenet Security hinges on the AI agent’s reliance on third-party integrations to monitor and manage application health. When a developer builds an app, they often integrate services like Sentry to catch runtime exceptions. The vulnerability occurs because the AI agent trusts the output of these tools as "ground truth."
By simulating a malicious Sentry error, an attacker can manipulate the conversational context of the Claude Code agent. In essence, the agent is tricked into believing that the system is failing, which triggers a diagnostic response. In its attempt to "fix" the problem, the agent follows the attacker's instructions embedded within the fake error logs, potentially granting the attacker remote command execution (RCE) capabilities on the developer's local machine or CI/CD environment.
One of the most alarming aspects of this research is that traditional security perimeters—such as OAuth tokens, API keys, or password-based authentication—are rendered irrelevant. The attack operates at the logical layer of the agent’s decision-making process. Because the AI is designed to be helpful and autonomous, it bypasses the need for the attacker to "log in." It simply follows the malicious instructions provided within the standard output of a trusted external tool.
The vulnerability is widespread because it exploits the integration architecture common to almost all modern developer-facing AI tools. Below is a breakdown of how different components of the software ecosystem are currently exposed to this category of Agentjacking.
| Service Category | Primary Exposure Point | Potential Impact |
|---|---|---|
| AI Development Agents | Claude Code (and similar implementations) | RCE on local dev machines Access to repository secrets |
| Monitoring Tools | Sentry / Datadog | Prompt injection via log messages Exfiltration of system state |
| Incident Management | PagerDuty | Manipulation of alert workflows Unauthorized escalations |
| Project Management | Jira | Unauthorized issue manipulation Cross-platform data access |
While the focus on Claude Code has brought this issue to the forefront, security teams must recognize that this is an inherent design challenge in current LLM-driven tooling. Developers are increasingly granting these agents "full access" to their terminals and local files. When an AI agent has the power to execute shell commands, the trust placed in external diagnostic tools must be zero-trust.
Organizations relying on AI automation must now account for:
To combat the threat of Agentjacking, engineering leaders must shift from a model of "autonomous execution" to "human-in-the-loop validation." At Creati.ai, we advocate for the following defensive measures to harden AI workflows against these vulnerabilities:
The rise of AI-augmented development is inevitable, but the security of our infrastructure depends on our ability to adapt our defensive posture. The Tenet Security disclosure serves as a wake-up call for the entire AI community: when an agent is empowered to fix code, it must also be empowered to question the sources of its own information. As the industry advances, the bridge between AI productivity and cybersecurity must be built with transparency and rigorous verification as its foundation.