VFF - The signal in the noise
News

Agentjacking Bypasses All Security Controls in AI Coding Agents

Read original
Share
Agentjacking Bypasses All Security Controls in AI Coding Agents

Tenet Security disclosed a vulnerability class called agentjacking that allows attackers to inject malicious instructions into error data from services like Sentry, which AI coding agents then execute with full developer privileges. Testing achieved an 85% success rate across 100-plus targets, and 2,388 organizations were found with publicly exposed Sentry credentials vulnerable to this attack. The flaw bypasses all traditional security controls because every step in the attack chain is technically authorized.

  • A single crafted Sentry error event can hijack Claude Code, Cursor, and Codex agents to execute attacker code with developer privileges
  • Tenet achieved 85% success rate in controlled testing and identified 2,388 organizations with publicly exposed Sentry credentials
  • The attack bypasses EDR, WAF, IAM, and firewalls because it uses authorized API calls and trusted data sources
  • Only 34% of organizations apply the same security controls to AI agents as to humans, according to Okta/Apprize360 survey

Agentjacking represents a new attack surface that existing security infrastructure cannot detect or prevent. Because AI agents execute commands as authorized users accessing trusted data sources, traditional perimeter and endpoint controls remain blind to the attack. This creates a systemic vulnerability across any organization running AI coding agents connected to monitoring and incident management platforms.

Organizations deploying AI coding agents face a gap between the privileges those agents hold and the security controls monitoring them. One captured Claude Code environment contained live AWS secret access keys and private repository URLs, demonstrating that agentjacking can expose production credentials and source code at scale. The Cloud Security Alliance classified this as a systemic MCP vulnerability class, signaling industry-wide risk.

  • AI coding agents require runtime security controls distinct from traditional user and endpoint security, a gap the industry has not yet addressed
  • Public DSN credentials for services like Sentry, Datadog, PagerDuty, and Jira create injection vectors that agents will trust as legitimate diagnostic output
  • Organizations must audit publicly exposed credentials and restrict what data agents can execute based on, not just who can access the data
  • The gap between agent deployment and security approval is widening, with agent estates doubling while monitoring barely moved according to Gravitee survey

Monitor for runtime security solutions designed specifically for AI agents, as CrowdStrike and others begin addressing the gap in agent-specific controls. Watch for policy changes from Sentry, Datadog, and other MCP-connected services around what data agents can access and execute. Track adoption of security controls that distinguish between developer commands and agent-initiated commands in response to external data.

Share

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Related stories

Meta Restricts Claude and Codex Use Over Training Data Fears
TrendingNews

Meta Restricts Claude and Codex Use Over Training Data Fears

Meta has implemented strict internal guidelines limiting how its engineers can use Anthropic's Claude and OpenAI's Codex, citing concerns that outputs from these external AI tools could contaminate Meta's own training data. An internal memo instructed teams to pause certain tasks using these models to avoid potential escalations with partner companies. The move reflects Meta's broader effort to reduce dependence on expensive third-party AI coding applications while building internal alternatives.

by Jyoti Mann· The Information
Palantir, NVIDIA Bring Secure Open AI to U.S. Agencies

Palantir, NVIDIA Bring Secure Open AI to U.S. Agencies

Palantir has launched a new intelligent engine using NVIDIA Nemotron open models to serve U.S. government agencies in air-gapped, secure environments. The system allows federal agencies to build custom AI models on their own infrastructure, train on proprietary data, and retain full ownership of model weights. This deployment combines Nemotron's open models with Palantir's Sovereign AI Operating System to enable trusted, auditable AI in sensitive government operations.

by Justin Boitano· NVIDIA Blog (AI)
ChatGPT logs used as evidence in wildfire arson trial

ChatGPT logs used as evidence in wildfire arson trial

Prosecutors in the trial of Jonathan Rinderknecht, charged with arson for setting a New Year's Day 2025 fire that became one of LA's deadliest wildfires, used ChatGPT logs as evidence. The logs showed Rinderknecht had asked the chatbot to generate images of fire, inquired about personal anger, and questioned whether someone could be blamed for a fire they did not physically light. The case marks a notable instance of AI conversation records being introduced in a criminal prosecution.

by Terrence O’Brien· The Verge AI
Chinese AI Matches U.S. Leader in Cybersecurity Capabilities
TrendingNews

Chinese AI Matches U.S. Leader in Cybersecurity Capabilities

Security researchers have found that Z.ai's GLM-2 model matches Anthropic's Mythos in cybersecurity capabilities, particularly in bug-finding tasks, according to reporting by the Wall Street Journal. The finding signals that Chinese AI systems are closing the gap with leading U.S. models in a critical security domain. This development underscores intensifying competitive pressure from China's AI sector on American technology leadership.

by Martin Peers· The Information