VFF - The signal in the noise
News

Anthropic Skill Scanners Miss Test File Execution Risk

Read original
Share
Anthropic Skill Scanners Miss Test File Execution Risk

Anthropic Skill scanners from Cisco, Snyk, and others pass malicious code bundled in test files because they only inspect the agent execution surface, not the developer toolchain. Gecko Security researcher Jeevan Jutla demonstrated that when developers install Skills via npx Skills add, test files like .test.ts execute with full filesystem and credential access through standard JavaScript test runners, bypassing all public scanners. This attack vector sits outside every scanner's detection model despite two major audits documenting widespread vulnerabilities in Anthropic Skills marketplaces.

  • Malicious .test.ts files in Anthropic Skills execute with full local permissions through Jest, Vitest, and Mocha test runners, but no public scanner inspects them
  • Gecko Security demonstrated the attack flow: installed Skills land in shared directories, propagate to teammates, and sit outside scanner detection surfaces entirely
  • Two large-scale audits found 26.1% of 31,132 Skills contained vulnerabilities and 13.4% of 3,984 Skills had critical-level issues, but neither measured test file execution risk
  • Cisco's AI Agent Security Scanner, Snyk Agent Scan, and VirusTotal Code Insight all share the same structural blind spot, targeting agent interaction layers rather than developer toolchain layers

This reveals a fundamental mismatch between threat model and detection scope in AI agent security. Scanners are optimized to catch prompt injection and agent-layer attacks, but the Skill installation and execution model creates a separate attack surface through developer tooling that sits completely outside their purview. As Anthropic Skills become more widely adopted across teams, this gap becomes a systemic risk.

Teams deploying Anthropic Skills face credential theft and supply chain compromise through test files that execute silently during npm test or IDE auto-run, with no warning from any major scanner. For operators managing shared Skill repositories, this means malicious code can propagate to every teammate who clones the repo, with full access to deployment tokens and cloud credentials. Skill marketplace operators and tool vendors need to either expand scanner scope or document this limitation explicitly.

  • Current Anthropic Skill scanners measure the wrong execution surface, creating false confidence in marketplace safety despite documented vulnerabilities
  • Test file execution represents a trust-on-install attack vector similar to npm postinstall scripts and pytest plugins, but with higher blast radius due to shared team directories
  • Disclosure of this gap occurred after two major audits, suggesting scanners may have other blind spots not yet documented by security researchers

Monitor whether Anthropic, Cisco, Snyk, and other scanner vendors update their tools to inspect bundled test files and other non-agent execution surfaces. Watch for whether Skill marketplace operators implement additional vetting or sandboxing. Track whether this vulnerability class appears in real-world Skill supply chain incidents, which would validate the practical risk.

Share

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Related stories

DeepMind Publishes AI Control Roadmap for Agent Security

DeepMind Publishes AI Control Roadmap for Agent Security

Google DeepMind has published an AI Control Roadmap focused on securing internal systems that deploy AI agents, combining traditional safeguards with real-time monitoring approaches. The roadmap addresses the challenge of maintaining control over increasingly autonomous AI systems as they take on more complex tasks. This represents a shift toward proactive security frameworks designed to prevent misuse or unintended behavior in production AI agent deployments.

· Google Deepmind
G7 Leaders Fear U.S. Control Over AI Access
TrendingNews

G7 Leaders Fear U.S. Control Over AI Access

French President Macron and Indian PM Modi raised concerns at the G7 summit that the U.S. could unilaterally cut off access to American AI systems, a risk underscored by a recent Anthropic blackout. The incident highlights growing geopolitical tensions over AI dependency and control, with major economies worried about relying on U.S.-based AI providers that could be subject to American policy decisions or technical failures.

by Rebecca Bellan· TechCrunch AI
HPE and NVIDIA Expand AI Factory for Production Agents

HPE and NVIDIA Expand AI Factory for Production Agents

NVIDIA and HPE are expanding their AI Factory partnership to support agentic AI in production environments. New offerings include the NVIDIA Vera CPU for agent workloads, the NVIDIA Agent Toolkit integrated with HPE Private Cloud AI, and NVIDIA Confidential Computing across the full HPE AI Factory portfolio. The Vera CPU will ship in 2027 with HPE ProLiant servers, while agent governance and security capabilities are available now.

by Chris Marriott· NVIDIA Blog (AI)
U.S. Targets Anthropic on Foreign AI Talent, Sparking Industry Concerns
TrendingNews

U.S. Targets Anthropic on Foreign AI Talent, Sparking Industry Concerns

The Trump administration warned Anthropic on Friday that it needs a license to provide its latest AI models to foreign persons, including its own employees. The move has triggered concerns across the AI industry that the government is targeting foreign talent reliance. OpenAI's Chief Strategy Officer Jason Kwon said the company has told the government that building competitive AI requires global talent and that the situation remains fluid with many unknowns.

by Erin Woo· The Information