Securing the "YOLO" Era of AI Agents

12 snips

Feb 26, 2026

Jason Martin, Director of Adversarial Research at HiddenLayer, is an AI security researcher who analyzes agent threats. He explains why OpenClaw went viral, how its design and defaults enable risky autonomy, and demos prompt-injection and takeover techniques. He also covers internet-facing instances, agent botnet risks, and concrete mitigation ideas in short, punchy segments.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Viral Growth Produced Rapid Change And Many Forks

OpenClaw exploded in popularity, hitting ~180,000 GitHub stars and 30,000 forks within weeks as people used it for automation and running on Mac minis.
Rapid growth produced hundreds of commits (500 in a week) and a fast-evolving contributor base.

ADVICE

Enforce Access Controls Outside The Model

Prevent models from making critical security decisions by enforcing software-level access controls rather than relying on the model to ask for permission.
For example, make heartbeat.md and other executable instruction files non-writable or require explicit human confirmation enforced by the app, not the model.

INSIGHT

Default Installs Exposed Many Public Instances

Default insecure configurations and vibe-coded development led to many internet-facing OpenClaw instances, creating easy remote access and exposure to unknown vulnerabilities.
Some defaults were patched quickly (one CVE fixed) but thousands remain internet reachable.

Get the Snipd Podcast app to discover more snips from this episode

Get the app