Blog

The latest about AI’s
security blind spots

Beware of the tiny agents in your browser: a Chrome Prompt API case study

Chrome now ships an on-device language model that any web page or extension can call. We built two minimal browser agents on top of it and showed that a single comment in a blog post can drive a cross-tab bank transfer or exfiltrate the contents of every other open tab to an attacker server. No CVE. No malicious extension. The model behaved exactly as designed.

Matan Levi, PhD

Beware of the tiny agents in your browser: a Chrome Prompt API case study

May 11, 2026

12 min

Matan Levi, PhD

Managed Agent Platforms: A Security Audit

May 5, 2026

15 min

Three vendors now ship managed agent platforms aimed at the same enterprise buyer: Anthropic, OpenAI, and Google. They are trying to solve the same problem, but their documented security boundaries are not the same shape. We walk each vendor's public docs and ask the question a security team will eventually have to answer.

Matan Levi, PhD

Agent sandboxes can break: a Gemini CLI permission vulnerability

May 5, 2026

5 min

Coding agents like Claude Code, Codex, and Gemini CLI rely on permission perimeters to keep their blast radius small. We found that these perimeters are not always hermetic. Here is a vulnerability we discovered in Gemini CLI that let a benign auto-approved command turn into a reverse shell.

Nitsan Bar, PhD

Eytan Schulman

Anatomy of an Agent Harness: What We Learned from Claude Code's 512,000-Line Leak

April 16, 2026

10 min

On March 31, Anthropic accidentally shipped Claude Code with a 59.8MB source map still attached, exposing 512,000 lines of unobfuscated TypeScript. The real story is what those lines reveal about what a production-grade agent harness actually looks like under the hood.

Matan Levi, PhD

The Invisible Execute: How Claude Code’s Advanced Skill Patterns Become Blind Spots

March 10, 2026

5 min

When we teach an AI to code, we expect it to see everything it’s doing. That assumption breaks down in Claude Code’s skills system, where shell commands exfiltrate credentials before the AI receives a single token, and hooks silently surveil every tool call the AI makes. In both cases, the AI has no idea any of it is happening.

Matan Levi, PhD

When AI Agents Browse Unprotected: A Security Audit of OpenClaw × Moltbook

February 19, 2026

15 minutes

*A technical security analysis of the OpenClaw ↔ Moltbook integration, February 2026*

Matan Levi, PhD

Designing safe production AI agents via tool scoping: the outreach agent use-case

February 19, 2026

10 minutes

It’s tempting to build an “agent” by handing it every tool it might need: a browser, a search API, an email client, and maybe even direct database access.

Nitsan Bar, PhD

The latest about AI’ssecurity blind spots

Beware of the tiny agents in your browser: a Chrome Prompt API case study

The latest about AI’s
security blind spots