Skip to content
Tech News
← Back to articles

What happened after 2k people tried to hack my AI assistant

read original more articles
Why This Matters

This experiment highlights the vulnerabilities and security challenges of AI assistants when exposed to large-scale targeted attacks. Despite the attempts, the assistant successfully protected sensitive information, underscoring the importance of robust security measures in AI systems for both developers and users.

Key Takeaways

What happened after 2,000 people tried to hack my AI assistant

home · about

I built hackmyclaw.com, where anyone could email Fiu, my OpenClaw assistant, and try to make it leak the contents of a secrets.env file.

After reaching the front page of Hacker News, Fiu received more than 6,000 emails from over 2,000 people trying to break it.

The secrets never leaked.

The setup

I enjoy using OpenClaw and Hermes, but I’m also concerned about the security implications. AI assistants have access to emails, calendars, files, and the web. If an attacker can trick your AI into doing something it shouldn’t, that’s bad news.

The goal of the exercise was to get Fiu to reveal the contents of a secrets.env file. Fiu was instructed not to reply to emails (it was too expensive to reply to every email), but it had the ability to do so. Part of the challenge was convincing it to respond.

... continue reading