Tech News
← Back to articles

Show HN: Stop AI scrapers from hammering your self-hosted blog (using porn)

read original more articles

Fuzzy Canary

AI companies are scraping everyone's sites for training data. If you're self-hosting your blog, there's not much you can do about it, except maybe make them think your site contains content they won't want. Fuzzy Canary plants invisible links (to porn websites...) in your HTML that trigger scrapers' content safeguards.

Getting Started

Installation

npm i @fuzzycanary/core # or pnpm add @fuzzycanary/core

Usage

There are two ways to use it: client-side or server-side. Use server-side if you can—it works better because the canary is in the HTML from the start, so scrapers that don't run JavaScript will still see it.

Server-side (recommended):

If you're using a React-based framework (Next.js, Remix, etc.), add the component to your root layout:

// Next.js App Router: app/layout.tsx // Remix: app/root.tsx // Other React frameworks: your root layout file import { Canary } from '@fuzzycanary/core/react' export default function RootLayout ( { children } ) { return ( < html > < body > < Canary /> { children } ) }

... continue reading