Skip to content
Tech News
← Back to articles

I prompted ChatGPT, Claude, Perplexity, and Gemini and watched my Nginx logs

read original get Nginx Log Analyzer Tool → more articles
Why This Matters

This article highlights the importance of understanding how AI services interact with websites by analyzing nginx logs. It reveals that AI providers often fetch pages directly from the origin, separate from human clickthroughs, which has implications for accurately measuring AI-driven traffic and engagement. Recognizing these differences helps the tech industry better interpret AI traffic data and refine analytics strategies.

Key Takeaways

Last month I wanted a straight answer to a question most AI visibility write-ups dodge. When someone asks ChatGPT, Claude, Perplexity, Gemini, or Google AI Mode about a site I own, does that AI product actually fetch the page, or does it answer from an index it built earlier?

The way to get a straight answer was the unfashionable one. Read the nginx access log.

This post walks through what the logs captured across the five AI products I tested, what they did not capture, and what that difference lets a product safely track. Every claim in the sections that follow is either something the server logged or a structural fact documented by the vendor.

Two signals, not one

A marketer saying “my site got traffic from AI” could mean two different things, and the logs prove they are different things.

Provider-side fetch. The AI provider itself hits my origin. The request usually arrives with a dedicated user-agent token, usually with no referrer, and usually inside a short burst while the model is deciding which page to cite.

The AI provider itself hits my origin. The request usually arrives with a dedicated user-agent token, usually with no referrer, and usually inside a short burst while the model is deciding which page to cite. Real clickthrough visit. A human reads the AI answer, clicks a citation link, and arrives as a normal browser. Chrome-shaped user-agent, normal cookies, the AI product as the referrer.

Collapsing these into one AI-traffic number papers over the most useful distinction in the data. One is the model reaching out to read you. The other is a human reading you because the model pointed. Different lever, different measurement, different copy.

How I instrumented the experiment

Nothing exotic. A custom nginx log format that captures the bits the default combined format compresses out, plus a tail -F next to a browser tab in each AI product.

... continue reading