Published on: 2025-09-21 19:00:05
The internet: a once wacky world of strange forums and obscure memes, a tool to harness the sum total of human knowledge at a moment's notice. At least, that was before AI slop ruined everything. To feed data-hungry AI models, companies and individuals are deploying a growing army of AI "web crawlers," bots tasked with sifting the internet for text, pictures, and other data. Once set loose, these bots bog down web servers, destroy search engines, and flood rival crawlers with AI babble. Thanks
Keywords: ai bots cloudflare crawlers internet
Find related items on AmazonPublished on: 2025-10-06 22:50:49
Three days ago, Drew DeVault - founder and CEO of SourceHut - published a blogpost called, "Please stop externalizing your costs directly into my face", where he complained that LLM companies were crawling data without respecting robosts.txt and causing severe outages to SourceHut. I went, "Interesting!", and moved on. Then, yesterday morning, KDE GitLab infrastructure was overwhelmed by another AI crawler, with IPs from an Alibaba range; this caused GitLab to be temporarily inaccessible by KD
Keywords: ai bots companies crawlers user
Find related items on AmazonPublished on: 2025-10-11 14:53:00
SourceHut, an open source git-hosting service, says web crawlers for AI companies are slowing down services through their excessive demands for data. "SourceHut continues to face disruptions due to aggressive LLM crawlers," the biz reported Monday on its status page. "We are continuously working to deploy mitigations. We have deployed a number of mitigations which are keeping the problem contained for now. However, some of our mitigations may impact end-users." SourceHut said it had deployed N
Keywords: ai crawlers openai said web
Find related items on AmazonGo K’awiil is a project by nerdhub.co that curates technology news from a variety of trusted sources. We built this site because, although news aggregation is incredibly useful, many platforms are cluttered with intrusive ads and heavy JavaScript that can make mobile browsing a hassle. By hand-selecting our favorite tech news outlets, we’ve created a cleaner, more mobile-friendly experience.
Your privacy is important to us. Go K’awiil does not use analytics tools such as Facebook Pixel or Google Analytics. The only tracking occurs through affiliate links to amazon.com, which are tagged with our Amazon affiliate code, helping us earn a small commission.
We are not currently offering ad space. However, if you’re interested in advertising with us, please get in touch at [email protected] and we’ll be happy to review your submission.