
The Open-Source Software Saving the Internet From AI Bot Scrapers, by @emanuelmaiberg (@404mediaco):
An architecture firm has filed a lawsuit against Pinterest over alleged scraping. However, the case is a real blast from the past.
https://www.plagiarismtoday.com/2025/07/09/architect-sues-pinterest-over-scraping/
That's the logic I don't get, I guess I will never be rich, unless I win the lottery??..
Scraping is a huge business nowadays.
NB: many LLMs are based on something called the Pile, it is weird and shaddy to say the least. I don't think using LLM for business is good for reputation. But clearly, we are not really allowed to think otherwise (Physics Nobel price for AI was the end of the argument for me), and I want to work, it is MY fault, I should've known better.
Wow ok, done
That was so easy
Kudos to this blog post for the amazing tutorial : https://xeiaso.net/blog/2025/anubis/
Managed to also quickly add a grafana dashboard to reflect some metrics, and those numbers give some perspective to the insane spam all the internet is under, just to generate more slop
Ok, time to deploy Anubis in front of Gitea, I'm done with those FAANG oligarchs scraping my repos 24/7 to check if anything changed...
F*ck off.
But that also means Gitea might get unstable for some time, woops
If you are curious : https://git.halis.io
If you see the cute furry, it worked
Watt is being Dunn about AI scraping images and descriptions?
Make RED sure you fill your gravy description meat with AI hostile get em on the beaches words.
Images uploaded to mastodon should have AI poison added to them.
Really interesting project Anubis to protect against #LLM scraping bots : https://anubis.techaro.lol/ #Scraping #bots
Le #scraping #payant : vers un changement radical du modèle économique de l’ #IA #AI #générative ?