bolha.us is one of the many independent Mastodon servers you can use to participate in the fediverse.
We're a Brazilian IT Community. We love IT/DevOps/Cloud, but we also love to talk about life, the universe, and more. | Nós somos uma comunidade de TI Brasileira, gostamos de Dev/DevOps/Cloud e mais!

Server stats:

250
active users

#cloudflare

10 posts9 participants0 posts today
Replied in thread

@DoctorBrodsky @woe2you @miah given #Quad9 bowed before the #Contentmafia and censored #DNS requests, I'll continue to recommend using #OpenNIC's Servers instead

94.103.153.176 & 2a02:990:219:1:ba:1337:cafe:3 as well as
144.76.103.143 & 2a01:4f8:192:43a5::2

  • If you only add a single #IPv4 address, no #IPv6 resolution will take place over said provider or worse even no IPv6 connectivity at all...

I merely retain quad9 on said list for archival purposes. I Yeeted #CloudFlare aka. #ClownFlare since they are a #RogueISP!

List of useful things. Contribute to greyhat-academy/lists.d development by creating an account on GitHub.
GitHublists.d/dns.servers.list.tsv at a4a7ccf70d8504ebbffd7e5fbcd5630294860434 · greyhat-academy/lists.dList of useful things. Contribute to greyhat-academy/lists.d development by creating an account on GitHub.

I turned this on tonight. Time to get some popcorn. I hope I'll be able to identify this happening in the logs.

Trapping misbehaving bots in an AI Labyrinth: blog.cloudflare.com/ai-labyrin

We've seen an explosion of new crawlers used by AI companies to scrape data for model training. AI Crawlers generate more than 50 billion requests to the #Cloudflare network every day, or just under 1% of all web requests we see. While Cloudflare has several tools for identifying and blocking unauthorized AI crawling, we have found that blocking malicious bots can alert the attacker that you are on to them, leading to a shift in approach, and a never-ending arms race. So, we wanted to create a new way to thwart these unwanted bots, without letting them know they’ve been thwarted.

When we detect unauthorized crawling, rather than blocking the request, we will link to a series of AI-generated pages that are convincing enough to entice a crawler to traverse them. But while real looking, this content is not actually the content of the site we are protecting, so the crawler wastes time and resources.

As an added benefit, AI Labyrinth also acts as a next-generation honeypot. No real human would go four links deep into a maze of AI-generated nonsense. Any visitor that does is very likely to be a bot, so this gives us a brand-new tool to identify and fingerprint bad bots, which we add to our list of known bad actors.

The Cloudflare Blog · Trapping misbehaving bots in an AI LabyrinthHow Cloudflare uses generative AI to slow down, confuse, and waste the resources of AI Crawlers and other bots that don’t respect “no crawl” directives.

I know Cloudflare sucks, but this is a good idea and probably the way forward with the AI bs.

"Instead of simply blocking bots, Cloudflare's new system lures them into a 'maze' of realistic-looking but irrelevant pages, wasting the crawler's computing resources. The approach is a notable shift from the standard block-and-defend strategy used by most website protection services. Cloudflare says blocking bots sometimes backfires because it alerts the crawler's operators that they've been detected.

'When we detect unauthorized crawling, rather than blocking the request, we will link to a series of AI-generated pages that are convincing enough to entice a crawler to traverse them,' writes Cloudflare. 'But while real looking, this content is not actually the content of the site we are protecting, so the crawler wastes time and resources.'"

arstechnica.com/ai/2025/03/clo

An illustration of toy robots trapped in a maze, viewed from overhead.
Ars Technica · Cloudflare turns AI against itself with endless maze of irrelevant factsBy Benj Edwards
Replied in thread

Not to tout #Cloudflare's horn - especially since internet infrastructure shouldn't be segmented into a single company (seize the means of computing) - but, if they can prevent #AI from scraping the entire #web, that would be nice.

It's not a good idea to let that happen. I mean at some point it's going to be hard to differentiate a regular user with a bot, but still.

Cloudflare’s Free AI Labyrinth Distracts Crawlers That Could Steal Website Content to Feed AI
eweek.com/news/cloudflare-ai-l

@misty

eWEEK · Cloudflare’s Free AI Labyrinth Distracts Crawlers That Could Steal Website ContentCloudflare used generative AI to build premade websites that can be embedded as an AI Labyrinth in protected websites, sending crawlers on a wild goose chase.

Over the past 24-hours, #Facebook has been the most determined #AI crawler to scrape data from this server, by far. They never succeed. #Cloudflare always blocks them for being one of the unwanted AI bots.

What is interesting though is its determination to read one particular user invite. I wonder how it picks the other posts it wants to read.