AI companies and notably AI scrapers are a cancer that is destroying what's left...

everdrive · 2026-03-29T19:24:50 1774812290

Thanks for sharing the perspective here. I think a lot of folks on HN have rightly said that a lot of the problems with the modern internet are due to the ad-supported business model. I don't think you were ever going to move away from it voluntarily -- too many people support it, even if they grumble about it.

But maybe (and likely for worse) LLMs will finally kill this model.

lm411 · 2026-03-29T20:15:55 1774815355

I would love for the ad-supported model to die. I hate ads, and I hate having to serve ads. We get some subscription users but nowhere near enough to cover costs.

Unfortunately, what I think will happen - and indeed already is - is that the AI companies themselves will replace much of the WWW. Sites like the one I am talking about will cease to exist. AI companies, once they can no longer scrape (steal) the data will end up licensing the data themselves and replace us as the distributor to end users. Perhaps as a subscription add-on or also with an ad based model.

Which to some may be fine. Personally, I don't want a few centralized AI companies replacing the hundreds of thousands of independent websites online. Way too much centralized power there.

lotsofpulp · 2026-03-30T02:29:11 1774837751

Evidently, users and customers like not having to sift through hundreds or thousands of independent websites.

TheScaryOne · 2026-03-30T02:55:27 1774839327

I much prefer having my thoughts distilled down into easily digestable and agreeable idioms that I can push around with absolute faith that they weren't just lies written by some PERSON on the internet.

lm411 · 2026-03-30T04:56:07 1774846567

Absolutely.

It's so much easier to know the truth when someone else tells me what it is and what to think about it.

How refreshing.

tripzilch · 2026-03-30T08:53:52 1774860832

> I hate ads, and I hate having to serve ads. We get some subscription users but nowhere near enough to cover costs.

I hate ads and I hate having to use an ad blocker to be able to not go crazy in order to use the Internet.

You merely hate "having" to serve ads because it denies you profit from the people you're exploiting with those ads.

Why is your business more deserving to exist on the Internet than my usage??

avadodin · 2026-03-29T23:52:18 1774828338

Ad-free premium has shown itself again and again to devolve into ad protection rackets.

The minute the internet dies for good, the chat bots will run half-locally and request payments to stop recommending VRAM enlarging pills.

shimman · 2026-03-29T19:30:23 1774812623

Do you not run Anubis or have strict fail2ban rules? I just straight up ban IPs forever if they lookup files that will never exist on my servers. That plus Anubis with the strictest settings.

https://anubis.techaro.lol/

lm411 · 2026-03-29T20:05:54 1774814754

Fail2ban doesn't scale well to these volumes of traffic and request patterns.

Just like fail2ban is not very useful against a DDOS attack where each unique IP only makes a few requests with a large (hour+) delay in between requests. There is no clear "fail" in these requests, and the fail2ban database becomes huge and far too slow.

- 400,000 Unique IP addresses

- 1 to 3 requests per hour per IP addresses - with delays of over 60 minutes between each request.

- Legit request URLs, legit UA & referrer

Maybe Anubis would help, but it's also a risk for various reasons.

ranger_danger · 2026-03-30T03:03:47 1774839827

The more sophisticated bots run real headless browsers that anubis can't touch, and they only follow links that are actually visible on the page, so they wouldn't hit fail2ban.

They even sell access to proxy servers that successfully evade cloudflare captchas automatically.

ctoth · 2026-03-29T20:45:08 1774817108

If you don't mind me asking, what sort of data are you licensing? I noticed that you explicitly don't mention it.

x______________ · 2026-03-30T05:15:27 1774847727

And self-ddos via HN advertising (a la slashdotted?:)

Saris · 2026-03-31T22:57:22 1774997842

What I don't understand is why a bot/scraper needs to load every page and image multiple times in the same hour or whatever session it's doing on my site. If I have say 10 pages and 100 images, surely 110 requests should be all it needs to load everything.

afinlayson · 2026-03-29T19:43:43 1774813423

At some point there needs to be a check if it's a real human... But it's a cat and mouse game - any way we create to keep bots off gets a work around by clever engineers.

ranger_danger · 2026-03-30T03:06:06 1774839966

What makes a real human?

AugSun · 2026-03-30T09:14:23 1774862063

CC payment. This is the ultimate test.

ranger_danger · 2026-03-30T13:03:54 1774875834

Hard disagree, it's very easy for a bot to use a credit card. And not only are card numbers often stolen, they're even given to teenagers these days, and can also be owned by businesses and exist entirely virtually... so I don't think you can assume the use of a credit card can always be tied to legitimate use by a single person.

rchaud · 2026-03-30T20:51:48 1774903908

Companies would offer all-you-can-DDoS plans at $20/bot per month if they could. Bots are only a problem to them because they prevent legitimate customers from handing over their credit card.

wiseowise · 2026-03-29T19:24:31 1774812271

Don’t worry, man, once AGI is here you’ll get your allowance (or whatever the hyperscalers plan is).

righthand · 2026-03-29T19:53:45 1774814025

You’ll enjoy painting or some other art even if you aren’t interested in the arts. That’s what I’ve seen written about it.

collingreen · 2026-03-30T00:56:53 1774832213

Or they'll let you starve to death which is way way easier and way faster for "them"

righthand · 2026-03-30T02:58:55 1774839535

Well yeah, art will be valueless in a flooded market. Starving is implied, “starving artist”.

PearlRiver · 2026-03-29T23:12:02 1774825922

Unfortunately nobody cares about destroying the internet if it gets them a Lambo.

Greed and ignorance have taken over the tech industry.