hhmx.de

Tom Bortels

· Föderation EN Do 23.01.2025 23:50:25

@devnull @clive @jasonkoebler @404mediaco

I felt obligated to disclaim my fantasy well-behaved AI scrapers just in case. The actual headcount there may well be zero.

Bornach

Föderation EN Fr 24.01.2025 08:44:49

@tbortels @devnull @clive @jasonkoebler @404mediaco
There is such a thing as a non-aggressive respectful AI scrapper. It's called asking for permission from the copyright owner and obtaining an appropriate license if their AI system can generate derivative works using your content.
youtu.be/PeKZvUcr0-M

Tom Bortels

Föderation EN Fr 24.01.2025 09:23:41

@bornach @devnull @clive @jasonkoebler @404mediaco

Alas - those scrapers are out of scope because they're not the ones causing problems and driving this conversation. Indeed - if someone licensed content legitimately, the need to scrape the web would be absent - there are far more efficient ways to say "here are all of the new posts in the last N hours".

You can safely assume any automation ignoring your robots.txt is a pest to be ruthlessly crushed in whatever manner amuses you most.

Clive Thompson

Föderation EN Fr 24.01.2025 17:43:35

@tbortels @bornach @devnull @jasonkoebler @404mediaco

yep -- licensing would obviate the hassles of scraping

"here's our API, enjoy"