Google Search rests on a social contract: their bots can crawl our sites, they can index our sites, and they can show excerpts of our sites because

raulmatias@mstdn.social

@algernon @inthehands @joe @ShadSterling Are there any pre-defined templates available except the standard one?

haihappen@social.anoxinon.de

@jedbrown @inthehands I can only go by German/EU law, hand here it is not transformative (becaise duh!). The reproduction is the key thing here: if you reproduce another's work outside of private use, you are violating Urheberrecht (creator's rights): priviledges enshrined in law to the creator of a work (some of which can be licensed out). One of these is distributing reproductions.
E.g. any time you upload an image to SM, their ToS say you grant them license to reproduce (amonh others).

algernon@come-from.mad-scientist.club

@raulmatias @inthehands @joe @ShadSterling There's the built-in one, and another - slightly more complex - in Nam-Shub of Enki.

There will be more templates coming in the next few months (and new scripts!).

But a lot of things are doable today, if someone takes the time and creates a suitable template.

blogdiva@mastodon.social

instead of no-index ―because this would affect all search engines, not just Google― isn’t there a way to target Google specifically in robots.txt?

there should be a list of all the major techbros crawlers ―Google, Microslop, Facebook, Amazon, X, etc.

@inthehands

inthehands@hachyderm.io

@blogdiva
I believe that my various name=“___” values specifically target Google.

Based on what I’ve read, blocking them in robots.txt will only stop them from •updating• their scrape, whereas noindex means “do not use.” (I have long blocked their LLM-specific bots in robots.txt.)

blogdiva@mastodon.social

@inthehands TIL thanks

inthehands@hachyderm.io

@blogdiva

Keep it in pencil. I’m still learning myself, and not sure I understand everything correctly here.

CIRCLE WITH A DOT

Google Search rests on a social contract: their bots can crawl our sites, they can index our sites, and they can show excerpts of our sites because