The bright #LLM future, next part.

algernon@come-from.mad-scientist.club

@Kyebr Yes, and that's fine: tools that don't try to pretend to be browsers get through the Three Ifs (and iocaine) just fine.

@mgorny

villares@ciberlandia.pt

@davidgerard would you have pointers to "how to guides" for less savvy people? I have a shared hosting account on a web hosting service, I feel like I need to protect myself from these bots and I'm totally lost.

davidgerard@circumstances.run

@villares no, but I went to https://iocaine.madhouse-project.org/ and faffed about a bit. I used iocaine 3 out of the box. i use nginx so i had to figure out the correct config. i added exceptions for some specific user-agents I wanted to let through.

villares@ciberlandia.pt

@davidgerard thank you!

js@ap.nil.im

@mirabilos @mgorny Wat? It’s stopping LLMs.

phf@dmv.community

@mirabilos It does not? Sure deleted a lot of files when I tried it in a container... Please to "edumacate" me? Or do you refer to the redirection? @mgorny

mirabilos@toot.mirbsd.org

@js @mgorny it’s also slop.

(And it’s not been stopping LLMs for months now.)

mirabilos@toot.mirbsd.org

@mgorny @phf yes, the eedirection. Can explain more layer if needed, from the laptop.t

cblgh@merveilles.town

@algernon @mgorny cc @alderwick re the current bot attacks on our forge

"https://chronicles.mad-scientist.club/tales/surviving-the-crawlers/#three-ifs-in-a-trenchcoat"

alien@fosstodon.org

@mgorny it does not help pointing to people using LLMs for legitimate reasons. It's other people using those same tools but then for nefarious purposes.
I use user-agent filtering and put Anubis in front of the Slackware git infrastructure, and that has helped immensely.
I eventually got git.gentoo.org to render and gosh! That's a lot of repositories there. Would it be an idea to distribute the cgit interface over multiple front-end servers? Like, moving all user repos to a different server?

phf@dmv.community

@mirabilos I think I get it. It's a bash-ism to redirect stdout and stderr and in (da)sh that doesn't work? @mgorny

mirabilos@toot.mirbsd.org

@phf @mgorny yes.

It’s actually one of the more damnable bashisms because…

foo &>bar

gets parsed as

foo &
>bar

that is, send foo into the background and truncate bar; it is always better to just transform this bashism into a standard redirection, even if you know you have GNU bash:

foo >bar 2>&1

CIRCLE WITH A DOT

The bright #LLM future, next part.