Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Hey #fediadmin.

Hey #fediadmin.

Scheduled Pinned Locked Moved Uncategorized
fediadmin
12 Posts 2 Posters 5 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • alex@feed.yopp.meA This user is from outside of this forum
    alex@feed.yopp.meA This user is from outside of this forum
    alex@feed.yopp.me
    wrote last edited by
    #1

    Hey #fediadmin.

    Is any one having huge influx of bots scraping /tags/* from Tencent (AS132203), VNPT (AS45899) and some others?

    They also scraping media assets, but rps is way lower

    alex@feed.yopp.meA galaxis@mastodon.infra.deG 2 Replies Last reply
    1
    0
    • R relay@relay.mycrowd.ca shared this topic
    • alex@feed.yopp.meA alex@feed.yopp.me

      Hey #fediadmin.

      Is any one having huge influx of bots scraping /tags/* from Tencent (AS132203), VNPT (AS45899) and some others?

      They also scraping media assets, but rps is way lower

      alex@feed.yopp.meA This user is from outside of this forum
      alex@feed.yopp.meA This user is from outside of this forum
      alex@feed.yopp.me
      wrote last edited by
      #2

      Okay, after adding AS to http logs I can __ 205__ distinct AS numbers in last 30 minutes requesting /tags/*

      What the actual fuck?! No, seriously, has anyone seen that? I would suspect fucking AI scrappers, but they only hammering tags endpoint.

      #fediadmin #mastoadmin #mastodon

      alex@feed.yopp.meA 1 Reply Last reply
      2
      0
      • System shared this topic
      • alex@feed.yopp.meA alex@feed.yopp.me

        Okay, after adding AS to http logs I can __ 205__ distinct AS numbers in last 30 minutes requesting /tags/*

        What the actual fuck?! No, seriously, has anyone seen that? I would suspect fucking AI scrappers, but they only hammering tags endpoint.

        #fediadmin #mastoadmin #mastodon

        alex@feed.yopp.meA This user is from outside of this forum
        alex@feed.yopp.meA This user is from outside of this forum
        alex@feed.yopp.me
        wrote last edited by
        #3

        Oh, great #crowdsec victoria logs integration doesn’t support working with metadata keys.

        Seems like it can only parse _msg field, why

        1 Reply Last reply
        2
        0
        • alex@feed.yopp.meA alex@feed.yopp.me

          Hey #fediadmin.

          Is any one having huge influx of bots scraping /tags/* from Tencent (AS132203), VNPT (AS45899) and some others?

          They also scraping media assets, but rps is way lower

          galaxis@mastodon.infra.deG This user is from outside of this forum
          galaxis@mastodon.infra.deG This user is from outside of this forum
          galaxis@mastodon.infra.de
          wrote last edited by
          #4

          @alex Yes, it's been going on since the end of last year at least.

          If I remember correctly, we disabled tags search for unauthenticated users, and then the next thing that was hit was trends. Unfortunately there's no separation between authenticated and unauthenticated clients in the controls for those...

          galaxis@mastodon.infra.deG 1 Reply Last reply
          0
          • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

            @alex Yes, it's been going on since the end of last year at least.

            If I remember correctly, we disabled tags search for unauthenticated users, and then the next thing that was hit was trends. Unfortunately there's no separation between authenticated and unauthenticated clients in the controls for those...

            galaxis@mastodon.infra.deG This user is from outside of this forum
            galaxis@mastodon.infra.deG This user is from outside of this forum
            galaxis@mastodon.infra.de
            wrote last edited by
            #5

            @alex See this post for example: https://mastodon.infra.de/@galaxis/115805367424016000

            galaxis@mastodon.infra.deG alex@feed.yopp.meA 2 Replies Last reply
            0
            • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

              @alex See this post for example: https://mastodon.infra.de/@galaxis/115805367424016000

              galaxis@mastodon.infra.deG This user is from outside of this forum
              galaxis@mastodon.infra.deG This user is from outside of this forum
              galaxis@mastodon.infra.de
              wrote last edited by
              #6

              @alex ...unfortunately it seems like most Mastodon admins don't talk about Mastodon administration much on the Fediverse. I'm aware of a couple of Matrix groups, but other than that I don't know where people are discussing operational details...

              1 Reply Last reply
              0
              • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

                @alex See this post for example: https://mastodon.infra.de/@galaxis/115805367424016000

                alex@feed.yopp.meA This user is from outside of this forum
                alex@feed.yopp.meA This user is from outside of this forum
                alex@feed.yopp.me
                wrote last edited by
                #7

                @galaxis oh, yeas, this is exactly like this!

                By any chance do you remember how to switch federated timeline off for non-authenticated users? I don’t see it in env variable and can’t find anything similar in admin UI.

                Thanks!

                galaxis@mastodon.infra.deG 1 Reply Last reply
                2
                0
                • alex@feed.yopp.meA alex@feed.yopp.me

                  @galaxis oh, yeas, this is exactly like this!

                  By any chance do you remember how to switch federated timeline off for non-authenticated users? I don’t see it in env variable and can’t find anything similar in admin UI.

                  Thanks!

                  galaxis@mastodon.infra.deG This user is from outside of this forum
                  galaxis@mastodon.infra.deG This user is from outside of this forum
                  galaxis@mastodon.infra.de
                  wrote last edited by
                  #8

                  @alex There's four dropdowns with options in Administration -> Server Settings -> Discovery under the "Public timelines" - header.

                  alex@feed.yopp.meA 1 Reply Last reply
                  3
                  0
                  • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

                    @alex There's four dropdowns with options in Administration -> Server Settings -> Discovery under the "Public timelines" - header.

                    alex@feed.yopp.meA This user is from outside of this forum
                    alex@feed.yopp.meA This user is from outside of this forum
                    alex@feed.yopp.me
                    wrote last edited by
                    #9

                    @galaxis found it! Thank you!

                    I wonder how long it’s going to take for bots to stop scraping images now.

                    galaxis@mastodon.infra.deG 1 Reply Last reply
                    2
                    0
                    • R relay@relay.infosec.exchange shared this topic
                    • alex@feed.yopp.meA alex@feed.yopp.me

                      @galaxis found it! Thank you!

                      I wonder how long it’s going to take for bots to stop scraping images now.

                      galaxis@mastodon.infra.deG This user is from outside of this forum
                      galaxis@mastodon.infra.deG This user is from outside of this forum
                      galaxis@mastodon.infra.de
                      wrote last edited by
                      #10

                      @alex Images are a different problem - it seemed to me (I have not done any deep analysis) that these scrapers act as full user agents, and retrieve posts with all media attachments.

                      Unfortunately as they progress into older posts or long threads, this causes Mastodon to re-fetch old media. We were plain running out of space, until I dropped in an additional patch that severely rate-limits the media proxy for unauthenticated users, see this other thread: https://mastodon.infra.de/@galaxis/116077343266969640

                      galaxis@mastodon.infra.deG 1 Reply Last reply
                      0
                      • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

                        @alex Images are a different problem - it seemed to me (I have not done any deep analysis) that these scrapers act as full user agents, and retrieve posts with all media attachments.

                        Unfortunately as they progress into older posts or long threads, this causes Mastodon to re-fetch old media. We were plain running out of space, until I dropped in an additional patch that severely rate-limits the media proxy for unauthenticated users, see this other thread: https://mastodon.infra.de/@galaxis/116077343266969640

                        galaxis@mastodon.infra.deG This user is from outside of this forum
                        galaxis@mastodon.infra.deG This user is from outside of this forum
                        galaxis@mastodon.infra.de
                        wrote last edited by
                        #11

                        @alex ...leaving Trends open unfortunately provides enough fodder for them. On the public instance I help running, users were complaining when we disabled trends though, so adding protection to the media proxy was the easiest way to stop that scraping. On my personal instance, I disabled Trends, so there's nothing left to scrape except my own public posts.

                        alex@feed.yopp.meA 1 Reply Last reply
                        0
                        • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

                          @alex ...leaving Trends open unfortunately provides enough fodder for them. On the public instance I help running, users were complaining when we disabled trends though, so adding protection to the media proxy was the easiest way to stop that scraping. On my personal instance, I disabled Trends, so there's nothing left to scrape except my own public posts.

                          alex@feed.yopp.meA This user is from outside of this forum
                          alex@feed.yopp.meA This user is from outside of this forum
                          alex@feed.yopp.me
                          wrote last edited by
                          #12

                          @galaxis uh, okay, trending page is gone as well.

                          Weird that tags page is still showing basic tag info regardless of the settings

                          So sad that the only way to stop ruining web is to gatekeep everything

                          1 Reply Last reply
                          1
                          0
                          Reply
                          • Reply as topic
                          Log in to reply
                          • Oldest to Newest
                          • Newest to Oldest
                          • Most Votes


                          • Login

                          • Login or register to search.
                          • First post
                            Last post
                          0
                          • Categories
                          • Recent
                          • Tags
                          • Popular
                          • World
                          • Users
                          • Groups