Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Brew (#forgejo), as usual, is being overloaded by the scrapers.

Brew (#forgejo), as usual, is being overloaded by the scrapers.

Scheduled Pinned Locked Moved Uncategorized
forgejobsdcafebsdcafeservices
13 Posts 5 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • stefano@mastodon.bsd.cafeS This user is from outside of this forum
    stefano@mastodon.bsd.cafeS This user is from outside of this forum
    stefano@mastodon.bsd.cafe
    wrote last edited by stefano@mastodon.bsd.cafe
    #1

    EDIT: done, let me know if you experience problems

    Brew (#forgejo), as usual, is being overloaded by the scrapers.

    I think I'll have to put an Anubis in front of it. I don't love those "blocks", but sometimes you need to.

    #BSDCafe #BSDCafeServices

    oxy@social.bsdlab.auO pertho@mastodon.bsd.cafeP tubsta@social.bsdlab.auT 3 Replies Last reply
    0
    • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

      EDIT: done, let me know if you experience problems

      Brew (#forgejo), as usual, is being overloaded by the scrapers.

      I think I'll have to put an Anubis in front of it. I don't love those "blocks", but sometimes you need to.

      #BSDCafe #BSDCafeServices

      oxy@social.bsdlab.auO This user is from outside of this forum
      oxy@social.bsdlab.auO This user is from outside of this forum
      oxy@social.bsdlab.au
      wrote last edited by
      #2
      @stefano I'm surprised more options havent appeared
      1 Reply Last reply
      0
      • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

        EDIT: done, let me know if you experience problems

        Brew (#forgejo), as usual, is being overloaded by the scrapers.

        I think I'll have to put an Anubis in front of it. I don't love those "blocks", but sometimes you need to.

        #BSDCafe #BSDCafeServices

        pertho@mastodon.bsd.cafeP This user is from outside of this forum
        pertho@mastodon.bsd.cafeP This user is from outside of this forum
        pertho@mastodon.bsd.cafe
        wrote last edited by
        #3

        @stefano Have you tried blocking the IP ranges in Julian's list?

        Julian Oliver (@JulianOliver@mastodon.social)

        I've done the log analysis and the two biggest contributors that brought the AI crawler hits up to 2 million in a day, a 4x increase on a week prior, are ByteSpider (Singapore networks) and especially AppleBot (used for Siri and other Apple products). The parasites.txt is now >4500 lines long: https://scienceispoetry.net/files/parasites.txt

        favicon

        Mastodon (mastodon.social)

        Or is it all coming from residential proxies? 🤔

        stefano@mastodon.bsd.cafeS oxy@social.bsdlab.auO 2 Replies Last reply
        0
        • pertho@mastodon.bsd.cafeP pertho@mastodon.bsd.cafe

          @stefano Have you tried blocking the IP ranges in Julian's list?

          Julian Oliver (@JulianOliver@mastodon.social)

          I've done the log analysis and the two biggest contributors that brought the AI crawler hits up to 2 million in a day, a 4x increase on a week prior, are ByteSpider (Singapore networks) and especially AppleBot (used for Siri and other Apple products). The parasites.txt is now >4500 lines long: https://scienceispoetry.net/files/parasites.txt

          favicon

          Mastodon (mastodon.social)

          Or is it all coming from residential proxies? 🤔

          stefano@mastodon.bsd.cafeS This user is from outside of this forum
          stefano@mastodon.bsd.cafeS This user is from outside of this forum
          stefano@mastodon.bsd.cafe
          wrote last edited by
          #4

          @pertho residential proxies. I blocked everything I could block, but it wasn't enough.

          1 Reply Last reply
          0
          • pertho@mastodon.bsd.cafeP pertho@mastodon.bsd.cafe

            @stefano Have you tried blocking the IP ranges in Julian's list?

            Julian Oliver (@JulianOliver@mastodon.social)

            I've done the log analysis and the two biggest contributors that brought the AI crawler hits up to 2 million in a day, a 4x increase on a week prior, are ByteSpider (Singapore networks) and especially AppleBot (used for Siri and other Apple products). The parasites.txt is now >4500 lines long: https://scienceispoetry.net/files/parasites.txt

            favicon

            Mastodon (mastodon.social)

            Or is it all coming from residential proxies? 🤔

            oxy@social.bsdlab.auO This user is from outside of this forum
            oxy@social.bsdlab.auO This user is from outside of this forum
            oxy@social.bsdlab.au
            wrote last edited by
            #5
            @pertho @stefano oooh interesting list.

            I've been tinkering with ssh/httpd logs/awk and enriching the data with https://iplocate.io/ and maybe eventually greynoise and spamhaus (to get more residential proxies etc)
            1 Reply Last reply
            0
            • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

              EDIT: done, let me know if you experience problems

              Brew (#forgejo), as usual, is being overloaded by the scrapers.

              I think I'll have to put an Anubis in front of it. I don't love those "blocks", but sometimes you need to.

              #BSDCafe #BSDCafeServices

              tubsta@social.bsdlab.auT This user is from outside of this forum
              tubsta@social.bsdlab.auT This user is from outside of this forum
              tubsta@social.bsdlab.au
              wrote last edited by
              #6
              @stefano Stick Bunny in with origin shield to drop the nuffs scraping your site
              stefano@mastodon.bsd.cafeS 1 Reply Last reply
              0
              • tubsta@social.bsdlab.auT tubsta@social.bsdlab.au
                @stefano Stick Bunny in with origin shield to drop the nuffs scraping your site
                stefano@mastodon.bsd.cafeS This user is from outside of this forum
                stefano@mastodon.bsd.cafeS This user is from outside of this forum
                stefano@mastodon.bsd.cafe
                wrote last edited by
                #7

                @tubsta this would work, but I'm trying to avoid using (external) CDNs, at the moment.

                tubsta@social.bsdlab.auT 1 Reply Last reply
                0
                • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                  @tubsta this would work, but I'm trying to avoid using (external) CDNs, at the moment.

                  tubsta@social.bsdlab.auT This user is from outside of this forum
                  tubsta@social.bsdlab.auT This user is from outside of this forum
                  tubsta@social.bsdlab.au
                  wrote last edited by
                  #8
                  @stefano I agree with what you are trying to do as I would rather avoid CDNs but some services need it, just gotta work out the least shit ones and the ones that are Europe focused to assist here.
                  tubsta@social.bsdlab.auT stefano@mastodon.bsd.cafeS 2 Replies Last reply
                  0
                  • tubsta@social.bsdlab.auT tubsta@social.bsdlab.au
                    @stefano I agree with what you are trying to do as I would rather avoid CDNs but some services need it, just gotta work out the least shit ones and the ones that are Europe focused to assist here.
                    tubsta@social.bsdlab.auT This user is from outside of this forum
                    tubsta@social.bsdlab.auT This user is from outside of this forum
                    tubsta@social.bsdlab.au
                    wrote last edited by
                    #9
                    @stefano FWIW I spend about $5 a month with Bunny’s CDN products for bsdlab
                    mwl@io.mwl.ioM 1 Reply Last reply
                    0
                    • tubsta@social.bsdlab.auT tubsta@social.bsdlab.au
                      @stefano FWIW I spend about $5 a month with Bunny’s CDN products for bsdlab
                      mwl@io.mwl.ioM This user is from outside of this forum
                      mwl@io.mwl.ioM This user is from outside of this forum
                      mwl@io.mwl.io
                      wrote last edited by
                      #10

                      @tubsta @stefano

                      If you must CDN, Bunny is 100% the way to go. Contains much less suck.

                      I run cdn.mwl.io specifically to distribute files. It's a CDN composed of one host.

                      1 Reply Last reply
                      0
                      • tubsta@social.bsdlab.auT tubsta@social.bsdlab.au
                        @stefano I agree with what you are trying to do as I would rather avoid CDNs but some services need it, just gotta work out the least shit ones and the ones that are Europe focused to assist here.
                        stefano@mastodon.bsd.cafeS This user is from outside of this forum
                        stefano@mastodon.bsd.cafeS This user is from outside of this forum
                        stefano@mastodon.bsd.cafe
                        wrote last edited by
                        #11

                        @tubsta sure. Bunny is great. I have an account and use it for some services. For some time, some of the BSD Cafe contents were served by them, and it was perfect.

                        tubsta@social.bsdlab.auT 1 Reply Last reply
                        0
                        • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                          @tubsta sure. Bunny is great. I have an account and use it for some services. For some time, some of the BSD Cafe contents were served by them, and it was perfect.

                          tubsta@social.bsdlab.auT This user is from outside of this forum
                          tubsta@social.bsdlab.auT This user is from outside of this forum
                          tubsta@social.bsdlab.au
                          wrote last edited by
                          #12
                          @stefano They now have S3 object access for their storage nodes which has been a long time coming
                          stefano@mastodon.bsd.cafeS 1 Reply Last reply
                          0
                          • tubsta@social.bsdlab.auT tubsta@social.bsdlab.au
                            @stefano They now have S3 object access for their storage nodes which has been a long time coming
                            stefano@mastodon.bsd.cafeS This user is from outside of this forum
                            stefano@mastodon.bsd.cafeS This user is from outside of this forum
                            stefano@mastodon.bsd.cafe
                            wrote last edited by
                            #13

                            @tubsta oh nice! I was curious to see it implemented. I'll have a look.

                            1 Reply Last reply
                            0
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups