Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Scheduled Pinned Locked Moved Uncategorized
bigtech
180 Posts 56 Posters 81 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • twilliability@genart.socialT twilliability@genart.social

    @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

    screen is easier, lots of pixels

    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.social
    wrote last edited by
    #99

    @twilliability Hehe. Me too. You live coders are good at managing chaos, so perhaps you could find a way to tame it, or pick out certain outlier patterns from these vacuum cleaners. It's true looking at crawler operators with huge swarms that they do shift across IP ranges as they feed, so there's that to play with I guess. OpenAI and Amazon in particular.

    You'd get a lot more perceptible detail if you could slow them down but my exp is that if you try & rate limit too much they lose interest!

    1 Reply Last reply
    0
    • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

      @JulianOliver

      Are you still looking for domains?

      Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

      julianoliver@mastodon.socialJ This user is from outside of this forum
      julianoliver@mastodon.socialJ This user is from outside of this forum
      julianoliver@mastodon.social
      wrote last edited by
      #100

      @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

      elithebearded@fed.qaz.redE 1 Reply Last reply
      0
      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

        @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

        elithebearded@fed.qaz.redE This user is from outside of this forum
        elithebearded@fed.qaz.redE This user is from outside of this forum
        elithebearded@fed.qaz.red
        wrote last edited by
        #101

        @JulianOliver

        Done. Copied from tender.horse, if it matters

        julianoliver@mastodon.socialJ 1 Reply Last reply
        0
        • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

          @JulianOliver

          Done. Copied from tender.horse, if it matters

          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.social
          wrote last edited by
          #102

          @elithebearded You are live and listed here 🙂

          Link Preview Image
          SEANCE IS POTTERY

          favicon

          (scienceispoetry.net)

          elithebearded@fed.qaz.redE 1 Reply Last reply
          0
          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

            Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

            If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

            Link Preview Image
            Science is Poetry

            favicon

            (julianoliver.com)

            #ai #bigtech #tacticalmedia

            texjoachim@blabber.rocksT This user is from outside of this forum
            texjoachim@blabber.rocksT This user is from outside of this forum
            texjoachim@blabber.rocks
            wrote last edited by
            #103

            @JulianOliver I think I might have one. Need to check, though.

            1 Reply Last reply
            0
            • julianoliver@mastodon.socialJ julianoliver@mastodon.social

              @elithebearded You are live and listed here 🙂

              Link Preview Image
              SEANCE IS POTTERY

              favicon

              (scienceispoetry.net)

              elithebearded@fed.qaz.redE This user is from outside of this forum
              elithebearded@fed.qaz.redE This user is from outside of this forum
              elithebearded@fed.qaz.red
              wrote last edited by
              #104

              @JulianOliver

              What a thing of beauty!

              Link Preview Image
              1 Reply Last reply
              0
              • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                Even faster now.

                Again, these pages are randomly generated, and each line is a page request from a crawler.

                To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

                Computationally, infrastructurally, & culturally, it's an obscenity,

                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.social
                wrote last edited by
                #105

                - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                - All the way to the moon, darling. All the way to the moon.

                https://scienceispoetry.net/files/parasites.txt

                themadhatter@mastodon.socialT julianoliver@mastodon.socialJ 2 Replies Last reply
                0
                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                  - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                  - All the way to the moon, darling. All the way to the moon.

                  https://scienceispoetry.net/files/parasites.txt

                  themadhatter@mastodon.socialT This user is from outside of this forum
                  themadhatter@mastodon.socialT This user is from outside of this forum
                  themadhatter@mastodon.social
                  wrote last edited by
                  #106

                  @JulianOliver indeed

                  1 Reply Last reply
                  0
                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                    I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                    I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                    https://scienceispoetry.net/files/parasites.txt

                    It is at the same path for all contributed domains.

                    For instance:

                    https://carrot.mro1.de/files/parasites.txt

                    jasperbuma@mstdn.socialJ This user is from outside of this forum
                    jasperbuma@mstdn.socialJ This user is from outside of this forum
                    jasperbuma@mstdn.social
                    wrote last edited by
                    #107

                    @JulianOliver Thanks is for this!

                    I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                    julianoliver@mastodon.socialJ 1 Reply Last reply
                    0
                    • jasperbuma@mstdn.socialJ jasperbuma@mstdn.social

                      @JulianOliver Thanks is for this!

                      I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.social
                      wrote last edited by
                      #108

                      @jasperbuma It should indeed!

                      1 Reply Last reply
                      0
                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                        Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                        If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                        Link Preview Image
                        Science is Poetry

                        favicon

                        (julianoliver.com)

                        #ai #bigtech #tacticalmedia

                        aks@scalie.zoneA This user is from outside of this forum
                        aks@scalie.zoneA This user is from outside of this forum
                        aks@scalie.zone
                        wrote last edited by
                        #109

                        @JulianOliver i could dedicate subdomains such as science.akselmo.dev to this. Just let me know how.

                        julianoliver@mastodon.socialJ 1 Reply Last reply
                        0
                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                          - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                          - All the way to the moon, darling. All the way to the moon.

                          https://scienceispoetry.net/files/parasites.txt

                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.social
                          wrote last edited by
                          #110

                          Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                          ---
                          #!/bin/bash

                          while read parasite;
                          do
                          if [[ "$parasite" == *"."* ]]; then
                          iptables -I INPUT -s "$parasite" -j DROP
                          elif [[ "$parasite" == *":"* ]]; then
                          ip6tables -I INPUT -s "$parasite" -j DROP
                          fi
                          done < /path/to/parasites.txt
                          ---

                          julianoliver@mastodon.socialJ pertho@mastodon.bsd.cafeP 2 Replies Last reply
                          0
                          • dzwiedziu@mastodon.socialD This user is from outside of this forum
                            dzwiedziu@mastodon.socialD This user is from outside of this forum
                            dzwiedziu@mastodon.social
                            wrote last edited by
                            #111

                            @tseitr
                            I'm curious about this also.

                            Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                            @JulianOliver

                            julianoliver@mastodon.socialJ 1 Reply Last reply
                            0
                            • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                              Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                              ---
                              #!/bin/bash

                              while read parasite;
                              do
                              if [[ "$parasite" == *"."* ]]; then
                              iptables -I INPUT -s "$parasite" -j DROP
                              elif [[ "$parasite" == *":"* ]]; then
                              ip6tables -I INPUT -s "$parasite" -j DROP
                              fi
                              done < /path/to/parasites.txt
                              ---

                              julianoliver@mastodon.socialJ This user is from outside of this forum
                              julianoliver@mastodon.socialJ This user is from outside of this forum
                              julianoliver@mastodon.social
                              wrote last edited by
                              #112

                              Actual hits dropping slightly, but more data is pulled from the tarpit day on day. This is reflected by a higher proportion of HTTP 200's - so less bad req's. Less reaching for what isn't there, just want the madness.

                              Unclear why this has changed.

                              Link Preview Image
                              julianoliver@mastodon.socialJ 1 Reply Last reply
                              0
                              • dzwiedziu@mastodon.socialD dzwiedziu@mastodon.social

                                @tseitr
                                I'm curious about this also.

                                Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                @JulianOliver

                                julianoliver@mastodon.socialJ This user is from outside of this forum
                                julianoliver@mastodon.socialJ This user is from outside of this forum
                                julianoliver@mastodon.social
                                wrote last edited by
                                #113

                                @dzwiedziu @tseitr Thanks both! Yes as simple as picking any unused domain (canonical or sub) and setting these records to point to the server:

                                A: 95.216.76.85
                                AAAA: 2a01:4f9:2b:c83::2

                                Then, DM or toot me the domain. Once set, I'll let you know, and then it's time to share your tarpit domain liberally: link in the footer of your site, landing page a friendly wiki you want to protect, blog post etc.

                                Ideally should be toward the front of the content.

                                dzwiedziu@mastodon.socialD 1 Reply Last reply
                                0
                                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                  @dzwiedziu @tseitr Thanks both! Yes as simple as picking any unused domain (canonical or sub) and setting these records to point to the server:

                                  A: 95.216.76.85
                                  AAAA: 2a01:4f9:2b:c83::2

                                  Then, DM or toot me the domain. Once set, I'll let you know, and then it's time to share your tarpit domain liberally: link in the footer of your site, landing page a friendly wiki you want to protect, blog post etc.

                                  Ideally should be toward the front of the content.

                                  dzwiedziu@mastodon.socialD This user is from outside of this forum
                                  dzwiedziu@mastodon.socialD This user is from outside of this forum
                                  dzwiedziu@mastodon.social
                                  wrote last edited by
                                  #114

                                  @JulianOliver
                                  Then I'll set up mine either very quickly or in a matter of weeks (WIP moving between countries).

                                  @tseitr

                                  julianoliver@mastodon.socialJ 1 Reply Last reply
                                  0
                                  • dzwiedziu@mastodon.socialD dzwiedziu@mastodon.social

                                    @JulianOliver
                                    Then I'll set up mine either very quickly or in a matter of weeks (WIP moving between countries).

                                    @tseitr

                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.social
                                    wrote last edited by
                                    #115

                                    @dzwiedziu @tseitr Been there a few times - no rush!

                                    1 Reply Last reply
                                    0
                                    • julianoliver@mastodon.socialJ This user is from outside of this forum
                                      julianoliver@mastodon.socialJ This user is from outside of this forum
                                      julianoliver@mastodon.social
                                      wrote last edited by
                                      #116

                                      @retech That's the word for it. Computationally, environmentally, culturally, infrastructurally - an obscenity.

                                      1 Reply Last reply
                                      0
                                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                        Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                        ---
                                        #!/bin/bash

                                        while read parasite;
                                        do
                                        if [[ "$parasite" == *"."* ]]; then
                                        iptables -I INPUT -s "$parasite" -j DROP
                                        elif [[ "$parasite" == *":"* ]]; then
                                        ip6tables -I INPUT -s "$parasite" -j DROP
                                        fi
                                        done < /path/to/parasites.txt
                                        ---

                                        pertho@mastodon.bsd.cafeP This user is from outside of this forum
                                        pertho@mastodon.bsd.cafeP This user is from outside of this forum
                                        pertho@mastodon.bsd.cafe
                                        wrote last edited by
                                        #117

                                        @JulianOliver
                                        block return on egress from <parasites>

                                        (in pf)

                                        That's what I'm using and:

                                        @32 block drop in log quick on egress from <parasites:2323> to any
                                        [ Evaluations: 125476 Packets: 351 Bytes: 20702 States: 0 ]
                                        [ Inserted: uid 0 pid 75290 State Creations: 0 ]

                                        Not seen much traffic from them on my machine.
                                        👍

                                        1 Reply Last reply
                                        0
                                        • pertho@mastodon.bsd.cafeP This user is from outside of this forum
                                          pertho@mastodon.bsd.cafeP This user is from outside of this forum
                                          pertho@mastodon.bsd.cafe
                                          wrote last edited by
                                          #118

                                          @neoluddite @JulianOliver At work we noticed that when we changed from html generated search links (nofollow was ignored) to JavaScript generated links, a lot of bots stopped coming back but there were some (mainly from residential proxies) that appear to have cached the URLs and came back for more.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups