Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Scheduled Pinned Locked Moved Uncategorized
bigtech
180 Posts 56 Posters 81 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

    @twilliability Yes I was thinking of just the same, dual projection.

    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.social
    wrote last edited by
    #93

    @twilliability Relatedly, I'm working on a means to capture the shell log output to a streaming endpoint while allowing plenty of bandwidth for existing bot traffic. Not as easy at it may seem!

    julianoliver@mastodon.socialJ 1 Reply Last reply
    0
    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

      @twilliability Relatedly, I'm working on a means to capture the shell log output to a streaming endpoint while allowing plenty of bandwidth for existing bot traffic. Not as easy at it may seem!

      julianoliver@mastodon.socialJ This user is from outside of this forum
      julianoliver@mastodon.socialJ This user is from outside of this forum
      julianoliver@mastodon.social
      wrote last edited by
      #94

      @twilliability P.S. I was not considering sonification, rather just a projection piece.

      IMO while plugging into Pure Data, Supercollider etc might seem interesting I honestly think that the rate a properly setup tarpit works you'd practically end up with gabba, or something akin to barely textured noise. If you were monitoring TCP traffic directly, sonifying on Layer 4 or even Layer 3 giving auditory identity to endpoint IPs, it would be pretty intense!

      twilliability@genart.socialT 1 Reply Last reply
      0
      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

        Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

        If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

        Link Preview Image
        Science is Poetry

        favicon

        (julianoliver.com)

        #ai #bigtech #tacticalmedia

        n_dimension@infosec.exchangeN This user is from outside of this forum
        n_dimension@infosec.exchangeN This user is from outside of this forum
        n_dimension@infosec.exchange
        wrote last edited by
        #95

        @JulianOliver

        Cute idea.
        Entirely useless.
        Feed a "Ai trap" page to Ai and see what happens...

        1 Reply Last reply
        0
        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

          @twilliability P.S. I was not considering sonification, rather just a projection piece.

          IMO while plugging into Pure Data, Supercollider etc might seem interesting I honestly think that the rate a properly setup tarpit works you'd practically end up with gabba, or something akin to barely textured noise. If you were monitoring TCP traffic directly, sonifying on Layer 4 or even Layer 3 giving auditory identity to endpoint IPs, it would be pretty intense!

          twilliability@genart.socialT This user is from outside of this forum
          twilliability@genart.socialT This user is from outside of this forum
          twilliability@genart.social
          wrote last edited by
          #96

          @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

          screen is easier, lots of pixels

          hamoid@genart.socialH julianoliver@mastodon.socialJ 2 Replies Last reply
          0
          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

            Even faster now.

            Again, these pages are randomly generated, and each line is a page request from a crawler.

            To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

            Computationally, infrastructurally, & culturally, it's an obscenity,

            elithebearded@fed.qaz.redE This user is from outside of this forum
            elithebearded@fed.qaz.redE This user is from outside of this forum
            elithebearded@fed.qaz.red
            wrote last edited by
            #97

            @JulianOliver

            Are you still looking for domains?

            Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

            julianoliver@mastodon.socialJ 1 Reply Last reply
            0
            • twilliability@genart.socialT twilliability@genart.social

              @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

              screen is easier, lots of pixels

              hamoid@genart.socialH This user is from outside of this forum
              hamoid@genart.socialH This user is from outside of this forum
              hamoid@genart.social
              wrote last edited by
              #98

              @twilliability @JulianOliver I would still like to hear it. Maybe with headphones, so the room is not unbearable. Also, there are many ways to sonify it. It could sound like cockroaches walking on paper, for instance 😁

              1 Reply Last reply
              0
              • twilliability@genart.socialT twilliability@genart.social

                @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

                screen is easier, lots of pixels

                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.social
                wrote last edited by
                #99

                @twilliability Hehe. Me too. You live coders are good at managing chaos, so perhaps you could find a way to tame it, or pick out certain outlier patterns from these vacuum cleaners. It's true looking at crawler operators with huge swarms that they do shift across IP ranges as they feed, so there's that to play with I guess. OpenAI and Amazon in particular.

                You'd get a lot more perceptible detail if you could slow them down but my exp is that if you try & rate limit too much they lose interest!

                1 Reply Last reply
                0
                • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

                  @JulianOliver

                  Are you still looking for domains?

                  Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

                  julianoliver@mastodon.socialJ This user is from outside of this forum
                  julianoliver@mastodon.socialJ This user is from outside of this forum
                  julianoliver@mastodon.social
                  wrote last edited by
                  #100

                  @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

                  elithebearded@fed.qaz.redE 1 Reply Last reply
                  0
                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                    @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

                    elithebearded@fed.qaz.redE This user is from outside of this forum
                    elithebearded@fed.qaz.redE This user is from outside of this forum
                    elithebearded@fed.qaz.red
                    wrote last edited by
                    #101

                    @JulianOliver

                    Done. Copied from tender.horse, if it matters

                    julianoliver@mastodon.socialJ 1 Reply Last reply
                    0
                    • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

                      @JulianOliver

                      Done. Copied from tender.horse, if it matters

                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.social
                      wrote last edited by
                      #102

                      @elithebearded You are live and listed here 🙂

                      Link Preview Image
                      SEANCE IS POTTERY

                      favicon

                      (scienceispoetry.net)

                      elithebearded@fed.qaz.redE 1 Reply Last reply
                      0
                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                        Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                        If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                        Link Preview Image
                        Science is Poetry

                        favicon

                        (julianoliver.com)

                        #ai #bigtech #tacticalmedia

                        texjoachim@blabber.rocksT This user is from outside of this forum
                        texjoachim@blabber.rocksT This user is from outside of this forum
                        texjoachim@blabber.rocks
                        wrote last edited by
                        #103

                        @JulianOliver I think I might have one. Need to check, though.

                        1 Reply Last reply
                        0
                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                          @elithebearded You are live and listed here 🙂

                          Link Preview Image
                          SEANCE IS POTTERY

                          favicon

                          (scienceispoetry.net)

                          elithebearded@fed.qaz.redE This user is from outside of this forum
                          elithebearded@fed.qaz.redE This user is from outside of this forum
                          elithebearded@fed.qaz.red
                          wrote last edited by
                          #104

                          @JulianOliver

                          What a thing of beauty!

                          Link Preview Image
                          1 Reply Last reply
                          0
                          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                            Even faster now.

                            Again, these pages are randomly generated, and each line is a page request from a crawler.

                            To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

                            Computationally, infrastructurally, & culturally, it's an obscenity,

                            julianoliver@mastodon.socialJ This user is from outside of this forum
                            julianoliver@mastodon.socialJ This user is from outside of this forum
                            julianoliver@mastodon.social
                            wrote last edited by
                            #105

                            - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                            - All the way to the moon, darling. All the way to the moon.

                            https://scienceispoetry.net/files/parasites.txt

                            themadhatter@mastodon.socialT julianoliver@mastodon.socialJ 2 Replies Last reply
                            0
                            • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                              - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                              - All the way to the moon, darling. All the way to the moon.

                              https://scienceispoetry.net/files/parasites.txt

                              themadhatter@mastodon.socialT This user is from outside of this forum
                              themadhatter@mastodon.socialT This user is from outside of this forum
                              themadhatter@mastodon.social
                              wrote last edited by
                              #106

                              @JulianOliver indeed

                              1 Reply Last reply
                              0
                              • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                                I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                                https://scienceispoetry.net/files/parasites.txt

                                It is at the same path for all contributed domains.

                                For instance:

                                https://carrot.mro1.de/files/parasites.txt

                                jasperbuma@mstdn.socialJ This user is from outside of this forum
                                jasperbuma@mstdn.socialJ This user is from outside of this forum
                                jasperbuma@mstdn.social
                                wrote last edited by
                                #107

                                @JulianOliver Thanks is for this!

                                I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                                julianoliver@mastodon.socialJ 1 Reply Last reply
                                0
                                • jasperbuma@mstdn.socialJ jasperbuma@mstdn.social

                                  @JulianOliver Thanks is for this!

                                  I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.social
                                  wrote last edited by
                                  #108

                                  @jasperbuma It should indeed!

                                  1 Reply Last reply
                                  0
                                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                    Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                                    If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                                    Link Preview Image
                                    Science is Poetry

                                    favicon

                                    (julianoliver.com)

                                    #ai #bigtech #tacticalmedia

                                    aks@scalie.zoneA This user is from outside of this forum
                                    aks@scalie.zoneA This user is from outside of this forum
                                    aks@scalie.zone
                                    wrote last edited by
                                    #109

                                    @JulianOliver i could dedicate subdomains such as science.akselmo.dev to this. Just let me know how.

                                    julianoliver@mastodon.socialJ 1 Reply Last reply
                                    0
                                    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                      - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                                      - All the way to the moon, darling. All the way to the moon.

                                      https://scienceispoetry.net/files/parasites.txt

                                      julianoliver@mastodon.socialJ This user is from outside of this forum
                                      julianoliver@mastodon.socialJ This user is from outside of this forum
                                      julianoliver@mastodon.social
                                      wrote last edited by
                                      #110

                                      Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                      ---
                                      #!/bin/bash

                                      while read parasite;
                                      do
                                      if [[ "$parasite" == *"."* ]]; then
                                      iptables -I INPUT -s "$parasite" -j DROP
                                      elif [[ "$parasite" == *":"* ]]; then
                                      ip6tables -I INPUT -s "$parasite" -j DROP
                                      fi
                                      done < /path/to/parasites.txt
                                      ---

                                      julianoliver@mastodon.socialJ pertho@mastodon.bsd.cafeP 2 Replies Last reply
                                      0
                                      • dzwiedziu@mastodon.socialD This user is from outside of this forum
                                        dzwiedziu@mastodon.socialD This user is from outside of this forum
                                        dzwiedziu@mastodon.social
                                        wrote last edited by
                                        #111

                                        @tseitr
                                        I'm curious about this also.

                                        Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                        @JulianOliver

                                        julianoliver@mastodon.socialJ 1 Reply Last reply
                                        0
                                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                          Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                          ---
                                          #!/bin/bash

                                          while read parasite;
                                          do
                                          if [[ "$parasite" == *"."* ]]; then
                                          iptables -I INPUT -s "$parasite" -j DROP
                                          elif [[ "$parasite" == *":"* ]]; then
                                          ip6tables -I INPUT -s "$parasite" -j DROP
                                          fi
                                          done < /path/to/parasites.txt
                                          ---

                                          julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.social
                                          wrote last edited by
                                          #112

                                          Actual hits dropping slightly, but more data is pulled from the tarpit day on day. This is reflected by a higher proportion of HTTP 200's - so less bad req's. Less reaching for what isn't there, just want the madness.

                                          Unclear why this has changed.

                                          julianoliver@mastodon.socialJ 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups