Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Scheduled Pinned Locked Moved Uncategorized
bigtech
180 Posts 56 Posters 81 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

    @twilliability Relatedly, I'm working on a means to capture the shell log output to a streaming endpoint while allowing plenty of bandwidth for existing bot traffic. Not as easy at it may seem!

    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.social
    wrote last edited by
    #94

    @twilliability P.S. I was not considering sonification, rather just a projection piece.

    IMO while plugging into Pure Data, Supercollider etc might seem interesting I honestly think that the rate a properly setup tarpit works you'd practically end up with gabba, or something akin to barely textured noise. If you were monitoring TCP traffic directly, sonifying on Layer 4 or even Layer 3 giving auditory identity to endpoint IPs, it would be pretty intense!

    twilliability@genart.socialT 1 Reply Last reply
    0
    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

      Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

      If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

      Link Preview Image
      Science is Poetry

      favicon

      (julianoliver.com)

      #ai #bigtech #tacticalmedia

      n_dimension@infosec.exchangeN This user is from outside of this forum
      n_dimension@infosec.exchangeN This user is from outside of this forum
      n_dimension@infosec.exchange
      wrote last edited by
      #95

      @JulianOliver

      Cute idea.
      Entirely useless.
      Feed a "Ai trap" page to Ai and see what happens...

      1 Reply Last reply
      0
      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

        @twilliability P.S. I was not considering sonification, rather just a projection piece.

        IMO while plugging into Pure Data, Supercollider etc might seem interesting I honestly think that the rate a properly setup tarpit works you'd practically end up with gabba, or something akin to barely textured noise. If you were monitoring TCP traffic directly, sonifying on Layer 4 or even Layer 3 giving auditory identity to endpoint IPs, it would be pretty intense!

        twilliability@genart.socialT This user is from outside of this forum
        twilliability@genart.socialT This user is from outside of this forum
        twilliability@genart.social
        wrote last edited by
        #96

        @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

        screen is easier, lots of pixels

        hamoid@genart.socialH julianoliver@mastodon.socialJ 2 Replies Last reply
        0
        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

          Even faster now.

          Again, these pages are randomly generated, and each line is a page request from a crawler.

          To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

          Computationally, infrastructurally, & culturally, it's an obscenity,

          elithebearded@fed.qaz.redE This user is from outside of this forum
          elithebearded@fed.qaz.redE This user is from outside of this forum
          elithebearded@fed.qaz.red
          wrote last edited by
          #97

          @JulianOliver

          Are you still looking for domains?

          Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

          julianoliver@mastodon.socialJ 1 Reply Last reply
          0
          • twilliability@genart.socialT twilliability@genart.social

            @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

            screen is easier, lots of pixels

            hamoid@genart.socialH This user is from outside of this forum
            hamoid@genart.socialH This user is from outside of this forum
            hamoid@genart.social
            wrote last edited by
            #98

            @twilliability @JulianOliver I would still like to hear it. Maybe with headphones, so the room is not unbearable. Also, there are many ways to sonify it. It could sound like cockroaches walking on paper, for instance 😁

            1 Reply Last reply
            0
            • twilliability@genart.socialT twilliability@genart.social

              @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

              screen is easier, lots of pixels

              julianoliver@mastodon.socialJ This user is from outside of this forum
              julianoliver@mastodon.socialJ This user is from outside of this forum
              julianoliver@mastodon.social
              wrote last edited by
              #99

              @twilliability Hehe. Me too. You live coders are good at managing chaos, so perhaps you could find a way to tame it, or pick out certain outlier patterns from these vacuum cleaners. It's true looking at crawler operators with huge swarms that they do shift across IP ranges as they feed, so there's that to play with I guess. OpenAI and Amazon in particular.

              You'd get a lot more perceptible detail if you could slow them down but my exp is that if you try & rate limit too much they lose interest!

              1 Reply Last reply
              0
              • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

                @JulianOliver

                Are you still looking for domains?

                Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.socialJ This user is from outside of this forum
                julianoliver@mastodon.social
                wrote last edited by
                #100

                @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

                elithebearded@fed.qaz.redE 1 Reply Last reply
                0
                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                  @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

                  elithebearded@fed.qaz.redE This user is from outside of this forum
                  elithebearded@fed.qaz.redE This user is from outside of this forum
                  elithebearded@fed.qaz.red
                  wrote last edited by
                  #101

                  @JulianOliver

                  Done. Copied from tender.horse, if it matters

                  julianoliver@mastodon.socialJ 1 Reply Last reply
                  0
                  • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

                    @JulianOliver

                    Done. Copied from tender.horse, if it matters

                    julianoliver@mastodon.socialJ This user is from outside of this forum
                    julianoliver@mastodon.socialJ This user is from outside of this forum
                    julianoliver@mastodon.social
                    wrote last edited by
                    #102

                    @elithebearded You are live and listed here 🙂

                    Link Preview Image
                    SEANCE IS POTTERY

                    favicon

                    (scienceispoetry.net)

                    elithebearded@fed.qaz.redE 1 Reply Last reply
                    0
                    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                      Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                      If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                      Link Preview Image
                      Science is Poetry

                      favicon

                      (julianoliver.com)

                      #ai #bigtech #tacticalmedia

                      texjoachim@blabber.rocksT This user is from outside of this forum
                      texjoachim@blabber.rocksT This user is from outside of this forum
                      texjoachim@blabber.rocks
                      wrote last edited by
                      #103

                      @JulianOliver I think I might have one. Need to check, though.

                      1 Reply Last reply
                      0
                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                        @elithebearded You are live and listed here 🙂

                        Link Preview Image
                        SEANCE IS POTTERY

                        favicon

                        (scienceispoetry.net)

                        elithebearded@fed.qaz.redE This user is from outside of this forum
                        elithebearded@fed.qaz.redE This user is from outside of this forum
                        elithebearded@fed.qaz.red
                        wrote last edited by
                        #104

                        @JulianOliver

                        What a thing of beauty!

                        Link Preview Image
                        1 Reply Last reply
                        0
                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                          Even faster now.

                          Again, these pages are randomly generated, and each line is a page request from a crawler.

                          To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

                          Computationally, infrastructurally, & culturally, it's an obscenity,

                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.social
                          wrote last edited by
                          #105

                          - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                          - All the way to the moon, darling. All the way to the moon.

                          https://scienceispoetry.net/files/parasites.txt

                          themadhatter@mastodon.socialT julianoliver@mastodon.socialJ 2 Replies Last reply
                          0
                          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                            - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                            - All the way to the moon, darling. All the way to the moon.

                            https://scienceispoetry.net/files/parasites.txt

                            themadhatter@mastodon.socialT This user is from outside of this forum
                            themadhatter@mastodon.socialT This user is from outside of this forum
                            themadhatter@mastodon.social
                            wrote last edited by
                            #106

                            @JulianOliver indeed

                            1 Reply Last reply
                            0
                            • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                              I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                              I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                              https://scienceispoetry.net/files/parasites.txt

                              It is at the same path for all contributed domains.

                              For instance:

                              https://carrot.mro1.de/files/parasites.txt

                              jasperbuma@mstdn.socialJ This user is from outside of this forum
                              jasperbuma@mstdn.socialJ This user is from outside of this forum
                              jasperbuma@mstdn.social
                              wrote last edited by
                              #107

                              @JulianOliver Thanks is for this!

                              I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                              julianoliver@mastodon.socialJ 1 Reply Last reply
                              0
                              • jasperbuma@mstdn.socialJ jasperbuma@mstdn.social

                                @JulianOliver Thanks is for this!

                                I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                                julianoliver@mastodon.socialJ This user is from outside of this forum
                                julianoliver@mastodon.socialJ This user is from outside of this forum
                                julianoliver@mastodon.social
                                wrote last edited by
                                #108

                                @jasperbuma It should indeed!

                                1 Reply Last reply
                                0
                                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                  Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                                  If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                                  Link Preview Image
                                  Science is Poetry

                                  favicon

                                  (julianoliver.com)

                                  #ai #bigtech #tacticalmedia

                                  aks@scalie.zoneA This user is from outside of this forum
                                  aks@scalie.zoneA This user is from outside of this forum
                                  aks@scalie.zone
                                  wrote last edited by
                                  #109

                                  @JulianOliver i could dedicate subdomains such as science.akselmo.dev to this. Just let me know how.

                                  julianoliver@mastodon.socialJ 1 Reply Last reply
                                  0
                                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                    - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                                    - All the way to the moon, darling. All the way to the moon.

                                    https://scienceispoetry.net/files/parasites.txt

                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.social
                                    wrote last edited by
                                    #110

                                    Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                    ---
                                    #!/bin/bash

                                    while read parasite;
                                    do
                                    if [[ "$parasite" == *"."* ]]; then
                                    iptables -I INPUT -s "$parasite" -j DROP
                                    elif [[ "$parasite" == *":"* ]]; then
                                    ip6tables -I INPUT -s "$parasite" -j DROP
                                    fi
                                    done < /path/to/parasites.txt
                                    ---

                                    julianoliver@mastodon.socialJ pertho@mastodon.bsd.cafeP 2 Replies Last reply
                                    0
                                    • dzwiedziu@mastodon.socialD This user is from outside of this forum
                                      dzwiedziu@mastodon.socialD This user is from outside of this forum
                                      dzwiedziu@mastodon.social
                                      wrote last edited by
                                      #111

                                      @tseitr
                                      I'm curious about this also.

                                      Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                      @JulianOliver

                                      julianoliver@mastodon.socialJ 1 Reply Last reply
                                      0
                                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                        Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                        ---
                                        #!/bin/bash

                                        while read parasite;
                                        do
                                        if [[ "$parasite" == *"."* ]]; then
                                        iptables -I INPUT -s "$parasite" -j DROP
                                        elif [[ "$parasite" == *":"* ]]; then
                                        ip6tables -I INPUT -s "$parasite" -j DROP
                                        fi
                                        done < /path/to/parasites.txt
                                        ---

                                        julianoliver@mastodon.socialJ This user is from outside of this forum
                                        julianoliver@mastodon.socialJ This user is from outside of this forum
                                        julianoliver@mastodon.social
                                        wrote last edited by
                                        #112

                                        Actual hits dropping slightly, but more data is pulled from the tarpit day on day. This is reflected by a higher proportion of HTTP 200's - so less bad req's. Less reaching for what isn't there, just want the madness.

                                        Unclear why this has changed.

                                        Link Preview Image
                                        julianoliver@mastodon.socialJ 1 Reply Last reply
                                        0
                                        • dzwiedziu@mastodon.socialD dzwiedziu@mastodon.social

                                          @tseitr
                                          I'm curious about this also.

                                          Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                          @JulianOliver

                                          julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.social
                                          wrote last edited by
                                          #113

                                          @dzwiedziu @tseitr Thanks both! Yes as simple as picking any unused domain (canonical or sub) and setting these records to point to the server:

                                          A: 95.216.76.85
                                          AAAA: 2a01:4f9:2b:c83::2

                                          Then, DM or toot me the domain. Once set, I'll let you know, and then it's time to share your tarpit domain liberally: link in the footer of your site, landing page a friendly wiki you want to protect, blog post etc.

                                          Ideally should be toward the front of the content.

                                          dzwiedziu@mastodon.socialD 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups