Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Scheduled Pinned Locked Moved Uncategorized
bigtech
180 Posts 56 Posters 81 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

    Even faster now.

    Again, these pages are randomly generated, and each line is a page request from a crawler.

    To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

    Computationally, infrastructurally, & culturally, it's an obscenity,

    elithebearded@fed.qaz.redE This user is from outside of this forum
    elithebearded@fed.qaz.redE This user is from outside of this forum
    elithebearded@fed.qaz.red
    wrote last edited by
    #97

    @JulianOliver

    Are you still looking for domains?

    Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

    julianoliver@mastodon.socialJ 1 Reply Last reply
    0
    • twilliability@genart.socialT twilliability@genart.social

      @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

      screen is easier, lots of pixels

      hamoid@genart.socialH This user is from outside of this forum
      hamoid@genart.socialH This user is from outside of this forum
      hamoid@genart.social
      wrote last edited by
      #98

      @twilliability @JulianOliver I would still like to hear it. Maybe with headphones, so the room is not unbearable. Also, there are many ways to sonify it. It could sound like cockroaches walking on paper, for instance 😁

      1 Reply Last reply
      0
      • twilliability@genart.socialT twilliability@genart.social

        @JulianOliver yes it's at first idea stage 🙂 it is not beyond me to take liberties for an experience that is memorable but maybe not 1:1 with reality.

        screen is easier, lots of pixels

        julianoliver@mastodon.socialJ This user is from outside of this forum
        julianoliver@mastodon.socialJ This user is from outside of this forum
        julianoliver@mastodon.social
        wrote last edited by
        #99

        @twilliability Hehe. Me too. You live coders are good at managing chaos, so perhaps you could find a way to tame it, or pick out certain outlier patterns from these vacuum cleaners. It's true looking at crawler operators with huge swarms that they do shift across IP ranges as they feed, so there's that to play with I guess. OpenAI and Amazon in particular.

        You'd get a lot more perceptible detail if you could slow them down but my exp is that if you try & rate limit too much they lose interest!

        1 Reply Last reply
        0
        • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

          @JulianOliver

          Are you still looking for domains?

          Somehow www.qaz.red is pointing at 95.216.76.85. Should I add an AAAA record, too?

          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.social
          wrote last edited by
          #100

          @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

          elithebearded@fed.qaz.redE 1 Reply Last reply
          0
          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

            @elithebearded Oh hey thanks! I'll add it today. An AAAA would be great if you have a moment.

            elithebearded@fed.qaz.redE This user is from outside of this forum
            elithebearded@fed.qaz.redE This user is from outside of this forum
            elithebearded@fed.qaz.red
            wrote last edited by
            #101

            @JulianOliver

            Done. Copied from tender.horse, if it matters

            julianoliver@mastodon.socialJ 1 Reply Last reply
            0
            • elithebearded@fed.qaz.redE elithebearded@fed.qaz.red

              @JulianOliver

              Done. Copied from tender.horse, if it matters

              julianoliver@mastodon.socialJ This user is from outside of this forum
              julianoliver@mastodon.socialJ This user is from outside of this forum
              julianoliver@mastodon.social
              wrote last edited by
              #102

              @elithebearded You are live and listed here 🙂

              Link Preview Image
              SEANCE IS POTTERY

              favicon

              (scienceispoetry.net)

              elithebearded@fed.qaz.redE 1 Reply Last reply
              0
              • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                Link Preview Image
                Science is Poetry

                favicon

                (julianoliver.com)

                #ai #bigtech #tacticalmedia

                texjoachim@blabber.rocksT This user is from outside of this forum
                texjoachim@blabber.rocksT This user is from outside of this forum
                texjoachim@blabber.rocks
                wrote last edited by
                #103

                @JulianOliver I think I might have one. Need to check, though.

                1 Reply Last reply
                0
                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                  @elithebearded You are live and listed here 🙂

                  Link Preview Image
                  SEANCE IS POTTERY

                  favicon

                  (scienceispoetry.net)

                  elithebearded@fed.qaz.redE This user is from outside of this forum
                  elithebearded@fed.qaz.redE This user is from outside of this forum
                  elithebearded@fed.qaz.red
                  wrote last edited by
                  #104

                  @JulianOliver

                  What a thing of beauty!

                  Link Preview Image
                  1 Reply Last reply
                  0
                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                    Even faster now.

                    Again, these pages are randomly generated, and each line is a page request from a crawler.

                    To think of the energy expended at a global scale, the waste. All the money, water & minerals thrown at this. These AI companies are near DoS'ing the human web as they deep-sea trawl our content.

                    Computationally, infrastructurally, & culturally, it's an obscenity,

                    julianoliver@mastodon.socialJ This user is from outside of this forum
                    julianoliver@mastodon.socialJ This user is from outside of this forum
                    julianoliver@mastodon.social
                    wrote last edited by
                    #105

                    - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                    - All the way to the moon, darling. All the way to the moon.

                    https://scienceispoetry.net/files/parasites.txt

                    themadhatter@mastodon.socialT julianoliver@mastodon.socialJ 2 Replies Last reply
                    0
                    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                      - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                      - All the way to the moon, darling. All the way to the moon.

                      https://scienceispoetry.net/files/parasites.txt

                      themadhatter@mastodon.socialT This user is from outside of this forum
                      themadhatter@mastodon.socialT This user is from outside of this forum
                      themadhatter@mastodon.social
                      wrote last edited by
                      #106

                      @JulianOliver indeed

                      1 Reply Last reply
                      0
                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                        I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                        I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                        https://scienceispoetry.net/files/parasites.txt

                        It is at the same path for all contributed domains.

                        For instance:

                        https://carrot.mro1.de/files/parasites.txt

                        jasperbuma@mstdn.socialJ This user is from outside of this forum
                        jasperbuma@mstdn.socialJ This user is from outside of this forum
                        jasperbuma@mstdn.social
                        wrote last edited by
                        #107

                        @JulianOliver Thanks is for this!

                        I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                        julianoliver@mastodon.socialJ 1 Reply Last reply
                        0
                        • jasperbuma@mstdn.socialJ jasperbuma@mstdn.social

                          @JulianOliver Thanks is for this!

                          I added the list to my Crowdsec firewall bouncer, that should block them. Right?

                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.socialJ This user is from outside of this forum
                          julianoliver@mastodon.social
                          wrote last edited by
                          #108

                          @jasperbuma It should indeed!

                          1 Reply Last reply
                          0
                          • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                            Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                            If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                            Link Preview Image
                            Science is Poetry

                            favicon

                            (julianoliver.com)

                            #ai #bigtech #tacticalmedia

                            aks@scalie.zoneA This user is from outside of this forum
                            aks@scalie.zoneA This user is from outside of this forum
                            aks@scalie.zone
                            wrote last edited by
                            #109

                            @JulianOliver i could dedicate subdomains such as science.akselmo.dev to this. Just let me know how.

                            julianoliver@mastodon.socialJ 1 Reply Last reply
                            0
                            • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                              - Mum, if you made a chain out of all the endpoint addresses of AI crawlers, how far would it reach?

                              - All the way to the moon, darling. All the way to the moon.

                              https://scienceispoetry.net/files/parasites.txt

                              julianoliver@mastodon.socialJ This user is from outside of this forum
                              julianoliver@mastodon.socialJ This user is from outside of this forum
                              julianoliver@mastodon.social
                              wrote last edited by
                              #110

                              Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                              ---
                              #!/bin/bash

                              while read parasite;
                              do
                              if [[ "$parasite" == *"."* ]]; then
                              iptables -I INPUT -s "$parasite" -j DROP
                              elif [[ "$parasite" == *":"* ]]; then
                              ip6tables -I INPUT -s "$parasite" -j DROP
                              fi
                              done < /path/to/parasites.txt
                              ---

                              julianoliver@mastodon.socialJ pertho@mastodon.bsd.cafeP 2 Replies Last reply
                              0
                              • dzwiedziu@mastodon.socialD This user is from outside of this forum
                                dzwiedziu@mastodon.socialD This user is from outside of this forum
                                dzwiedziu@mastodon.social
                                wrote last edited by
                                #111

                                @tseitr
                                I'm curious about this also.

                                Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                @JulianOliver

                                julianoliver@mastodon.socialJ 1 Reply Last reply
                                0
                                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                  Here's a thing I did in a couple of mins to ban all IPs in the parasites.txt serverside. You could ofc REJECT rather than DROP to send a message.

                                  ---
                                  #!/bin/bash

                                  while read parasite;
                                  do
                                  if [[ "$parasite" == *"."* ]]; then
                                  iptables -I INPUT -s "$parasite" -j DROP
                                  elif [[ "$parasite" == *":"* ]]; then
                                  ip6tables -I INPUT -s "$parasite" -j DROP
                                  fi
                                  done < /path/to/parasites.txt
                                  ---

                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.social
                                  wrote last edited by
                                  #112

                                  Actual hits dropping slightly, but more data is pulled from the tarpit day on day. This is reflected by a higher proportion of HTTP 200's - so less bad req's. Less reaching for what isn't there, just want the madness.

                                  Unclear why this has changed.

                                  julianoliver@mastodon.socialJ 1 Reply Last reply
                                  0
                                  • dzwiedziu@mastodon.socialD dzwiedziu@mastodon.social

                                    @tseitr
                                    I'm curious about this also.

                                    Edit: if all I need to do is add the A and AAAA records, then the answer could be “yes”.

                                    @JulianOliver

                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.socialJ This user is from outside of this forum
                                    julianoliver@mastodon.social
                                    wrote last edited by
                                    #113

                                    @dzwiedziu @tseitr Thanks both! Yes as simple as picking any unused domain (canonical or sub) and setting these records to point to the server:

                                    A: 95.216.76.85
                                    AAAA: 2a01:4f9:2b:c83::2

                                    Then, DM or toot me the domain. Once set, I'll let you know, and then it's time to share your tarpit domain liberally: link in the footer of your site, landing page a friendly wiki you want to protect, blog post etc.

                                    Ideally should be toward the front of the content.

                                    dzwiedziu@mastodon.socialD 1 Reply Last reply
                                    0
                                    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                      @dzwiedziu @tseitr Thanks both! Yes as simple as picking any unused domain (canonical or sub) and setting these records to point to the server:

                                      A: 95.216.76.85
                                      AAAA: 2a01:4f9:2b:c83::2

                                      Then, DM or toot me the domain. Once set, I'll let you know, and then it's time to share your tarpit domain liberally: link in the footer of your site, landing page a friendly wiki you want to protect, blog post etc.

                                      Ideally should be toward the front of the content.

                                      dzwiedziu@mastodon.socialD This user is from outside of this forum
                                      dzwiedziu@mastodon.socialD This user is from outside of this forum
                                      dzwiedziu@mastodon.social
                                      wrote last edited by
                                      #114

                                      @JulianOliver
                                      Then I'll set up mine either very quickly or in a matter of weeks (WIP moving between countries).

                                      @tseitr

                                      julianoliver@mastodon.socialJ 1 Reply Last reply
                                      0
                                      • dzwiedziu@mastodon.socialD dzwiedziu@mastodon.social

                                        @JulianOliver
                                        Then I'll set up mine either very quickly or in a matter of weeks (WIP moving between countries).

                                        @tseitr

                                        julianoliver@mastodon.socialJ This user is from outside of this forum
                                        julianoliver@mastodon.socialJ This user is from outside of this forum
                                        julianoliver@mastodon.social
                                        wrote last edited by
                                        #115

                                        @dzwiedziu @tseitr Been there a few times - no rush!

                                        1 Reply Last reply
                                        0
                                        • julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.socialJ This user is from outside of this forum
                                          julianoliver@mastodon.social
                                          wrote last edited by
                                          #116

                                          @retech That's the word for it. Computationally, environmentally, culturally, infrastructurally - an obscenity.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups