Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

Scheduled Pinned Locked Moved Uncategorized
bigtech
180 Posts 56 Posters 81 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • pertho@mastodon.bsd.cafeP pertho@mastodon.bsd.cafe

    @JulianOliver I think scraper bots and other parasites constantly scan TLS transparency reports to find new domains to probe. As soon as you have a new certificate, they start hitting your web server.

    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.socialJ This user is from outside of this forum
    julianoliver@mastodon.social
    wrote last edited by
    #51

    @pertho Very interesting! I will look into this closely. Thank you.

    quite@mstdn.socialQ 1 Reply Last reply
    0
    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

      Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

      If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

      Link Preview Image
      Science is Poetry

      favicon

      (julianoliver.com)

      #ai #bigtech #tacticalmedia

      vortex@tldr.nettime.orgV This user is from outside of this forum
      vortex@tldr.nettime.orgV This user is from outside of this forum
      vortex@tldr.nettime.org
      wrote last edited by
      #52

      @JulianOliver heya. How do I donate and configure domains for your experiment? Do they come with any restrictions, i.e. must the FQDNs be in distinct DNS zones, etc? Can easily spin up and configure a few to begin with e.g.

      poetry.zenr.io
      worst.case.zenr.io
      wurst.case.zenr.io
      et.c.

      1 Reply Last reply
      0
      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

        @futuresprog Great, and yes.. precisely!

        Here you go:

        A: 95.216.76.85
        AAAA: 2a01:4f9:2b:c83::2

        vortex@tldr.nettime.orgV This user is from outside of this forum
        vortex@tldr.nettime.orgV This user is from outside of this forum
        vortex@tldr.nettime.org
        wrote last edited by
        #53

        @JulianOliver @futuresprog ah, cool to know, thanks. can config a few from this ...

        julianoliver@mastodon.socialJ 1 Reply Last reply
        0
        • vortex@tldr.nettime.orgV vortex@tldr.nettime.org

          @JulianOliver @futuresprog ah, cool to know, thanks. can config a few from this ...

          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.socialJ This user is from outside of this forum
          julianoliver@mastodon.social
          wrote last edited by
          #54

          @vortex @futuresprog Thanks a lot Adam! Please let me know when you're done. By DM is also fine too.

          1 Reply Last reply
          0
          • pertho@mastodon.bsd.cafeP pertho@mastodon.bsd.cafe

            @JulianOliver I think scraper bots and other parasites constantly scan TLS transparency reports to find new domains to probe. As soon as you have a new certificate, they start hitting your web server.

            dch@bsd.networkD This user is from outside of this forum
            dch@bsd.networkD This user is from outside of this forum
            dch@bsd.network
            wrote last edited by
            #55

            @pertho @JulianOliver a way around this is to use both DNS and TLS wildcard support - there is no single domain list in the report that can be slopped.

            pertho@mastodon.bsd.cafeP 1 Reply Last reply
            0
            • dch@bsd.networkD dch@bsd.network

              @pertho @JulianOliver a way around this is to use both DNS and TLS wildcard support - there is no single domain list in the report that can be slopped.

              pertho@mastodon.bsd.cafeP This user is from outside of this forum
              pertho@mastodon.bsd.cafeP This user is from outside of this forum
              pertho@mastodon.bsd.cafe
              wrote last edited by
              #56

              @dch I would think they would still try the bare/apex domain and "www".
              @JulianOliver

              dch@bsd.networkD 1 Reply Last reply
              0
              • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                @pertho Very interesting! I will look into this closely. Thank you.

                quite@mstdn.socialQ This user is from outside of this forum
                quite@mstdn.socialQ This user is from outside of this forum
                quite@mstdn.social
                wrote last edited by
                #57

                @JulianOliver @pertho yes this is the thing. you can get hold of live streams of newly created TLS certificates, an example https://bencevans.io/security/certificate-stream

                quite@mstdn.socialQ 1 Reply Last reply
                0
                • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                  I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                  I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                  https://scienceispoetry.net/files/parasites.txt

                  It is at the same path for all contributed domains.

                  For instance:

                  https://carrot.mro1.de/files/parasites.txt

                  netopwibby@social.coopN This user is from outside of this forum
                  netopwibby@social.coopN This user is from outside of this forum
                  netopwibby@social.coop
                  wrote last edited by
                  #58

                  @JulianOliver "parasites" is a great name for this

                  joe_vinegar@mastodon.bida.imJ 1 Reply Last reply
                  0
                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                    Pleased to share a page and explainer for the AI tarpit project Science is Poetry, with legal statement, rationale(s), and a few deployment notes:

                    Link Preview Image
                    Science is Poetry

                    favicon

                    (julianoliver.com)

                    The page may grow a bit. Just wanted to get it out the door.

                    #AI #bigtech

                    computersandblues@post.lurk.orgC This user is from outside of this forum
                    computersandblues@post.lurk.orgC This user is from outside of this forum
                    computersandblues@post.lurk.org
                    wrote last edited by
                    #59

                    @JulianOliver do you want some subdomains on https://nein.wtf? feel free to pick!

                    julianoliver@mastodon.socialJ 1 Reply Last reply
                    0
                    • computersandblues@post.lurk.orgC computersandblues@post.lurk.org

                      @JulianOliver do you want some subdomains on https://nein.wtf? feel free to pick!

                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.socialJ This user is from outside of this forum
                      julianoliver@mastodon.social
                      wrote last edited by
                      #60

                      @computersandblues Amazing!

                      Might these be available?

                      - parasit
                      - karotte
                      - ftw

                      computersandblues@post.lurk.orgC 1 Reply Last reply
                      0
                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                        I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                        I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                        https://scienceispoetry.net/files/parasites.txt

                        It is at the same path for all contributed domains.

                        For instance:

                        https://carrot.mro1.de/files/parasites.txt

                        julianoliver@mastodon.socialJ This user is from outside of this forum
                        julianoliver@mastodon.socialJ This user is from outside of this forum
                        julianoliver@mastodon.social
                        wrote last edited by
                        #61

                        It's approaching DoS at this point. This just one of the VMs, and just OpenAI's parasite.

                        Threading's holding up but need some more tuning of rate limits and burst. Trying sending 429's now to ask them to play nice.

                        To think the www was built for people.

                        And here we are

                        mro@digitalcourage.socialM julianoliver@mastodon.socialJ twilliability@genart.socialT paulhanrahan@mastodon.socialP bastelwombat@chaos.socialB 5 Replies Last reply
                        0
                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                          @computersandblues Amazing!

                          Might these be available?

                          - parasit
                          - karotte
                          - ftw

                          computersandblues@post.lurk.orgC This user is from outside of this forum
                          computersandblues@post.lurk.orgC This user is from outside of this forum
                          computersandblues@post.lurk.org
                          wrote last edited by
                          #62

                          @JulianOliver they are, A and AAAA records already set up

                          julianoliver@mastodon.socialJ 1 Reply Last reply
                          0
                          • computersandblues@post.lurk.orgC computersandblues@post.lurk.org

                            @JulianOliver they are, A and AAAA records already set up

                            julianoliver@mastodon.socialJ This user is from outside of this forum
                            julianoliver@mastodon.socialJ This user is from outside of this forum
                            julianoliver@mastodon.social
                            wrote last edited by
                            #63

                            @computersandblues Ahh, so good. I'll add them first thing tomorrow. It's evening here. Will let you know as soon as I do 🙂

                            1 Reply Last reply
                            0
                            • pertho@mastodon.bsd.cafeP pertho@mastodon.bsd.cafe

                              @dch I would think they would still try the bare/apex domain and "www".
                              @JulianOliver

                              dch@bsd.networkD This user is from outside of this forum
                              dch@bsd.networkD This user is from outside of this forum
                              dch@bsd.network
                              wrote last edited by
                              #64

                              @pertho in my experience it seems they don’t even bother with wildcard domains @JulianOliver

                              pertho@mastodon.bsd.cafeP 1 Reply Last reply
                              0
                              • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                                If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                                Link Preview Image
                                Science is Poetry

                                favicon

                                (julianoliver.com)

                                #ai #bigtech #tacticalmedia

                                wtl@mastodon.socialW This user is from outside of this forum
                                wtl@mastodon.socialW This user is from outside of this forum
                                wtl@mastodon.social
                                wrote last edited by
                                #65

                                @JulianOliver I have a slew of currently un -used domains. Where do I point them?

                                julianoliver@mastodon.socialJ 1 Reply Last reply
                                0
                                • wtl@mastodon.socialW wtl@mastodon.social

                                  @JulianOliver I have a slew of currently un -used domains. Where do I point them?

                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.socialJ This user is from outside of this forum
                                  julianoliver@mastodon.social
                                  wrote last edited by
                                  #66

                                  @WTL Thanks! Just these two records:

                                  A: 95.216.76.85
                                  AAAA: 2a01:4f9:2b:c83::2

                                  1 Reply Last reply
                                  0
                                  • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                    I've started to harvest a list of AI crawler endpoint addrs for your blacklisting pleasure.

                                    I'll try to keep it updated. I've been fastidious with ensuring I'm only pulling those related to the known user agent, so as not to have any false positives

                                    https://scienceispoetry.net/files/parasites.txt

                                    It is at the same path for all contributed domains.

                                    For instance:

                                    https://carrot.mro1.de/files/parasites.txt

                                    mro@digitalcourage.socialM This user is from outside of this forum
                                    mro@digitalcourage.socialM This user is from outside of this forum
                                    mro@digitalcourage.social
                                    wrote last edited by
                                    #67

                                    Hi @JulianOliver,
                                    mind adding a parallel https://cdb.cr.yp.to file?

                                    1 Reply Last reply
                                    0
                                    • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                      It's approaching DoS at this point. This just one of the VMs, and just OpenAI's parasite.

                                      Threading's holding up but need some more tuning of rate limits and burst. Trying sending 429's now to ask them to play nice.

                                      To think the www was built for people.

                                      And here we are

                                      mro@digitalcourage.socialM This user is from outside of this forum
                                      mro@digitalcourage.socialM This user is from outside of this forum
                                      mro@digitalcourage.social
                                      wrote last edited by
                                      #68

                                      Hi @JulianOliver,
                                      indeed an act of #hygiene blocking #bot​s: https://doi.org/10.17487/RFC8890 "The Internet is for End Users"

                                      1 Reply Last reply
                                      0
                                      • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                        Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                                        If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                                        Link Preview Image
                                        Science is Poetry

                                        favicon

                                        (julianoliver.com)

                                        #ai #bigtech #tacticalmedia

                                        thgie@post.lurk.orgT This user is from outside of this forum
                                        thgie@post.lurk.orgT This user is from outside of this forum
                                        thgie@post.lurk.org
                                        wrote last edited by
                                        #69

                                        You can add `dreckiger.schleimpilz.ch` to the list. Thanks for all your work!

                                        @JulianOliver

                                        julianoliver@mastodon.socialJ 1 Reply Last reply
                                        0
                                        • julianoliver@mastodon.socialJ julianoliver@mastodon.social

                                          Do you have an unused domain that you would be happy to donate to a counter-offensive against unchecked & unregulated AI crawlers that scrape human-made content to simulate & deceive for profit?

                                          If so, pls reply to this post. Your domain would become an entrypoint to the AI tarpit & Poison-as-a-Service project below, allowing concerned public to choose to use it on their sites, helping make the project more resilient to blacklisting.

                                          Link Preview Image
                                          Science is Poetry

                                          favicon

                                          (julianoliver.com)

                                          #ai #bigtech #tacticalmedia

                                          alexandermars@mastodon.socialA This user is from outside of this forum
                                          alexandermars@mastodon.socialA This user is from outside of this forum
                                          alexandermars@mastodon.social
                                          wrote last edited by
                                          #70

                                          @JulianOliver I have a handful of domains that I would love to see used for tarpitting.

                                          julianoliver@mastodon.socialJ 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups