Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Today MJ12bot made 2570 request to my blog but it got trapped by garbage so 2487 answers involved random word salad from Moby Dick.

Today MJ12bot made 2570 request to my blog but it got trapped by garbage so 2487 answers involved random word salad from Moby Dick.

Scheduled Pinned Locked Moved Uncategorized
butlerianjihad
1 Posts 1 Posters 4 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • alex@social.alexschroeder.chA This user is from outside of this forum
    alex@social.alexschroeder.chA This user is from outside of this forum
    alex@social.alexschroeder.ch
    wrote last edited by
    #1

    Today MJ12bot made 2570 request to my blog but it got trapped by garbage so 2487 answers involved random word salad from Moby Dick.

    "Majestic is a UK based specialist search engine used by hundreds of thousands of businesses in 13 languages and over 60 countries to paint a map of the Internet independent of the consumer based search engines. Majestic also powers other legitimate technologies that help to understand the continually changing fabric of the web."

    No thanks.

    "MJ12bot adheres to the robots.txt standard. If you want the bot to prevent website from being crawled then add the following text to your robots.txt:"

       User-agent: MJ12bot
       Disallow: /
    

    I guess my robots.txt doesn't match that exactly:

    #  curl https://alexschroeder.ch/robots.txt
    User-agent: Wibybot
    User-agent: Xobaque
    User-agent: search.marginalia.nu
    Allow: /view/
    
    User-agent: *
    Disallow: /
    DisallowAITraining: /
    

    I still feel that this should block most bots.

    #ButlerianJihad

    1 Reply Last reply
    2
    0
    • R relay@relay.infosec.exchange shared this topic
      R relay@relay.mycrowd.ca shared this topic
    Reply
    • Reply as topic
    Log in to reply
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes


    • Login

    • Login or register to search.
    • First post
      Last post
    0
    • Categories
    • Recent
    • Tags
    • Popular
    • World
    • Users
    • Groups