Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Data is Beautiful
  3. Mastodon - Graph of Non-English Instances [17MB]

Mastodon - Graph of Non-English Instances [17MB]

Scheduled Pinned Locked Moved Data is Beautiful
dataisbeautiful
2 Posts 2 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • P This user is from outside of this forum
    P This user is from outside of this forum
    podbrushkin@mander.xyz
    wrote last edited by
    #1

    On Mastodon, if you have an account on instance X, you can follow someone who is on instance Y. It creates a connection: X -> Y. If there are a lot of such follows, weight of this edge will increase, attractive force between points will be higher.

    Original explanation on the page of Kaggle dataset:

    "active users" graphs: For each instance, we consider the set of the 10K most recently active users. Then, for each user of an instance X, we consider the list of the users they follow, and add 1 to the edge from X to Y where Y is the instance the followed users. The weight of the edge from X to Y thus encodes how much the content seen on instance X is generated in instance Y. Note that this graph thus contains self loops.

    I've tried to layout this dataset in Gephi, but it was a classic hairy ball - everyone is connected to everyone, amount of edges is too high comparing to number of nodes. Then, I've filtered out all EN instances and suddenly got a meaningful picture:

    graph

    What can we see? If English-speaking instances are ignored, German, French and Japanese languages are most common across Mastodon. Japan and Korea don't hang around much with other folks, while French, German and Spanish instances are quite interconnected between each other.

    Size of nodes depends on centrality, post about centrality of Peertube instances is here.


    Gephi table

    Same, but Fruchterman-Reingold algorithm instead of ForceAtlas 2:

    FR

    Mastodon active users dataset can be downloaded here: https://www.kaggle.com/datasets/marcdamie/fediverse-graph-dataset-reduced

    Link Preview Image
    santiago@mastodon.uyS 1 Reply Last reply
    0
    • P podbrushkin@mander.xyz

      On Mastodon, if you have an account on instance X, you can follow someone who is on instance Y. It creates a connection: X -> Y. If there are a lot of such follows, weight of this edge will increase, attractive force between points will be higher.

      Original explanation on the page of Kaggle dataset:

      "active users" graphs: For each instance, we consider the set of the 10K most recently active users. Then, for each user of an instance X, we consider the list of the users they follow, and add 1 to the edge from X to Y where Y is the instance the followed users. The weight of the edge from X to Y thus encodes how much the content seen on instance X is generated in instance Y. Note that this graph thus contains self loops.

      I've tried to layout this dataset in Gephi, but it was a classic hairy ball - everyone is connected to everyone, amount of edges is too high comparing to number of nodes. Then, I've filtered out all EN instances and suddenly got a meaningful picture:

      graph

      What can we see? If English-speaking instances are ignored, German, French and Japanese languages are most common across Mastodon. Japan and Korea don't hang around much with other folks, while French, German and Spanish instances are quite interconnected between each other.

      Size of nodes depends on centrality, post about centrality of Peertube instances is here.


      Gephi table

      Same, but Fruchterman-Reingold algorithm instead of ForceAtlas 2:

      FR

      Mastodon active users dataset can be downloaded here: https://www.kaggle.com/datasets/marcdamie/fediverse-graph-dataset-reduced

      Link Preview Image
      santiago@mastodon.uyS This user is from outside of this forum
      santiago@mastodon.uyS This user is from outside of this forum
      santiago@mastodon.uy
      wrote last edited by
      #2

      @podbrushkin Hi, i cannot find mastodon.uy, is it included in the dataset? Cheers!

      1 Reply Last reply
      1
      0
      • R relay@relay.publicsquare.global shared this topic
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups