Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Had a lot of fun with my stats students today.

Had a lot of fun with my stats students today.

Scheduled Pinned Locked Moved Uncategorized
112 Posts 62 Posters 20 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • futurebird@sauropods.winF futurebird@sauropods.win

    @Bumblefish

    Which one is random?
    (data sets are 100 numbers 1 to 6)

    listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

    listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

    noplasticshower@infosec.exchangeN This user is from outside of this forum
    noplasticshower@infosec.exchangeN This user is from outside of this forum
    noplasticshower@infosec.exchange
    wrote last edited by
    #24

    @futurebird @Bumblefish that question makes no sense

    1 Reply Last reply
    0
    • futurebird@sauropods.winF futurebird@sauropods.win

      The LLM is like a little box of computer horrors that we peer into from time to time.

      I'm sorry but the whole interface is just so silly.

      You ask for random numbers with sentences and it pretends to give them to you? What are we doooooing?

      gatesvp@mstdn.caG This user is from outside of this forum
      gatesvp@mstdn.caG This user is from outside of this forum
      gatesvp@mstdn.ca
      wrote last edited by
      #25

      @futurebird I am reminded of a Doctor Who episode, where they realize they are in a simulation because they are incapable of generating truly random numbers. One scene has a whole bunch of scientists sitting at a table and they all keep yelling the same number at the same time.

      1 Reply Last reply
      0
      • dpiponi@mathstodon.xyzD dpiponi@mathstodon.xyz

        @burnitdown @futurebird These days if you really want random numbers you can have them. Eg. RDRAND on Intel chips is seeded by analogue circuitry, not by some state updated in RAM. And even if you don't use RDRAND directly its output is still used as a source of entropy for other generators.

        digitalcalibrator@hol.ogra.phD This user is from outside of this forum
        digitalcalibrator@hol.ogra.phD This user is from outside of this forum
        digitalcalibrator@hol.ogra.ph
        wrote last edited by
        #26

        @dpiponi@mathstodon.xyz @burnitdown@beige.party @futurebird@sauropods.win and cloudflare famously uses a camera pointed at a wall of lava lamps because the motion is random

        1 Reply Last reply
        0
        • futurebird@sauropods.winF futurebird@sauropods.win

          @Bumblefish

          Which one is random?
          (data sets are 100 numbers 1 to 6)

          listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

          listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

          zalasur@mastodon.surazal.netZ This user is from outside of this forum
          zalasur@mastodon.surazal.netZ This user is from outside of this forum
          zalasur@mastodon.surazal.net
          wrote last edited by
          #27

          @futurebird @Bumblefish There's literally no way to say whether a list of numbers is random or not (1, 1, 1, 1, etc can plausibly be a random sequence for all we know), though you can establish likelihoods by looking at the distribution.

          futurebird@sauropods.winF 1 Reply Last reply
          0
          • futurebird@sauropods.winF futurebird@sauropods.win

            @Bumblefish

            Which one is random?
            (data sets are 100 numbers 1 to 6)

            listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

            listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

            ramsey@phpc.socialR This user is from outside of this forum
            ramsey@phpc.socialR This user is from outside of this forum
            ramsey@phpc.social
            wrote last edited by
            #28

            @futurebird @Bumblefish The only way you could determine that something’s not random is if a pattern emerges in the data set. Even still, statistically, it is probable for a CSPRNG with good entropy to produce a random data set that looks like it’s not random—unlikely, but probable.

            futurebird@sauropods.winF ramsey@phpc.socialR 2 Replies Last reply
            0
            • dpiponi@mathstodon.xyzD dpiponi@mathstodon.xyz

              @futurebird It's very weird.

              In principle, if you take an LLM, you should be able to get it to generate random numbers in a way that reflects the numbers that appear in the corpus it was trained on. If you have the raw model you can probably do that.

              But if you ask ChatGPT (or at least if I do) it starts talking about how numbers taken from around us typically follow Benford's law so their first digits have a logarithmic distribution. When it then spits out some random numbers it's no longer sampling random numbers from the entire corpus but a sample that's probably heavily biased towards numbers that appear in articles about Benford's law. I.e. what people have previously said about these numbers, rather than the actual numbers.

              jedbrown@hachyderm.ioJ This user is from outside of this forum
              jedbrown@hachyderm.ioJ This user is from outside of this forum
              jedbrown@hachyderm.io
              wrote last edited by
              #29

              @dpiponi Even with a raw model, I don't see how you would sample from the distribution of numbers in the corpus. Perhaps provide no context and sample one or more tokens (using an independent pseudo-random number generator) from the distribution, and if the returned token parses as a number, return it to the user, otherwise try again. Providing any context/prompt would bias what is returned. This seems too contrived/circular.
              @futurebird

              dpiponi@mathstodon.xyzD 1 Reply Last reply
              0
              • zalasur@mastodon.surazal.netZ zalasur@mastodon.surazal.net

                @futurebird @Bumblefish There's literally no way to say whether a list of numbers is random or not (1, 1, 1, 1, etc can plausibly be a random sequence for all we know), though you can establish likelihoods by looking at the distribution.

                futurebird@sauropods.winF This user is from outside of this forum
                futurebird@sauropods.winF This user is from outside of this forum
                futurebird@sauropods.win
                wrote last edited by
                #30

                @zalasur @Bumblefish

                You *can* make an argument for one of these lists being random like a dice roll and the other being much less likely to be generated in that way.

                zalasur@mastodon.surazal.netZ 1 Reply Last reply
                0
                • ramsey@phpc.socialR ramsey@phpc.social

                  @futurebird @Bumblefish The only way you could determine that something’s not random is if a pattern emerges in the data set. Even still, statistically, it is probable for a CSPRNG with good entropy to produce a random data set that looks like it’s not random—unlikely, but probable.

                  futurebird@sauropods.winF This user is from outside of this forum
                  futurebird@sauropods.winF This user is from outside of this forum
                  futurebird@sauropods.win
                  wrote last edited by
                  #31

                  @ramsey @Bumblefish

                  Only one of these lists could *plausibly* be from rolling dice.

                  ramsey@phpc.socialR ldpm@wandering.shopL 2 Replies Last reply
                  0
                  • ramsey@phpc.socialR ramsey@phpc.social

                    @futurebird @Bumblefish The only way you could determine that something’s not random is if a pattern emerges in the data set. Even still, statistically, it is probable for a CSPRNG with good entropy to produce a random data set that looks like it’s not random—unlikely, but probable.

                    ramsey@phpc.socialR This user is from outside of this forum
                    ramsey@phpc.socialR This user is from outside of this forum
                    ramsey@phpc.social
                    wrote last edited by
                    #32

                    @futurebird @Bumblefish I have a UUID-generating library that, under certain conditions, could generate the same identical UUIDs because the CSPRNG it used ended up reusing the same entropy seed, unless the server was restarted. That was a *fun* bug to investigate and fix. 😉

                    1 Reply Last reply
                    0
                    • futurebird@sauropods.winF futurebird@sauropods.win

                      @ramsey @Bumblefish

                      Only one of these lists could *plausibly* be from rolling dice.

                      ramsey@phpc.socialR This user is from outside of this forum
                      ramsey@phpc.socialR This user is from outside of this forum
                      ramsey@phpc.social
                      wrote last edited by
                      #33

                      @futurebird @Bumblefish Based on the statistical distribution of the dice rolls?

                      1 Reply Last reply
                      0
                      • futurebird@sauropods.winF futurebird@sauropods.win

                        @Bumblefish

                        Which one is random?
                        (data sets are 100 numbers 1 to 6)

                        listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

                        listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

                        raederle@masto.nuR This user is from outside of this forum
                        raederle@masto.nuR This user is from outside of this forum
                        raederle@masto.nu
                        wrote last edited by
                        #34

                        @futurebird @Bumblefish I like list A for random and list B for “planned random”.

                        1 Reply Last reply
                        0
                        • futurebird@sauropods.winF futurebird@sauropods.win

                          @Bumblefish

                          Which one is random?
                          (data sets are 100 numbers 1 to 6)

                          listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

                          listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

                          dlakelan@mastodon.sdf.orgD This user is from outside of this forum
                          dlakelan@mastodon.sdf.orgD This user is from outside of this forum
                          dlakelan@mastodon.sdf.org
                          wrote last edited by
                          #35

                          @futurebird
                          just to clarify what she means is as if from random unbiased 6 sided die rolls.

                          @Bumblefish

                          dlakelan@mastodon.sdf.orgD 1 Reply Last reply
                          0
                          • futurebird@sauropods.winF futurebird@sauropods.win

                            The LLM is like a little box of computer horrors that we peer into from time to time.

                            I'm sorry but the whole interface is just so silly.

                            You ask for random numbers with sentences and it pretends to give them to you? What are we doooooing?

                            f_dion@mastodon.onlineF This user is from outside of this forum
                            f_dion@mastodon.onlineF This user is from outside of this forum
                            f_dion@mastodon.online
                            wrote last edited by
                            #36

                            @futurebird the first episode of Numb3rs covered the appearance of randomness vs true randomness. I would not have remember that but watched a bunch of episodes to serve as math concept inspiration for the 31 music pieces I wrote and performed (on actual hardware synths) the whole month of January for #jamuary2026 #math #music #synths

                            Link Preview Image
                            Jamuary 2026

                            Listen to Jamuary 2026, a playlist curated by Francois Dion on desktop and mobile.

                            favicon

                            SoundCloud (soundcloud.com)

                            1 Reply Last reply
                            0
                            • dlakelan@mastodon.sdf.orgD dlakelan@mastodon.sdf.org

                              @futurebird
                              just to clarify what she means is as if from random unbiased 6 sided die rolls.

                              @Bumblefish

                              dlakelan@mastodon.sdf.orgD This user is from outside of this forum
                              dlakelan@mastodon.sdf.orgD This user is from outside of this forum
                              dlakelan@mastodon.sdf.org
                              wrote last edited by
                              #37

                              @futurebird
                              things I would check are first the frequency of each number... they should be somewhat uniform but not TOO close to equal as all exactly equal is unlikely... next I'd look at the length of repeat sequences and compare to expected values.

                              the actual definition of random sequences (Per Martin-Löf) is in terms of passing tests actually
                              @Bumblefish

                              alienghic@timeloop.cafeA danpmoore@mathstodon.xyzD vgarzareyna@mstdn.mxV 3 Replies Last reply
                              0
                              • futurebird@sauropods.winF futurebird@sauropods.win

                                @Bumblefish

                                Which one is random?
                                (data sets are 100 numbers 1 to 6)

                                listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

                                listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

                                madjohnroberts@mastodon.socialM This user is from outside of this forum
                                madjohnroberts@mastodon.socialM This user is from outside of this forum
                                madjohnroberts@mastodon.social
                                wrote last edited by
                                #38

                                @futurebird @Bumblefish listA has 17 occurrences of 1-4 and 16 of 5-6, where listB has different frequencies for each. I would guess that listB is actually random, listA is too nice.

                                sabrina@fedi01.unicornsparkle.clubS 1 Reply Last reply
                                0
                                • futurebird@sauropods.winF futurebird@sauropods.win

                                  The LLM is like a little box of computer horrors that we peer into from time to time.

                                  I'm sorry but the whole interface is just so silly.

                                  You ask for random numbers with sentences and it pretends to give them to you? What are we doooooing?

                                  apophis@yourwalls.todayA This user is from outside of this forum
                                  apophis@yourwalls.todayA This user is from outside of this forum
                                  apophis@yourwalls.today
                                  wrote last edited by
                                  #39
                                  @futurebird now i'm morbidly curious about what output it gave

                                  ...and, relatedly, whether asking it for random words would net a very high frequency of ninjas, monkeys and sporks...
                                  1 Reply Last reply
                                  0
                                  • futurebird@sauropods.winF futurebird@sauropods.win

                                    @Bumblefish

                                    Which one is random?
                                    (data sets are 100 numbers 1 to 6)

                                    listA=[2,3,5,1,2,2,4,2,4,5,2,3,3,4,5,6,4,2,6,2,2,1,3,4,5,5,6,3,3,6,1,4,2,1,4,5,2,2,3,3,3,5,6,3,2,4,5,5,1,1,1,6,1,4,3,5,5,3,1,1,1,6,1,4,6,6,3,6,6,2,4,4,4,5,1,5,6,2,6,1,1,2,4,2,2,3,4,4,5,6,1,3,3,3,5,4,6,5,1,6]

                                    listB=[4,2,5,6,3,5,3,1,3,4,2,3,4,3,4,5,5,1,3,3,2,1,1,6,1,3,2,2,2,6,1,5,6,3,6,3,2,3,2,4,6,1,1,6,3,2,4,1,6,1,3,1,5,6,2,3,3,5,1,6,4,5,2,5,1,1,5,3,6,2,3,3,6,5,2,3,3,1,6,3,2,3,2,1,6,6,4,4,6,2,4,5,4,5,3,4,6,5,3,2]

                                    apophis@yourwalls.todayA This user is from outside of this forum
                                    apophis@yourwalls.todayA This user is from outside of this forum
                                    apophis@yourwalls.today
                                    wrote last edited by
                                    #40
                                    @futurebird i'm guessing the second one is made up because there aren't enough triples?


                                    @Bumblefish
                                    apophis@yourwalls.todayA futurebird@sauropods.winF 2 Replies Last reply
                                    0
                                    • apophis@yourwalls.todayA apophis@yourwalls.today
                                      @futurebird i'm guessing the second one is made up because there aren't enough triples?


                                      @Bumblefish
                                      apophis@yourwalls.todayA This user is from outside of this forum
                                      apophis@yourwalls.todayA This user is from outside of this forum
                                      apophis@yourwalls.today
                                      wrote last edited by
                                      #41
                                      @futurebird @Bumblefish no, scratch, that, list A has a *lot* of triples, like a disturbing number, and there are so many ascending patterns...
                                      1 Reply Last reply
                                      0
                                      • futurebird@sauropods.winF futurebird@sauropods.win

                                        The LLM is like a little box of computer horrors that we peer into from time to time.

                                        I'm sorry but the whole interface is just so silly.

                                        You ask for random numbers with sentences and it pretends to give them to you? What are we doooooing?

                                        bicebird@toot.walesB This user is from outside of this forum
                                        bicebird@toot.walesB This user is from outside of this forum
                                        bicebird@toot.wales
                                        wrote last edited by
                                        #42

                                        @futurebird 🎶 little box, little box of horrors 🎶

                                        1 Reply Last reply
                                        0
                                        • jedbrown@hachyderm.ioJ jedbrown@hachyderm.io

                                          @dpiponi Even with a raw model, I don't see how you would sample from the distribution of numbers in the corpus. Perhaps provide no context and sample one or more tokens (using an independent pseudo-random number generator) from the distribution, and if the returned token parses as a number, return it to the user, otherwise try again. Providing any context/prompt would bias what is returned. This seems too contrived/circular.
                                          @futurebird

                                          dpiponi@mathstodon.xyzD This user is from outside of this forum
                                          dpiponi@mathstodon.xyzD This user is from outside of this forum
                                          dpiponi@mathstodon.xyz
                                          wrote last edited by
                                          #43

                                          @jedbrown @futurebird You described exactly what I would do. Obviously it would depend on an external PRNG and yes, no prompt. One natural way to use an LLM is to transform draws from a PRNG into draws from a distribution intended to represent some corpus. Picking numbers out of these draws would be expected to have a similar distribution to picking numbers from the original corpus. IIRC I may already have tested to see of the results conform to Benford's law - I did a lot of stuff like that when llama.cpp first became available. You have to select the right parameters to have llama.cpp use the distribution "correctly".

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups