Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. generative so-called "AI" is now being used to transcribe and translate Latin manuscripts.

generative so-called "AI" is now being used to transcribe and translate Latin manuscripts.

Scheduled Pinned Locked Moved Uncategorized
23 Posts 20 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • elilla@transmom.loveE elilla@transmom.love

    generative so-called "AI" is now being used to transcribe and translate Latin manuscripts.

    guess the results:
    https://blacksky.community/profile/did:plc:vpepa7tkir7iaia3risvyjc6/post/3mhntbkyadc24

    Link Preview Image
    Dan Conway (@magisterconway.bsky.social)

    Update: it gets so much worse. All of Bede's letters have no basis in the original Latin at all. This one reads like a modern email! (Original Latin, real translation, fake "translation")

    favicon

    Blacksky (blacksky.community)

    elilla@transmom.loveE This user is from outside of this forum
    elilla@transmom.loveE This user is from outside of this forum
    elilla@transmom.love
    wrote last edited by
    #3

    "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

    then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

    some things are worse than nothing. "signal-shaped noise" is worse than nothing.

    gustodon@mas.toG laprice@beige.partyL cairobraga@gts.cairobraga.comC securitywriter@infosec.exchangeS onekind@beige.partyO 16 Replies Last reply
    1
    0
    • elilla@transmom.loveE elilla@transmom.love

      "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

      then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

      some things are worse than nothing. "signal-shaped noise" is worse than nothing.

      gustodon@mas.toG This user is from outside of this forum
      gustodon@mas.toG This user is from outside of this forum
      gustodon@mas.to
      wrote last edited by
      #4

      @elilla 🏆

      1 Reply Last reply
      0
      • elilla@transmom.loveE elilla@transmom.love

        "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

        then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

        some things are worse than nothing. "signal-shaped noise" is worse than nothing.

        laprice@beige.partyL This user is from outside of this forum
        laprice@beige.partyL This user is from outside of this forum
        laprice@beige.party
        wrote last edited by
        #5

        @elilla "signal-shaped noise" is a very apt description. Thank you.

        1 Reply Last reply
        0
        • elilla@transmom.loveE elilla@transmom.love

          "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

          then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

          some things are worse than nothing. "signal-shaped noise" is worse than nothing.

          cairobraga@gts.cairobraga.comC This user is from outside of this forum
          cairobraga@gts.cairobraga.comC This user is from outside of this forum
          cairobraga@gts.cairobraga.com
          wrote last edited by
          #6

          @elilla A BANGER POST

          1 Reply Last reply
          0
          • elilla@transmom.loveE elilla@transmom.love

            "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

            then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

            some things are worse than nothing. "signal-shaped noise" is worse than nothing.

            securitywriter@infosec.exchangeS This user is from outside of this forum
            securitywriter@infosec.exchangeS This user is from outside of this forum
            securitywriter@infosec.exchange
            wrote last edited by
            #7

            @elilla I am continuously arguing the accuracy debt angle. I have made significant strides in some areas of building ai 'bulkheads'.

            Low background radiation steel is a good analogy I use. There is before, and there is after.

            1 Reply Last reply
            0
            • elilla@transmom.loveE elilla@transmom.love

              "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

              then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

              some things are worse than nothing. "signal-shaped noise" is worse than nothing.

              onekind@beige.partyO This user is from outside of this forum
              onekind@beige.partyO This user is from outside of this forum
              onekind@beige.party
              wrote last edited by
              #8

              @elilla There's a great book by Stefania Tutino called A Fake Saint and the True Church about the forgery of a saint out of letters between Naples and Rome in the 17th C. No AIs were necessary, just lots and lots of letters. As my favourite linguist points out there's no way to guarantee the veridity of discourse at the level of discourse itself. Never has been. AI didn't change that.

              1 Reply Last reply
              0
              • R relay@relay.infosec.exchange shared this topic
              • elilla@transmom.loveE elilla@transmom.love

                "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                jzb@hachyderm.ioJ This user is from outside of this forum
                jzb@hachyderm.ioJ This user is from outside of this forum
                jzb@hachyderm.io
                wrote last edited by
                #9

                @elilla transcription / translation is one of the areas where I see a good use for LLMs at the moment. But, only as a first-pass.

                I use Speech Note to do a first pass at transcribing audio from talks and such that I will write about. But I also go back and watch the talk and clean up the transcript -- I'm not blindly trusting the output, I'm just trying to speed up the act of typing it out and saving some wear and tear on my hands.

                An LLM-generated translation or transcription that is not verified is, IMO, generally a dangerous thing. It might be fine for local use to try to get the gist of something, but no organization should be publishing those types of things without verification.

                gloriouscow@oldbytes.spaceG 1 Reply Last reply
                0
                • elilla@transmom.loveE elilla@transmom.love

                  "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                  then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                  some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                  gudenau@hachyderm.ioG This user is from outside of this forum
                  gudenau@hachyderm.ioG This user is from outside of this forum
                  gudenau@hachyderm.io
                  wrote last edited by
                  #10

                  @elilla Signal-shaped noise is a great term, thank you for that one.

                  1 Reply Last reply
                  0
                  • elilla@transmom.loveE elilla@transmom.love

                    "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                    then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                    some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                    dalias@hachyderm.ioD This user is from outside of this forum
                    dalias@hachyderm.ioD This user is from outside of this forum
                    dalias@hachyderm.io
                    wrote last edited by
                    #11

                    @elilla "Signal-shaped noise" is an utterly brilliant characterization of what "gen AI" produces.

                    1 Reply Last reply
                    0
                    • elilla@transmom.loveE elilla@transmom.love

                      "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                      then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                      some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                      jmelesky@tinylad.socialJ This user is from outside of this forum
                      jmelesky@tinylad.socialJ This user is from outside of this forum
                      jmelesky@tinylad.social
                      wrote last edited by
                      #12

                      @elilla@transmom.love I'm a data engineer. I've been saying for years if not decades: "Bad data is worse than no data". And, generally, when people hear that, they agree with me.

                      When I point out that genAI produces bad data, the turnaround to "oh, but, so useful", "early days", etc, is quick and disheartening.

                      1 Reply Last reply
                      0
                      • elilla@transmom.loveE elilla@transmom.love

                        "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                        then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                        some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                        lorxus@yiff.lifeL This user is from outside of this forum
                        lorxus@yiff.lifeL This user is from outside of this forum
                        lorxus@yiff.life
                        wrote last edited by
                        #13

                        @elilla @jacel As someone who likes using (but not remotely relying on) automated transcription and notetaking that way... as far as I'm concerned, if anyone's *training* on that stuff, then they deserve exactly what they'll get. And if whatever big corporation is *putting that stuff in training sets*, then they need to quit shitting where they eat.

                        jacel@m.prettyshiny.orgJ 1 Reply Last reply
                        0
                        • lorxus@yiff.lifeL lorxus@yiff.life

                          @elilla @jacel As someone who likes using (but not remotely relying on) automated transcription and notetaking that way... as far as I'm concerned, if anyone's *training* on that stuff, then they deserve exactly what they'll get. And if whatever big corporation is *putting that stuff in training sets*, then they need to quit shitting where they eat.

                          jacel@m.prettyshiny.orgJ This user is from outside of this forum
                          jacel@m.prettyshiny.orgJ This user is from outside of this forum
                          jacel@m.prettyshiny.org
                          wrote last edited by
                          #14

                          @lorxus @elilla yeah, but.

                          You know whose training set all these things that are being indiscriminately vomited into the informational substrate of humanity /do/ end up in?

                          The people's c.c

                          lorxus@yiff.lifeL 1 Reply Last reply
                          0
                          • elilla@transmom.loveE elilla@transmom.love

                            "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                            then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                            some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                            rose_alibi@post.lurk.orgR This user is from outside of this forum
                            rose_alibi@post.lurk.orgR This user is from outside of this forum
                            rose_alibi@post.lurk.org
                            wrote last edited by
                            #15

                            @elilla SIGNAL-SHAPED NOISE

                            1 Reply Last reply
                            0
                            • jzb@hachyderm.ioJ jzb@hachyderm.io

                              @elilla transcription / translation is one of the areas where I see a good use for LLMs at the moment. But, only as a first-pass.

                              I use Speech Note to do a first pass at transcribing audio from talks and such that I will write about. But I also go back and watch the talk and clean up the transcript -- I'm not blindly trusting the output, I'm just trying to speed up the act of typing it out and saving some wear and tear on my hands.

                              An LLM-generated translation or transcription that is not verified is, IMO, generally a dangerous thing. It might be fine for local use to try to get the gist of something, but no organization should be publishing those types of things without verification.

                              gloriouscow@oldbytes.spaceG This user is from outside of this forum
                              gloriouscow@oldbytes.spaceG This user is from outside of this forum
                              gloriouscow@oldbytes.space
                              wrote last edited by
                              #16

                              @elilla I experimented with using ChatGPT to do OCR on old scanned assembly code listings.

                              Columnar text has always been a huge challenge for OCR, and I had already tried Tesseract and given up on it.

                              At first I thought the results from ChatGPT were a revolutionary leap in the state of the art.

                              Then I looked closer - it had reworded the comments and headers. It even changed the code in places, swapping out entire mnemonics and parameters.

                              Like any good sloperator I tried to prompt may way around this, which was met by effusive apologies and assurances that it would, going forward, be sure to never do that again.

                              Which of course, it immediately did.

                              I suspect there's only the most tenuous thread of context between a "multi-modal" LLM's text and image capabilities - they're basically just two models duct-taped together.

                              I find this particularly disturbing as if someone simply doing an editorial pass looking for spelling or grammar errors may not notice that the content appears fundamentally correct, but was actually altered.

                              I would rather wade through a sea of Tesseract's obvious typos than have to take on the much higher cognitive burden of making sure grammatically correct sentences weren't invented wholesale.

                              1 Reply Last reply
                              0
                              • elilla@transmom.loveE elilla@transmom.love

                                "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                                then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                                some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                                moonhouse@social.tchncs.deM This user is from outside of this forum
                                moonhouse@social.tchncs.deM This user is from outside of this forum
                                moonhouse@social.tchncs.de
                                wrote last edited by
                                #17

                                @elilla Earlier today I reflected on how AI generated closed captions on local news here in Sweden are too exact. When a human does them in Sweden they remove filler words and repeat words. When they suddenly are there it takes more cognitive effort to read what people are saying.

                                1 Reply Last reply
                                0
                                • elilla@transmom.loveE elilla@transmom.love

                                  "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                                  then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                                  some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                                  707kat@mastodon.art7 This user is from outside of this forum
                                  707kat@mastodon.art7 This user is from outside of this forum
                                  707kat@mastodon.art
                                  wrote last edited by
                                  #18

                                  @elilla Wrong information is so not better than nothing. 😅

                                  1 Reply Last reply
                                  0
                                  • elilla@transmom.loveE elilla@transmom.love

                                    "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                                    then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                                    some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                                    weekend_editor@mathstodon.xyzW This user is from outside of this forum
                                    weekend_editor@mathstodon.xyzW This user is from outside of this forum
                                    weekend_editor@mathstodon.xyz
                                    wrote last edited by
                                    #19

                                    @elilla

                                    Thing is, our myths and literature have been telling us this for millennia!

                                    *All* the oracle stories involve an oracle saying something ambiguous, which the protagonist dangerously misinterprets. It will always be mushy, you'll always choose the wrong interpretation, and it will always be your fault. In that sense, saying "you have to check the AI result" is a threat, meaning the AI is free to make mistakes, but you will be held liable.

                                    This is not positive information; it is almost *negative* information in that we still don't know the truth, but are tempted into dangerous fantasies of misinterpretation.

                                    We've even turned the whole mess into a cautionary tale with the "ibis redibis" story of the oracle at Dodona, a caution heeded nowadays by almost nobody:

                                    Link Preview Image
                                    Ibis redibis nunquam per bella peribis - Wikipedia

                                    favicon

                                    (en.wikipedia.org)

                                    1 Reply Last reply
                                    0
                                    • elilla@transmom.loveE elilla@transmom.love

                                      "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                                      then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                                      some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                                      rogerparkinson@mastodon.nzR This user is from outside of this forum
                                      rogerparkinson@mastodon.nzR This user is from outside of this forum
                                      rogerparkinson@mastodon.nz
                                      wrote last edited by
                                      #20

                                      @elilla I have direct experience of this. There's a handwritten letter from my grandfather dated around 1914 that turned up in a box of stuff. It's in cursive and younger people are less familiar with cursive so a family member put it through chatgtp. The result was, as you'd expect, vaguely similar to what was written, with some alarming inaccuracies. And it missed the actual point he was writing about.
                                      I'm old enough to read cursive and I've had some recent experience making out other old writing in much worse hand, so I could read it quite well. A couple of words were hard to decipher but not impossible.
                                      So my conclusion was that the AI transcription was worse than useless.

                                      1 Reply Last reply
                                      0
                                      • elilla@transmom.loveE elilla@transmom.love

                                        "AI" users are like, "I know this is imprecise but as a convenience these transcriptions are better than nothing"

                                        then 70 years from now we'll still be struggling to debunk these entirely hallucinated transcriptions of thousands of manuscripts that were pissed into the pool of human knowledge.

                                        some things are worse than nothing. "signal-shaped noise" is worse than nothing.

                                        chemicaleyeguy@mstdn.scienceC This user is from outside of this forum
                                        chemicaleyeguy@mstdn.scienceC This user is from outside of this forum
                                        chemicaleyeguy@mstdn.science
                                        wrote last edited by
                                        #21

                                        @elilla #AI is #clankers all the way down.

                                        #Resist #broligarchs and #wankers and their #AIslop.

                                        1 Reply Last reply
                                        0
                                        • jacel@m.prettyshiny.orgJ jacel@m.prettyshiny.org

                                          @lorxus @elilla yeah, but.

                                          You know whose training set all these things that are being indiscriminately vomited into the informational substrate of humanity /do/ end up in?

                                          The people's c.c

                                          lorxus@yiff.lifeL This user is from outside of this forum
                                          lorxus@yiff.lifeL This user is from outside of this forum
                                          lorxus@yiff.life
                                          wrote last edited by
                                          #22

                                          @jacel @elilla As in, people will read those and uncritically accept it? Or something else?

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups