Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. Opinion poll:

Opinion poll:

Scheduled Pinned Locked Moved Uncategorized
12 Posts 6 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • veronica@explains.socialV veronica@explains.social

    I have never considered speech-to-text as "generative." I've always thought of it as transitioning information between contexts (aural to written).

    In other words, if I speak words, and then manually wrote them down, I generated them at the "speaking" part, not at the "writing them down" part.

    I've had folks disagree as of late, though, saying that speech-to-text is "generative" in that it literally generates content where content did not exist.

    This is a nuance I hadn't considered.

    jessebot@social.smallhack.orgJ This user is from outside of this forum
    jessebot@social.smallhack.orgJ This user is from outside of this forum
    jessebot@social.smallhack.org
    wrote last edited by
    #3

    @veronica I think this gets muddy given that people often use LLMs to do this these days (whereas previously they may not have).

    Side note: Steno is such a neato art form. I wish I had the time to get into it.

    veronica@explains.socialV 1 Reply Last reply
    0
    • veronica@explains.socialV veronica@explains.social

      I have never considered speech-to-text as "generative." I've always thought of it as transitioning information between contexts (aural to written).

      In other words, if I speak words, and then manually wrote them down, I generated them at the "speaking" part, not at the "writing them down" part.

      I've had folks disagree as of late, though, saying that speech-to-text is "generative" in that it literally generates content where content did not exist.

      This is a nuance I hadn't considered.

      veronica@explains.socialV This user is from outside of this forum
      veronica@explains.socialV This user is from outside of this forum
      veronica@explains.social
      wrote last edited by
      #4

      The reason I'm thinking about this distinction is captions.

      I am a firm believer in captions, and I've gone to great lengths to manually write out captions for every video.

      Speech-to-text has helped immensely with this task over the years. If I've done non-scripted content, using a speech-to-text program has saved me countless hours of retyping.

      But much more importantly, I know the technology has helped folks who are deaf or hard of hearing. Live captioning using speech-to-text is a great tool, particularly when it's locally hosted.

      I haven't considered that to some folks, that usage would count as "generative AI".

      veronica@explains.socialV 1 Reply Last reply
      0
      • veronica@explains.socialV veronica@explains.social

        The reason I'm thinking about this distinction is captions.

        I am a firm believer in captions, and I've gone to great lengths to manually write out captions for every video.

        Speech-to-text has helped immensely with this task over the years. If I've done non-scripted content, using a speech-to-text program has saved me countless hours of retyping.

        But much more importantly, I know the technology has helped folks who are deaf or hard of hearing. Live captioning using speech-to-text is a great tool, particularly when it's locally hosted.

        I haven't considered that to some folks, that usage would count as "generative AI".

        veronica@explains.socialV This user is from outside of this forum
        veronica@explains.socialV This user is from outside of this forum
        veronica@explains.social
        wrote last edited by
        #5

        Now, if the speech-to-text adds context, like changing the following text:

        "I used grep to find these files"

        into this:

        "Veronica said she used grep to find these files"

        I think that part is generative, if that makes sense? Am I making sense?

        veronica@explains.socialV janwilejan@snug.moeJ ciphermonger@infosec.exchangeC 3 Replies Last reply
        0
        • veronica@explains.socialV veronica@explains.social

          Now, if the speech-to-text adds context, like changing the following text:

          "I used grep to find these files"

          into this:

          "Veronica said she used grep to find these files"

          I think that part is generative, if that makes sense? Am I making sense?

          veronica@explains.socialV This user is from outside of this forum
          veronica@explains.socialV This user is from outside of this forum
          veronica@explains.social
          wrote last edited by
          #6

          I'm slowly devolving back into my earliest form, "imagined scenarios for academic purposes"

          1 Reply Last reply
          0
          • jessebot@social.smallhack.orgJ jessebot@social.smallhack.org

            @veronica I think this gets muddy given that people often use LLMs to do this these days (whereas previously they may not have).

            Side note: Steno is such a neato art form. I wish I had the time to get into it.

            veronica@explains.socialV This user is from outside of this forum
            veronica@explains.socialV This user is from outside of this forum
            veronica@explains.social
            wrote last edited by
            #7

            @jessebot I'm with you, but a stenographer isn't going to follow a person who is deaf or hard of hearing around to do live captioning, and I think that's where the nuance really deepens for me.

            jessebot@social.smallhack.orgJ 1 Reply Last reply
            0
            • veronica@explains.socialV veronica@explains.social

              I have never considered speech-to-text as "generative." I've always thought of it as transitioning information between contexts (aural to written).

              In other words, if I speak words, and then manually wrote them down, I generated them at the "speaking" part, not at the "writing them down" part.

              I've had folks disagree as of late, though, saying that speech-to-text is "generative" in that it literally generates content where content did not exist.

              This is a nuance I hadn't considered.

              aeris@firefish.imirhil.frA This user is from outside of this forum
              aeris@firefish.imirhil.frA This user is from outside of this forum
              aeris@firefish.imirhil.fr
              wrote last edited by
              #8

              @veronica@explains.social "generative" for AI is not about the fact it generate content, but about the way it generate the content. Procedural generation vs generative generation for example.

              1 Reply Last reply
              0
              • veronica@explains.socialV veronica@explains.social

                Now, if the speech-to-text adds context, like changing the following text:

                "I used grep to find these files"

                into this:

                "Veronica said she used grep to find these files"

                I think that part is generative, if that makes sense? Am I making sense?

                janwilejan@snug.moeJ This user is from outside of this forum
                janwilejan@snug.moeJ This user is from outside of this forum
                janwilejan@snug.moe
                wrote last edited by
                #9

                @veronica iirc Piper tts will translate silence into "thank you" as a common hallucination.

                so yes, newer speech-to-text models are generative AI.

                1 Reply Last reply
                0
                • veronica@explains.socialV veronica@explains.social

                  @jessebot I'm with you, but a stenographer isn't going to follow a person who is deaf or hard of hearing around to do live captioning, and I think that's where the nuance really deepens for me.

                  jessebot@social.smallhack.orgJ This user is from outside of this forum
                  jessebot@social.smallhack.orgJ This user is from outside of this forum
                  jessebot@social.smallhack.org
                  wrote last edited by
                  #10

                  @veronica And that's fair. I just feel like in as many instances possible, there should be interpreters. In the Netherlands, you are allowed to request an interpreter in many situations, for instance at the hospital. It's also common to have official interpreters at cons that can be requested (This was the case at the most recent Linux Foundation run con I went to - Kubecon Europe - hosted in Amsterdam). I'd like to see more situations where we employ humans to do this work, rather than try to solve more of this with tech, especially where people will lean towards AI, because I don't know that a lot of people with disabilities would feel comfortable using tech that has so many horrible moral consequences behind it.

                  (To be clear: I'm not saying you're advocating for AI or anything, and sorry if it came off that way. Also, not saying you personally have a moral failing or anything. 🙇)

                  1 Reply Last reply
                  0
                  • veronica@explains.socialV veronica@explains.social

                    Now, if the speech-to-text adds context, like changing the following text:

                    "I used grep to find these files"

                    into this:

                    "Veronica said she used grep to find these files"

                    I think that part is generative, if that makes sense? Am I making sense?

                    ciphermonger@infosec.exchangeC This user is from outside of this forum
                    ciphermonger@infosec.exchangeC This user is from outside of this forum
                    ciphermonger@infosec.exchange
                    wrote last edited by
                    #11

                    @veronica Don't worry, it will instead change it to say "I use grape to find these files", leaving everyone wondering how fruit solves anything.

                    1 Reply Last reply
                    0
                    • veronica@explains.socialV veronica@explains.social

                      Opinion poll:

                      In your opinion, is speech-to-text "generative AI?"

                      sortius@infosec.exchangeS This user is from outside of this forum
                      sortius@infosec.exchangeS This user is from outside of this forum
                      sortius@infosec.exchange
                      wrote last edited by
                      #12

                      @veronica speech-to-text has existed longer than LLMs, so I'm not sure why "AI" even comes into it.

                      There are definitely arguments against using LLMs to do the processing, as the systems that existed before worked relatively well, and didn't require fascism or killing the planet

                      1 Reply Last reply
                      1
                      0
                      • R relay@relay.infosec.exchange shared this topic
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups