Eloquence 64-bit for NVDA now supports a 44 kHz mode.
-
@x0 For more on that and author discussions check here: https://github.com/fastfinge/eloquence_64/issues/88#issuecomment-4091293709
@amir Oh God, dude rolled his own resampling algorithm with AI? At least 44100 is integer upsampling which is by far easier than fractional upsampling or downsampling, but why not a known open one? R8Brain free, for example. Sinc interpolation giving too ideal upsampling to keep it sounding dull I guess?
-
@amir Oh God, dude rolled his own resampling algorithm with AI? At least 44100 is integer upsampling which is by far easier than fractional upsampling or downsampling, but why not a known open one? R8Brain free, for example. Sinc interpolation giving too ideal upsampling to keep it sounding dull I guess?
@x0 He knows what he's doing and properly tests every step of the way. Honestly I don't see any issues with AI as the code is clearly presented. As for his upsampling approach, he apparently tested a couple of them, but this one produced the best quality without known speech jitters affecting IBMTTS with ViaVoice's 22 kHZ mode.
-
@x0 He knows what he's doing and properly tests every step of the way. Honestly I don't see any issues with AI as the code is clearly presented. As for his upsampling approach, he apparently tested a couple of them, but this one produced the best quality without known speech jitters affecting IBMTTS with ViaVoice's 22 kHZ mode.
@amir Huh. IBM must have implemented that one incorrectly. It's required to use the watson voice, though, and that one doesn't jitter.
-
Eloquence 64-bit for NVDA now supports a 44 kHz mode. It’s currently in preview and still being fine-tuned.
You can use the Sample Rate combo box in the Voices dialog to increase the default rate from 11 kHz to 44 kHz for clearer speech.
Unlike the 22 kHz mode in IBMTTS and ViaVoice, this mode doesn’t introduce pops, jitters, or unexpected voice resets.
Try the 16.1 upsampling preview release:
https://github.com/hozosch/eloquence_64/releases/tag/upsampling_preview
For more on the upsampling preview and author discussions check here:
https://github.com/fastfinge/eloquence_64/issues/88#issuecomment-4091293709@bmoore123 @amir Problem is, you'll never improve audio quality by upsampling.
-
@amir Huh. IBM must have implemented that one incorrectly. It's required to use the watson voice, though, and that one doesn't jitter.
@x0 The ViaVoice jitter with 22 kHZ is a known issue, and affects certain letters, like t, in certain situations. It's actually a speech pop rather than a jitter. Also getting IBMTTS to use 22 kHZ in ViaVoice is quite burdensome whereas Eloquence 64-bit handles it without requiring extra steps.
-
@x0 The ViaVoice jitter with 22 kHZ is a known issue, and affects certain letters, like t, in certain situations. It's actually a speech pop rather than a jitter. Also getting IBMTTS to use 22 kHZ in ViaVoice is quite burdensome whereas Eloquence 64-bit handles it without requiring extra steps.
@amir Oh? What's burdensome about it besides the engine actually trying to predicate on supported versions? If you try to force something that doesn't support it you get fast forward speech.
-
@bmoore123 @amir Problem is, you'll never improve audio quality by upsampling.
@Bruce @bmoore123 But 44 kHZ does improve it whereas the 8 kHZ mode does reduce the audio quality.
-
@bmoore123 @amir Problem is, you'll never improve audio quality by upsampling.
-
@amir Oh? What's burdensome about it besides the engine actually trying to predicate on supported versions? If you try to force something that doesn't support it you get fast forward speech.
@x0 No. The burdensome issue is finding the proper DLL from an older ViaVoice release which does support the 22 kHZ mode properly. The newer DLL which is installed by ViaVoice doesn't support it. Also IBMTTS does have its random voice resets to default with ViaVoice and nothing can be done about it.
-
@x0 No. The burdensome issue is finding the proper DLL from an older ViaVoice release which does support the 22 kHZ mode properly. The newer DLL which is installed by ViaVoice doesn't support it. Also IBMTTS does have its random voice resets to default with ViaVoice and nothing can be done about it.
@amir Huh. The always send current speech settings is supposed to fix that but IBM DLLs typically have that off because the annotations cause pauses, I think? The setting is a fix for the rate bug.
-
@amir Huh. The always send current speech settings is supposed to fix that but IBM DLLs typically have that off because the annotations cause pauses, I think? The setting is a fix for the rate bug.
@x0 Yeah it's supposed to fix that. But here checking or unchecking it doesn't fix the voice parameter resets at all.
-
Eloquence 64-bit for NVDA now supports a 44 kHz mode. It’s currently in preview and still being fine-tuned.
You can use the Sample Rate combo box in the Voices dialog to increase the default rate from 11 kHz to 44 kHz for clearer speech.
Unlike the 22 kHz mode in IBMTTS and ViaVoice, this mode doesn’t introduce pops, jitters, or unexpected voice resets.
Try the 16.1 upsampling preview release:
https://github.com/hozosch/eloquence_64/releases/tag/upsampling_preview
For more on the upsampling preview and author discussions check here:
https://github.com/fastfinge/eloquence_64/issues/88#issuecomment-4091293709@amir actually, it does sounds a lot clearer. better thn I expected.
-
@amir Oh? What's burdensome about it besides the engine actually trying to predicate on supported versions? If you try to force something that doesn't support it you get fast forward speech.
@x0 Also IBMTTS has issues with my own add-on, Typing & Spelling Rate, whereas these have been fixed in Eloquence 64-bit. If you use my add-on and spell something via a higher rate for spelling, or type via a higher rate for typing, IBMTTS's speech rate won't be decreased for other non-spelling and non-typing tasks.
-
@x0 Also IBMTTS has issues with my own add-on, Typing & Spelling Rate, whereas these have been fixed in Eloquence 64-bit. If you use my add-on and spell something via a higher rate for spelling, or type via a higher rate for typing, IBMTTS's speech rate won't be decreased for other non-spelling and non-typing tasks.
@amir Huh. Before the bridge? That's odd, I knew it had issues with indexing but I thought without the bridge embedded commands worked just fine, after all MathML does it all the time.
-
@amir actually, it does sounds a lot clearer. better thn I expected.
@bmoore123 Yeah I was also surprised. And the author wants to add another slider for more fine-tuning.
-
@amir Huh. Before the bridge? That's odd, I knew it had issues with indexing but I thought without the bridge embedded commands worked just fine, after all MathML does it all the time.
@x0 Nope. it doesn't, I tested the latest IBMTTS preview release.
-
@bmoore123 @amir Problem is, you'll never improve audio quality by upsampling.
@Bruce @bmoore123 @amir I think whether it is an improvement or not depends on how you look at it. Objectively speaking, the upsampler can't improve anything because it can't intellligently add missing content. But its aliasing artifacts (which are in the high frequencies) makes it sound better to some people. Not knocking the people who like it btw, I like it as much as anyone else. I just don't want people thinking of it as some modern quality enhancement voodoo, because it really isn't. It's a decades-old artifact that finally got implemented in an Eloquence add-on for people who like it.
-
@Bruce @bmoore123 @amir I think whether it is an improvement or not depends on how you look at it. Objectively speaking, the upsampler can't improve anything because it can't intellligently add missing content. But its aliasing artifacts (which are in the high frequencies) makes it sound better to some people. Not knocking the people who like it btw, I like it as much as anyone else. I just don't want people thinking of it as some modern quality enhancement voodoo, because it really isn't. It's a decades-old artifact that finally got implemented in an Eloquence add-on for people who like it.
@musicalman @Bruce @bmoore123 Right. Of course, no one was talking about modern quality enhancements. It improves speech/ audio quality like the 22 kHZ mode in IBMTTS for ViaVoice, or even better than that I'd say.
-
Eloquence 64-bit for NVDA now supports a 44 kHz mode. It’s currently in preview and still being fine-tuned.
You can use the Sample Rate combo box in the Voices dialog to increase the default rate from 11 kHz to 44 kHz for clearer speech.
Unlike the 22 kHz mode in IBMTTS and ViaVoice, this mode doesn’t introduce pops, jitters, or unexpected voice resets.
Try the 16.1 upsampling preview release:
https://github.com/hozosch/eloquence_64/releases/tag/upsampling_preview
For more on the upsampling preview and author discussions check here:
https://github.com/fastfinge/eloquence_64/issues/88#issuecomment-4091293709@amir This has the hiss the Eloquence on the iPhone has. It is particular Noticeable on words with s in them. I did hear some popping. but the hiss is very noticeable.
-
@amir This has the hiss the Eloquence on the iPhone has. It is particular Noticeable on words with s in them. I did hear some popping. but the hiss is very noticeable.
@Dennislong82 But I don't notice the hiss with Eloquence 64-bit at all, even with headphones. And yes, I do have the hiss on the iPhone. Perhaps it is speaker/ CPU-specific.