Eloquence 64-bit for NVDA now supports a 44 kHz mode.

x0@dragonscave.space

@amir Oh God, dude rolled his own resampling algorithm with AI? At least 44100 is integer upsampling which is by far easier than fractional upsampling or downsampling, but why not a known open one? R8Brain free, for example. Sinc interpolation giving too ideal upsampling to keep it sounding dull I guess?

amir@dragonscave.space

@x0 He knows what he's doing and properly tests every step of the way. Honestly I don't see any issues with AI as the code is clearly presented. As for his upsampling approach, he apparently tested a couple of them, but this one produced the best quality without known speech jitters affecting IBMTTS with ViaVoice's 22 kHZ mode.

x0@dragonscave.space

@amir Huh. IBM must have implemented that one incorrectly. It's required to use the watson voice, though, and that one doesn't jitter.

bruce@allovertheplace.ca

@bmoore123 @amir Problem is, you'll never improve audio quality by upsampling.

amir@dragonscave.space

@x0 The ViaVoice jitter with 22 kHZ is a known issue, and affects certain letters, like t, in certain situations. It's actually a speech pop rather than a jitter. Also getting IBMTTS to use 22 kHZ in ViaVoice is quite burdensome whereas Eloquence 64-bit handles it without requiring extra steps.

x0@dragonscave.space

@amir Oh? What's burdensome about it besides the engine actually trying to predicate on supported versions? If you try to force something that doesn't support it you get fast forward speech.

amir@dragonscave.space

@Bruce @bmoore123 But 44 kHZ does improve it whereas the 8 kHZ mode does reduce the audio quality.

bmoore123@tweesecake.social

@Bruce @amir no, that's true but 8 k sounds like crap. I don't know if I would use it but I would try it

amir@dragonscave.space

@x0 No. The burdensome issue is finding the proper DLL from an older ViaVoice release which does support the 22 kHZ mode properly. The newer DLL which is installed by ViaVoice doesn't support it. Also IBMTTS does have its random voice resets to default with ViaVoice and nothing can be done about it.

x0@dragonscave.space

@amir Huh. The always send current speech settings is supposed to fix that but IBM DLLs typically have that off because the annotations cause pauses, I think? The setting is a fix for the rate bug.

amir@dragonscave.space

@x0 Yeah it's supposed to fix that. But here checking or unchecking it doesn't fix the voice parameter resets at all.

bmoore123@tweesecake.social

@amir actually, it does sounds a lot clearer. better thn I expected.

amir@dragonscave.space

@x0 Also IBMTTS has issues with my own add-on, Typing & Spelling Rate, whereas these have been fixed in Eloquence 64-bit. If you use my add-on and spell something via a higher rate for spelling, or type via a higher rate for typing, IBMTTS's speech rate won't be decreased for other non-spelling and non-typing tasks.

x0@dragonscave.space

@amir Huh. Before the bridge? That's odd, I knew it had issues with indexing but I thought without the bridge embedded commands worked just fine, after all MathML does it all the time.

amir@dragonscave.space

@bmoore123 Yeah I was also surprised. And the author wants to add another slider for more fine-tuning.

amir@dragonscave.space

@x0 Nope. it doesn't, I tested the latest IBMTTS preview release.

musicalman@dragonscave.space

@Bruce @bmoore123 @amir I think whether it is an improvement or not depends on how you look at it. Objectively speaking, the upsampler can't improve anything because it can't intellligently add missing content. But its aliasing artifacts (which are in the high frequencies) makes it sound better to some people. Not knocking the people who like it btw, I like it as much as anyone else. I just don't want people thinking of it as some modern quality enhancement voodoo, because it really isn't. It's a decades-old artifact that finally got implemented in an Eloquence add-on for people who like it.

amir@dragonscave.space

@musicalman @Bruce @bmoore123 Right. Of course, no one was talking about modern quality enhancements. It improves speech/ audio quality like the 22 kHZ mode in IBMTTS for ViaVoice, or even better than that I'd say.

dennislong82@tweesecake.social

@amir This has the hiss the Eloquence on the iPhone has. It is particular Noticeable on words with s in them. I did hear some popping. but the hiss is very noticeable.

amir@dragonscave.space

@Dennislong82 But I don't notice the hiss with Eloquence 64-bit at all, even with headphones. And yes, I do have the hiss on the iPhone. Perhaps it is speaker/ CPU-specific.

CIRCLE WITH A DOT

Eloquence 64-bit for NVDA now supports a 44 kHz mode.