dcre parent
Great article. I was of course surprised to learn that it's possible to learn to understand the super-fast TTS, since videos and podcasts start to get very tough to follow around 2.5x and higher. I've been wondering: surely better algorithms for generating high-speed speech are possible, especially as we have more and more compute around to throw at it. It's not easy to search for, since "speed" for most tools is about speed of generation rather than wpm. As normal-speed neural net TTS models get incredibly good, I am hoping to see more attention paid to the high-speed use case.
I've added a section about TTS voices to the post, see https://neurrone.com/posts/software-development-at-800-wpm/#...
Yeah, the options for this are quite limited. The only ones I know of are Espeak (open source) but doesn't sound as good, and Eloquence, which is an abandoned product.
The use case for super high speed TTS are pretty niche though.