Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you don’t mind me asking what do you use TTS?


Yes, I want to know too


It's Azure's TTS API — I'm using four of their voices.


Why not use Siri / the native TTS solution on iOS?


The native TTS is not great. It doesn't sound like Siri — it's much robotic.


It can sound significantly better but there’s a couple hoops you have to jump through - and even then it’s decent, but not the same as Siri.

You need the user to download ‘enhanced’ or ‘premium’ voices in the settings app. (Settings -> Accessibility-> Spoken Content -> Voices -> [Language of choice] -> [Voice of choice] -> Enhanced or Premium)

In the app you have to search for the enhanced or premium voices when doing TTS.

Heres an Objective C example, I’m sure there’s an easier way to write it in Swift. https://github.com/osmandapp/OsmAnd-iOS/pull/1156/commits/0b...

I’m not sure if you’ll find this acceptable from a UX point of view but there’s an option to play with if you’d like.


Yeah, I use a premium voice but was still disappointed when we added the feature to my reader app. I decided to leave it in the app since we'd already built it at that point, but it's kind of a bummer since obviously they could use Siri-level TTS if they wanted to.


Did you give any thought to ElevenLabs?


Yes, their quality is great but the cost is astronomical — I pay about $8 in Azure TTS bills alone for TTS-ing a 500-page book (what you can scan per month with a $10 subscription), whereas Eleven Labs would be about $100 for the same length. I found Azure to be the best bang-for-the-buck, although I'm on the lookout for more affordable high-quality TTS, which would also let me drop the price point of the app.


did you try the openAI pricing? how does it look?


Just took a look, their lower-quality model is almost exactly the same price as Azure TTS, and the quality is similar. Thanks for the pointer.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: