Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Really sorry about that, longer running audio (~10m+) is something I'm working on along with handling multiple speakers.

I've been focused on getting functional parity across all OS's since the Android release. This is very close to being done and I just need to reach the milestone of it being available on all platforms before I move forward.

Hopefully you will take another look when the next update is out.





Are you piping the audio buffers straight to the transcriber as they come in? Or capturing the entire recording and then processing it at the end?

I'm capturing the audio first, which was fine for short audio but ended up killing the process when it got too big. The updated fix is out now.

I did a complete overhaul to the pipeline so that it splits and processes at the end, this seems to have sorted it. I'm thinking about doing each transcription segment as it's coming in (with a bit of a buffer / overlap to keep context) much like the live transcription does, but for now performance is ok. Something I'll keep in mind once I've crossed some other things off the list.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: