September 22, 2022

Whisper Speech Recognition from OpenAI

From openai.com:

We’ve trained and are open-sourcing a neural net called Whisper that approaches human level robustness and accuracy on English speech recognition.
[…]
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web.

Looks amazing. The fact that it has multilingual data makes it specially interesting — at least for those of us that speak with an accent.

Applications for automatic speech recognition (ASR) go way beyond than dictation. But I think the UX of Voice/Keyboard/Pen input still lacks. There’s no mouse pointer” equivalent — yet?.


snippets


Previous post
Framework Laptop Chromebook Edition frame.work: The Chromebook Edition is available for pre-order in the US and Canada today starting at $999 USD, with first shipments starting in
Next post
Changes to reMarkable Connect From remarkable.com: Our new approach is simple: Everything that happens on the paper tablet, comes with the paper tablet. Integrating with Google