Overview of Transcribe
Cohere has introduced Transcribe, its first open-source automatic speech recognition model. This innovative tool is designed for tasks such as note-taking and speech analysis. With just 2 billion parameters, it is lightweight enough for consumer-grade GPUs, making it accessible for self-hosting. Transcribe supports 14 languages, including major ones like English, Spanish, and Chinese, catering to a diverse user base.
Key Features and Performance
- Transcribe outperforms competitors like Zoom Scribe and IBM Granite on the Hugging Face Open ASR leaderboard, achieving an impressive average word error rate (WER) of 5.42.
- In evaluations by human testers, Transcribe had a 61% win rate over other models based on accuracy and usability.
- The model can process 525 minutes of audio in just one minute, showcasing its efficiency.
- Future plans include integrating Transcribe into Cohere’s enterprise platform, North, and making it available via a free API and Model Vault.
Significance in the Market
The rise of speech recognition technology highlights a growing demand for effective note-taking and dictation tools. As applications like Granola and Wispr Flow gain traction, Transcribe positions itself as a strong contender in this competitive space. Cohere’s projected annual revenue of $240 million signals its robust growth, and with potential plans for a public offering, the company is poised for significant impact in the tech industry.











