At Saicom, we know that understanding whatโs really happening in your contact centre is critical to delivering great customer experiences. Whether youโre using automated quality assurance, AI-driven insights, or analytics tools, everything starts with one essential ingredient โ accurate transcription.
The challenge? Real-world call recordings are rarely perfect. Background noise, muffled microphones, and mobile carrier compression can all distort audio long before it reaches your contact centre. This is where Word Error Rate (WER) โ the industryโs standard measure of transcription accuracy โ comes in.
What is Word Error Rate?
WER calculates how many words in a transcription are incorrect. For example:
Customer says: The cat sat on the mat
Transcript says: The rat sat on the mat
Here, only one word is wrong, giving a WER of 16.7% (1 error in 6 words).
WER counts:
- Substitutions (wrong words)
- Deletions (missing words)
- Additions (extra words not spoken)
Itโs not perfect, but itโs an excellent way to compare different transcription models.
Why Traditional Speech Recognition Struggles
For years, the big names โ Microsoft Azure (formerly Dragon Speech) and Google Cloud โ have dominated the market. While they perform well in studio-quality audio, telephone audio is far trickier.
Even Googleโs optimised telephone speech model still averages 14.29% WER, meaning about 14% of a transcript contains errors. Thatโs good โ but in the fast-moving world of AI, โgoodโ can be beaten.
Saicomโs Leap Forward with QContact
Through our partnership with QContact, we now use cutting-edge NVIDIA speech technology (publicly released just this week) to achieve a WER of only 6.05% โ thatโs:
- 58% more accurate than Googleโs telephone model
- 15% more accurate than the latest OpenAI Whisper models
Weโve also expanded automatic transcription to cover 20+ European languages, delivering world-class results like:
- Spanish: 3.72% WER
- German: 4.90% WER
- French: 5.38% WER
- Portuguese: 5.95% WER
According to the Open ASR Leaderboard, this is now the second most accurate and fastest multilingual speech recognition model in the world.
Privacy and Compliance You Can Trust
When choosing a CCaaS provider, itโs worth asking:
- Where is my audio processed?
- Is my data being used to train AI models without consent?
Some providers send recordings to countries without GDPR or POPIA equivalence, or allow training on your data in exchange for discounts from transcription vendors. This could expose you to compliance and privacy risks.
At Saicom, we guarantee:
- Your call recordings stay within South Africa for processing
- Your data will never be used for training without consent
- Call transcription is included free for all customers
And because QContact isnโt tied to a single AI model, we can quickly adopt the latest and best speech recognition technology โ without you lifting a finger.