Transcription Accuracy Benchmark

Accurate, low cost AI Transcription

Salad Transcription Service has the lowest price in the market. But our multi-model, open-source Whisper based approach also delivers one of the industry's highest accuracy rates (91.13% average)- at up to 90% less cost than other APIs.  

We also raised the bar on what accuracy means for the industry in this benchmark, going beyond just words to include punctuation, sentence structure, capitalization, etc. The result? Highly accurate, formatted transcripts at the lowest cost in the industry.

Salad GPU cloud
94.24%
Docu-series accuracy
94.49%
Phone call accuracy
92.24%
Podcast accuracy
95.59%
Tutorial accuracy

Need over 500,000 hours of transcription per year?

Book a 30 min call with our team
of transcription experts.

Talk to our experts
TRANSCRIPTION ACCURACY BENCHMARK results

High Accuracy.
Up to 90% less cost.

Salad Transcription Service offers high accuracy for various media types - at 20%-90% less cost than other Transcription APIs in the industry. While other tools just transcribe the words, Salad provides human-readable transcripts with punctuation, capitalization, sentence and paragraph structure.

Media
Salad
Assembly AI
AWS Transcribe
Google STT
90.02%
91.37%
87.48%
85.49%
91.56%
92.84%
91.85%
88.95%
94.24%
94.31%
92.25%
91.25%
Media
Salad
Assembly AI
AWS Transcribe
Google STT
91.28%
92.31%
86.34%
84.26%
93.80%
94.41%
91.05%
90.83%
91.47%
95.70%
89.36%
89.75%
94.49%
95.46%
91.62%
89.72%
93.42%
92.70%
87.81%
87.98%
Media type
Salad
Assembly AI
AWS Transcribe
Google STT
94.43%
91.49%
87.42%
86.93%
95.59%
97.45%
96.18%
96.35%
87.15%
89.65%
86.15%
85.22%
91.86%
89.70%
84.46%
85.12%
91.69%
88.48%
86.43%
84.68%
Media type
Salad
Assembly AI
AWS Transcribe
Google STT
92.69%
93.98%
91.56%
90.02%
92.24%
90.06%
86.98%
86.60%
84.95%
86.07%
82.03%
77.58%
89.97%
88.65%
87.26%
80.93%

Highly accurate, human-readable transcripts

Test some of the transcripts from our benchmarks below for different media types and see how Salad Transcription Service performs.  

Podcast transcription

Salad Transcription Service offers the best cost-performance for transcribing podcasts. Here, we transcribed a podcast episode from Serial, achieving a 92.24% accuracy - the highest among all APIs in the benchmark.  


Transcription for Podcasts - Benchmark

Tutorial Transcription

Transcribing learning materials and tutorials at an affordable cost is key to the success of businesses and e-learning. Here, we transcribe a tutorial on how to use Stripe with a 95.49% accuracy, the highest among all APIs. Most ot the errors were around capitalization but the actual transcription of words is highly accurate.  

Phonecall Transcription

Transcribing phone calls is crucial not only to provide a human-readable transcript but also to perform post-call analysis for business intelligence. Here, Salad API scored a 94.49% accuracy transcribing an earnings phone call audio.


Process for Transcription Benchmark

Key inputs to this benchmark were taken from Assembly AI's benchmark. The benchmark process is detailed below.

Hexagon1

Other transcription APIs

First, we transcribed the files in our dataset automatically through the specified APIs (AssemblyAI, Google, and AWS).

Hexagon2

Human transcription

Second, we transcribed the files in our dataset by human transcriptionists—to approximately 100% accuracy.

Hexagon3

Salad Transcription

Finally, we compared Salad API's transcription with our human transcription to calculate Word Error Rate (WER) & Word Accuracy Rate (WAR).

Frequently Asked Questions

Lowest pricing in the market. Highly accurate transcripts.

How do I switch to Salad from another API provider?

Switching to Salad is designed to be seamless and cost-effective. The investment includes a one-time setup cost, after which customers can enjoy substantial savings and a high return on investment (~533%) from the very first year.

How does Salad have the lowest prices for transcription in the market?

Unlike other API providers that rely on expensive, high-end GPUs from hyperscalers, Salad's transcription service utilizes our proprietary distributed cloud powered by 1000s of consumer GPUs at the lowest price in the market. This low-cost compute model allows us to offer transcription services at significantly lower prices without compromising on quality. Our tiered pricing model is designed to cater to high-volume needs, providing clear cost advantages as usage scales up.

How does Salad maintain high accuracy in its transcriptions?

Salad employs a combination of open source AI models including Audio Enhancement, Automatic Speech Recognition technology and Large Language Models, which are enhanced by a dedicated Knowledge Base that accounts for custom vocabulary and contextual nuances.

Can you handle complex transcription needs like diarization and accents?

Yes, Salad's service includes diarization to differentiate between speakers and accent modification in the pre-processing stage to handle diverse accents effectively, ensuring high-quality transcription regardless of complexity.

How does Security work on Salad's service?

Every day, 100s of businesses trust Salad infrastructure with their data, thanks to our multi-step security framework ensuring that data is safeguarded at every step of the way. Our SOC-2 compliant cloud infrastructure utilizes end-to-end encryption of your data, isolated processing environments, data sanitization and access controls to safeguard the confidentiality and integrity of customer's files.