Podcast transcription
Salad Transcription Service offers the best cost-performance for transcribing podcasts. Here, we transcribed a podcast episode from Serial, achieving a 92.24% accuracy - the highest among all APIs in the benchmark.
Salad Transcription Service has the lowest price in the market. But our multi-model, open-source Whisper based approach also delivers one of the industry's highest accuracy rates (91.13% average)- at up to 90% less cost than other APIs.
We also raised the bar on what accuracy means for the industry in this benchmark, going beyond just words to include punctuation, sentence structure, capitalization, etc. The result? Highly accurate, formatted transcripts at the lowest cost in the industry.
Need over 500,000 hours of transcription per year?
Salad Transcription Service offers high accuracy for various media types - at 20%-90% less cost than other Transcription APIs in the industry. While other tools just transcribe the words, Salad provides human-readable transcripts with punctuation, capitalization, sentence and paragraph structure.
Test some of the transcripts from our benchmarks below for different media types and see how Salad Transcription Service performs.
Salad Transcription Service offers the best cost-performance for transcribing podcasts. Here, we transcribed a podcast episode from Serial, achieving a 92.24% accuracy - the highest among all APIs in the benchmark.
Transcribing learning materials and tutorials at an affordable cost is key to the success of businesses and e-learning. Here, we transcribe a tutorial on how to use Stripe with a 95.49% accuracy, the highest among all APIs. Most ot the errors were around capitalization but the actual transcription of words is highly accurate.
Transcribing phone calls is crucial not only to provide a human-readable transcript but also to perform post-call analysis for business intelligence. Here, Salad API scored a 94.49% accuracy transcribing an earnings phone call audio.
Key inputs to this benchmark were taken from Assembly AI's benchmark. The benchmark process is detailed below.
First, we transcribed the files in our dataset automatically through the specified APIs (AssemblyAI, Google, and AWS).
Second, we transcribed the files in our dataset by human transcriptionists—to approximately 100% accuracy.
Finally, we compared Salad API's transcription with our human transcription to calculate Word Error Rate (WER) & Word Accuracy Rate (WAR).
Frequently Asked Questions
Switching to Salad is designed to be seamless and cost-effective. The investment includes a one-time setup cost, after which customers can enjoy substantial savings and a high return on investment (~533%) from the very first year.
Unlike other API providers that rely on expensive, high-end GPUs from hyperscalers, Salad's transcription service utilizes our proprietary distributed cloud powered by 1000s of consumer GPUs at the lowest price in the market. This low-cost compute model allows us to offer transcription services at significantly lower prices without compromising on quality. Our tiered pricing model is designed to cater to high-volume needs, providing clear cost advantages as usage scales up.
Salad employs a combination of open source AI models including Audio Enhancement, Automatic Speech Recognition technology and Large Language Models, which are enhanced by a dedicated Knowledge Base that accounts for custom vocabulary and contextual nuances.
Yes, Salad's service includes diarization to differentiate between speakers and accent modification in the pre-processing stage to handle diverse accents effectively, ensuring high-quality transcription regardless of complexity.
Every day, 100s of businesses trust Salad infrastructure with their data, thanks to our multi-step security framework ensuring that data is safeguarded at every step of the way. Our SOC-2 compliant cloud infrastructure utilizes end-to-end encryption of your data, isolated processing environments, data sanitization and access controls to safeguard the confidentiality and integrity of customer's files.