Arm Community
Site
Search
User
Site
Search
User
Groups
Education Hub
Distinguished Ambassadors
Open Source Software and Platforms
Research Collaboration and Enablement
Forums
AI and ML forum
Architectures and Processors forum
Arm Development Platforms forum
Arm Development Studio forum
Arm Virtual Hardware forum
Automotive forum
Compilers and Libraries forum
Graphics, Gaming, and VR forum
High Performance Computing (HPC) forum
Infrastructure Solutions forum
Internet of Things (IoT) forum
Keil forum
Morello forum
Operating Systems forum
SoC Design and Simulation forum
SystemReady Forum
Blogs
AI and ML blog
Announcements
Architectures and Processors blog
Automotive blog
Graphics, Gaming, and VR blog
High Performance Computing (HPC) blog
Infrastructure Solutions blog
Internet of Things (IoT) blog
Operating Systems blog
SoC Design and Simulation blog
Tools, Software and IDEs blog
Support
Arm Support Services
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Arm Community blogs
Internet of Things (IoT) blog
Quick and Easy Speech Transcription with Cloud-Based ML
Blogs
Mentions
Sub-Groups
Tags
Jump...
Cancel
More blogs in Arm Community blogs
AI and ML blog
Announcements
Architectures and Processors blog
Automotive blog
Embedded blog
Graphics, Gaming, and VR blog
High Performance Computing (HPC) blog
Infrastructure Solutions blog
Internet of Things (IoT) blog
Operating Systems blog
SoC Design and Simulation blog
Tools, Software and IDEs blog
Tags
Artificial Intelligence (AI)
Machine Learning (ML)
Actions
RSS
More
Cancel
Related blog posts
Related forum threads
Quick and Easy Speech Transcription with Cloud-Based ML
Hellen Norman
September 26, 2018
2 minute read time.
If you listen to video presentations or podcasts, you’ll probably agree that it’s a diverting pleasure, but it’s not a particularly efficient way to extract information. For speech to be clear enough to be easily understood, broadcasters need to speak at around 150 words per minute. This means that information transferral is relatively slow – especially when you consider that the average adult can read around 250-300 words per minute, and scan large quantities of text in even less time.
That’s why adding transcriptions to your audio or video content can be a game changer: it lets your audience parse your content quickly and easily to identify areas of interest, making it more efficient for them to digest. It also unlocks the potential of your content by making it available to different audiences. If your language isn’t my language, it’s pretty easy for me to run your whole text through a translation app. Translating the audio would take significantly longer (and I might have lost interest by then…).
Once upon a time, the only way to get a reliable transcription was to sit someone down with a recording and ask them to type it up but – thank goodness – machine learning (ML) has revolutionized that painful process. Nowadays, speech transcription tasks can be performed in moments – and we’ve put together a tutorial to show you how simple and swift it can be, even if it’s something you’ve never attempted before.
Getting Started with Speech Transcription
The tutorial shows you how to set up speech transcription deployed as a service, running on cloud-hosted Arm servers. Essentially, you just record an audio file and send it to the server. The server uses an ML-based speech recognition service to transcribe the audio, and sends the text back to your client machine.
No knowledge of machine learning is required, and the step-by-step guide walks you through each stage of the technical deployment. (The tutorial uses Ubuntu 16.04 so some familiarity with the command-line, Linux package managers, and SSH is assumed.) You’ll need a couple of hours to set up the installations and builds, but once you’ve done that, the service will be up and running very quickly – and all on a computing cost of around $3. And with 96 cores available on a Cavium Thunder X server such as the one used here, 24 hours of English or Mandarin speech can be transcribed with state-of-the-art accuracy for just $0.50!
Are you ready to get started?
[CTAToken URL = "https://developer.arm.com/technologies/machine-learning-on-arm/developer-material/how-to-guides/deploying-cloud-based-ml-for-speech-transcription" target="_blank" text="Read tutorial" class ="green"]
Internet of Things (IoT) blog
Software, Tools, and Ecosystem for ML Edge Devices
Reinhard Keil
Learn how Arm and our Partners enable developers and the IoT software ecosystem to deliver smart, energy efficient ML edge devices.
July 17, 2024
Uniting Arm Enthusiasts: Recap of the hands-on Arm Developer Community Meetup at Texas A&M University
Fidel Makatia
Arm enthusiasts gathered in Texas A&M for a hands-on GPIO workshop using STM Nucleo-F410R3 boards from the Arm Developer Program, fostering knowledge sharing and community bonding.
June 13, 2024
Introducing Cortex-M52: Bringing Arm’s AI-optimized Helium architecture to the smallest IoT devices
Tim Menasveta
Today, Arm announces the Cortex-M52, delivering dramatic increases in DSP and ML performance.
November 22, 2023