paint-brush
wav2vec2 for Automatic Speech Recognition In Plain Englishby@pictureinthenoise
612 reads
612 reads

wav2vec2 for Automatic Speech Recognition In Plain English

by Picture in the Noise7mMarch 13th, 2024
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

wav2vec2 is a leading machine-learning model for the design of automatic speech recognition (ASR) systems. It is composed of three general components: a Feature Encoder, a Quantization Module, and a Transformer. The model is pretrained on audio-only data to learn basic speech units. The model is then finetuned on labeled data where speech units are mapped to text.
featured image - wav2vec2 for Automatic Speech Recognition In Plain English
Picture in the Noise HackerNoon profile picture
Picture in the Noise

Picture in the Noise

@pictureinthenoise

Speech and language processing. At the end of the beginning.

L O A D I N G
. . . comments & more!

About Author

Picture in the Noise HackerNoon profile picture
Picture in the Noise@pictureinthenoise
Speech and language processing. At the end of the beginning.

TOPICS

Languages

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Krisp
Newsbreak
Thetechstreetnow
Aivataro
Sumi
Lizedin
Devurls
Freshnews
Tiktok