Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2006.11477

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Paper • 2310.00704 • Published Oct 1, 2023 • 21
Structural Similarities Between Language Models and Neural Response Measurements

Paper • 2306.01930 • Published Jun 2, 2023 • 2
Streaming Transformer ASR with Blockwise Synchronous Beam Search

Paper • 2006.14941 • Published Jun 25, 2020 • 2
NU-GAN: High resolution neural upsampling with GAN

Paper • 2010.11362 • Published Oct 22, 2020 • 2

Automatic Speech Recognition Architectures

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 25
Conformer: Convolution-augmented Transformer for Speech Recognition

Paper • 2005.08100 • Published May 16, 2020
wav2vec: Unsupervised Pre-training for Speech Recognition

Paper • 1904.05862 • Published Apr 11, 2019
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

there's many more on arxiv if you search for CLAP

Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

Paper • 2211.06687 • Published Nov 12, 2022 • 3
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning

Paper • 2401.17690 • Published Jan 31, 2024 • 5
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 53
Audiobox: Unified Audio Generation with Natural Language Prompts

Paper • 2312.15821 • Published Dec 25, 2023 • 13

A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.

facebook/wav2vec2-large-960h-lv60-self

Automatic Speech Recognition • Updated May 23, 2022 • 1.07M • 141
facebook/wav2vec2-large-960h

Automatic Speech Recognition • Updated Apr 5, 2022 • 64.3k • 28
facebook/wav2vec2-base-960h

Automatic Speech Recognition • Updated Nov 14, 2022 • 1.57M • • 312
facebook/wav2vec2-base-100h

Automatic Speech Recognition • Updated May 27, 2022 • 1.67k • 6

audio recognition

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs