Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2006.11477

Papers - Audio - Dataset - LibriVox

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Dataset - Librispeech

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Fine-tuning - Loss - CTC

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Training - Activation - Gumbel Softmax

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Training - Activation - Gelu

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Training - Loss - CTC

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Text - Encoders - Bert

Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings

Paper • 2305.13571 • Published May 23, 2023 • 2
BERTs are Generative In-Context Learners

Paper • 2406.04823 • Published Jun 7, 2024 • 1
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 26 days ago • 121
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Fine-tuning

WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31, 2024 • 10
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Paper • 2404.09956 • Published Apr 15, 2024 • 11
Long-form music generation with latent diffusion

Paper • 2404.10301 • Published Apr 16, 2024 • 24
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - Speech Transcription

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Paper • 2303.00747 • Published Mar 1, 2023 • 4
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 5

Papers - Audio - STT - ASR

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Paper • 2303.00747 • Published Mar 1, 2023 • 4
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion

Paper • 2311.14836 • Published Nov 24, 2023 • 2
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Paper • 2308.11466 • Published Aug 22, 2023 • 1
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training

Paper • 2108.06209 • Published Aug 7, 2021 • 1

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs