General - a eli02 Collection

eli02 's Collections

General

General

updated 4 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 48
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 10
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Paper • 2306.07691 • Published Jun 13, 2023 • 6
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

Paper • 2203.02395 • Published Mar 4, 2022
Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published 23 days ago • 25
Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 19 days ago • 51
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 13 days ago • 268
Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 18 days ago • 80
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 17 days ago • 40
An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published 18 days ago • 37
The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 11 days ago • 65
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 11 days ago • 35
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 5 days ago • 69