Agentless: Demystifying LLM-based Software Engineering Agents Paper β’ 2407.01489 β’ Published Jul 1, 2024 β’ 54
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper β’ 2501.01028 β’ Published 11 days ago β’ 10
Proactive Conversational Agents with Inner Thoughts Paper β’ 2501.00383 β’ Published 13 days ago β’ 1
Agent Laboratory: Using LLM Agents as Research Assistants Paper β’ 2501.04227 β’ Published 5 days ago β’ 68
Deepseek V3 (All Versions) Collection Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. β’ 3 items β’ Updated about 5 hours ago β’ 21
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published 9 days ago β’ 72
Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 13 items β’ Updated 2 days ago β’ 36
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 10 days ago β’ 29
GAIA release Collection Gather the items of the GAIA release β’ 4 items β’ Updated Nov 23, 2023 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 10 days ago β’ 37
SwiftKV Models Collection SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. β’ 3 items β’ Updated Dec 5, 2024 β’ 3
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 14 days ago β’ 22