view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 14 days ago β’ 22
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 10 days ago β’ 37
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 10 days ago β’ 29
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 β’ 6 days ago β’ 24