Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper β’ 2305.18290 β’ Published May 29, 2023 β’ 51
Enhancing Human-Like Responses in Large Language Models Paper β’ 2501.05032 β’ Published 4 days ago β’ 35
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper β’ 2501.04001 β’ Published 5 days ago β’ 36
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 9 days ago β’ 29
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper β’ 2412.18925 β’ Published 18 days ago β’ 89
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: By Omartificial-Intelligence-Space β’ Nov 30, 2024 β’ 6
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 β’ Aug 30, 2024 β’ 3
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 130
view article Article Letβs make a generation of amazing image generation models By burtenshaw β’ Nov 26, 2024 β’ 34
view article Article Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models By mikelabs β’ Nov 21, 2024 β’ 2
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21, 2024 β’ 35
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13, 2024 β’ 98