Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text By Albertmade • about 18 hours ago • 7
🐺🐦⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark By wolfram • 2 days ago • 2
TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation By imomayiz • 3 days ago • 17
Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation By RapidataAI • 3 days ago • 13
Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 8 days ago • 4
Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text By Albertmade • about 18 hours ago • 7
🐺🐦⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark By wolfram • 2 days ago • 2
TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation By imomayiz • 3 days ago • 17
Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation By RapidataAI • 3 days ago • 13
Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 8 days ago • 4