Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published Feb 9 • 34
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 47
theojolliffe/bart-large-cnn-pubmed1o3-pubmed2o3 Text2Text Generation • Updated May 27, 2022 • 128 • 1