[Ref, Fix] indentation error in answer key selection, longer explanation in demo, exclusion of broken dataset c608f7f Joschka Strueber commited on 15 days ago
[Add] add bbh and gpqa benchmarks again with correct answer_index selection 0a42e99 Joschka Strueber commited on 15 days ago