Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
liked
a dataset
3 days ago
cat-searcher/minif2f-lean4
updated
a model
13 days ago
ricdomolm/ml4331-reward-model
updated
a model
13 days ago
ricdomolm/ml4331-reward-model2
Organizations
None yet
Collections
1
models
65
ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
249
ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
4
ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
201
ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
355
ricdomolm/test-model
Updated
ricdomolm/SmolLM2-135M-SFT-Alpaca
Updated
ricdomolm/reward-model-exercise
Updated
ricdomolm/lawma-8b
Text Generation
•
Updated
•
2.02k
•
6
ricdomolm/ttt-mc-ziya2-13b-base
Updated
•
2
ricdomolm/ttt-mc-yi-6b
Updated
•
4
datasets
15
ricdomolm/caselawqa_leaderboard_results
Updated
•
1.07k
ricdomolm/caselawqa_leaderboard_requests
Viewer
•
Updated
•
29
•
1.02k
ricdomolm/lawma-instructions_gemma2_8k
Viewer
•
Updated
•
554k
•
60
ricdomolm/lawma-instructions_llama3_16k
Viewer
•
Updated
•
554k
•
34
ricdomolm/lawma-instructions_llama3_8k
Viewer
•
Updated
•
554k
•
49
ricdomolm/lawma-instructions
Viewer
•
Updated
•
554k
•
35
ricdomolm/lawma-tasks
Viewer
•
Updated
•
692k
•
685
•
2
ricdomolm/lawma-task-files
Updated
•
35
ricdomolm/caselawqa-8k
Viewer
•
Updated
•
16.1k
•
36
•
2
ricdomolm/lawma-all-tasks
Viewer
•
Updated
•
575k
•
60