Steven Goldfeather
treehugg3
ยท
AI & ML interests
None yet
Recent Activity
new activity
16 days ago
infly/INF-ORM-Llama3.1-70B:Basic question: How would you reproduce the training of this model?
new activity
20 days ago
Skywork/Skywork-Reward-Gemma-2-27B-v0.2:Model reproducability
new activity
about 1 month ago
nvidia/Llama-3_1-Nemotron-51B-Instruct:What is the context size this model was trained on?
Organizations
None yet
treehugg3's activity
Basic question: How would you reproduce the training of this model?
2
#1 opened 18 days ago
by
treehugg3
Model reproducability
#2 opened 20 days ago
by
treehugg3
What is the context size this model was trained on?
2
#23 opened about 1 month ago
by
treehugg3
Model will need to be requantized, rope issues for long context
3
#2 opened about 1 month ago
by
treehugg3
Model will need to be re-quantized, rope issues
#4 opened about 1 month ago
by
treehugg3
Poor long-context performance?
6
#2 opened about 1 month ago
by
treehugg3
Compatible small models for speculative decoding?
#9 opened about 1 month ago
by
treehugg3
Chat template
3
#3 opened 6 months ago
by
sydneyfong