opiyu

owao

AI & ML interests

None yet

Recent Activity

new activity 1 day ago
internlm/OREAL-32B:[42]
liked a model 1 day ago
hexgrad/Kokoro-82M
liked a model 1 day ago
arcee-ai/Arcee-Maestro-7B-Preview
View all activity

Organizations

None yet

owao's activity

New activity in internlm/OREAL-32B 1 day ago

[42]

#4 opened 1 day ago by
owao
reacted to davanstrien's post with ๐Ÿ‘ 5 days ago
view post
Post
2437
Hacked together a way to log trl GRPO training completions to a ๐Ÿค— dataset repo. This allows you to:

- Track rewards from multiple reward functions
- Treat the completion and rewards from training as a "proper" dataset and do EDA
- Share results for open science

The implementation is super hacky, but I'm curious if people would find this useful.

To push completions to the Hub, you just need two extra parameters:

log_completions=True
log_completions_hub_repo='your-username/repo-name'

Example dataset: davanstrien/test-logs
Colab: https://colab.research.google.com/drive/1wzBFPVthRYYTp-mEYlznLg_e_0Za1M3g

New activity in perplexity-ai/r1-1776 6 days ago
New activity in smirki/UIGEN-T1-Qwen-7b 6 days ago

Gguf please.

2
#1 opened 9 days ago by
AlgorithmicKing