Datasets and models used for benchmarking Constitutional Continual Alignment of LLMs
MZ
Shahradmz
·
AI & ML interests
LLMs, Graph Learning, Temporal Graph Learning, RL, Continual RL, Optimization
Recent Activity
updated
a model
about 16 hours ago
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_0
updated
a model
about 16 hours ago
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
published
a model
about 16 hours ago
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
Organizations
Collections
1
Papers
2
models
104

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_0
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_0
Updated

Shahradmz/Qwen2-0.5B-Reward-LoRA
Updated

Shahradmz/llama8b_SEND_1B-alpaca-5
Text Generation
•
Updated
•
9

Shahradmz/llama8b_SEND_1B-legalbench-5
Text Generation
•
Updated
•
33

Shahradmz/llama8b_SEND_1B-codesearchnet-5
Text Generation
•
Updated
•
12

Shahradmz/llama8b_SEND_1B-helm-5
Text Generation
•
Updated
•
8

Shahradmz/llama8b_SEND_1B-codesearchnet-4
Text Generation
•
Updated
•
9

Shahradmz/llama8b_SEND_1B-alpaca-4
Text Generation
•
Updated
•
6
datasets
8
Shahradmz/cppo_continual_dataset_rl_others
Viewer
•
Updated
•
75.7k
•
34
Shahradmz/cppo_continual_dataset_rl_relationships
Viewer
•
Updated
•
93.9k
•
37
Shahradmz/cppo_continual_dataset_reward_others
Viewer
•
Updated
•
78.5k
•
36
Shahradmz/cppo_continual_dataset_reward_relationships
Viewer
•
Updated
•
97.4k
•
36
Shahradmz/ca_constitution_1
Viewer
•
Updated
•
33.7k
•
72
Shahradmz/ca_constitution_2
Viewer
•
Updated
•
35.8k
•
82
Shahradmz/assertiveness-corpus
Viewer
•
Updated
•
6k
•
87
Shahradmz/2MSampled_OpenWebText
Updated
•
2