Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
15
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
skar0
updated
a dataset
3 days ago
AlignmentResearch/Llama3Jailbreaks
tomtseng
updated
a model
5 days ago
AlignmentResearch/robust_llm_clf_imdb_pythia-410m_s-2_adv_tr_gcg_t-2
tomtseng
updated
a model
5 days ago
AlignmentResearch/robust_llm_clf_imdb_pythia-410m_s-1_adv_tr_gcg_t-1
View all activity
Team members
10
spaces
1
Running
24
🔎
Tuned Lens
models
3609
Sort: Recently updated
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-1b_s-2
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-1b_s-1
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-410m_s-2
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035a_clf_jailbreaks_pythia-1b_s-0
Updated
7 days ago
•
23
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-160m_s-2
Updated
7 days ago
•
4
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-70m_s-2
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-410m_s-1
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-160m_s-1
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-31m_s-2
Updated
7 days ago
•
3
AlignmentResearch/robust_llm_oskar-035b_clf_jailbreaks_pythia-14m_s-2
Updated
7 days ago
•
3
Expand 3609 models
datasets
15
Sort: Recently updated
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
3 days ago
•
16k
•
319
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
1.19k
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
1.09k
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
1.25k
AlignmentResearch/StrongREJECT
Viewer
•
Updated
Jul 29, 2024
•
313
•
498
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29, 2024
•
100k
•
1.8k
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29, 2024
•
97.5k
•
1.62k
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29, 2024
•
62.3k
•
1.08k
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26, 2024
•
50k
•
38
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26, 2024
•
100k
•
44
Expand 15 datasets