Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
47
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
taufeeque
updated
a model
about 12 hours ago
AlignmentResearch/Llama-3.3-70B-Instruct-math-lora-reference
sam-far
published
a model
about 16 hours ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_easy_merged_v1
sam-far
published
a model
about 16 hours ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_easy_v1
View all activity
Team members
18
AlignmentResearch
's datasets
79
Sort: Recently updated
AlignmentResearch/AugmentedJailbreaks
Viewer
•
Updated
Mar 13, 2025
•
20.8k
•
78
AlignmentResearch/JailbreakCompletions
Viewer
•
Updated
Mar 13, 2025
•
46.3k
•
37
AlignmentResearch/WildChatFiltered
Viewer
•
Updated
Mar 12, 2025
•
24k
•
10
AlignmentResearch/JailbreakInputs
Viewer
•
Updated
Mar 11, 2025
•
102k
•
43
•
1
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
Feb 12, 2025
•
78.5k
•
272
AlignmentResearch/XSTest
Viewer
•
Updated
Jan 30, 2025
•
900
•
26
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
126
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
275
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
244
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29, 2024
•
100k
•
317
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29, 2024
•
97.5k
•
250
•
1
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29, 2024
•
62.3k
•
152
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26, 2024
•
50k
•
28
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26, 2024
•
100k
•
31
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26, 2024
•
313
•
16
AlignmentResearch/IMDB-test
Viewer
•
Updated
Jul 26, 2024
•
97.5k
•
23
AlignmentResearch/EnronSpam-test
Viewer
•
Updated
Jul 26, 2024
•
62.4k
•
22
AlignmentResearch/boxoban-astar-solutions
Preview
•
Updated
Jul 25, 2024
•
94
AlignmentResearch/RuLES-Encryption
Viewer
•
Updated
Jul 16, 2024
•
50k
•
12
•
1
Previous
1
2
3
Next