argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
•
12.9k
•
3.91k
•
181
Viewer
•
Updated
•
66.4k
•
5.9k
•
214
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
3.69k
•
154
Viewer
•
Updated
•
15.3k
•
64
•
19
theblackcat102/evol-codealpaca-v1
Viewer
•
Updated
•
111k
•
2.55k
•
171
Viewer
•
Updated
•
395k
•
9.4k
•
429
glaiveai/glaive-code-assistant-v2
Viewer
•
Updated
•
215k
•
375
•
49
Viewer
•
Updated
•
12.9k
•
3.05k
•
317
Viewer
•
Updated
•
183k
•
982
•
294
garage-bAInd/Open-Platypus
Viewer
•
Updated
•
24.9k
•
4.29k
•
412
LLM360/CrystalCoderDatasets
Updated
•
5.21k
•
21
protectai/deberta-v3-base-prompt-injection
Text Classification
•
0.2B
•
Updated
•
20k
•
•
89
nampdn-ai/tiny-orca-textbooks
Viewer
•
Updated
•
147k
•
65
•
43
code-search-net/code_search_net
Updated
•
17k
•
316
WhiteRabbitNeo/WRN-Chapter-1
Viewer
•
Updated
•
7.75k
•
82
•
51
WhiteRabbitNeo/WRN-Chapter-2
Viewer
•
Updated
•
11.1k
•
67
•
21
Text Generation
•
Updated
•
485
•
205
Viewer
•
Updated
•
31.1M
•
49k
•
649
Viewer
•
Updated
•
3.54k
•
132
•
55
NousResearch/json-mode-eval
Viewer
•
Updated
•
100
•
585
•
40
Viewer
•
Updated
•
2.75M
•
9.14k
•
379
Viewer
•
Updated
•
518k
•
21
•
1
laurentiubp/openhermes-scored
Viewer
•
Updated
•
185k
•
19
•
1
Towards Best Practices for Open Datasets for LLM Training
Paper
•
2501.08365
•
Published
•
62