Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sinagph
's Collections
Pretrain Basic
Pretrain Basic
updated
Oct 7, 2023
Upvote
-
Skylion007/openwebtext
Viewer
•
Updated
about 19 hours ago
•
8.01M
•
45.4k
•
465
legacy-datasets/c4
Updated
Mar 5, 2024
•
10.3k
•
243
legacy-datasets/wikipedia
Updated
Mar 11, 2024
•
32.6k
•
607
cerebras/SlimPajama-627B
Preview
•
Updated
Jul 7, 2023
•
59.6k
•
510
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
46.3k
•
879
bookcorpus/bookcorpus
Updated
May 3, 2024
•
5.9k
•
336
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
Dec 2, 2022
•
134M
•
17.7k
•
106
Upvote
-
Share collection
View history
Collection guide
Browse collections