Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 15.4k • 85
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 8.39k • • 22 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • 0.1B • Updated Jul 21, 2025 • 3.17M • 749 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • 2B • Updated Jul 28, 2025 • 39.1k • 258 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 25 • 1
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 126 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 4 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 25 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 374 • 29
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 27.7k • 632 bookcorpus/bookcorpus Updated May 3, 2024 • 5.8k • 338 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 179 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 491 • 2
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 124 • 696 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 16 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 7.98k • • 156
corpuses Skylion007/openwebtext Viewer • Updated 7 days ago • 8.01M • 44k • 470 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 251 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 6.38k • 32 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 5.89k • 12
Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 15.4k • 85
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 27.7k • 632 bookcorpus/bookcorpus Updated May 3, 2024 • 5.8k • 338 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 179 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 491 • 2
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 8.39k • • 22 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • 0.1B • Updated Jul 21, 2025 • 3.17M • 749 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • 2B • Updated Jul 28, 2025 • 39.1k • 258 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 25 • 1
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 124 • 696 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 16 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 7.98k • • 156
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 126 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 4 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 25 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 374 • 29
corpuses Skylion007/openwebtext Viewer • Updated 7 days ago • 8.01M • 44k • 470 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 251 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 6.38k • 32 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 5.89k • 12