Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,7 @@ language:
|
|
| 10 |
|
| 11 |
# Model Card for Olmo 3 32B
|
| 12 |
|
| 13 |
-
We introduce Olmo 3, a new family of 7B and 32B models featuring a [TODO: insert gain in performance], among other evaluation improvements, compared to the most recent Olmo 2 7B model. These gains come from training on [dolma3-mix-1025](https://huggingface.co/datasets/allenai/dolma3_mix-6T-1025)
|
| 14 |
|
| 15 |
Olmo is a series of **O**pen **l**anguage **mo**dels designed to enable the science of language models.
|
| 16 |
These models are trained on the Dolma 3 dataset. We are releasing all code, checkpoints, logs (coming soon), and associated training details.
|
|
|
|
| 10 |
|
| 11 |
# Model Card for Olmo 3 32B
|
| 12 |
|
| 13 |
+
We introduce Olmo 3, a new family of 7B and 32B models featuring a [TODO: insert gain in performance], among other evaluation improvements, compared to the most recent Olmo 2 7B model. These gains come from training on [dolma3-mix-1025](https://huggingface.co/datasets/allenai/dolma3_mix-6T-1025) and [dolma3-dolmino-mix-1025](https://huggingface.co/datasets/allenai/dolma3_dolmino_mix-100B-1025) datasets and staged training approach.
|
| 14 |
|
| 15 |
Olmo is a series of **O**pen **l**anguage **mo**dels designed to enable the science of language models.
|
| 16 |
These models are trained on the Dolma 3 dataset. We are releasing all code, checkpoints, logs (coming soon), and associated training details.
|