AMindToThink
/

ppo_with_value14

Generated from Trainer

Model card Files Files and versions

AMindToThink commited on Apr 16, 2025

Commit

ed722ce

·

verified ·

1 Parent(s): 01cd9f4

End of training

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 base_model: EleutherAI/pythia-160m
 library_name: transformers
 model_name: ppo_with_value14
 tags:
@@ -9,7 +10,7 @@ licence: license
 # Model Card for ppo_with_value14
-This model is a fine-tuned version of [EleutherAI/pythia-160m](https://huggingface.co/EleutherAI/pythia-160m).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 ---
 base_model: EleutherAI/pythia-160m
+datasets: trl-internal-testing/descriptiveness-sentiment-trl-style
 library_name: transformers
 model_name: ppo_with_value14
 tags:
 # Model Card for ppo_with_value14
+This model is a fine-tuned version of [EleutherAI/pythia-160m](https://huggingface.co/EleutherAI/pythia-160m) on the [trl-internal-testing/descriptiveness-sentiment-trl-style](https://huggingface.co/datasets/trl-internal-testing/descriptiveness-sentiment-trl-style) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start