Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sentinel-AI
/
neurips_grpo_model
like
1
Follow
Sentinel AI
2
Text Generation
PEFT
Safetensors
Transformers
lora
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
neurips_grpo_model
166 MB
2 contributors
History:
3 commits
yashaektefaie
Shared the correct model weights
e5a0911
28 days ago
.gitattributes
1.63 kB
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
README.md
5.19 kB
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
adapter_config.json
932 Bytes
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
adapter_model.safetensors
131 MB
xet
Shared the correct model weights
28 days ago
chat_template.jinja
1.53 kB
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
special_tokens_map.json
662 Bytes
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
tokenizer.json
33.4 MB
xet
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago
tokenizer_config.json
1.16 MB
Add GRPO LoRA checkpoint with Git LFS
about 1 month ago