Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
BRlkl
/
GRPO-1_20
like
0
Transformers
Safetensors
unsloth
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
GRPO-1_20
1 contributor
History:
4 commits
BRlkl
checkpoint 20-percent (step 20)
2caed51
verified
26 days ago
.gitattributes
1.57 kB
Upload model trained with Unsloth
27 days ago
README.md
5.18 kB
Upload model trained with Unsloth
27 days ago
adapter_config.json
1.2 kB
checkpoint 20-percent (step 20)
26 days ago
adapter_model.safetensors
529 MB
xet
checkpoint 20-percent (step 20)
26 days ago
added_tokens.json
707 Bytes
Upload model trained with Unsloth
27 days ago
chat_template.jinja
4.01 kB
Upload model trained with Unsloth
27 days ago
merges.txt
1.67 MB
Upload model trained with Unsloth
27 days ago
special_tokens_map.json
496 Bytes
Upload model trained with Unsloth
27 days ago
tokenizer.json
11.4 MB
xet
Upload model trained with Unsloth
27 days ago
tokenizer_config.json
5.43 kB
Upload model trained with Unsloth
27 days ago
vocab.json
2.78 MB
Upload model trained with Unsloth
27 days ago