Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Elliott
's Collections
LUFFY-RL
LUFFY-RL
updated
May 30, 2025
Upvote
10
Elliott/LUFFY-Qwen-Math-7B-Zero
Text Generation
•
8B
•
Updated
Apr 23, 2025
•
21
•
1
Elliott/Qwen2.5-Math-7B-16k-think
Text Generation
•
8B
•
Updated
May 28, 2025
•
10.2k
•
•
6
Elliott/Openr1-Math-46k-8192
Viewer
•
Updated
Apr 23, 2025
•
45.8k
•
481
•
9
Learning to Reason under Off-Policy Guidance
Paper
•
2504.14945
•
Published
Apr 21, 2025
•
88
Elliott/LUFFY-Qwen-Math-1.5B-Zero
Text Generation
•
2B
•
Updated
Apr 23, 2025
•
6
•
Elliott/LUFFY-Qwen-Instruct-7B
Text Generation
•
8B
•
Updated
Apr 23, 2025
•
43
•
1
Elliott/Qwen2.5-Math-7B-SFT
Text Generation
•
8B
•
Updated
May 2, 2025
•
2
Elliott/Qwen2.5-Math-7B-SFT-RL
Text Generation
•
8B
•
Updated
May 30, 2025
•
2
Elliott/Openr1-Math-48k-Complement
Viewer
•
Updated
May 30, 2025
•
47.9k
•
15
Upvote
10
+6
Share collection
View history
Collection guide
Browse collections