AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Agentic Reasoning for Large Language Models
models
0
None public yet
datasets
0
None public yet