YutaoXie
AndreasX1206
AI & ML interests
None yet
Recent Activity
upvoted a paper 29 days ago
TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs upvoted a paper 29 days ago
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL updated a model 8 months ago
AndreasX1206/test