18 2

YutaoXie

AndreasX1206

Andreas1206

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

upvoted a paper 29 days ago

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

updated a model 8 months ago

AndreasX1206/test

View all activity

Organizations

New activity in LLM360/guru-RL-92k 11 months ago

Update README.md

#12 opened 11 months ago by

AndreasX1206

Update README.md

#11 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k-extra-info-compressed 11 months ago

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#7 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k 11 months ago

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#9 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k-extra-info-compressed 11 months ago

Delete online_eval/math__olympiad_bench_675.parquet

#6 opened 11 months ago by

AndreasX1206

Delete online_eval/codegen__leetcode2k_386.parquet

#5 opened 11 months ago by

AndreasX1206

Rename online_eval/simulation__arcagi1_200.parquet to online_eval/logic__arcagi1_200.parquet

#4 opened 11 months ago by

AndreasX1206

Delete online_eval/math__minerva_272.parquet

#3 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k 11 months ago

Delete online_eval/logic__graph_logical_dataset_77.parquet

#8 opened 11 months ago by

AndreasX1206

Delete online_eval/codegen__leetcode2k_386.parquet

#7 opened 11 months ago by

AndreasX1206

Delete online_eval/math__minerva_272.parquet

#6 opened 11 months ago by

AndreasX1206

Rename online_eval/table__hitab_300.parquet to online_eval/table__hitab_200.parquet

#5 opened 11 months ago by

AndreasX1206

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#4 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k-extra-info-compressed 11 months ago

Create README.md

#2 opened 11 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k 12 months ago

Update README.md

#1 opened 12 months ago by

AndreasX1206

New activity in ucsd-wang-lab-lm/bird_execution_correct_data about 1 year ago

Update passrate train set with 8b data

#3 opened about 1 year ago by

nathomas

Upload train set with 7b passrate

#2 opened about 1 year ago by

nathomas

Upload 4 files

#1 opened about 1 year ago by

nathomas

YutaoXie

AI & ML interests

Recent Activity

Organizations

AndreasX1206's activity

Update README.md

Update README.md

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

Delete online_eval/math__olympiad_bench_675.parquet

Delete online_eval/codegen__leetcode2k_386.parquet

Rename online_eval/simulation__arcagi1_200.parquet to online_eval/logic__arcagi1_200.parquet

Delete online_eval/math__minerva_272.parquet

Delete online_eval/logic__graph_logical_dataset_77.parquet

Delete online_eval/codegen__leetcode2k_386.parquet

Delete online_eval/math__minerva_272.parquet

Rename online_eval/table__hitab_300.parquet to online_eval/table__hitab_200.parquet

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

Create README.md

Update README.md

Update passrate train set with 8b data

Upload train set with 7b passrate

Upload 4 files