Langlin Huang's picture

Langlin Huang

shrango

·

https://shrango.github.io/

AI & ML interests

LLM Reasoning, Machine Translation

Recent Activity

upvoted a paper 1 day ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

upvoted a paper 14 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper 15 days ago

Process Rewards with Learned Reliability

View all activity

Organizations

Papers 2

arxiv:2601.05167

arxiv:2506.03566

models 27

shrango/fake_english_advshape_policyshape_qwen3-1.7b-base

2B • Updated May 3 • 7

shrango/ascii_advshape_policyshape_qwen3-1.7b-base

2B • Updated May 2 • 6

shrango/markovify_advshape_policy_shape_qwen3-1.7b-base

2B • Updated May 1 • 8

shrango/random_la_advshape_policyshape_qwen3-1.7b-base

2B • Updated Apr 30 • 6

shrango/lorem_advshape_qwen3-1.7b-base

2B • Updated Apr 26 • 6

shrango/lorem_policy_shape_adv_shape_qwen2.5-math_7b

8B • Updated Apr 25 • 2

shrango/lorem_advshape_policyshape_qwen2.5_math_7b_170

shrango/lorem_advshape_policyshape_qwen2.5_math_7b_150

shrango/lorem_advshape_qwen2.5-math-7b

8B • Updated Apr 24 • 2

shrango/lorem_octothinker3b-base_wokl_luffy

3B • Updated Apr 17 • 3

datasets 5

shrango/LoPE-train-openr1.parquet

Viewer • Updated Apr 23 • 500 • 26

shrango/spatial-minecraft-stitched

Viewer • Updated Dec 1, 2025 • 508 • 6 • 1

shrango/spatial-minecraft

Viewer • Updated Dec 1, 2025 • 1.02k • 56

shrango/eagle3_training_data

Updated Sep 12, 2025 • 4

shrango/gsm8k-first20-llama3.1-8B-instruct-repeat16

Viewer • Updated Mar 5, 2025 • 16 • 16