G$^2$RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance Paper โข 2508.13023 โข Published Aug 18 โข 1
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper โข 2512.05150 โข Published 27 days ago โข 74