Norris Wheeler's picture

Norris Wheeler

wheeler404

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

upvoted a paper 1 day ago

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

upvoted a paper 1 day ago

TRACE: Capability-Targeted Agentic Training

View all activity

Organizations

None yet

upvoted 14 papers 1 day ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 4 days ago • 11

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

Paper • 2604.11259 • Published 4 days ago • 11

TRACE: Capability-Targeted Agentic Training

Paper • 2604.05336 • Published 10 days ago • 13

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks

Paper • 2604.11753 • Published 4 days ago • 14

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published 19 days ago • 17

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published 8 days ago • 17

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Paper • 2604.08865 • Published 7 days ago • 27

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published 29 days ago • 25

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published 4 days ago • 37

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 3 days ago • 92

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 3 days ago • 71

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 6 days ago • 72

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 4 days ago • 132

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 4 days ago • 129

upvoted a collection 8 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.75k

upvoted an article 8 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5, 2025

•

513