HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants Paper • 2509.08494 • Published Sep 10, 2025 • 3
Towards Enterprise-Ready Computer Using Generalist Agent Paper • 2503.01861 • Published Feb 24, 2025 • 2
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206
gx-ai-architect/ultrafeedback-dice-iter1-sft-drsow-first-half-vanilla-router Viewer • Updated Apr 5, 2025 • 60.9k • 4
gx-ai-architect/ultrafeedback-dice-iter1-sft-drsow-first-half-vanilla-router Viewer • Updated Apr 5, 2025 • 60.9k • 4
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32-correct-long Viewer • Updated Mar 31, 2025 • 52k • 1
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32-correct-long Viewer • Updated Mar 31, 2025 • 52k • 1
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32-correct Viewer • Updated Mar 31, 2025 • 52k • 4
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32-correct Viewer • Updated Mar 31, 2025 • 52k • 4
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32 Viewer • Updated Mar 31, 2025 • 60.9k • 2
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32 Viewer • Updated Mar 31, 2025 • 60.9k • 2
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32 Viewer • Updated Mar 31, 2025 • 60.9k • 1
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32 Viewer • Updated Mar 31, 2025 • 60.9k • 1
gx-ai-architect/ultrafeedback-eurus-7b-classifier-annotation-bo32 Viewer • Updated Mar 30, 2025 • 60.8k • 3
gx-ai-architect/ultrafeedback-eurus-7b-classifier-annotation-bo32 Viewer • Updated Mar 30, 2025 • 60.8k • 3
gx-ai-architect/ultrafeedback-qwen32b-instruct-vs-base-vanilla-router-filter-minus50-bo32 Viewer • Updated Mar 30, 2025 • 57.9k • 1