BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 21 days ago • 10
BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment Paper • 2601.06401 • Published 21 days ago • 10
PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data Paper • 2508.15180 • Published Aug 21, 2025 • 1
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning Paper • 2508.15868 • Published Aug 21, 2025 • 3
RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models Paper • 2510.21604 • Published Oct 24, 2025