DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published 10 days ago • 190
Multilingual Encoder Knows more than You Realize: Shared Weights Pretraining for Extremely Low-Resource Languages Paper • 2502.10852 • Published Feb 15 • 2
CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China Paper • 2509.09990 • Published Sep 12 • 2
CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China Paper • 2509.09990 • Published Sep 12 • 2