Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric Paper • 2502.17184 • Published Feb 24 • 1
DocFusion: A Unified Framework for Document Parsing Tasks Paper • 2412.12505 • Published Dec 17, 2024
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 195
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision Paper • 2411.16579 • Published Nov 25, 2024 • 3
Aligning Large Language Models from Self-Reference AI Feedback with one General Principle Paper • 2406.11190 • Published Jun 17, 2024
SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance Paper • 2406.18118 • Published Jun 26, 2024
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition Paper • 2406.11192 • Published Jun 17, 2024
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models Paper • 2403.12171 • Published Mar 18, 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions Paper • 2402.16431 • Published Feb 26, 2024
The Rise and Potential of Large Language Model Based Agents: A Survey Paper • 2309.07864 • Published Sep 14, 2023 • 7
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback Paper • 2401.11458 • Published Jan 21, 2024 • 2
Secrets of RLHF in Large Language Models Part II: Reward Modeling Paper • 2401.06080 • Published Jan 11, 2024 • 28
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment Paper • 2312.09979 • Published Dec 15, 2023 • 2
Orthogonal Subspace Learning for Language Model Continual Learning Paper • 2310.14152 • Published Oct 22, 2023 • 2
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models Paper • 2310.06762 • Published Oct 10, 2023 • 2
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models Paper • 2310.02949 • Published Oct 4, 2023 • 3
InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction Paper • 2304.08085 • Published Apr 17, 2023