4 3 9

Xiao Wang

CherryDurian

https://xiaowangnlp.github.io/

AI & ML interests

LLM,Reasoning,RL

Recent Activity

new activity about 1 month ago

moonshotai/Kimi-K2-Thinking:Any plan to open source the search agent framework?

new activity about 1 month ago

moonshotai/Kimi-K2-Thinking:K2 Thinking Browsecomp/HLE Reproducibility | 结果复现

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

View all activity

Organizations

None yet

authored 17 papers 5 months ago

Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric

Paper • 2502.17184 • Published Feb 24 • 1

DocFusion: A Unified Framework for Document Parsing Tasks

Paper • 2412.12505 • Published Dec 17, 2024

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 195

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Paper • 2411.16579 • Published Nov 25, 2024 • 3

Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

Paper • 2406.11190 • Published Jun 17, 2024

SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance

Paper • 2406.18118 • Published Jun 26, 2024

Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition

Paper • 2406.11192 • Published Jun 17, 2024

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Paper • 2403.12171 • Published Mar 18, 2024

RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions

Paper • 2402.16431 • Published Feb 26, 2024

The Rise and Potential of Large Language Model Based Agents: A Survey

Paper • 2309.07864 • Published Sep 14, 2023 • 7

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

Paper • 2401.11458 • Published Jan 21, 2024 • 2

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 28

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Paper • 2312.09979 • Published Dec 15, 2023 • 2

Orthogonal Subspace Learning for Language Model Continual Learning

Paper • 2310.14152 • Published Oct 22, 2023 • 2

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

Paper • 2310.06762 • Published Oct 10, 2023 • 2

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models

Paper • 2310.02949 • Published Oct 4, 2023 • 3

InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction

Paper • 2304.08085 • Published Apr 17, 2023

Xiao Wang

AI & ML interests

Recent Activity

Organizations

CherryDurian's activity