Reading list
updated
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised
Learning in Open-World Scenarios
Paper
•
2509.09926
•
Published
•
13
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning
under Incomplete Knowledge
Paper
•
2508.08344
•
Published
MemMamba: Rethinking Memory Patterns in State Space Model
Paper
•
2510.03279
•
Published
•
72
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper
•
2510.07499
•
Published
•
48
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper
•
2510.14972
•
Published
•
34
ReCode: Unify Plan and Action for Universal Granularity Control
Paper
•
2510.23564
•
Published
•
121
Code Aesthetics with Agentic Reward Feedback
Paper
•
2510.23272
•
Published
•
8
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via
Balanced Policy Optimization with Adaptive Clipping
Paper
•
2510.18927
•
Published
•
83
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool
Use
Paper
•
2510.27363
•
Published
•
22
Unlocking the conversion of Web Screenshots into HTML Code with the
WebSight Dataset
Paper
•
2403.09029
•
Published
•
56
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
•
2512.17351
•
Published
•
24
Memory in the Age of AI Agents
Paper
•
2512.13564
•
Published
•
124