Morning Papers - a fishfillets Collection

fishfillets 's Collections

Morning Papers

updated about 19 hours ago

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 549
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 320
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Paper • 2601.00393 • Published Jan 1 • 133
LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 172
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 169
BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published Jan 6 • 104
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published Dec 30, 2025 • 111
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published 25 days ago • 149
Moonshine: Speech Recognition for Live Transcription and Voice Commands

Paper • 2410.15608 • Published Oct 21, 2024 • 12
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published 20 days ago • 84
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published 24 days ago • 144
OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 17 days ago • 143
Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 24 days ago • 184
Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published 23 days ago • 177
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 25 days ago • 189
AI Can Learn Scientific Taste

Paper • 2603.14473 • Published 12 days ago • 408
InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 10 days ago • 302
Demystifing Video Reasoning

Paper • 2603.16870 • Published 10 days ago • 361
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published 10 days ago • 245
Attention Residuals

Paper • 2603.15031 • Published 12 days ago • 162
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 4 days ago • 124
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 5 days ago • 113