Shao qi's picture

23 10

Shao qi

OmniTN

·

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

bigai/TongSIM-Asset

liked a model 21 days ago

FutureMa/Qwen3-8B-Drama-Thinking

upvoted a paper about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 167

The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

Paper • 2511.20614 • Published Nov 25, 2025 • 37

upvoted 4 papers 3 months ago

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model

Paper • 2510.16449 • Published Oct 18, 2025 • 34

LongCodeZip: Compress Long Context for Code Language Models

Paper • 2510.00446 • Published Oct 1, 2025 • 106

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26, 2025 • 65

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19, 2025 • 126

upvoted 5 papers 4 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8, 2025 • 63

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

MV-RAG: Retrieval Augmented Multiview Diffusion

Paper • 2508.16577 • Published Aug 22, 2025 • 38

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Paper • 2508.18032 • Published Aug 25, 2025 • 42

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted 9 papers 11 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13, 2025 • 191

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published Feb 10, 2025 • 17

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Paper • 2502.01639 • Published Feb 3, 2025 • 26

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published Jan 28, 2025 • 31

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23, 2025 • 23

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24, 2025 • 27

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published Jan 21, 2025 • 11

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published Jan 26, 2025 • 13

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published Jan 26, 2025 • 25