25 4

Zeyu Zhang

SteveZeyuZhang

https://steve-zeyu-zhang.github.io/

steve-zeyu-zhang

AI & ML interests

Geometric Learning, Generative AI, Computer Vision, Robotics, AI for Health

Recent Activity

published a dataset about 1 hour ago

AIGeeksGroup/Code4D

upvoted a paper 8 days ago

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

upvoted a paper 13 days ago

HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models

View all activity

Organizations

published a dataset about 1 hour ago

AIGeeksGroup/Code4D

Updated about 1 hour ago

upvoted a paper 8 days ago

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Paper • 2602.06034 • Published 9 days ago • 8

upvoted a paper 13 days ago

HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models

Paper • 2601.15968 • Published 23 days ago • 7

authored 2 papers about 1 month ago

VaseVQA: Multimodal Agent and Benchmark for Ancient Greek Pottery

Paper • 2509.17191 • Published Sep 21, 2025 • 1

3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence

Paper • 2601.06496 • Published Jan 10 • 1

submitted a paper to Daily Papers about 1 month ago

3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence

Paper • 2601.06496 • Published Jan 10 • 1

authored 2 papers about 1 month ago

CoV: Chain-of-View Prompting for Spatial Reasoning

Paper • 2601.05172 • Published Jan 8 • 10

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published Jan 6 • 10

submitted a paper to Daily Papers about 1 month ago

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published Jan 6 • 10

authored a paper about 2 months ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

Paper • 2510.15264 • Published Oct 17, 2025 • 4

submitted a paper to Daily Papers 2 months ago

DragMesh: Interactive 3D Generation Made Easy

Paper • 2512.06424 • Published Dec 6, 2025 • 1

published a model 2 months ago

AIGeeksGroup/DragMesh

Robotics • Updated Dec 28, 2025

authored a paper 2 months ago

EgoLCD: Egocentric Video Generation with Long Context Diffusion

Paper • 2512.04515 • Published Dec 4, 2025 • 6

commented a paper 2 months ago

EgoLCD: Egocentric Video Generation with Long Context Diffusion

Paper • 2512.04515 • Published Dec 4, 2025 • 6 •

authored a paper 2 months ago

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Paper • 2511.22973 • Published Nov 28, 2025 • 6

commented a paper 2 months ago

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Paper • 2511.22973 • Published Nov 28, 2025 • 6 •

New activity in heyuanyu/LV-Bench 3 months ago

Update README.md

#1 opened 3 months ago by

SteveZeyuZhang

authored 3 papers 3 months ago

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots

Paper • 2511.17889 • Published Nov 22, 2025 • 5

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Paper • 2511.20714 • Published Nov 25, 2025 • 49

EvoVLA: Self-Evolving Vision-Language-Action Model

Paper • 2511.16166 • Published Nov 20, 2025 • 6

Zeyu Zhang

AI & ML interests

Recent Activity

Organizations

SteveZeyuZhang's activity

Update README.md