Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
5
15
13
LIU Shih-yang
sliuau
Follow
cmhungsteve's profile picture
Sneha7's profile picture
Mi6paulino's profile picture
6 followers
·
7 following
nbasyl_tw
nbasyl
AI & ML interests
None yet
Organizations
sliuau
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
2 months ago
Sleeping
1
Reward Policy Intuition
🍃
1
GRPO vs GDPO: Understanding Multi-Reward Policy Optimization
liked
a dataset
3 months ago
allenai/Dolci-RL-Zero-Math-7B
Viewer
•
Updated
Jan 5
•
13.3k
•
397
•
10
liked
2 models
3 months ago
Qwen/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
Sep 17, 2025
•
6.06M
•
•
778
EssentialAI/rnj-1-instruct
Text Generation
•
8B
•
Updated
Dec 24, 2025
•
260k
•
•
312
liked
2 models
4 months ago
mistralai/Ministral-3-3B-Reasoning-2512
4B
•
Updated
Jan 15
•
15.4k
•
108
allenai/Olmo-3-7B-Think
Text Generation
•
528k
•
Updated
Jan 5
•
53.1k
•
90
liked
3 models
5 months ago
nvidia/DLER-Llama-Nemotron-8B-Merge-Research
8B
•
Updated
Oct 25, 2025
•
32
•
16
nvidia/DLER-R1-1.5B-Research
2B
•
Updated
Oct 25, 2025
•
616
•
17
nvidia/DLER-R1-7B-Research
8B
•
Updated
Oct 25, 2025
•
149
•
15
liked
a dataset
6 months ago
SynthLabsAI/Big-Math-RL-Verified
Viewer
•
Updated
Mar 25, 2025
•
251k
•
8.88k
•
223
liked
a model
about 1 year ago
nvidia/Hymba-1.5B-Base
Text Generation
•
2B
•
Updated
Nov 26, 2025
•
409
•
156
liked
a dataset
over 1 year ago
Post-training-Data-Flywheel/flywheel-v2
Updated
Aug 29, 2024
•
4
•
1
liked
a model
over 1 year ago
nvidia/Mistral-NeMo-Minitron-8B-Base
Text Generation
•
8B
•
Updated
Aug 22, 2024
•
3.32k
•
177