artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
Nathan Lambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
upvoted a collection about 2 hours ago
Gemma 4 upvoted a collection 17 days ago
NVIDIA Nemotron v3 upvoted a collection 17 days ago
Nemotron-Post-Training-v3Organizations
[lecture artifacts] aligning open language models
artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
2024 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!
Reward models on the hub
UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF.
2023 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!