artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
Nathan Lambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
upvoted a collection 13 days ago
NVIDIA Nemotron v3 upvoted a collection 13 days ago
Nemotron-Post-Training-v3 liked a model 13 days ago
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8Organizations
[lecture artifacts] aligning open language models
artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
2024 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!
Reward models on the hub
UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF.
2023 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!