artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
Nathan Lambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
liked a dataset 4 days ago
open-thoughts/OpenThoughts-Agent-SFT-100K upvoted a collection 6 days ago
Tmax liked a model about 1 month ago
openbmb/BitCPM-CANN-3B-unquantizedOrganizations
[lecture artifacts] aligning open language models
artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin
2024 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!
Reward models on the hub
UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF.
2023 Interconnects Artifacts
Models & datasets mentioned in the bottom section of posts!