Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

updated a dataset about 3 hours ago

allenai/Dolci-Instruct-RL

liked a model about 3 hours ago

Motif-Technologies/Motif-2-12.7B-Instruct

updated a dataset about 9 hours ago

allenai/Dolci-RL-Zero-General-7B

View all activity

Organizations

upvoted a paper 1 day ago

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22 • 15

upvoted a collection 20 days ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated about 7 hours ago • 38

upvoted a collection about 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated about 7 hours ago • 144

upvoted an article 4 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29

•

203

upvoted a paper 5 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted a collection 5 months ago

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 6 days ago • 22

upvoted 2 collections 6 months ago

Reward Bench 2

Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated about 7 hours ago • 16

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6 • 38

upvoted a collection 7 months ago

OpenVision

27 items • Updated Aug 15 • 32

upvoted a collection 8 months ago

Qwen3

84 items • Updated Aug 6 • 1.48k

upvoted a paper 8 months ago

Reinforcement Learning from Human Feedback

Paper • 2504.12501 • Published Apr 16 • 4

upvoted a collection 10 months ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated about 7 hours ago • 16

upvoted an article 11 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

upvoted a collection 11 months ago

2024 Interconnects Artifacts

Models & datasets mentioned in the bottom section of posts! • 280 items • Updated Jan 2 • 6

upvoted a paper 12 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

upvoted 5 collections about 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated about 7 hours ago • 81

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated about 7 hours ago • 150

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 7 hours ago • 103

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 7 hours ago • 96

Molmo

Artifacts for open multimodal language models. • 5 items • Updated about 7 hours ago • 308