reasoning - a kevpan Collection

kevpan 's Collections

vlm

reasoning

updated 5 days ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published 20 days ago • 25
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 10 days ago • 83