Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models Paper โข 2510.02880 โข Published Oct 3, 2025 โข 2 โข 2