SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published 3 days ago • 41
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 10 days ago • 64
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 12 days ago • 100
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 13 days ago • 39
LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator Paper • 2512.10605 • Published 15 days ago • 6
Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge Paper • 2512.06951 • Published 19 days ago • 3
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published Sep 26 • 139
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 29 days ago • 213