Xu Weiwen's picture

Xu Weiwen

xww033

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

updated a dataset 28 days ago

xww033/train_math_dapo_qwen3-4b_rollout_acc0.8-g2.5pro_solution

published a dataset 28 days ago

xww033/train_math_dapo_qwen3-4b_rollout_acc0.8-g2.5pro_solution

View all activity

Organizations

authored a paper 4 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30 • 46

authored 6 papers 6 months ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 100

Exploiting Reasoning Chains for Multi-hop Science Question Answering

Paper • 2109.02905 • Published Sep 7, 2021 • 1

From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Paper • 2212.04755 • Published Dec 9, 2022

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Paper • 2410.01428 • Published Oct 2, 2024 • 1

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Paper • 2410.10858 • Published Oct 7, 2024

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 114

authored 2 papers over 1 year ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 58

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Paper • 2406.16377 • Published Jun 24, 2024 • 13

authored a paper almost 2 years ago

Reasons to Reject? Aligning Language Models with Judgments

Paper • 2312.14591 • Published Dec 22, 2023 • 19