3 30 16

Cong Wei PRO

CongWei1230

https://congwei1230.github.io/

AI & ML interests

Generative Models; Reasoning

Recent Activity

upvoted a paper 8 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

upvoted a paper 9 days ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

upvoted a paper 9 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

upvoted a paper 8 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published 9 days ago • 62

upvoted 2 papers 9 days ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published 16 days ago • 45

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 10 days ago • 61

upvoted a paper about 1 month ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24 • 21

upvoted 2 papers about 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 48

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12 • 27

upvoted 2 papers 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

upvoted 2 papers 3 months ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1 • 33

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

upvoted a paper 5 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

upvoted 5 papers 7 months ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published Feb 3 • 29

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 41

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

Paper • 2505.14640 • Published May 20 • 16

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 24

upvoted 2 collections 7 months ago

MoCha

Collection

The pioneering work in Dialogue-driven Movie Shot Generation • 3 items • Updated May 6 • 1

MoCha

Collection

The pioneering work in Dialogue-driven Movie Shot Generation • 3 items • Updated May 6 • 2

upvoted 2 papers 8 months ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138

Cong Wei PRO

AI & ML interests

Recent Activity

Organizations

CongWei1230's activity