3 17 2

Zhe Cao

MichaelCaoo

MichaelCao0

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

NJU-LINK/T2AV-Compass:Update README.md

new activity 2 days ago

NJU-LINK/T2AV-Compass:Upload 0000.parquet

upvoted a paper 2 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

View all activity

Organizations

New activity in NJU-LINK/T2AV-Compass 2 days ago

Update README.md

#5 opened 2 days ago by

MichaelCaoo

Upload 0000.parquet

#4 opened 2 days ago by

MichaelCaoo

upvoted a paper 2 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published 3 days ago • 24

New activity in NJU-LINK/T2AV-Compass 5 days ago

Upload prompts_with_checklist.json

#1 opened 5 days ago by

MichaelCaoo

upvoted a paper 13 days ago

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published 24 days ago • 27

upvoted 2 papers 25 days ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published 26 days ago • 53

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 276

upvoted a paper 29 days ago

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published about 1 month ago • 45

authored a paper 30 days ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12 • 46

updated a model about 2 months ago

MichaelCaoo/RoboTwin_DP3_ckpt

Updated Nov 12

upvoted 2 papers about 2 months ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published Nov 10 • 17

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5 • 52

published a model about 2 months ago

MichaelCaoo/RoboTwin_DP3_ckpt

Updated Nov 12

upvoted 2 papers about 2 months ago

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Paper • 2509.23909 • Published Sep 28 • 32

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 118

upvoted 5 papers 2 months ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published Oct 17 • 147

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published Oct 21 • 24

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20 • 19

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16 • 74

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

Zhe Cao

AI & ML interests

Recent Activity

Organizations

MichaelCaoo's activity

Update README.md

Upload 0000.parquet

Upload prompts_with_checklist.json