arxiv:2303.17144
Chenyang Li
MorningsunLee
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive
Capacity
upvoted
a
paper
about 1 month ago
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable
Non-Visual Shortcuts
upvoted
a
paper
about 1 month ago
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation
Organizations
None yet