SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control Paper • 2511.09715 • Published Nov 12 • 8
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model Paper • 2312.12423 • Published Dec 19, 2023 • 13
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Paper • 2307.05463 • Published Jul 11, 2023 • 11