view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ Jul 5, 2024 โข 306
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick Oct 24, 2024 โข 14
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 โข 74