Empirical Recipes for Efficient and Compact Vision-Language Models

less than 1 minute read

Published in Preprint available on arXiv, 2026