Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Publication
In Tech Report