Linli Yao, Yicheng Li, Yuancheng Wei, Lei Li, Shuhuai Ren, Yuanxin Liu, Kun Ouyang, Lean Wang, Shicheng Li, Sida Li, Lingpeng Kong, Qi Liu, Yuanxing Zhang, Xu Sun
(2025).
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos.
In
ACM Multimedia.
Lei Li, Yuancheng Wei, Zhihui Xie, Xuqing Yang, Yifan Song, Peiyi Wang, Chenxin An, Tianyu Liu, Sujian Li, Bill Yuchen Lin, Lingpeng Kong, Qi Liu
(2025).
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models.
In
CVPR.
Lei Li, Yuwei Yin, Shicheng Li, Liang Chen, Peiyi Wang, Shuhuai Ren, Mukai Li, Yazheng Yang, Jingjing Xu, Xu Sun, Lingpeng Kong, Qi Liu
(2023).
M3IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.
In
arXiv.