Z. Li, B. Yang et al. Monkey: Image Resolution and Text Label are important things for Large Multi-modal Models
发布时间:2024-04-01
点击次数:
- 发表刊物:
- IEEE/CVF Conference on Computer Vision and Pattern recognition (CVPR), 2024