arxiv:2412.20070
Zhenyang Cai
Eric3200
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One
Vision Token
authored
a paper
10 days ago
On the Compositional Generalization of Multimodal LLMs for Medical
Imaging
commented
a paper
11 days ago
On the Compositional Generalization of Multimodal LLMs for Medical
Imaging