Baichuan Zhou's picture

Baichuan Zhou

bczhou

·

https://baichuanzhou.github.io/

baichuanzhou

AI & ML interests

Computer Vision

Recent Activity

liked a Space 25 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

upvoted a paper about 1 month ago

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

upvoted a paper about 1 month ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

View all activity

Organizations

Collections 1

Papers 3

arxiv:2410.09732

arxiv:2408.17267

arxiv:2402.14289

spaces 1

Clip Gpt2

models 8

bczhou/tiny-llava-v1-hf

Image-Text-to-Text • Updated Aug 17, 2024 • 1.27k • 56

bczhou/TinyLLaVA-2.0B

Image-Text-to-Text • Updated Jul 26, 2024 • 341 • 5

bczhou/TinyLLaVA-1.5B

Image-Text-to-Text • Updated Jun 14, 2024 • 153 • 16

bczhou/TinyLLaVA-3.1B-Pretrain

Text Generation • Updated Mar 25, 2024 • 136

bczhou/TinyLLaVA-3.1B

Text Generation • Updated Mar 25, 2024 • 1.87k • 25

bczhou/TinyLLaVA-2.0B-SigLIP

Updated Feb 26, 2024 • 335 • 1

bczhou/TinyLLaVA-1.5B-SigLIP

Updated Feb 26, 2024 • 42 • 1

bczhou/TinyLLaVA-3.1B-SigLIP

Updated Feb 26, 2024 • 1.78k • 4

datasets 7

bczhou/LOKI

Preview • Updated Nov 5, 2024 • 41

bczhou/UrBench

Updated Aug 17, 2024 • 37 • 2

bczhou/CityBench-SubTasks

Viewer • Updated Aug 1, 2024 • 12.8k • 40

bczhou/SyntheticBench-Videos

Viewer • Updated Jul 30, 2024 • 264 • 30

bczhou/CityBench-v0.3

Viewer • Updated Jul 10, 2024 • 9.71k • 30

bczhou/CityBench-v0.2

Viewer • Updated Jul 10, 2024 • 9.71k • 35

bczhou/CityVQA-v0.2

Viewer • Updated May 30, 2024 • 2.5k • 31 • 1