llava-hf
/

LLaVA-NeXT-Video-7B-hf

nielsr HF staff

ervin0307 commited on 4 days ago

Commit

61a33f7

verified ·

1 Parent(s): b3b624d

Update README.md (#13)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ Disclaimer: The team releasing LLaVa-NeXT-Video did not write a model card for t
 ## 📄 Model details
 **Model type:**
-LLaVA-Next-Video is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data. The model is buit on top of LLaVa-NeXT by tuning on a mix of video and image data to achieves better video understanding capabilities. The videos were sampled uniformly to be 32 frames per clip.
 The model is a current SOTA among open-source models on [VideoMME bench](https://arxiv.org/abs/2405.21075).
 Base LLM: [lmsys/vicuna-7b-v1.5](https://huggingface.co/lmsys/vicuna-13b-v1.5)

 ## 📄 Model details
 **Model type:**
+LLaVA-Next-Video is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data. The model is buit on top of LLaVa-NeXT by tuning on a mix of video and image data to achieve better video understanding capabilities. The videos were sampled uniformly to be 32 frames per clip.
 The model is a current SOTA among open-source models on [VideoMME bench](https://arxiv.org/abs/2405.21075).
 Base LLM: [lmsys/vicuna-7b-v1.5](https://huggingface.co/lmsys/vicuna-13b-v1.5)