Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability
Language Technology Lab at Alibaba DAMO Academy
company
AI & ML interests
None defined yet.
Recent Activity
Collections
5
spaces
3
models
42
DAMO-NLP-SG/VideoRefer-7B-stage2.5
Visual Question Answering
β’
Updated
β’
32
β’
2
DAMO-NLP-SG/VideoRefer-7B-stage2
Visual Question Answering
β’
Updated
β’
13
β’
1
DAMO-NLP-SG/VideoRefer-7B
Visual Question Answering
β’
Updated
β’
38
β’
2
DAMO-NLP-SG/DiGIT
Unconditional Image Generation
β’
Updated
β’
4
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV
Visual Question Answering
β’
Updated
β’
1.43k
β’
14
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F
Visual Question Answering
β’
Updated
β’
1.9k
β’
8
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base
Visual Question Answering
β’
Updated
β’
153
β’
1
DAMO-NLP-SG/LiT-B-32_CC12M
Updated
β’
1
DAMO-NLP-SG/VideoLLaMA2-72B
Visual Question Answering
β’
Updated
β’
88
β’
10
DAMO-NLP-SG/VideoLLaMA2-72B-Base
Visual Question Answering
β’
Updated
β’
23
β’
1
datasets
9
DAMO-NLP-SG/multimodal_textbook
Updated
β’
2.09k
β’
60
DAMO-NLP-SG/VideoRefer-Bench
Updated
β’
21
DAMO-NLP-SG/CMM
Updated
β’
37
β’
5
DAMO-NLP-SG/Multi-Source-Video-Captioning
Viewer
β’
Updated
β’
1.5k
β’
59
β’
6
DAMO-NLP-SG/LongCorpus-2.5B
Preview
β’
Updated
β’
36
β’
8
DAMO-NLP-SG/SOUL
Viewer
β’
Updated
β’
15k
β’
44
DAMO-NLP-SG/MultiJail
Viewer
β’
Updated
β’
315
β’
54
β’
6
DAMO-NLP-SG/HyperlinkMRC
Updated
β’
37
β’
2
DAMO-NLP-SG/SSTuning-datasets
Updated
β’
31