
-
Zhongguancun Academy & Institute of Automation, Chinese Academy of Sciences
- Beijing, China
-
19:46
- 8h ahead - https://xuchen-li.github.io
Highlights
-
MyArxiv Public
Automatically update arXiv papers about cs.CV, eess.IV, cs.MM, cs.CL and cs.HC using Github Actions.
-
cv-arxiv-daily Public
Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.
-
llm-arxiv-daily Public
Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.
-
OvO-R1 Public
Exploring the influence of using end-to-end reinforcement learning and various reward functions on the reasoning capabilities of different 1.5B base models.
-
Awesome-Multimodal-Object-Tracking Public
Forked from 983632847/Awesome-Multimodal-Object-TrackingA personal investigative project to track the latest progress in the field of multi-modal object tracking.
-
CTVLT Public
Forked from XiaokunFeng/CTVLT[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
-
A vision-language tracking paper list, articles related to visual language tracking have been documented.
-
A visual object tracking paper list, articles related to visual object tracking have been documented.
39 UpdatedNov 6, 2024 -
DTLLM-VLT Public
[CVPRW’24 Best Paper Honorable Mention Award] DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
7 UpdatedOct 7, 2024 -
MemVLT Public
Forked from XiaokunFeng/MemVLT[NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts
1 UpdatedOct 7, 2024 -
CPDTrack Public
Forked from ZhangDailing8/CPDTrack[NeurIPS'24] Beyond accuracy: Tracking more like Human via Visual Search
1 UpdatedOct 6, 2024 -
MGIT Public
Forked from huuuuusy/videocube-toolkit[NeurIPS’23] A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship