PDrop Method #11

jun0wanan · 2025-01-22T13:33:31Z

I find your code is :


<html>
<body>
<!--StartFragment-->
"llm_compress_layer_list": [
--
  | 24
  | ],
  | "llm_compress_type": "attention",
  | "llm_image_token_ratio_list": [
  | 1.0,
  | 0.5
  | ],

<!--EndFragment-->
</body>
</html>

however, in your paper is :

At the shallow
layers of the LLM, we uniformly drop a small number of
video tokens (i.e. uniform drop)

Can you tell me about that difference? Thanks!

The text was updated successfully, but these errors were encountered:

hello-bluedog · 2025-01-22T16:20:12Z

https://huggingface.co/OpenGVLab/VideoChat-Flash-Qwen2-7B_res224/blob/0018d8199ed96cae61adde768c73bea2e2cf4fbd/config.json#L167

Another additional question, were pDrops not used during training？

leexinhao · 2025-01-23T12:02:56Z

Sorry for this confusion, we choose to set the drop_type while inferencing:

@hello-bluedog It was not used in training our final version of the model, in order to be compatible with policies such as data packing and sequence parallel (in fact, there is no conflict, it is just an engineering problem), and whether we enabled drop during training had little impact on our ablation study.

jun0wanan · 2025-01-24T06:37:03Z

Sorry for this confusion, we choose to set the drop_type while inferencing:

@hello-bluedog It was not used in training our final version of the model, in order to be compatible with policies such as data packing and sequence parallel (in fact, there is no conflict, it is just an engineering problem), and whether we enabled drop during training had little impact on our ablation study.

可是这个drop_type是只有一个呀，具体到里面代码就是选择一种在第几层，我看了你们的config是24层中进行attention操作

hello-bluedog · 2025-01-24T08:16:14Z

作者的意思应该是，在inference时这个配置会被修改，实际上用到的配置参数是这个？

leexinhao · 2025-01-25T04:19:45Z

作者的意思应该是，在inference时这个配置会被修改，实际上用到的配置参数是这个？

是的

jun0wanan closed this as completed Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PDrop Method #11

PDrop Method #11

jun0wanan commented Jan 22, 2025

hello-bluedog commented Jan 22, 2025

leexinhao commented Jan 23, 2025

jun0wanan commented Jan 24, 2025

hello-bluedog commented Jan 24, 2025

leexinhao commented Jan 25, 2025

PDrop Method #11

PDrop Method #11

Comments

jun0wanan commented Jan 22, 2025

hello-bluedog commented Jan 22, 2025

leexinhao commented Jan 23, 2025

jun0wanan commented Jan 24, 2025

hello-bluedog commented Jan 24, 2025

leexinhao commented Jan 25, 2025