Is it possible to further reduce the RAM? #395

ForcewithMe66 · 2023-06-17T07:56:51Z

I use multiple A6000 cards for pretraining. The RAM of each card is 49140MiB.

I tried to pretrain LLaMA-7B with bf16-mixed,

batch_size = 60 # 125
micro_batch_size = 1 # 1 × 4 = 4 for each iterations

it works well before the backpropagation. Before backpropagation, it takes 47+/48G. But it's OOM when it reach the 15th step (When backpropagation is operated).

It's a way to make this work? I can come up with the following ideas, both of which can work. But I don't think they are the best choice.

change the precision from bf16-mixed to bf16-true. But as BLOOM) said, bfloat16 mixed precision training can solve the instability problem
Reduce the context length (Block size)

The text was updated successfully, but these errors were encountered:

carmocca · 2023-06-19T19:02:05Z

Yes, those are ways to reduce the memory requirement. I will also make a fix soon that enables back flash attention: Lightning-AI/litgpt#171

ForcewithMe66 · 2023-06-20T03:35:58Z

Hi @carmocca , can Lightning-AI/litgpt#171 save some RAM while pretraining and fine-tuning？

carmocca · 2023-06-20T04:24:45Z

Yes, would you like to port the changes from that PR here? I can do it otherwise

ForcewithMe66 · 2023-06-20T05:00:02Z

Hi @carmocca , I am not familiar with that, so I am afraid I can't port the change from lit-parrot to lit-llama here.

ruoyu61 · 2023-07-23T23:16:01Z

@carmocca I made a PR for lit-llama by following your PR. However, after I ran finetune/adapter.py on A100, the memory increased from 19.5G to 20.3G. Any idea what I did wrong?

MartinKondor mentioned this issue Jul 13, 2023

Lit-llama using too much RAM #416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to further reduce the RAM? #395

Is it possible to further reduce the RAM? #395

ForcewithMe66 commented Jun 17, 2023 •

edited

Loading

carmocca commented Jun 19, 2023

ForcewithMe66 commented Jun 20, 2023

carmocca commented Jun 20, 2023

ForcewithMe66 commented Jun 20, 2023

ruoyu61 commented Jul 23, 2023

Is it possible to further reduce the RAM? #395

Is it possible to further reduce the RAM? #395

Comments

ForcewithMe66 commented Jun 17, 2023 • edited Loading

carmocca commented Jun 19, 2023

ForcewithMe66 commented Jun 20, 2023

carmocca commented Jun 20, 2023

ForcewithMe66 commented Jun 20, 2023

ruoyu61 commented Jul 23, 2023

ForcewithMe66 commented Jun 17, 2023 •

edited

Loading