-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About Training Costs #4
Comments
Sorry! My fault! I just saw your issue here... It's not recommended to use this repo to train it from scratch. Some developers tried it but the performance is not as good as the original version, though I try my best to recover every training details they used. By contrast, you can train from the pretrained checkpoint provided by GR-MG. For faster response you can send me an email...sorry again...I am working on video generation model and my own MimicTest policy toolbox in these days Best, |
For your original question, please refer to this issue They use 32 V100 32GB. But no worry, in my experience you can achieve roughly the same speed with 8*4090 GPU. If you open torch compiling option in my repo, it can even be 50% faster! |
Hello, I would like to know whether the pretrained checkpoint provided by GR-MG is "pretrained.pt". I used that checkpoint for training, but the results were very poor, far from the performance achieved with ByteDance's "snapshot_ABC.pt". I also want to know if your GR-Chunk is trained based on "snapshot_ABC.pt" or "pretrained.pt". I’m wondering if my training method is causing the issue. Thank you. |
Hello @1786707378, I haven't tried the pretrained checkpoint provided by GR-MG...Can you load it easily to my code? My GR-Chunk is based on "snapshot_ABC.pt" Could you please let me know which training method you are using? We can have a phone call on wechat if you have time (my ID: StarRingSpace) |
I think I made a mistake. I just made a simple attempt with your code and "pretrained. pt", and it seems that the model is incorrect. Thank you for your reply |
Emm I dont fully understand but good luck! |
Hi, Thanks for you great work to reproduce the training code for GR-1! I wonder how long it takes to complete the training process for GR-1 from scratch?[ABC->D setting]
The text was updated successfully, but these errors were encountered: