-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue] Tensor mismatch at Runtime #10
Comments
Hi Ratra, Thanks for bringing this up. I have not made any changes to the clip or llm part since last December. Thanks you |
Hi! Thank you for your prompt response! Could you please share the versions of the dependencies you used to debvelop your code? Because the demo colab notebook keeps throwing the same tensor shape mismatch error, even when replicated locally. I am suspecting that one or more of the dependencies has been updated to a never version, and the current code is not compatible with it. Thank you in advance, |
Here is my package info! Package Version Editable project location accelerate 0.21.0 |
Hi! Thank you so much for your reply! Just to update you on this issue, Huggingface updated the Transformers library to v4.39.1, and incrementally above. The updated versions are leading to the tensor mismatch issues in the code. The presented Colab notebook works as expected when used with transformers v4.39.0. Hope that helps anyone who comes across this same issue. :) Best, |
@RatraSahil can you run this github locally, I install the command but get different verision mention in issue, thx! |
RuntimeError: The expanded size of the tensor (628) must match the existing size (129) at non-singleton dimension 3. Target sizes: [1, 32, 1, 628]. Tensor sizes: [1, 1, 1, 129]
Issue: Upon running the demo colab notebook here, up until yesterday, I was able to get the expected results, but somehow since today the code throws the below mentioned error :
RuntimeError: The expanded size of the tensor (628) must match the existing size (129) at non-singleton dimension 3. Target sizes: [1, 32, 1, 628]. Tensor sizes: [1, 1, 1, 129]
The error gets triggered upon calling the following piece of code :
generate_ids = model.generate(**inputs, max_new_tokens=200)
Could you please have a look and possibly explain what causes the sudden mismatch?
Please note that the example instances used remain unchanged, i.e, the code is trhowing the tensor mismatch error even with the example included in the demo notebook.
The text was updated successfully, but these errors were encountered: