You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
init model cost:5.800448417663574
apply chat template cost:0.014918088912963867
diffusion generate cost:33.67275953292847
Hello! How can I assist you today?
decode cost:0.002490997314453125
full cost:39.49076247215271
The text was updated successfully, but these errors were encountered:
Hi there, the speed is related to max_new_tokens and steps. I just ran a test on one H800 GPU, and it costs 16s when setting max_new_tokens=512 and steps=512. So, I guess your speed seems reasonable considering the hardware difference.
I make a test for the sample code which supported on main page on A800.
I find the speed of diffusion was very slow, am I something wrong?
the prompt is
here is the time cost information:
The text was updated successfully, but these errors were encountered: