Skip to content

Empty responses for complex queries when running demo_multiturn_chat.py #5

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
tanhuajie opened this issue Apr 5, 2025 · 3 comments

Comments

@tanhuajie
Copy link

Hi team! Thanks for such an amazing work on this project!

I'm encountering an issue when running the provided demo_multiturn_chat.py script. The model seems to work fine with only simple inputs like "Hi" or "What's your name?".

Image

But when I try more complex questions such as:

  • "Can you recommend me some places to visit in Beijing?"
  • "How can I learn to play the violin?"

After running through the default 256 steps of forward diffusion, the model returns empty responses. This behavior seems inconsistent since simpler queries work as expected.

Image

Have you encountered this issue before? Any suggestions on how to resolve it or what might be causing this behavior? Thanks again!!

@zihaoli0629
Copy link

zihaoli0629 commented Apr 6, 2025

Same issue here. I also notice that when alg_temp tends to 0, the model behaves like an AR model and the accuracy improves significantly for math problems.

@jiacheng-ye
Copy link
Contributor

Hi there, the empty response is mostly because the model generates a sentence with all tokens, which may be because of the lower temperature and randomness. You can consider trying a slightly higher temperature.

Hi team! Thanks for such an amazing work on this project!

I'm encountering an issue when running the provided demo_multiturn_chat.py script. The model seems to work fine with only simple inputs like "Hi" or "What's your name?".

Image

But when I try more complex questions such as:

  • "Can you recommend me some places to visit in Beijing?"
  • "How can I learn to play the violin?"

After running through the default 256 steps of forward diffusion, the model returns empty responses. This behavior seems inconsistent since simpler queries work as expected.

Image

Have you encountered this issue before? Any suggestions on how to resolve it or what might be causing this behavior? Thanks again!!

@jiacheng-ye
Copy link
Contributor

Same issue here. I also notice that when alg_temp tends to 0, the model behaves like an AR model and the accuracy improves significantly for math problems.

Good observation!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants