Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parameter for number of maximum topics #9

Open
bvyshali opened this issue Jan 9, 2021 · 0 comments
Open

Parameter for number of maximum topics #9

bvyshali opened this issue Jan 9, 2021 · 0 comments

Comments

@bvyshali
Copy link

bvyshali commented Jan 9, 2021

Hi,

I was experimenting with the code you guys have here, particularly with the parameter 'K', which I think means the maximum number of topics expected. I am encountering a problem owing to inconsistencies between the number of topics discovered in the topic-word distribution and doc-topic distribution outputs.

  1. I see that with K=40, I get 40 unique topics in the doc_topic distribution but with K=100, I get only 2 topics in the doc_topic distribution with a very high probability >98%.

  2. But for both K, in the topic_word distribution output, the number of topics reaches the limit we specify and have 40 and 100 unique topics respectively.

So, it looks like for the K we specify, the number of topics in the topic_word distribution always reaches this upper bound but the same is not the case with the output from doc_topic distribution.

Perhaps there are some parameters that I need to set or wrong assumptions that I am making? Please let me know.

Thank you for the great work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant