You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was experimenting with the code you guys have here, particularly with the parameter 'K', which I think means the maximum number of topics expected. I am encountering a problem owing to inconsistencies between the number of topics discovered in the topic-word distribution and doc-topic distribution outputs.
I see that with K=40, I get 40 unique topics in the doc_topic distribution but with K=100, I get only 2 topics in the doc_topic distribution with a very high probability >98%.
But for both K, in the topic_word distribution output, the number of topics reaches the limit we specify and have 40 and 100 unique topics respectively.
So, it looks like for the K we specify, the number of topics in the topic_word distribution always reaches this upper bound but the same is not the case with the output from doc_topic distribution.
Perhaps there are some parameters that I need to set or wrong assumptions that I am making? Please let me know.
Thank you for the great work!
The text was updated successfully, but these errors were encountered:
Hi,
I was experimenting with the code you guys have here, particularly with the parameter 'K', which I think means the maximum number of topics expected. I am encountering a problem owing to inconsistencies between the number of topics discovered in the topic-word distribution and doc-topic distribution outputs.
I see that with K=40, I get 40 unique topics in the doc_topic distribution but with K=100, I get only 2 topics in the doc_topic distribution with a very high probability >98%.
But for both K, in the topic_word distribution output, the number of topics reaches the limit we specify and have 40 and 100 unique topics respectively.
So, it looks like for the K we specify, the number of topics in the topic_word distribution always reaches this upper bound but the same is not the case with the output from doc_topic distribution.
Perhaps there are some parameters that I need to set or wrong assumptions that I am making? Please let me know.
Thank you for the great work!
The text was updated successfully, but these errors were encountered: