I tried to run this code from github (following the 1-2-3 steps) which identifies 30 topics in Sarah Palin's 14,500 emails. The topics discovered by the author are here. However, Stanford Topic Modeling Toolbox is not producing lda-output directory for me. It produced the lda-86a58136-30-2b1a90a6, but the summary.txt in this folder only shows the initial assignment of topics, not the final one. Any idea how to produce lda-output directory with the final summary of topics discovered? Thanks in advance!
相关问题
- How to get a list of antonyms lemmas using Python,
- How to conditionally scale values in Keras Lambda
- Trying to understand Pytorch's implementation
- How to match dependency patterns with spaCy?
- LUIS - Can we use phrases list for new values in t
相关文章
- How to use cross_val_score with random_state
- How to measure overfitting when train and validati
- McNemar's test in Python and comparison of cla
- How to disable keras warnings?
- Invert MinMaxScaler from scikit_learn
- What's the difference between WordNet 3.1 and
- How should I vectorize the following list of lists
- ValueError: Unknown metric function when using cus
Have you tried the instructions posted here?
Note that I see the original investigator trained the model with Sarah Palin's emails, and then used that trained model to analyze Sarah Palin's emails. While I am not an LDA expert, this typically smacks of "finding what you have".
In most disciplines, training would be done over a known set of items which had been classified according to discriminant by experts. This means that the training would consist of feeding a set of data in known likely topics from other sources, and then would use the LDA library to determine distance from the topics in the "learned" database.
In any event, good luck.
In the event you encounter a specific issue, please post the error, and the steps you took to arrive at that error. Few people invest the time to attempt to reproduce an issue (a typical prerequisite for correcting an issue) without direction, or even the ability to determine if their encountered issue is similar to yours.