-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NaiveBayesClassifier taking too long #63
Comments
It can take a while to Train the classifier. Once the classifier is trained, It seems to return polarity fairly quickly. On Thu, May 29, 2014 at 10:28 AM, canivel [email protected] wrote:
|
Also, is you time issue when Training, or When using it to classify a new On Thu, May 29, 2014 at 10:28 AM, canivel [email protected] wrote:
|
when using the training, look: def save_classifier(classifier): def load_classifier(): if os.path.isfile('semtiment_classifier.pickle'): c = cl.classify("This is a fantastic api!") it saves the classifier, but when I try to run again returns: Thanks for the help |
What you put above is virtually identical to what I have. Are you doing I use this method with NLTK classifiers and with SkLearn classifiers, and note: You might try to train classifier C1, then pickle it and load it as C2. On Thu, May 29, 2014 at 12:29 PM, canivel [email protected] wrote:
|
Thanks I got it working now... have to work out a lot of unicode characters in the dataset... just to let you now for a 942 tweets (pos/neg) it takes 31s to classify in a i7 new imac: Is there anything else to improve the execution time that you recommend... tks again! |
I'm experiencing this, but I cannot get rid of the training set because is in French, which means every string is UTF-8 encoded. I actually created a training set encoded as ASCII, ignoring the characters outside that encoding (which means losing a lot of data), and the training phase was still taking too much time. What can I do? |
Hi, I also noticed the TextBlob classifier to take very long in comparison to sklearn and the NLTK-classifier. But I don't understand why there is a difference, especially for NLTK. Isn't TextBlob using the NLTK version? |
Can I know how to get the accuracy and most informative features of the naive bayes classifier which is already trained using movie corpus. Thanks |
Experiencing similar performance issues. Using custom NaiveBayes classifier to train on 1500 article titles (pos, neg, NA). Takes about 10 minutes to train on core i7 and not much faster to classify the same number of titles with a pickled classifier (loading is fast, but classification is slow). Any ideas how to speed this up? I can understand that training is slow, but why is classification slow too? |
any update on this issue? |
#136 is now merged and released to PyPI. |
Hi, I've a small dataset of 1000 tweets which I've classify in pos/neg for training. When I tried to use it at the NaiveBayesClassifier() it tooks like 10-15min to return a result...
Is there a way to save the result of the classifier like a dump and reuse that for further classifications ?
Thanks
The text was updated successfully, but these errors were encountered: