How to output RandomForest Classifier from python?

2019-02-08 20:09发布

I have trained a RandomForestClassifier from Python Sckit Learn Module with very big dataset, but question is how can I possibly save this model and let other people apply it on their end. Thank you!

2条回答
老娘就宠你
2楼-- · 2019-02-08 20:47

The recommended method is to use joblib, this will result in a much smaller file than a pickle:

from sklearn.externals import joblib
joblib.dump(clf, 'filename.pkl') 

#then your colleagues can load it

clf = joblib.load('filename.pk1')

See the online docs

查看更多
别忘想泡老子
3楼-- · 2019-02-08 21:08

Have you tried pickling the RandomForestClassifier using the Pickle module and then saving it to the disk?

Here’s an example based on the pickle docs:

import pickle

classifier = RandomForestClassifier(etc)
output = open('classifier.pkl', 'wb')
pickle.dump(classifier, output)
output.close()

The “other people” could then reload the pickled object as follows:

import pickle

f = open('classifier.pkl', 'rb')
classifier = pickle.load(f)
f.close()
查看更多
登录 后发表回答