Is there easy way to grid search without cross val

There is absolutely helpful class GridSearchCV in scikit-learn to do grid search and cross validation, but I don't want to do cross validataion. I want to do grid search without cross validation and use whole data to train. To be more specific, I need to evaluate my model made by RandomForestClassifier with "oob score" during grid search. Is there easy way to do it? or should I make a class by myself?

The points are

I'd like to do grid search with easy way.
I don't want to do cross validation.
I need to use whole data to train.(don't want to separate to train data and test data)
I need to use oob score to evaluate during grid search.

标签： python scikit-learn random-forest grid-search

3条回答

SAY GOODBYE

2楼-- · 2019-01-23 05:27

One method is to use ParameterGrid to make a iterator of the parameters you want and loop over it.

Another thing you could do is actually configure the GridSearchCV to do what you want. I wouldn't recommend this much because it's unnecessarily complicated.
What you would need to do is:

Use the arg cv from the docs and give it a generator which yields a tuple with all indices (so that train and test are same)
Change the scoring arg to use the oob given out from the Random forest.

0人赞添加讨论(0) 举报

在下西门庆

3楼-- · 2019-01-23 05:27

See this link: https://stackoverflow.com/a/44682305/2202107

He used cv=[(slice(None), slice(None))] which is NOT recommended by sklearn's authors.

0人赞添加讨论(0) 举报

疯言疯语

4楼-- · 2019-01-23 05:33

I would really advise against using OOB to evaluate a model, but it is useful to know how to run a grid search outside of GridSearchCV() (I frequently do this so I can save the CV predictions from the best grid for easy model stacking). I think the easiest way is to create your grid of parameters via ParameterGrid() and then just loop through every set of params. For example assuming you have a grid dict, named "grid", and RF model object, named "rf", then you can do something like this:

for g in ParameterGrid(grid):
    rf.set_params(**g)
    rf.fit(X,y)
    # save if best
    if rf.oob_score_ > best_score:
        best_score = rf.oob_score_
        best_grid = g

print "OOB: %0.5f" % best_score 
print "Grid:", best_grid

0人赞添加讨论(0) 举报

Is there easy way to grid search without cross val

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间