is it neccessary to run random forest with cross v

2019-07-21 04:43发布

Random forest is a robust algorithm. In Random Forest, it trains several small trees and have OOB accuracy. However, is it necessary to run cross-validation with random forest at the same time ?

标签： machine-learning classification random-forest

2条回答

一纸荒年 Trace。

2楼-- · 2019-07-21 05:13

You do not need to perform any kind of validation. If you just want to use it, and don't care about the risk of overfitting.

For scientific publishing (or anything else, where you compare the quality of different classifiers), you should validate your results, and cross validation is a best practise here.

0人赞添加讨论(0) 举报

够拽才男人

3楼-- · 2019-07-21 05:35

OOB error is an unbiased estimate of the error for random forests, so that's great. But what are you using the cross validation for? If you are comparing the RF against some other algorithm that isn't using bagging in the same way, you want a low variance way to compare them. You have to use cross validation anyway to support the other algorithm. Then using the cross validation sample splits for the RF and the other algorithm is still a good idea, so that you get rid of the variance caused by the split selection.

If you are comparing one RF against another RF with a different feature set, then comparing OOB errors is reasonable. This is especially true if you make sure both RFs use the same bagging sets during training.

0人赞添加讨论(0) 举报

is it neccessary to run random forest with cross v

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间