Using scikit-learn, how do I learn a SVM over a sm

2019-07-17 04:12发布

With scikit-learn, I have built a support vector machine, for a basic handwritten digit detection problem.

My total data set consists of 235 observations. My observations consist of 1025 features each. I know that one of the advantages of using a support vector machine is in situations like this, where there are a modest number of observations that have a large number of features.

After my SVM is created, I look at my confusion matrix (below)...

Confusion Matrix:
[[ 6  0]
 [ 0 30]]

...and realize that holding out 15% of my data for testing (i.e., 36 observations) is not enough.

My problem is this: How can I work around this small data issue, using cross validation?

标签： python machine-learning svm scikit-learn confusion-matrix

1条回答

放我归山

2楼-- · 2019-07-17 04:24

This is exactly what cross validation (and its generalizations, like Err^0.632) is for. Hold-out set is reasonable only with huge quantities of data.

0人赞添加讨论(0) 举报

Using scikit-learn, how do I learn a SVM over a sm

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间