CSV format for OpenCV machine learning algorithms

2019-07-01 19:02发布

Machine learning algorithms in OpenCV appear to use data read in CSV format. See for example this cpp file. The data is read into an OpenCV machine learning class CvMLData using the following code:

CvMLData data;
data.read_csv( filename )

However, there does not appear to be any readily available documentation on the required format for the csv file. Does anyone know how the csv file should be arranged?

Other (non-Opencv) programs tend to have a line per training example, and begin with an integer or string indicating the class label.

标签： csv opencv machine-learning computer-vision

1条回答

来，给爷笑一个

2楼-- · 2019-07-01 19:45

If I read the source for that class, particularly the str_to_flt_elem function, and the class documentation I conclude that valid formats for individual items in the file are:

Anything that can be parsed to a double by strod
A question mark (?) or the empty string to represent missing values
Any string that doesn't parse to a double.

Items 1 and 2 are only valid for features. anything matched by item 3 is assumed to be a class label, and as far as I can deduce the order of the items doesn't matter. The read_csv function automatically assigns each column in the csv file the correct type, and (if you want) you can override the labels with set_response_index. Delimiter wise you can use the default (,) or set it to whatever you like before calling read_csv with set_delimiter (as long as you don't use the decimal point).

So this should work for example, for 6 datapoints in 3 classes with 3 features per point:

A,1.2,3.2e-2,+4.1
A,3.2,?,3.1
B,4.2,,+0.2
B,4.3,2.0e3,.1
C,2.3,-2.1e+3,-.1
C,9.3,-9e2,10.4

You can move your text label to any column you want, or even have multiple text labels.

0人赞添加讨论(0) 举报

CSV format for OpenCV machine learning algorithms

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间