Data Augmentation for Object Detection using Deep

I have a question regarding data augmentation for training the deep neural network for object detection.

I have quite limited data set (nearly 300 images). I augmented the data by rotating each image from 0-360 degrees with stepsize of 15 degree. Consequently I got 24 rotated images out of just one. So in total, I got around 7200 images. Then I drew bounding box around the object of interest in each augmented image.

Does it seem to be a reasonable approach to enhance the data?

Best Regards

标签： machine-learning computer-vision neural-network deep-learning object-detection

4条回答

放我归山

2楼-- · 2019-08-01 04:19

It seems like you are on the right track, rotation is usually a very useful transformation for augmenting the training data. I would suggest to try other transformations like shift (you most probably want to detect partially present objects), zoom (makes your model invariant to the scale), shear, flip, etc. By combining different transformations you can introduce additional diversity in your training data. Training set of 300 images is a very small number, so you would definitely need more than one transformation to augment so tiny training set.

0人赞添加讨论(0) 举报

Luminary・发光体

3楼-- · 2019-08-01 04:20

This is a good approach as long as you don't implicitly change the labels when you do rotation. E.g. An image containing the digit 6 will become digit 9 on rotation of 180 deg. So, you've to pay some attention in such scenarios.

But, you could also do other geometric transformations like scaling, translation

Other augmentation that you can consider is using the pre-trained model such as ImageNet, if your problem domain has some resemblance to the ImageNet data. This will allow you to train deeper models even for your data scarce situation.

0人赞添加讨论(0) 举报

劳资没心，怎么记你

4楼-- · 2019-08-01 04:25

Even though rotation increases the representational complexity of your image, it might be not enough. Instead you probably need to add other types of augmentation as well.

Color augmentations are useful if they still represent the real distribution of your data.

Spatial augmentations work very good. Keep in mind that most modern systems use a lot of cropping, so that might help.

Actually I have a few scripts that I am trying to turn into a library that might work for you. Check them https://github.com/lozuwa/impy if you would like to.

0人赞添加讨论(0) 举报

Root（大扎）

5楼-- · 2019-08-01 04:29

In order to train a good model you need lots of representative data. Your augmentation is representative only for rotations, so yes, it is a good method, if you are concerned about having not enough object rotations. However, it will not help in any sense with generalization to other objects/transformations.

0人赞添加讨论(0) 举报

Data Augmentation for Object Detection using Deep

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间