可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

My problem statement is as follows :

" Object Detection and Localization using Tensorflow and convolutional neural network "

What i did ?

I am done with the cat detection from images using tflearn library.I successfully trained a model using 25000 images of cats and its working fine with good accuracy.

Current Result :

What i wanted to do?

If my image consist of two or more than two objects in the same image for example cat and dog together so my result should be 'cat and dog' and apart from this i have to find the exact location of these two objects on the image(bounding box)

I came across many high level libraries like darknet , SSD but not able to get the concept behind it.

Please guide me about the approach to solve the problem.

Note : I am using supervised learning techniques.

Expected Result :

回答1:

You have several ways to go about it.

The most straight forward way is to get some suggested bounding boxes using some bounding box suggestion algorithm like selective search and run on each on of the suggestion the classification net that you already trained. This approach is the approach taken by R-CNN.

For more advanced algorithm based on the above approach i suggest you read about Fast-R-CNN and Faster R-CNN.

Look at Object detection with R-CNN? for some basic explanation.

Darknet and SSD are based on a different approach if you want to undestand them you can read about them on

http://www.cs.unc.edu/~wliu/papers/ssd.pdf https://pjreddie.com/media/files/papers/yolo.pdf