Fully Convolutional Network Training Image Size

I'm trying to replicate the results of Fully Convolutional Network (FCN) for Semantic Segmentation using TensorFlow.

I'm stuck on feeding training images into the computation graph. The fully convolutional network used VOC PASCAL dataset for training. However, the training images in the dataset are of varied sizes.

I just want to ask if they preprocessed the training images to make them have the same size and how they preprocessed the images. If not, did they just feed batches of images of different sizes into the FCN? Is it possible to feed images of different sizes in one batch into a computation graph in TensorFlow? Is it possible to do that using queue input rather than placeholder?

标签： machine-learning computer-vision tensorflow deep-learning caffe

2条回答

爱情/是我丢掉的垃圾

2楼-- · 2020-02-11 06:20

It's not possible to feed images of different size into a single input batch. Every batch can have an undefined number of samples (that's the batch size usually, below noted with None) but every sample must have the same dimensions.

When you train a fully convolutional network you have to train it like a network with fully connected layers at the end. So, every input image in the input batch must have the same widht, height and depth. Resize them.

The only difference is that while fully connected layers output a single output vector for every sample in the input batch (shape [None, num_classes]) the fully convolutional outputs a probability map of classes.

During train, when the input images dimensions are equals to the network input dimensions, the output will be a probability map with shape [None, 1, 1, num_classes].

You can remove the dimensions of size 1 from the output tensor using tf.squeeze and then calculate the loss and accuracy just like you do with a fully connected network.

At test time, when you feed the network images with dimensions greater than the input, the output will be a probability map with size [None, n, n, num_classes].

0人赞添加讨论(0) 举报

在下西门庆

3楼-- · 2020-02-11 06:31

You can use batch size = 1
You can resize images to fixed size like 256,256
You can resize each batch to median image size of it's content.

0人赞添加讨论(0) 举报

Fully Convolutional Network Training Image Size

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间