Introduction

TensorFlow has a number of functions for resizing.

When I looked it up, there are 6 functions for resize in TensorFlow.

tf.image.resize_images
tf.image.resize_area
tf.image.resize_bicubic
tf.image.resize_bilinear
tf.image.resize_nearest_neighbor
tf.image.resize_image_with_crop_or_pad

When I read the document, the explanation is written, but it doesn't come out very well.

So I would like to try it out and visually understand how it works.

In conclusion, tf.image.resize_images included the following four features:

tf.image.resize_area
tf.image.resize_bicubic
tf.image.resize_bilinear
tf.image.resize_nearest_neighbor

So, I will try only the following two.

tf.image.resize_images
tf.image.resize_image_with_crop_or_pad

The images used are Lena, who has a size of 256x256, and a cat, which has a size of 256x170. スクリーンショット 2016-09-06 11.17.15.png

Let's try them one by one.

tf.image.resize_images(images, new_height, new_width, method=0, align_corners=False) resize_images is a function that resizes images to new_height x new_width by the specified method.

A 4D tensor [batch, height, width, channels] or a 3D tensor [height, width, channels] can be given as the input image. If given in 4D, batch conversion of images is possible.

A 4D tensor [batch, new_height, new_width, channels] or a 3D tensor [new_height, new_width, channels] is returned as the return value. This changes according to the input tensor.

First try to reduce

Try reducing the image to 128x128 with the following settings.

tf.image.resize_images(image, 128, 128)

The result is as follows. スクリーンショット 2016-09-06 11.21.07.png

Lena has been able to shrink without problems, but the cat is distorted. In this way, ** If the original aspect ratio is not the same as new_width and new_height, the resized image will be distorted. ** To avoid this, use resize_image_with_crop_or_pad. I'll try this later.

Try changing the method

The resize_images function can take four methods:

ResizeMethod.BILINEAR: Bilinear interpolation (default)
ResizeMethod.NEAREST_NEIGHBOR: Nearest neighbor interpolation
ResizeMethod.BICUBIC: Bicubic interpolation
ResizeMethod.AREA: Area interpolation

The results of trying one by one are as follows: スクリーンショット 2016-09-06 12.08.59.png

It can be seen that the degree of blurring is slightly different from the original image when enlarged. It seems that each has its own uses.

align_corners Below are the images when align_corners is False and True: スクリーンショット 2016-09-06 13.20.00.png

To be honest, I don't really understand the difference, but it seems that True scales the input to (new_height -1) / (height -1), and False scales it to new_height / height. By setting it to True, it seems that the positions of all four corners of the input and output are accurately aligned. I'm not sure what to do for it.

tf.image.resize_image_with_crop_or_pad(image, target_height, target_width) The resize_image_with_crop_or_pad function is a function that trims and / or pad the image to the specified size (target_height x target_width).

Resize the image to target_width and target_height by cropping the center of the image or padding the black image.

Crop the center of the image if the width or height is greater than the specified target_width or target_height, respectively. The following is a case where a 256x170 cat image is resized by specifying 128x128 for target_height and target_width. You can see that the center of the image is cropped. スクリーンショット 2016-09-06 13.53.39.png

Embeds a black image if the width or height is less than target_width or target_height. The following is a case where a 256x170 cat image is resized by specifying 196x196 for target_height and target_width. The center of the image is cropped while being padded up and down. スクリーンショット 2016-09-06 13.57.54.png

bonus

By combining resize_image_with_crop_or_pad and resize_images, you can reduce the image with only padding without cropping.

manner

Get the size of the image
Specify the size of the long side in target_height and target_width of the resize_image_with_crop_or_pad function.
Shrink the padded image with resize_images

reference

TensorFlow/image

[PYTHON] I didn't understand the Resize of TensorFlow so I tried to summarize it visually.

Introduction

First try to reduce

Try changing the method

bonus

reference