By applying filters and analyzing frequencies, images can be processed and combined in interesting ways. In the first part of this project, edge detection is conducted by applying the Finite Difference Filter. Gaussian Filter is applied to get rid of the unnecessary wrinkles. Then, images are sharpened by stacking its edges onto itself. The second part of this project consists of two image binding tasks. The first task generates Hybrid Image by adding the high frequency of one image to the low frequency of another. Both successful and failing attempts are introduced. The second task blends images by applying the Gaussian Stack and the Laplacian Stack.
I detected the edges by applying the Finite Difference Filters dx and dy, denoted by dx = [1, -1], dy = [1, -1]^T respectively. The horizontal and vertical edges are calculated by conducting convolution on the image using the filters, and are combined into a single edge image using Euclidean Distance. To erase the wrinkles, I conducted binaryzation on the edge image. Figure 1 shows the original Cameraman image and its horizontal, vertical and combined edge image before and after their binaryzation.
dx |
dy |
combined |
The edge image shown in Figure 1f contains unwanted noise. To remove the noise, I smoothed the original image by applying Gaussian Filter with a kernel size of 9 and a sigma of 1.5 before edge detection. The edge image produced this time is shown in Figure 2a. In another attempt, I generated a Blurred Finite Difference Filter by conducting convolution on the Gaussian Filter and the two Finite Difference Filters, and applied the combined filters directly on the original image. The result of this second attempt is shown in Figure 2b.
The noiseless edge image obtained by the two attempts are almost identical.
Image sharpening can emphasize the details of the image, making the image "clearer" when observed by human eyes. I sharpened the images using the Unsharp Masking technic. First, a Gaussian Filter is applied to the target image to generate a blurred image, which contains the low frequency features of the image. Sigma in the Gaussian Filter is always set as 1/6 of the kernel size. Then, the high-frequency details are obtained by subtracting the low frequency features from the original image. These details are emphasized in the final image by \[ IMG_{sharpened} = IMG_{original} + \alpha \times IMG_{high-frequency}. \] Figure 3, 4 shows the sharpening result of the taj and forest image.
In order to evaluate my image sharpening approach, I took the tiger image, blurred it, and then sharpened it. The high frequency details lost in the blurring process should be restored. Figure 5 shows the result.
The tiger in the sharpened image seems more vivid than the tiger in the blurred image. This indicates that a lower resolution image can show similar visual effects on human eyes using image sharpening.
Human eyes can only identify color changes in a range of frequencies. When the colors change too swift or too slow on an image, humans can't identify the change. I combined two images by adding the low frequency features of one image and the high frequency details of the other. In this way, when looked afar, only the low frequency features from the first image would be noticed. When looked closely, only the high frequency details of the second image would be noticed. The following are three attempts of generating Hybrid Image.
By combining the high frequency details of Spiderman with mask on and the low frequency features of the face Peter Parker casted by Tobey Maguire, Spiderman should have mask on when looked closely. However, when look afar (or simply take off your nearsighted glasses like Peter did), Spiderman will reveal his true identity!
I analyzed the two Spiderman images in frequency domain before and after their filtering using Fast Fourier Transform, Figure 7 shows the result.
I combined the high frequency details of the Wednesday cartoon version and the low frequency features of Wednesday casted by Jenna Ortega. The cartoon Wednesday should appear when looked closely, and the Netfilx Wednesday will appear when looked afar.
I combined the high frequency details of Golden Gate Bridge image filmed last century and the low frequency features of the bridge filmed recently. Figure 9 shows the result.
The results are not satisfying as the older Golden Gate Bridge image can barely be seen even when looked closely. I believe this is because one of the photos are grayscale while the other is colored. It can be hard for a grayscale image to gain human eyes' attention when it is mixed with a colored image.
When blending two images together, it would seem more natural if the high-frequency patterns are not stacked together while the low-frequency background color change gradually between the images. I implemented Laplacian Stack to obtain the image features under different frequencies. Laplacian Stack is calculated using the Gaussian Stack. Each layer in the Gaussian Stack blurs the image from the last layer using the Gaussian Filter, with an increasing larger kernel size as the layers go deep. The first layer in the Gaussian Stack is the original image. Each layer in the Laplacian Stack is denoted by \[ lstack[i] = gstack[i] - gstack[i + 1]; \] \[ lstack[-1] = gstack[-1]. \] Each layer in the Laplacian Stack contains a certain range of frequency, and should be blended in different ways.
The images are blended under the instruction of a mask image. Each layer is assigned with a blurred version of the mask image. The deeper the layer, the more blurred (using larger Gaussian Filter) the mask image is. First, a blend image for each layer is generated, denoted by \[ blend[i] = lstack1[i] * mask[i] + lstack2[i] * (1 - mask[i]). \] Then, the blended layers are added together, and the result is the blending output.
Figure 10 shows my attempt of blending the apple image (Figure 10a) and the orange image (Figure 10c) using mask image (Figure 10b). Figure 11 shows the Laplacian Stack of this process, where the left column is the Laplacian Stack of the apple image, the right column is the Laplacian Stack of the orange image, and the middle column is the blended layers. The blending output is shown in Figure 10d.
I also blended earth and moon images to generate moonearth (imagine when human inhabits half of the moon), and blended my friend Mike and a kangaroo to generate Mikaroo (a scornful kangaroo). The results are shown in Figure 12 and 13 respectively.
When trying different ways of recoloring the moonearth image, I accidentally used the moonearth image generated last time as the new mask image. The blending result is amazing. It looks like lunar surface, but has clouds and oceans. I cannot explain this phenomenon, but here is the image.
* Finished on Sep 23, 2024.