Image Segmentation
So far we have learned to analyze ideal images of shapes, which we have set up and know exactly what they are. Usually, these are also black and white, which makes it easy for our programs to distinguish between “something and nothing”. These idealized cases are far from reality. Images which we have to process are usually colored, and we have no idea how to get what we need. An example would be trying to perform some analysis on the red part of this Gundam model kit.
How do we do that??? Well, we first need to teach our program to separate the red from the rest of the image. The image contains RGB values for each pixel. We can guess that the red parts of the gundam satisfy certain values of R, G and B. We could have a sample Region of Interest (ROI), however, that would probably just be a finite range of shades. Since we want to automate getting all of the red, it would be pointless to have our ROI as the whole image.
The solution comes in the representation of RGB in the Normalized Chromacity Coordinates. The idea is that each pixel will have,
\[\begin{aligned} q &= \frac{Q}{(R+G+B)} \text{ where q,Q are rgb and RGB}\\ r + g + b &= 1 \end{aligned}\]Because of this, we can represent the color of a pixel in terms of only two variables, \(r\) and \(g\). This greatly simplifies how we search for color. Our ROI will now satisfy only a small region in the NCC.
From the NCC representation of the ROI, we can use 2 methods to segment the image.
Parametric
We can take the distributions of the r ang g values in the ROI. This represents the distribution of the r and g values that represent the ROI, and other parts of our image. We fit Gaussian functions to these histograms, and we will use the probibilities of the Gaussian functions in deciding if a pixel is part of our target color. Of course, the closer the r ang g value of the pixel is, the more chances that it is going to be used. The result is shown below; you can see that it is quite noisy. This is because being a probabilistic approach, some pixels are not included even if they are red.
Histogram Backprojection
An alternative approach is histogram backprojection. Instead of taking probabilities, we instead look at the 2D histogram in the NCC.
All of the non black points represent viable colors that should match our desired color. In the parametric method, we assume Gaussian functions for both r and g. This is the same as saying that the 2D histogram is an ellipse. This isn’t the case as we can see now. The 2D histogram is more of a slanted ellipse.
In backprojection, we simply take all of the r and g values that are not zero, and use the pixels that have the same r and g values. The result is a much more acceptable segmentation shown below.
This is a fun activity, especially after seeing the color separation (by Image segmentation.) I give myself 10/10
Enjoy Reading This Article?
Here are some more articles you might like to read next: