top of page

An In-Depth Guide to Image Processing with scikit-image and NumPy



I. Introduction to Image Processing


Definition of Image Processing


Image processing refers to the manipulation of images using algorithms to obtain an enhanced image or extract some meaningful information. It's like adjusting the brightness of a photograph using software, but on a more complex and insightful level.


Computer Vision


Image processing is a fundamental subset of computer vision, the broader field that teaches computers to interpret and make decisions based on visual data. If computer vision is the eyes, image processing is the process that makes sense of what the eyes see.


Applications


Image processing has a wide range of applications. From medical imaging to detect diseases to AI-powered surveillance systems for security, the field is as diverse as it is revolutionary.


Purposes


Image processing serves several main purposes, including:

  • Visualization: Making unseen aspects of images visible.

  • Sharpening: Enhancing images to reveal hidden details.

  • Retrieval: Finding images from a database that match a pattern.

  • Measurement: Measuring various patterns within an image.

  • Recognition: Recognizing different objects, features, or patterns within the image.


II. Working with scikit-image


Introduction to scikit-image


scikit-image is a collection of algorithms for image processing. It's built on top of SciPy and is easy to use. Think of it as a toolbox filled with all the instruments needed to work on an image.

from skimage import data, io

image = data.camera()
io.imshow(image)
io.show()


Understanding Digital Images


Digital images are essentially matrices of pixels. Each pixel represents a color, and together they form an image. Imagine a mosaic, where each tiny tile is a color, and together they create a beautiful picture.


Loading Images in scikit-image


With scikit-image, loading images is as easy as reading a book.

from skimage import data

image = data.camera()


RGB Channels


RGB channels are the building blocks of a colored image, representing Red, Green, and Blue colors. Think of them as the primary colors used to paint an image on a blank canvas.

import numpy as np

red_channel = image[:, :, 0]
green_channel = image[:, :, 1]
blue_channel = image[:, :, 2]


Grayscale Images


Grayscale images are like black-and-white photographs, containing only shades of gray. They are used when color information is not essential.

from skimage.color import rgb2gray

grayscale_image = rgb2gray(image)


RGB vs Grayscale


While RGB images use three channels to represent colors, grayscale images use only one. It's like comparing a colorful painting to a pencil sketch, both beautiful but serving different purposes.

# Convert a grayscale image back to RGB
from skimage.color import gray2rgb

rgb_image = gray2rgb(grayscale_image)


These are the introductory parts of the tutorial covering fundamental concepts. Please let me know when you are ready to continue with the rest of the content. The upcoming sections will dive deeper into visualizing images, manipulating them with NumPy, understanding histograms, thresholding, and more.


III. Visualizing Images


Visualization Methods


Visualizing images is an essential part of image processing. It's like previewing a photograph before printing it.

import matplotlib.pyplot as plt

plt.imshow(image, cmap='gray')
plt.axis('off')  # to turn off axes
plt.show()


Displaying Grayscale Images


Displaying a grayscale image is similar to viewing a black and white photograph.

plt.imshow(grayscale_image, cmap='gray')
plt.axis('off')
plt.show()


IV. Image Manipulation with NumPy


Introduction to NumPy for Images


NumPy is a powerful library for numerical computing in Python, including working with images. It's like having a calculator that can handle entire photographs.


Images as NdArrays


Images can be treated as multi-dimensional arrays. It's like a grid of numbers where each number represents a color or shade.

import numpy as np

image_array = np.array(image)
print(image_array.shape)


Working with Colors


You can slice multi-dimensional arrays to obtain specific color channels. It's like isolating the red hues in a painting.

red_channel = image_array[:, :, 0]


Shapes and Sizes


Understanding the shape and total number of pixels is akin to measuring the dimensions of a photograph.

height, width, channels = image_array.shape
total_pixels = height * width


Flipping Images


You can flip images both vertically and horizontally, like turning a picture upside down.

flipped_image = np.flipud(image_array)
plt.imshow(flipped_image)
plt.axis('off')
plt.show()


V. Understanding Histograms


Introduction to Histograms


A histogram is a graphical representation of the intensity distribution of an image. It's like a bar chart showing the popularity of different colors.

plt.hist(image.ravel(), bins=256, range=(0, 256))
plt.show()


Color Histograms


Creating histograms for colored images is like analyzing a painting's color palette.

plt.hist(red_channel.ravel(), bins=256, color='red', alpha=0.5, range=(0, 256))
plt.hist(green_channel.ravel(), bins=256, color='green', alpha=0.5, range=(0, 256))
plt.hist(blue_channel.ravel(), bins=256, color='blue', alpha=0.5, range=(0, 256))
plt.show()


Applications of Histograms


Histograms are used for various purposes in image processing, like enhancing contrast.


Visualization of Histograms


Visualizing histograms provides insights into color distribution.


VI. Getting Started with Thresholding


Introduction to Thresholding


Thresholding is a way to separate an object from its background, like cutting a figure from a piece of paper.

from skimage.filters import threshold_otsu

thresh_value = threshold_otsu(image)
binary_image = image > thresh_value
plt.imshow(binary_image, cmap='gray')
plt.axis('off')
plt.show()


How Thresholding Works


Thresholding is like drawing a line on a grayscale intensity scale. Pixels with intensity values above the line are converted to white, and those below to black.

threshold_value = 128
binary_image = grayscale_image > threshold_value
plt.imshow(binary_image, cmap='gray')
plt.axis('off')
plt.show()


Thresholding Categories


Thresholding can be broadly classified into global and local types, like different tools for cutting different shapes from a piece of paper.


Exploring Thresholding Algorithms


We can explore various algorithms to find the optimal thresholding value.

from skimage.filters import try_all_threshold

fig, ax = try_all_threshold(grayscale_image, figsize=(10, 8), verbose=False)
plt.show()


VII. Specific Techniques


Image processing has a plethora of techniques. Here we will delve into some

specific ones.


Edge Detection


Detecting edges in an image is akin to outlining shapes in a drawing.

from skimage.filters import sobel


edge_image = sobel(grayscale_image)
plt.imshow(edge_image, cmap='gray')
plt.axis('off')
plt.show()


Image Filtering


Filtering helps in noise reduction or feature enhancement. It's like adjusting the focus of a camera lens.

from skimage.filters import gaussian

filtered_image = gaussian(grayscale_image, sigma=1)
plt.imshow(filtered_image, cmap='gray')
plt.axis('off')
plt.show()


Morphological Operations


Morphological operations are like sculpting an image. You can erode or dilate features.

from skimage.morphology import erosion, dilation

eroded_image = erosion(binary_image)
dilated_image = dilation(binary_image)
plt.subplot(1, 2, 1)
plt.imshow(eroded_image, cmap='gray')
plt.title('Eroded')
plt.axis('off')
plt.subplot(1, 2, 2)
plt.imshow(dilated_image, cmap='gray')
plt.title('Dilated')
plt.axis('off')
plt.show()


Segmentation


Segmentation is the process of dividing an image into different parts, like cutting a cake into slices.

from skimage.segmentation import slic
from skimage.color import label2rgb

segments = slic(image)
segmented_image = label2rgb(segments, image, kind='avg')
plt.imshow(segmented_image)
plt.axis('off')
plt.show()


Conclusion


Image processing is an exciting field with a wide array of applications. From basic visualizations to complex segmentation, the tools and techniques explored in this tutorial open up endless possibilities for analysis and manipulation of images. Whether it's adjusting the focus with filters, carving shapes with morphological operations, or outlining objects through edge detection, the Python ecosystem provides powerful tools for image processing. Like a painter with a palette of colors, a data scientist with these techniques can create, transform, and understand the visual world in new and profound ways.

bottom of page