After Width: | Height: | Size: 6.5 KiB |
After Width: | Height: | Size: 7.5 KiB |
After Width: | Height: | Size: 3.2 KiB |
After Width: | Height: | Size: 14 KiB |
After Width: | Height: | Size: 767 B |
After Width: | Height: | Size: 1.2 KiB |
After Width: | Height: | Size: 1.5 KiB |
After Width: | Height: | Size: 2.0 KiB |
After Width: | Height: | Size: 2.4 KiB |
After Width: | Height: | Size: 3.3 KiB |
After Width: | Height: | Size: 3.9 KiB |
After Width: | Height: | Size: 5.9 KiB |
After Width: | Height: | Size: 4.1 KiB |
After Width: | Height: | Size: 14 KiB |
After Width: | Height: | Size: 2.3 KiB |
@ -0,0 +1,86 @@ |
||||
Extract horizontal and vertical lines by using morphological operations {#tutorial_moprh_lines_detection} |
||||
============= |
||||
|
||||
Goal |
||||
---- |
||||
|
||||
In this tutorial you will learn how to: |
||||
|
||||
- Apply two very common morphology operators (i.e. Dilation and Erosion), with the creation of custom kernels, in order to extract straight lines on the horizontal and vertical axes. For this purpose, you will use the following OpenCV functions: |
||||
- @ref cv::erode |
||||
- @ref cv::dilate |
||||
- @ref cv::getStructuringElement |
||||
|
||||
in an example where your goal will be to extract the music notes from a music sheet. |
||||
|
||||
Theory |
||||
------ |
||||
|
||||
### Morphology Operations |
||||
Morphology is a set of image processing operations that process images based on predefined *structuring elements* known also as kernels. The value of each pixel in the output image is based on a comparison of the corresponding pixel in the input image with its neighbors. By choosing the size and shape of the kernel, you can construct a morphological operation that is sensitive to specific shapes regarding the input image. |
||||
|
||||
Two of the most basic morphological operations are dilation and erosion. Dilation adds pixels to the boundaries of the object in an image, while erosion does exactly the opposite. The amount of pixels added or removed, respectively depends on the size and shape of the structuring element used to process the image. In general the rules followed from these two operations have as follows: |
||||
|
||||
- __Dilation__: The value of the output pixel is the <b><em>maximum</em></b> value of all the pixels that fall within the structuring element's size and shape. For example in a binary image, if any of the pixels of the input image falling within the range of the kernel is set to the value 1, the corresponding pixel of the output image will be set to 1 as well. The latter applies to any type of image (e.g. grayscale, rgb, etc). |
||||
|
||||
![Dilation on a Binary Image](images/morph21.gif) |
||||
|
||||
![Dilation on a Grayscale Image](images/morph6.gif) |
||||
|
||||
- __Erosion__: The vise versa applies for the erosion operation. The value of the output pixel is the <b><em>minimum</em></b> value of all the pixels that fall within the structuring element's size and shape. Look the at the example figures below: |
||||
|
||||
![Erosion on a Binary Image](images/morph211.png) |
||||
|
||||
![Erosion on a Grayscale Image](images/morph61.png) |
||||
|
||||
### Structuring Elements |
||||
|
||||
As it can be seen above and in general in any morphological operation the structuring element used to probe the input image, is the most important part. |
||||
|
||||
A structuring element is a matrix consisting of only 0's and 1's that can have any arbitrary shape and size. Typically are much smaller than the image being processed, while the pixels with values of 1 define the neighborhood. The center pixel of the structuring element, called the origin, identifies the pixel of interest -- the pixel being processed. |
||||
|
||||
For example, the following illustrates a diamond-shaped structuring element of 7x7 size. |
||||
|
||||
![A Diamond-Shaped Structuring Element and its Origin](images/morph12.gif) |
||||
|
||||
A structuring element can have many common shapes, such as lines, diamonds, disks, periodic lines, and circles and sizes. You typically choose a structuring element the same size and shape as the objects you want to process/extract in the input image. For example, to find lines in an image, create a linear structuring element as you will see later. |
||||
|
||||
Code |
||||
---- |
||||
|
||||
This tutorial code's is shown lines below. You can also download it from [here](https://github.com/Itseez/opencv/tree/master/samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp). |
||||
@includelineno samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp |
||||
|
||||
Explanation / Result |
||||
-------------------- |
||||
|
||||
-# Load the source image and check if it is loaded without any problem, then show it: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp load_image |
||||
![](images/src.png) |
||||
|
||||
-# Then transform image to grayscale if it not already: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp gray |
||||
![](images/gray.png) |
||||
|
||||
-# Afterwards transform grayscale image to binary. Notice the ~ symbol which indicates that we use the inverse (i.e. bitwise_not) version of it: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp bin |
||||
![](images/binary.png) |
||||
|
||||
-# Now we are ready to apply morphological operations in order to extract the horizontal and vertical lines and as a consequence to separate the the music notes from the music sheet, but first let's initialize the output images that we will use for that reason: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp init |
||||
|
||||
-# As we specified in the theory in order to extract the object that we desire, we need to create the corresponding structure element. Since here we want to extract the horizontal lines, a corresponding structure element for that purpose will have the following shape: |
||||
![](images/linear_horiz.png) |
||||
and in the source code this is represented by the following code snippet: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp horiz |
||||
![](images/horiz.png) |
||||
|
||||
-# The same applies for the vertical lines, with the corresponding structure element: |
||||
![](images/linear_vert.png) |
||||
and again this is represented as follows: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp vert |
||||
![](images/vert.png) |
||||
|
||||
-# As you can see we are almost there. However, at that point you will notice that the edges of the notes are a bit rough. For that reason we need to refine the edges in order to obtain a smoother result: |
||||
@snippet samples/cpp/tutorial_code/ImgProc/Morphology_3.cpp smooth |
||||
![](images/smooth.png) |
@ -0,0 +1,127 @@ |
||||
/**
|
||||
* @file Morphology_3(Extract_Lines).cpp |
||||
* @brief Use morphology transformations for extracting horizontal and vertical lines sample code |
||||
* @author OpenCV team |
||||
*/ |
||||
|
||||
#include <iostream> |
||||
#include <opencv2/opencv.hpp> |
||||
|
||||
using namespace std; |
||||
using namespace cv; |
||||
|
||||
int main(int, char** argv) |
||||
{ |
||||
//! [load_image]
|
||||
// Load the image
|
||||
Mat src = imread(argv[1]); |
||||
|
||||
// Check if image is loaded fine
|
||||
if(!src.data) |
||||
cerr << "Problem loading image!!!" << endl; |
||||
|
||||
// Show source image
|
||||
imshow("src", src); |
||||
//! [load_image]
|
||||
|
||||
//! [gray]
|
||||
// Transform source image to gray if it is not
|
||||
Mat gray; |
||||
|
||||
if (src.channels() == 3) |
||||
{ |
||||
cvtColor(src, gray, CV_BGR2GRAY); |
||||
} |
||||
else |
||||
{ |
||||
gray = src; |
||||
} |
||||
|
||||
// Show gray image
|
||||
imshow("gray", gray); |
||||
//! [gray]
|
||||
|
||||
//! [bin]
|
||||
// Apply adaptiveThreshold at the bitwise_not of gray, notice the ~ symbol
|
||||
Mat bw; |
||||
adaptiveThreshold(~gray, bw, 255, CV_ADAPTIVE_THRESH_MEAN_C, THRESH_BINARY, 15, -2); |
||||
|
||||
// Show binary image
|
||||
imshow("binary", bw); |
||||
//! [bin]
|
||||
|
||||
//! [init]
|
||||
// Create the images that will use to extract the horizontal and vertical lines
|
||||
Mat horizontal = bw.clone(); |
||||
Mat vertical = bw.clone(); |
||||
//! [init]
|
||||
|
||||
//! [horiz]
|
||||
// Specify size on horizontal axis
|
||||
int horizontalsize = horizontal.cols / 30; |
||||
|
||||
// Create structure element for extracting horizontal lines through morphology operations
|
||||
Mat horizontalStructure = getStructuringElement(MORPH_RECT, Size(horizontalsize,1)); |
||||
|
||||
// Apply morphology operations
|
||||
erode(horizontal, horizontal, horizontalStructure, Point(-1, -1)); |
||||
dilate(horizontal, horizontal, horizontalStructure, Point(-1, -1)); |
||||
|
||||
// Show extracted horizontal lines
|
||||
imshow("horizontal", horizontal); |
||||
//! [horiz]
|
||||
|
||||
//! [vert]
|
||||
// Specify size on vertical axis
|
||||
int verticalsize = vertical.rows / 30; |
||||
|
||||
// Create structure element for extracting vertical lines through morphology operations
|
||||
Mat verticalStructure = getStructuringElement(MORPH_RECT, Size( 1,verticalsize)); |
||||
|
||||
// Apply morphology operations
|
||||
erode(vertical, vertical, verticalStructure, Point(-1, -1)); |
||||
dilate(vertical, vertical, verticalStructure, Point(-1, -1)); |
||||
|
||||
// Show extracted vertical lines
|
||||
imshow("vertical", vertical); |
||||
//! [vert]
|
||||
|
||||
//! [smooth]
|
||||
// Inverse vertical image
|
||||
bitwise_not(vertical, vertical); |
||||
imshow("vertical_bit", vertical); |
||||
|
||||
// Extract edges and smooth image according to the logic
|
||||
// 1. extract edges
|
||||
// 2. dilate(edges)
|
||||
// 3. src.copyTo(smooth)
|
||||
// 4. blur smooth img
|
||||
// 5. smooth.copyTo(src, edges)
|
||||
|
||||
// Step 1
|
||||
Mat edges; |
||||
adaptiveThreshold(vertical, edges, 255, CV_ADAPTIVE_THRESH_MEAN_C, THRESH_BINARY, 3, -2); |
||||
imshow("edges", edges); |
||||
|
||||
// Step 2
|
||||
Mat kernel = Mat::ones(2, 2, CV_8UC1); |
||||
dilate(edges, edges, kernel); |
||||
imshow("dilate", edges); |
||||
|
||||
// Step 3
|
||||
Mat smooth; |
||||
vertical.copyTo(smooth); |
||||
|
||||
// Step 4
|
||||
blur(smooth, smooth, Size(2, 2)); |
||||
|
||||
// Step 5
|
||||
smooth.copyTo(vertical, edges); |
||||
|
||||
// Show final result
|
||||
imshow("smooth", vertical); |
||||
//! [smooth]
|
||||
|
||||
waitKey(0); |
||||
return 0; |
||||
} |
After Width: | Height: | Size: 14 KiB |