Merge pull request #510 from andreamerello:master

9 years ago · 06ecd05e29
parent 6f64ad6892 6bd381cf4b
commit 06ecd05e29
12 changed files with 208 additions and 8 deletions
--- a/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_illusion4med.jpg
+++ b/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_illusion4med.jpg
--- a/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_illusion4med_proof.jpg
+++ b/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_illusion4med_proof.jpg
--- a/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_parvo.png
+++ b/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_parvo.png
--- a/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_parvo_proof.png
+++ b/modules/bioinspired/tutorials/retina_illusion/images/checkershadow_parvo_proof.png
--- a/modules/bioinspired/tutorials/retina_illusion/retina_illusion.markdown
+++ b/modules/bioinspired/tutorials/retina_illusion/retina_illusion.markdown
@ -0,0 +1,187 @@
+Processing images causing optical illusions {#tutorial_bioinspired_retina_illusion}
+=============================================================
+
+Goal
+----
+
+I will show here how the bioinspired module can reproduce a well-known optical illusion that
+our eyes perceive in certain light condition: The Adelson checkerboard.
+
+The Adelson checkerboard
+------------------------
+
+Looking at the checkerboard image below, human eyes perceive the "B" square lighter than the
+"A" square, although they are pictured in the very same RGB color.
+Of course in the physical world, checkerboard has a "B" square which is lighter than "A", but in this image the
+shadow of the green cylinder casting over the "B" square ends up in making the "A" and "B"
+squares actually having the same luminance.
+
+![Adelson checkerboard](images/checkershadow_illusion4med.jpg)
+
+Our visual system does "compensate" for the shadow, making us perceive the "B" square lighter,
+as the shadow wouldn't be there. This is due to local adaptation process that is performed in the
+foveal area.
+
+You may find the original Adelson's explanation [here](http://web.mit.edu/persci/people/adelson/checkershadow_description.html).
+
+Proof: You can convince yourself by using an image manipulation program, cutting out a portion
+of the two squares, and looking at them without any background. You can also measure the RGB
+values of the two squares with the picker tool.
+
+In this image I've cropped a little piece of the A and B squares and I've put them side-by-side.
+It should be quite evident they have the same luminance.
+![Adelson checkerboard proof](images/checkershadow_illusion4med_proof.jpg)
+
+It's worth to know that this illusion works because the checkerboard image, as you may see it
+on your laptop, casts on your retina with dimensions that cause the retina local adaptation to take
+into account both the two squares at the same time.
+
+The foveal vision area is something like one inch at one meter (and because your eye moves
+continuously, with the so called "saccades", your brain is able to reconstruct the entire
+color scene in real time). This means that one single letter, either A or B, can hit
+your fovea at any time.
+
+The point is that, even if you can't see both letters at the same time in a single eye fixation,
+when looking at one letter your fovea also takes into account light information from what is around it.
+This means that the fovea actually perceives also the neighboring cells.
+
+The net effect is that when looking at one area, your eye locally adapts to luminance, filters noise,
+enforces contours, etc. considering what *surrounds* this area, and this makes the illusion work. We
+say that *the retina works in a "center surround" manner*.
+
+So, the "A" cell being surrounded by lighter cells can be perceived darker. As a comparison, cell "B" 's
+neighborhood is darker and the cell "B" is then perceived lighter.
+
+Finally, since shadow edges are soft, retina eliminates this information. Then shadows do not disrupt the overall chessboard observation making possible to "confidently being fooled" by the perceived cells luminance.
+
+Reproducing the illusion
+------------------------
+The bioinspired module does mimic (also) the parvocellular retina process, that is our foveal
+vision, and it does reproduce our eyes' local adaptation.
+
+This means we can expect the parvo channel output to really contain luminance values
+similar to those we perceive with our eyes. Specifically, in this case we expect the "B" square
+RGB values to be actually lighter than the "A" ones.
+
+To correctly mimic what our eye does we need opencv to do the local adaptation on the right
+image portion. This means we have to ensure that the opencv's notion of "local" does match with our
+image's dimensions, otherwise the local adaptation wouldn't work as expected.
+
+For this reason we may have to adjust the **hcellsSpatialConstant** parameter (that technically
+specifies the low spatial cut frequency, or slow luminance changes sensitivity) depending by
+the image resolution.
+
+For the image in this tutorial, the default retina parameters should be fine.
+
+In order to feed the image to the bioinspired module, you can use either your own code or
+the *example_bioinspired_retinaDemo* example that comes with the bioinspired module.
+
+Running
+@code{.sh}
+example_bioinspired_retinaDemo -image checkershadow_illusion4med.jpg
+@endcode
+
+will cause our image to be processed in both parvocellular and magnocellular channels (we are interested
+just in the first one).
+
+If you choose to use your own code, please note that the parvocellular (and magnocellular)
+channel does require some iterations (frames to be processed) before actually getting steady.
+
+Actually parvo (and magno) channel do cares about temporal information. That is, when you start
+feeding frames, it is similar to you with closed eyes; then you open them and you see the chessboard.
+
+This is a static image but your retina just starts moving to a new context (eyes opening) and
+has to adapt.
+
+While in this transient state the luminance information do matters, and you see more or less
+the absolute luminance values. Absolute luminance is exactly what you need **not** to look at in
+order to reproduce the illusion..
+
+As soon as steady state is reached, you receive more contextual luminance information. Your eyes work
+in a center-surround manner and take into account the neighborhood luminance to evaluate the
+region of interest luminance level. And that's when our illusion comes out !
+
+This is something that you don't need to worry about when you process videos, because you are
+naturally feeding the virtual retina with several frames, but you have to take care of it in
+order to process a single frame.
+
+What you will actually need to do when processing a single frame, and you only need steady state response,
+is to repeatedly feed the retina with the same frame (this is what the example code does), as you
+would do with a still video. Alternatively you can set retina temporal parameters to 0 to get steady state immediately
+(**photoreceptorsTemporalConstant** and **hcellsTemporalConstant** parameters of the xml file); however
+in this case you should be aware that you are now making experiments with something that is
+deliberately less accurate in reproducing the behaviour of a real retina!
+
+Here there is a small fragment of python code we used to process the image. It does 20
+iterations. This is an arbitrary number that we found experimentally to be (more than)
+enough
+
+@code{.py}
+import cv2
+
+inputImage = cv2.imread('checkershadow_illusion4med.jpg', 1)
+retina = cv2.bioinspired.createRetina((inputImage.shape[1], inputImage.shape[0]))
+
+# the retina object is created with default parameters. If you want to read
+# the parameters from an external XML file, uncomment the next line
+#retina.setup('MyRetinaParameters.xml')
+
+# feed the retina with several frames, in order to reach 'steady' state
+for i in range(20):
+    retina.run(inputImage)
+
+# get our processed image :)
+retinaOut_parvo = retina.getParvo()
+
+# show both the original image and the processed one
+cv2.imshow('image', inputImage)
+cv2.imshow('retina parvo out', retinaOut_parvo)
+
+# wait for a key to be pressed and exit
+cv2.waitKey(0)
+cv2.destroyAllWindows()
+
+# write the output image on a file
+cv2.imwrite('checkershadow_parvo.png', retinaOut_parvo)
+@endcode
+
+Whatever method you used to process the image, you should end up
+with something like this:
+
+![Parvo output for adelson checkerboard](images/checkershadow_parvo.png)
+
+Analyzing the results
+----------------------
+
+We expected that the "B" pixels in the parvo channel output are lighter than "A" ones.
+
+.. And in fact that is!
+
+Looking at the resulting image might not tell us so much at a first glance: the "B" square looks
+lighter than "A" to our eyes, as it did in the input image. The difference is that, contrarily to
+the input image, now the RGB values of the pixels are actually lighter; note that when looking at
+the output image, we are actually  applying the parvocellular process
+two times: first in the bioinspired module, then in our eyes.
+We can convince ourselves that the illusion appeared
+in the computed image by measuring the squares' luminance with the image manipulation program
+and the picker tool, or by cropping pieces of the squares and putting them side-by-side.
+
+In the following image I cropped a portion of square "A" and a portion of square "B", and I placed
+them side-by-side, as I did for the original Adelson image.
+
+![Illusion reproduced](images/checkershadow_parvo_proof.png)
+
+It should be quite evident that the "B" square is really lighter than the "A" square! Congratulations: you have
+just reproduced the Adelson illusion with the Bioinspired module!
+
+Credits
+-------
+
+I want to thank:
+
+**Alexandre Benoit** - for being so kind of explaining me how this whole thing works, for giving me the
+opportunity of writing this tutorial, and for reviewing it.
+
+**Edward Adelson** - for allowing me to freely use his checkerboard image.
+
+**Antonio Cuni**  - for reviewing this tutorial and for writing the Python code.
--- a/modules/bioinspired/tutorials/retina_model/images/retina_TreeHdr_retina.jpg
+++ b/modules/bioinspired/tutorials/retina_model/images/retina_TreeHdr_retina.jpg
--- a/modules/bioinspired/tutorials/retina_model/images/retina_TreeHdr_small.jpg
+++ b/modules/bioinspired/tutorials/retina_model/images/retina_TreeHdr_small.jpg
--- a/modules/bioinspired/tutorials/retina_model/images/studentsSample_input.jpg
+++ b/modules/bioinspired/tutorials/retina_model/images/studentsSample_input.jpg
--- a/modules/bioinspired/tutorials/retina_model/images/studentsSample_magno.jpg
+++ b/modules/bioinspired/tutorials/retina_model/images/studentsSample_magno.jpg
--- a/modules/bioinspired/tutorials/retina_model/images/studentsSample_parvo.jpg
+++ b/modules/bioinspired/tutorials/retina_model/images/studentsSample_parvo.jpg
--- a/modules/bioinspired/tutorials/retina_model/retina_model.markdown
+++ b/modules/bioinspired/tutorials/retina_model/retina_model.markdown
@ -1,4 +1,4 @@
-Discovering the human retina and its use for image processing {#tutorial_bioinspired_retina_model}
+Retina and real-world vision {#tutorial_bioinspired_retina_model}
 =============================================================

 Goal
@ -414,9 +414,9 @@ and high frequency noise canceling.
    objects are blurred and can disappear while static object are favored. But when starting the
    retina processing, stable state is reached later.
 -   **photoreceptorsSpatialConstant** specifies the spatial constant related to photo-receptors' low
-    pass filter's effect. Those parameters specify the minimum value of the spatial signal period allowed 
-    in what follows. Typically, this filter should cut high frequency noise. On the other hand, a 0 value 
-    cuts none of the noise while higher values start to cut high spatial frequencies, and progressively 
+    pass filter's effect. Those parameters specify the minimum value of the spatial signal period allowed
+    in what follows. Typically, this filter should cut high frequency noise. On the other hand, a 0 value
+    cuts none of the noise while higher values start to cut high spatial frequencies, and progressively
    lower frequencies... Be aware to not go to high levels if you want to see some details of the input images !
    A good compromise for color images is a 0.53 value since such choice won't affect too much the color spectrum.
    Higher values would lead to gray and blurred output images.
@ -428,7 +428,7 @@ It modulates photo-receptors sensitivity and completes the processing for final
 (part of the spatial band pass effect thus favoring visual details enhancement).

 -   **horizontalCellsGain** here is a critical parameter ! If you are not interested with the mean
-    luminance and want just to focus on details enhancement, then, set this parameterto zero. However, if 
+    luminance and want just to focus on details enhancement, then, set this parameterto zero. However, if
    you want to keep some environment luminance's data, let some low spatial frequencies pass into the system and set a
    higher value (\<1).
 -   **hcellsTemporalConstant** similar to photo-receptors, this parameter acts on the temporal constant of a
@ -457,7 +457,7 @@ the visual scene, even in bright areas.

 #### IPL Magno (motion/transient channel) parameters

-Once image's information are cleaned, this channel acts as a high pass temporal filter that  
+Once image's information are cleaned, this channel acts as a high pass temporal filter that
 selects only the signals related to transient signals (events, motion, etc.). A low pass spatial filter
 smoothes extracted transient data while a final logarithmic compression enhances low transient events
 thus enhancing event sensitivity.
@ -471,8 +471,7 @@ thus enhancing event sensitivity.
    High values let slow transient events to be selected.
 -   **V0CompressionParameter** specifies the strength of the log compression. Similar behaviors to
    previous description but here  enforces sensitivity of transient events.
-   **localAdaptintegration_tau** generally set to 0, has no real use actually in here. 
+-   **localAdaptintegration_tau** generally set to 0, has no real use actually in here.
 -   **localAdaptintegration_k** specifies the size of the area on which local adaptation is
    performed. Low values lead to short range local adaptation (higher sensitivity to noise), high
    values secure log compression.
-
--- a/modules/bioinspired/tutorials/table_of_content_retina.markdown
+++ b/modules/bioinspired/tutorials/table_of_content_retina.markdown
@ -0,0 +1,14 @@
+Discovering the human retina and its use for image processing {#tutorial_table_of_content_retina}
+============================
+
+-   @subpage tutorial_bioinspired_retina_model
+
+    *Author:* Alexandre Benoit
+
+    Processing regular images
+
+-   @subpage tutorial_bioinspired_retina_illusion
+
+    *Author:* Andrea Merello
+
+    See how to reproduce human eye optical illusions