Dmitry Kurtaev
d17d3b66aa
|
6 years ago | |
---|---|---|
.. | ||
face_detector | 6 years ago | |
CMakeLists.txt | 6 years ago | |
README.md | 6 years ago | |
action_recognition.py | 6 years ago | |
classification.cpp | 6 years ago | |
classification.py | 6 years ago | |
colorization.cpp | 6 years ago | |
colorization.py | 7 years ago | |
common.hpp | 6 years ago | |
common.py | 6 years ago | |
custom_layers.hpp | 6 years ago | |
edge_detection.py | 6 years ago | |
fast_neural_style.py | 6 years ago | |
js_face_recognition.html | 6 years ago | |
mask_rcnn.py | 6 years ago | |
mobilenet_ssd_accuracy.py | 6 years ago | |
models.yml | 6 years ago | |
object_detection.cpp | 6 years ago | |
object_detection.py | 6 years ago | |
openpose.cpp | 6 years ago | |
openpose.py | 6 years ago | |
segmentation.cpp | 6 years ago | |
segmentation.py | 6 years ago | |
shrink_tf_graph_weights.py | 7 years ago | |
text_detection.cpp | 6 years ago | |
text_detection.py | 6 years ago | |
tf_text_graph_common.py | 6 years ago | |
tf_text_graph_faster_rcnn.py | 6 years ago | |
tf_text_graph_mask_rcnn.py | 6 years ago | |
tf_text_graph_ssd.py | 6 years ago |
README.md
OpenCV deep learning module samples
Model Zoo
Check a wiki for a list of tested models.
If OpenCV is built with Intel's Inference Engine support you can use Intel's pre-trained models.
There are different preprocessing parameters such mean subtraction or scale factors for different models. You may check the most popular models and their parameters at models.yml configuration file. It might be also used for aliasing samples parameters. In example,
python object_detection.py opencv_fd --model /path/to/caffemodel --config /path/to/prototxt
Check -h
option to know which values are used by default:
python object_detection.py opencv_fd -h
Face detection
An origin model
with single precision floating point weights has been quantized using TensorFlow framework.
To achieve the best accuracy run the model on BGR images resized to 300x300
applying mean subtraction
of values (104, 177, 123)
for each blue, green and red channels correspondingly.
The following are accuracy metrics obtained using COCO object detection evaluation
tool on FDDB dataset
(see script)
applying resize to 300x300
and keeping an origin images' sizes.
AP - Average Precision | FP32/FP16 | UINT8 | FP32/FP16 | UINT8 |
AR - Average Recall | 300x300 | 300x300 | any size | any size |
--------------------------------------------------|-----------|----------------|-----------|----------------|
AP @[ IoU=0.50:0.95 | area= all | maxDets=100 ] | 0.408 | 0.408 | 0.378 | 0.328 (-0.050) |
AP @[ IoU=0.50 | area= all | maxDets=100 ] | 0.849 | 0.849 | 0.797 | 0.790 (-0.007) |
AP @[ IoU=0.75 | area= all | maxDets=100 ] | 0.251 | 0.251 | 0.208 | 0.140 (-0.068) |
AP @[ IoU=0.50:0.95 | area= small | maxDets=100 ] | 0.050 | 0.051 (+0.001) | 0.107 | 0.070 (-0.037) |
AP @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] | 0.381 | 0.379 (-0.002) | 0.380 | 0.368 (-0.012) |
AP @[ IoU=0.50:0.95 | area= large | maxDets=100 ] | 0.455 | 0.455 | 0.412 | 0.337 (-0.075) |
AR @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] | 0.299 | 0.299 | 0.279 | 0.246 (-0.033) |
AR @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] | 0.482 | 0.482 | 0.476 | 0.436 (-0.040) |
AR @[ IoU=0.50:0.95 | area= all | maxDets=100 ] | 0.496 | 0.496 | 0.491 | 0.451 (-0.040) |
AR @[ IoU=0.50:0.95 | area= small | maxDets=100 ] | 0.189 | 0.193 (+0.004) | 0.284 | 0.232 (-0.052) |
AR @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] | 0.481 | 0.480 (-0.001) | 0.470 | 0.458 (-0.012) |
AR @[ IoU=0.50:0.95 | area= large | maxDets=100 ] | 0.528 | 0.528 | 0.520 | 0.462 (-0.058) |