Hello
The API VisionAPI in Project Oxford also gives us the ability to perform optical character recognition in an image. What we usually known as OCR.
The result of the OCR process, shows us information with
- the language of the detected language
- the area where the text has been detected
- the angle of the text
- a collection of lines within each area of detected text
- a collection of words per line
So a McDonalds sign returns the following information
Language: en
Orientation: Up
Text Angle: 5.89999999999997
RegionRectangle:
Left 75, Top 74,
Height 132, Width 210Lines
please do not
eat the
billboard
all white meat chicken
The About Windows form returns
Language: en
Orientation: Up
Text Angle: 0
RegionRectangle:
Left 9, Top 11,
Height 311, Width 402Lines
About Windows
Windows 10
Microso ft Mndows
Version 1511 (osBui1d 10586.11)
@ 2015 Microsoft Corporaton. Al rights reserved.
The Windows 10 Enterprise operating system and its user interface are
protected by trademark and other pending or existing intellectual property
rights in the United States and other countries/regions.
This product is licensed under the Microsoft Software License
Terms to:
brunocapuano@msn.com
And a live action demo is like this one where PADDINGTON is quickly added into the image information
The source code is avilable in GitHub https://github.com/elbruno/ProjectOxford
Greetings @ Madrid !
El Bruno
References
- Project Oxford
- My Azure ML Vision APIs series
- VisionAPI implementation for image analysis
- Adding Vision API capabilities in our apps
- My Azure ML Emotion APIs series
- My Azure ML Face APIs series
4 thoughts on “#AZURE – #VisionAPI and optical character recognition #OCR”