What is Computer Vision API Version 2.0?

The cloud-based Computer Vision API provides developers with access to advanced algorithms for processing images and returning information. By uploading an image or specifying an image URL, Microsoft Computer Vision algorithms can analyze visual content in different ways based on inputs and user choices. With the Computer Vision API, you can analyze images to:

Tag images based on content
Categorize images
Identify the type and quality of images
Detect human faces and return their coordinates
Recognize domain-specific content
Generate descriptions of the content
Use optical character recognition to identify printed text found in images
Recognize text
Distinguish color schemes
Flag adult content
Crop photos to be used as thumbnails

Requirements

Supported input methods: Raw image binary in the form of an application/octet stream or image URL.
Supported image formats: JPEG, PNG, GIF, BMP.
Image file size: Less than 4 MB.
Image dimension: Greater than 50 x 50 pixels.

Tagging images

Computer Vision API returns tags based on more than 2000 recognizable objects, living beings, scenery, and actions. When tags are ambiguous or not common knowledge, the API response provides 'hints' to clarify the meaning of the tag in context of a known setting. Tags are not organized as a taxonomy and no inheritance hierarchies exist. A collection of content tags forms the foundation for an image 'description' displayed as human readable language formatted in complete sentences. Note, that at this point English is the only supported language for image description.

After uploading an image or specifying an image URL, Computer Vision API's algorithms output tags based on the objects, living beings, and actions identified in the image. Tagging is not limited to the main subject, such as a person in the foreground, but also includes the setting (indoor or outdoor), furniture, tools, plants, animals, accessories, gadgets etc.

" https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/ "

Hyperstella

Search This Blog