Programmable Media

Google Auto Tagging

Last updated: Nov-28-2024

Cloudinary is a cloud-based service that provides an end-to-end asset management solution including uploads, storage, transformations, optimizations and delivery. Cloudinary offers a very rich set of image transformation and analysis capabilities and allows you to extract the semantic data from uploaded images: photo metadata (Exif & IPTC) including location and camera details, coordinates of automatically detected faces, color histogram and predominant colors. In addition, Cloudinary allows you to assign multiple tags to images for listing and managing your media assets.

Google Cloud enables developers to understand the content of an image by utilizing powerful machine learning models and quickly classify images into thousands of categories. Cloudinary provides an add-on for Google's automatic image tagging capabilities, fully integrated into Cloudinary's image management and transformation pipeline. Google analyzes image data and automatically identifies categories and suggests tags, a process that would take huge amounts of time and resources if performed manually. The analysis also improves over time as new concepts are introduced and accuracy is improved.

With the Google auto tagging add-on, you can extend Cloudinary's powerful semantic data extraction and image tagging features. When using the Google auto tagging add-on, your images are automatically tagged according to the categories detected in each image.

Tips
  • You can optionally request the category and tag results in another language or in multiple languages by registering for the Google Translation Add-on in addition to the tagging add-on.
  • This page describes how to use the Google Auto Tagging add-on programmatically in Programmable Media, but that you can also use the add-on for DAM use cases in Assets. For more information, see Google Auto Tagging in the Assets user guide.

Getting started

Before you can use the Google Auto Tagging add-on:

  • You must have a Cloudinary account. If you don't already have one, you can sign up for a free account.

  • Register for the add-on: make sure you're logged in to your account and then go to the Add-ons page. For more information about add-on registrations, see Registering for add-ons.

  • Keep in mind that many of the examples on this page use our SDKs. For SDK installation and configuration details, see the relevant SDK guide.

  • If you're new to Cloudinary, you may want to take a look at the Developer Kickstart for a hands-on, step-by-step introduction to Programmable Media features.

Tip
You can also request tagging using the Analyze API (Beta) which also accepts external assets to analyze.

Image recognition and categorization

Take a look at the following photo of children ice skating:

ice_skating.jpg

By setting the categorization parameter to google_tagging when calling Cloudinary's upload or update methods, Google is used to automatically classify the scenes of the uploaded or specified existing image. For example:

Tips
  • You can use upload presets to centrally define a set of upload options including add-on operations to apply, instead of specifying them in each upload call. You can define multiple upload presets, and apply different presets in different upload scenarios. You can create new upload presets in the Upload Presets page of the Console Settings or using the upload_presets Admin API method. From the Upload page of the Console Settings, you can also select default upload presets to use for image, video, and raw API uploads (respectively) as well as default presets for image, video, and raw uploads performed via the Media Library UI.

    Learn more: Upload presets

  • You can run multiple categorization add-ons on the resource. The categorization parameter accepts a comma-separated list of all the Cloudinary categorization add-ons to run on the resource.

The upload API response includes the automatic categorization identified by the Google auto tagging add-on. As can be seen in the response snippet below, various categories were automatically detected in the uploaded photo. The confidence score is a numerical value that represents the confidence level of the detected category, where 1.0 means 100% confidence.

Adding resource tags to images

Automatically categorizing your images is a useful way to organize your Cloudinary media assets, and by also providing the auto_tagging parameter to an upload or update call, images are automatically assigned resource tags based on the detected scene categories. The value of the auto_tagging parameter is the minimum confidence score of a detected category that should be automatically used as an assigned resource tag. Assigning these resource tags allows you to list and search images using Cloudinary's API and Web interface. The following code example automatically tags an uploaded image with all detected categories that have a confidence score higher than 0.6.

The response of the upload API call above returns the detected categories as well as the assigned tags for categories meeting the minimum confidence score of 0.6:

You can also use the update method to apply Google auto tagging to already uploaded images, based on their public IDs, and then automatically tag them according to the detected categories. For example, the following image was uploaded to Cloudinary with the 'puppy' public ID.

sample.jpg

The following code sample uses Cloudinary's update method to apply Google's automatic image tagging and categorization to the puppy uploaded image, and then automatically assign resource tags based on the categories detected with over a 90% confidence level.

The response of the update API call includes the detected categories, and automatically assigned tags. As you can see in the response snippet below, tags were only added for the scene categories with a confidence score of over 90%.

You can use the Admin API's resource method to return the details of a resource, including the scene categories that you already extracted using the upload or update methods.

✔️ Feedback sent!

Rate this page: