DD image tagging tool

An interface to the Clarifai API

With

DensityDesign Research Lab

Year

2018

Role

design and development

Technology

Clarifai API

densitydesign.github.io/dd-image-tagging

The image tagging tool is a javascript interface built on top of the Clarifai API that let users tag a series of images with one of the image content recognition models developed by Clarifai.

Process

For information designers and social scientists working with Digital Methods, content recognition algorithms can be very helpful in categorizing large corpuses of images. They can be employed, for example, as a way to place images in a space based on their content similarity, or to roughly categorize and filter only the images relevant to the research.

Header screenshot

This small project comes from the need of having an easy interface where researchers could use a content recognition algorithm without the burden of having to learn how to code or how to call an API. While inside the lab we had a routine set up to perform the various operations needed to label a corpus of images, this was not ideal in our teaching activities. It implied a series of scripts that took time and know-how to learn and made the passage of knowledge, especially during workshops, challenging and often not very effective.

The tool is specifically designed to work in combination with spreadsheet softwares as they are, most of the time, what comes before and after in this kind of research processes. This is why the interface accepts only a list of image urls in the form of a csv file and outputs another csv file with the results of the algorithm.

Clarifai was chosen for various reasons:

  • it is one of the leading artificial intelligence companies specialized in computer vision;
  • their API is easy to setup and is free within reasonable limits;
  • it has a pool of categorization models that other companies lack.

Thoughts

The image tagging tool works well for its intended use, but it could be greatly improved. Its aim is really focused on the specific task it was created for, making it less flexible to change. For example, a research question that requires an output formatted differently would render the tool useless.

As future developments we could also improve the inputs it handles by also excepting images directly instead of just csvs.