Screenshots and Image Recognition Software

All of the images from the clip ‘Transformation’ that you just saw originated somewhere else. They have been copied and pasted in a new context thanks to technologies that allow you to ‘transform’ and retrieve images. This assignment revolves around Image Recognition Software, also known as Computer Vision. You will first learn how to create a screenshot of an image and inspect its corresponding metadata. Then, you will learn how to use software to find where an image on the web originally comes from. Finally, you will analyse the image using digital source criticism tools. This will allow you to trace the origin of the image to its publication in a digital representation in an online environment, documenting the transformations it has undergone in order to reach its most recent stage.

card

1.a Inspect screen shots | 35 min

A screenshot (sometimes referred to as a screencap, screengrab, or in the earlier days of the internet, as a print screen), can be seen as revolutionary. The ease with which you can copy and paste an image, use it in another context, and share it with others, encourages the use and spread of images on the web. This is especially the case when the images are copyright free. The first step in exploring image recognition is using your laptop to create two screenshots of images of your choice that have been used for the animation ‘Transformation.’

  • Start by re-watching the video clip ‘Transformation.’ At any point in the video, pause it (for example, at 0:28 or 2:03), and take a screenshot of a clear image of an object or a person in the paused clip. (Click here for instructions on how to take a screenshot on a Mac or a Windows computer.

For Mac users, the screenshots will automatically appear on your desktop. For Windows users, the screenshots will appear in a folder called “Screenshots” in your “Pictures” folder. Copy those screenshots in the two fields of your answerform (this will be added to this lesson to download asap).

Taking the screenshots automatically generates information about them that we call metadata. Metadata, which literally means “data about data”, is information about the object in question. It is a concept that is key for the management of digital content (including photos, texts, audio or video-files, tweets, or 3d objects). In the case of digitally produced images, this type of metadata is called Exif data, which stands for: Exchangeable Image File Format. These data provide information about the image such as the amount of pixels, the number of bytes, the colour scheme, the date of creation, and the format of the image.

  • Watch the video The Metadata Librarian Explains Metadata by metadata librarian Mary Rose of the Southern Illinois University Edwardsville Libraries that gives you an introduction to the concept of metadata by using the analogy of how we label leftovers in a fridge.

You are now going to inspect the metadata of the first of the two screenshots that you created.

  • Click here for access to a PDF with an overview of all the metadata fields, how to access them, and what they mean Using the PDF, you will see all the major fields for metadata information in the tables below.
  • Fill out only the metadata fields that are highlighted for the first of the two screenshots you took, either in the Mac or the Windows version, while taking note of all the other types of common metadata
Mac OS metadata Finding
Kind  
Size  
Where  
Created  
Modified  
Last opened  
Dimensions  
Colour space  
Colour profile  
Alpha channel  
Name and extension  
Comments  
Open with  
Preview (smaller version of screenshot)  
Sharing permissions  
Windows metadata Finding
Type of file  
Opens with  
Location  
Size  
Size on disk  
Created  
Modified  
Accessed  
Attributes  
Object Name  
Permissions  
Date taken  
Dimensions  
Width  
Height  
Bit Depth  
Name  
Folder path  
Attributes  
Offline status  
Shared with  
Owner  
  • Check whether or not the metadata that you looked up was correct by using the online tool https://www.metadata2go.com, which helps you find and examine the Exif and metadata of an image, in addition to providing explanations for a number of metadata categories or fields that you might not be familiar with.

1.b Explore Image Recognition Tools | 50 min

The purpose of Image Recognition software is to teach systems to recognize patterns in images in the same way humans do. The software is trained with data that enables it to create a kind of prediction model. This model defines a set of conditions that have to be met and translates those into pixel values, which allows the software to recognize different images by looking for the closest possible resemblances to what it has learned and stored. We are now going to explore two online image recognition services: a well known one, Google Search by Image, and a less-known service, Tineye.

For this assignment, you will choose the second of the two screenshots you took in Assignment 1a (that is, not the one that you analyzed the metadata of) to search for its origin via two image recognition services, Google Search by Image, and Tineye.

Image search results Google Image Image search results Tineye

Image search results Tineye Image search results Google Image

  • Complete the table with your findings. Put a copy of your screenshot in the first field (if you have chosen a portrait page setup, you will need to make it smaller to fit in the field)
  • Describe the differences in options to select between the two services
  • Describe the metadata components of the summary of the search in both services, paying attention to the differences between the two services. This includes things such as the dimensions of the image, possible related search terms, the file type, and any other information relating to the image itself.
  • Describe the metadata components of the search results in both services, paying attention to the differences between the two services. This includes things such as the titles and text of the websites that appear, the number of search results, and any other information relating to the search results generated by the different services.
  • Describe the first two search results, first the result of the search and then what you find when you open the link, assess the quality of the image recognition in both services.

  • Evaluate your findings in a short summary (100 words max.) in which you assess the reason for the differences in performance. (Hint: Think about what was said in the TED Talk)

1.c Applying Digital Source Criticism to screenshots | 50 min

Source criticism is the practice of critically engaging with a source. Historians, for example, have always questioned the origin, meaning, and credibility of the historical sources that they use. However, when a source is digitised, published online, and reproduced by many people, it can become more difficult to trace the origin of the source. In this assignment, you are going to document the characteristics of the image from its original creation to its digital representation online, tracking the changes that happen each time the image is transformed into something else. It is important to remember that while the transformed image communicates the same message as the original in most instances, its artifactual value has profoundly changed.

  • Open this visual aid from the Ranke.2 website. You will see the heading ‘Questions to ask about a digitised manuscript published on the web.’ Below, three contexts are identified in which the identity, appearance, and value of a source changes. In each context, different questions should be asked. In this assignment, you will pose similar questions about two images that you will choose below.

The goal here is to understand the difference between ‘the’ image and ‘an’ image. ‘The’ image is the original, that is, the very first of its kind that was ever made. ‘An’ image is a reproduction of the original image. Sometimes the reproduction is in the same format as the original and sometimes it is in a completely different format. Here, you will be tracing that process of transformation from the original image (context of creation) to its digital reproduction (context of digitisation).

NOTE: You may need to do some additional web research to answer all the questions. Some good resources include Encyclopedia Britannica, Google Scholar, and Khan Academy. Keep in mind that some websites do not share the information about the digitization process, so it is OK if you are not able to find all the necessary information.

  • Look at the images below, choose two, and take screenshots of them.

Charles Darwin portrait by John Collier Morel Sandro Botticelli - La nascita di Venere Barack Obama hope poster Boy children Christmas Man mask Postmortem children Iris Woman labor Boy in blue long sleeve Oscar Wild Postmortem man Dora Maar Picasso

  • Fill out the tables below using two of the images selected above. For the first section, ‘Context of creation’, we suggest that you repeat the process of Assignment 1b of uploading the screenshot in the image recognition service that you prefer. The list of results that appears will guide you in answering the questions in the first section.

To give you an idea of how to respond to the questions, an example is given below.

1. Context of creation example answer
What is its name and who created it? Portrait de Dora Maar. It was created by the painter Pablo Picasso.
When and where was it created? 1937 and potentially in Paris.
For which purpose? to create a work of art
What is portrayed in the image? Dora Maar, a pseudonym for the french photographer, painter and poet Henriette Theodora Markovitch, Picasso’s muse
With which technology/craft/medium? Oil on linen
2. Context of preservation example answer
Where is the original? At the Musée Picasso in Paris
What does this say about the value of the source? It is valued as an important work of art
Is it accessible to an audience? It is not clear whether it is in the museum’s archive or currently being exhibited to the public
Has its value/ appearance//identity changed in this different context? Yes, it has become part of shared cultural heritage as it is in a publicly funded museum.
3. Context of digitisation and online publishing example answer
On which website did you find it? I found it both on commercial images selling companies websites like Alamy, and in websites of collections, like wikiart. An interesting thing to note is that you can end up on a website where you have to pay to receive a copy, whereas the image is licensed under ‘fair use’ and can be downloaded without costs, in a lower resolution
Who published it online, when and where? This is not possible to find out online and would require an investigation based on contact with the owner of the painting, that is, the museum
Why and how was it digitised Same as above
Was information lost or added? Yes, its materiality was lost, the colors are different, the dimensions are different, and it has changed into digits and automatically created metadata when being photographed, being converted to a lower resolution to make it suitable for upload. When uploaded, the structure of the bits changed again, and new metadata was added.
Who is the owner of the digital copy? Probably the Picasso Museum, but it is being shared under the principle of ‘fair use’ and for educational and research purposes a lower resolution is freely available.

Reading/viewing suggestions

  • How to find and remove Exif data on a Mac and on a Windows computer: Klein, Matt. “What Is EXIF Data, and How Can I Remove It from My Photos”. How-to Geek, July 10, 2017. Magazine article
  • Neatrour, Anna, and Martin Greer. “Using Metadata to Describe Digital Content.” September 30, 2015. Digpublib. Video, 35:51. Youtube Video
  • Jagdish, Deepak, and Daniel Smilkov. “The Power of Metadata.” September 25, 2013 at TEDxCambridge. Video, 9:57. Youtube Video
  • Read the original article from the team that produced the Google Books Ngram Viewer: Michel, Jean-Baptiste, Yuan Kui Shen, Aviva P. Aiden, Adrian Veres, Matthew K. Gray, et al. “Quantitative analysis of cutlure using millions of digitized books.” Science 331, (6014): 176-182. Harvard Library
  • If you want to go more in-depth about the field of Image Recognition software and Computer Vision: Szeliski, Richard. “Computer Vision: Algorithms and Applications by Richard.” Last modified May 23, 2022. Website