Presentations and other scholarship

Using Human Observer Eye Movements in Automatic Image Classifiers

Alejandro Jaimes, Columbia University
Jeff Pelz, Rochester Institute of TechnologyFollow
Timothy Grabowski, Rochester Institute of Technology
Jason Babcock, Rochester Institute of Technology
Shih-Fu Chang, Columbia University

Description

We explore the way in which people look at images of different semantic categories (e.g., handshake, landscape), and directly relate those results to computational approaches for automatic image classification. Our hypothesis is that the eye movements of human observers differ for images of different semantic categories, and that this information can be effectively used in automatic content-based classifiers. First, we present eye tracking experiments that show the variations in eye movements (i.e., fixations and saccades) across different individuals for images of 5 different categories: handshakes (two people shaking hands), crowd (cluttered scenes with many people), landscapes (nature scenes without people), main object in uncluttered background (e.g., an airplane flying), and miscellaneous (people and still lives). The eye tracking results suggest that similar viewing patterns occur when different subjects view different images in the same semantic category. Using these results, we examine how empirical data obtained from eye tracking experiments across different semantic categories can be integrated with existing computational frameworks, or used to construct new ones. In particular, we examine the Visual Apprentice, a system in which image classifiers are learned (using machine learning) from user input as the user defines a multiple level object definition hierarchy based on an object and its parts (scene, object, object-part, perceptual area, region), and labels examples for specific classes (e.g., handshake). The resulting classifiers are applied to automatically classify new images (e.g., as handshake/non-handshake). Although many eye tracking experiments have been performed, to our knowledge, this is the first study that specifically compares eye movements across categories, and that links categoryspecific eye tracking results to automatic image classification techniques.

Date of creation, presentation, or exhibit

6-8-2001

Comments

Copyright 2001 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.

The authors wish to thank Diane Kucharczyk, Amy Silver, and the persons that participated in the experiments.

Note: imported from RIT’s Digital Media Library running on DSpace to RIT Scholar Works in February 2014.

Document Type

Conference Paper

Department, Program, or Center

Chester F. Carlson Center for Imaging Science (COS)

Recommended Citation

Alejandro Jaimes, Jeff B. Pelz, Tim Grabowski, Jason S. Babcock, Shih-Fu Chang, "Using human observer eye movements in automatic image classifiers", Proc. SPIE 4299, Human Vision and Electronic Imaging VI, (8 June 2001); doi: 10.1117/12.429507; https://doi.org/10.1117/12.429507

Campus

RIT – Main Campus

Download

COinS

Presentations and other scholarship

Using Human Observer Eye Movements in Automatic Image Classifiers

Description

Date of creation, presentation, or exhibit

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Search

Browse

Author Corner

RIT Links

Presentations and other scholarship

Using Human Observer Eye Movements in Automatic Image Classifiers

Authors

Description

Date of creation, presentation, or exhibit

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Share

Search

Browse

Author Corner

RIT Links