Home ] Up ] Research ] Publications ] People ] History of Cog ] Job Opportunities ]

Head/Eyes/Visual Routines

 

  Head/Eyes/Visual Routines

  • Head/Eye Orientation
  • Saccadic Eye Movement
  • Saccade to Motion
  • Smooth Pursuit Tracking
  • Vestibulo-Occular Reflex
  • Face Detection
  • Eye Finding
  • Imitation of head nods

  • The Cog Shop
    MIT Artificial Intelligence Laboratory
    545 Technology Square, #920
    Cambridge, MA 02139

    write to the Cog Documentation Project: cdp@ai.mit.edu

    Head/Eye Orientation

    These clips show Cog orienting its head and neck to a visual stimulus. The eyes are moving to look at moving objects in the field of view. Whenever the eyes move to their new target, the neck moves to point the head toward that same target. The first clip is from the new revision of Cog's head.

     

  • Title: Head/Eye Orientation
  • Authors: Scassellati
  • Length: approximately 25 seconds
  • Quicktime (15 fps) -- (5.1 Meg)
  • Quicktime (30 fps) -- (10.3 Meg)
  • MPEG (15 fps) -- (3.2 Meg)
  • MPEG (30 fps) -- (6.4 Meg)

  • This orientation clip is from Cog's old head.

  • Title: Head/Eye Orientation
  • Authors: Ferrell
  • Length: approximately 16 seconds
  • Quicktime (15 fps) -- (3.4 Meg)
  • Quicktime (30 fps) -- (6.7 Meg)
  • MPEG (15 fps) -- (2.9 Meg)
  • MPEG (30 fps) -- (5.8 Meg)
  •  

     

    Saccadic Eye Movement

    Human eyes move as the result of one of four mechanisms. Two of these mechanisms are under voluntary control (saccadic movements, and smooth pursuit movements), while two are under involuntary control (the vestibulo-occular reflex and non-stabilizing micromovements). The saccadic movements are high-speed movements that cause the eye to jump to a new location approximately three times per second This short clip shows the first tests of the new active vision system as it saccades to random positions. Three of these heads were developed, one for Cog, and two to serve as desktop development platforms. The eyes are travelling at about one-half their maximum velocity.


  • Title: Saccadic Eye Movements
  • Authors: Scassellati
  • Length: approximately 5 seconds
  • Quicktime (15 fps) -- (1.2 Meg)
  • Quicktime (30 fps) -- (2.3 Meg)
  • MPEG (15 fps) -- (0.9 Meg)
  • MPEG (30 fps) -- (1.8 Meg)

  •  

    Saccade to Motion

    In this clip, Cog has been programmed to attend to moving objects. This motion detection operates by subtracting consecutive images, and the using region growing to identify boundaries of moving objects. In this video clip, you can see the eyes saccade to the moving stuffed animal.

  • Title: Saccade to Motion
  • Authors: Scassellati
  • Length: approximately 6 seconds
  • Quicktime (15 fps) -- (1.2 Meg)
  • Quicktime (30 fps) -- (2.5 Meg)
  • MPEG (15 fps) -- (0.8 Meg)
  • MPEG (30 fps) -- (1.5 Meg)

  •  

    Smooth Pursuit Tracking

    The second type of voluntary eye motion is smooth pursuit tracking. This clip shows Cog smoothly tracking a moving object that was placed in front of it. The tracking uses a correlation based metric to determine where the desired object has moved in the visual field.

  • Title: Smooth Pursuit Tracking
  • Authors: Scassellati
  • Length: approximately 9 seconds
  • Quicktime (15 fps) -- (1.9 Meg)
  • Quicktime (30 fps) -- (3.8 Meg)
  • MPEG (15 fps) -- (1.3 Meg)
  • MPEG (30 fps) -- (2.5 Meg)

  •  

    Vestibulo-Occular Reflex

    One of the involuntary eye movements is the vestibulo-occular reflex. This reflex serves to keep the eyes fixed on a target while the head moves (or is moved). In humans, this relfex is accomplished by two systems:a very tight feedback loop from the vestibular system to the eye muscles, which is active at high velocities, and a measurement of visual slip which is active at slow velocities. We have implemented the high velocity vestibular reflex on Cog. Using two rate gyroscopes, we can measure the angular velocity of the head and move the eyes to compensate for that motion. The clip below first shows the head being moved without the vestibular reflex. Notice that the eyes move back and forth with the head as it moves. The second part of the clip shows the head being moved with the vestibular reflex intact. Notice that instead of moving with the head, the eyes continue to point straight ahead regardless of how the head is moved.

  • Title: Vestibulo-Occular Reflex
  • Authors: Peskin, Scassellati
  • Length: approximately 17 seconds
  • Quicktime (15 fps) -- (3.5 Meg)
  • Quicktime (30 fps) -- (7.1 Meg)
  • MPEG (15 fps) -- (1.8 Meg)
  • MPEG (30 fps) -- (3.6 Meg)

  •  

    Face Detection

    One visual task that infants are very good at is face detection. The face detection routine shown here was developed based on the ratio template work of Pawan Sinha. This clip shows the output of the face detection module. On the right side of the clip is the live video stream. On the left side is the same image, but with detected faces outlined. A red outline indicates a better match than a green outline. Notice that the face detection software is not sensitive to face motion, as can be seen when the face is occluded by the circular mountain picture shown in this clip.

  • Title: Face Detection
  • Authors: Scassellati
  • Length: approximately 16 seconds
  • Quicktime (15 fps) -- (3.3 Meg)
  • Quicktime (30 fps) -- (6.5 Meg)
  • MPEG (15 fps) -- (1.2 Meg)
  • MPEG (30 fps) -- (2.3 Meg)

  •  

    Eye Finding

    We would like Cog to be able to tell if someone is making eye contact with it. Using the face detection routines described above, we first locate a face in the peripheral camera. Using a learned sensory-motor mapping, Cog moves its eyes to look at that person. We then can use a second learned sensory-motor mapping to extract an image of that person's eyes. This clip shows the image processing steps used to find eyes. The upper right shows the raw video image and the upper left shows the outlined face images. The lower left is the prefilter results, and the lower right shows the extracted image of the eye. The image of the eye is not stable because the person in the video is moving almost continuously.

  • Title: Eye Finding
  • Authors: Scassellati
  • Length: approximately 19 seconds
  • Quicktime (15 fps) -- (3.4 Meg)
  • Quicktime (30 fps) -- (6.7 Meg)
  • MPEG (15 fps) -- (1.2 Meg)
  • MPEG (30 fps) -- (2.3 Meg)

  •  

    Imitating Head Nods

    One of the long range tasks that we would like the robot to be able to perform is to imitate gestures and motion. The following clips show a very simple example of imitation of head motion. The output from the face detection module is passed to a tracking module, which then characterizes head motions as being either horiztonal "no" motions or vertical "yes" motions. The first clip shows one of the small active vision development platforms imitating head motions. Notice that the head responds only to a head nodding; similar motions with non-face stimuli do not provoke a response.

  • Title: Imitating head nods (development platform)
  • Authors: Scassellati
  • Length: approximately 10 seconds
  • Quicktime (15 fps) -- (1.9 Meg)
  • Quicktime (30 fps) -- (3.9 Meg)
  • MPEG (15 fps) -- (1.4 Meg)
  • MPEG (30 fps) -- (2.8 Meg)

  • The second clip shows Cog imitating the head motions of a person.

  • Title: Imitating head nods of a human
  • Authors: Scassellati
  • Length: approximately 17 seconds
  • Quicktime (15 fps) -- (5.5 Meg)
  • Quicktime (30 fps) -- (11.0 Meg)
  • MPEG (15 fps) -- (1.7 Meg)
  • MPEG (30 fps) -- (3.5 Meg)

  • The third clip shows Cog imitating the head motions of a toy cow. The stuffed animal is detected as a face, and the robot responds to it in the same way that it responds to a person.

  • Title: Imitating head nods of a toy cow
  • Authors: Scassellati
  • Length: approximately 12 seconds
  • Quicktime (15 fps) -- (4.1 Meg)
  • Quicktime (30 fps) -- (5.0 Meg)
  • MPEG (15 fps) -- (1.6 Meg)
  • MPEG (30 fps) -- (3.1 Meg)

  • The fourth clip shows Cog imitating a second stuffed animal (Mickey Mouse). The robot only responds to the face of the toy, not to the motion of the toy.

  • Title: Imitating head nods of Mickey Mouse
  • Authors: Scassellati
  • Length: approximately 28 seconds
  • Quicktime (15 fps) -- (5.7 Meg)
  • Quicktime (30 fps) -- (11.3 Meg)
  • MPEG (15 fps) -- (3.5 Meg)
  • MPEG (30 fps) -- (6.9 Meg)


  •  


    Representatives of the press who are interested in acquiring further information about the Cog project should contact Elizabeth Thomson, thomson@mit.edu, from the MIT News Office,  http://web.mit.edu/newsoffice/www/ .

     

    Home ] Up ] Research ] Publications ] People ] History of Cog ] Job Opportunities ]