Head Nodding
The Cog Shop
MIT Artificial Intelligence Laboratory
545 Technology Square, #920
Cambridge, MA 02139
|
|
By adding a tracking mechanism to the output
of the face detector and then classifying these outputs, we have been able to have the
system mimic yes/no head nods of the caregiver (that is, when the caretaker nods yes, the
robot responds by nodding yes). The face detection module produces a stream of face
locations at 20Hz. An attentional marker is attached to the most salient face stimulus,
and the location of that marker is tracked from frame to frame. If the position of the
marker changes drastically, or if no face is determined to be salient, then the tracking
routine resets and waits for a new face to be acquired. Otherwise, the motion of the
attentional marker for a fixed-duration window is classified into one of three static
classes: the yes class, the no class, or the no-motion class. Two metrics are used to
classify the motion, the cumulative sum of the displacements between frames (the relative
displacement over the time window) and the cumulative sum of the absolute values of the
displacements (the total distance traveled by the marker). If the horizontal total trip
distance exceeds a threshold (indicating some motion), and if the horizontal cumulative
displacement is below a threshold (indicating that the motion was back and forth around a
mean), and if the horizontal total distance exceeds the vertical total distance, then we
classify the motion as part of the no class. Otherwise, if the vertical cumulative total
trip distance exceeds a threshold (indicating some motion), and if the vertical cumulative
displacement is below a threshold (indicating that the motion was up and down around a
mean), then we classify the motion as part of the yes class. All other motion types
default to the no-motion class. These simple classes then drive fixed-action patterns for
moving the head and eyes in a yes or no nodding motion. While this is a very simple form
of imitation, it is highly selective. Merely producing horizontal or vertical movement is
not sufficient for the head to mimic the actionthe movement must come from a
face-like object.
|