This is a quick experiment applying computer logic, in which the computer loses track of a person when it can’t detect the face properly, to a human interaction, sans the graphics indicating what is going on. Applying the same concept to a larger situation, or doing it live, would be really interesting to see as a next step.
This compilation of computer vision is really interesting to watch if you imagine seeing it without the source footage behind the computer graphics. I really like the description Kyle McDonald gives in the comments of the aesthetic qualities of computer vision graphics and will add to it myself:
“color choices tend toward high contrast saturated primaries (easy colors to code), almost no text (it’s never descriptive, only enumerative), trails are used to show history, ellipses and rectangle/bounding boxes are used as placeholders for complex shapes…”