Recent Advances in Computer Vision

By Roland Piquepaille

Computer vision technology is a branch of artificial intelligence that focuses on providing computers with the functions typical of human vision.

According to researchers at the University of Sydney, Australia, human-centered applications for human-computer interfaces (HCIs), augmented perception, automatic media interpretation, and video surveillance are emerging. Here is one example.

The vOICe, developed at Philips Research Laboratories, provides a simple yet effective means of augmented perception for people with partially impaired vision. In the virtual demonstration, the camera accompanies you in your wanderings. The camera periodically scans the scene in front of you and turns images into sounds, using different pitches and lengths to encode objects' position and size.

Now let's look at the Face Detection Project from Carnegie Mellon University (CMU).

The use of computer vision for automatic media interpretation assists users in searching for specific scenes and shots otherwise not annotated in the video-scene indexes. For example, images containing faces can be automatically distinguished from other images, as the results of the Face Detection Projectat CMU prove. The CMU face detector is considered the most accurate for frontal face detection and is also reliable for facial profiles and three-quarter images.

And of course, there is video surveillance. These researchers designed a system to track suspicious pedestrian behavior in parking lots. Here is a summary of their approach.

The first step consists of detecting all the moving objects in the scene by subtracting an estimated "background image" -- one that represents only the static objects in the scene -- from the current frame. The next step is to distinguish people from moving vehicles on the basis of a form factor, such as the height: width ratio, and to locate their heads as the top region in their silhouette. In this way, the head's speed at each frame is automatically determined. Then, a series of speed samples are repeatedly measured for each person in the scene. Each series covers an interval of about 10 s, which is enough to detect suspicious behavior patterns.
Finally, a neural network classifier, trained to recognize the suspicious behaviors, provides the behavior classification. In the experiments we performed, the system achieved good accuracy, with a reasonably limited number of false dismissals and false alarms -- 4% and 2%, respectively, among more than 100 test samples.

For more information, figures and references, please read this excellent article.

Source: Massimo Picardi and Tony Jan, in The Industrial Physicist, February/March 2003, Volume 9, Number 1


Famous quotes containing the words vision, advances and/or computer:

    We might hypothetically possess ourselves of every technological resource on the North American continent, but as long as our language is inadequate, our vision remains formless, our thinking and feeling are still running in the old cycles, our process may be “revolutionary” but not transformative.
    Adrienne Rich (b. 1929)

    The Church disowned, the tower overthrown, the bells upturned, what have we to do
    But stand with empty hands and palms turned upwards
    In an age which advances progressively backwards?
    —T.S. (Thomas Stearns)

    The computer takes up where psychoanalysis left off. It takes the ideas of a decentered self and makes it more concrete by modeling mind as a multiprocessing machine.
    Sherry Turkle (b. 1948)