Choose 1 of the 2 projects. If you choose the first
project, you must do one of the optional parts. The second project is
more fixed. Groups of up to 2 are allowed. Due April 11.
Secure ftp site:
to download images from crunchy. sftp: crunchy.cs.wustl.edu username: cse559 Password: Pl3ss559 (as in Pless559, in 'leet speak) lands you in a directory with one subfolder, named webcams, which has folders names 1 through 39. avi files are also available for camera camera 04 and camera 33.
In the project you are going to write a pca algorithm that identifies the principle components and the coefficients for a set of images.
Base Option. Take a set of images, show the first 10 principle component images. Challenges that you need to solve and then write about:
What features are highlighted by these principle components? Why?
For the 3 most important components, plot the coefficients as a function of time. Do these coefficient plots have the same pattern? Why or why not?
Find a data set at least as interesting as: 40 people times 10 images each.
For each image in the test set, use a nearest neighbor classifier in order to classify it. (that is, project it onto your basis set, compute the distance to the coefficients of the training images, and give it the label of the closest image). Define a threshold so that if the new coefficients are not "close enough" to any training image coefficients, then you label that image as not in the training set. Show the error rates including (a) images in database correctly defined, (b) images in the database mis-recognized, (c) images in the database not recognized (believed to not be in the database), and (d) images of people not in the training set incorrectly matched to a person in the database. Show this chart as a function of the number of principle components used (perhaps using 2,4,6,8,10 pc's).
The principle components captured from a fixed camera highlight related areas of a scene. This option is open ended and asks you to explore ways of visualizing these principle components in a single image, rather than showing several images. One way I've done this is to take three principle components, and make a "false color" image using one of the PC's to define the red component, one to define the blue component and so on. Here you can present a "diary" format of alternative approaches that you explore to visualize several basis images at once, and discuss how each addresses the goal of "visualizing the variation in the scene".
For any interesting collection of images with at least 4 categories, such as: http://www.paaonline.net/benchmarks/minerva/