CSE 515A - Fall 2005 - Intelligent Data Analysis

Instructor:  Weixiong Zhang

Location: Crow 205
Time: Monday and Wednesday, 1:00 - 2:30 pm

Prereq:  CS241 (or CS514A or CS501 and CS502) and SSM 326A (or Math 320), or their equivalent, or permission of the instructor

Text book:
    R.E. Neapolitan, Learning Bayesian Networks, Prentice Hall, 2003, ISBN: 0130125342
Reference book:
   
C. Borgelt and R. Kruse, Graphical Models: Methods for Data Analysis and Mining, John Wiley & Sons, 2002, ISBN: 0470843373

Newsgroup: tba (course related announcement and discussions will be posted there)

Instructor office hours: Friday 3-4pm, Jolley Hall 506, or by appointment


Because we will cover many topics that are not in the textbook nor the reference book, it is critical to attend every lecture. 


Brief description (in course listings)
Lecture and topic schedule
Additional information (project and final grade)
Collaboration policy


Brief description:

We very often cry for knowledge while immersed with huge amount of data.  Finding models intrinsic to the production of data we collect and patterns characteristic to the nature of observations we make is of fundamental and practical importance.  In this course, we study various advanced techniques (e.g., graphical models and spectral graph theory) from computer science, artificial intelligence and statistics for analyzing large quantity of data.  We consider applications in selected domains, such as computational biology and text mining on the web. 

Note: 1) In this semester, we will consider problems in genomics and biology. 2) This course can be considered as a continuation and extension of CSE 514A Datamining.  It will be very helpful if you have taken that course first, although that is not a prerequisite.


Lecture and topic schedule (Note: this is just an outline; details to be added later.)

******** Uncertainty and independency ********

******** Graphical models of Uncertainty and independency ********

******** Learning graphical models ********

******** Feature extraction, feature selections and model building ********

******** Time series analysis ********


Information on project and grading

In addition to the presentation slides, every student must submit a final report of his/her project.  The following items must be covered in detail: Problem description, data and method used, detail of existing algorithms, design and implementation of your own algorithm, algorithm analysis and comparison, result analysis and future work.
The following scale will be applied to compute the final grade from your total points earned:

Policy on collaboration

When solving your homework problems and working on your project, you may discuss HIGH-LEVEL approaches to the homework problems with your classmates, HOWEVER, you are to work out all details of any solutions discussed and write up the solution completely on your own. In particular, when working with a student on an assigned homework problem you should do so verbally -- Nothing should be written. Remember to keep your discussion at a high-level so that everyone can work out the details on their own. Also you must clearly acknowledge anyone (except the instructor) with whom you discussed any problem and say briefly what you discussed.

Please keep any discussions you have with other students to a small group of no more than 3 students and be sure that each of you are equally involved. If you just listen in and are then able to understand and write up the solution you have missed at least half of the benefit of the homework. It is really important to work through the process of recognizing when you are heading the wrong way and learning how to work through the problem solving process.

Violations of any of the above rules will be dealt with harshly! The homework problems and projects are designed to help you learn the material being taught. Being told the solution and understanding it is VERY different from working through the process of actually finding a solution. If you do not take an active role in the process of solving the homework problems and project, then you won't get much out of it, hence you won't learn the material.


Created by Weixiong Zhang, August, 2005.