|
CSCI 567 Home Syllabus Schedule Resources Blackboard |
Machine Learning (CSCI-567)Fall 2008
General Information
News
Course DescriptionThis course will present an introduction to algorithms for machine learning and data mining. These algorithms lie at the heart of many leading edge computer applications including optical character recognition, speech recognition, text mining, document classification, pattern recognition, computer intrusion detection, and information extraction from web pages. Every machine learning algorithm has both a computational aspect (how to compute the answer) and a statistical aspect (how to ensure that future predictions are accurate). Algorithms covered include linear classifiers (Gaussian maximum likelihood, Naive Bayes, and logistic regression) and non-linear classifiers (neural networks, decision trees, support-vector machines, nearest neighbor methods). The class will also introduce techniques for learning from sequential data and advanced ensemble methods such as bagging and boosting.Prerequisites: basic knowledge of search algorithms, probability, statistics, calculus, data structures, search algorithms (gradient descent, depth-first search, greedy algorithms), linear algebra. Some AI background is recommended, but not required. Textbook:
Course HandoutsSoftwareIn this class, we will be using the WEKA of Waikato (Hamilton, New Zealand). This is a package of machine learning algorithms and data sets that is very easy to use and easy to extend.
http://www.cs.waikato.ac.nz/ml/weka/. You will need java 1.4+ installed.
Homework Assignments
Please turn in all homework in two forms: (i) as hardcopy at the start of class and if applicable: (ii) electronically to TA and instructor.
Written Homework and Programs are due at the beginning of class.
Each student is responsible for his/her own work. The standard departmental rules for academic dishonesty apply to all assignments in this course. Collaboration on homeworks and programs should be limited only to answering questions that can be asked and answered without using any written medium (e.g., no pencils, instant messages, or email). |
||||||||||||||||||||||||||||||||||||||||||