COSC 388: Machine Learning

Project 4
Fall 2003

Due: Nov 17 @ 5 P.M.
13 points

  1. Implement Mitchell's perceptron, either version. Train and test your implementation using the data set iris-p4.arff, which I'll describe in lecture.

    The implementation must be general, meaning it must work with any data set with two classes and binary attributes.

    Everything must compile and run on gusun, cssun, or daruma.

    Include in the archive everything needed to compile, run the programs, and reproduce the results.

    In a text file named README, include the results of the evaluation and instructions about how to execute your program and reproduce the results.

  2. Download Weka. Get it running. Let me know if you have any problems.

    Run J48 and naive Bayes on the iris data set. (J48 is a Java version of C4.5.) Once you get Weka configured, you should be able to type on the command line

    java weka.classifiers.NaiveBayes -t iris-p4.arff
    
    and
    java weka.classifiers.j48.J48 -t iris-p4.arff
    
    This will automatically conduct a ten-fold cross-validation.

Instructions for Submission: In the header comments, provide the following information:

//
// Name
// E-mail Address
// Platform: Windows, OS X, Redhat, Solaris (cssun/gusun/daruma), etc.
// Development Environment: gcc, g++, java, g77, etc.
// Mail Client: mailx, pine, GUMail, Netscape, Yahoo!, etc.
//
When you are ready to submit your project for grading, create a compressed archive of a directory containing the files of project and send it to me by e-mail as an attachment, as you did for Project 1.

Submit your project before 5:00 P.M. on the due date.

Once submitted, it is important to keep an electronic copy of your project on either cssun or gusun. These systems are regularly backed-up, and if we lose your project or the e-mail system breaks, then we will need to look at the modification date and time of your project to ensure that you submitted it before it was due. If you developed your code on a Windows machine, then use a secure ftp client to transfer your files or the archive to cssun or gusun.

Finally, when storing source code on university machines, it is important to set file permissions so others cannot read the file. To turn off such read/write permissions, type at the UNIX prompt chmod og-rw <file>, where <file> is the name of your source file.