COSC 270: Artificial Intelligence

Project 5
Fall 2015

Due: Mon, Apr 27 @ 11:59 P.M.
13 points

Implement IREPc as discussed in Sections 2.2 and 2.3 of Cohen's paper entitled Fast Effective Rule Induction. You can use Lisp, Java, C++, Python, or Ruby. You can use only the standard libraries included with the programming language. If you want to use something else, check with me first. I should be able to compile and run your implementation from the command line.

We will talk about rule-selection heuristics in lecture. To make the project tractable, your implementation does not need to handle numeric attributes, it does not need to prune, and it needs to work only with the 1984 Congressional Voting Record. See also the names file. I created a version of this data set in a simplified format: votes.dta. I also put this file on cs-class, which you can retrieve using the command:

cs-class% cp ~maloofm/cosc270/votes.dta ./
Feel free to remove the comments and read the data set into your program, or you can hard-code the data set into your program. You can remove comments and reorganize the data, but you cannot modify the data in any other way.

To evaluate the learned rules, implement the hold-out method, which involves selecting a random set of the original examples as a training set and using the remaining examples as a testing set. Use 75% of the original examples as the training set. The training set serves as input to IREP. Once the program produces a set of rules, then it evaluates the rules on the examples of the test set. Seed the random number generator with the system clock.

The implementation should print the rules it learns from the training examples and the accuracy of those rules on the testing set for both classes (i.e., the overall accuracy) and for each class (i.e., the true-positive and true-negative rates).

Instructions for Electronic Submission

In the header comments of the primary file, provide the following information:
;;;;
;;;; COSC 270 Project 5
;;;; Name
;;;; E-mail Address
;;;; Platform: Windows, Linux (cs-class), etc.
;;;; Language: C, C++, Lisp, Python, PL/I
;;;;
;;;; In accordance with the class policies and Georgetown's Honor Code,
;;;; I certify that, with the exceptions of the course materials and those
;;;; items noted below, I have neither given nor received any assistance
;;;; on this project.
;;;;

If you use Lisp, make sure that I can execute your project from one file; for example, (load "p5.lisp") or even better:

$ clisp p5.lisp >| output

If you use C++, provide a Makefile.

If you use Java, don't use packages or jar files. Just give me the source.

Use Blackboard to submit your assignment. Keep in mind that Blackboard lets you submit only once. Put the files for your project in a single-level directory with the same name as your Net ID. Zip up the folder, and upload the zip file for assignment p5 on Blackboard. If you need to include a message with your submission, put it in a README file in your directory.

Copyright © 2019 Mark Maloof. All Rights Reserved. This material may not be published, broadcast, rewritten, or redistributed.