edu.georgetown.gucs.experiment
Class DuplicateDataSet

java.lang.Object
  extended by edu.georgetown.gucs.experiment.DuplicateDataSet

public class DuplicateDataSet
extends java.lang.Object

Creates copies of files in a dataset to insure known matches to find and compare against. Running this class as a stand alone will create a sub-directory in the dataset called copies and will create entries in the database so that each copy is shown as matching the original.

Author:
Clay Shields

Constructor Summary
DuplicateDataSet(java.lang.String dataset, java.lang.String directory)
          Constructor that takes the dataset and directory names and performs the copy and database operations
 
Method Summary
static void main(java.lang.String[] args)
          Creates a sub-directory called copies, copies all files there, and adds the copies to the matches in the database
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DuplicateDataSet

public DuplicateDataSet(java.lang.String dataset,
                        java.lang.String directory)
Constructor that takes the dataset and directory names and performs the copy and database operations

Parameters:
dataset - the name of the dataset (and database name)
directory - the directory containing the dataset files
Method Detail

main

public static void main(java.lang.String[] args)
Creates a sub-directory called copies, copies all files there, and adds the copies to the matches in the database

Parameters:
args - array of string command line arguments
args[0] the name of the dataset (and database name)
args[1] the directory containing the dataset files