public class BitVectorFingerprinter extends Fingerprinter
base64Fingerprint, byteRun, creatingProgram, creator, dictionary, diskImage, fingerprintName, GUID, showDataSource, showDictionary, showDigest, systemID, targetFile, tokenizers, unknownTokens, version, volume
Constructor and Description |
---|
BitVectorFingerprinter()
Constructor that generates the fingerprint name, version, unique identifier (GUID), system identifier, and creating
program.
|
BitVectorFingerprinter(java.lang.String dictionaryFilename)
Constructor that loads a dictionary and its tokenizers.
|
Modifier and Type | Method and Description |
---|---|
byte[] |
computeFingerprint(java.lang.String filename)
Computes a byte array fingerprint indicating the presence or absence of each token in this dictionary
|
static void |
main(java.lang.String[] args)
Creates a bit vector fingerprint representing the presence or absence of terms in a particular dictionary
|
computeFingerprintXML, generateCreatingProgram, generateXML, getBase64Fingerprint, getCreatingProgram, getDictionary, getFingerprintName, loadFingerprintXML, loadFingerprintXML, outputFields, setCreator, setDictionary, setDictionary, setMangler, setMangler, setManglerRNG, setOutput, setTerse, setVerbose
public BitVectorFingerprinter()
public BitVectorFingerprinter(java.lang.String dictionaryFilename)
dictionaryFilename
- the filename containing the dictionary to use for this fingerprinterpublic byte[] computeFingerprint(java.lang.String filename)
computeFingerprint
in class Fingerprinter
filename
- the string filename of the document to fingerprintpublic static void main(java.lang.String[] args)
args
- array of string command line argumentsargs[0]
the filename containing the dictionary to use for this fingerprinterargs[1]
the filename of the document to fingerprintargs[2]
the filename to write the XML outputargs[3]
the filename containing the output specifications