Package | Description |
---|---|
edu.georgetown.gucs.dictionary | |
edu.georgetown.gucs.fingerprinter |
Modifier and Type | Method and Description |
---|---|
TokenizerList |
Dictionary.getTokenizers()
Provides the tokenizerList object used to create this dictionary
|
Modifier and Type | Method and Description |
---|---|
void |
Dictionary.setTokenizers(TokenizerList tokenizerList)
Loads the tokenizers to use for this dictionary from a TokenizerList object.
|
Modifier and Type | Method and Description |
---|---|
java.util.List<Fingerprint> |
Fingerprinter.computeFingerprint(TokenizerList list,
java.lang.String str)
Computes the fingerprint of this document as a byte array; indicates the presence or absence of each token in this
dictionary
|
java.util.List<Fingerprint> |
BitVectorFingerprinter.computeFingerprint(TokenizerList tokenizer,
java.lang.String str)
Computes a byte array fingerprint indicating the presence or absence of each token in this dictionary
does not read from a file -- creates fingerprints from a string and the tokenizers passed
|
void |
SdhashFingerprinter.setTokenizers(TokenizerList newTokenizers)
Tokenizes a file based on a list of tokenizers that are passed to the function
|