Package | Description |
---|---|
edu.georgetown.gucs.dictionary | |
edu.georgetown.gucs.fingerprinter | |
edu.georgetown.gucs.matcher | |
sdtext |
Class and Description |
---|
Dictionary
Creates a list of unique tokens extracted from a collection of documents that can be trimmed by removing tokens based
on various different attributes; used for creating fingerprints of documents that are based on words that appear in a
document collection
|
DictionaryEntry
Statistics kept on a per-token basis within a Dictionary
|
Class and Description |
---|
Dictionary
Creates a list of unique tokens extracted from a collection of documents that can be trimmed by removing tokens based
on various different attributes; used for creating fingerprints of documents that are based on words that appear in a
document collection
|
Class and Description |
---|
Dictionary
Creates a list of unique tokens extracted from a collection of documents that can be trimmed by removing tokens based
on various different attributes; used for creating fingerprints of documents that are based on words that appear in a
document collection
|
Class and Description |
---|
Dictionary
Creates a list of unique tokens extracted from a collection of documents that can be trimmed by removing tokens based
on various different attributes; used for creating fingerprints of documents that are based on words that appear in a
document collection
|