public class API
extends java.lang.Object
Constructor and Description |
---|
API() |
Modifier and Type | Method and Description |
---|---|
java.util.Map<java.lang.Integer,java.lang.String> |
cmpFingerprintsToText(int threshold,
java.lang.String match,
java.util.Optional<Dictionary> dictionary,
java.lang.String documentContent,
java.util.Vector<java.lang.String> fingerprintFileNames)
1F_REQ1
Compare list of fingerprint files to a text document
|
void |
cmpFingerprintToDirectory(java.lang.Integer threshold,
java.lang.String match,
java.lang.String fingerprintFile,
java.lang.String directory)
1F_REQ2
Compare a fingerprint file to a directory
|
void |
createDictAndTrim(java.lang.Double minIDF,
java.lang.Double maxIDF,
java.lang.String config,
java.lang.String directory,
java.lang.String dictionary)
1F_REQ3
Create a dictionary and normalize (trim) its IDF range (0-1]
|
void |
createFingerprint(java.lang.String fingerprintFile,
java.lang.String dictionary,
java.lang.String outputFile)
1F_REQ4
Create fingerprint for file
NOTE --- The fingerprinter is [default] BitVectorFingerprinter
|
public java.util.Map<java.lang.Integer,java.lang.String> cmpFingerprintsToText(int threshold, java.lang.String match, java.util.Optional<Dictionary> dictionary, java.lang.String documentContent, java.util.Vector<java.lang.String> fingerprintFileNames)
threshold
- the minimum score required for a matchmatch
- name of the matcher to use for this comparisondictionary
- an optional dictionary that can be set if the fingerprint does not contain a dictionarydocumentContent
- text content of the document to be comparedfingerprintFileNames
- list of the fingerprint file names to compare with the given text documentpublic void cmpFingerprintToDirectory(java.lang.Integer threshold, java.lang.String match, java.lang.String fingerprintFile, java.lang.String directory)
threshold
- minimum score required to be considered a matchmatch
- name of the matcher to use for this comparisonfingerprintFile
- filename of the fingerprint file to compare with this directorydirectory
- directory path to compare with this fingerprintpublic void createDictAndTrim(java.lang.Double minIDF, java.lang.Double maxIDF, java.lang.String config, java.lang.String directory, java.lang.String dictionary)
minIDF
- minimum normalized IDF to keep in trimmed dictionary 0.0 (default 0.0)maxIDF
- maximum normalized IDF to keep in trimmed dictionary (default 1.0)config
- filename of the tokenizer configuration filedirectory
- directory containing a set of sample filesdictionary
- output dictionary filenamepublic void createFingerprint(java.lang.String fingerprintFile, java.lang.String dictionary, java.lang.String outputFile)
fingerprintFile
- the file to fingerprintdictionary
- the dictionary filenameoutputFile
- output fingerprint filename