- call() - Method in class edu.georgetown.gucs.fingerprinter.BulkFingerprinterWorker
-
- call() - Method in class edu.georgetown.gucs.matcher.BulkMatcherWorker
-
- changePuncHelper(Vector<Character>, int, int) - Method in class edu.georgetown.gucs.tokenizers.FileManglerTokenizer
-
- changeToChar(Vector<Token>) - Method in class edu.georgetown.gucs.tokenizers.FileManglerTokenizer
-
- changeToken(Vector<Token>) - Method in class edu.georgetown.gucs.tokenizers.FileManglerTokenizer
-
- changeToPunc(Vector<Token>) - Method in class edu.georgetown.gucs.tokenizers.FileManglerTokenizer
-
- checkIn(T) - Method in class edu.georgetown.gucs.bloomfilter.ReusuableObjectPool
-
- checkOut() - Method in class edu.georgetown.gucs.bloomfilter.ReusuableObjectPool
-
- ChineseFileTokenizer - Class in edu.georgetown.gucs.tokenizers
-
Splits contents of a Chinese or Chinese-English text file into tokens using Apache Lucene's
ChineseAnalyzer or SmartChineseAnalyzer.
- ChineseFileTokenizer() - Constructor for class edu.georgetown.gucs.tokenizers.ChineseFileTokenizer
-
Constructor that sets the token creation mode to split by smart tokenization using probabilistic word segmentation
- ChineseFileTokenizer(String) - Constructor for class edu.georgetown.gucs.tokenizers.ChineseFileTokenizer
-
Constructor that sets the token creation mode.
- clear(long) - Method in class edu.georgetown.gucs.bloomfilter.LongBitSet
-
Sets the bit specified by the index to false
.
- clear() - Method in class edu.georgetown.gucs.bloomfilter.LongBitSet
-
Sets all of the bits in this BitSet to false
.
- clear() - Method in class edu.georgetown.gucs.bloomfilter.LongFastBloomFilter
-
- clear() - Method in class edu.georgetown.gucs.utility.AntlrBitSet
-
Clears all elements in this bitset
- clear(int) - Method in class edu.georgetown.gucs.utility.AntlrBitSet
-
Clears the given element in this bitset
- clearDict() - Method in class edu.georgetown.gucs.fingerprinter.Fingerprint
-
sets the fingerprint dictionary to null
- clearTokenList() - Method in class edu.georgetown.gucs.fingerprinter.Fingerprint
-
clears the tokens that were used to create the fingerprint
- CLIOptions - Class in sdtext
-
- CLIOptions() - Constructor for class sdtext.CLIOptions
-
- CLIOptions(String[]) - Constructor for class sdtext.CLIOptions
-
Retrieves the first option, the sdtext program to run, and passes the
- clone() - Method in class edu.georgetown.gucs.bloomfilter.LongBitSet
-
Create a clone of this bit set, that is an instance of the same
class and contains the same elements.
- clone(Dictionary) - Method in class edu.georgetown.gucs.dictionary.Dictionary
-
Creates a deep copy of the given dictionary object and stores it in this dictionary
- clone() - Method in class edu.georgetown.gucs.utility.AntlrBitSet
-
Creates and returns a copy of this antlrBitSet
- cmpFingerprintsToText(int, String, Optional<Dictionary>, String, Vector<String>) - Method in class sdtext.API
-
1F_REQ1
Compare list of fingerprint files to a text document
- cmpFingerprintToDirectory(Integer, String, String, String) - Method in class sdtext.API
-
1F_REQ2
Compare a fingerprint file to a directory
- cmpFingToDirectory(Fingerprint, List<Fingerprint>) - Method in class edu.georgetown.gucs.matcher.BulkMatcher
-
- CommandLine - Class in sdtext
-
- CommandLine() - Constructor for class sdtext.CommandLine
-
- CommandLineHelp - Enum in enums
-
Provides the different programs available within SDText (Similarity Digest Text)
- compareAllFingerprints(List<Fingerprint>) - Method in class edu.georgetown.gucs.matcher.BulkMatcher
-
- comparedFile - Variable in class edu.georgetown.gucs.matcher.FingerprintMatcher
-
- compareFile - Variable in class edu.georgetown.gucs.matcher.FingerprintMatcher
-
- CompareFingerprint - Class in edu.georgetown.gucs.matcher
-
Outputs a list of files and their scores, comparing a given fingerprint to a directory of files.
- CompareFingerprint(String, String, String, int, String) - Constructor for class edu.georgetown.gucs.matcher.CompareFingerprint
-
Constructor that sets the matcher, fingerprint file, directory, minimum score and dictionary to use for this
comparison
- CompareFingerprint(String, String, Vector<Fingerprint>, int, String) - Constructor for class edu.georgetown.gucs.matcher.CompareFingerprint
-
Constructor that sets the matcher, fingerprint file, directory, minimum score and dictionary to use for this
comparison
- compareFingerprintDirectory() - Method in class edu.georgetown.gucs.matcher.CompareFingerprint
-
Compute a fingerprint for each file in this directory and compare it to each other using this matcher; if the
matcher returns boolean values, true values are given a score of 100 and false values are given a score of 0
- compute() - Method in class edu.georgetown.gucs.utility.FileListerWorker
-
- computeBloomFilterSpec(long, double) - Static method in class edu.georgetown.gucs.bloomfilter.BloomFilterCalculations
-
- computeFingerprint(String) - Method in class edu.georgetown.gucs.fingerprinter.BitVectorFingerprinter
-
Computes a byte array fingerprint indicating the presence or absence of each token in this dictionary
- computeFingerprint(TokenizerList, String) - Method in class edu.georgetown.gucs.fingerprinter.BitVectorFingerprinter
-
Computes a byte array fingerprint indicating the presence or absence of each token in this dictionary
does not read from a file -- creates fingerprints from a string and the tokenizers passed
- computeFingerprint(String) - Method in class edu.georgetown.gucs.fingerprinter.Fingerprinter
-
Computes the fingerprint of this document as a byte array; indicates the presence or absence of each token in this
dictionary
- computeFingerprint(TokenizerList, String) - Method in class edu.georgetown.gucs.fingerprinter.Fingerprinter
-
Computes the fingerprint of this document as a byte array; indicates the presence or absence of each token in this
dictionary
- computeFingerprint(String) - Method in class edu.georgetown.gucs.fingerprinter.SdhashFingerprinter
-
computeFingerprint tokenizes a file in order to create a fingerprint
- computeFingerprintXML(String) - Method in class edu.georgetown.gucs.fingerprinter.Fingerprinter
-
Computes the fingerprint of a document as a Base64 encoded string; indicates the presence or absence of each token
in this dictionary
- Config - Class in edu.georgetown.gucs.configurations
-
- Config(String) - Constructor for class edu.georgetown.gucs.configurations.Config
-
- contains(byte[]) - Method in class edu.georgetown.gucs.bloomfilter.LongFastBloomFilter
-
- containsToken(Token) - Method in class edu.georgetown.gucs.dictionary.Dictionary
-
Determines if this dictionary contains a particular token
- CosineSimilarityFingerprintMatcher - Class in edu.georgetown.gucs.matcher
-
- CosineSimilarityFingerprintMatcher() - Constructor for class edu.georgetown.gucs.matcher.CosineSimilarityFingerprintMatcher
-
Constructor that sets the minimum score to use for matching two fingerprints to zero
- CosineSimilarityFingerprintMatcher(int) - Constructor for class edu.georgetown.gucs.matcher.CosineSimilarityFingerprintMatcher
-
Constructor that sets the minimum score to use for matching two fingerprints
- CosineSimilarityFingerprintMatcher(String) - Constructor for class edu.georgetown.gucs.matcher.CosineSimilarityFingerprintMatcher
-
Constructor that sets the minimum score to use for matching two fingerprints.
- COUNT - Static variable in class edu.georgetown.gucs.utility.Global
-
Location of mangler count in array of experiment/trial results
- create() - Method in interface edu.georgetown.gucs.bloomfilter.Factory
-
- createDictAndTrim(Double, Double, String, String, String) - Method in class sdtext.API
-
1F_REQ3
Create a dictionary and normalize (trim) its IDF range (0-1]
- createFingerprint(String, String, String) - Method in class sdtext.API
-
1F_REQ4
Create fingerprint for file
NOTE --- The fingerprinter is [default] BitVectorFingerprinter
- createPartialDictionary(List<File>, File) - Method in class edu.georgetown.gucs.dictionary.Dictionary
-
Creates this dictionary using the optional count or percent in the given fileLister; if no count or percent is
specified then uses entire fileLister to create dictionary
- createTokens(int) - Method in class edu.georgetown.gucs.tokenizers.ParseTokenizers
-
- creatingProgram - Variable in class edu.georgetown.gucs.configurations.FingerprintConfiguration
-
the name of the program that created the fingerprint
- creatingProgram - Variable in class edu.georgetown.gucs.fingerprinter.Fingerprinter
-
the name of the program that created the fingerprint
- creator - Variable in class edu.georgetown.gucs.configurations.FingerprintConfiguration
-
the person or organization that created this fingerprint
- creator - Variable in class edu.georgetown.gucs.fingerprinter.Fingerprinter
-
the person or organization that created this fingerprint