public class StripPunctuationTokenizer extends Tokenizer
positions, tokenVector
Constructor and Description |
---|
StripPunctuationTokenizer() |
Modifier and Type | Method and Description |
---|---|
void |
tokenize(java.util.Iterator<java.lang.String> iterator)
Separates tokens based on punctuation and removes punctuation from tokens
|
void |
tokenize(java.util.Iterator<java.lang.String> tokensIterator,
java.util.Iterator<Pair<java.lang.Integer,java.lang.Integer>> positionsIterator)
Separates tokens based on punctuation and removes punctuation from tokens
|
getPositionsVector, getTokenVector, iterator, position_iterator, printTokens, tokenize
public void tokenize(java.util.Iterator<java.lang.String> iterator)
public void tokenize(java.util.Iterator<java.lang.String> tokensIterator, java.util.Iterator<Pair<java.lang.Integer,java.lang.Integer>> positionsIterator)