public static class WhitespaceTokenizer.WhitespaceTokenizerFactory<T extends HasWord> extends Object implements TokenizerFactory<T>
Constructor and Description |
---|
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory) |
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory,
boolean tokenizeNLs) |
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory,
String options) |
Modifier and Type | Method and Description |
---|---|
Iterator<T> |
getIterator(Reader r)
Return an iterator over the contents read from r.
|
Tokenizer<T> |
getTokenizer(Reader r) |
Tokenizer<T> |
getTokenizer(Reader r,
String extraOptions) |
static TokenizerFactory<Word> |
newTokenizerFactory()
Constructs a new TokenizerFactory that returns Word objects and
treats carriage returns as normal whitespace.
|
void |
setOptions(String options) |
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory)
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory, String options)
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory, boolean tokenizeNLs)
public static TokenizerFactory<Word> newTokenizerFactory()
public Iterator<T> getIterator(Reader r)
IteratorFromReaderFactory
getIterator
in interface IteratorFromReaderFactory<T extends HasWord>
r
- Where to read objects frompublic Tokenizer<T> getTokenizer(Reader r)
getTokenizer
in interface TokenizerFactory<T extends HasWord>
public Tokenizer<T> getTokenizer(Reader r, String extraOptions)
getTokenizer
in interface TokenizerFactory<T extends HasWord>
public void setOptions(String options)
setOptions
in interface TokenizerFactory<T extends HasWord>