edu.stanford.nlp.process
Class SentenceRetokenizingProcessor
java.lang.Object
edu.stanford.nlp.process.SentenceRetokenizingProcessor
- public class SentenceRetokenizingProcessor
- extends Object
Transforms a Document of Words into a Document of Sentences by grouping the
Words.
- Author:
- Teg Grenager (grenager@stanford.edu)
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SentenceRetokenizingProcessor
public SentenceRetokenizingProcessor(TokenizerFactory tokenizerFactory)
- Create a
SentenceRetokenizingProcessor
that uses the
TokenizerFactory
tokenizerFactory.
process
public List process(List input)
- Returns a new
List
where each element is a Sentence from the
List
input,
retokenized with the tokenizer provided at construction. Input must be
of class List
. Specifically, concatenates the old
Sentences with whitespace, and then applies the tokenizer anew.
retokenize
public Sentence retokenize(Sentence oldS)
Stanford NLP Group