edu.stanford.nlp.process
Class SentenceRetokenizingProcessor

java.lang.Object
  extended byedu.stanford.nlp.process.SentenceRetokenizingProcessor

public class SentenceRetokenizingProcessor
extends Object

Transforms a Document of Words into a Document of Sentences by grouping the Words.

Author:
Teg Grenager (grenager@stanford.edu)

Constructor Summary
SentenceRetokenizingProcessor(TokenizerFactory tokenizerFactory)
          Create a SentenceRetokenizingProcessor that uses the TokenizerFactory tokenizerFactory.
 
Method Summary
 List process(List input)
          Returns a new List where each element is a Sentence from the List input, retokenized with the tokenizer provided at construction.
 Sentence retokenize(Sentence oldS)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

SentenceRetokenizingProcessor

public SentenceRetokenizingProcessor(TokenizerFactory tokenizerFactory)
Create a SentenceRetokenizingProcessor that uses the TokenizerFactory tokenizerFactory.

Method Detail

process

public List process(List input)
Returns a new List where each element is a Sentence from the List input, retokenized with the tokenizer provided at construction. Input must be of class List. Specifically, concatenates the old Sentences with whitespace, and then applies the tokenizer anew.


retokenize

public Sentence retokenize(Sentence oldS)


Stanford NLP Group