|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--edu.stanford.nlp.process.WordToSentenceProcessor
Transforms a Document of Words into a Document of Sentences by grouping the Words.
Constructor Summary | |
WordToSentenceProcessor()
Creat a WordToSentenceProcessor using a sensible default
list of tokens to split on. |
|
WordToSentenceProcessor(Set boundaryTokens)
Flexibly set the set of acceptable sentence boundary tokens. |
|
WordToSentenceProcessor(Set boundaryTokens,
Set boundaryFollowers)
Flexibly set the set of acceptable sentence boundary tokens and also the set of tokens commonly following sentence boundaries. |
|
WordToSentenceProcessor(Set boundaryTokens,
Set boundaryFollowers,
Set boundaryToDiscard)
Flexibly set the set of acceptable sentence boundary tokens, the set of tokens commonly following sentence boundaries, and also the set of tokens that are sentences boundaries that should be discarded. |
Method Summary | |
static void |
main(String[] args)
This will print out as sentences some text. |
Document |
process(Document words)
Returns a new Document where each element is a Sentence built from a run of Words in the input Document. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
public WordToSentenceProcessor()
WordToSentenceProcessor
using a sensible default
list of tokens to split on.
public WordToSentenceProcessor(Set boundaryTokens)
public WordToSentenceProcessor(Set boundaryTokens, Set boundaryFollowers)
public WordToSentenceProcessor(Set boundaryTokens, Set boundaryFollowers, Set boundaryToDiscard)
Method Detail |
public Document process(Document words)
process
in interface Processor
#sentenceBoundaryTokens
,
Sentence
,
PTBTokenizer
public static void main(String[] args)
args
- Command line argument: a file or URL
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |