Class | Description |
---|---|
AnCoraPOSStats |
A utility to build unigram part-of-speech tagging data from XML
corpus files from the AnCora corpus.
|
AnCoraProcessor |
A tool which accepts raw AnCora-3.0 Spanish XML files and produces
normalized / pre-processed PTB-style treebanks for use with CoreNLP
tools.
|
MultiWordPreprocessor |
Clean up an AnCora treebank which has been processed to expand multi-word
tokens into separate leaves.
|
MultiWordTreeExpander |
Provides routines for "decompressing" further the expanded trees
formed by multiword token splitting.
|