| Class | Description |
|---|---|
| AnCoraPOSStats |
A utility to build unigram part-of-speech tagging data from XML
corpus files from the AnCora corpus.
|
| AnCoraProcessor |
A tool which accepts raw AnCora-3.0 Spanish XML files and produces
normalized / pre-processed PTB-style treebanks for use with CoreNLP
tools.
|
| MultiWordPreprocessor |
Clean up an AnCora treebank which has been processed to expand multi-word
tokens into separate leaves.
|
| MultiWordTreeExpander |
Provides routines for "decompressing" further the expanded trees
formed by multiword token splitting.
|