edu.stanford.nlp.international.spanish.pipeline (Stanford JavaNLP API)

Class Summary
Class	Description
AnCoraPOSStats	A utility to build unigram part-of-speech tagging data from XML corpus files from the AnCora corpus.
AnCoraProcessor	A tool which accepts raw AnCora-3.0 Spanish XML files and produces normalized / pre-processed PTB-style treebanks for use with CoreNLP tools.
MultiWordPreprocessor	Clean up an AnCora treebank which has been processed to expand multi-word tokens into separate leaves.
MultiWordTreeExpander	Provides routines for "decompressing" further the expanded trees formed by multiword token splitting.

Package edu.stanford.nlp.international.spanish.pipeline