public class SpanishXMLTreeReader extends java.lang.Object implements TreeReader
getPOS(Element)
.Constructor and Description |
---|
SpanishXMLTreeReader(java.lang.String filename,
java.io.Reader in,
boolean simplifiedTagset,
boolean aggressiveNormalization,
boolean retainNER,
boolean detailedAnnotations,
boolean expandElisions,
boolean expandConmigo)
Read parse trees from a Reader.
|
Modifier and Type | Method and Description |
---|---|
void |
close()
Close the Reader behind this
TreeReader . |
static void |
main(java.lang.String[] args) |
static void |
process(java.io.File file,
TreeReader tr,
java.util.regex.Pattern posPattern,
java.util.regex.Pattern wordPattern,
boolean plainPrint)
Read trees from the given file and output their processed forms to
standard output.
|
Tree |
readTree()
Reads a single tree.
|
public SpanishXMLTreeReader(java.lang.String filename, java.io.Reader in, boolean simplifiedTagset, boolean aggressiveNormalization, boolean retainNER, boolean detailedAnnotations, boolean expandElisions, boolean expandConmigo)
filename
- in
- The Reader
simplifiedTagset
- If `true`, convert part-of-speech labels to a
simplified version of the EAGLES tagset, where the tags do not
include extensive morphological analysisaggressiveNormalization
- Perform aggressive "normalization"
on the trees read from the provided corpus documents:
split multi-word tokens into their constituent words (and
infer parts of speech of the constituent words).retainNER
- Retain NER information in preterminals (for later
use in `MultiWordPreprocessor) and add NER-specific
parents to single-word NE tokensdetailedAnnotations
- Retain detailed tree node annotations. These
annotations on parse tree constituents may be useful for
e.g. training a parser.expandElisions
- MWT Expand words like del, alexpandConmigo
- MWT Expand words like conmigo, contigopublic void close()
TreeReader
TreeReader
.close
in interface TreeReader
close
in interface java.io.Closeable
close
in interface java.lang.AutoCloseable
public Tree readTree()
TreeReader
readTree
in interface TreeReader
null
at end of file.public static void process(java.io.File file, TreeReader tr, java.util.regex.Pattern posPattern, java.util.regex.Pattern wordPattern, boolean plainPrint) throws java.io.IOException
java.io.IOException
public static void main(java.lang.String[] args)