edu.stanford.nlp.trees
Class PennTreeReader

java.lang.Object
  extended byedu.stanford.nlp.trees.PennTreeReader
All Implemented Interfaces:
TreeReader

public class PennTreeReader
extends Object
implements TreeReader

A PennTreeReader is a TreeReader that reads in Penn Treebank-style files. Example usage:
TreeReader tr = new PennTreeReader(new BufferedReader(new InputStreamReader(new FileInputStream(file),"UTF-8")), * myTreeFactory);

Author:
Christopher Manning (mod. Roger Levy 2003/01)

Constructor Summary
PennTreeReader(Reader in)
          Read parse trees from a Reader.
PennTreeReader(Reader in, StreamTokenizer st)
          Read parse trees from a Reader.
PennTreeReader(Reader in, TreeFactory tf)
          Read parse trees from a Reader.
PennTreeReader(Reader in, TreeFactory tf, TreeNormalizer tn)
          Read parse trees from a Reader.
PennTreeReader(Reader in, TreeFactory tf, TreeNormalizer tn, StreamTokenizer st)
          Read parse trees from a Reader.
 
Method Summary
 void close()
          Close the Reader behind this TreeReader.
static void main(String[] args)
          Loads treebank data from first argument and prints it.
 Tree readTree()
          Reads a single tree in standard Penn Treebank format, with or without an additional set of parens around it (an unnamed ROOT node).
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PennTreeReader

public PennTreeReader(Reader in)
Read parse trees from a Reader. For the defaulted arguments, you get a SimpleTreeFactory, no TreeNormalizer, and a PennTreebankStreamTokenizer.

Parameters:
in - The Reader

PennTreeReader

public PennTreeReader(Reader in,
                      TreeFactory tf)
Read parse trees from a Reader.

Parameters:
in - the Reader
tf - TreeFactory -- factory to create some kind of Tree

PennTreeReader

public PennTreeReader(Reader in,
                      StreamTokenizer st)
Read parse trees from a Reader.

Parameters:
in - the Reader
st - the StreamTokenizer

PennTreeReader

public PennTreeReader(Reader in,
                      TreeFactory tf,
                      TreeNormalizer tn)
Read parse trees from a Reader.

Parameters:
in - Reader
tf - TreeFactory -- factory to create some kind of Tree
tn - the method of normalizing trees

PennTreeReader

public PennTreeReader(Reader in,
                      TreeFactory tf,
                      TreeNormalizer tn,
                      StreamTokenizer st)
Read parse trees from a Reader.

Parameters:
in - Reader
tf - TreeFactory -- factory to create some kind of Tree
tn - the method of normalizing trees
st - edu.stanford.nlp.io.StreamTokenizer that divides up InputStream
Method Detail

readTree

public Tree readTree()
              throws IOException
Reads a single tree in standard Penn Treebank format, with or without an additional set of parens around it (an unnamed ROOT node).

Specified by:
readTree in interface TreeReader
Returns:
A single tree, or null at end of file.
Throws:
IOException

close

public void close()
           throws IOException
Close the Reader behind this TreeReader.

Specified by:
close in interface TreeReader
Throws:
IOException

main

public static void main(String[] args)
Loads treebank data from first argument and prints it.

Parameters:
args - Array of command-line arguments: specifies a filename


Stanford NLP Group