edu.stanford.nlp.parser.lexparser
Class AbstractTreebankParserParams

java.lang.Object
  extended byedu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams
All Implemented Interfaces:
Serializable, TreebankLangParserParams
Direct Known Subclasses:
ChineseTreebankParserParams, NegraPennTreebankParserParams

public abstract class AbstractTreebankParserParams
extends Object
implements TreebankLangParserParams

An abstract class providing a common method base from which to complete a TreebankLangParserParams implementing class.

With some extending classes you'll want to have access to special attributes of the corresponding TreebankLanguagePack while taking advantage of this class's code for making the TreebankLanguagePack accessible. A good way to do this is to pass a new instance of the appropriate TreebankLanguagePack into this class's constructor, then get it back later on by casting a call to treebankLanguagePack(). See ChineseTreebankParserParams for an example.

Author:
Roger Levy
See Also:
Serialized Form

Field Summary
protected  String inputEncoding
           
protected  String outputEncoding
           
protected  TreebankLanguagePack tlp
           
 
Constructor Summary
protected AbstractTreebankParserParams(TreebankLanguagePack tlp)
          Stores the passed-in TreebankLanguagePack.
 
Method Summary
abstract  TreeTransformer collinizer()
          the tree transformer used to produce trees for evaluation.
abstract  TreeTransformer collinizerEvalb()
          the tree transformer used to produce trees for evaluation.
abstract  void display()
          display language-specific settings
abstract  HeadFinder headFinder()
          the HeadFinder to use for your treebank.
abstract  Lexicon lex()
          reads in trees from the source you want.
abstract  MemoryTreebank memoryTreebank()
          returns a MemoryTreebank appropriate to the treebank source
 PrintWriter pw()
          the PrintWriter used to print output.
 PrintWriter pw(OutputStream o)
          the PrintWriter used to print output.
 void setInputEncoding(String encoding)
          sets the input encoding
abstract  int setOptionFlag(String[] args, int i)
          Set language-specific options according to flags.
 void setOutputEncoding(String encoding)
          sets the output encoding
abstract  String[] sisterSplitters()
          Returns the splitting strings used for selective splits.
abstract  String[] splitters()
          Returns the splitting strings used for selective splits.
 MemoryTreebank testMemoryTreebank()
          You can often return the same thing for testMemoryTreebank as for memoryTreebank
abstract  edu.stanford.nlp.parser.lexparser.TreeHeadPair transformTree(Tree t, Tree root, edu.stanford.nlp.parser.lexparser.TreeHeadPair thp)
          transformTree does language-specific tree transformations such as splicing.
 TreebankLanguagePack treebankLanguagePack()
          Returns an appropriate treebankLanguagePack
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

inputEncoding

protected String inputEncoding

outputEncoding

protected String outputEncoding

tlp

protected TreebankLanguagePack tlp
Constructor Detail

AbstractTreebankParserParams

protected AbstractTreebankParserParams(TreebankLanguagePack tlp)
Stores the passed-in TreebankLanguagePack.

Method Detail

setInputEncoding

public void setInputEncoding(String encoding)
sets the input encoding

Specified by:
setInputEncoding in interface TreebankLangParserParams

setOutputEncoding

public void setOutputEncoding(String encoding)
sets the output encoding

Specified by:
setOutputEncoding in interface TreebankLangParserParams

memoryTreebank

public abstract MemoryTreebank memoryTreebank()
returns a MemoryTreebank appropriate to the treebank source

Specified by:
memoryTreebank in interface TreebankLangParserParams

testMemoryTreebank

public MemoryTreebank testMemoryTreebank()
You can often return the same thing for testMemoryTreebank as for memoryTreebank

Specified by:
testMemoryTreebank in interface TreebankLangParserParams

pw

public PrintWriter pw()
the PrintWriter used to print output. It's the responsibility of pw to deal properly with character encodings for the relevant treebank

Specified by:
pw in interface TreebankLangParserParams

pw

public PrintWriter pw(OutputStream o)
the PrintWriter used to print output. It's the responsibility of pw to deal properly with character encodings for the relevant treebank

Specified by:
pw in interface TreebankLangParserParams

treebankLanguagePack

public TreebankLanguagePack treebankLanguagePack()
Returns an appropriate treebankLanguagePack

Specified by:
treebankLanguagePack in interface TreebankLangParserParams

headFinder

public abstract HeadFinder headFinder()
the HeadFinder to use for your treebank.

Specified by:
headFinder in interface TreebankLangParserParams

lex

public abstract Lexicon lex()
Description copied from interface: TreebankLangParserParams
reads in trees from the source you want. It's the responsibility of tr to deal properly with character-set encoding of the input. It also is the responsibility of tr to properly normalize trees

Specified by:
lex in interface TreebankLangParserParams

collinizer

public abstract TreeTransformer collinizer()
the tree transformer used to produce trees for evaluation. Will be applied both to the parse output tree and to the gold tree. Should strip punctuation and maybe do some other things.

Specified by:
collinizer in interface TreebankLangParserParams

collinizerEvalb

public abstract TreeTransformer collinizerEvalb()
the tree transformer used to produce trees for evaluation. Will be applied both to the parse output tree and to the gold tree. Should strip punctuation and maybe do some other things. The evalb version should strip some more stuff off. (finish this doc!)

Specified by:
collinizerEvalb in interface TreebankLangParserParams

splitters

public abstract String[] splitters()
Returns the splitting strings used for selective splits.

Specified by:
splitters in interface TreebankLangParserParams
Returns:
An array containing ancestor-annotated Strings: categories should be split according to these ancestor annotations.

sisterSplitters

public abstract String[] sisterSplitters()
Returns the splitting strings used for selective splits.

Specified by:
sisterSplitters in interface TreebankLangParserParams
Returns:
An array containing ancestor-annotated Strings: categories should be split according to these ancestor annotations.

transformTree

public abstract edu.stanford.nlp.parser.lexparser.TreeHeadPair transformTree(Tree t,
                                                                             Tree root,
                                                                             edu.stanford.nlp.parser.lexparser.TreeHeadPair thp)
transformTree does language-specific tree transformations such as splicing. Any parameterizations should be inside the specific TreebankLangParserParams class

Specified by:
transformTree in interface TreebankLangParserParams

display

public abstract void display()
display language-specific settings

Specified by:
display in interface TreebankLangParserParams

setOptionFlag

public abstract int setOptionFlag(String[] args,
                                  int i)
Set language-specific options according to flags. This routine should process the option starting in args[i] (which might potentially be several arguments long if it takes arguments). It should return the index after the last index it consumed in processing. In particular, if it cannot process the current option, the return value should be i.

Specified by:
setOptionFlag in interface TreebankLangParserParams


Stanford NLP Group