edu.stanford.nlp.trees.international.tuebadz
Class TueBaDZLanguagePack

java.lang.Object
  extended by edu.stanford.nlp.trees.AbstractTreebankLanguagePack
      extended by edu.stanford.nlp.trees.international.tuebadz.TueBaDZLanguagePack
All Implemented Interfaces:
TreebankLanguagePack, Serializable

public class TueBaDZLanguagePack
extends AbstractTreebankLanguagePack

Language pack for the Tuebingen Treebank of Written German (TueBa-D/Z): http://www.sfs.nphil.uni-tuebingen.de/en_tuebadz.shtml

Author:
Roger Levy (rog@stanford.edu)
See Also:
Serialized Form

Field Summary
 
Fields inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack
DEFAULT_ENCODING
 
Constructor Summary
TueBaDZLanguagePack()
           
 
Method Summary
 char[] labelAnnotationIntroducingCharacters()
          Return an array of characters at which a String should be truncated to give the basic syntactic category of a label.
 String[] punctuationTags()
          Returns a String array of punctuation tags for this treebank/language.
 String[] punctuationWords()
          Returns a String array of punctuation words for this treebank/language.
 String[] sentenceFinalPunctuationTags()
          Returns a String array of sentence final punctuation tags for this treebank/language.
 String[] sentenceFinalPunctuationWords()
          Returns a String array of sentence final punctuation words for this treebank/language.
 String[] startSymbols()
          Returns a String array of treebank start symbols.
 String treebankFileExtension()
          Returns the extension of treebank files for this treebank.
 
Methods inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack
basicCategory, categoryAndFunction, evalBIgnoredPunctuationTagAcceptFilter, evalBIgnoredPunctuationTagRejectFilter, evalBIgnoredPunctuationTags, getBasicCategoryFunction, getCategoryAndFunctionFunction, getEncoding, getTokenizerFactory, grammaticalStructureFactory, grammaticalStructureFactory, isEvalBIgnoredPunctuationTag, isLabelAnnotationIntroducingCharacter, isPunctuationTag, isPunctuationWord, isSentenceFinalPunctuationTag, isStartSymbol, punctuationTagAcceptFilter, punctuationTagRejectFilter, punctuationWordAcceptFilter, punctuationWordRejectFilter, sentenceFinalPunctuationTagAcceptFilter, startSymbol, startSymbolAcceptFilter
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

TueBaDZLanguagePack

public TueBaDZLanguagePack()
Method Detail

labelAnnotationIntroducingCharacters

public char[] labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be truncated to give the basic syntactic category of a label. The idea here is that Penn treebank style labels follow a syntactic category with various functional and crossreferencing information introduced by special characters (such as "NP-SBJ=1"). This would be truncated to "NP" by the array containing '-' and "=".

Specified by:
labelAnnotationIntroducingCharacters in interface TreebankLanguagePack
Overrides:
labelAnnotationIntroducingCharacters in class AbstractTreebankLanguagePack
Returns:
An array of characters that set off label name suffixes

punctuationTags

public String[] punctuationTags()
Description copied from class: AbstractTreebankLanguagePack
Returns a String array of punctuation tags for this treebank/language.

Specified by:
punctuationTags in interface TreebankLanguagePack
Specified by:
punctuationTags in class AbstractTreebankLanguagePack
Returns:
The punctuation tags

punctuationWords

public String[] punctuationWords()
Description copied from class: AbstractTreebankLanguagePack
Returns a String array of punctuation words for this treebank/language.

Specified by:
punctuationWords in interface TreebankLanguagePack
Specified by:
punctuationWords in class AbstractTreebankLanguagePack
Returns:
The punctuation words

sentenceFinalPunctuationTags

public String[] sentenceFinalPunctuationTags()
Description copied from class: AbstractTreebankLanguagePack
Returns a String array of sentence final punctuation tags for this treebank/language.

Specified by:
sentenceFinalPunctuationTags in interface TreebankLanguagePack
Specified by:
sentenceFinalPunctuationTags in class AbstractTreebankLanguagePack
Returns:
The sentence final punctuation tags

startSymbols

public String[] startSymbols()
Description copied from class: AbstractTreebankLanguagePack
Returns a String array of treebank start symbols.

Specified by:
startSymbols in interface TreebankLanguagePack
Specified by:
startSymbols in class AbstractTreebankLanguagePack
Returns:
The start symbols

sentenceFinalPunctuationWords

public String[] sentenceFinalPunctuationWords()
Description copied from interface: TreebankLanguagePack
Returns a String array of sentence final punctuation words for this treebank/language.

Returns:
The punctuation words

treebankFileExtension

public String treebankFileExtension()
Description copied from interface: TreebankLanguagePack
Returns the extension of treebank files for this treebank. This should be passed as an argument to Treebank loading classes. It might be "mrg" or "fid" or whatever. Don't inlcude the period.



Stanford NLP Group