edu.stanford.nlp.trees.international.icegb
Class ICEGBLanguagePack

java.lang.Object
  extended by edu.stanford.nlp.trees.AbstractTreebankLanguagePack
      extended by edu.stanford.nlp.trees.international.icegb.ICEGBLanguagePack
All Implemented Interfaces:
TreebankLanguagePack, Serializable

public class ICEGBLanguagePack
extends AbstractTreebankLanguagePack
implements Serializable

This is a language pack from the ICE-GB corpus. UNDER CONSTRUCTION!

Author:
Pi-Chuan Chang
See Also:
Serialized Form

Field Summary
 
Fields inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack
DEFAULT_ENCODING, DEFAULT_GF_CHAR, gfCharacter
 
Constructor Summary
ICEGBLanguagePack()
           
 
Method Summary
 char[] labelAnnotationIntroducingCharacters()
          Return an array of characters at which a String should be truncated to give the basic syntactic category of a label.
 String[] punctuationTags()
          Returns a String array of punctuation tags for the ICE-GB corpus.
 String[] punctuationWords()
          Returns a String array of punctuation words for the ICE-GB corpus.
 String[] sentenceFinalPunctuationTags()
          Returns a String array of sentence final punctuation tags for the ICE-GB corpus.
 String[] sentenceFinalPunctuationWords()
          Returns a String array of sentence final punctuation words for the ICE-GB corpus.
 String[] startSymbols()
          Returns a String array of treebank start symbols.
 String treebankFileExtension()
          Returns the extension of treebank files for this treebank.
 
Methods inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack
basicCategory, categoryAndFunction, evalBIgnoredPunctuationTagAcceptFilter, evalBIgnoredPunctuationTagRejectFilter, evalBIgnoredPunctuationTags, getBasicCategoryFunction, getCategoryAndFunctionFunction, getEncoding, getGfCharacter, getTokenizerFactory, grammaticalStructureFactory, grammaticalStructureFactory, isEvalBIgnoredPunctuationTag, isLabelAnnotationIntroducingCharacter, isPunctuationTag, isPunctuationWord, isSentenceFinalPunctuationTag, isStartSymbol, punctuationTagAcceptFilter, punctuationTagRejectFilter, punctuationWordAcceptFilter, punctuationWordRejectFilter, sentenceFinalPunctuationTagAcceptFilter, setGfCharacter, startSymbol, startSymbolAcceptFilter, stripGF
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ICEGBLanguagePack

public ICEGBLanguagePack()
Method Detail

startSymbols

public String[] startSymbols()
Returns a String array of treebank start symbols.

Specified by:
startSymbols in interface TreebankLanguagePack
Specified by:
startSymbols in class AbstractTreebankLanguagePack
Returns:
The start symbols

punctuationTags

public String[] punctuationTags()
Returns a String array of punctuation tags for the ICE-GB corpus.

Specified by:
punctuationTags in interface TreebankLanguagePack
Specified by:
punctuationTags in class AbstractTreebankLanguagePack
Returns:
The punctuation tags

punctuationWords

public String[] punctuationWords()
Returns a String array of punctuation words for the ICE-GB corpus.

Specified by:
punctuationWords in interface TreebankLanguagePack
Specified by:
punctuationWords in class AbstractTreebankLanguagePack
Returns:
The punctuation words

sentenceFinalPunctuationTags

public String[] sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for the ICE-GB corpus.

Specified by:
sentenceFinalPunctuationTags in interface TreebankLanguagePack
Specified by:
sentenceFinalPunctuationTags in class AbstractTreebankLanguagePack
Returns:
The sentence final punctuation tags

sentenceFinalPunctuationWords

public String[] sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for the ICE-GB corpus.

Specified by:
sentenceFinalPunctuationWords in interface TreebankLanguagePack
Returns:
The sentence final punctuation words

labelAnnotationIntroducingCharacters

public char[] labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be truncated to give the basic syntactic category of a label.

Specified by:
labelAnnotationIntroducingCharacters in interface TreebankLanguagePack
Overrides:
labelAnnotationIntroducingCharacters in class AbstractTreebankLanguagePack
Returns:
An array of characters that set off label name suffixes

treebankFileExtension

public String treebankFileExtension()
Returns the extension of treebank files for this treebank. This is "COR".

Specified by:
treebankFileExtension in interface TreebankLanguagePack
Returns:
the extension on files for this treebank


Stanford NLP Group