public class PennTreebankLanguagePack extends AbstractTreebankLanguagePack
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String[] |
pennPunctTags |
DEFAULT_ENCODING, DEFAULT_GF_CHAR, generateOriginalDependencies, gfCharacter| Constructor and Description |
|---|
PennTreebankLanguagePack()
Gives a handle to the TreebankLanguagePack
|
| Modifier and Type | Method and Description |
|---|---|
java.lang.String[] |
evalBIgnoredPunctuationTags()
Returns a String array of punctuation tags that EVALB-style evaluation
should ignore for this treebank/language.
|
TokenizerFactory<CoreLabel> |
getTokenizerFactory()
Returns a factory for
PTBTokenizer. |
GrammaticalStructureFactory |
grammaticalStructureFactory()
Return a GrammaticalStructure suitable for this language/treebank.
|
GrammaticalStructureFactory |
grammaticalStructureFactory(java.util.function.Predicate<java.lang.String> puncFilter)
Return a GrammaticalStructure suitable for this language/treebank.
|
GrammaticalStructureFactory |
grammaticalStructureFactory(java.util.function.Predicate<java.lang.String> puncFilter,
HeadFinder hf)
Return a GrammaticalStructureFactory suitable for this language/treebank.
|
HeadFinder |
headFinder()
The HeadFinder to use for your treebank.
|
char[] |
labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be
truncated to give the basic syntactic category of a label.
|
static void |
main(java.lang.String[] args)
Prints a few aspects of the TreebankLanguagePack, just for debugging.
|
java.lang.String[] |
punctuationTags()
Returns a String array of punctuation tags for this treebank/language.
|
java.lang.String[] |
punctuationWords()
Returns a String array of punctuation words for this treebank/language.
|
java.lang.String[] |
sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for this
treebank/language.
|
java.lang.String[] |
sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for this
treebank/language.
|
java.lang.String[] |
startSymbols()
Returns a String array of treebank start symbols.
|
boolean |
supportsGrammaticalStructures()
Whether or not we have typed dependencies for this language.
|
java.lang.String |
treebankFileExtension()
Returns the extension of treebank files for this treebank.
|
HeadFinder |
typedDependencyHeadFinder()
The HeadFinder to use when making typed dependencies.
|
basicCategory, categoryAndFunction, evalBIgnoredPunctuationTagAcceptFilter, evalBIgnoredPunctuationTagRejectFilter, generateOriginalDependencies, getBasicCategoryFunction, getCategoryAndFunctionFunction, getEncoding, getGfCharacter, isEvalBIgnoredPunctuationTag, isLabelAnnotationIntroducingCharacter, isPunctuationTag, isPunctuationWord, isSentenceFinalPunctuationTag, isStartSymbol, morphFeatureSpec, punctuationTagAcceptFilter, punctuationTagRejectFilter, punctuationWordAcceptFilter, punctuationWordRejectFilter, sentenceFinalPunctuationTagAcceptFilter, setGenerateOriginalDependencies, setGfCharacter, startSymbol, startSymbolAcceptFilter, stripGF, treeReaderFactory, treeTokenizerFactorypublic PennTreebankLanguagePack()
public java.lang.String[] punctuationTags()
punctuationTags in interface TreebankLanguagePackpunctuationTags in class AbstractTreebankLanguagePackpublic java.lang.String[] punctuationWords()
punctuationWords in interface TreebankLanguagePackpunctuationWords in class AbstractTreebankLanguagePackpublic java.lang.String[] sentenceFinalPunctuationTags()
sentenceFinalPunctuationTags in interface TreebankLanguagePacksentenceFinalPunctuationTags in class AbstractTreebankLanguagePackpublic java.lang.String[] sentenceFinalPunctuationWords()
sentenceFinalPunctuationWords in interface TreebankLanguagePacksentenceFinalPunctuationWords in class AbstractTreebankLanguagePackpublic java.lang.String[] evalBIgnoredPunctuationTags()
evalBIgnoredPunctuationTags in interface TreebankLanguagePackevalBIgnoredPunctuationTags in class AbstractTreebankLanguagePackpublic char[] labelAnnotationIntroducingCharacters()
labelAnnotationIntroducingCharacters in interface TreebankLanguagePacklabelAnnotationIntroducingCharacters in class AbstractTreebankLanguagePackpublic java.lang.String[] startSymbols()
startSymbols in interface TreebankLanguagePackstartSymbols in class AbstractTreebankLanguagePackpublic TokenizerFactory<CoreLabel> getTokenizerFactory()
PTBTokenizer.getTokenizerFactory in interface TreebankLanguagePackgetTokenizerFactory in class AbstractTreebankLanguagePackpublic java.lang.String treebankFileExtension()
treebankFileExtension in interface TreebankLanguagePacktreebankFileExtension in class AbstractTreebankLanguagePackpublic GrammaticalStructureFactory grammaticalStructureFactory()
grammaticalStructureFactory in interface TreebankLanguagePackgrammaticalStructureFactory in class AbstractTreebankLanguagePackpublic GrammaticalStructureFactory grammaticalStructureFactory(java.util.function.Predicate<java.lang.String> puncFilter)
Note: This is loaded by reflection so basic treebank use does not require all the Stanford Dependencies code. TODO: ?????
grammaticalStructureFactory in interface TreebankLanguagePackgrammaticalStructureFactory in class AbstractTreebankLanguagePackpuncFilter - A filter which should reject punctuation words (as Strings)public GrammaticalStructureFactory grammaticalStructureFactory(java.util.function.Predicate<java.lang.String> puncFilter, HeadFinder hf)
AbstractTreebankLanguagePackgrammaticalStructureFactory in interface TreebankLanguagePackgrammaticalStructureFactory in class AbstractTreebankLanguagePackpuncFilter - A filter which should reject punctuation words (as Strings)hf - A HeadFinder which finds heads for typed dependenciespublic boolean supportsGrammaticalStructures()
TreebankLanguagePacksupportsGrammaticalStructures in interface TreebankLanguagePacksupportsGrammaticalStructures in class AbstractTreebankLanguagePackpublic HeadFinder headFinder()
headFinder in interface TreebankLanguagePackheadFinder in class AbstractTreebankLanguagePackpublic HeadFinder typedDependencyHeadFinder()
typedDependencyHeadFinder in interface TreebankLanguagePacktypedDependencyHeadFinder in class AbstractTreebankLanguagePackpublic static void main(java.lang.String[] args)