|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface TreebankLangParserParams
Contains language-specific methods necessary to get the parser to parse an arbitrary treebank.
Method Summary | |
---|---|
TreeTransformer |
collinizer()
The tree transformer applied to trees prior to evaluation. |
TreeTransformer |
collinizerEvalb()
the tree transformer used to produce trees for evaluation. |
java.util.List<? extends HasWord> |
defaultTestSentence()
Return a default sentence of the language (for testing). |
Extractor<DependencyGrammar> |
dependencyGrammarExtractor(Options op)
|
DiskTreebank |
diskTreebank()
returns a DiskTreebank appropriate to the treebank source |
void |
display()
display language-specific settings |
java.lang.String |
getInputEncoding()
Returns the input encoding being used. |
java.lang.String |
getOutputEncoding()
Returns the output encoding being used. |
HeadFinder |
headFinder()
|
Lexicon |
lex(Options.LexOptions op)
Vends a Lexicon object suitable to the particular language/treebank combination of interest. |
MemoryTreebank |
memoryTreebank()
returns a MemoryTreebank appropriate to the treebank source |
double[] |
MLEDependencyGrammarSmoothingParams()
Give the parameters for smoothing in the MLEDependencyGrammar. |
AbstractEval |
ppAttachmentEval()
Returns a language specific object for evaluating PP attachment |
Label |
processHeadWord(Label headWord)
Allows language specific processing (e.g., stemming) of head words. |
java.io.PrintWriter |
pw()
returns a PrintWriter used to print output. |
java.io.PrintWriter |
pw(java.io.OutputStream o)
returns a PrintWriter used to print output to the OutputStream o. |
void |
setEvaluateGrammaticalFunctions(boolean evalGFs)
If evalGFs = true, then the evaluation of parse trees will include evaluation on grammatical functions. |
void |
setInputEncoding(java.lang.String encoding)
|
int |
setOptionFlag(java.lang.String[] args,
int i)
Set a language-specific option according to command-line flags. |
void |
setOutputEncoding(java.lang.String encoding)
|
void |
setupForEval()
Convenience method for setting state parameters specific to evaluation. |
java.lang.String[] |
sisterSplitters()
Returns the splitting strings used for selective splits. |
TreeTransformer |
subcategoryStripper()
Returns a TreeTransformer appropriate to the Treebank which can be used to remove functional tags (such as "-TMP") from categories. |
MemoryTreebank |
testMemoryTreebank()
returns a MemoryTreebank appropriate to the testing treebank source |
Tree |
transformTree(Tree t,
Tree root)
This method does language-specific tree transformations such as annotating particular nodes with language-relevant features. |
Treebank |
treebank()
Required to extend TreebankFactory |
TreebankLanguagePack |
treebankLanguagePack()
returns a TreebankLanguagePack containing Treebank-specific (but not parser-specific) info such as what is punctuation, and also information about the structure of labels |
TreeReaderFactory |
treeReaderFactory()
Returns a factory for reading in trees from the source you want. |
TokenizerFactory<Tree> |
treeTokenizerFactory()
|
Method Detail |
---|
HeadFinder headFinder()
Label processHeadWord(Label headWord)
headWord
- An Label
that minimally implements the
HasWord
and HasTag
interfaces.
Label
void setupForEval()
void setInputEncoding(java.lang.String encoding)
void setOutputEncoding(java.lang.String encoding)
void setEvaluateGrammaticalFunctions(boolean evalGFs)
java.lang.String getOutputEncoding()
java.lang.String getInputEncoding()
TreeReaderFactory treeReaderFactory()
Lexicon lex(Options.LexOptions op)
Lexicon
object suitable to the particular language/treebank combination of interest.
op
- Options as to how the Lexicon behaves
TreeTransformer collinizer()
TreeTransformer collinizerEvalb()
MemoryTreebank memoryTreebank()
DiskTreebank diskTreebank()
MemoryTreebank testMemoryTreebank()
Treebank treebank()
treebank
in interface TreebankFactory
TreebankLanguagePack treebankLanguagePack()
java.io.PrintWriter pw()
java.io.PrintWriter pw(java.io.OutputStream o)
java.lang.String[] sisterSplitters()
TreeTransformer subcategoryStripper()
Tree transformTree(Tree t, Tree root)
t
. It changes both
labels and the tree shape.
t
- The input tree (with non-language specific annotation already
done, so you need to strip back to basic categories)root
- The root of the current tree (can be null for words)
void display()
int setOptionFlag(java.lang.String[] args, int i)
args
- Array of command line argumentsi
- Index in command line arguments to try to process as an option
java.util.List<? extends HasWord> defaultTestSentence()
TokenizerFactory<Tree> treeTokenizerFactory()
Extractor<DependencyGrammar> dependencyGrammarExtractor(Options op)
double[] MLEDependencyGrammarSmoothingParams()
AbstractEval ppAttachmentEval()
AbstractEval
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |