|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.TestOptions
public class TestOptions
Options to the parser which affect performance only at testing (parsing)
time.
The Options class that stores the TestOptions stores the
TestOptions as a transient object. This means that whatever
options get set at creation time are forgotten when the parser is
serialized. If you want an option to be remembered when the parser
is reloaded, put it in either TrainOptions or in Options itself.
Field Summary | |
---|---|
boolean |
addMissingFinalPunctuation
If a token list does not have sentence final punctuation near the end, then automatically add the default one. |
double |
depWeight
Weighting on dependency log probs. |
boolean |
doRecovery
If true, then failure of the PCFG factor to parse a sentence will trigger parse recovery mode. |
boolean |
evalb
Write EvalB-readable output files. |
Properties |
evals
What evaluations to report and how to report them (using LexicalizedParser). |
boolean |
exhaustiveTest
|
int |
fastFactoredCandidateAddend
This variable says to find k good factored parses, how many added on best PCFG parses should be examined. |
int |
fastFactoredCandidateMultiplier
This variable says to find k good fast factored parses, how many times k of the best PCFG parses should be examined. |
boolean |
forceTagBeginnings
|
boolean |
forceTags
Parse using only tags given from correct answer or the POS tagger |
boolean |
increasingLength
Parse trees in test treebank in order of increasing length. |
boolean |
iterativeCKY
If true, use faster iterative deepening CKY algorithm. |
boolean |
lengthNormalization
Turns on normalizing scores for sentence length. |
int |
MAX_ITEMS
The maximum number of edges and hooks combined that the factored parser will build before giving up. |
int |
maxLength
The maximum sentence length (including punctuation, etc.) to parse. |
int |
maxSpanForTags
The largest span to consider for word-hood. |
boolean |
noFunctionalForcing
Only valid with force tags - strips away functionals when forcing the tags, meaning tags have to start appropriately but the parser will assign the functional part. |
boolean |
noRecoveryTagging
If false, then failure of the PCFG parser to parse a sentence will trigger allowing all tags for words in parse recovery mode, with a log probability of -1000. |
String |
outputFilesDirectory
If the writeOutputFiles option is true, then output files appear in this directory. |
String |
outputFilesExtension
If the writeOutputFiles option is true, then output files appear with this extension. |
String |
outputFilesPrefix
If the writeOutputFiles option is true, then output files appear with this prefix. |
String |
outputFormat
Determines format of output trees: choose among penn, oneline |
String |
outputFormatOptions
|
String |
outputkBestEquivocation
If this option is not null, output the k-best equivocation. |
boolean |
pcfgThreshold
If this variable is true, and the sum of the inside and outside score for a constituent is worse than the best known score for a sentence by more than pcfgThresholdValue , then -Inf is returned as the
outside Score by oScore() (while otherwise the true
outside score is returned). |
double |
pcfgThresholdValue
|
boolean |
preTag
Tag the sentences first, then parse given those (coarse) tags. |
boolean |
printAllBestParses
Print out all best PCFG parses. |
int |
printFactoredKGood
Printing k-best parses from PCFG, when k > 0. |
int |
printPCFGkBest
Printing k-best parses from PCFG, when k > 0. |
boolean |
prunePunc
|
boolean |
sample
Used when you want to generate sample parses instead of finding the best parse. |
String |
taggerSerializedFile
POS tagger model used when preTag is enabled. |
double |
unseenSmooth
The amount of smoothing put in (as an m-estimate) for unknown words. |
boolean |
useFastFactored
If true, use approximate factored algorithm, which just rescores PCFG k best, rather than exact factored algorithm. |
boolean |
useLexiconToScoreDependencyPwGt
If this is true, the Lexicon is used to score P(w|t) in the backoff inside the dependency grammar. |
boolean |
useN5
If true, the n^4 "speed-up" is not used with the Factored Parser. |
boolean |
useNonProjectiveDependencyParser
If this is true, perform non-projective dependency parsing. |
boolean |
verbose
Print a lot of extra output as you parse. |
boolean |
writeOutputFiles
If true, write files parsed to a new file with the same name except for an added ".stp" extension. |
Constructor Summary | |
---|---|
TestOptions()
|
Method Summary | |
---|---|
void |
display()
|
TreePrint |
treePrint(TreebankLangParserParams tlpParams)
Determines method for print trees on output. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public boolean noRecoveryTagging
public boolean doRecovery
public boolean useN5
public boolean useFastFactored
public boolean iterativeCKY
public int maxLength
public int MAX_ITEMS
public double unseenSmooth
public boolean increasingLength
public boolean preTag
public boolean forceTags
public boolean forceTagBeginnings
public String taggerSerializedFile
public boolean noFunctionalForcing
public boolean evalb
public boolean verbose
public final boolean exhaustiveTest
public final boolean pcfgThreshold
pcfgThresholdValue
, then -Inf is returned as the
outside Score by oScore()
(while otherwise the true
outside score is returned).
public final double pcfgThresholdValue
public boolean printAllBestParses
public double depWeight
public boolean prunePunc
public boolean addMissingFinalPunctuation
public String outputFormat
public String outputFormatOptions
public boolean writeOutputFiles
public String outputFilesDirectory
null
) means to use
the directory of the source files. Use ""
or .
for the current directory.
public String outputFilesExtension
""
for no extension.
public String outputFilesPrefix
public String outputkBestEquivocation
public int maxSpanForTags
public boolean lengthNormalization
public boolean sample
public int printPCFGkBest
public int printFactoredKGood
public Properties evals
-evals "depDA=false"
) if you don't want
them.
LB = ParseEval labeled bracketing,
CB = crossing brackets and zero crossing bracket rate,
DA = dependency accuracy, TA = tagging accuracy,
LL = log likelihood score,
RUO/CUO = rules/categories under and over proposed,
CatE = evaluation by phrasal category.
Known styles are: runningAverages, summary, tsv.
The default style is summary.
You need to negate it out if you don't want it.
Invalid names in the argument to this option are not reported!
public int fastFactoredCandidateMultiplier
public int fastFactoredCandidateAddend
public boolean useLexiconToScoreDependencyPwGt
public boolean useNonProjectiveDependencyParser
Constructor Detail |
---|
public TestOptions()
Method Detail |
---|
public TreePrint treePrint(TreebankLangParserParams tlpParams)
tlpParams
- The treebank parser params
public void display()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |