|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.Test
public class Test
Options to the parser which affect performance only at testing (parsing) time.
Nested Class Summary | |
---|---|
static class |
Test.Constraint
A Constraint represents a restriction on possible parse trees to consider. |
Field Summary | |
---|---|
static boolean |
addMissingFinalPunctuation
If a token list does not have sentence final punctuation near the end, then automatically add the default one. |
static List<Test.Constraint> |
constraints
When you want to force the parser to parse a particular subsequence into a particular state. |
static double |
depWeight
Weighting on dependency log probs. |
static boolean |
doRecovery
If true, then failure of the PCFG factor to parse a sentence will trigger parse recovery mode. |
static boolean |
evalb
Write EvalB-readable output files. |
static Properties |
evals
What evaluations to report and how to report them (using LexicalizedParser). |
static boolean |
exhaustiveTest
|
static int |
fastFactoredCandidateAddend
This variable says to find k good factored parses, how many added on best PCFG parses should be examined. |
static int |
fastFactoredCandidateMultiplier
This variable says to find k good fast factored parses, how many times k of the best PCFG parses should be examined. |
static boolean |
forceTagBeginnings
|
static boolean |
forceTags
Parse using only tags given from correct answer or the POS tagger |
static boolean |
increasingLength
Parse trees in test treebank in order of increasing length. |
static boolean |
iterativeCKY
If true, use faster iterative deepening CKY algorithm. |
static boolean |
lengthNormalization
Turns on normalizing scores for sentence length. |
static int |
MAX_ITEMS
The maximum number of edges and hooks combined that the factored parser will build before giving up. |
static int |
maxLength
The maximum sentence length (including punctuation, etc.) to parse. |
static int |
maxSpanForTags
The largest span to consider for word-hood. |
static boolean |
noFunctionalForcing
Only valid with force tags - strips away functionals when forcing the tags, meaning tags have to start appropriately but the parser will assign the functional part. |
static boolean |
noRecoveryTagging
If false, then failure of the PCFG parser to parse a sentence will trigger allowing all tags for words in parse recovery mode, with a log probability of -1000. |
static String |
outputFilesDirectory
If the writeOutputFiles option is true, then output files appear in this directory. |
static String |
outputFilesExtension
If the writeOutputFiles option is true, then output files appear with this extension. |
static String |
outputFilesPrefix
If the writeOutputFiles option is true, then output files appear with this prefix. |
static String |
outputFormat
Determines format of output trees: choose among penn, oneline |
static String |
outputFormatOptions
|
static String |
outputkBestEquivocation
If this option is not null, output the k-best equivocation. |
static boolean |
pcfgThreshold
If this variable is true, and the sum of the inside and outside score for a constituent is worse than the best known score for a sentence by more than pcfgThresholdValue , then -Inf is returned as the
outside Score by oScore() (while otherwise the true
outside score is returned). |
static double |
pcfgThresholdValue
|
static boolean |
preTag
Tag the sentences first, then parse given those (coarse) tags. |
static boolean |
printAllBestParses
Print out all best PCFG parses. |
static int |
printFactoredKGood
Printing k-best parses from PCFG, when k > 0. |
static int |
printPCFGkBest
Printing k-best parses from PCFG, when k > 0. |
static boolean |
prunePunc
|
static boolean |
sample
Used when you want to generate sample parses instead of finding the best parse. |
static String |
taggerSerializedFile
POS tagger model used when preTag is enabled. |
static double |
unseenSmooth
The amount of smoothing put in (as an m-estimate) for unknown words. |
static boolean |
useFastFactored
If true, use approximate factored algorithm, which just rescores PCFG k best, rather than exact factored algorithm. |
static boolean |
useLexiconToScoreDependencyPwGt
If this is true, the Lexicon is used to score P(w|t) in the backoff inside the dependency grammar. |
static boolean |
useN5
If true, the n^4 "speed-up" is not used with the Factored Parser. |
static boolean |
useNonProjectiveDependencyParser
If this is true, perform non-projective dependency parsing. |
static boolean |
verbose
Print a lot of extra output as you parse. |
static boolean |
writeOutputFiles
If true, write files parsed to a new file with the same name except for an added ".stp" extension. |
Method Summary | |
---|---|
static void |
display()
|
static TreePrint |
treePrint(TreebankLangParserParams tlpParams)
Determines method for print trees on output. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static boolean noRecoveryTagging
public static boolean doRecovery
public static boolean useN5
public static boolean useFastFactored
public static boolean iterativeCKY
public static int maxLength
public static int MAX_ITEMS
public static double unseenSmooth
public static boolean increasingLength
public static boolean preTag
public static boolean forceTags
public static boolean forceTagBeginnings
public static String taggerSerializedFile
public static boolean noFunctionalForcing
public static boolean evalb
public static boolean verbose
public static final boolean exhaustiveTest
public static final boolean pcfgThreshold
pcfgThresholdValue
, then -Inf is returned as the
outside Score by oScore()
(while otherwise the true
outside score is returned).
public static final double pcfgThresholdValue
public static boolean printAllBestParses
public static double depWeight
public static boolean prunePunc
public static boolean addMissingFinalPunctuation
public static String outputFormat
public static String outputFormatOptions
public static boolean writeOutputFiles
public static String outputFilesDirectory
null
) means to use
the directory of the source files. Use ""
or .
for the current directory.
public static String outputFilesExtension
""
for no extension.
public static String outputFilesPrefix
public static String outputkBestEquivocation
public static int maxSpanForTags
public static boolean lengthNormalization
public static List<Test.Constraint> constraints
public static boolean sample
public static int printPCFGkBest
public static int printFactoredKGood
public static Properties evals
-evals "depDA=false"
) if you don't want
them.
LB = ParseEval labeled bracketing,
CB = crossing brackets and zero crossing bracket rate,
DA = dependency accuracy, TA = tagging accuracy,
LL = log likelihood score,
RUO/CUO = rules/categories under and over proposed,
CatE = evaluation by phrasal category.
Known styles are: runningAverages, summary, tsv.
The default style is summary.
You need to negate it out if you don't want it.
Invalid names in the argument to this option are not reported!
public static int fastFactoredCandidateMultiplier
public static int fastFactoredCandidateAddend
public static boolean useLexiconToScoreDependencyPwGt
public static boolean useNonProjectiveDependencyParser
Method Detail |
---|
public static TreePrint treePrint(TreebankLangParserParams tlpParams)
tlpParams
- The treebank parser params
public static void display()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |