|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.Train
public class Train
Non-language-specific options for training a grammar from a treebank. These options are not used at parsing time. But they are all static so it isn't possible to train multiple parsers in multiple threads at present with different options, until this is changed.
Field Summary | |
---|---|
static boolean |
basicCategoryTagsInDependencyGrammar
Where to use the basic or split tags in the dependency grammar |
static boolean |
cheatPCFG
Add all test set trees to training data for PCFG. |
static boolean |
collinsPunc
Promote/delete punctuation like Collins. |
static int |
compactGrammar
How to compact grammars as FSMs. |
static java.util.Set<java.lang.String> |
deleteSplitters
|
static double |
fractionBeforeUnseenCounting
Start to aggregate signature-tag pairs only for words unseen in the first this fraction of the data. |
static boolean |
gPA
This variable controls doing 2 levels of parent annotation. |
static int |
HSEL_CUT
|
static boolean |
hSelSplit
|
static int |
leaveItAll
if true, leave all PTB (functional tag) annotations (bad) |
static boolean |
leftRec
Left edge is right-recursive (X << X) Bad. |
static boolean |
leftToRight
|
static boolean |
markFinalStates
Whether or not to mark final states in binarized grammar. |
static boolean |
markovFactor
|
static int |
markovOrder
|
static int |
markUnary
Mark all unary nodes specially. |
static boolean |
markUnaryTags
Mark POS tags which are the sole member of their phrasal constituent. |
static boolean |
noTagSplit
|
static int |
openClassTypesThreshold
A POS tag has to have been attributed to more than this number of word types before it is regarded as an open-class tag. |
static boolean |
PA
This variable controls doing parent annotation of phrasal nodes. |
static boolean |
postGPA
|
static boolean |
postPA
|
static java.util.Set |
postSplitters
|
static boolean |
postSplitWithBaseCategory
Whether, in post-splitting of categories, nodes are annotated with the (grand)parent's base category or with its complete subcategorized category. |
static java.io.PrintWriter |
printAnnotatedPW
|
static boolean |
printAnnotatedRuleCounts
|
static boolean |
printAnnotatedStateCounts
|
static java.io.PrintWriter |
printBinarizedPW
|
static boolean |
printStates
|
static int |
printTreeTransformations
Just for debugging: check that your tree transforms work correctly. |
static boolean |
rightRec
Right edge is right-recursive (X << X) Bad. |
static double |
ruleDiscount
Discounts the count of BinaryRule's (only, apparently) in training data. |
static boolean |
selectivePostSplit
|
static double |
selectivePostSplitCutOff
|
static boolean |
selectiveSplit
Only split the "common high KL divergence" parent categories.... |
static double |
selectiveSplitCutOff
|
static boolean |
sisterAnnotate
Selective Sister annotation. |
static java.util.Set<java.lang.String> |
sisterSplitters
|
static boolean |
smoothedBound
|
static boolean |
smoothing
CHANGE ANYTHING BELOW HERE AT YOUR OWN RISK |
static boolean |
splitPrePreT
Mark all pre-preterminals (also does splitBaseNP: don't need both) |
static java.util.Set<java.lang.String> |
splitters
Set the splitter strings. |
static boolean |
tagPA
Parent annotation on tags. |
static boolean |
tagSelectivePostSplit
|
static double |
tagSelectivePostSplitCutOff
|
static boolean |
tagSelectiveSplit
Do parent annotation on tags selectively. |
static double |
tagSelectiveSplitCutOff
|
static java.lang.String |
trainTreeFile
|
static boolean |
xOverX
X over X is marked (subsumes baseNP marking) Bad. |
Method Summary | |
---|---|
static int |
compactGrammar()
|
static void |
display()
|
static boolean |
outsideFactor()
If true, declare early -- leave this on except maybe with markov on. |
static void |
printTrainTree(java.io.PrintWriter pw,
java.lang.String message,
Tree t)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static java.lang.String trainTreeFile
public static int leaveItAll
public static boolean cheatPCFG
public static boolean markovFactor
public static int markovOrder
public static boolean hSelSplit
public static int HSEL_CUT
public static boolean markFinalStates
public static int openClassTypesThreshold
public static double fractionBeforeUnseenCounting
public static boolean PA
public static boolean gPA
public static boolean postPA
public static boolean postGPA
public static boolean selectiveSplit
public static double selectiveSplitCutOff
public static boolean selectivePostSplit
public static double selectivePostSplitCutOff
public static boolean postSplitWithBaseCategory
public static boolean sisterAnnotate
public static java.util.Set<java.lang.String> sisterSplitters
public static int markUnary
public static boolean markUnaryTags
public static boolean splitPrePreT
public static boolean tagPA
public static boolean tagSelectiveSplit
public static double tagSelectiveSplitCutOff
public static boolean tagSelectivePostSplit
public static double tagSelectivePostSplitCutOff
public static boolean rightRec
public static boolean leftRec
public static boolean xOverX
public static boolean collinsPunc
public static java.util.Set<java.lang.String> splitters
public static java.util.Set postSplitters
public static java.util.Set<java.lang.String> deleteSplitters
public static int printTreeTransformations
public static java.io.PrintWriter printAnnotatedPW
public static java.io.PrintWriter printBinarizedPW
public static boolean printStates
public static int compactGrammar
public static boolean leftToRight
public static boolean noTagSplit
public static boolean smoothing
public static boolean smoothedBound
public static double ruleDiscount
public static boolean printAnnotatedRuleCounts
public static boolean printAnnotatedStateCounts
public static boolean basicCategoryTagsInDependencyGrammar
Method Detail |
---|
public static boolean outsideFactor()
public static int compactGrammar()
public static void display()
public static void printTrainTree(java.io.PrintWriter pw, java.lang.String message, Tree t)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |