|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.Train
public class Train
Non-language-specific options for training a grammar from a treebank.
Field Summary | |
---|---|
static boolean |
cheatPCFG
|
static boolean |
collinsPunc
Promote/delete punctuation like Collins. |
static int |
compactGrammar
|
static Set<String> |
deleteSplitters
|
static double |
fractionBeforeUnseenCounting
Start to aggregate signature-tag pairs only for words unseen in the first this fraction of the data. |
static boolean |
gPA
This variable controls doing 2 levels of parent annotation. |
static int |
HSEL_CUT
|
static boolean |
hSelSplit
|
static boolean |
leaveItAll
if true, leave all PTB (functional tag) annotations (bad) |
static boolean |
leftRec
Left edge is right-recursive (X << X) Bad. |
static boolean |
leftToRight
|
static boolean |
markFinalStates
|
static boolean |
markovFactor
|
static int |
markovOrder
|
static boolean |
markUnary
Mark all unary nodes specially. |
static boolean |
markUnary2
|
static boolean |
markUnaryTags
|
static boolean |
noTagSplit
|
static int |
openClassTypesThreshold
A POS tag has to have been attributed to more than this number of word types before it is regarded as an open-class tag. |
static boolean |
PA
This variable controls doing parent annotation of phrasal nodes. |
static boolean |
postGPA
|
static boolean |
postPA
|
static Set |
postSplitters
|
static boolean |
postSplitWithBaseCategory
Whether, in post-splitting of categories, nodes are annotated with the (grand)parent's base category or with its complete subcategorized category. |
static PrintWriter |
printAnnotatedPW
|
static PrintWriter |
printBinarizedPW
|
static boolean |
printStates
|
static boolean |
printTreeTransformations
Just for debugging: check that your tree transforms work right or states |
static boolean |
rightRec
Right edge is right-recursive (X << X) Bad. |
static double |
ruleDiscount
Discounts the count of BinaryRule's (only, apparently) in training data. |
static boolean |
selectivePostSplit
|
static double |
selectivePostSplitCutOff
|
static boolean |
selectiveSplit
Only split the "common high KL divergence" parent categories.... |
static double |
selectiveSplitCutOff
|
static boolean |
sisterAnnotate
Selective Sister annotation. |
static Set |
sisterSplitters
|
static boolean |
smoothedBound
|
static boolean |
smoothing
CHANGE ANYTHING BELOW HERE AT YOUR OWN RISK |
static boolean |
splitPrePreT
Mark all pre-preterminals (also does splitBaseNP: don't need both) |
static Set |
splitters
Set the splitter strings. |
static boolean |
tagPA
Parent annotation on tags. |
static boolean |
tagSelectivePostSplit
|
static double |
tagSelectivePostSplitCutOff
|
static boolean |
tagSelectiveSplit
Do parent annotation on tags selectively. |
static double |
tagSelectiveSplitCutOff
|
static boolean |
xOverX
X over X is marked (subsumes baseNP marking) Bad. |
Method Summary | |
---|---|
static int |
compactGrammar()
|
static void |
display()
|
static boolean |
outsideFactor()
If true, declare early -- leave this on except maybe with markov on. |
static void |
printTrainTree(PrintWriter pw,
String message,
Tree t)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static boolean leaveItAll
public static boolean cheatPCFG
public static boolean markovFactor
public static int markovOrder
public static boolean hSelSplit
public static int HSEL_CUT
public static boolean markFinalStates
public static int openClassTypesThreshold
public static double fractionBeforeUnseenCounting
public static boolean PA
public static boolean gPA
public static boolean postPA
public static boolean postGPA
public static boolean selectiveSplit
public static double selectiveSplitCutOff
public static boolean selectivePostSplit
public static double selectivePostSplitCutOff
public static boolean postSplitWithBaseCategory
public static boolean sisterAnnotate
public static Set sisterSplitters
public static boolean markUnary
public static boolean markUnary2
public static boolean markUnaryTags
public static boolean splitPrePreT
public static boolean tagPA
public static boolean tagSelectiveSplit
public static double tagSelectiveSplitCutOff
public static boolean tagSelectivePostSplit
public static double tagSelectivePostSplitCutOff
public static boolean rightRec
public static boolean leftRec
public static boolean xOverX
public static boolean collinsPunc
public static Set splitters
public static Set postSplitters
public static Set<String> deleteSplitters
public static boolean printTreeTransformations
public static PrintWriter printAnnotatedPW
public static PrintWriter printBinarizedPW
public static boolean printStates
public static int compactGrammar
public static boolean leftToRight
public static boolean noTagSplit
public static boolean smoothing
public static boolean smoothedBound
public static double ruleDiscount
Method Detail |
---|
public static boolean outsideFactor()
public static int compactGrammar()
public static void display()
public static void printTrainTree(PrintWriter pw, String message, Tree t)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |