|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectedu.stanford.nlp.parser.lexparser.ParentAnnotationStats
public class ParentAnnotationStats
See what parent annotation helps in treebank, based on support and KL divergence.
| Field Summary | |
|---|---|
static double[] |
CUTOFFS
Minimum support * KL to be included in output and as feature |
static double |
SUPPCUTOFF
Minimum support of parent annotated node for grandparent to be studied. |
| Method Summary | |
|---|---|
static java.util.Set<java.lang.String> |
getEnglishSplitCategories(java.lang.String treebankRoot)
This is hardwired to calculate the split categories from English Penn Treebank sections 2-21 with a default cutoff of 300 (as used in ACL03PCFG). |
static java.util.Set<java.lang.String> |
getSplitCategories(Treebank t,
boolean doTags,
int algorithm,
double phrasalCutOff,
double tagCutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static java.util.Set<java.lang.String> |
getSplitCategories(Treebank t,
double cutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static java.util.List<java.lang.String> |
kidLabels(Tree t)
|
static void |
main(java.lang.String[] args)
Calculate parent annotation statistics suitable for doing selective parent splitting in the PCFGParser inside FactoredParser. |
void |
printStats()
|
void |
processTreeHelper(java.lang.String gP,
java.lang.String p,
Tree t)
|
void |
visitTree(Tree t)
Does whatever one needs to do to a particular parse tree |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final double[] CUTOFFS
public static final double SUPPCUTOFF
| Method Detail |
|---|
public void visitTree(Tree t)
visitTree in interface TreeVisitort - A tree. Classes implementing this interface can assume
that the tree passed in is not null.public static java.util.List<java.lang.String> kidLabels(Tree t)
public void processTreeHelper(java.lang.String gP,
java.lang.String p,
Tree t)
public void printStats()
public static void main(java.lang.String[] args)
Usage: java edu.stanford.nlp.parser.lexparser.ParentAnnotationStats [-tags] treebankPath
args - One argument: path to the Treebank
public static java.util.Set<java.lang.String> getSplitCategories(Treebank t,
double cutOff,
TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
This version just defaults some parameters. Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static java.util.Set<java.lang.String> getSplitCategories(Treebank t,
boolean doTags,
int algorithm,
double phrasalCutOff,
double tagCutOff,
TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static java.util.Set<java.lang.String> getEnglishSplitCategories(java.lang.String treebankRoot)
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||