|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.ParentAnnotationStats
public class ParentAnnotationStats
See what parent annotation helps in treebank, based on support and KL divergence.
Field Summary | |
---|---|
static double[] |
CUTOFFS
Minimum support * KL to be included in output and as feature |
static double |
SUPPCUTOFF
Minimum support of parent annotated node for grandparent to be studied. |
Constructor Summary | |
---|---|
ParentAnnotationStats()
|
Method Summary | |
---|---|
static Set<String> |
getEnglishSplitCategories(String treebankRoot)
This is hardwired to calculate the split categories from English Penn Treebank sections 2-21 with a default cutoff of 300 (as used in ACL03PCFG). |
static Set<String> |
getSplitCategories(Treebank t,
boolean doTags,
int algorithm,
double phrasalCutOff,
double tagCutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static Set<String> |
getSplitCategories(Treebank t,
double cutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static List<String> |
kidLabels(Tree t)
|
static void |
main(String[] args)
Calculate parent annotation statistics suitable for doing selective parent splitting in the PCFGParser inside FactoredParser. |
void |
printStats()
|
void |
processTreeHelper(String gP,
String p,
Tree t)
|
void |
visitTree(Tree t)
Does whatever one needs to do to a particular parse tree |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final double[] CUTOFFS
public static final double SUPPCUTOFF
Constructor Detail |
---|
public ParentAnnotationStats()
Method Detail |
---|
public void visitTree(Tree t)
visitTree
in interface TreeVisitor
t
- A tree. Classes implementing this interface can assume
that the tree passed in is not null
.public static List<String> kidLabels(Tree t)
public void processTreeHelper(String gP, String p, Tree t)
public void printStats()
public static void main(String[] args)
Usage: java edu.stanford.nlp.parser.lexparser.ParentAnnotationStats [-tags] treebankPath
args
- One argument: path to the Treebankpublic static Set<String> getSplitCategories(Treebank t, double cutOff, TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
This version just defaults some parameters. Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static Set<String> getSplitCategories(Treebank t, boolean doTags, int algorithm, double phrasalCutOff, double tagCutOff, TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static Set<String> getEnglishSplitCategories(String treebankRoot)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |