|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.trees.TreeGraph
edu.stanford.nlp.trees.GrammaticalStructure
edu.stanford.nlp.trees.EnglishGrammaticalStructure
public class EnglishGrammaticalStructure
A GrammaticalStructure for English.
The Stanford parser should be run with the "-retainNPTmpSubcategories" option! Caveat emptor! This is a work in progress. Suggestions welcome.
Field Summary | |
---|---|
static int |
CoNLLX_FieldCount
|
static int |
CoNLLX_GovField
|
static int |
CoNLLX_POSField
|
static int |
CoNLLX_RelnField
|
static int |
CoNLLX_WordField
|
static String |
DEFAULT_PARSER_FILE
|
static Map<String,GrammaticalRelation> |
shortNameToGRel
|
Fields inherited from class edu.stanford.nlp.trees.GrammaticalStructure |
---|
allTypedDependencies, dependencies, typedDependencies |
Fields inherited from class edu.stanford.nlp.trees.TreeGraph |
---|
root |
Constructor Summary | |
---|---|
EnglishGrammaticalStructure(List<TypedDependency> projectiveDependencies,
TreeGraphNode root)
Used for postprocessing CoNLL X dependencies |
|
EnglishGrammaticalStructure(Tree t)
Construct a new GrammaticalStructure from an existing parse
tree. |
|
EnglishGrammaticalStructure(Tree t,
Filter<String> puncFilter)
This gets used by GrammaticalStructureFactory (by reflection). |
|
EnglishGrammaticalStructure(Tree t,
Filter<String> puncFilter,
HeadFinder hf)
This gets used by GrammaticalStructureFactory (by reflection). |
|
EnglishGrammaticalStructure(Tree t,
Filter<String> puncFilter,
HeadFinder hf,
boolean threadSafe)
Construct a new GrammaticalStructure from an existing parse
tree. |
Method Summary | |
---|---|
static EnglishGrammaticalStructure |
buildCoNNLXGrammaticStructure(List<List<String>> tokenFields)
|
protected void |
collapseDependencies(List<TypedDependency> list,
boolean CCprocess)
Destructively modifies this Collection<TypedDependency>
by collapsing several types of transitive pairs of dependencies. |
protected void |
collapseDependenciesTree(List<TypedDependency> list)
Destructively modifies this Collection<TypedDependency>
by collapsing several types of transitive pairs of dependencies, but
keeping the tree structure. |
protected static GrammaticalRelation |
conjValue(String conj)
Does some hard coding to deal with relation in CONJP. |
protected void |
correctDependencies(Collection<TypedDependency> list)
Destructively modify the TypedDependencyGraph to correct
language-dependent dependencies. |
static String |
dependenciesToString(GrammaticalStructure gs,
Collection<TypedDependency> deps,
Tree tree,
boolean conllx,
boolean extraSep)
|
static TreeGraphNode |
getSubject(TreeGraphNode t)
Tries to return a node representing the SUBJECT (whether
nominal or clausal) of the given node t . |
static void |
main(String[] args)
Given sentences or trees, output the typed dependencies. |
static void |
printDependencies(GrammaticalStructure gs,
Collection<TypedDependency> deps,
Tree tree,
boolean conllx,
boolean extraSep)
Print typed dependencies in either the Stanford dependency representation or in the conllx format. |
static List<GrammaticalStructure> |
readCoNLLXGrammaticStructureCollection(String fileName)
Read in a file containing a CoNLL-X dependency treebank and return a corresponding list of GrammaticalStructures. |
Methods inherited from class edu.stanford.nlp.trees.TreeGraph |
---|
addNodeToIndexMap, getNodeByIndex, getNodes, root, toString |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final String DEFAULT_PARSER_FILE
public static final int CoNLLX_WordField
public static final int CoNLLX_POSField
public static final int CoNLLX_GovField
public static final int CoNLLX_RelnField
public static final int CoNLLX_FieldCount
public static final Map<String,GrammaticalRelation> shortNameToGRel
Constructor Detail |
---|
public EnglishGrammaticalStructure(Tree t)
GrammaticalStructure
from an existing parse
tree. The new GrammaticalStructure
has the same tree structure
and label values as the given tree (but no shared storage). As part of
construction, the parse tree is analyzed using definitions from
GrammaticalRelation
to populate
the new GrammaticalStructure
with as many labeled grammatical
relations as it can.
t
- Parse tree to make grammatical structure frompublic EnglishGrammaticalStructure(Tree t, Filter<String> puncFilter)
t
- Parse tree to make grammatical structure frompuncFilter
- Filter to remove punctuation dependenciespublic EnglishGrammaticalStructure(Tree t, Filter<String> puncFilter, HeadFinder hf)
t
- Parse tree to make grammatical structure frompuncFilter
- Filter to remove punctuation dependencieshf
- HeadFinder to use when building itpublic EnglishGrammaticalStructure(Tree t, Filter<String> puncFilter, HeadFinder hf, boolean threadSafe)
GrammaticalStructure
from an existing parse
tree. The new GrammaticalStructure
has the same tree structure
and label values as the given tree (but no shared storage). As part of
construction, the parse tree is analyzed using definitions from
GrammaticalRelation
to populate
the new GrammaticalStructure
with as many labeled grammatical
relations as it can.
t
- Parse tree to make grammatical structure frompuncFilter
- Filter for punctuation wordshf
- HeadFinder to use when building itthreadSafe
- Whether or not to support simultaneous instances among multiple
threadspublic EnglishGrammaticalStructure(List<TypedDependency> projectiveDependencies, TreeGraphNode root)
Method Detail |
---|
public static TreeGraphNode getSubject(TreeGraphNode t)
SUBJECT
(whether
nominal or clausal) of the given node t
. Probably, node
t
should represent a clause or verb phrase.
t
- a node in this GrammaticalStructure
t
, or else
null
protected void correctDependencies(Collection<TypedDependency> list)
GrammaticalStructure
TypedDependencyGraph
to correct
language-dependent dependencies. (e.g., nsubjpass in a relative clause)
Default is no-op; to be over-ridden in subclasses.
correctDependencies
in class GrammaticalStructure
protected void collapseDependencies(List<TypedDependency> list, boolean CCprocess)
Collection<TypedDependency>
by collapsing several types of transitive pairs of dependencies.
prep(cat, in)
and pobj(in, hat)
are collapsed to
prep_in(cat, hat)
prep(heard, of)
and pcomp(of, attacking)
are
collapsed to prepc_of(heard, attacking)
cc(investors, and)
and
conj(investors, regulators)
are collapsed to
conj_and(investors,regulators)
possessive(Montezuma, 's)
will be erased. This is like a collapsing, but
due to the flatness of NPs, two dependencies are not actually composed.ref(man, that)
and dobj(love, that)
are collapsed
to dobj(love, man)
collapseDependencies
in class GrammaticalStructure
list
- A list of dependencies to process for possible collapsingCCprocess
- apply CC process?protected void collapseDependenciesTree(List<TypedDependency> list)
Collection<TypedDependency>
by collapsing several types of transitive pairs of dependencies, but
keeping the tree structure.
prep(cat, in)
and pobj(in, hat)
are collapsed to
prep_in(cat, hat)
prep(heard, of)
and pcomp(of, attacking)
are
collapsed to prepc_of(heard, attacking)
cc(investors, and)
and
conj(investors, regulators)
are collapsed to
conj_and(investors,regulators)
possessive(Montezuma, 's)
will be erased. This is like a collapsing, but
due to the flatness of NPs, two dependencies are not actually composed.
collapseDependenciesTree
in class GrammaticalStructure
list
- A list of dependencies to process for possible collapsingprotected static GrammaticalRelation conjValue(String conj)
conj
- The head dependency of the conjunction marker
public static void printDependencies(GrammaticalStructure gs, Collection<TypedDependency> deps, Tree tree, boolean conllx, boolean extraSep)
deps
- Typed dependencies to printtree
- Tree corresponding to typed dependencies (only necessary if conllx
== true)conllx
- If true use conllx format, otherwise use Stanford representationextraSep
- If true, in the Stanford representation, the extra dependencies
(which do not preserve the tree structure) are printed after the
basic dependenciespublic static String dependenciesToString(GrammaticalStructure gs, Collection<TypedDependency> deps, Tree tree, boolean conllx, boolean extraSep)
public static List<GrammaticalStructure> readCoNLLXGrammaticStructureCollection(String fileName) throws IOException
IOException
public static EnglishGrammaticalStructure buildCoNNLXGrammaticStructure(List<List<String>> tokenFields)
public static void main(String[] args)
By default, the method outputs the collapsed typed dependencies with processing of conjuncts. The input can be given as plain text (one sentence by line) using the option -sentFile, or as trees using the option -treeFile. For -sentFile, the input has to be strictly one sentence per line. You can specify where to find a parser with -parserFile serializedParserPath. See LexicalizedParser for more flexible processing of text files (including with Stanford Dependencies output). The above options assume a file as input. You can also feed trees (only) via stdin by using the option -filter.
The following options can be used to specify the types of dependencies
wanted:
-collapsed collapsed dependencies
-basic non-collapsed dependencies that preserve a tree structure
-nonCollapsed non-collapsed dependencies that do not preserve a tree
structure (the basic dependencies plus the extra ones) -CCprocessed
collapsed dependencies and conjunctions processed (dependencies are added
for each conjunct) -- this is the default if no option are passed
-collapsedTree collapsed dependencies retaining a tree structure
-makeCopulaHead Contrary to the approach argued for in the SD papers,
nevertheless make the verb 'to be' the head, not the predicate noun, adjective,
etc.
java edu.stanford.nlp.trees.EnglishGrammaticalStructure [-treeFile FILE | -sentFile FILE | -conllxFile FILE | -filter]
[-collapsed -basic -CCprocessed -test]
args
- Command-line arguments, as above
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |