public class TokensRegexAnnotator extends java.lang.Object implements Annotator
Configuration:
rules
- Name of file containing extraction rules
(see CoreMapExpressionExtractor
and SequenceMatchRules
options.xxx = ...
)
setTokenOffsets
- whether to explicit set the token offsets of individual tokens (needed to token sequence matches to work)extractWithTokens
- whether to return unmatched tokens as wellflatten
- whether to flatten matched expressions into individual tokensmatchedExpressionsAnnotationKey
- Annotation key where matched expressions are stored as a listMultiple TokensRegexAnnotator
can be configured using the same properties file by specifying
difference prefix for the TokensRegexAnnotator
Modifier and Type | Class and Description |
---|---|
static class |
TokensRegexAnnotator.Options |
DEFAULT_REQUIREMENTS, STANFORD_CDC_TOKENIZE, STANFORD_CLEAN_XML, STANFORD_COLUMN_DATA_CLASSIFIER, STANFORD_COREF, STANFORD_COREF_MENTION, STANFORD_DEPENDENCIES, STANFORD_DETERMINISTIC_COREF, STANFORD_DOCDATE, STANFORD_ENTITY_MENTIONS, STANFORD_GENDER, STANFORD_KBP, STANFORD_LEMMA, STANFORD_LINK, STANFORD_MWT, STANFORD_NATLOG, STANFORD_NER, STANFORD_OPENIE, STANFORD_PARSE, STANFORD_POS, STANFORD_QUOTE, STANFORD_QUOTE_ATTRIBUTION, STANFORD_REGEXNER, STANFORD_RELATION, STANFORD_SENTIMENT, STANFORD_SSPLIT, STANFORD_TOKENIZE, STANFORD_TOKENSREGEX, STANFORD_TRUECASE, STANFORD_UD_FEATURES
Constructor and Description |
---|
TokensRegexAnnotator(java.util.Properties props) |
TokensRegexAnnotator(java.lang.String... files) |
TokensRegexAnnotator(java.lang.String name,
java.util.Properties props) |
Modifier and Type | Method and Description |
---|---|
void |
annotate(Annotation annotation)
Given an Annotation, perform a task on this Annotation.
|
java.util.Set<java.lang.Class<? extends CoreAnnotation>> |
requirementsSatisfied()
Returns a set of requirements for which tasks this annotator can
provide.
|
java.util.Set<java.lang.Class<? extends CoreAnnotation>> |
requires()
Returns the set of tasks which this annotator requires in order
to perform.
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
exactRequirements, unmount
public TokensRegexAnnotator(java.lang.String... files)
public TokensRegexAnnotator(java.lang.String name, java.util.Properties props)
public TokensRegexAnnotator(java.util.Properties props)
public void annotate(Annotation annotation)
Annotator
public java.util.Set<java.lang.Class<? extends CoreAnnotation>> requires()
Annotator
public java.util.Set<java.lang.Class<? extends CoreAnnotation>> requirementsSatisfied()
Annotator
requirementsSatisfied
in interface Annotator