Annotator (Stanford JavaNLP API)

All Known Implementing Classes:

AnnotationPipeline, ArabicSegmenterAnnotator, BinarizerAnnotator, ChapterAnnotator, CharniakParserAnnotator, ChineseSegmenterAnnotator, CleanXmlAnnotator, ColumnDataClassifierAnnotator, CorefAnnotator, CorefMentionAnnotator, DependencyParseAnnotator, DeterministicCorefAnnotator, DocDateAnnotator, EntityMentionsAnnotator, GenderAnnotator, GenericWebServiceAnnotator, GUTimeAnnotator, HeidelTimeAnnotator, HeidelTimeKBPAnnotator, HybridCorefAnnotator, KBPAnnotator, MorphaAnnotator, MWTAnnotator, NaturalLogicAnnotator, NERCombinerAnnotator, NumberAnnotator, OpenIE, ParagraphAnnotator, ParserAnnotator, POSTaggerAnnotator, QuantifiableEntityNormalizingAnnotator, QuoteAnnotator, QuoteAttributionAnnotator, RegexNERAnnotator, RelationExtractorAnnotator, SentenceAnnotator, SentimentAnnotator, StanfordCoreNLP, StanfordCoreNLPClient, StatTokSentAnnotator, TimeAnnotator, TimexTreeAnnotator, TokenizerAnnotator, TokensRegexAnnotator, TokensRegexNERAnnotator, TrueCaseAnnotator, UDFeatureAnnotator, WebServiceAnnotator, WikidictAnnotator, WordsToSentencesAnnotator
```
public interface Annotator
```
This is an interface for adding annotations to a partially annotated Annotation. In some ways, it is just a glorified function, except that it explicitly operates in-place on Annotation objects. Annotators should be given to an AnnotationPipeline in order to make annotation pipelines (the whole motivation of this package), and therefore implementers of this interface should be designed to play well with other Annotators and in their javadocs they should explicitly state what annotations they are assuming already exist in the annotation (like parse, POS tag, etc), what keys they are expecting them under (see, for instance, the ones in CoreAnnotations), and what annotations they will add (or modify) and the keys for them as well. If you would like to look at the code for a relatively simple Annotator, I recommend NERAnnotator. For a lot of code you could just add the implements directly, but I recommend wrapping instead because I believe that it will help to keep the pipeline code more manageable. An Annotator should also provide a description of what it produces and a description of what it requires to have been produced by using Sets of requirements. The StanfordCoreNLP version of the AnnotationPipeline can enforce requirements, throwing an exception if an annotator does not have all of its prerequisites met. An Annotator which does not participate in this system can simply return Collections.emptySet() for both requires() and requirementsSatisfied().
Properties
We extensively use Properties objects to configure each Annotator. In particular, CoreNLP has most of its properties in an informal namespace with properties names like "parse.maxlen" to specify that a property only applies to a parser annotator. There can also be global properties; they should not have any periods in their names. Each Annotator knows its own name; we assume these don't collide badly, though possibly two parsers could share the "parse.*" namespace. An Annotator should have a constructor that simply takes a Properties object. At this point, the Annotator should expect to be getting properties in namespaces. The classes that annotators call (like a concrete parser, tagger, or whatever) mainly expect properties not in namespaces. In general the annotator should subset the passed in properties to keep only global properties and ones in its own namespace, and then strip the namespace prefix from the latter properties.

Author:

Jenny Finkel

Field Summary

Fields
Modifier and Type	Field and Description
`static java.util.Map<java.lang.String,java.util.Set<java.lang.String>>`	`DEFAULT_REQUIREMENTS` A mapping from an annotator to a its default transitive dependencies.
`static java.lang.String`	`STANFORD_CDC_TOKENIZE`
`static java.lang.String`	`STANFORD_CLEAN_XML`
`static java.lang.String`	`STANFORD_COLUMN_DATA_CLASSIFIER`
`static java.lang.String`	`STANFORD_COREF`
`static java.lang.String`	`STANFORD_COREF_MENTION`
`static java.lang.String`	`STANFORD_DEPENDENCIES`
`static java.lang.String`	`STANFORD_DETERMINISTIC_COREF`
`static java.lang.String`	`STANFORD_DOCDATE`
`static java.lang.String`	`STANFORD_ENTITY_MENTIONS`
`static java.lang.String`	`STANFORD_GENDER`
`static java.lang.String`	`STANFORD_KBP`
`static java.lang.String`	`STANFORD_LEMMA`
`static java.lang.String`	`STANFORD_LINK`
`static java.lang.String`	`STANFORD_MWT`
`static java.lang.String`	`STANFORD_NATLOG`
`static java.lang.String`	`STANFORD_NER`
`static java.lang.String`	`STANFORD_OPENIE`
`static java.lang.String`	`STANFORD_PARSE`
`static java.lang.String`	`STANFORD_POS`
`static java.lang.String`	`STANFORD_QUOTE`
`static java.lang.String`	`STANFORD_QUOTE_ATTRIBUTION`
`static java.lang.String`	`STANFORD_REGEXNER`
`static java.lang.String`	`STANFORD_RELATION`
`static java.lang.String`	`STANFORD_SENTIMENT`
`static java.lang.String`	`STANFORD_SSPLIT`
`static java.lang.String`	`STANFORD_TOKENIZE` These are annotators which StanfordCoreNLP knows how to create.
`static java.lang.String`	`STANFORD_TOKENSREGEX`
`static java.lang.String`	`STANFORD_TRUECASE`
`static java.lang.String`	`STANFORD_UD_FEATURES`

Method Summary

All Methods Instance Methods Abstract Methods Default Methods
Modifier and Type	Method and Description
`void`	`annotate(Annotation annotation)` Given an Annotation, perform a task on this Annotation.
`default java.util.Collection<java.lang.String>`	`exactRequirements()`
`java.util.Set<java.lang.Class<? extends CoreAnnotation>>`	`requirementsSatisfied()` Returns a set of requirements for which tasks this annotator can provide.
`java.util.Set<java.lang.Class<? extends CoreAnnotation>>`	`requires()` Returns the set of tasks which this annotator requires in order to perform.
`default void`	`unmount()` A block of code called when this annotator unmounts from the `AnnotatorPool`.

- Field Detail
  - STANFORD_TOKENIZE
```
static final java.lang.String STANFORD_TOKENIZE
```
    These are annotators which StanfordCoreNLP knows how to create. Add new annotators and/or annotators from other groups here!
    
    See Also:
    
    Constant Field Values
  - STANFORD_CDC_TOKENIZE
```
static final java.lang.String STANFORD_CDC_TOKENIZE
```
    See Also:
    
    Constant Field Values
  - STANFORD_CLEAN_XML
```
static final java.lang.String STANFORD_CLEAN_XML
```
    See Also:
    
    Constant Field Values
  - STANFORD_SSPLIT
```
static final java.lang.String STANFORD_SSPLIT
```
    See Also:
    
    Constant Field Values
  - STANFORD_MWT
```
static final java.lang.String STANFORD_MWT
```
    See Also:
    
    Constant Field Values
  - STANFORD_DOCDATE
```
static final java.lang.String STANFORD_DOCDATE
```
    See Also:
    
    Constant Field Values
  - STANFORD_POS
```
static final java.lang.String STANFORD_POS
```
    See Also:
    
    Constant Field Values
  - STANFORD_LEMMA
```
static final java.lang.String STANFORD_LEMMA
```
    See Also:
    
    Constant Field Values
  - STANFORD_NER
```
static final java.lang.String STANFORD_NER
```
    See Also:
    
    Constant Field Values
  - STANFORD_REGEXNER
```
static final java.lang.String STANFORD_REGEXNER
```
    See Also:
    
    Constant Field Values
  - STANFORD_TOKENSREGEX
```
static final java.lang.String STANFORD_TOKENSREGEX
```
    See Also:
    
    Constant Field Values
  - STANFORD_ENTITY_MENTIONS
```
static final java.lang.String STANFORD_ENTITY_MENTIONS
```
    See Also:
    
    Constant Field Values
  - STANFORD_GENDER
```
static final java.lang.String STANFORD_GENDER
```
    See Also:
    
    Constant Field Values
  - STANFORD_TRUECASE
```
static final java.lang.String STANFORD_TRUECASE
```
    See Also:
    
    Constant Field Values
  - STANFORD_PARSE
```
static final java.lang.String STANFORD_PARSE
```
    See Also:
    
    Constant Field Values
  - STANFORD_DETERMINISTIC_COREF
```
static final java.lang.String STANFORD_DETERMINISTIC_COREF
```
    See Also:
    
    Constant Field Values
  - STANFORD_COREF
```
static final java.lang.String STANFORD_COREF
```
    See Also:
    
    Constant Field Values
  - STANFORD_COREF_MENTION
```
static final java.lang.String STANFORD_COREF_MENTION
```
    See Also:
    
    Constant Field Values
  - STANFORD_RELATION
```
static final java.lang.String STANFORD_RELATION
```
    See Also:
    
    Constant Field Values
  - STANFORD_SENTIMENT
```
static final java.lang.String STANFORD_SENTIMENT
```
    See Also:
    
    Constant Field Values
  - STANFORD_COLUMN_DATA_CLASSIFIER
```
static final java.lang.String STANFORD_COLUMN_DATA_CLASSIFIER
```
    See Also:
    
    Constant Field Values
  - STANFORD_DEPENDENCIES
```
static final java.lang.String STANFORD_DEPENDENCIES
```
    See Also:
    
    Constant Field Values
  - STANFORD_NATLOG
```
static final java.lang.String STANFORD_NATLOG
```
    See Also:
    
    Constant Field Values
  - STANFORD_OPENIE
```
static final java.lang.String STANFORD_OPENIE
```
    See Also:
    
    Constant Field Values
  - STANFORD_QUOTE
```
static final java.lang.String STANFORD_QUOTE
```
    See Also:
    
    Constant Field Values
  - STANFORD_QUOTE_ATTRIBUTION
```
static final java.lang.String STANFORD_QUOTE_ATTRIBUTION
```
    See Also:
    
    Constant Field Values
  - STANFORD_UD_FEATURES
```
static final java.lang.String STANFORD_UD_FEATURES
```
    See Also:
    
    Constant Field Values
  - STANFORD_LINK
```
static final java.lang.String STANFORD_LINK
```
    See Also:
    
    Constant Field Values
  - STANFORD_KBP
```
static final java.lang.String STANFORD_KBP
```
    See Also:
    
    Constant Field Values
  - DEFAULT_REQUIREMENTS
```
static final java.util.Map<java.lang.String,java.util.Set<java.lang.String>> DEFAULT_REQUIREMENTS
```
    A mapping from an annotator to a its default transitive dependencies. Note that this is not guaranteed to be accurate, as properties set in the annotator can change the annotator's dependencies; but, it's a reasonable guess if you're using things out-of-the-box.
- Method Detail
  - annotate
```
void annotate(Annotation annotation)
```
    Given an Annotation, perform a task on this Annotation.
  - unmount
```
default void unmount()
```
    A block of code called when this annotator unmounts from the AnnotatorPool. By default, nothing is done.
  - requirementsSatisfied
```
java.util.Set<java.lang.Class<? extends CoreAnnotation>> requirementsSatisfied()
```
    Returns a set of requirements for which tasks this annotator can provide. For example, the POS annotator will return "pos".
  - requires
```
java.util.Set<java.lang.Class<? extends CoreAnnotation>> requires()
```
    Returns the set of tasks which this annotator requires in order to perform. For example, the POS annotator will return "tokenize", "ssplit".
  - exactRequirements
```
default java.util.Collection<java.lang.String> exactRequirements()
```

Interface Annotator

Properties

Field Summary

Method Summary

Field Detail

STANFORD_TOKENIZE

STANFORD_CDC_TOKENIZE

STANFORD_CLEAN_XML

STANFORD_SSPLIT

STANFORD_MWT

STANFORD_DOCDATE

STANFORD_POS

STANFORD_LEMMA

STANFORD_NER

STANFORD_REGEXNER

STANFORD_TOKENSREGEX

STANFORD_ENTITY_MENTIONS

STANFORD_GENDER

STANFORD_TRUECASE

STANFORD_PARSE

STANFORD_DETERMINISTIC_COREF

STANFORD_COREF

STANFORD_COREF_MENTION

STANFORD_RELATION

STANFORD_SENTIMENT

STANFORD_COLUMN_DATA_CLASSIFIER

STANFORD_DEPENDENCIES

STANFORD_NATLOG

STANFORD_OPENIE

STANFORD_QUOTE

STANFORD_QUOTE_ATTRIBUTION

STANFORD_UD_FEATURES

STANFORD_LINK

STANFORD_KBP

DEFAULT_REQUIREMENTS

Method Detail

annotate

unmount

requirementsSatisfied

requires

exactRequirements