|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.process.CoreLabelTokenFactory
public class CoreLabelTokenFactory
Constructs CoreLabel
s from Strings optionally with
beginning and ending (character after the end) offset positions in
an original text. The makeToken method will put the token in the
OriginalTextAnnotation AND TextAnnotation keys (2 places!),
and optionally records
begin and position after offsets in BeginPositionAnnotation and
EndPositionAnnotation. If the tokens are built in PTBTokenizer with
an "invertible" tokenizer, you will also get a BeforeAnnotation and for
the last token an AfterAnnotation.You can also get an empty CoreLabel token
Constructor Summary | |
---|---|
CoreLabelTokenFactory()
Constructor for a new token factory which will add in the word, the "current" annotation, and the begin/end position annotations. |
|
CoreLabelTokenFactory(boolean addIndices)
Constructor that allows one to choose if index annotation indicating begin/end position will be included in the label. |
Method Summary | |
---|---|
CoreLabel |
makeToken()
|
CoreLabel |
makeToken(CoreLabel labelToBeCopied)
|
CoreLabel |
makeToken(String[] keys,
String[] values)
|
CoreLabel |
makeToken(String tokenText,
int begin,
int length)
Constructs a CoreLabel as a String with a corresponding BEGIN and END position. |
CoreLabel |
makeToken(String tokenText,
String originalText,
int begin,
int length)
Constructs a CoreLabel as a String with a corresponding BEGIN and END position, when the original OriginalTextAnnotation is different from TextAnnotation (Does not take substring). |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CoreLabelTokenFactory()
public CoreLabelTokenFactory(boolean addIndices)
addIndices
- if true, begin and end position annotations will be included (this is the default)Method Detail |
---|
public CoreLabel makeToken(String tokenText, int begin, int length)
makeToken
in interface LexedTokenFactory<CoreLabel>
tokenText
- The String extracted by the lexer.begin
- The offset in the document of the first character
in this string.length
- The number of characters the string takes up in
the document.
public CoreLabel makeToken(String tokenText, String originalText, int begin, int length)
public CoreLabel makeToken()
makeToken
in interface CoreTokenFactory<CoreLabel>
public CoreLabel makeToken(String[] keys, String[] values)
makeToken
in interface CoreTokenFactory<CoreLabel>
public CoreLabel makeToken(CoreLabel labelToBeCopied)
makeToken
in interface CoreTokenFactory<CoreLabel>
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |