ChunkAnnotationUtils (Stanford JavaNLP API)

java.lang.Object
- edu.stanford.nlp.pipeline.ChunkAnnotationUtils

public class ChunkAnnotationUtils
extends java.lang.Object

Utility functions for annotating chunks

Author:: Angel Chang

Method Summary

All Methods Static Methods Concrete Methods
Modifier and Type	Method and Description
`static void`	`annotateChunk(CoreMap annotation, java.lang.Class newAnnotationKey, java.lang.Class aggrKey, CoreMapAttributeAggregator aggregator)`
`static void`	`annotateChunk(CoreMap chunk, java.util.List<CoreLabel> tokens, int tokenStartIndex, int tokenEndIndex, int totalTokenOffset)` Annotates a CoreMap representing a chunk with basic chunk information.
`static void`	`annotateChunk(CoreMap chunk, java.util.Map<java.lang.String,java.lang.String> attributes)`
`static void`	`annotateChunks(java.util.List<? extends CoreMap> chunks, int start, int end, java.util.Map<java.lang.String,java.lang.String> attributes)`
`static void`	`annotateChunks(java.util.List<? extends CoreMap> chunks, java.util.Map<java.lang.String,java.lang.String> attributes)`
`static void`	`annotateChunkText(CoreMap chunk, java.lang.Class tokenTextKey)` Annotates a CoreMap representing a chunk with text information TextAnnotation - String representing tokens in this chunks (token text separated by space)
`static boolean`	`annotateChunkText(CoreMap chunk, CoreMap origAnnotation)` Annotates a CoreMap representing a chunk with text information TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
`static void`	`annotateChunkTokens(CoreMap chunk, java.lang.Class tokenChunkKey, java.lang.Class tokenLabelKey)` Annotates tokens in chunk.
`static <T extends CoreMap> void`	`appendCoreMap(java.util.List<T> res, CoreMap cm, java.lang.String text, int start, int end, CoreTokenFactory<T> factory)`
`static boolean`	`checkOffsets(CoreMap docAnnotation)` Checks if offsets of doc and sentence matches.
`static void`	`copyUnsetAnnotations(CoreMap src, CoreMap dest)` Copies annotation over to this CoreMap if not already set.
`static <T extends CoreMap> T`	`createCoreMap(CoreMap cm, java.lang.String text, int start, int end, CoreTokenFactory<T> factory)`
`static boolean`	`fixChunkSentenceBoundaries(CoreMap docAnnotation, java.util.List<IntPair> chunkCharOffsets)` Give an list of character offsets for chunk, fix sentence splitting so sentences doesn't break the chunks.
`static boolean`	`fixChunkSentenceBoundaries(CoreMap docAnnotation, java.util.List<IntPair> chunkCharOffsets, boolean offsetsAreNotSorted, boolean extendedFixSentence, boolean moreExtendedFixSentence)` Give an list of character offsets for chunk, fix sentence splitting so sentences doesn't break the chunks.
`static boolean`	`fixChunkTokenBoundaries(CoreMap docAnnotation, java.util.List<IntPair> chunkCharOffsets)` Give an list of character offsets for chunk, fix tokenization so tokenization occurs at boundary of chunks.
`static boolean`	`fixTokenOffsets(CoreMap docAnnotation)` Fix token offsets of sentences to match those in the document (assumes tokens are shared) sentence token indices may not match document token list if certain html elements are ignored.
`static Annotation`	`getAnnotatedChunk(CoreMap annotation, int tokenStartIndex, int tokenEndIndex)` Create a new chunk Annotation with basic chunk information CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + annotation's TokenBeginAnnotation TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + annotation's TokenBeginAnnotation TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
`static Annotation`	`getAnnotatedChunk(CoreMap annotation, int tokenStartIndex, int tokenEndIndex, java.lang.Class tokenChunkKey, java.lang.Class tokenLabelKey)` Create a new chunk Annotation with basic chunk information CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + annotation's TokenBeginAnnotation TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + annotation's TokenBeginAnnotation TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
`static Annotation`	`getAnnotatedChunk(java.util.List<CoreLabel> tokens, int tokenStartIndex, int tokenEndIndex, int totalTokenOffset)` Create a new chunk Annotation with basic chunk information.
`static Annotation`	`getAnnotatedChunk(java.util.List<CoreLabel> tokens, int tokenStartIndex, int tokenEndIndex, int totalTokenOffset, java.lang.Class tokenChunkKey, java.lang.Class tokenTextKey, java.lang.Class tokenLabelKey)` Create a new chunk Annotation with basic chunk information.
`static java.util.List<CoreMap>`	`getAnnotatedChunksUsingSortedCharOffsets(CoreMap annotation, java.util.List<IntPair> charOffsets)`
`static java.util.List<CoreMap>`	`getAnnotatedChunksUsingSortedCharOffsets(CoreMap annotation, java.util.List<IntPair> charOffsets, boolean charOffsetIsRelative, java.lang.Class tokenChunkKey, java.lang.Class tokenLabelKey, boolean allowPartialTokens)` Create a list of new chunk Annotation with basic chunk information.
`static CoreMap`	`getAnnotatedChunkUsingCharOffsets(CoreMap annotation, int charOffsetStart, int charOffsetEnd)` Returns a chunk annotation based on char offsets.
`static Interval<java.lang.Integer>`	`getChunkOffsetsUsingCharOffsets(java.util.List<? extends CoreMap> chunkList, int charStart, int charEnd)` Return chunk offsets
`static CoreMap`	`getMergedChunk(java.util.List<? extends CoreMap> chunkList, int chunkIndexStart, int chunkIndexEnd, java.util.Map<java.lang.Class,CoreMapAttributeAggregator> aggregators, CoreLabelTokenFactory tokenFactory)` Create chunk that is merged from chunkIndexStart to chunkIndexEnd (exclusive)
`static CoreMap`	`getMergedChunk(java.util.List<? extends CoreMap> chunkList, java.lang.String origText, int chunkIndexStart, int chunkIndexEnd, CoreLabelTokenFactory tokenFactory)` Create chunk that is merged from chunkIndexStart to chunkIndexEnd (exclusive).
`static java.lang.String`	`getTokenText(java.util.List<? extends CoreMap> tokens, java.lang.Class tokenTextKey)`
`static java.lang.String`	`getTokenText(java.util.List<? extends CoreMap> tokens, java.lang.Class tokenTextKey, java.lang.String delimiter)`
`static boolean`	`hasCharacterOffsets(CoreMap chunk)`
`static void`	`mergeChunks(java.util.List<CoreMap> chunkList, java.lang.String origText, int chunkIndexStart, int chunkIndexEnd)` Merge chunks from chunkIndexStart to chunkIndexEnd (exclusive) and replace them in the list.
`static <T extends CoreMap> java.util.List<T>`	`splitCoreMap(java.util.regex.Pattern p, boolean includeMatched, CoreMap cm, CoreTokenFactory<T> factory)`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Method Detail
  - checkOffsets
```
public static boolean checkOffsets(CoreMap docAnnotation)
```
    Checks if offsets of doc and sentence matches.
    
    Parameters:
    
    docAnnotation - The document Annotation to analyze
    
    Returns:
    
    true if the offsets match, false otherwise
  - fixTokenOffsets
```
public static boolean fixTokenOffsets(CoreMap docAnnotation)
```
    Fix token offsets of sentences to match those in the document (assumes tokens are shared) sentence token indices may not match document token list if certain html elements are ignored.
    
    Parameters:
    
    docAnnotation - The document Annotation to analyze
    
    Returns:
    
    true if fix was okay, false otherwise
  - copyUnsetAnnotations
```
public static void copyUnsetAnnotations(CoreMap src,
                                        CoreMap dest)
```
    Copies annotation over to this CoreMap if not already set.
  - fixChunkTokenBoundaries
```
public static boolean fixChunkTokenBoundaries(CoreMap docAnnotation,
                                              java.util.List<IntPair> chunkCharOffsets)
```
    Give an list of character offsets for chunk, fix tokenization so tokenization occurs at boundary of chunks.
    
    Parameters:
    
    docAnnotation -
    
    chunkCharOffsets -
  - getMergedChunk
```
public static CoreMap getMergedChunk(java.util.List<? extends CoreMap> chunkList,
                                     java.lang.String origText,
                                     int chunkIndexStart,
                                     int chunkIndexEnd,
                                     CoreLabelTokenFactory tokenFactory)
```
    Create chunk that is merged from chunkIndexStart to chunkIndexEnd (exclusive).
    
    Parameters:
    
    chunkList - - List of chunks
    
    origText - - Text from which to extract chunk text
    
    chunkIndexStart - - Index of first chunk to merge
    
    chunkIndexEnd - - Index of last chunk to merge (exclusive)
    
    tokenFactory - - factory for creating tokens (if we want to get a merged corelabel instead of something random)
    
    Returns:
    
    new merged chunk
  - getMergedChunk
```
public static CoreMap getMergedChunk(java.util.List<? extends CoreMap> chunkList,
                                     int chunkIndexStart,
                                     int chunkIndexEnd,
                                     java.util.Map<java.lang.Class,CoreMapAttributeAggregator> aggregators,
                                     CoreLabelTokenFactory tokenFactory)
```
    Create chunk that is merged from chunkIndexStart to chunkIndexEnd (exclusive)
    
    Parameters:
    
    chunkList - - List of chunks
    
    chunkIndexStart - - Index of first chunk to merge
    
    chunkIndexEnd - - Index of last chunk to merge (exclusive)
    
    aggregators - - Aggregators
    
    tokenFactory - - factory for creating tokens (if we want to get a merged corelabel instead of something random)
    
    Returns:
    
    new merged chunk
  - getChunkOffsetsUsingCharOffsets
```
public static Interval<java.lang.Integer> getChunkOffsetsUsingCharOffsets(java.util.List<? extends CoreMap> chunkList,
                                                                          int charStart,
                                                                          int charEnd)
```
    Return chunk offsets
    
    Parameters:
    
    chunkList - - List of chunks
    
    charStart - - character begin offset
    
    charEnd - - character end offset
    
    Returns:
    
    chunk offsets
  - mergeChunks
```
public static void mergeChunks(java.util.List<CoreMap> chunkList,
                               java.lang.String origText,
                               int chunkIndexStart,
                               int chunkIndexEnd)
```
    Merge chunks from chunkIndexStart to chunkIndexEnd (exclusive) and replace them in the list.
    
    Parameters:
    
    chunkList - - List of chunks
    
    origText - - Text from which to extract chunk text
    
    chunkIndexStart - - Index of first chunk to merge
    
    chunkIndexEnd - - Index of last chunk to merge (exclusive)
  - fixChunkSentenceBoundaries
```
public static boolean fixChunkSentenceBoundaries(CoreMap docAnnotation,
                                                 java.util.List<IntPair> chunkCharOffsets)
```
    Give an list of character offsets for chunk, fix sentence splitting so sentences doesn't break the chunks.
    
    Parameters:
    
    docAnnotation - Document with sentences
    
    chunkCharOffsets - ordered pairs of different chunks that should appear in sentences
    
    Returns:
    
    true if fix was okay (chunks are in all sentences), false otherwise
  - fixChunkSentenceBoundaries
```
public static boolean fixChunkSentenceBoundaries(CoreMap docAnnotation,
                                                 java.util.List<IntPair> chunkCharOffsets,
                                                 boolean offsetsAreNotSorted,
                                                 boolean extendedFixSentence,
                                                 boolean moreExtendedFixSentence)
```
    Give an list of character offsets for chunk, fix sentence splitting so sentences doesn't break the chunks.
    
    Parameters:
    
    docAnnotation - Document with sentences
    
    chunkCharOffsets - ordered pairs of different chunks that should appear in sentences
    
    offsetsAreNotSorted - Treat each pair of offsets as independent (look through all sentences again)
    
    extendedFixSentence - Do extended sentence fixing based on some heuristics
    
    moreExtendedFixSentence - Do even more extended sentence fixing based on some heuristics
    
    Returns:
    
    true if fix was okay (chunks are in all sentences), false otherwise
  - annotateChunk
```
public static void annotateChunk(CoreMap chunk,
                                 java.util.List<CoreLabel> tokens,
                                 int tokenStartIndex,
                                 int tokenEndIndex,
                                 int totalTokenOffset)
```
    Annotates a CoreMap representing a chunk with basic chunk information. CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + totalTokenOffset TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + totalTokenOffset
    
    Parameters:
    
    chunk - - CoreMap to be annotated
    
    tokens - - List of tokens to look for chunks
    
    tokenStartIndex - - Index (relative to current list of tokens) at which this chunk starts
    
    tokenEndIndex - - Index (relative to current list of tokens) at which this chunk ends (not inclusive)
    
    totalTokenOffset - - Index of tokens to offset by
  - getTokenText
```
public static java.lang.String getTokenText(java.util.List<? extends CoreMap> tokens,
                                            java.lang.Class tokenTextKey)
```
  - getTokenText
```
public static java.lang.String getTokenText(java.util.List<? extends CoreMap> tokens,
                                            java.lang.Class tokenTextKey,
                                            java.lang.String delimiter)
```
  - annotateChunkText
```
public static void annotateChunkText(CoreMap chunk,
                                     java.lang.Class tokenTextKey)
```
    Annotates a CoreMap representing a chunk with text information TextAnnotation - String representing tokens in this chunks (token text separated by space)
    
    Parameters:
    
    chunk - - CoreMap to be annotated
    
    tokenTextKey - - Key to use to find the token text
  - hasCharacterOffsets
```
public static boolean hasCharacterOffsets(CoreMap chunk)
```
  - annotateChunkText
```
public static boolean annotateChunkText(CoreMap chunk,
                                        CoreMap origAnnotation)
```
    Annotates a CoreMap representing a chunk with text information TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
    
    Parameters:
    
    chunk - - CoreMap to be annotated
    
    origAnnotation - - Annotation from which to extract the text for this chunk
  - annotateChunkTokens
```
public static void annotateChunkTokens(CoreMap chunk,
                                       java.lang.Class tokenChunkKey,
                                       java.lang.Class tokenLabelKey)
```
    Annotates tokens in chunk.
    
    Parameters:
    
    chunk - - CoreMap representing chunk (should have TextAnnotation and TokensAnnotation)
    
    tokenChunkKey - - If not null, each token is annotated with the chunk using this key
    
    tokenLabelKey - - If not null, each token is annotated with the text associated with the chunk using this key
  - getAnnotatedChunk
```
public static Annotation getAnnotatedChunk(java.util.List<CoreLabel> tokens,
                                           int tokenStartIndex,
                                           int tokenEndIndex,
                                           int totalTokenOffset)
```
    Create a new chunk Annotation with basic chunk information. CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + totalTokenOffset TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + totalTokenOffset
    
    Parameters:
    
    tokens - - List of tokens to look for chunks
    
    tokenStartIndex - - Index (relative to current list of tokens) at which this chunk starts
    
    tokenEndIndex - - Index (relative to current list of tokens) at which this chunk ends (not inclusive)
    
    totalTokenOffset - - Index of tokens to offset by
    
    Returns:
    
    Annotation representing new chunk
  - getAnnotatedChunk
```
public static Annotation getAnnotatedChunk(java.util.List<CoreLabel> tokens,
                                           int tokenStartIndex,
                                           int tokenEndIndex,
                                           int totalTokenOffset,
                                           java.lang.Class tokenChunkKey,
                                           java.lang.Class tokenTextKey,
                                           java.lang.Class tokenLabelKey)
```
    Create a new chunk Annotation with basic chunk information. CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + totalTokenOffset TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + totalTokenOffset TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
    
    Parameters:
    
    tokens - - List of tokens to look for chunks
    
    tokenStartIndex - - Index (relative to current list of tokens) at which this chunk starts
    
    tokenEndIndex - - Index (relative to current list of tokens) at which this chunk ends (not inclusive)
    
    totalTokenOffset - - Index of tokens to offset by
    
    tokenChunkKey - - If not null, each token is annotated with the chunk using this key
    
    tokenTextKey - - Key to use to find the token text
    
    tokenLabelKey - - If not null, each token is annotated with the text associated with the chunk using this key
    
    Returns:
    
    Annotation representing new chunk
  - getAnnotatedChunk
```
public static Annotation getAnnotatedChunk(CoreMap annotation,
                                           int tokenStartIndex,
                                           int tokenEndIndex)
```
    Create a new chunk Annotation with basic chunk information CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + annotation's TokenBeginAnnotation TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + annotation's TokenBeginAnnotation TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
    
    Parameters:
    
    annotation - - Annotation from which to extract the text for this chunk
    
    tokenStartIndex - - Index (relative to current list of tokens) at which this chunk starts
    
    tokenEndIndex - - Index (relative to current list of tokens) at which this chunk ends (not inclusive)
    
    Returns:
    
    Annotation representing new chunk
  - getAnnotatedChunk
```
public static Annotation getAnnotatedChunk(CoreMap annotation,
                                           int tokenStartIndex,
                                           int tokenEndIndex,
                                           java.lang.Class tokenChunkKey,
                                           java.lang.Class tokenLabelKey)
```
    Create a new chunk Annotation with basic chunk information CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + annotation's TokenBeginAnnotation TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + annotation's TokenBeginAnnotation TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
    
    Parameters:
    
    annotation - - Annotation from which to extract the text for this chunk
    
    tokenStartIndex - - Index (relative to current list of tokens) at which this chunk starts
    
    tokenEndIndex - - Index (relative to current list of tokens) at which this chunk ends (not inclusive)
    
    tokenChunkKey - - If not null, each token is annotated with the chunk using this key
    
    tokenLabelKey - - If not null, each token is annotated with the text associated with the chunk using this key
    
    Returns:
    
    Annotation representing new chunk
  - getAnnotatedChunkUsingCharOffsets
```
public static CoreMap getAnnotatedChunkUsingCharOffsets(CoreMap annotation,
                                                        int charOffsetStart,
                                                        int charOffsetEnd)
```
    Returns a chunk annotation based on char offsets.
    
    Parameters:
    
    annotation - Annotation from which to extract the text for this chunk
    
    charOffsetStart - Start character offset
    
    charOffsetEnd - End (not inclusive) character offset
    
    Returns:
    
    An Annotation representing the new chunk. Or null if no chunk matches offsets.
  - getAnnotatedChunksUsingSortedCharOffsets
```
public static java.util.List<CoreMap> getAnnotatedChunksUsingSortedCharOffsets(CoreMap annotation,
                                                                               java.util.List<IntPair> charOffsets)
```
  - getAnnotatedChunksUsingSortedCharOffsets
```
public static java.util.List<CoreMap> getAnnotatedChunksUsingSortedCharOffsets(CoreMap annotation,
                                                                               java.util.List<IntPair> charOffsets,
                                                                               boolean charOffsetIsRelative,
                                                                               java.lang.Class tokenChunkKey,
                                                                               java.lang.Class tokenLabelKey,
                                                                               boolean allowPartialTokens)
```
    Create a list of new chunk Annotation with basic chunk information. CharacterOffsetBeginAnnotation - set to CharacterOffsetBeginAnnotation of first token in chunk CharacterOffsetEndAnnotation - set to CharacterOffsetEndAnnotation of last token in chunk TokensAnnotation - List of tokens in this chunk TokenBeginAnnotation - Index of first token in chunk (index in original list of tokens) tokenStartIndex + annotation's TokenBeginAnnotation TokenEndAnnotation - Index of last token in chunk (index in original list of tokens) tokenEndIndex + annotation's TokenBeginAnnotation TextAnnotation - String extracted from the origAnnotation using character offset information for this chunk
    
    Parameters:
    
    annotation - Annotation from which to extract the text for this chunk
    
    charOffsets - - List of start and end (not inclusive) character offsets Note: assume char offsets are sorted and non-overlapping!!!
    
    charOffsetIsRelative - - Whether the character offsets are relative to the current annotation or absolute offsets
    
    tokenChunkKey - - If not null, each token is annotated with the chunk using this key
    
    tokenLabelKey - - If not null, each token is annotated with the text associated with the chunk using this key
    
    allowPartialTokens - - Whether to allow partial tokens or not
    
    Returns:
    
    List of Annotation representing new chunks; may be empty never null
  - annotateChunk
```
public static void annotateChunk(CoreMap annotation,
                                 java.lang.Class newAnnotationKey,
                                 java.lang.Class aggrKey,
                                 CoreMapAttributeAggregator aggregator)
```
  - annotateChunk
```
public static void annotateChunk(CoreMap chunk,
                                 java.util.Map<java.lang.String,java.lang.String> attributes)
```
  - annotateChunks
```
public static void annotateChunks(java.util.List<? extends CoreMap> chunks,
                                  int start,
                                  int end,
                                  java.util.Map<java.lang.String,java.lang.String> attributes)
```
  - annotateChunks
```
public static void annotateChunks(java.util.List<? extends CoreMap> chunks,
                                  java.util.Map<java.lang.String,java.lang.String> attributes)
```
  - createCoreMap
```
public static <T extends CoreMap> T createCoreMap(CoreMap cm,
                                                  java.lang.String text,
                                                  int start,
                                                  int end,
                                                  CoreTokenFactory<T> factory)
```
  - appendCoreMap
```
public static <T extends CoreMap> void appendCoreMap(java.util.List<T> res,
                                                     CoreMap cm,
                                                     java.lang.String text,
                                                     int start,
                                                     int end,
                                                     CoreTokenFactory<T> factory)
```
  - splitCoreMap
```
public static <T extends CoreMap> java.util.List<T> splitCoreMap(java.util.regex.Pattern p,
                                                                 boolean includeMatched,
                                                                 CoreMap cm,
                                                                 CoreTokenFactory<T> factory)
```

Class ChunkAnnotationUtils

Method Summary

Methods inherited from class java.lang.Object

Method Detail

checkOffsets

fixTokenOffsets

copyUnsetAnnotations

fixChunkTokenBoundaries

getMergedChunk

getMergedChunk

getChunkOffsetsUsingCharOffsets

mergeChunks

fixChunkSentenceBoundaries

fixChunkSentenceBoundaries

annotateChunk

getTokenText

getTokenText

annotateChunkText

hasCharacterOffsets

annotateChunkText

annotateChunkTokens

getAnnotatedChunk

getAnnotatedChunk

getAnnotatedChunk

getAnnotatedChunk

getAnnotatedChunkUsingCharOffsets

getAnnotatedChunksUsingSortedCharOffsets

getAnnotatedChunksUsingSortedCharOffsets

annotateChunk

annotateChunk

annotateChunks

annotateChunks

createCoreMap

appendCoreMap

splitCoreMap