edu.stanford.nlp.trees
Class PennSentenceMrgNormalizer

java.lang.Object
  extended byedu.stanford.nlp.trees.SentenceNormalizer
      extended byedu.stanford.nlp.trees.PennSentenceNormalizer
          extended byedu.stanford.nlp.trees.PennSentenceMrgNormalizer

public class PennSentenceMrgNormalizer
extends PennSentenceNormalizer

A class for sentence normalization. This one knows about the funny things in Penn Treebank pos files -- like lots of equals signs and square brackets. Additionally, it recodes brackets as strings like -LRB- so as to make them the same as the encoding used in the parsed files. A Singleton.

Author:
Christopher Manning

Constructor Summary
PennSentenceMrgNormalizer()
           
 
Method Summary
 String normalizeString(String word)
          Normalizes a read string word (and maybe intern it).
 
Methods inherited from class edu.stanford.nlp.trees.PennSentenceNormalizer
endSentenceToken, normalizeSentence
 
Methods inherited from class edu.stanford.nlp.trees.SentenceNormalizer
eolIsSentenceEnd
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PennSentenceMrgNormalizer

public PennSentenceMrgNormalizer()
Method Detail

normalizeString

public String normalizeString(String word)
Normalizes a read string word (and maybe intern it).

Overrides:
normalizeString in class PennSentenceNormalizer


Stanford NLP Group