edu.stanford.nlp.parser.lexparser
Class ChineseMarkovWordSegmenter
java.lang.Object
edu.stanford.nlp.parser.lexparser.ChineseMarkovWordSegmenter
- All Implemented Interfaces:
- WordSegmenter, java.io.Serializable
public class ChineseMarkovWordSegmenter
- extends java.lang.Object
- implements WordSegmenter
Performs word segmentation with a hierarchical markov model over POS
and over characters given POS.
- Author:
- Galen Andrew
- See Also:
- Serialized Form
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ChineseMarkovWordSegmenter
public ChineseMarkovWordSegmenter(ChineseCharacterBasedLexicon lex,
Index<java.lang.String> wordIndex,
Index<java.lang.String> tagIndex)
ChineseMarkovWordSegmenter
public ChineseMarkovWordSegmenter(ChineseTreebankParserParams params,
Index<java.lang.String> wordIndex,
Index<java.lang.String> tagIndex)
initializeTraining
public void initializeTraining(double numTrees)
- Specified by:
initializeTraining
in interface WordSegmenter
train
public void train(java.util.Collection<Tree> trees)
- Specified by:
train
in interface WordSegmenter
train
public void train(Tree tree)
- Specified by:
train
in interface WordSegmenter
train
public void train(java.util.List<TaggedWord> sentence)
- Specified by:
train
in interface WordSegmenter
finishTraining
public void finishTraining()
- Specified by:
finishTraining
in interface WordSegmenter
segment
public java.util.List<HasWord> segment(java.lang.String s)
- Specified by:
segment
in interface WordSegmenter
loadSegmenter
public void loadSegmenter(java.lang.String filename)
- Specified by:
loadSegmenter
in interface WordSegmenter
Stanford NLP Group