edu.stanford.nlp.parser.lexparser
Class ChineseMarkovWordSegmenter

java.lang.Object
  extended by edu.stanford.nlp.parser.lexparser.ChineseMarkovWordSegmenter
All Implemented Interfaces:
WordSegmenter, java.io.Serializable

public class ChineseMarkovWordSegmenter
extends java.lang.Object
implements WordSegmenter

Performs word segmentation with a hierarchical markov model over POS and over characters given POS.

Author:
Galen Andrew
See Also:
Serialized Form

Constructor Summary
ChineseMarkovWordSegmenter(ChineseCharacterBasedLexicon lex, Index<java.lang.String> wordIndex, Index<java.lang.String> tagIndex)
           
ChineseMarkovWordSegmenter(ChineseTreebankParserParams params, Index<java.lang.String> wordIndex, Index<java.lang.String> tagIndex)
           
 
Method Summary
 void finishTraining()
           
 void initializeTraining(double numTrees)
           
 void loadSegmenter(java.lang.String filename)
           
 java.util.List<HasWord> segment(java.lang.String s)
           
 void train(java.util.Collection<Tree> trees)
           
 void train(java.util.List<TaggedWord> sentence)
           
 void train(Tree tree)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ChineseMarkovWordSegmenter

public ChineseMarkovWordSegmenter(ChineseCharacterBasedLexicon lex,
                                  Index<java.lang.String> wordIndex,
                                  Index<java.lang.String> tagIndex)

ChineseMarkovWordSegmenter

public ChineseMarkovWordSegmenter(ChineseTreebankParserParams params,
                                  Index<java.lang.String> wordIndex,
                                  Index<java.lang.String> tagIndex)
Method Detail

initializeTraining

public void initializeTraining(double numTrees)
Specified by:
initializeTraining in interface WordSegmenter

train

public void train(java.util.Collection<Tree> trees)
Specified by:
train in interface WordSegmenter

train

public void train(Tree tree)
Specified by:
train in interface WordSegmenter

train

public void train(java.util.List<TaggedWord> sentence)
Specified by:
train in interface WordSegmenter

finishTraining

public void finishTraining()
Specified by:
finishTraining in interface WordSegmenter

segment

public java.util.List<HasWord> segment(java.lang.String s)
Specified by:
segment in interface WordSegmenter

loadSegmenter

public void loadSegmenter(java.lang.String filename)
Specified by:
loadSegmenter in interface WordSegmenter


Stanford NLP Group