This page gives instructions to run the bilingual NER experiments described in papers listed on the Chinese NER page.
To carry out the up-training experiments, use the bilingual NER model described below to label unannotated bitext, and include the tagged output as additional training data to retrain the CRF NER tagger.
$JAVANLP_HOME
to point to the javanlp
directory.
-writePosteriors
to true
to get alignment posterior probabilities. Store the resulting alignment file for test portion as test.align
A O European I-LOC official O in O the O Egyptian I-LOC capital I-LOC
en.ser.gz
and the Chinese model cn.ser.gz
.
$LP_HOME
to point to lp_solve_5.5
directory.
java -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*
edu.stanford.nlp.ie.crf.CRFClassifier -testFile en.test -loadClassifier en.ser.gz -printProbs > en.test.probsjava -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*
edu.stanford.nlp.ie.crf.CRFClassifier -testFile cn.test -loadClassifier cn.ser.gz -printProbs > cn.test.probs
export PYTHONPATH=$PYTHONPATH:$LP_HOME/extra/Python/build/lib.linux-x86_64-2.6/ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$LP_HOME/lib/liblpsolve55.so python ilp-soft.py cn.test.probs en.test.probs test.align autostat.penalty > cn.test.out 2> en.test.out
cn.test.out
using conlleval
javanlp/more
from here.
java -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*:$JAVANLP_HOME/projects/more/classes
edu.stanford.nlp.ie.crf.BisequenceCRFClassifier -prop gibbs.prop
(NOTE: for BIO tagging, useautostat.penalty
)
cn.test.out
using conlleval
java -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*:$JAVANLP_HOME/projects/more/classes
edu.stanford.nlp.ie.crf.BilingualCRFClassifier -prop cl-proj-unsup.propjava -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*
edu.stanford.nlp.ie.crf.CRFClassifier -testFile cn.test -loadClassifier cn.bilingual.ser.gz > cn.test.out
java -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*:$JAVANLP_HOME/projects/more/classes
edu.stanford.nlp.ie.crf.BilingualCRFClassifier -prop cl-proj-semisup.propjava -cp $JAVANLP_HOME/projects/core/classes:$JAVANLP_HOME/projects/core/lib/*
edu.stanford.nlp.ie.crf.CRFClassifier -testFile cn.test -loadClassifier cn.bilingual.ser.gz > cn.test.out- Evaluate
cn.test.out
usingconlleval