|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.tagger.maxent.ExtractorFramesRare
public class ExtractorFramesRare
Maintains an array of ExtractorFrames for rare words. This file also defines all the rare word extractors as non-public classes.
Field Summary | |
---|---|
static Extractor |
cAllCap
"1" if token has no lower case letters |
static Extractor |
cAllCapitalized
"1" if token has only upper case letters |
static Extractor |
cCapDist
Distance to lowercase word |
static Extractor |
cCompany
"1" if capitalized and one of following 3 words is Inc., Co., or Corp. |
static Extractor |
cLetterDigitDash
"1" if word contains letter, digit, and dash, in any position and case |
static Extractor |
cMidSentence
"1" if not first word of sentence and _some_ letter is uppercase |
static int[] |
countFeats
|
static Extractor |
cPluralAcronym
"1" if a plural acronym: capital letters followed by 's' |
static Extractor |
cUCaseLCase
"0" if first letter isn't uppercase or if lowercase version isn't in dictionary. |
static Extractor |
cUpperDigitDash
"1" if word contains uppercase letter, digit, and dash |
static Extractor |
cWordDash
"1" iff word contains 1 or more dash characters (somewhere) |
static Extractor |
cWordMidUCase
"0" if first word of sentence or not first letter uppercase or if lowercase version isn't in dictionary. |
static Extractor |
cWordNumber
"1" iff word contains 1 or more digit characters (somewhere) |
static Extractor |
cWordPref1
First 1-4 characters of word |
static Extractor |
cWordPref2
|
static Extractor |
cWordPref3
|
static Extractor |
cWordPref4
|
static Extractor |
cWordStartUCase
"0" if not 1st word of sentence or not upper case, or lowercased version not in dictionary. |
static Extractor |
cWordSuff1
Last 1-4 characters of word |
static Extractor |
cWordSuff2
|
static Extractor |
cWordSuff3
|
static Extractor |
cWordSuff4
|
static Extractor |
cWordUppCase
"1" iff word contains 1 or more upper case characters (somewhere) |
static Extractor[] |
eFrames
|
static int[] |
nums
|
static int |
size
|
static java.lang.String |
zeroSt
|
Constructor Summary | |
---|---|
ExtractorFramesRare()
|
Method Summary | |
---|---|
static void |
addConjunctions()
this adds the conjunctions of various features in the rare words features |
static void |
addFeatures()
expand the extractors list |
static void |
addLCTagFeatures()
|
static void |
clearEmpty()
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static Extractor cWordSuff1
public static Extractor cWordSuff2
public static Extractor cWordSuff3
public static Extractor cWordSuff4
public static Extractor cWordPref1
public static Extractor cWordPref2
public static Extractor cWordPref3
public static Extractor cWordPref4
public static Extractor cWordUppCase
public static Extractor cWordNumber
public static Extractor cWordDash
public static Extractor cWordStartUCase
public static Extractor cWordMidUCase
public static Extractor cUCaseLCase
public static Extractor cMidSentence
public static Extractor cAllCap
public static Extractor cAllCapitalized
public static Extractor cCompany
public static Extractor cPluralAcronym
public static Extractor cLetterDigitDash
public static Extractor cUpperDigitDash
public static Extractor cCapDist
public static Extractor[] eFrames
public static int size
public static int[] countFeats
public static int[] nums
public static java.lang.String zeroSt
Constructor Detail |
---|
public ExtractorFramesRare()
Method Detail |
---|
public static void addFeatures()
public static void addConjunctions()
public static void addLCTagFeatures()
public static void clearEmpty()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |