Given a filename and a collection, performs a number of tests on the
file, and if the file passes all of the tests, adds a java.io.File
java.io.File object corresponding to the filename to the collection.
Constructs a new string from the given arg string as follows:
capitalizes the second character and inserts the prefix string
between the first and second characters.
beginLine and beginColumn describe the position of the first character
of this token; endLine and endColumn describe the position of the
last character of this token.
beginLine and beginColumn describe the position of the first character
of this token; endLine and endColumn describe the position of the
last character of this token.
A ContainsFilter is a java.io.FileFilter that either: (1) accepts
files that contain a certain string or (2) accepts files that do
not contain a certain string.
beginLine and beginColumn describe the position of the first character
of this token; endLine and endColumn describe the position of the
last character of this token.
beginLine and beginColumn describe the position of the first character
of this token; endLine and endColumn describe the position of the
last character of this token.
Extracts the argument that matches the given String and the following
argument, converts the second extracted argument into a parameter list,
and returns the parameter list.
A TwoLevelOrderer that combines the first level score with the second
level scores according to the following formula:
firstLevelScore / Sum_c_in_categories (seconedLevelScore_c)
This orderer ignores its parameter.
A TwoLevelOrderer that combines the first level score with the second
level scores according to the following formula:
<>
This orderer uses its double parameter in its calculation.
A SimpleOrderer that orders according to the following formula:
score (w) = sum_c_in_C_and_w_in_W [p(c,w) * log [p(c,w) / p(c) / p(w)]]
where C is the set of categories and V is {^w,w}.
A SimpleOrderer that orders according to the following formula:
score (w) =
-1 * sum_c_in_C_and_w_in_W [p(c,w) * log [p(c,w) / p(c) / p(w)]]
where C is the set of categories and V is {w,^w}.
Given corpus counters for a data set and a set of potential features,
orders the potential features according to how useful they will be during
classification.
A parser that parses the following language:
termPlusEOF ::=
termList ::= ()+
Term ::= [~ \t\n\r]+
Term ::= "(" ")"
To parse, call termPlusEOF (), which returns a parameter.
A SimpleOrderer that orders according to the following formula:
score (w) = max_c_in_C_and_w_in_W [log [p(c,w)/p(c)/p(w)]]
where C is the set of categories and W is {^w, w}.
The following Comparator orders TwoLevelScores in reverse order
of their first level scores (not their combined scores as the comparator
in DoubleWrap would).