public abstract class AbstractDataset extends Object implements Dataset
| Modifier and Type | Class and Description |
|---|---|
protected static class |
AbstractDataset.SplitFilter |
Dataset.Encoding| Modifier and Type | Field and Description |
|---|---|
protected boolean |
addDeterminer |
protected boolean |
addRoot |
protected Set<String> |
configuredOptions |
protected TreeVisitor |
customTreeVisitor |
protected Dataset.Encoding |
encoding |
protected Pattern |
fileNameNormalizer |
protected String |
flatFileName |
protected String |
lexMapOptions |
protected Mapper |
lexMapper |
protected boolean |
makeFlatFile |
protected int |
maxLen |
protected String |
morphDelim |
protected Properties |
options
Provides access for sub-classes to the data set parameters
|
protected String |
outFileName |
protected List<String> |
outputFileList |
protected List<File> |
pathsToData |
protected List<File> |
pathsToMappings |
protected String |
posMapOptions |
protected Mapper |
posMapper |
protected boolean |
removeDashTags |
protected boolean |
removeEscapeTokens |
protected Set<String> |
requiredOptions |
protected FileFilter |
splitFilter |
protected StringBuilder |
toStringBuffer |
protected Treebank |
treebank |
protected String |
treeFileExtension |
| Constructor and Description |
|---|
AbstractDataset() |
| Modifier and Type | Method and Description |
|---|---|
abstract void |
build()
Generic method for loading, processing, and writing a dataset.
|
protected Set<String> |
buildSplitMap(String path) |
List<String> |
getFilenames()
Returns the filenames written by
Dataset.build(). |
boolean |
setOptions(Properties opts)
Sets options for a dataset.
|
String |
toString() |
protected Mapper posMapper
protected String posMapOptions
protected Mapper lexMapper
protected String lexMapOptions
protected Dataset.Encoding encoding
protected FileFilter splitFilter
protected boolean addDeterminer
protected boolean removeDashTags
protected boolean addRoot
protected boolean removeEscapeTokens
protected int maxLen
protected String morphDelim
protected TreeVisitor customTreeVisitor
protected String outFileName
protected String flatFileName
protected boolean makeFlatFile
protected final Pattern fileNameNormalizer
protected Treebank treebank
protected final StringBuilder toStringBuffer
protected String treeFileExtension
protected Properties options
public abstract void build()
Datasetpublic boolean setOptions(Properties opts)
DatasetsetOptions in interface Datasetopts - A map from parameter types defined in ConfigParser to
valuespublic List<String> getFilenames()
DatasetDataset.build().getFilenames in interface Dataset