public class DBWordListPreparerImpl extends AbstractInput
deleteTempFiles, inputReader, intSentenceNumbers, intSource, knownNumbers2, longTimestamp, MWU_FILE_NAME, objTokenizer, sentenceNumbers, STATUS, words, writeFrequency
BOOLEAN_DEFAULT, DOUBLE_DEFAULT, FORMAT_IMPL, INT_DEFAULT, LONG_DEFAULT, objConf, objConfCategory, objFormat, strBackupConfFile, STRING_DEFAULT
SENTENCE_COLUMN, SENTENCE_ID_COLUMN, SF_COMPLETE, SF_COMPLETE_WITHOUT_SENTENCE_NUMBER, SF_SIMPLE, SF_UNKNOWN, SF_UNSET, SF_WITH_SENTENCE_NUMBERS, SOURCE_COLUMN, SOURCE_NOT_SET, SOURCE_UNKNOWN, TIMESTAMP_COLUMN, TIMESTAMP_NOT_SET
Constructor and Description |
---|
DBWordListPreparerImpl()
Creates a new instance of DBWordListPreparerImpl
|
Modifier and Type | Method and Description |
---|---|
void |
init() |
static void |
main(java.lang.String[] args) |
void |
prepare() |
checkMapStatus, countWhitespaces, getMWUTokens, getTokens, getTokens, getWhitespacePositions, guessFormat, ignoreSentence, loadMWU, loadSourceMapping, loadTokenizer, makeWord, mergeFiles, processLine, startReading, throwNewInputException
config, getConfiguration, getGlobalProperty, getProperty, getProperty, getStatisticsProperty, loadFormatImpl, selfconfig, setGlobalProperty, setProperty, setProperty, setStatisticsProperty, setStatisticsProperty
public DBWordListPreparerImpl()
public void init() throws ConfigurationException
ConfigurationException
public void prepare() throws java.io.IOException, java.io.FileNotFoundException
java.io.IOException
java.io.FileNotFoundException
public static void main(java.lang.String[] args)
2005-2013 Marco Büchler, Natural Language Processing Group, University of Leipzig, Germany. 2013-2016 Marco Büchler, Georg August University Göttingen, Göttingen, Germany