Package opennlp.morfologik.builder
Class MorfologikDictionaryBuilder
- java.lang.Object
-
- opennlp.morfologik.builder.MorfologikDictionaryBuilder
-
public class MorfologikDictionaryBuilder extends Object
Utility class to build Morfologik dictionaries from a tab separated values file.The first column is the word, the second its lemma and the third a POS tag (base,inflected,tag). If there is no lemma information leave the second column empty.
-
-
Constructor Summary
Constructors Constructor Description MorfologikDictionaryBuilder()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Path
build(Path input)
Helper to compile a morphological dictionary automaton using default parameters.Path
build(Path input, boolean overwrite, boolean validate, boolean acceptBom, boolean acceptCr, boolean ignoreEmpty)
Helper to compile a morphological dictionary automaton.
-
-
-
Method Detail
-
build
public Path build(Path input, boolean overwrite, boolean validate, boolean acceptBom, boolean acceptCr, boolean ignoreEmpty) throws Exception
Helper to compile a morphological dictionary automaton.- Parameters:
input
- Theinput file
(base,inflected,tag). An associated metadata (*.info
) file must exist.overwrite
- Whether to overwrite the output file if it exists, or not.validate
- Whether to validate input to make sure it makes sense.acceptBom
- Whether to accept leading BOM bytes (UTF-8), or not.acceptCr
- Whether to accept CR bytes in input sequences (\r
), or not.ignoreEmpty
- Whether to ignore empty lines in the input, or not.- Returns:
- The resulting dictionary
Path
. - Throws:
Exception
- Thrown if errors occurred during dictionary compilation.
-
build
public Path build(Path input) throws Exception
Helper to compile a morphological dictionary automaton using default parameters.- Parameters:
input
- Theinput file
(base,inflected,tag). An associated metadata (*.info
) file must exist.- Returns:
- The resulting dictionary
Path
. - Throws:
Exception
- Thrown if errors occurred during dictionary compilation.
-
-