Package opennlp.tools.dictionary
Class Dictionary
- java.lang.Object
-
- opennlp.tools.dictionary.Dictionary
-
- All Implemented Interfaces:
Iterable<StringList>,SerializableArtifact
public class Dictionary extends Object implements Iterable<StringList>, SerializableArtifact
This class is a dictionary.
-
-
Constructor Summary
Constructors Constructor Description Dictionary()Initializes an emptyDictionary.Dictionary(boolean caseSensitive)Dictionary(InputStream in)Initializes theDictionaryfrom an existing dictionary resource.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description Set<String>asStringSet()Gets this dictionary as aSet<String>.booleancontains(StringList tokens)Checks if this dictionary has the given entry.booleanequals(Object obj)Class<?>getArtifactSerializerClass()Gets the Serializer Class forDictionaryintgetMaxTokenCount()intgetMinTokenCount()inthashCode()Iterator<StringList>iterator()Retrieves an Iterator over all tokens.static DictionaryparseOneEntryPerLine(Reader in)Reads a dictionary which has one entry per line.voidput(StringList tokens)Adds the tokens to the dictionary as one new entry.voidremove(StringList tokens)Removes the given tokens form the current instance.voidserialize(OutputStream out)Writes the current instance to the givenOutputStream.intsize()Retrieves the number of tokens in the current instance.StringtoString()-
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
-
-
-
Constructor Detail
-
Dictionary
public Dictionary()
Initializes an emptyDictionary.
-
Dictionary
public Dictionary(boolean caseSensitive)
-
Dictionary
public Dictionary(InputStream in) throws IOException
Initializes theDictionaryfrom an existing dictionary resource.- Parameters:
in-InputStream- Throws:
IOException
-
-
Method Detail
-
put
public void put(StringList tokens)
Adds the tokens to the dictionary as one new entry.- Parameters:
tokens- the new entry
-
getMinTokenCount
public int getMinTokenCount()
- Returns:
- minimum token count in the dictionary
-
getMaxTokenCount
public int getMaxTokenCount()
- Returns:
- maximum token count in the dictionary
-
contains
public boolean contains(StringList tokens)
Checks if this dictionary has the given entry.- Parameters:
tokens- query- Returns:
- true if it contains the entry otherwise false
-
remove
public void remove(StringList tokens)
Removes the given tokens form the current instance.- Parameters:
tokens- filter tokens
-
iterator
public Iterator<StringList> iterator()
Retrieves an Iterator over all tokens.- Specified by:
iteratorin interfaceIterable<StringList>- Returns:
- token-
Iterator
-
size
public int size()
Retrieves the number of tokens in the current instance.- Returns:
- number of tokens
-
serialize
public void serialize(OutputStream out) throws IOException
Writes the current instance to the givenOutputStream.- Parameters:
out-OutputStream- Throws:
IOException
-
parseOneEntryPerLine
public static Dictionary parseOneEntryPerLine(Reader in) throws IOException
Reads a dictionary which has one entry per line. The tokens inside an entry are whitespace delimited.- Parameters:
in-Reader- Returns:
- the parsed dictionary
- Throws:
IOException
-
asStringSet
public Set<String> asStringSet()
Gets this dictionary as aSet<String>. Onlyiterator(),size()andcontains(Object)methods are implemented. If this dictionary entries are multi tokens only the first token of the entry will be part of the Set.- Returns:
- a Set containing the entries of this dictionary
-
getArtifactSerializerClass
public Class<?> getArtifactSerializerClass()
Gets the Serializer Class forDictionary- Specified by:
getArtifactSerializerClassin interfaceSerializableArtifact- Returns:
DictionarySerializer
-
-