Package opennlp.tools.dictionary
Class Dictionary
- java.lang.Object
-
- opennlp.tools.dictionary.Dictionary
-
- All Implemented Interfaces:
Iterable<StringList>
,SerializableArtifact
public class Dictionary extends Object implements Iterable<StringList>, SerializableArtifact
This class is a dictionary.
-
-
Constructor Summary
Constructors Constructor Description Dictionary()
Initializes an emptyDictionary
.Dictionary(boolean caseSensitive)
Dictionary(InputStream in)
Initializes theDictionary
from an existing dictionary resource.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description Set<String>
asStringSet()
Gets this dictionary as aSet<String>
.boolean
contains(StringList tokens)
Checks if this dictionary has the given entry.boolean
equals(Object obj)
Class<?>
getArtifactSerializerClass()
Gets the Serializer Class forDictionary
int
getMaxTokenCount()
int
getMinTokenCount()
int
hashCode()
Iterator<StringList>
iterator()
Retrieves an Iterator over all tokens.static Dictionary
parseOneEntryPerLine(Reader in)
Reads a dictionary which has one entry per line.void
put(StringList tokens)
Adds the tokens to the dictionary as one new entry.void
remove(StringList tokens)
Removes the given tokens form the current instance.void
serialize(OutputStream out)
Writes the current instance to the givenOutputStream
.int
size()
Retrieves the number of tokens in the current instance.String
toString()
-
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
-
-
-
Constructor Detail
-
Dictionary
public Dictionary()
Initializes an emptyDictionary
.
-
Dictionary
public Dictionary(boolean caseSensitive)
-
Dictionary
public Dictionary(InputStream in) throws IOException
Initializes theDictionary
from an existing dictionary resource.- Parameters:
in
-InputStream
- Throws:
IOException
-
-
Method Detail
-
put
public void put(StringList tokens)
Adds the tokens to the dictionary as one new entry.- Parameters:
tokens
- the new entry
-
getMinTokenCount
public int getMinTokenCount()
- Returns:
- minimum token count in the dictionary
-
getMaxTokenCount
public int getMaxTokenCount()
- Returns:
- maximum token count in the dictionary
-
contains
public boolean contains(StringList tokens)
Checks if this dictionary has the given entry.- Parameters:
tokens
- query- Returns:
- true if it contains the entry otherwise false
-
remove
public void remove(StringList tokens)
Removes the given tokens form the current instance.- Parameters:
tokens
- filter tokens
-
iterator
public Iterator<StringList> iterator()
Retrieves an Iterator over all tokens.- Specified by:
iterator
in interfaceIterable<StringList>
- Returns:
- token-
Iterator
-
size
public int size()
Retrieves the number of tokens in the current instance.- Returns:
- number of tokens
-
serialize
public void serialize(OutputStream out) throws IOException
Writes the current instance to the givenOutputStream
.- Parameters:
out
-OutputStream
- Throws:
IOException
-
parseOneEntryPerLine
public static Dictionary parseOneEntryPerLine(Reader in) throws IOException
Reads a dictionary which has one entry per line. The tokens inside an entry are whitespace delimited.- Parameters:
in
-Reader
- Returns:
- the parsed dictionary
- Throws:
IOException
-
asStringSet
public Set<String> asStringSet()
Gets this dictionary as aSet<String>
. Onlyiterator()
,size()
andcontains(Object)
methods are implemented. If this dictionary entries are multi tokens only the first token of the entry will be part of the Set.- Returns:
- a Set containing the entries of this dictionary
-
getArtifactSerializerClass
public Class<?> getArtifactSerializerClass()
Gets the Serializer Class forDictionary
- Specified by:
getArtifactSerializerClass
in interfaceSerializableArtifact
- Returns:
DictionarySerializer
-
-