public class DefaultSDContextGenerator extends Object implements SDContextGenerator
Constructor and Description |
---|
DefaultSDContextGenerator(char[] eosCharacters)
Creates a new
SDContextGenerator instance with
no induced abbreviations. |
DefaultSDContextGenerator(Set<String> inducedAbbreviations,
char[] eosCharacters)
Creates a new
SDContextGenerator instance which uses
the set of induced abbreviations. |
Modifier and Type | Method and Description |
---|---|
String[] |
getContext(CharSequence sb,
int position)
Returns an array of contextual features for the potential sentence boundary at the
specified position within the specified string buffer.
|
public DefaultSDContextGenerator(char[] eosCharacters)
SDContextGenerator
instance with
no induced abbreviations.eosCharacters
- public DefaultSDContextGenerator(Set<String> inducedAbbreviations, char[] eosCharacters)
SDContextGenerator
instance which uses
the set of induced abbreviations.inducedAbbreviations
- a Set
of Strings
representing induced abbreviations in the training data.
Example: "Mr."eosCharacters
- public String[] getContext(CharSequence sb, int position)
SDContextGenerator
getContext
in interface SDContextGenerator
sb
- The String
for which sentences are being determined.position
- An index into the specified string buffer when a sentence boundary may occur.Copyright © 2015 The Apache Software Foundation. All rights reserved.