public interface EndOfSentenceScanner
Implementations of this interface can use regular expressions, hand-coded DFAs, and other scanning techniques to locate end of sentence offsets.
Modifier and Type | Method and Description |
---|---|
char[] |
getEndOfSentenceCharacters()
Deprecated.
|
Set<Character> |
getEOSCharacters()
Returns a set of character which can indicate the end of a sentence.
|
List<Integer> |
getPositions(char[] cbuf)
The receiver scans `cbuf' for sentence ending characters and
returns their offsets.
|
List<Integer> |
getPositions(String s)
The receiver scans the specified string for sentence ending characters and
returns their offsets.
|
List<Integer> |
getPositions(StringBuffer buf)
The receiver scans `buf' for sentence ending characters and
returns their offsets.
|
@Deprecated char[] getEndOfSentenceCharacters()
Set<Character> getEOSCharacters()
List<Integer> getPositions(String s)
s
- a String
valueList
of Integer objects.List<Integer> getPositions(StringBuffer buf)
buf
- a StringBuffer
valueList
of Integer objects.Copyright © 2020 The Apache Software Foundation. All rights reserved.