Package opennlp.tools.util.normalizer
-
Interface Summary Interface Description CharSequenceNormalizer A char sequence normalizer, used to adjusting (prune, substitute, add, etc.) characters in order to remove noise from text -
Class Summary Class Description AggregateCharSequenceNormalizer ACharSequenceNormalizerimplementation that aggregates the functionality of other normalizers.EmojiCharSequenceNormalizer AEmojiCharSequenceNormalizerimplementation that normalizes text in terms of emojis.NumberCharSequenceNormalizer ANumberCharSequenceNormalizerimplementation that normalizes text in terms of numbers.ShrinkCharSequenceNormalizer AShrinkCharSequenceNormalizerimplementation that shrinks repeated spaces / chars in text.TwitterCharSequenceNormalizer ATwitterCharSequenceNormalizerimplementation that normalizes text in terms of Twitter character patterns.UrlCharSequenceNormalizer AUrlCharSequenceNormalizerimplementation that normalizes text in terms of URls and email addresses.