public class CrimeanTatarWordTokenizer extends WordTokenizer
REMOVED_EMOJI| Constructor and Description |
|---|
CrimeanTatarWordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
String |
getTokenizingCharacters() |
List<String> |
tokenize(String text)
Tokenizes text.
|
getProtocols, isCurrencyExpression, isEMail, isUrl, joinEMails, joinEMailsAndUrls, joinUrls, replaceEmojis, restoreEmojis, splitCurrencyExpressionpublic String getTokenizingCharacters()
getTokenizingCharacters in class WordTokenizerpublic List<String> tokenize(String text)
tokenize in interface Tokenizertokenize in class WordTokenizertext - String of words to tokenize.