public class PunctuationSeparator extends java.lang.Object implements TextProcessor
PunctuationSeparator converts every character of the input tokens to it's respective
lower case character.| Constructor and Description |
|---|
PunctuationSeparator()
Creates a
TextProcessor that separates the given punctuations into distinct tokens in
the text. |
PunctuationSeparator(java.lang.String punctuations)
Creates a
TextProcessor that separates the given punctuations into distinct tokens in
the text. |
| Modifier and Type | Method and Description |
|---|---|
java.util.List<java.lang.String> |
preprocess(java.util.List<java.lang.String> tokens)
Applies the preprocessing defined to the given input tokens.
|
public PunctuationSeparator(java.lang.String punctuations)
TextProcessor that separates the given punctuations into distinct tokens in
the text.punctuations - the punctuations to be separatedpublic PunctuationSeparator()
TextProcessor that separates the given punctuations into distinct tokens in
the text.public java.util.List<java.lang.String> preprocess(java.util.List<java.lang.String> tokens)
preprocess in interface TextProcessortokens - the tokens created after the input text is tokenized