Class WhitespaceTokenizer

java.lang.Object
org.tribuo.util.tokens.impl.SplitFunctionTokenizer
org.tribuo.util.tokens.impl.WhitespaceTokenizer
All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Configurable, com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>, Cloneable, Tokenizer

A simple tokenizer that splits on whitespace. This tokenizer does not create tokens that correspond to whitespace - only those spans of text delimited by whitespace. For example, the text "a b" will result in two tokens "a" and "b".