public class SimpleTextEncodingVectorizer extends Object implements Vectorizer
Runs a Map/Reduce job that encodes FeatureVectorEncoder the
input and writes it to the output as a sequence file.
Only works on basic text, where the value in the SequenceFile is a blob of text.
| Constructor and Description |
|---|
SimpleTextEncodingVectorizer() |
| Modifier and Type | Method and Description |
|---|---|
void |
createVectors(org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
VectorizerConfig config) |
public void createVectors(org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
VectorizerConfig config)
throws IOException,
ClassNotFoundException,
InterruptedException
createVectors in interface VectorizerIOExceptionClassNotFoundExceptionInterruptedExceptionCopyright © 2008–2017 The Apache Software Foundation. All rights reserved.