Class PythonDataframeWordCount
- java.lang.Object
-
- org.apache.beam.examples.multilanguage.PythonDataframeWordCount
-
public class PythonDataframeWordCount extends java.lang.ObjectAn example that counts words in Shakespeare and utilizes a Python external transform.This class,
PythonDataframeWordCount, uses Python DataframeTransform to count words from the input text file.The example command below shows how to run this pipeline on Dataflow runner:
./gradlew :examples:multi-language:pythonDataframeWordCount --args=" \ --runner=DataflowRunner \ --output=gs://{$OUTPUT_BUCKET}/count \ --sdkHarnessContainerImageOverrides=.*python.*,gcr.io/apache-beam-testing/beam-sdk/beam_python{$PYTHON_VERSION}_sdk:latest"
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classPythonDataframeWordCount.FormatAsTextFnA SimpleFunction that converts a counted row into a printable string.static interfacePythonDataframeWordCount.WordCountOptionsOptions supported byPythonDataframeWordCount.
-
Field Summary
Fields Modifier and Type Field Description static java.lang.StringTOKENIZER_PATTERN
-
Constructor Summary
Constructors Constructor Description PythonDataframeWordCount()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static voidmain(java.lang.String[] args)
-
-
-
Field Detail
-
TOKENIZER_PATTERN
public static final java.lang.String TOKENIZER_PATTERN
- See Also:
- Constant Field Values
-
-