public class PythonDataframeWordCount
extends java.lang.Object
This class, PythonDataframeWordCount, uses Python DataframeTransform to count words
from the input text file.
The example command below shows how to run this pipeline on Dataflow runner:
./gradlew :examples:multi-language:pythonDataframeWordCount --args=" \
--runner=DataflowRunner \
--output=gs://{$OUTPUT_BUCKET}/count \
--sdkHarnessContainerImageOverrides=.*python.*,gcr.io/apache-beam-testing/beam-sdk/beam_python{$PYTHON_VERSION}_sdk:latest"
| Modifier and Type | Class and Description |
|---|---|
static class |
PythonDataframeWordCount.FormatAsTextFn
A SimpleFunction that converts a counted row into a printable string.
|
static interface |
PythonDataframeWordCount.WordCountOptions
Options supported by
PythonDataframeWordCount. |
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String |
TOKENIZER_PATTERN |
| Constructor and Description |
|---|
PythonDataframeWordCount() |
| Modifier and Type | Method and Description |
|---|---|
static void |
main(java.lang.String[] args) |
public static final java.lang.String TOKENIZER_PATTERN