Class PythonDataframeWordCount


  • public class PythonDataframeWordCount
    extends java.lang.Object
    An example that counts words in Shakespeare and utilizes a Python external transform.

    This class, PythonDataframeWordCount, uses Python DataframeTransform to count words from the input text file.

    The example command below shows how to run this pipeline on Dataflow runner:

    
     ./gradlew :examples:multi-language:pythonDataframeWordCount --args=" \
     --runner=DataflowRunner \
     --output=gs://{$OUTPUT_BUCKET}/count \
     --sdkHarnessContainerImageOverrides=.*python.*,gcr.io/apache-beam-testing/beam-sdk/beam_python{$PYTHON_VERSION}_sdk:latest"
     
    • Field Detail

      • TOKENIZER_PATTERN

        public static final java.lang.String TOKENIZER_PATTERN
        See Also:
        Constant Field Values
    • Constructor Detail

      • PythonDataframeWordCount

        public PythonDataframeWordCount()
    • Method Detail

      • main

        public static void main​(java.lang.String[] args)