Class TopWikipediaSessions


  • public class TopWikipediaSessions
    extends java.lang.Object
    An example that reads Wikipedia edit data from Cloud Storage and computes the user with the longest string of edits separated by no more than an hour within each month.

    Concepts: Using Windowing to perform time-based aggregations of data.

    It is not recommended to execute this pipeline locally, given the size of the default input data.

    To execute this pipeline using a selected runner and an output prefix on GCS, specify:

    
     --runner=YOUR_SELECTED_RUNNER
     --output=gs://YOUR_OUTPUT_PREFIX
     
    See examples/java/README.md for instructions about how to configure different runners.

    The default input is gs://apache-beam-samples/wikipedia_edits/*.json and can be overridden with --input.

    • Nested Class Summary

      Nested Classes 
      Modifier and Type Class Description
      static interface  TopWikipediaSessions.Options
      Options supported by this class.
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static void main​(java.lang.String[] args)  
      static void run​(TopWikipediaSessions.Options options)  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • TopWikipediaSessions

        public TopWikipediaSessions()