Class TopWikipediaSessions
- java.lang.Object
-
- org.apache.beam.examples.complete.TopWikipediaSessions
-
public class TopWikipediaSessions extends java.lang.ObjectAn example that reads Wikipedia edit data from Cloud Storage and computes the user with the longest string of edits separated by no more than an hour within each month.Concepts: Using Windowing to perform time-based aggregations of data.
It is not recommended to execute this pipeline locally, given the size of the default input data.
To execute this pipeline using a selected runner and an output prefix on GCS, specify:
See examples/java/README.md for instructions about how to configure different runners.--runner=YOUR_SELECTED_RUNNER --output=gs://YOUR_OUTPUT_PREFIXThe default input is
gs://apache-beam-samples/wikipedia_edits/*.jsonand can be overridden with--input.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static interfaceTopWikipediaSessions.OptionsOptions supported by this class.
-
Constructor Summary
Constructors Constructor Description TopWikipediaSessions()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static voidmain(java.lang.String[] args)static voidrun(TopWikipediaSessions.Options options)
-
-
-
Method Detail
-
run
public static void run(TopWikipediaSessions.Options options)
-
main
public static void main(java.lang.String[] args)
-
-