Class BoundedDataset<T>

  • All Implemented Interfaces:
    java.io.Serializable, Dataset

    public class BoundedDataset<T>
    extends java.lang.Object
    implements Dataset
    Holds an RDD or values for deferred conversion to an RDD if needed. PCollections are sometimes created from a collection of objects (using RDD parallelize) and then only used to create View objects; in which case they do not need to be converted to bytes since they are not transferred across the network until they are broadcast.
    See Also:
    Serialized Form
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void action()  
      void cache​(java.lang.String storageLevel, org.apache.beam.sdk.coders.Coder<?> coder)  
      org.apache.spark.api.java.JavaRDD<org.apache.beam.sdk.util.WindowedValue<T>> getRDD()  
      void setName​(java.lang.String name)  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Method Detail

      • getRDD

        public org.apache.spark.api.java.JavaRDD<org.apache.beam.sdk.util.WindowedValue<T>> getRDD()
      • cache

        public void cache​(java.lang.String storageLevel,
                          org.apache.beam.sdk.coders.Coder<?> coder)
        Specified by:
        cache in interface Dataset
      • action

        public void action()
        Specified by:
        action in interface Dataset
      • setName

        public void setName​(java.lang.String name)
        Specified by:
        setName in interface Dataset