Packages

c

org.apache.spark.sql.delta.commands

ClusteringStrategy

case class ClusteringStrategy(sparkSession: SparkSession, clusteringColumns: Seq[String], optimizeContext: DeltaOptimizeContext) extends OptimizeTableStrategy with Product with Serializable

Implements clustering strategy for clustered tables

Linear Supertypes
Serializable, Serializable, Product, Equals, OptimizeTableStrategy, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ClusteringStrategy
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. OptimizeTableStrategy
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new ClusteringStrategy(sparkSession: SparkSession, clusteringColumns: Seq[String], optimizeContext: DeltaOptimizeContext)

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. val clusteringColumns: Seq[String]
  7. val curve: String

    The clustering algorithm to be used by either by ZORDER or Liquid CLUSTERING.

    The clustering algorithm to be used by either by ZORDER or Liquid CLUSTERING. An error is thrown for COMPACTION.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  8. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  9. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. def initNewBin: BinInfo

    Prepare a new Bin and returns its initialized BinInfo.

    Prepare a new Bin and returns its initialized BinInfo.

    This function is expected to be called once for each bin before tagAddFile is called.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  12. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  13. val maxBinSize: Long

    In clustering, the bin size corresponds to a ZCube size that can be adjusted through configurations.

    In clustering, the bin size corresponds to a ZCube size that can be adjusted through configurations.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  14. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  16. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  17. val optimizeContext: DeltaOptimizeContext
  18. val optimizeTableMode: OptimizeTableMode.Value

    The optimize mode the strategy instance is created for.

    The optimize mode the strategy instance is created for.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  19. def prepareFilesPerPartition(inputFiles: Seq[AddFile]): Seq[AddFile]

    Utility method to prepare files in a partition for optimization.

    Utility method to prepare files in a partition for optimization.

    By default it sorts files on the size for the binpack.

    returns

    Prepared files for the subsequent optimization.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  20. val sparkSession: SparkSession
  21. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  22. def tagAddFile(file: AddFile, binInfo: BinInfo): AddFile

    Incorporate essential tags for optimized files based on the OptimizeTableMode.

    Incorporate essential tags for optimized files based on the OptimizeTableMode.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  23. def updateOptimizeStats(optimizeStats: OptimizeStats, removedFiles: Seq[RemoveFile], bins: Seq[Bin]): Unit

    Utility to update additional metrics after optimization.

    Utility to update additional metrics after optimization.

    optimizeStats

    The input stats to update on.

    removedFiles

    Removed files.

    bins

    Sequence of bin-packed file groups, where each group consists of a partition value and its associated files.

    Definition Classes
    ClusteringStrategyOptimizeTableStrategy
  24. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from OptimizeTableStrategy

Inherited from AnyRef

Inherited from Any

Ungrouped