package files
- Alphabetic
- Public
- Protected
Type Members
- class CdcAddFileIndex extends TahoeBatchFileIndex
A TahoeFileIndex for scanning a sequence of added files as CDC.
A TahoeFileIndex for scanning a sequence of added files as CDC. Similar to TahoeBatchFileIndex, with a bit of special handling to attach the log version and CDC type on a per-file basis.
- class DelayedCommitProtocol extends FileCommitProtocol with Serializable with Logging
Writes out the files to
pathand returns a list of them inaddedStatuses.Writes out the files to
pathand returns a list of them inaddedStatuses. Includes special handling for partitioning on CDC_PARTITION_COL for compatibility between enabled and disabled CDC; partitions with a value of false in this column produce no corresponding partitioning directory. - case class DeltaFileListingResult(partitions: Seq[(InternalRow, Seq[AddFile])], addFiles: Seq[AddFile], sortTime: Long = 0L) extends Product with Serializable
Similar to FileListingResult, but maintains the partitions as AddFile.
- class DeltaSourceSnapshot extends StateCache
Converts a
Snapshotinto the initial set of files read when starting a new streaming query.Converts a
Snapshotinto the initial set of files read when starting a new streaming query. The list of files that represent the table at the time the query starts are selected by: - Addingversionandindexto each file to enable splitting of the initial state into multiple batches. - Filtering files that don't match partition predicates, while preserving the aforementioned indexing. - trait SQLMetricsReporting extends AnyRef
This trait is used to register SQL metrics for a Delta Operation.
This trait is used to register SQL metrics for a Delta Operation. Registering will allow the metrics to be instrumented via the CommitInfo and is accessible via DescribeHistory
- trait SupportsRowIndexFilters extends AnyRef
- class TahoeBatchFileIndex extends TahoeFileIndexWithSnapshotDescriptor
A TahoeFileIndex that generates the list of files from a given list of files that are within a version range of DeltaLog.
- class TahoeChangeFileIndex extends TahoeFileIndexWithSnapshotDescriptor
A TahoeFileIndex for scanning a sequence of CDC files.
A TahoeFileIndex for scanning a sequence of CDC files. Similar to TahoeBatchFileIndex, the equivalent for reading AddFile actions.
- abstract class TahoeFileIndex extends FileIndex with SupportsRowIndexFilters with SnapshotDescriptor
A FileIndex that generates the list of files managed by the Tahoe protocol.
- abstract class TahoeFileIndexWithSnapshotDescriptor extends TahoeFileIndex
A TahoeFileIndex that works with a specific SnapshotDescriptor.
- case class TahoeLogFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshotAtAnalysis: Snapshot, partitionFilters: Seq[Expression] = Nil, isTimeTravelQuery: Boolean = false) extends TahoeFileIndex with Product with Serializable
A TahoeFileIndex that generates the list of files from DeltaLog with given partition filters.
A TahoeFileIndex that generates the list of files from DeltaLog with given partition filters.
NOTE: This is NOT a TahoeFileIndexWithSnapshotDescriptor because we only use snapshotAtAnalysis for actual data skipping if this is a time travel query.
- class TahoeRemoveFileIndex extends TahoeFileIndexWithSnapshotDescriptor
A TahoeFileIndex for scanning a sequence of removed files as CDC.
A TahoeFileIndex for scanning a sequence of removed files as CDC. Similar to TahoeBatchFileIndex, the equivalent for reading AddFile actions.
- trait TransactionalWrite extends DeltaLogging
Adds the ability to write files out as part of a transaction.
Adds the ability to write files out as part of a transaction. Checks are performed to ensure that the data being written matches either the current metadata or the new metadata being set by this transaction.
Value Members
- object DeltaFileFormatWriter extends Logging
A helper object for writing FileFormat data out to a location.
A helper object for writing FileFormat data out to a location. Logic is copied from FileFormatWriter from Spark 3.5 with added functionality to write partition values to data files. Specifically L123-126, L132, and L140 where it adds option WRITE_PARTITION_COLUMNS
- object TahoeLogFileIndex extends Serializable
- object TransactionalWrite