case class TahoeLogFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshotAtAnalysis: SnapshotDescriptor, partitionFilters: Seq[Expression], isTimeTravelQuery: Boolean) extends TahoeFileIndex with Product with Serializable

A TahoeFileIndex that generates the list of files from DeltaLog with given partition filters.

NOTE: This is NOT a TahoeFileIndexWithSnapshotDescriptor because we only use snapshotAtAnalysis for actual data skipping if this is a time travel query.

Linear Supertypes
Serializable, Serializable, Product, Equals, TahoeFileIndex, SnapshotDescriptor, SupportsRowIndexFilters, FileIndex, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. TahoeLogFileIndex
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. TahoeFileIndex
  7. SnapshotDescriptor
  8. SupportsRowIndexFilters
  9. FileIndex
  10. AnyRef
  11. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TahoeLogFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshotAtAnalysis: Snapshot, partitionFilters: Seq[Expression] = Nil, isTimeTravelQuery: Boolean = false)
  2. new TahoeLogFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshotAtAnalysis: SnapshotDescriptor, partitionFilters: Seq[Expression], isTimeTravelQuery: Boolean)

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. def absolutePath(child: String): Path
    Definition Classes
    TahoeFileIndex
  5. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  6. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  7. val deltaLog: DeltaLog
  8. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  9. def equals(that: Any): Boolean
    Definition Classes
    TahoeLogFileIndex → Equals → AnyRef → Any
  10. def fileStatusWithMetadataFromAddFile(addFile: AddFile): FileStatusWithMetadata

    Generates a FileStatusWithMetadata using data extracted from a given AddFile.

    Generates a FileStatusWithMetadata using data extracted from a given AddFile.

    Definition Classes
    TahoeFileIndex
  11. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. def getBasePath(filePath: Path): Option[Path]

    Returns the path of the base directory of the given file path (i.e.

    Returns the path of the base directory of the given file path (i.e. its parent directory with all the partition directories stripped off).

    Definition Classes
    TahoeFileIndex
  13. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  14. def getPartitionValuesRow(partitionValues: Map[String, String]): GenericInternalRow
    Attributes
    protected
    Definition Classes
    TahoeFileIndex
  15. def getSnapshot: Snapshot
  16. def getSnapshotToScan: Snapshot
    Attributes
    protected
  17. def hashCode(): Int
    Definition Classes
    TahoeLogFileIndex → AnyRef → Any
  18. def inputFiles: Array[String]
    Definition Classes
    TahoeLogFileIndex → FileIndex
  19. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  20. val isTimeTravelQuery: Boolean
  21. def listFiles(partitionFilters: Seq[Expression], dataFilters: Seq[Expression]): Seq[PartitionDirectory]
    Definition Classes
    TahoeFileIndex → FileIndex
  22. def listPartitionsAsAddFiles(partitionFilters: Seq[Expression], dataFilters: Seq[Expression]): (Seq[(InternalRow, Seq[AddFile])], Seq[AddFile])

    Returns (i) tuples of partition directories to their respective AddFile actions and (ii) a collection of matched AddFiles.

    Returns (i) tuples of partition directories to their respective AddFile actions and (ii) a collection of matched AddFiles. The matched AddFiles are those that meet the criteria set by the partition and data filters. Essentially, this is a collection of all the files associated with the identified partitions.

    Definition Classes
    TahoeFileIndex
  23. def makePartitionDirectories(partitionValuesToFiles: Seq[(InternalRow, Seq[AddFile])]): Seq[PartitionDirectory]
    Definition Classes
    TahoeFileIndex
  24. def matchingFiles(partitionFilters: Seq[Expression], dataFilters: Seq[Expression]): Seq[AddFile]

    Returns all matching/valid files by the given partitionFilters and dataFilters.

    Returns all matching/valid files by the given partitionFilters and dataFilters. Implementations may avoid evaluating data filters when doing so would be expensive, but *must* evaluate the partition filters; wrong results will be produced if AddFile entries which don't match the partition filters are returned.

    Definition Classes
    TahoeLogFileIndexTahoeFileIndex
  25. def metadata: Metadata
    Definition Classes
    TahoeLogFileIndexSnapshotDescriptor
  26. def metadataOpsTimeNs: Option[Long]
    Definition Classes
    FileIndex
  27. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  28. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  29. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  30. def numOfFilesIfKnown: Option[Long]
    Attributes
    protected[delta]
    Definition Classes
    TahoeLogFileIndexSnapshotDescriptor
  31. val partitionFilters: Seq[Expression]
  32. def partitionSchema: StructType
    Definition Classes
    TahoeFileIndex → FileIndex
  33. val path: Path
    Definition Classes
    TahoeLogFileIndexTahoeFileIndex
  34. def protocol: Protocol
    Definition Classes
    TahoeLogFileIndexSnapshotDescriptor
  35. def refresh(): Unit
    Definition Classes
    TahoeLogFileIndex → FileIndex
  36. def rootPaths: Seq[Path]
    Definition Classes
    TahoeFileIndex → FileIndex
  37. def rowIndexFilters: Option[Map[String, RowIndexFilterType]]

    If we know a-priori which exact rows we want to read (e.g., from a previous scan) find the per-file filter here, which must be passed down to the appropriate reader.

    If we know a-priori which exact rows we want to read (e.g., from a previous scan) find the per-file filter here, which must be passed down to the appropriate reader.

    returns

    a mapping from file names to the row index filter for that file.

    Definition Classes
    SupportsRowIndexFilters
  38. def schema: StructType
    Definition Classes
    SnapshotDescriptor
  39. def sizeInBytes: Long
    Definition Classes
    TahoeLogFileIndex → FileIndex
  40. def sizeInBytesIfKnown: Option[Long]
    Attributes
    protected[delta]
    Definition Classes
    TahoeLogFileIndexSnapshotDescriptor
  41. val snapshotAtAnalysis: SnapshotDescriptor
  42. val spark: SparkSession
    Definition Classes
    TahoeLogFileIndexTahoeFileIndex
  43. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  44. def toString(): String
    Definition Classes
    TahoeFileIndex → FileIndex → AnyRef → Any
  45. def version: Long
    Definition Classes
    TahoeLogFileIndexSnapshotDescriptor
  46. def versionToUse: Option[Long]

    Provides the version that's being used as part of the scan if this is a time travel query.

  47. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  48. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  49. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from TahoeFileIndex

Inherited from SnapshotDescriptor

Inherited from SupportsRowIndexFilters

Inherited from FileIndex

Inherited from AnyRef

Inherited from Any

Ungrouped