class DeltaSourceSplit extends FileSourceSplit
A SourceSplit that represents a Parquet file, or a region of a file.
The split additionally has an offset and an end, which defines the region of the file represented by the split. For splits representing the while file, the offset is zero and the length is the file size.
The split may furthermore have a "reader position", which is the checkpointed position from a reader previously reading this split. This position is null when the split is assigned from the enumerator to the readers, and is non-null when the reader's checkpoint their state in a file source split.
This implementation extends a FileSourceSplit with Delta table partition
information
- Alphabetic
- By Inheritance
- DeltaSourceSplit
- FileSourceSplit
- Serializable
- SourceSplit
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
DeltaSourceSplit(partitionValues: Map[String, String], id: String, filePath: Path, offset: Long, length: Long, hostnames: Array[String], readerPosition: CheckpointedPosition)
Constructs a split with host information and reader position restored from checkpoint.
Constructs a split with host information and reader position restored from checkpoint.
The
hostnamesparameter provides information about the names of the hosts storing this range of the file. Empty, if no host information is available. Host information is typically only available on a specific file systems, like HDFS.- partitionValues
The Delta partition column to partition value map that should be used for underlying Parquet File.
- id
The unique ID of this source split.
- filePath
The path to the Parquet file that this splits represents.
- offset
The start (inclusive) of the split's rage in the Parquet file, in bytes.
- length
The number of bytes in the split (starting from the offset)
- hostnames
The hostnames of the nodes storing the split's file range.
- readerPosition
The reader position in bytes recovered from a checkpoint.
-
new
DeltaSourceSplit(partitionValues: Map[String, String], id: String, filePath: Path, offset: Long, length: Long, hostnames: <repeated...>[String])
Constructs a split with host information and no reader position.
Constructs a split with host information and no reader position.
The
hostnamesprovides information about the names of the hosts is storing this range of the file. Empty, if no host information is available. Host information is typically only available on a specific file systems, like HDFS.- partitionValues
The Delta partition column to partition value map that should be used for underlying Parquet File.
- id
The unique ID of this source split.
- filePath
The path to the Parquet file that this splits represents.
- offset
The start (inclusive) of the split's rage in the Parquet file, in bytes.
- length
The number of bytes in the split (starting from the offset)
- hostnames
The hostnames of the nodes storing the split's file range.
- new DeltaSourceSplit(partitionValues: Map[String, String], id: String, filePath: Path, offset: Long, length: Long)
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
fileModificationTime(): Long
- Definition Classes
- FileSourceSplit
-
def
fileSize(): Long
- Definition Classes
- FileSourceSplit
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
getPartitionValues(): Map[String, String]
- returns
an unmodifiable Map of Delta Table Partition columns and values.
-
def
getReaderPosition(): Optional[CheckpointedPosition]
- Definition Classes
- FileSourceSplit
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hostnames(): Array[String]
- Definition Classes
- FileSourceSplit
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
def
length(): Long
- Definition Classes
- FileSourceSplit
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
def
offset(): Long
- Definition Classes
- FileSourceSplit
-
def
path(): Path
- Definition Classes
- FileSourceSplit
-
def
splitId(): String
- Definition Classes
- FileSourceSplit → SourceSplit
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- FileSourceSplit → AnyRef → Any
-
def
updateWithCheckpointedPosition(position: CheckpointedPosition): DeltaSourceSplit
- Definition Classes
- DeltaSourceSplit → FileSourceSplit
- Annotations
- @Override()
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()