package util
- Alphabetic
- Public
- All
Type Members
- trait AnalysisHelper extends AnyRef
-
class
DatasetRefCache[T] extends AnyRef
A Dataset reference cache to automatically create new Dataset objects when the active SparkSession changes.
A Dataset reference cache to automatically create new Dataset objects when the active SparkSession changes. This is useful when sharing objects holding Dataset references cross multiple sessions. Without this, using a Dataset that holds a stale session may change the active session and cause multiple issues (e.g., if we switch to a stale session coming from a notebook that has been detached, we may not be able to use built-in functions because those are cleaned up).
The
creatorfunction will be called to create a new Dataset object when the old one has a different session than the current active session.Unlike StateCache, this class only caches the Dataset reference and doesn't cache the underlying
RDD. -
sealed
trait
DateFormatter extends Serializable
Forked from org.apache.spark.sql.catalyst.util.DateFormatter
-
trait
DateTimeFormatterHelper extends AnyRef
Forked from org.apache.spark.sql.catalyst.util.DateTimeFormatterHelper
- trait DeltaProgressReporter extends Logging
-
class
FractionTimestampFormatter extends Iso8601TimestampFormatter
The formatter parses/formats timestamps according to the pattern
yyyy-MM-dd HH:mm:ss.[..fff..]where[..fff..]is a fraction of second up to microsecond resolution.The formatter parses/formats timestamps according to the pattern
yyyy-MM-dd HH:mm:ss.[..fff..]where[..fff..]is a fraction of second up to microsecond resolution. The formatter does not output trailing zeros in the fraction. For example, the timestamp2019-03-05 15:00:01.123400is formatted as the string2019-03-05 15:00:01.1234. - class Iso8601DateFormatter extends DateFormatter with DateTimeFormatterHelper
- class Iso8601TimestampFormatter extends TimestampFormatter with DateTimeFormatterHelper
-
class
PartitionPath extends AnyRef
Holds a directory in a partitioned collection of files as well as the partition values in the form of a Row.
Holds a directory in a partitioned collection of files as well as the partition values in the form of a Row. Before scanning, the files at
pathneed to be enumerated. - case class PartitionSpec(partitionColumns: StructType, partitions: Seq[PartitionPath]) extends Product with Serializable
-
case class
SerializableFileStatus(path: String, length: Long, isDir: Boolean, modificationTime: Long) extends Product with Serializable
A serializable variant of HDFS's FileStatus.
-
class
SetAccumulator[T] extends AccumulatorV2[T, Set[T]]
Accumulator to collect distinct elements as a set.
-
trait
StateCache extends AnyRef
Machinary that caches the reconstructed state of a Delta table using the RDD cache.
Machinary that caches the reconstructed state of a Delta table using the RDD cache. The cache is designed so that the first access will materialize the results. However once uncache is called, all data will be flushed and will not be cached again.
-
sealed
trait
TimestampFormatter extends Serializable
Forked from org.apache.spark.sql.catalyst.util.TimestampFormatter
Value Members
- object AnalysisHelper
- object DateFormatter extends Serializable
-
object
DateTimeUtils
Helper functions for converting between internal and external date and time representations.
Helper functions for converting between internal and external date and time representations. Dates are exposed externally as java.sql.Date and are represented internally as the number of dates since the Unix epoch (1970-01-01). Timestamps are exposed externally as java.sql.Timestamp and are stored internally as longs, which are capable of storing timestamps with microsecond precision.
-
object
DeltaFileOperations extends DeltaLogging
Some utility methods on files, directories, and paths.
-
object
FileNames
Helper for creating file names for specific commits / checkpoints.
-
object
JsonUtils
Useful json functions used around the Delta codebase.
-
object
PartitionPath
This file is forked from org.apache.spark.sql.execution.datasources.PartitioningUtils.
- object PartitionSpec extends Serializable
- object SerializableFileStatus extends Serializable
- object TimestampFormatter extends Serializable