Packages

c

com.nvidia.spark.rapids

GpuOrcMultiFilePartitionReaderFactory

case class GpuOrcMultiFilePartitionReaderFactory(sqlConf: SQLConf, broadcastedConf: Broadcast[SerializableConfiguration], dataSchema: StructType, readDataSchema: StructType, partitionSchema: StructType, filters: Array[Filter], rapidsConf: RapidsConf, metrics: Map[String, GpuMetric], queryUsesInputFile: Boolean) extends MultiFilePartitionReaderFactoryBase with Product with Serializable

The multi-file partition reader factory for creating cloud reading or coalescing reading for ORC file format.

sqlConf

the SQLConf

broadcastedConf

the Hadoop configuration

dataSchema

schema of the data

readDataSchema

the Spark schema describing what will be read

partitionSchema

schema of partitions.

filters

filters on non-partition columns

rapidsConf

the Rapids configuration

metrics

the metrics

queryUsesInputFile

this is a parameter to easily allow turning it off in GpuTransitionOverrides if InputFileName, InputFileBlockStart, or InputFileBlockLength are used

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. GpuOrcMultiFilePartitionReaderFactory
  2. Serializable
  3. Product
  4. Equals
  5. MultiFilePartitionReaderFactoryBase
  6. Logging
  7. PartitionReaderFactory
  8. Serializable
  9. AnyRef
  10. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new GpuOrcMultiFilePartitionReaderFactory(sqlConf: SQLConf, broadcastedConf: Broadcast[SerializableConfiguration], dataSchema: StructType, readDataSchema: StructType, partitionSchema: StructType, filters: Array[Filter], rapidsConf: RapidsConf, metrics: Map[String, GpuMetric], queryUsesInputFile: Boolean)

    sqlConf

    the SQLConf

    broadcastedConf

    the Hadoop configuration

    dataSchema

    schema of the data

    readDataSchema

    the Spark schema describing what will be read

    partitionSchema

    schema of partitions.

    filters

    filters on non-partition columns

    rapidsConf

    the Rapids configuration

    metrics

    the metrics

    queryUsesInputFile

    this is a parameter to easily allow turning it off in GpuTransitionOverrides if InputFileName, InputFileBlockStart, or InputFileBlockLength are used

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. val broadcastedConf: Broadcast[SerializableConfiguration]
  6. def buildBaseColumnarReaderForCloud(files: Array[PartitionedFile], conf: Configuration): PartitionReader[ColumnarBatch]

    Build the PartitionReader for cloud reading

    Build the PartitionReader for cloud reading

    files

    files to be read

    conf

    configuration

    returns

    cloud reading PartitionReader

    Definition Classes
    GpuOrcMultiFilePartitionReaderFactoryMultiFilePartitionReaderFactoryBase
  7. def buildBaseColumnarReaderForCoalescing(files: Array[PartitionedFile], conf: Configuration): PartitionReader[ColumnarBatch]

    Build the PartitionReader for coalescing reading

    Build the PartitionReader for coalescing reading

    files

    files to be read

    conf

    the configuration

    returns

    coalescing reading PartitionReader

    Definition Classes
    GpuOrcMultiFilePartitionReaderFactoryMultiFilePartitionReaderFactoryBase
  8. val canUseCoalesceFilesReader: Boolean

    An abstract method to indicate if coalescing reading can be used

    An abstract method to indicate if coalescing reading can be used

    Definition Classes
    GpuOrcMultiFilePartitionReaderFactoryMultiFilePartitionReaderFactoryBase
  9. val canUseMultiThreadReader: Boolean

    An abstract method to indicate if cloud reading can be used

    An abstract method to indicate if cloud reading can be used

    Definition Classes
    GpuOrcMultiFilePartitionReaderFactoryMultiFilePartitionReaderFactoryBase
  10. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  11. def createColumnarReader(partition: InputPartition): PartitionReader[ColumnarBatch]
    Definition Classes
    MultiFilePartitionReaderFactoryBase → PartitionReaderFactory
  12. def createReader(partition: InputPartition): PartitionReader[InternalRow]
    Definition Classes
    MultiFilePartitionReaderFactoryBase → PartitionReaderFactory
  13. val dataSchema: StructType
  14. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. val filters: Array[Filter]
  16. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  17. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  18. final def getFileFormatShortName: String

    File format short name used for logging and other things to uniquely identity which file format is being used.

    File format short name used for logging and other things to uniquely identity which file format is being used.

    returns

    the file format short name

    Definition Classes
    GpuOrcMultiFilePartitionReaderFactoryMultiFilePartitionReaderFactoryBase
  19. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  20. def initializeLogIfNecessary(isInterpreter: Boolean): Unit
    Attributes
    protected
    Definition Classes
    Logging
  21. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  22. def isTraceEnabled(): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  23. def log: Logger
    Attributes
    protected
    Definition Classes
    Logging
  24. def logDebug(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  25. def logDebug(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  26. def logError(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  27. def logError(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  28. def logInfo(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  29. def logInfo(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  30. def logName: String
    Attributes
    protected
    Definition Classes
    Logging
  31. def logTrace(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  32. def logTrace(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  33. def logWarning(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  34. def logWarning(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  35. val maxChunkedReaderMemoryUsageSizeBytes: Long
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  36. val maxGpuColumnSizeBytes: Long
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  37. val maxReadBatchSizeBytes: Long
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  38. val maxReadBatchSizeRows: Int
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  39. val metrics: Map[String, GpuMetric]
  40. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  41. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  42. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  43. val partitionSchema: StructType
  44. val queryUsesInputFile: Boolean
  45. val rapidsConf: RapidsConf
  46. val readDataSchema: StructType
  47. val sqlConf: SQLConf
  48. def supportColumnarReads(partition: InputPartition): Boolean
    Definition Classes
    MultiFilePartitionReaderFactoryBase → PartitionReaderFactory
  49. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  50. val targetBatchSizeBytes: Long
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  51. val useChunkedReader: Boolean
    Attributes
    protected
    Definition Classes
    MultiFilePartitionReaderFactoryBase
  52. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  53. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  54. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from Logging

Inherited from PartitionReaderFactory

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped