class OrcColumnarBatchReader extends RecordReader[Void, ColumnarBatch]
To support vectorization in WholeStageCodeGen, this reader returns ColumnarBatch.
After creating, initialize and initBatch should be called sequentially.
- Alphabetic
- By Inheritance
- OrcColumnarBatchReader
- RecordReader
- Closeable
- AutoCloseable
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Instance Constructors
- new OrcColumnarBatchReader(capacity: Int)
Value Members
- final def !=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def ##: Int
- Definition Classes
- AnyRef → Any
- final def ==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def asInstanceOf[T0]: T0
- Definition Classes
- Any
- def clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.CloneNotSupportedException]) @native()
- def close(): Unit
- Definition Classes
- OrcColumnarBatchReader → RecordReader → Closeable → AutoCloseable
- Annotations
- @Override()
- final def eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def equals(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef → Any
- def finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.Throwable])
- final def getClass(): Class[_ <: AnyRef]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- def getCurrentKey(): Void
- Definition Classes
- OrcColumnarBatchReader → RecordReader
- Annotations
- @Override()
- def getCurrentValue(): ColumnarBatch
- Definition Classes
- OrcColumnarBatchReader → RecordReader
- Annotations
- @Override()
- def getProgress(): Float
- Definition Classes
- OrcColumnarBatchReader → RecordReader
- Annotations
- @Override()
- def hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- def initBatch(orcSchema: TypeDescription, requiredFields: Array[StructField], requestedDataColIds: Array[Int], requestedPartitionColIds: Array[Int], partitionValues: InternalRow): Unit
Initialize columnar batch by setting required schema and partition information.
Initialize columnar batch by setting required schema and partition information. With this information, this creates ColumnarBatch with the full schema.
- orcSchema
Schema from ORC file reader.
- requiredFields
All the fields that are required to return, including partition fields.
- requestedDataColIds
Requested column ids from orcSchema. -1 if not existed.
- requestedPartitionColIds
Requested column ids from partition schema. -1 if not existed.
- partitionValues
Values of partition columns.
- def initialize(inputSplit: InputSplit, taskAttemptContext: TaskAttemptContext): Unit
Initialize ORC file reader and batch record reader.
Initialize ORC file reader and batch record reader. Please note that
initBatchis needed to be called after this.- Definition Classes
- OrcColumnarBatchReader → RecordReader
- Annotations
- @Override()
- final def isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- final def ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def nextKeyValue(): Boolean
- Definition Classes
- OrcColumnarBatchReader → RecordReader
- Annotations
- @Override()
- final def notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def synchronized[T0](arg0: => T0): T0
- Definition Classes
- AnyRef
- def toString(): String
- Definition Classes
- AnyRef → Any
- final def wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException]) @native()