public class SparkOrcReader
extends java.lang.Object
implements org.apache.iceberg.orc.OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
It minimizes allocations by reusing most of the objects in the implementation.
| Constructor and Description |
|---|
SparkOrcReader(org.apache.iceberg.Schema expectedSchema,
org.apache.orc.TypeDescription readSchema) |
SparkOrcReader(org.apache.iceberg.Schema expectedSchema,
org.apache.orc.TypeDescription readOrcSchema,
java.util.Map<java.lang.Integer,?> idToConstant) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.spark.sql.catalyst.InternalRow |
read(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch,
int row) |
void |
setBatchContext(long batchOffsetInFile) |
public SparkOrcReader(org.apache.iceberg.Schema expectedSchema,
org.apache.orc.TypeDescription readSchema)
public SparkOrcReader(org.apache.iceberg.Schema expectedSchema,
org.apache.orc.TypeDescription readOrcSchema,
java.util.Map<java.lang.Integer,?> idToConstant)
public org.apache.spark.sql.catalyst.InternalRow read(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch,
int row)
read in interface org.apache.iceberg.orc.OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>public void setBatchContext(long batchOffsetInFile)
setBatchContext in interface org.apache.iceberg.orc.OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>