public class ColumnarBatchReader
extends org.apache.iceberg.arrow.vectorized.BaseBatchReader<org.apache.spark.sql.vectorized.ColumnarBatch>
VectorizedReader that returns Spark's ColumnarBatch to support Spark's vectorized
read path. The ColumnarBatch returned is created by passing in the Arrow vectors
populated via delegated read calls to VectorReader(s).| Constructor and Description |
|---|
ColumnarBatchReader(java.util.List<org.apache.iceberg.parquet.VectorizedReader<?>> readers) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.spark.sql.vectorized.ColumnarBatch |
read(org.apache.spark.sql.vectorized.ColumnarBatch reuse,
int numRowsToRead) |
void |
setDeleteFilter(org.apache.iceberg.data.DeleteFilter<org.apache.spark.sql.catalyst.InternalRow> deleteFilter) |
void |
setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore,
java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData,
long rowPosition) |
public ColumnarBatchReader(java.util.List<org.apache.iceberg.parquet.VectorizedReader<?>> readers)
public void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore,
java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData,
long rowPosition)
setRowGroupInfo in interface org.apache.iceberg.parquet.VectorizedReader<org.apache.spark.sql.vectorized.ColumnarBatch>setRowGroupInfo in class org.apache.iceberg.arrow.vectorized.BaseBatchReader<org.apache.spark.sql.vectorized.ColumnarBatch>public void setDeleteFilter(org.apache.iceberg.data.DeleteFilter<org.apache.spark.sql.catalyst.InternalRow> deleteFilter)
public final org.apache.spark.sql.vectorized.ColumnarBatch read(org.apache.spark.sql.vectorized.ColumnarBatch reuse,
int numRowsToRead)