Package org.apache.iceberg.parquet
Interface VectorizedReader<T>
-
public interface VectorizedReader<T>Interface for vectorized Iceberg readers.
-
-
Method Summary
All Methods Instance Methods Abstract Methods Default Methods Deprecated Methods Modifier and Type Method Description voidclose()Release any resources allocated.Tread(T reuse, int numRows)Reads a batch of type @param <T> and of size numRowsvoidsetBatchSize(int batchSize)default voidsetRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata)Sets the row group information to be used with this readervoidsetRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition)Deprecated.since 1.8.0, will be removed in 1.9.0; usesetRowGroupInfo(PageReadStore, Map)instead.
-
-
-
Method Detail
-
read
T read(T reuse, int numRows)
Reads a batch of type @param <T> and of size numRows- Parameters:
reuse- container for the last batch to be reused for next batchnumRows- number of rows to read- Returns:
- batch of records of type @param <T>
-
setBatchSize
void setBatchSize(int batchSize)
-
setRowGroupInfo
@Deprecated void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition)Deprecated.since 1.8.0, will be removed in 1.9.0; usesetRowGroupInfo(PageReadStore, Map)instead.Sets the row group information to be used with this reader- Parameters:
pages- row group information for all the columnsmetadata- map ofColumnPath->ColumnChunkMetaDatafor the row grouprowPosition- the row group's row offset in the parquet file
-
setRowGroupInfo
default void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata)Sets the row group information to be used with this reader- Parameters:
pages- row group information for all the columnsmetadata- map ofColumnPath->ColumnChunkMetaDatafor the row group
-
close
void close()
Release any resources allocated.
-
-