public class OrcReaderFactory extends Object implements org.apache.paimon.format.FormatReaderFactory
ColumnarRow records.| 限定符和类型 | 字段和说明 |
|---|---|
protected int |
batchSize |
protected List<OrcFilters.Predicate> |
conjunctPredicates |
protected SerializableHadoopConfigWrapper |
hadoopConfigWrapper |
protected org.apache.orc.TypeDescription |
schema |
| 构造器和说明 |
|---|
OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
org.apache.paimon.types.RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize) |
| 限定符和类型 | 方法和说明 |
|---|---|
static org.apache.orc.Reader |
createReader(org.apache.hadoop.conf.Configuration conf,
org.apache.paimon.fs.FileIO fileIO,
org.apache.paimon.fs.Path path) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader |
createReader(org.apache.paimon.format.FormatReaderFactory.Context context) |
org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch |
createReaderBatch(org.apache.paimon.fs.Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
org.apache.paimon.utils.Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
Creates the
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format. |
protected final SerializableHadoopConfigWrapper hadoopConfigWrapper
protected final org.apache.orc.TypeDescription schema
protected final List<OrcFilters.Predicate> conjunctPredicates
protected final int batchSize
public OrcReaderFactory(org.apache.hadoop.conf.Configuration hadoopConfig,
org.apache.paimon.types.RowType readType,
List<OrcFilters.Predicate> conjunctPredicates,
int batchSize)
hadoopConfig - the hadoop config for orc reader.conjunctPredicates - the filter predicates that can be evaluated.batchSize - the batch size of orc reader.public org.apache.paimon.format.orc.OrcReaderFactory.OrcVectorizedReader createReader(org.apache.paimon.format.FormatReaderFactory.Context context)
throws IOException
createReader 在接口中 org.apache.paimon.format.FormatReaderFactoryIOExceptionpublic org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch createReaderBatch(org.apache.paimon.fs.Path filePath,
org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch orcBatch,
org.apache.paimon.utils.Pool.Recycler<org.apache.paimon.format.orc.OrcReaderFactory.OrcReaderBatch> recycler)
OrcReaderBatch structure, which is responsible for holding the data
structures that hold the batch data (column vectors, row arrays, ...) and the batch
conversion from the ORC representation to the result format.public static org.apache.orc.Reader createReader(org.apache.hadoop.conf.Configuration conf,
org.apache.paimon.fs.FileIO fileIO,
org.apache.paimon.fs.Path path)
throws IOException
IOExceptionCopyright © 2023–2024 The Apache Software Foundation. All rights reserved.