public interface BulkInsertPartitioner<I> extends Serializable
The actual implementation of BulkInsertPartitioner is determined by the bulk insert
sort mode, BulkInsertSortMode, specified by
HoodieWriteConfig.BULK_INSERT_SORT_MODE (`hoodie.bulkinsert.sort.mode`).
| Modifier and Type | Method and Description |
|---|---|
boolean |
arePartitionRecordsSorted() |
default String |
getFileIdPfx(int partitionId)
Return file group id prefix for the given data partition.
|
default Option<WriteHandleFactory> |
getWriteHandleFactory(int partitionId)
Return write handle factory for the given partition.
|
I |
repartitionRecords(I records,
int outputPartitions)
Partitions the input records based on the number of output partitions as a hint.
|
static String[] |
tryPrependPartitionPathColumns(String[] columnNames,
HoodieWriteConfig config) |
I repartitionRecords(I records, int outputPartitions)
Note that, the number of output partitions may or may not be enforced, depending on the specific implementation.
records - Input Hoodie records.outputPartitions - Expected number of output partitions as a hint.boolean arePartitionRecordsSorted()
true if the records are sorted by partition-path; false otherwise.default String getFileIdPfx(int partitionId)
partitionId - data partitiondefault Option<WriteHandleFactory> getWriteHandleFactory(int partitionId)
partitionId - data partitionstatic String[] tryPrependPartitionPathColumns(String[] columnNames, HoodieWriteConfig config)
Copyright © 2023 The Apache Software Foundation. All rights reserved.