public class HDFSParquetImporter extends Object implements Serializable
| Modifier and Type | Class and Description |
|---|---|
static class |
HDFSParquetImporter.Config |
static class |
HDFSParquetImporter.FormatValidator |
| Modifier and Type | Field and Description |
|---|---|
static SimpleDateFormat |
PARTITION_FORMATTER |
| Constructor and Description |
|---|
HDFSParquetImporter(HDFSParquetImporter.Config cfg) |
| Modifier and Type | Method and Description |
|---|---|
protected org.apache.spark.api.java.JavaRDD<HoodieRecord<HoodieRecordPayload>> |
buildHoodieRecordsForImport(org.apache.spark.api.java.JavaSparkContext jsc,
String schemaStr) |
protected int |
dataImport(org.apache.spark.api.java.JavaSparkContext jsc) |
int |
dataImport(org.apache.spark.api.java.JavaSparkContext jsc,
int retry) |
protected <T extends HoodieRecordPayload> |
load(HoodieWriteClient client,
String instantTime,
org.apache.spark.api.java.JavaRDD<HoodieRecord<T>> hoodieRecords)
Imports records to Hoodie dataset
|
static void |
main(String[] args) |
public static final SimpleDateFormat PARTITION_FORMATTER
public HDFSParquetImporter(HDFSParquetImporter.Config cfg) throws IOException
IOExceptionpublic int dataImport(org.apache.spark.api.java.JavaSparkContext jsc,
int retry)
throws Exception
Exceptionprotected int dataImport(org.apache.spark.api.java.JavaSparkContext jsc)
throws IOException
IOExceptionprotected org.apache.spark.api.java.JavaRDD<HoodieRecord<HoodieRecordPayload>> buildHoodieRecordsForImport(org.apache.spark.api.java.JavaSparkContext jsc, String schemaStr) throws IOException
IOExceptionprotected <T extends HoodieRecordPayload> org.apache.spark.api.java.JavaRDD<WriteStatus> load(HoodieWriteClient client, String instantTime, org.apache.spark.api.java.JavaRDD<HoodieRecord<T>> hoodieRecords)
T - Typeclient - Hoodie ClientinstantTime - Instant TimehoodieRecords - Hoodie RecordsCopyright © 2019 The Apache Software Foundation. All rights reserved.