public class HoodieDeltaStreamer extends Object implements Serializable
HiveIncrementalPuller and apply it to the target
dataset. Does not maintain any state, queries at runtime to see how far behind the target dataset is from the source
dataset. This can be overriden to force sync from a timestamp.
In continuous mode, DeltaStreamer runs in loop-mode going through the below operations (a) pull-from-source (b)
write-to-sink (c) Schedule Compactions if needed (d) Conditionally Sync to Hive each cycle. For MOR table with
continuous mode enabled, a seperate compactor thread is allocated to execute compactions| Modifier and Type | Class and Description |
|---|---|
static class |
HoodieDeltaStreamer.AsyncCompactService
Async Compactor Service tha runs in separate thread.
|
static class |
HoodieDeltaStreamer.Config |
static class |
HoodieDeltaStreamer.DeltaSyncService
Syncs data either in single-run or in continuous mode.
|
static class |
HoodieDeltaStreamer.Operation |
| Modifier and Type | Field and Description |
|---|---|
static String |
CHECKPOINT_KEY |
| Constructor and Description |
|---|
HoodieDeltaStreamer(HoodieDeltaStreamer.Config cfg,
org.apache.spark.api.java.JavaSparkContext jssc) |
HoodieDeltaStreamer(HoodieDeltaStreamer.Config cfg,
org.apache.spark.api.java.JavaSparkContext jssc,
org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.hive.conf.HiveConf hiveConf) |
| Modifier and Type | Method and Description |
|---|---|
HoodieDeltaStreamer.DeltaSyncService |
getDeltaSyncService() |
static void |
main(String[] args) |
void |
shutdownGracefully() |
void |
sync()
Main method to start syncing
|
public static String CHECKPOINT_KEY
public HoodieDeltaStreamer(HoodieDeltaStreamer.Config cfg, org.apache.spark.api.java.JavaSparkContext jssc) throws IOException
IOExceptionpublic HoodieDeltaStreamer(HoodieDeltaStreamer.Config cfg, org.apache.spark.api.java.JavaSparkContext jssc, org.apache.hadoop.fs.FileSystem fs, org.apache.hadoop.hive.conf.HiveConf hiveConf) throws IOException
IOExceptionpublic void shutdownGracefully()
public HoodieDeltaStreamer.DeltaSyncService getDeltaSyncService()
Copyright © 2019 The Apache Software Foundation. All rights reserved.