public static class QuickstartUtils.DataGenerator extends Object
| Constructor and Description |
|---|
DataGenerator() |
DataGenerator(String[] partitionPaths) |
| Modifier and Type | Method and Description |
|---|---|
void |
close() |
List<String> |
generateDeletes(List<org.apache.spark.sql.Row> rows)
Generates delete records for the passed in rows.
|
static org.apache.avro.generic.GenericRecord |
generateGenericRecord(String rowKey,
String riderName,
String driverName,
long timestamp) |
List<HoodieRecord> |
generateInserts(Integer n)
Generates new inserts, uniformly across the partition paths above.
|
Stream<HoodieRecord> |
generateInsertsStream(String randomString,
Integer n)
Generates new inserts, uniformly across the partition paths above.
|
static OverwriteWithLatestAvroPayload |
generateRandomValue(HoodieKey key,
String riderDriverSuffix)
Generates a new avro record of the above schema format, retaining the key if optionally provided.
|
List<HoodieRecord> |
generateUniqueUpdates(Integer n)
Generates new updates, one for each of the keys above
list
|
HoodieRecord |
generateUpdateRecord(HoodieKey key,
String randomString) |
List<HoodieRecord> |
generateUpdates(Integer n)
Generates new updates, randomly distributed across the keys above.
|
int |
getNumExistingKeys() |
public DataGenerator()
public DataGenerator(String[] partitionPaths)
public int getNumExistingKeys()
public static org.apache.avro.generic.GenericRecord generateGenericRecord(String rowKey, String riderName, String driverName, long timestamp)
public static OverwriteWithLatestAvroPayload generateRandomValue(HoodieKey key, String riderDriverSuffix) throws IOException
IOExceptionpublic Stream<HoodieRecord> generateInsertsStream(String randomString, Integer n)
public List<HoodieRecord> generateInserts(Integer n) throws IOException
IOExceptionpublic HoodieRecord generateUpdateRecord(HoodieKey key, String randomString) throws IOException
IOExceptionpublic List<HoodieRecord> generateUpdates(Integer n)
n - Number of updates (including dups)public List<HoodieRecord> generateUniqueUpdates(Integer n)
n - Number of updates (must be no more than number of existing keys)public List<String> generateDeletes(List<org.apache.spark.sql.Row> rows)
rows - List of Rows for which delete record need to be generatedpublic void close()
Copyright © 2023 The Apache Software Foundation. All rights reserved.