O - Read result data typepublic interface DeltaInputReader<O> extends Serializable
DeltaInputReader will read the configured input type and provide an RDD of records to the
client.| Modifier and Type | Method and Description |
|---|---|
org.apache.spark.api.java.JavaRDD<O> |
read(int numPartitions,
int numFiles,
double percentageRecordsPerFile) |
org.apache.spark.api.java.JavaRDD<O> |
read(int numPartitions,
int numFiles,
long approxNumRecords) |
org.apache.spark.api.java.JavaRDD<O> |
read(int numPartitions,
long approxNumRecords) |
org.apache.spark.api.java.JavaRDD<O> |
read(long approxNumRecords)
Attempts to reads an approximate number of records close to approxNumRecords.
|
org.apache.spark.api.java.JavaRDD<O> read(long approxNumRecords) throws IOException
IOExceptionorg.apache.spark.api.java.JavaRDD<O> read(int numPartitions, long approxNumRecords) throws IOException
IOException - Attempts to read approx number of records (exact if equal or more records available)
across requested number of
partitions.org.apache.spark.api.java.JavaRDD<O> read(int numPartitions, int numFiles, long approxNumRecords) throws IOException
IOException - Attempts to read approx number of records (exact if equal or more records available)
across requested number of
partitions and number of files.
1. Find numFiles across numPartitions
2. numRecordsToReadPerFile = approxNumRecords / numFilesorg.apache.spark.api.java.JavaRDD<O> read(int numPartitions, int numFiles, double percentageRecordsPerFile) throws IOException
IOException - Attempts to a % of records per file across requested number of partitions and number of files.
1. Find numFiles across numPartitions
2. numRecordsToReadPerFile = approxNumRecordsPerFile * percentageRecordsPerFileCopyright © 2023 The Apache Software Foundation. All rights reserved.