public abstract class DFSDeltaInputReader extends Object implements DeltaInputReader<org.apache.avro.generic.GenericRecord>
DeltaInputReader| Constructor and Description |
|---|
DFSDeltaInputReader() |
| Modifier and Type | Method and Description |
|---|---|
protected long |
analyzeSingleFile(String filePath)
Implementation of
DeltaInputReaders to provide a way to read a single file on DFS and provide an
average number of records across N files. |
protected List<String> |
getFilePathsToRead(String basePath,
org.apache.hadoop.fs.PathFilter filter,
long totalRecordsToRead) |
protected org.apache.hudi.common.util.collection.Pair<Integer,Integer> |
getFileStatusIndexRange(List<org.apache.hadoop.fs.FileStatus> fileStatuses,
long averageFileSize,
long totalSizeToRead) |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitread, read, read, readprotected List<String> getFilePathsToRead(String basePath, org.apache.hadoop.fs.PathFilter filter, long totalRecordsToRead) throws IOException
IOExceptionprotected org.apache.hudi.common.util.collection.Pair<Integer,Integer> getFileStatusIndexRange(List<org.apache.hadoop.fs.FileStatus> fileStatuses, long averageFileSize, long totalSizeToRead)
protected long analyzeSingleFile(String filePath)
DeltaInputReaders to provide a way to read a single file on DFS and provide an
average number of records across N files.Copyright © 2023 The Apache Software Foundation. All rights reserved.