public interface Scan
| Modifier and Type | Method and Description |
|---|---|
java.util.Optional<Expression> |
getRemainingFilter()
Get the remaining filter that is not guaranteed to be satisfied for the data Delta Kernel
returns.
|
CloseableIterator<ColumnarBatch> |
getScanFiles(TableClient tableClient)
Get an iterator of data files to scan.
|
Row |
getScanState(TableClient tableClient)
Get the scan state associated with the current scan.
|
static CloseableIterator<DataReadResult> |
readData(TableClient tableClient,
Row scanState,
CloseableIterator<Row> scanFileRowIter,
java.util.Optional<Expression> filter)
Get the data from the given scan files using the connector provided
TableClient. |
CloseableIterator<ColumnarBatch> getScanFiles(TableClient tableClient)
tableClient - TableClient instance to use in Delta Kernel.ColumnarBatchs where each row in each batch corresponds to one
scan filejava.util.Optional<Expression> getRemainingFilter()
Expression.Row getScanState(TableClient tableClient)
tableClient - TableClient instance to use in Delta Kernel.Row format.static CloseableIterator<DataReadResult> readData(TableClient tableClient, Row scanState, CloseableIterator<Row> scanFileRowIter, java.util.Optional<Expression> filter) throws java.io.IOException
TableClient.tableClient - Connector provided TableClient implementation.scanState - Scan state returned by getScanState(TableClient)scanFileRowIter - an iterator of Rows. Each Row represents one scan file
from the ColumnarBatch returned by
getScanFiles(TableClient)filter - An optional filter that can be used for data skipping while reading the
scan files.DataReadResults. Each
DataReadResult instance contains the data read and an optional selection
vector that indicates data rows as valid or invalid. It is the responsibility of the
caller to close this iterator.java.io.IOException - when error occurs while reading the data.