T - parser typeS - input split typepublic interface FiniteFirehoseFactory<T extends InputRowParser,S> extends FirehoseFactory<T>
FiniteFirehoseFactory designed for batch processing. Its implementations assume that the amount of inputs is
limited.| Modifier and Type | Method and Description |
|---|---|
int |
getNumSplits()
Returns number of splits returned by
getSplits(). |
Stream<InputSplit<S>> |
getSplits()
Returns a
Stream for InputSplits. |
default boolean |
isSplittable()
Returns true if this
FiniteFirehoseFactory supports parallel batch indexing. |
FiniteFirehoseFactory<T,S> |
withSplit(InputSplit<S> split)
Returns the same
FiniteFirehoseFactory but with the given InputSplit. |
connect, connectdefault boolean isSplittable()
FiniteFirehoseFactory supports parallel batch indexing.isSplittable in interface FirehoseFactory<T extends InputRowParser>Stream<InputSplit<S>> getSplits() throws IOException
Stream for InputSplits. In parallel batch indexing, each InputSplit is processed
by a sub task.
Listing splits may cause high overhead in some implementations. In this case, InputSplits should be listed
lazily so that the listing overhead could be amortized.IOExceptionint getNumSplits()
throws IOException
getSplits().IOExceptionFiniteFirehoseFactory<T,S> withSplit(InputSplit<S> split)
FiniteFirehoseFactory but with the given InputSplit. The returned
FiniteFirehoseFactory is used by sub tasks in parallel batch indexing.Copyright © 2011–2018 The Apache Software Foundation. All rights reserved.