Class TransformProcessRecordReader
- java.lang.Object
-
- org.datavec.api.records.reader.impl.transform.TransformProcessRecordReader
-
- All Implemented Interfaces:
Closeable,Serializable,AutoCloseable,Configurable,RecordReader
public class TransformProcessRecordReader extends Object implements RecordReader
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected Recordnextprotected RecordReaderrecordReaderprotected TransformProcesstransformProcess-
Fields inherited from interface org.datavec.api.records.reader.RecordReader
APPEND_LABEL, LABELS, NAME_SPACE
-
-
Constructor Summary
Constructors Constructor Description TransformProcessRecordReader(RecordReader recordReader, TransformProcess transformProcess)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanbatchesSupported()This method returns true, if next(int) signature is supported by this RecordReader implementation.voidclose()Closes this stream and releases any system resources associated with it.ConfigurationgetConf()Return the configuration used by this object.List<String>getLabels()List of label stringsList<RecordListener>getListeners()Get the record listeners for this record reader.booleanhasNext()Whether there are anymore recordsvoidinitialize(Configuration conf, InputSplit split)Called once at initialization.voidinitialize(InputSplit split)Called once at initialization.List<Record>loadFromMetaData(List<RecordMetaData> recordMetaDatas)Load multiple records from the given a list ofRecordMetaDatainstancesRecordloadFromMetaData(RecordMetaData recordMetaData)Load a single record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadFromMetaData(List)List<Writable>next()Get the next recordList<List<Writable>>next(int num)This method will be used, if batchesSupported() returns true.RecordnextRecord()List<Writable>record(URI uri, DataInputStream dataInputStream)Load the record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStreamvoidreset()Reset record reader iteratorbooleanresetSupported()voidsetConf(Configuration conf)Set the configuration to be used by this object.voidsetListeners(Collection<RecordListener> listeners)Set the record listeners for this record reader.voidsetListeners(RecordListener... listeners)Set the record listeners for this record reader.
-
-
-
Field Detail
-
recordReader
protected RecordReader recordReader
-
transformProcess
protected TransformProcess transformProcess
-
next
protected Record next
-
-
Constructor Detail
-
TransformProcessRecordReader
public TransformProcessRecordReader(RecordReader recordReader, TransformProcess transformProcess)
-
-
Method Detail
-
initialize
public void initialize(InputSplit split) throws IOException, InterruptedException
Called once at initialization.- Specified by:
initializein interfaceRecordReader- Parameters:
split- the split that defines the range of records to read- Throws:
IOExceptionInterruptedException
-
initialize
public void initialize(Configuration conf, InputSplit split) throws IOException, InterruptedException
Called once at initialization.- Specified by:
initializein interfaceRecordReader- Parameters:
conf- a configuration for initializationsplit- the split that defines the range of records to read- Throws:
IOExceptionInterruptedException
-
batchesSupported
public boolean batchesSupported()
Description copied from interface:RecordReaderThis method returns true, if next(int) signature is supported by this RecordReader implementation.- Specified by:
batchesSupportedin interfaceRecordReader- Returns:
-
next
public List<List<Writable>> next(int num)
Description copied from interface:RecordReaderThis method will be used, if batchesSupported() returns true.- Specified by:
nextin interfaceRecordReader- Returns:
-
next
public List<Writable> next()
Get the next record- Specified by:
nextin interfaceRecordReader- Returns:
-
hasNext
public boolean hasNext()
Whether there are anymore records- Specified by:
hasNextin interfaceRecordReader- Returns:
-
getLabels
public List<String> getLabels()
List of label strings- Specified by:
getLabelsin interfaceRecordReader- Returns:
-
reset
public void reset()
Reset record reader iterator- Specified by:
resetin interfaceRecordReader
-
resetSupported
public boolean resetSupported()
- Specified by:
resetSupportedin interfaceRecordReader- Returns:
- True if the record reader can be reset, false otherwise. Note that some record readers cannot be reset - for example, if they are backed by a non-resettable input split (such as certain types of streams)
-
record
public List<Writable> record(URI uri, DataInputStream dataInputStream) throws IOException
Load the record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStream- Specified by:
recordin interfaceRecordReader- Parameters:
uri-dataInputStream-- Throws:
IOException- if error occurs during reading from the input stream
-
nextRecord
public Record nextRecord()
Similar tonext(), but returns aRecordobject, that may include metadata such as the source of the data- Specified by:
nextRecordin interfaceRecordReader- Returns:
- next record
-
loadFromMetaData
public Record loadFromMetaData(RecordMetaData recordMetaData) throws IOException
Load a single record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadFromMetaData(List)- Specified by:
loadFromMetaDatain interfaceRecordReader- Parameters:
recordMetaData- Metadata for the record that we want to load from- Returns:
- Single record for the given RecordMetaData instance
- Throws:
IOException- If I/O error occurs during loading
-
loadFromMetaData
public List<Record> loadFromMetaData(List<RecordMetaData> recordMetaDatas) throws IOException
Load multiple records from the given a list ofRecordMetaDatainstances- Specified by:
loadFromMetaDatain interfaceRecordReader- Parameters:
recordMetaDatas- Metadata for the records that we want to load from- Returns:
- Multiple records for the given RecordMetaData instances
- Throws:
IOException- If I/O error occurs during loading
-
getListeners
public List<RecordListener> getListeners()
Get the record listeners for this record reader.- Specified by:
getListenersin interfaceRecordReader
-
setListeners
public void setListeners(RecordListener... listeners)
Set the record listeners for this record reader.- Specified by:
setListenersin interfaceRecordReader- Parameters:
listeners-
-
setListeners
public void setListeners(Collection<RecordListener> listeners)
Set the record listeners for this record reader.- Specified by:
setListenersin interfaceRecordReader- Parameters:
listeners-
-
close
public void close() throws IOExceptionCloses this stream and releases any system resources associated with it. If the stream is already closed then invoking this method has no effect.As noted in
AutoCloseable.close(), cases where the close may fail require careful attention. It is strongly advised to relinquish the underlying resources and to internally mark theCloseableas closed, prior to throwing theIOException.- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceCloseable- Throws:
IOException- if an I/O error occurs
-
setConf
public void setConf(Configuration conf)
Set the configuration to be used by this object.- Specified by:
setConfin interfaceConfigurable- Parameters:
conf-
-
getConf
public Configuration getConf()
Return the configuration used by this object.- Specified by:
getConfin interfaceConfigurable
-
-