Class TransformProcessSequenceRecordReader
- java.lang.Object
-
- org.datavec.api.records.reader.impl.transform.TransformProcessSequenceRecordReader
-
- All Implemented Interfaces:
Closeable,Serializable,AutoCloseable,Configurable,RecordReader,SequenceRecordReader
public class TransformProcessSequenceRecordReader extends Object implements SequenceRecordReader
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected SequenceRecordReadersequenceRecordReaderprotected TransformProcesstransformProcess-
Fields inherited from interface org.datavec.api.records.reader.RecordReader
APPEND_LABEL, LABELS, NAME_SPACE
-
-
Constructor Summary
Constructors Constructor Description TransformProcessSequenceRecordReader()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanbatchesSupported()This method returns true, if next(int) signature is supported by this RecordReader implementation.voidclose()Closes this stream and releases any system resources associated with it.ConfigurationgetConf()Return the configuration used by this object.List<String>getLabels()List of label stringsList<RecordListener>getListeners()Get the record listeners for this record reader.booleanhasNext()Whether there are anymore recordsvoidinitialize(Configuration conf, InputSplit split)Called once at initialization.voidinitialize(InputSplit split)Called once at initialization.List<Record>loadFromMetaData(List<RecordMetaData> recordMetaDatas)Load multiple records from the given a list ofRecordMetaDatainstancesRecordloadFromMetaData(RecordMetaData recordMetaData)Load a single record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadFromMetaData(List)List<SequenceRecord>loadSequenceFromMetaData(List<RecordMetaData> recordMetaDatas)Load multiple sequence records from the given a list ofRecordMetaDatainstancesSequenceRecordloadSequenceFromMetaData(RecordMetaData recordMetaData)Load a single sequence record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadSequenceFromMetaData(List)List<Writable>next()Get the next recordList<List<Writable>>next(int num)This method will be used, if batchesSupported() returns true.RecordnextRecord()SequenceRecordnextSequence()Similar tosequenceRecord(), but returns aRecordobject, that may include metadata such as the source of the dataList<Writable>record(URI uri, DataInputStream dataInputStream)Load the record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStreamvoidreset()Reset record reader iteratorbooleanresetSupported()List<List<Writable>>sequenceRecord()Returns a sequence record.List<List<Writable>>sequenceRecord(URI uri, DataInputStream dataInputStream)Load a sequence record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStreamvoidsetConf(Configuration conf)Set the configuration to be used by this object.voidsetListeners(Collection<RecordListener> listeners)Set the record listeners for this record reader.voidsetListeners(RecordListener... listeners)Set the record listeners for this record reader.
-
-
-
Field Detail
-
sequenceRecordReader
protected SequenceRecordReader sequenceRecordReader
-
transformProcess
protected TransformProcess transformProcess
-
-
Method Detail
-
setConf
public void setConf(Configuration conf)
Set the configuration to be used by this object.- Specified by:
setConfin interfaceConfigurable- Parameters:
conf-
-
getConf
public Configuration getConf()
Return the configuration used by this object.- Specified by:
getConfin interfaceConfigurable
-
sequenceRecord
public List<List<Writable>> sequenceRecord()
Returns a sequence record.- Specified by:
sequenceRecordin interfaceSequenceRecordReader- Returns:
- a sequence of records
-
batchesSupported
public boolean batchesSupported()
Description copied from interface:RecordReaderThis method returns true, if next(int) signature is supported by this RecordReader implementation.- Specified by:
batchesSupportedin interfaceRecordReader- Returns:
-
next
public List<List<Writable>> next(int num)
Description copied from interface:RecordReaderThis method will be used, if batchesSupported() returns true.- Specified by:
nextin interfaceRecordReader- Returns:
-
sequenceRecord
public List<List<Writable>> sequenceRecord(URI uri, DataInputStream dataInputStream) throws IOException
Load a sequence record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStream- Specified by:
sequenceRecordin interfaceSequenceRecordReader- Parameters:
uri-dataInputStream-- Throws:
IOException- if error occurs during reading from the input stream
-
nextSequence
public SequenceRecord nextSequence()
Similar tosequenceRecord(), but returns aRecordobject, that may include metadata such as the source of the data- Specified by:
nextSequencein interfaceSequenceRecordReader- Returns:
- next sequence record
-
loadSequenceFromMetaData
public SequenceRecord loadSequenceFromMetaData(RecordMetaData recordMetaData) throws IOException
Load a single sequence record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadSequenceFromMetaData(List)- Specified by:
loadSequenceFromMetaDatain interfaceSequenceRecordReader- Parameters:
recordMetaData- Metadata for the sequence record that we want to load from- Returns:
- Single sequence record for the given RecordMetaData instance
- Throws:
IOException- If I/O error occurs during loading
-
loadSequenceFromMetaData
public List<SequenceRecord> loadSequenceFromMetaData(List<RecordMetaData> recordMetaDatas) throws IOException
Load multiple sequence records from the given a list ofRecordMetaDatainstances- Specified by:
loadSequenceFromMetaDatain interfaceSequenceRecordReader- Parameters:
recordMetaDatas- Metadata for the records that we want to load from- Returns:
- Multiple sequence record for the given RecordMetaData instances
- Throws:
IOException- If I/O error occurs during loading
-
initialize
public void initialize(InputSplit split) throws IOException, InterruptedException
Called once at initialization.- Specified by:
initializein interfaceRecordReader- Parameters:
split- the split that defines the range of records to read- Throws:
IOExceptionInterruptedException
-
initialize
public void initialize(Configuration conf, InputSplit split) throws IOException, InterruptedException
Called once at initialization.- Specified by:
initializein interfaceRecordReader- Parameters:
conf- a configuration for initializationsplit- the split that defines the range of records to read- Throws:
IOExceptionInterruptedException
-
next
public List<Writable> next()
Get the next record- Specified by:
nextin interfaceRecordReader- Returns:
-
hasNext
public boolean hasNext()
Whether there are anymore records- Specified by:
hasNextin interfaceRecordReader- Returns:
-
getLabels
public List<String> getLabels()
List of label strings- Specified by:
getLabelsin interfaceRecordReader- Returns:
-
reset
public void reset()
Reset record reader iterator- Specified by:
resetin interfaceRecordReader
-
resetSupported
public boolean resetSupported()
- Specified by:
resetSupportedin interfaceRecordReader- Returns:
- True if the record reader can be reset, false otherwise. Note that some record readers cannot be reset - for example, if they are backed by a non-resettable input split (such as certain types of streams)
-
record
public List<Writable> record(URI uri, DataInputStream dataInputStream) throws IOException
Load the record from the given DataInputStream Unlikenext()the internal state of the RecordReader is not modified Implementations of this method should not close the DataInputStream- Specified by:
recordin interfaceRecordReader- Parameters:
uri-dataInputStream-- Throws:
IOException- if error occurs during reading from the input stream
-
nextRecord
public Record nextRecord()
Similar tonext(), but returns aRecordobject, that may include metadata such as the source of the data- Specified by:
nextRecordin interfaceRecordReader- Returns:
- next record
-
loadFromMetaData
public Record loadFromMetaData(RecordMetaData recordMetaData) throws IOException
Load a single record from the givenRecordMetaDatainstance
Note: that for data that isn't splittable (i.e., text data that needs to be scanned/split), it is more efficient to load multiple records at once usingloadFromMetaData(List)- Specified by:
loadFromMetaDatain interfaceRecordReader- Parameters:
recordMetaData- Metadata for the record that we want to load from- Returns:
- Single record for the given RecordMetaData instance
- Throws:
IOException- If I/O error occurs during loading
-
loadFromMetaData
public List<Record> loadFromMetaData(List<RecordMetaData> recordMetaDatas) throws IOException
Load multiple records from the given a list ofRecordMetaDatainstances- Specified by:
loadFromMetaDatain interfaceRecordReader- Parameters:
recordMetaDatas- Metadata for the records that we want to load from- Returns:
- Multiple records for the given RecordMetaData instances
- Throws:
IOException- If I/O error occurs during loading
-
getListeners
public List<RecordListener> getListeners()
Get the record listeners for this record reader.- Specified by:
getListenersin interfaceRecordReader
-
setListeners
public void setListeners(RecordListener... listeners)
Set the record listeners for this record reader.- Specified by:
setListenersin interfaceRecordReader- Parameters:
listeners-
-
setListeners
public void setListeners(Collection<RecordListener> listeners)
Set the record listeners for this record reader.- Specified by:
setListenersin interfaceRecordReader- Parameters:
listeners-
-
close
public void close() throws IOExceptionCloses this stream and releases any system resources associated with it. If the stream is already closed then invoking this method has no effect.As noted in
AutoCloseable.close(), cases where the close may fail require careful attention. It is strongly advised to relinquish the underlying resources and to internally mark theCloseableas closed, prior to throwing theIOException.- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceCloseable- Throws:
IOException- if an I/O error occurs
-
-