- generateProjectionSchema(Schema, Map<String, Schema.Field>, List<String>) - Static method in class org.apache.hudi.hadoop.utils.HoodieRealtimeRecordReaderUtils
-
Generate a reader schema off the provided writeSchema, to just project out the provided columns.
- getAffectedPartitions(List<HoodieInstant>, HoodieTableMetaClient, HoodieTimeline, List<Path>) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Extract partitions touched by the commitsToCheck.
- getBasePath() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- getBasePath() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- getBasePath() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getBasePath() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
Return Base Path of the dataset.
- getBelongsToIncrementalQuery() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- getBelongsToIncrementalQuery() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- getBelongsToIncrementalQuery() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getBelongsToIncrementalQuery() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
Returns the flag whether this split belongs to an Incremental Query
- getBootstrapFileSplit() - Method in class org.apache.hudi.hadoop.BootstrapBaseFileSplit
-
- getBootstrapFileStatus() - Method in class org.apache.hudi.hadoop.PathWithBootstrapFileStatus
-
- getCommitMetadata(HoodieInstant, HoodieTimeline) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Returns the commit metadata of the given instant.
- getCommitsForIncrementalQuery(Job, String, HoodieTimeline) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Get commits for incremental query from Hive map reduce configuration.
- getConf() - Method in class org.apache.hudi.hadoop.HoodieParquetInputFormatBase
-
- getConf() - Method in class org.apache.hudi.hadoop.HoodieROTablePathFilter
-
- getConf() - Method in class org.apache.hudi.hadoop.HoodieTableInputFormat
-
- getDeltaLogFiles() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- getDeltaLogFiles() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- getDeltaLogFiles() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getDeltaLogFiles() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
- getDeltaLogFiles() - Method in class org.apache.hudi.hadoop.RealtimeFileStatus
-
- getDeltaLogPaths() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
Return Log File Paths.
- getFileStatus(HoodieBaseFile) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getFileStatusUnchecked(HoodieBaseFile) - Static method in class org.apache.hudi.hadoop.HoodieCopyOnWriteTableInputFormat
-
- getFilteredCommitsTimeline(JobContext, HoodieTableMetaClient) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Extract HoodieTimeline based on HoodieTableMetaClient.
- getGroupedIncrementalPaths() - Method in class org.apache.hudi.hadoop.InputPathHandler
-
- getHiveSchema() - Method in class org.apache.hudi.hadoop.realtime.AbstractRealtimeRecordReader
-
- getHoodieTimelineForIncrementalQuery(JobContext, String, HoodieTimeline) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Get HoodieTimeline for incremental query from Hive map reduce configuration.
- getHoodieVirtualKeyInfo(HoodieTableMetaClient) - Static method in class org.apache.hudi.hadoop.HoodieCopyOnWriteTableInputFormat
-
- getIncrementalTableNames(JobContext) - Static method in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
Returns a list of tableNames for which hoodie.
.consume.mode is set to incremental else returns empty List
- getInputFormat(HoodieFileFormat, boolean, Configuration) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getInputFormat(String, boolean, Configuration) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getInputFormatClassName(HoodieFileFormat, boolean) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getInputPathsShim(JobConf) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- getInputSplitShim() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
- getInputSplitShim() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- getIOColumnNameAndTypes(Configuration) - Static method in class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
- getIOColumns(Configuration) - Static method in class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
- getIOColumnTypes(Configuration) - Static method in class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
- getJob() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
- getJobConf() - Method in class org.apache.hudi.hadoop.realtime.AbstractRealtimeRecordReader
-
- getLength() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
- getLength(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns the length of the ith Path.
- getLength() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
- getLength(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns the length of the ith Path.
- getLength() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
The number of bytes in the file to process.
- getLengths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns an array containing the lengths of the files in the split.
- getLengths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns an array containing the lengths of the files in the split.
- getLocations() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns all the Paths where this input-split resides.
- getLocations() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns all the Paths where this input-split resides.
- getMaxCommit(JobConf, String) - Static method in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- getMaxCommitTime() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- getMaxCommitTime() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- getMaxCommitTime() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getMaxCommitTime() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
Return Max Instant Time.
- getMaxCommitTime() - Method in class org.apache.hudi.hadoop.RealtimeFileStatus
-
- getMaxCompactionMemoryInBytes(JobConf) - Static method in class org.apache.hudi.hadoop.utils.HoodieRealtimeRecordReaderUtils
-
get the max compaction memory in bytes from JobConf.
- getNameToFieldMap(Schema) - Static method in class org.apache.hudi.hadoop.utils.HoodieRealtimeRecordReaderUtils
-
- getNonCombinablePathIndices(JobConf, Path[], int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
Gets all the path indices that should not be combined.
- getNonHoodieInputPaths() - Method in class org.apache.hudi.hadoop.InputPathHandler
-
- getNthParent(Path, int) - Static method in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
Gets the n'th parent for the Path.
- getNumPaths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns the number of Paths in the split.
- getNumPaths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns the number of Paths in the split.
- getOffset(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns the start offset of the ith Path.
- getOffset(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns the start offset of the ith Path.
- getOutputFormatClassName(HoodieFileFormat) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getParquetInputFormatClassName() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
- getParquetRealtimeInputFormatClassName() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
- getPartitionPathField() - Method in class org.apache.hudi.hadoop.realtime.HoodieVirtualKeyInfo
-
- getPartitionPathFieldIndex() - Method in class org.apache.hudi.hadoop.realtime.HoodieVirtualKeyInfo
-
- getPath() - Method in class org.apache.hudi.hadoop.FileStatusWithBootstrapBaseFile
-
- getPath(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns the ith Path.
- getPath(int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns the ith Path.
- getPath() - Method in class org.apache.hudi.hadoop.LocatedFileStatusWithBootstrapBaseFile
-
- getPath() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
The file containing this split's data.
- getPath() - Method in class org.apache.hudi.hadoop.RealtimeFileStatus
-
- getPaths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns all the Paths in the split.
- getPaths() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns all the Paths in the split.
- getPathWithBootstrapFileStatus() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getPos() - Method in class org.apache.hudi.hadoop.BootstrapColumnStichingRecordReader
-
- getPos() - Method in class org.apache.hudi.hadoop.HoodieHFileRecordReader
-
- getPos() - Method in class org.apache.hudi.hadoop.realtime.HoodieCombineRealtimeRecordReader
-
- getPos() - Method in class org.apache.hudi.hadoop.realtime.HoodieEmptyRecordReader
-
- getPos() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeRecordReader
-
- getPos() - Method in class org.apache.hudi.hadoop.SafeParquetRecordReaderWrapper
-
- getProgress() - Method in class org.apache.hudi.hadoop.BootstrapColumnStichingRecordReader
-
- getProgress() - Method in class org.apache.hudi.hadoop.HoodieHFileRecordReader
-
- getProgress() - Method in class org.apache.hudi.hadoop.realtime.HoodieCombineRealtimeRecordReader
-
- getProgress() - Method in class org.apache.hudi.hadoop.realtime.HoodieEmptyRecordReader
-
- getProgress() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeRecordReader
-
- getProgress() - Method in class org.apache.hudi.hadoop.SafeParquetRecordReaderWrapper
-
- getReadColumnIDs(Configuration) - Static method in class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
Returns an array of column ids(start from zero) which is set in the given
parameter conf.
- getReadColumnNames(Configuration) - Static method in class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
- getReader() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeRecordReader
-
- getReaderSchema() - Method in class org.apache.hudi.hadoop.realtime.AbstractRealtimeRecordReader
-
- getRealtimeFileSplits() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
- getRecordKeyField() - Method in class org.apache.hudi.hadoop.realtime.HoodieVirtualKeyInfo
-
- getRecordKeyFieldIndex() - Method in class org.apache.hudi.hadoop.realtime.HoodieVirtualKeyInfo
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
Create a generic Hive RecordReader than can iterate over all chunks in a CombinedFileSplit.
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- getRecordReader(JobConf, CombineFileSplit, Reporter, Class<RecordReader<K, V>>) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.HoodieCopyOnWriteTableInputFormat
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.HoodieHFileInputFormat
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.HoodieParquetInputFormat
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.realtime.HoodieHFileRealtimeInputFormat
-
- getRecordReader(InputSplit, JobConf, Reporter) - Method in class org.apache.hudi.hadoop.realtime.HoodieParquetRealtimeInputFormat
-
- getSerDeClassName(HoodieFileFormat) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- getSnapshotPaths() - Method in class org.apache.hudi.hadoop.InputPathHandler
-
- getSplit() - Method in class org.apache.hudi.hadoop.realtime.AbstractRealtimeRecordReader
-
- getSplits(JobConf, int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
Create Hive splits based on CombineFileSplit.
- getSplits(JobConf, int) - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- getSplits(JobConf, int) - Method in class org.apache.hudi.hadoop.HoodieParquetInputFormatBase
-
- getSplits(JobConf, int) - Method in class org.apache.hudi.hadoop.realtime.HoodieMergeOnReadTableInputFormat
-
- getStart() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
The position of the first byte in the file to process.
- getStartOffsets() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.CombineHiveInputSplit
-
Returns an array containing the startoffsets of the files in the split.
- getStartOffsets() - Method in class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
Returns an array containing the start offsets of the files in the split.
- getTableMetaClientByPartitionPath(Configuration, Set<Path>) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Extract HoodieTableMetaClient by partition path.
- getTableMetaClientForBasePathUnchecked(Configuration, Path) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Extract HoodieTableMetaClient from a partition path (not base path)
- getTableMetaClientMap() - Method in class org.apache.hudi.hadoop.InputPathHandler
-
- getVirtualKeyInfo() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- getVirtualKeyInfo() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- getVirtualKeyInfo() - Method in class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- getVirtualKeyInfo() - Method in interface org.apache.hudi.hadoop.realtime.RealtimeSplit
-
Returns Virtual key info if meta fields are disabled.
- getWritePartitionPaths(List<HoodieCommitMetadata>) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Returns all the incremental write partition paths as a set with the given commits metadata.
- getWriterSchema() - Method in class org.apache.hudi.hadoop.realtime.AbstractRealtimeRecordReader
-
- GLOBALLY_CONSISTENT_READ_TIMESTAMP - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- groupFileStatusForSnapshotPaths(FileStatus[], String, Collection<HoodieTableMetaClient>) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
Takes in a list of filesStatus and a list of table metadatas.
- groupSnapshotPathsByMetaClient(Collection<HoodieTableMetaClient>, List<Path>) - Static method in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- hasNext() - Method in class org.apache.hudi.hadoop.RecordReaderValueIterator
-
- HiveHoodieTableFileIndex - Class in org.apache.hudi.hadoop
-
- HiveHoodieTableFileIndex(HoodieEngineContext, HoodieTableMetaClient, TypedProperties, HoodieTableQueryType, List<Path>, Option<String>, boolean) - Constructor for class org.apache.hudi.hadoop.HiveHoodieTableFileIndex
-
- HOODIE_COMMIT_TIME_COL_POS - Static variable in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- HOODIE_CONSUME_COMMIT - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_CONSUME_MODE_PATTERN - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_CONSUME_MODE_PATTERN_STRING - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_CONSUME_PENDING_COMMITS - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_INCREMENTAL_USE_DATABASE - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_MAX_COMMIT_PATTERN - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_PARTITION_PATH_COL_POS - Static variable in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- HOODIE_READ_COLUMNS_PROP - Static variable in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- HOODIE_RECORD_KEY_COL_POS - Static variable in class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- HOODIE_START_COMMIT_PATTERN - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HOODIE_STOP_AT_COMPACTION_PATTERN - Static variable in class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HoodieColumnProjectionUtils - Class in org.apache.hudi.hadoop
-
Utility functions copied from Hive ColumnProjectionUtils.java.
- HoodieColumnProjectionUtils() - Constructor for class org.apache.hudi.hadoop.HoodieColumnProjectionUtils
-
- HoodieCombineFileInputFormatShim() - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim
-
- HoodieCombineHiveInputFormat<K extends org.apache.hadoop.io.WritableComparable,V extends org.apache.hadoop.io.Writable> - Class in org.apache.hudi.hadoop.hive
-
This is just a copy of the org.apache.hadoop.hive.ql.io.CombineHiveInputFormat from Hive 2.x Search for **MOD** to
see minor modifications to support custom inputformat in CombineHiveInputFormat.
- HoodieCombineHiveInputFormat() - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat
-
- HoodieCombineHiveInputFormat.AvoidSplitCombination - Interface in org.apache.hudi.hadoop.hive
-
This is a marker interface that is used to identify the formats where combine split generation is not applicable.
- HoodieCombineHiveInputFormat.CombineHiveInputSplit - Class in org.apache.hudi.hadoop.hive
-
CombineHiveInputSplit encapsulates an InputSplit with its corresponding inputFormatClassName.
- HoodieCombineHiveInputFormat.HoodieCombineFileInputFormatShim<K,V> - Class in org.apache.hudi.hadoop.hive
-
**MOD** This is the implementation of CombineFileInputFormat which is a copy of
org.apache.hadoop.hive.shims.HadoopShimsSecure.CombineFileInputFormatShim with changes in listStatus.
- HoodieCombineRealtimeFileSplit - Class in org.apache.hudi.hadoop.hive
-
Represents a CombineFileSplit for realtime tables.
- HoodieCombineRealtimeFileSplit() - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
- HoodieCombineRealtimeFileSplit(JobConf, List<FileSplit>) - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeFileSplit
-
- HoodieCombineRealtimeFileSplit.Builder - Class in org.apache.hudi.hadoop.hive
-
- HoodieCombineRealtimeHiveSplit - Class in org.apache.hudi.hadoop.hive
-
Represents a CombineHiveInputSplit for realtime tables.
- HoodieCombineRealtimeHiveSplit() - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeHiveSplit
-
- HoodieCombineRealtimeHiveSplit(JobConf, CombineFileSplit, Map<Path, PartitionDesc>) - Constructor for class org.apache.hudi.hadoop.hive.HoodieCombineRealtimeHiveSplit
-
- HoodieCombineRealtimeRecordReader - Class in org.apache.hudi.hadoop.realtime
-
Allows to read multiple realtime file splits grouped together by CombineInputFormat.
- HoodieCombineRealtimeRecordReader(JobConf, CombineFileSplit, List<RecordReader>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieCombineRealtimeRecordReader
-
- HoodieCopyOnWriteTableInputFormat - Class in org.apache.hudi.hadoop
-
Base implementation of the Hive's FileInputFormat allowing for reading of Hudi's
Copy-on-Write (COW) tables in various configurations:
Snapshot mode: reading table's state as of particular timestamp (or instant, in Hudi's terms)
Incremental mode: reading table's state as of particular timestamp (or instant, in Hudi's terms)
External mode: reading non-Hudi partitions
NOTE: This class is invariant of the underlying file-format of the files being read
- HoodieCopyOnWriteTableInputFormat() - Constructor for class org.apache.hudi.hadoop.HoodieCopyOnWriteTableInputFormat
-
- HoodieEmptyRecordReader - Class in org.apache.hudi.hadoop.realtime
-
Dummy record for log only realtime split.
- HoodieEmptyRecordReader(RealtimeSplit, JobConf) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieEmptyRecordReader
-
- HoodieHFileInputFormat - Class in org.apache.hudi.hadoop
-
HoodieInputFormat for HUDI datasets which store data in HFile base file format.
- HoodieHFileInputFormat() - Constructor for class org.apache.hudi.hadoop.HoodieHFileInputFormat
-
- HoodieHFileRealtimeInputFormat - Class in org.apache.hudi.hadoop.realtime
-
HoodieRealtimeInputFormat for HUDI datasets which store data in HFile base file format.
- HoodieHFileRealtimeInputFormat() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieHFileRealtimeInputFormat
-
- HoodieHFileRecordReader - Class in org.apache.hudi.hadoop
-
- HoodieHFileRecordReader(Configuration, InputSplit, JobConf) - Constructor for class org.apache.hudi.hadoop.HoodieHFileRecordReader
-
- HoodieHiveUtils - Class in org.apache.hudi.hadoop.utils
-
- HoodieHiveUtils() - Constructor for class org.apache.hudi.hadoop.utils.HoodieHiveUtils
-
- HoodieInputFormat - Class in com.uber.hoodie.hadoop
-
Temporary class to allow seamless migration of com.uber.hoodie to org.apache.hudi .
- HoodieInputFormat() - Constructor for class com.uber.hoodie.hadoop.HoodieInputFormat
-
- HoodieInputFormatUtils - Class in org.apache.hudi.hadoop.utils
-
- HoodieInputFormatUtils() - Constructor for class org.apache.hudi.hadoop.utils.HoodieInputFormatUtils
-
- HoodieMergeOnReadTableInputFormat - Class in org.apache.hudi.hadoop.realtime
-
Base implementation of the Hive's FileInputFormat allowing for reading of Hudi's
Merge-on-Read (COW) tables in various configurations:
Snapshot mode: reading table's state as of particular timestamp (or instant, in Hudi's terms)
Incremental mode: reading table's state as of particular timestamp (or instant, in Hudi's terms)
External mode: reading non-Hudi partitions
- HoodieMergeOnReadTableInputFormat() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieMergeOnReadTableInputFormat
-
- HoodieParquetInputFormat - Class in org.apache.hudi.hadoop
-
HoodieInputFormat which understands the Hoodie File Structure and filters files based on the Hoodie Mode.
- HoodieParquetInputFormat() - Constructor for class org.apache.hudi.hadoop.HoodieParquetInputFormat
-
- HoodieParquetInputFormat(HoodieCopyOnWriteTableInputFormat) - Constructor for class org.apache.hudi.hadoop.HoodieParquetInputFormat
-
- HoodieParquetInputFormatBase - Class in org.apache.hudi.hadoop
-
!!! PLEASE READ CAREFULLY !!!
NOTE: Hive bears optimizations which are based upon validating whether FileInputFormat
implementation inherits from MapredParquetInputFormat.
- HoodieParquetInputFormatBase(HoodieCopyOnWriteTableInputFormat) - Constructor for class org.apache.hudi.hadoop.HoodieParquetInputFormatBase
-
- HoodieParquetRealtimeInputFormat - Class in org.apache.hudi.hadoop.realtime
-
Input Format, that provides a real-time view of data in a Hoodie table.
- HoodieParquetRealtimeInputFormat() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieParquetRealtimeInputFormat
-
- HoodieParquetSerde - Class in org.apache.hudi.hadoop.realtime
-
Simply extends ParquetHiveSerDe.
- HoodieParquetSerde() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieParquetSerde
-
- HoodieRealtimeBootstrapBaseFileSplit - Class in org.apache.hudi.hadoop.realtime
-
- HoodieRealtimeBootstrapBaseFileSplit() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
NOTE: This ctor is necessary for Hive to be able to serialize and
then instantiate it when deserializing back
- HoodieRealtimeBootstrapBaseFileSplit(FileSplit, String, List<HoodieLogFile>, String, FileSplit, boolean, Option<HoodieVirtualKeyInfo>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeBootstrapBaseFileSplit
-
- HoodieRealtimeConfig - Class in org.apache.hudi.hadoop.config
-
Class to hold props related to Hoodie RealtimeInputFormat and RealtimeRecordReader.
- HoodieRealtimeConfig() - Constructor for class org.apache.hudi.hadoop.config.HoodieRealtimeConfig
-
- HoodieRealtimeFileSplit - Class in org.apache.hudi.hadoop.realtime
-
FileSplit implementation that holds
Split corresponding to the base file
List of
HoodieLogFile that holds the delta to be merged (upon reading)
This split is correspondent to a single file-slice in the Hudi terminology.
- HoodieRealtimeFileSplit() - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- HoodieRealtimeFileSplit(FileSplit, HoodieRealtimePath) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- HoodieRealtimeFileSplit(FileSplit, String, List<HoodieLogFile>, String, boolean, Option<HoodieVirtualKeyInfo>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeFileSplit
-
- HoodieRealtimeInputFormat - Class in com.uber.hoodie.hadoop.realtime
-
Temporary class to allow seamless migration of com.uber.hoodie to org.apache.hudi .
- HoodieRealtimeInputFormat() - Constructor for class com.uber.hoodie.hadoop.realtime.HoodieRealtimeInputFormat
-
- HoodieRealtimeInputFormatUtils - Class in org.apache.hudi.hadoop.utils
-
- HoodieRealtimeInputFormatUtils() - Constructor for class org.apache.hudi.hadoop.utils.HoodieRealtimeInputFormatUtils
-
- HoodieRealtimePath - Class in org.apache.hudi.hadoop.realtime
-
Path implementation encoding additional information necessary to appropriately read
base files of the MOR tables, such as list of delta log files (holding updated records) associated
w/ the base file, etc.
- HoodieRealtimePath(Path, String, String, List<HoodieLogFile>, String, boolean, Option<HoodieVirtualKeyInfo>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimePath
-
- HoodieRealtimeRecordReader - Class in org.apache.hudi.hadoop.realtime
-
Realtime Record Reader which can do compacted (merge-on-read) record reading or unmerged reading (parquet and log
files read in parallel) based on job configuration.
- HoodieRealtimeRecordReader(RealtimeSplit, JobConf, RecordReader<NullWritable, ArrayWritable>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieRealtimeRecordReader
-
- HoodieRealtimeRecordReaderUtils - Class in org.apache.hudi.hadoop.utils
-
- HoodieRealtimeRecordReaderUtils() - Constructor for class org.apache.hudi.hadoop.utils.HoodieRealtimeRecordReaderUtils
-
- HoodieROTablePathFilter - Class in org.apache.hudi.hadoop
-
Given a path is a part of - Hoodie table = accepts ONLY the latest version of each path - Non-Hoodie table = then
always accept
- HoodieROTablePathFilter() - Constructor for class org.apache.hudi.hadoop.HoodieROTablePathFilter
-
- HoodieROTablePathFilter(Configuration) - Constructor for class org.apache.hudi.hadoop.HoodieROTablePathFilter
-
- HoodieTableInputFormat - Class in org.apache.hudi.hadoop
-
Abstract base class of the Hive's FileInputFormat implementations allowing for reading of Hudi's
Copy-on-Write (COW) and Merge-on-Read (MOR) tables
- HoodieTableInputFormat() - Constructor for class org.apache.hudi.hadoop.HoodieTableInputFormat
-
- HoodieVirtualKeyInfo - Class in org.apache.hudi.hadoop.realtime
-
Class to hold virtual key info when meta fields are disabled.
- HoodieVirtualKeyInfo(String, Option<String>, int, Option<Integer>) - Constructor for class org.apache.hudi.hadoop.realtime.HoodieVirtualKeyInfo
-