public abstract class VectorMapJoinCommonOperator extends MapJoinOperator implements VectorizationOperator, VectorizationContextRegion
Operator.Counter, Operator.OperatorFunc, Operator.StatefirstSmallTable, fullOuterBigTableRetainSize, hashMapRowGetters, isFullOuterMapJoin, isTestingNoHashTableLoad, loader, mapJoinTables, matchTracker, spilledMapJoinTablesemptyList, joinKeys, joinKeysObjectInspectors, posBigTablealias, aliasFilterTags, aliasFilterTagsNext, condn, countAfterReport, dummyObj, dummyObjVectors, filterMaps, filterTags, forwardCache, heartbeatInterval, intermediate, joinFilterObjectInspectors, joinFilters, joinValues, joinValuesObjectInspectors, joinValuesStandardObjectInspectors, needsPostEvaluation, noOuterJoin, NOTSKIPBIGTABLE, nullsafes, numAliases, offsets, order, residualJoinFilters, residualJoinFiltersOIs, rowContainerPostFilteredOuterJoin, rowContainerStandardObjectInspectors, skipVectors, spillTableDesc, totalSzabortOp, asyncInitOperations, cContext, childOperators, childOperatorsArray, childOperatorsTag, conf, CONTEXT_NAME_KEY, done, groupKeyObject, HIVE_COUNTER_CREATED_DYNAMIC_PARTITIONS, HIVE_COUNTER_CREATED_FILES, HIVE_COUNTER_FATAL, id, inputObjInspectors, numRows, operatorId, out, outputObjInspector, parentOperators, reporter, runTimeNumRows, state, statsMap| Modifier | Constructor and Description |
|---|---|
protected |
VectorMapJoinCommonOperator()
Kryo ctor.
|
|
VectorMapJoinCommonOperator(CompilationOpContext ctx) |
|
VectorMapJoinCommonOperator(CompilationOpContext ctx,
OperatorDesc conf,
VectorizationContext vContext,
VectorDesc vectorDesc) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
commonSetup() |
protected void |
completeInitializationOp(Object[] os)
This method can be used to retrieve the results from async operations
started at init time - before the operator pipeline is started.
|
protected void |
determineCommonInfo(boolean isOuter) |
protected void |
displayBatchColumns(VectorizedRowBatch batch,
String batchName) |
void |
firstBatchSetup(VectorizedRowBatch batch) |
protected HashTableLoader |
getHashTableLoader(org.apache.hadoop.conf.Configuration hconf)
This override lets us substitute our own fast vectorized hash table loader.
|
VectorizationContext |
getInputVectorizationContext() |
protected abstract String |
getLoggingPrefix() |
protected String |
getLoggingPrefix(String className) |
VectorizationContext |
getOutputVectorizationContext() |
OperatorType |
getType()
Return the type of the specific operator among the
types in OperatorType.
|
VectorDesc |
getVectorDesc() |
void |
hashTableSetup() |
protected void |
initializeOp(org.apache.hadoop.conf.Configuration hconf)
Operator specific initialization.
|
protected void |
initLoggingPrefix(String className) |
void |
process(Object row,
int tag)
Process the row.
|
abstract void |
processBatch(VectorizedRowBatch batch) |
void |
setTestMapJoinTableContainer(int posSmallTable,
MapJoinTableContainer testMapJoinTableContainer,
MapJoinTableContainerSerDe mapJoinTableContainerSerDe) |
protected VectorizedRowBatch |
setupOverflowBatch() |
protected void |
setupVOutContext(List<String> outputColumnNames)
Setup the vectorized row batch description of the output of the native vectorized map join
operator.
|
canSkipJoinProcessing, canSkipReload, cleanUpInputFileChangedOp, closeOp, endGroup, generateFullOuterSmallTableNoMatches, generateMapMetaData, getCacheKey, getName, getOperatorName, getRefKey, getValueObjectInspectors, isInputFileChangeSensitive, loadHashTable, reloadHashTable, reProcessBigTable, setMapJoinKey, setMapJoinKeyNoNulls, spillBigTableRow, startGroupaddToAliasFilterTags, checkAndGenObject, getFilteredValue, getFilterTag, getNextSize, getPosToAliasMap, hasFilter, internalForward, opAllowedAfterMapJoin, opAllowedBeforeMapJoin, reportProgress, setPosToAliasMapabort, acceptLimitPushdown, allInitializedParentsAreClosed, areAllParentsInitialized, augmentPlan, batchNeedsClone, cleanUpInputFileChanged, clone, cloneOp, cloneRecursiveChildren, close, columnNamesRowResolvedCanBeObtained, createDummy, defaultEndGroup, defaultStartGroup, dump, dump, flush, flushRecursive, forward, getAdditionalCounters, getChildOperators, getChildren, getColumnExprMap, getCompilationOpContext, getConf, getConfiguration, getCounterName, getDone, getExecContext, getGroupKeyObject, getIdentifier, getInputObjInspectors, getIsReduceSink, getMarker, getNextCntr, getNumChild, getNumParent, getOperatorId, getOpTraits, getOutputObjInspector, getParentOperators, getReduceOutputName, getSchema, getStatistics, getStats, initEvaluators, initEvaluators, initEvaluatorsAndReturnStruct, initialize, initialize, initializeChildren, initializeLocalWork, initOperatorId, isUseBucketizedHiveInputFormat, jobClose, jobCloseOp, logicalEquals, logicalEqualsTree, logStats, opAllowedBeforeSortMergeJoin, opAllowedConvertMapJoin, passExecContext, preorderMap, processGroup, removeChild, removeChildAndAdoptItsChildren, removeParent, removeParents, replaceChild, replaceParent, replaceTabAlias, reset, setAlias, setChildOperators, setColumnExprMap, setCompilationOpContext, setConf, setDone, setExecContext, setGroupKeyObject, setInputContext, setInputObjInspectors, setMarker, setNextVectorBatchGroupStatus, setOpTraits, setOutputCollector, setParentOperators, setReporter, setSchema, setStatistics, setUseBucketizedHiveInputFormat, supportAutomaticSortMergeJoin, supportSkewJoinOptimization, supportUnionRemoveOptimization, toString, toString, toString, vectorForwardprotected transient String loggingPrefix
protected VectorMapJoinDesc vectorDesc
protected VectorMapJoinInfo vectorMapJoinInfo
protected boolean isOuterJoin
protected byte posSingleVectorMapJoinSmallTable
protected VectorizationContext vContext
protected VectorizationContext vOutContext
protected VectorMapJoinDesc.VectorMapJoinVariation vectorMapJoinVariation
protected VectorMapJoinDesc.HashTableKind hashTableKind
protected VectorMapJoinDesc.HashTableKeyType hashTableKeyType
protected int[] outputProjection
protected TypeInfo[] outputTypeInfos
protected VectorExpression[] bigTableFilterExpressions
protected VectorExpression[] bigTableKeyExpressions
protected VectorExpression[] bigTableValueExpressions
protected int[] bigTableKeyColumnMap
protected String[] bigTableKeyColumnNames
protected TypeInfo[] bigTableKeyTypeInfos
protected int[] bigTableValueColumnMap
protected String[] bigTableValueColumnNames
protected TypeInfo[] bigTableValueTypeInfos
protected int[] bigTableRetainColumnMap
protected TypeInfo[] bigTableRetainTypeInfos
protected int[] nonOuterSmallTableKeyColumnMap
protected TypeInfo[] nonOuterSmallTableKeyTypeInfos
protected VectorColumnOutputMapping outerSmallTableKeyMapping
protected VectorColumnSourceMapping fullOuterSmallTableKeyMapping
protected VectorColumnSourceMapping smallTableValueMapping
protected VectorColumnSourceMapping projectionMapping
protected int[] outerSmallTableKeyColumnMap
protected int[] smallTableValueColumnMap
protected int[] bigTableByteColumnVectorColumns
protected int[] nonOuterSmallTableKeyByteColumnVectorColumns
protected int[] outerSmallTableKeyByteColumnVectorColumns
protected int[] smallTableByteColumnVectorColumns
protected transient boolean useOverflowRepeatedThreshold
protected transient int overflowRepeatedThreshold
protected transient VectorCopyRow bigTableRetainedVectorCopy
protected int[] allSmallTableKeyColumnNums
protected boolean[] allSmallTableKeyColumnIncluded
protected transient VectorDeserializeRow<BinarySortableDeserializeRead> smallTableKeyOuterVectorDeserializeRow
protected transient VectorCopyRow nonOuterSmallTableKeyVectorCopy
protected transient VectorCopyRow outerSmallTableKeyVectorCopy
protected transient VectorDeserializeRow<LazyBinaryDeserializeRead> smallTableValueVectorDeserializeRow
protected transient VectorizedRowBatch overflowBatch
protected transient VectorizedRowBatch spillReplayBatch
protected transient boolean needCommonSetup
protected transient boolean needFirstBatchSetup
protected transient boolean needHashTableSetup
protected transient VectorMapJoinHashTable vectorMapJoinHashTable
protected transient long batchCounter
protected transient long rowCounter
protected VectorMapJoinCommonOperator()
public VectorMapJoinCommonOperator(CompilationOpContext ctx)
public VectorMapJoinCommonOperator(CompilationOpContext ctx, OperatorDesc conf, VectorizationContext vContext, VectorDesc vectorDesc) throws HiveException
HiveExceptionprotected abstract String getLoggingPrefix()
protected void initLoggingPrefix(String className)
protected void determineCommonInfo(boolean isOuter)
throws HiveException
HiveExceptionprotected void setupVOutContext(List<String> outputColumnNames)
protected HashTableLoader getHashTableLoader(org.apache.hadoop.conf.Configuration hconf)
getHashTableLoader in class MapJoinOperatorprotected void initializeOp(org.apache.hadoop.conf.Configuration hconf)
throws HiveException
OperatorinitializeOp in class MapJoinOperatorHiveExceptionprotected void completeInitializationOp(Object[] os) throws HiveException
OperatorcompleteInitializationOp in class MapJoinOperatorHiveExceptionpublic void setTestMapJoinTableContainer(int posSmallTable,
MapJoinTableContainer testMapJoinTableContainer,
MapJoinTableContainerSerDe mapJoinTableContainerSerDe)
setTestMapJoinTableContainer in class MapJoinOperatorprotected VectorizedRowBatch setupOverflowBatch() throws HiveException
HiveExceptionprotected void commonSetup()
throws HiveException
HiveExceptionpublic void firstBatchSetup(VectorizedRowBatch batch) throws HiveException
HiveExceptionpublic void hashTableSetup()
throws HiveException
HiveExceptionpublic abstract void processBatch(VectorizedRowBatch batch) throws HiveException
HiveExceptionpublic void process(Object row, int tag) throws HiveException
Operatorprocess in class MapJoinOperatorrow - The object representing the row.tag - The tag of the row usually means which parent this row comes from.
Rows with the same tag should have exactly the same rowInspector
all the time.HiveExceptionprotected void displayBatchColumns(VectorizedRowBatch batch, String batchName)
public OperatorType getType()
OperatorgetType in class MapJoinOperatorpublic VectorizationContext getInputVectorizationContext()
getInputVectorizationContext in interface VectorizationOperatorpublic VectorDesc getVectorDesc()
getVectorDesc in interface VectorizationOperatorpublic VectorizationContext getOutputVectorizationContext()
getOutputVectorizationContext in interface VectorizationContextRegionCopyright © 2024 The Apache Software Foundation. All rights reserved.