public interface LlapIo<T>
| Modifier and Type | Method and Description |
|---|---|
void |
close() |
long |
evictEntity(LlapDaemonProtocolProtos.EvictEntityRequestProto protoRequest)
Handles request to evict entities specified in the request object.
|
LlapDaemonProtocolProtos.CacheEntryList |
fetchCachedContentInfo()
Extract and return the cache content metadata.
|
org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.NullWritable,T> |
getInputFormat(org.apache.hadoop.mapred.InputFormat<?,?> sourceInputFormat,
Deserializer serde) |
String |
getMemoryInfo() |
org.apache.orc.impl.OrcTail |
getOrcTailFromCache(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration conf,
CacheTag tag,
Object fileKey)
Returns a deserialized OrcTail instance associated with the ORC file on the given path.
|
org.apache.hadoop.hive.common.io.encoded.MemoryBufferOrBuffers |
getParquetFooterBuffersFromCache(org.apache.hadoop.fs.Path path,
org.apache.hadoop.mapred.JobConf conf,
Object fileKey)
Returns the metadata buffers associated with the Parquet file on the given path.
|
void |
initCacheOnlyInputFormat(org.apache.hadoop.mapred.InputFormat<?,?> inputFormat) |
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> |
llapVectorizedOrcReaderForPath(Object fileKey,
org.apache.hadoop.fs.Path path,
CacheTag tag,
List<Integer> tableIncludedCols,
org.apache.hadoop.mapred.JobConf conf,
long offset,
long length,
org.apache.hadoop.mapred.Reporter reporter)
Creates an LLAP record reader for a given file, by creating a split from this file, and passing it into a new
LLAP record reader.
|
void |
loadDataIntoCache(LlapDaemonProtocolProtos.CacheEntryList metadata)
Load the actual data into the cache based on the provided metadata.
|
long |
purge()
purge is best effort and will just release the buffers that are unlocked (refCount == 0).
|
boolean |
usingLowLevelCache() |
org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.NullWritable,T> getInputFormat(org.apache.hadoop.mapred.InputFormat<?,?> sourceInputFormat, Deserializer serde)
void close()
String getMemoryInfo()
long purge()
org.apache.orc.impl.OrcTail getOrcTailFromCache(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration conf,
CacheTag tag,
@Nullable
Object fileKey)
throws IOException
path - Orc file pathconf - jobConftag - a CacheTag instance must be provided as that's needed for cache insertionfileKey - fileId of the ORC file (either the Long fileId of HDFS or the SyntheticFileId).
Optional, if it is not provided, it will be generated, see:
org.apache.hadoop.hive.ql.io.HdfsUtils#getFileId()IOException - exorg.apache.hadoop.hive.common.io.encoded.MemoryBufferOrBuffers getParquetFooterBuffersFromCache(org.apache.hadoop.fs.Path path, org.apache.hadoop.mapred.JobConf conf, @Nullable Object fileKey) throws IOException
path - Parquet file pathconf - jobConffileKey - fileId of the Parquet file (either the Long fileId of HDFS or the SyntheticFileId).
Optional, if it is not provided, it will be generated, see:
org.apache.hadoop.hive.ql.io.HdfsUtils#getFileId()IOExceptionlong evictEntity(LlapDaemonProtocolProtos.EvictEntityRequestProto protoRequest)
protoRequest - lists Hive entities (DB, table, etc..) whose LLAP buffers should be evicted.void initCacheOnlyInputFormat(org.apache.hadoop.mapred.InputFormat<?,?> inputFormat)
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> llapVectorizedOrcReaderForPath(Object fileKey, org.apache.hadoop.fs.Path path, CacheTag tag, List<Integer> tableIncludedCols, org.apache.hadoop.mapred.JobConf conf, long offset, long length, org.apache.hadoop.mapred.Reporter reporter) throws IOException
fileKey - - file ID, if null it will be determined during read but for additional performance costpath - - path for the file to readtag - - cache tag associated with this file (required for LLAP administrative purposes)tableIncludedCols - - list of column #'s to be read from the fileconf - - job conf for this read. Schema serialized herein should be aligned with tableIncludedColsoffset - - required offset to start reading fromlength - - required reading lengthIOExceptionLlapDaemonProtocolProtos.CacheEntryList fetchCachedContentInfo()
void loadDataIntoCache(LlapDaemonProtocolProtos.CacheEntryList metadata)
boolean usingLowLevelCache()
Copyright © 2024 The Apache Software Foundation. All rights reserved.