public class ExampleOutputFormat extends ParquetOutputFormat<Group>
must be provided the schema up front
ParquetOutputFormat.JobSummaryLevelADAPTIVE_BLOOM_FILTER_ENABLED, BLOCK_SIZE, BLOOM_FILTER_CANDIDATES_NUMBER, BLOOM_FILTER_ENABLED, BLOOM_FILTER_EXPECTED_NDV, BLOOM_FILTER_FPP, BLOOM_FILTER_MAX_BYTES, COLUMN_INDEX_TRUNCATE_LENGTH, COMPRESSION, DICTIONARY_PAGE_SIZE, ENABLE_DICTIONARY, ENABLE_JOB_SUMMARY, ESTIMATE_PAGE_SIZE_CHECK, JOB_SUMMARY_LEVEL, MAX_PADDING_BYTES, MAX_ROW_COUNT_FOR_PAGE_SIZE_CHECK, MEMORY_POOL_RATIO, MIN_MEMORY_ALLOCATION, MIN_ROW_COUNT_FOR_PAGE_SIZE_CHECK, PAGE_ROW_COUNT_LIMIT, PAGE_SIZE, PAGE_VALUE_COUNT_THRESHOLD, PAGE_WRITE_CHECKSUM_ENABLED, STATISTICS_TRUNCATE_LENGTH, VALIDATION, WRITE_SUPPORT_CLASS, WRITER_VERSION| Constructor and Description |
|---|
ExampleOutputFormat() |
| Modifier and Type | Method and Description |
|---|---|
static MessageType |
getSchema(org.apache.hadoop.mapreduce.Job job)
retrieve the schema from the conf
|
static void |
setSchema(org.apache.hadoop.mapreduce.Job job,
MessageType schema)
set the schema being written to the job conf
|
createEncryptionProperties, getAdaptiveBloomFilterEnabled, getBlockSize, getBlockSize, getBloomFilterEnabled, getBloomFilterMaxBytes, getCompression, getCompression, getDictionaryPageSize, getDictionaryPageSize, getEnableDictionary, getEnableDictionary, getEstimatePageSizeCheck, getJobSummaryLevel, getLongBlockSize, getMaxRowCountForPageSizeCheck, getMemoryManager, getMinRowCountForPageSizeCheck, getOutputCommitter, getPageSize, getPageSize, getPageWriteChecksumEnabled, getRecordWriter, getRecordWriter, getRecordWriter, getRecordWriter, getRecordWriter, getRecordWriter, getValidation, getValidation, getValueCountThreshold, getWriterVersion, getWriteSupport, getWriteSupportClass, isCompressionSet, isCompressionSet, setBlockSize, setColumnIndexTruncateLength, setColumnIndexTruncateLength, setCompression, setDictionaryPageSize, setEnableDictionary, setMaxPaddingSize, setMaxPaddingSize, setPageRowCountLimit, setPageRowCountLimit, setPageSize, setPageWriteChecksumEnabled, setPageWriteChecksumEnabled, setStatisticsTruncateLength, setValidation, setValidation, setWriteSupportClass, setWriteSupportClasscheckOutputSpecs, getCompressOutput, getDefaultWorkFile, getOutputCompressorClass, getOutputName, getOutputPath, getPathForWorkFile, getUniqueFile, getWorkOutputPath, setCompressOutput, setOutputCompressorClass, setOutputName, setOutputPathpublic static void setSchema(org.apache.hadoop.mapreduce.Job job,
MessageType schema)
job - a jobschema - the schema of the datapublic static MessageType getSchema(org.apache.hadoop.mapreduce.Job job)
job - a jobCopyright © 2023 The Apache Software Foundation. All rights reserved.