public class RewriteOptions extends Object
ParquetRewriter. See RewriteOptions.Builder for options description.| Modifier and Type | Class and Description |
|---|---|
static class |
RewriteOptions.Builder
Builder for
RewriteOptions which is used for constructing ParquetRewriter. |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.conf.Configuration |
getConf()
Gets the
Configuration part of the rewrite options. |
List<String> |
getEncryptColumns() |
FileEncryptionProperties |
getFileEncryptionProperties() |
boolean |
getIgnoreJoinFilesMetadata() |
IndexCache.CacheStrategy |
getIndexCacheStrategy() |
List<org.apache.hadoop.fs.Path> |
getInputFiles()
Gets the input
Paths for the rewrite if they exist for all input files,
otherwise throws a RuntimeException. |
List<org.apache.hadoop.fs.Path> |
getInputFilesToJoin()
Gets the input
Paths for the rewrite if they exist for all input files to join,
otherwise throws a RuntimeException. |
Map<String,MaskMode> |
getMaskColumns() |
CompressionCodecName |
getNewCodecName() |
org.apache.hadoop.fs.Path |
getOutputFile()
Get the
Path for the rewrite if it exists, otherwise throws a RuntimeException. |
boolean |
getOverwriteInputWithJoinColumns() |
ParquetConfiguration |
getParquetConfiguration()
Gets the
ParquetConfiguration part of the rewrite options. |
List<InputFile> |
getParquetInputFiles()
Gets the
InputFiles for the rewrite. |
List<InputFile> |
getParquetInputFilesToJoin()
Gets the right
InputFiles to join during the rewrite. |
OutputFile |
getParquetOutputFile()
Get the
OutputFile for the rewrite. |
List<String> |
getPruneColumns() |
Map<String,String> |
getRenameColumns() |
public org.apache.hadoop.conf.Configuration getConf()
Configuration part of the rewrite options.Configurationpublic ParquetConfiguration getParquetConfiguration()
ParquetConfiguration part of the rewrite options.ParquetConfigurationpublic List<org.apache.hadoop.fs.Path> getInputFiles()
Paths for the rewrite if they exist for all input files,
otherwise throws a RuntimeException.List of the associated input Pathspublic List<org.apache.hadoop.fs.Path> getInputFilesToJoin()
Paths for the rewrite if they exist for all input files to join,
otherwise throws a RuntimeException.List of the associated input Paths to joinpublic List<InputFile> getParquetInputFiles()
InputFiles for the rewrite.public List<InputFile> getParquetInputFilesToJoin()
InputFiles to join during the rewrite.public org.apache.hadoop.fs.Path getOutputFile()
Path for the rewrite if it exists, otherwise throws a RuntimeException.Path if it existspublic OutputFile getParquetOutputFile()
OutputFile for the rewrite.OutputFilepublic CompressionCodecName getNewCodecName()
public FileEncryptionProperties getFileEncryptionProperties()
public IndexCache.CacheStrategy getIndexCacheStrategy()
public boolean getOverwriteInputWithJoinColumns()
public boolean getIgnoreJoinFilesMetadata()
Copyright © 2024 The Apache Software Foundation. All rights reserved.