Package org.apache.beam.runners.spark
Interface SparkCommonPipelineOptions
-
- All Superinterfaces:
org.apache.beam.sdk.options.ApplicationNameOptions,org.apache.beam.sdk.options.FileStagingOptions,org.apache.beam.sdk.transforms.display.HasDisplayData,org.apache.beam.sdk.options.PipelineOptions,org.apache.beam.sdk.options.StreamingOptions
- All Known Subinterfaces:
SparkContextOptions,SparkPipelineOptions,SparkPortableStreamingPipelineOptions,SparkStructuredStreamingPipelineOptions,TestSparkPipelineOptions
public interface SparkCommonPipelineOptions extends org.apache.beam.sdk.options.PipelineOptions, org.apache.beam.sdk.options.StreamingOptions, org.apache.beam.sdk.options.ApplicationNameOptions, org.apache.beam.sdk.options.FileStagingOptionsSpark runnerPipelineOptionshandles Spark execution-related configurations, such as the master address, and other user-related knobs.
-
-
Nested Class Summary
Nested Classes Modifier and Type Interface Description static classSparkCommonPipelineOptions.StorageLevelFactoryReturns Spark's default storage level for the Dataset or RDD API based on the respective runner.static classSparkCommonPipelineOptions.TmpCheckpointDirFactoryReturns the default checkpoint directory of /tmp/${job.name}.-
Nested classes/interfaces inherited from interface org.apache.beam.sdk.options.PipelineOptions
org.apache.beam.sdk.options.PipelineOptions.AtomicLongFactory, org.apache.beam.sdk.options.PipelineOptions.CheckEnabled, org.apache.beam.sdk.options.PipelineOptions.DirectRunner, org.apache.beam.sdk.options.PipelineOptions.JobNameFactory, org.apache.beam.sdk.options.PipelineOptions.UserAgentFactory
-
-
Field Summary
Fields Modifier and Type Field Description static java.lang.StringDEFAULT_MASTER_URL
-
Method Summary
All Methods Static Methods Instance Methods Abstract Methods Modifier and Type Method Description java.lang.StringgetCheckpointDir()java.lang.BooleangetEnableSparkMetricSinks()java.lang.StringgetSparkMaster()java.lang.StringgetStorageLevel()static voidprepareFilesToStage(SparkCommonPipelineOptions options)Classpath contains non jar files (eg.voidsetCheckpointDir(java.lang.String checkpointDir)voidsetEnableSparkMetricSinks(java.lang.Boolean enableSparkMetricSinks)voidsetSparkMaster(java.lang.String master)voidsetStorageLevel(java.lang.String storageLevel)-
Methods inherited from interface org.apache.beam.sdk.options.ApplicationNameOptions
getAppName, setAppName
-
Methods inherited from interface org.apache.beam.sdk.options.FileStagingOptions
getFilesToStage, setFilesToStage
-
Methods inherited from interface org.apache.beam.sdk.transforms.display.HasDisplayData
populateDisplayData
-
-
-
-
Field Detail
-
DEFAULT_MASTER_URL
static final java.lang.String DEFAULT_MASTER_URL
- See Also:
- Constant Field Values
-
-
Method Detail
-
getSparkMaster
@String("local[4]") java.lang.String getSparkMaster()
-
setSparkMaster
void setSparkMaster(java.lang.String master)
-
getCheckpointDir
@InstanceFactory(TmpCheckpointDirFactory.class) java.lang.String getCheckpointDir()
-
setCheckpointDir
void setCheckpointDir(java.lang.String checkpointDir)
-
getStorageLevel
@InstanceFactory(StorageLevelFactory.class) java.lang.String getStorageLevel()
-
setStorageLevel
void setStorageLevel(java.lang.String storageLevel)
-
getEnableSparkMetricSinks
@Boolean(true) java.lang.Boolean getEnableSparkMetricSinks()
-
setEnableSparkMetricSinks
void setEnableSparkMetricSinks(java.lang.Boolean enableSparkMetricSinks)
-
prepareFilesToStage
@Internal static void prepareFilesToStage(SparkCommonPipelineOptions options)
Classpath contains non jar files (eg. directories with .class files or empty directories) will cause exception in running log. Though theSparkContextcan handle this when running in local master, it's better not to include non-jars files in classpath.
-
-