Package org.apache.druid.indexer
Class DeterminePartitionsJob
- java.lang.Object
-
- org.apache.druid.indexer.DeterminePartitionsJob
-
- All Implemented Interfaces:
org.apache.druid.indexer.Jobby
public class DeterminePartitionsJob extends Object implements org.apache.druid.indexer.Jobby
Determines appropriate ShardSpecs for a job by determining whether or not partitioning is necessary, and if so, choosing the best dimension that satisfies the criteria:- Must have exactly one value per row.
- Must not generate oversized partitions. A dimension with N rows having the same value will necessarily put all those rows in the same partition, and that partition may be much larger than the target size.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classDeterminePartitionsJob.DeterminePartitionsDimSelectionAssumeGroupedMapperThis DimSelection mapper runs on raw input data that we assume has already been grouped.static classDeterminePartitionsJob.DeterminePartitionsDimSelectionCombinerstatic classDeterminePartitionsJob.DeterminePartitionsDimSelectionOutputFormatstatic classDeterminePartitionsJob.DeterminePartitionsDimSelectionPartitionerstatic classDeterminePartitionsJob.DeterminePartitionsDimSelectionPostGroupByMapperThis DimSelection mapper runs on data generated by our GroupBy job.static classDeterminePartitionsJob.DeterminePartitionsDimSelectionReducerstatic classDeterminePartitionsJob.DeterminePartitionsGroupByMapperstatic classDeterminePartitionsJob.DeterminePartitionsGroupByReducerstatic classDeterminePartitionsJob.DimValueCount
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description StringgetErrorMessage()Map<String,Object>getStats()booleanrun()
-