trait SupportsPushDownAggregates extends ScanBuilder
A mix-in interface for ScanBuilder. Data sources can implement this interface to
push down aggregates. Spark assumes that the data source can't fully complete the
grouping work, and will group the data source output again. For queries like
"SELECT min(value) AS m FROM t GROUP BY key", after pushing down the aggregate
to the data source, the data source can still output data with duplicated keys, which is OK
as Spark will do GROUP BY key again. The final query plan can be something like this:
Aggregate [key#1], [min(min(value)#2) AS m#3]
+- RelationV2[key#1, min(value)#2]
Similarly, if there is no grouping expression, the data source can still output more than one
rows.When pushing down operators, Spark pushes down filters to the data source first, then push down aggregates or apply column pruning. Depends on data source implementation, aggregates may or may not be able to be pushed down with filters. If pushed filters still need to be evaluated after scanning, aggregates can't be pushed down.
- Annotations
- @Evolving()
- Since
3.2.0
- Alphabetic
- By Inheritance
- SupportsPushDownAggregates
- ScanBuilder
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Abstract Value Members
-
abstract
def
build(): Scan
- Definition Classes
- ScanBuilder
-
abstract
def
pushAggregation(aggregation: Aggregation): Boolean
Pushes down Aggregation to datasource.
Pushes down Aggregation to datasource. The order of the datasource scan output columns should be: grouping columns, aggregate columns (in the same order as the aggregate functions in the given Aggregation).
- returns
true if the aggregation can be pushed down to datasource, false otherwise.
Concrete Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()