huaxingao commented on a change in pull request #32049: URL: https://github.com/apache/spark/pull/32049#discussion_r638464112
########## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsPushDownAggregates.java ########## @@ -0,0 +1,60 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.spark.sql.connector.read; + +import org.apache.spark.annotation.Evolving; +import org.apache.spark.sql.sources.Aggregation; +import org.apache.spark.sql.types.StructType; + +/** + * A mix-in interface for {@link ScanBuilder}. Data source can implement this interface to + * push down aggregates to the data source. + * + * @since 3.2.0 + */ +@Evolving +public interface SupportsPushDownAggregates extends ScanBuilder { + + /** + * Pushes down Aggregation to datasource. + * The Aggregation can be pushed down only if all the Aggregate Functions can + * be pushed down. + */ + void pushAggregation(Aggregation aggregation); + + /** + * Returns the aggregation that are pushed to the data source via + * {@link #pushAggregation(Aggregation aggregation)}. + */ + Aggregation pushedAggregation(); + + /** + * Returns the schema of the pushed down aggregates + */ + StructType getPushDownAggSchema(); + + /** + * Indicate if the data source only supports global aggregated push down + */ + boolean supportsGlobalAggregatePushDownOnly(); Review comment: I use `supportsGlobalAggregatePushDownOnly` to indicate if the data source supports pushing down `group by`, and use `supportsPushDownAggregateWithFilter` to indicate if the data source supports push down aggregate with filter. For the data source that doesn't support push down `group by` or push down aggregate with filter, I exit right away if I see `group by` or filter. I guess I need these two flags to find out if I should push down `group by` or push down filter with aggregate. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
