[GitHub] spark pull request #15363: [SPARK-17791][SQL] Join reordering using star sch...

ioana-delaney Sat, 18 Mar 2017 12:28:17 -0700

Github user ioana-delaney commented on a diff in the pull request:

    https://github.com/apache/spark/pull/15363#discussion_r106789008
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala 
---
    @@ -20,19 +20,347 @@ package org.apache.spark.sql.catalyst.optimizer
     import scala.annotation.tailrec
     
     import org.apache.spark.sql.catalyst.expressions._
    -import org.apache.spark.sql.catalyst.planning.ExtractFiltersAndInnerJoins
    +import 
org.apache.spark.sql.catalyst.planning.{ExtractFiltersAndInnerJoins, 
PhysicalOperation}
     import org.apache.spark.sql.catalyst.plans._
     import org.apache.spark.sql.catalyst.plans.logical._
     import org.apache.spark.sql.catalyst.rules._
    +import org.apache.spark.sql.catalyst.CatalystConf
    +
    +/**
    + * Encapsulates star-schema join detection.
    + */
    +case class StarSchemaDetection(conf: CatalystConf) extends PredicateHelper 
{
    +
    +  /**
    +   * Star schema consists of one or more fact tables referencing a number 
of dimension
    +   * tables. In general, star-schema joins are detected using the 
following conditions:
    +   *  1. Informational RI constraints (reliable detection)
    +   *    + Dimension contains a primary key that is being joined to the 
fact table.
    +   *    + Fact table contains foreign keys referencing multiple dimension 
tables.
    +   *  2. Cardinality based heuristics
    +   *    + Usually, the table with the highest cardinality is the fact 
table.
    +   *    + Table being joined with the most number of tables is the fact 
table.
    +   *
    +   * To detect star joins, the algorithm uses a combination of the above 
two conditions.
    +   * The fact table is chosen based on the cardinality heuristics, and the 
dimension
    +   * tables are chosen based on the RI constraints. A star join will 
consist of the largest
    +   * fact table joined with the dimension tables on their primary keys. To 
detect that a
    +   * column is a primary key, the algorithm uses table and column 
statistics.
    +   *
    +   * Since Catalyst only supports left-deep tree plans, the algorithm 
currently returns only
    +   * the star join with the largest fact table. Choosing the largest fact 
table on the
    +   * driving arm to avoid large inners is in general a good heuristic. 
This restriction can
    +   * be lifted with support for bushy tree plans.
    +   *
    +   * The highlights of the algorithm are the following:
    +   *
    +   * Given a set of joined tables/plans, the algorithm first verifies if 
they are eligible
    +   * for star join detection. An eligible plan is a base table access with 
valid statistics.
    +   * A base table access represents Project or Filter operators above a 
LeafNode. Conservatively,
    +   * the algorithm only considers base table access as part of a star join 
since they provide
    +   * reliable statistics.
    +   *
    +   * If some of the plans are not base table access, or statistics are not 
available, the algorithm
    +   * returns an empty star join plan since, in the absence of statistics, 
it cannot make
    +   * good planning decisions. Otherwise, the algorithm finds the table 
with the largest cardinality
    +   * (number of rows), which is assumed to be a fact table.
    +   *
    +   * Next, it computes the set of dimension tables for the current fact 
table. A dimension table
    +   * is assumed to be in a RI relationship with a fact table. To infer 
column uniqueness,
    +   * the algorithm compares the number of distinct values with the total 
number of rows in the
    +   * table. If their relative difference is within certain limits (i.e. 
ndvMaxError * 2, adjusted
    +   * based on 1TB TPC-DS data), the column is assumed to be unique.
    +   */
    +  def findStarJoins(
    +      input: Seq[LogicalPlan],
    +      conditions: Seq[Expression]): Seq[Seq[LogicalPlan]] = {
    +
    +    val emptyStarJoinPlan = Seq.empty[Seq[LogicalPlan]]
    +
    +    if (!conf.starSchemaDetection || input.size < 2) {
    +      emptyStarJoinPlan
    +    } else {
    +      // Find if the input plans are eligible for star join detection.
    +      // An eligible plan is a base table access with valid statistics.
    +      val foundEligibleJoin = input.forall {
    +        case PhysicalOperation(_, _, t: LeafNode) if 
t.stats(conf).rowCount.isDefined => true
    +        case _ => false
    +      }
    +
    +      if (!foundEligibleJoin) {
    +        // Some plans don't have stats or are complex plans. 
Conservatively,
    +        // return an empty star join. This restriction can be lifted
    +        // once statistics are propagated in the plan.
    +        emptyStarJoinPlan
    +      } else {
    +        // Find the fact table using cardinality based heuristics i.e.
    +        // the table with the largest number of rows.
    +        val sortedFactTables = input.map { plan =>
    +          TableAccessCardinality(plan, getTableAccessCardinality(plan))
    +        }.collect { case t @ TableAccessCardinality(_, Some(_)) =>
    +          t
    +        }.sortBy(_.size)(implicitly[Ordering[Option[BigInt]]].reverse)
    +
    +        sortedFactTables match {
    +          case Nil =>
    +            emptyStarJoinPlan
    +          case table1 :: table2 :: _
    +            if table2.size.get.toDouble > conf.starSchemaFTRatio * 
table1.size.get.toDouble =>
    +            // If the top largest tables have comparable number of rows, 
return an empty star plan.
    +            // This restriction will be lifted when the algorithm is 
generalized
    +            // to return multiple star plans.
    +            emptyStarJoinPlan
    +          case TableAccessCardinality(factTable, _) :: _ =>
    +            // Find the fact table joins.
    +            val allFactJoins = input.filterNot { plan =>
    +              plan eq factTable
    +            }.filter { plan =>
    +              val joinCond = findJoinConditions(factTable, plan, 
conditions)
    +              joinCond.nonEmpty
    +            }
    +
    +            // Find the corresponding join conditions.
    +            val allFactJoinCond = allFactJoins.flatMap { plan =>
    +              val joinCond = findJoinConditions(factTable, plan, 
conditions)
    +              joinCond
    +            }
    +
    +            // Verify if the join columns have valid statistics
    +            val areStatsAvailable = allFactJoins.forall { dimTable =>
    +              allFactJoinCond.exists {
    +                case BinaryComparison(lhs: AttributeReference, rhs: 
AttributeReference) =>
    +                  val dimCol = if (dimTable.outputSet.contains(lhs)) lhs 
else rhs
    +                  val factCol = if (factTable.outputSet.contains(lhs)) lhs 
else rhs
    +                  hasStatistics(dimCol, dimTable) && 
hasStatistics(factCol, factTable)
    +                case _ => false
    +              }
    +            }
    +
    +            if (!areStatsAvailable) {
    +              emptyStarJoinPlan
    +            } else {
    +              // Find the subset of dimension tables. A dimension table is 
assumed to be in
    +              // RI relationship with the fact table. Also, only consider 
equi-joins
    +              // between a fact and a dimension table.
    +              val eligibleDimPlans = allFactJoins.filter { dimTable =>
    +                allFactJoinCond.exists {
    +                  case cond @ BinaryComparison(lhs: AttributeReference, 
rhs: AttributeReference)
    +                    if cond.isInstanceOf[EqualTo] || 
cond.isInstanceOf[EqualNullSafe] =>
    +                    val dimCol = if (dimTable.outputSet.contains(lhs)) lhs 
else rhs
    +                    isUnique(dimCol, dimTable)
    +                  case _ => false
    +                }
    +              }
    +
    +              if (eligibleDimPlans.isEmpty) {
    +                // An eligible star join was not found because the join is 
not
    +                // an RI join, or the star join is an expanding join.
    +                emptyStarJoinPlan
    +              } else {
    +                Seq(factTable +: eligibleDimPlans)
    +              }
    +            }
    +        }
    +      }
    +    }
    +  }
    +
    +  /**
    +   * Reorders a star join based on heuristics:
    +   *   1) Finds the star join with the largest fact table and places it on 
the driving
    +   *      arm of the left-deep tree. This plan avoids large table access 
on the inner, and
    +   *      thus favor hash joins.
    +   *   2) Applies the most selective dimensions early in the plan to 
reduce the amount of
    +   *      data flow.
    +   */
    +  def reorderStarJoins(
    +      input: Seq[(LogicalPlan, InnerLike)],
    +      conditions: Seq[Expression]): Seq[(LogicalPlan, InnerLike)] = {
    +    assert(input.size >= 2)
    +
    +    val emptyStarJoinPlan = Seq.empty[(LogicalPlan, InnerLike)]
    +
    +    // Find the eligible star plans. Currently, it only returns
    +    // the star join with the largest fact table.
    +    val eligibleJoins = input.collect{ case (plan, Inner) => plan }
    +    val starPlans = findStarJoins(eligibleJoins, conditions)
    +
    +    if (starPlans.isEmpty) {
    +      emptyStarJoinPlan
    +    } else {
    +      val starPlan = starPlans.head
    +      val (factTable, dimTables) = (starPlan.head, starPlan.tail)
    +
    +      // Only consider selective joins. This case is detected by observing 
local predicates
    +      // on the dimension tables. In a star schema relationship, the join 
between the fact and the
    +      // dimension table is a FK-PK join. Heuristically, a selective 
dimension may reduce
    +      // the result of a join.
    +      // Also, conservatively assume that a fact table is joined with more 
than one dimension.
    +      if (dimTables.size >= 2 && isSelectiveStarJoin(dimTables, 
conditions)) {
    +        val reorderDimTables = dimTables.map { plan =>
    +          TableAccessCardinality(plan, getTableAccessCardinality(plan))
    +        }.sortBy(_.size).map {
    +          case TableAccessCardinality(p1, _) => p1
    +        }
    +
    +        val reorderStarPlan = factTable +: reorderDimTables
    +        reorderStarPlan.map(plan => (plan, Inner))
    +      } else {
    +        emptyStarJoinPlan
    +      }
    +    }
    +  }
    +
    +  /**
    +   * Determines if a column referenced by a base table access is a primary 
key.
    +   * A column is a PK if it is not nullable and has unique values.
    +   * To determine if a column has unique values in the absence of 
informational
    +   * RI constraints, the number of distinct values is compared to the total
    +   * number of rows in the table. If their relative difference
    +   * is within the expected limits (i.e. 2 * 
spark.sql.statistics.ndv.maxError based
    +   * on TPCDS data results), the column is assumed to have unique values.
    +   */
    +  private def isUnique(
    +      column: Attribute,
    +      plan: LogicalPlan): Boolean = plan match {
    +    case PhysicalOperation(_, _, t: LeafNode) =>
    +      val leafCol = findLeafNodeCol(column, plan)
    +      leafCol match {
    +        case Some(col) if t.outputSet.contains(col) =>
    +          val stats = t.stats(conf)
    +          stats.rowCount match {
    +            case Some(rowCount) if rowCount >= 0 =>
    +              if (stats.attributeStats.nonEmpty && 
stats.attributeStats.contains(col)) {
    +                val colStats = stats.attributeStats.get(col)
    +                if (colStats.get.nullCount > 0) {
    +                  false
    +                } else {
    +                  val distinctCount = colStats.get.distinctCount
    +                  val relDiff = math.abs((distinctCount.toDouble / 
rowCount.toDouble) - 1.0d)
    +                  // ndvMaxErr adjusted based on TPCDS 1TB data results
    +                  relDiff <= conf.ndvMaxError * 2
    +                }
    +              } else {
    +                false
    +              }
    +            case None => false
    +          }
    +        case None => false
    +      }
    +    case _ => false
    +  }
    +
    +  /**
    +   * Given a column over a base table access, it returns
    +   * the leaf node column from which the input column is derived.
    +   */
    +  @tailrec
    +  private def findLeafNodeCol(
    +      column: Attribute,
    +      plan: LogicalPlan): Option[Attribute] = plan match {
    +    case pl @ PhysicalOperation(_, _, _: LeafNode) =>
    +      pl match {
    +        case t: LeafNode if t.outputSet.contains(column) =>
    +          Option(column)
    +        case p: Project if p.outputSet.exists(_.semanticEquals(column)) =>
    +          val col = p.outputSet.find(_.semanticEquals(column)).get
    +          findLeafNodeCol(col, p.child)
    +        case f: Filter =>
    +          findLeafNodeCol(column, f.child)
    +        case _ => None
    +      }
    +    case _ => None
    +  }
    +
    +  /**
    +   * Checks if a column has statistics.
    +   * The column is assumed to be over a base table access.
    +   */
    +  private def hasStatistics(
    +      column: Attribute,
    +      plan: LogicalPlan): Boolean = plan match {
    +    case PhysicalOperation(_, _, t: LeafNode) =>
    --- End diff --
    
    @cloud-fan I am using the column statistics to infer the "uniqueness" of a 
column in a user table. For that I want to use the stats on the base relation 
and not the ones computed by CBO.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #15363: [SPARK-17791][SQL] Join reordering using star sch...

Reply via email to