Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21240#discussion_r186276709
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/generators.scala
 ---
    @@ -222,6 +222,51 @@ case class Stack(children: Seq[Expression]) extends 
Generator {
       }
     }
     
    +/**
    + * Replicate the row N times. N is specified as the first argument to the 
function.
    + * {{{
    + *   SELECT replicate_rows(2, "val1", "val2") ->
    + *   2  val1  val2
    + *   2  val1  val2
    + *  }}}
    + */
    +@ExpressionDescription(
    +usage = "_FUNC_(n, expr1, ..., exprk) - Replicates `n`, `expr1`, ..., 
`exprk` into `n` rows.",
    --- End diff --
    
    I checked the design doc for INTERSECT ALL and EXCEPT ALL. Looks like the 
`n` is always stripped after Generate operation. So why we need to keep `n` in  
`ReplicateRows` outputs? Can we do it like:
    
    ```
    > SELECT _FUNC_(2, "val1", "val2");
      val1  val2
      val1  val2
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to