Cheng Lian created SPARK-13554:
----------------------------------

             Summary: Migrate typed relational operations
                 Key: SPARK-13554
                 URL: https://issues.apache.org/jira/browse/SPARK-13554
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
    Affects Versions: 2.0.0
            Reporter: Cheng Lian
            Assignee: Cheng Lian


Should migrate the following methods and corresponding tests to Dataset:

{noformat}
- Relational operations

  - Typed relational operations

    - as(String): Dataset[T] // Subquery
    - filter(Column): Dataset[T]
    - filter(String): Dataset[T]
    - where(Column): Dataset[T]
    - where(String): Dataset[T]
    - limit(n): Dataset[T]
    - sortWithinPartitions(String, String*): Dataset[T]
    - sortWithinPartitions(Column*): Dataset[T]
    - sort(String, String*): Dataset[T]
    - sort(Column*): Dataset[T]
    - orderBy(String, String*): Dataset[T]
    - orderBy(Column*): Dataset[T]
    - randomSplit(Array[Double], Long): Array[Dataset[T]]
    - randomSplit(Array[Double]): Array[Dataset[T]]
    - Set operations

      - unionAll // alias of union (remove it?)
      - except // alias of substract (remove it?)

    - Repartitioning

      - repartition(Int, Column*): Dataset[T]
      - repartition(Column*): Dataset[T]

    - explode[A <: Product: TypeTag](Column*)(Row => TraversableOnce[A]): 
Dataset[A]
    - explode[A, B: TypeTag](String, String)(A => TraversableOnce[B]): 
Dataset[B]
{noformat}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to