Cheng Lian created SPARK-13554:
----------------------------------
Summary: Migrate typed relational operations
Key: SPARK-13554
URL: https://issues.apache.org/jira/browse/SPARK-13554
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 2.0.0
Reporter: Cheng Lian
Assignee: Cheng Lian
Should migrate the following methods and corresponding tests to Dataset:
{noformat}
- Relational operations
- Typed relational operations
- as(String): Dataset[T] // Subquery
- filter(Column): Dataset[T]
- filter(String): Dataset[T]
- where(Column): Dataset[T]
- where(String): Dataset[T]
- limit(n): Dataset[T]
- sortWithinPartitions(String, String*): Dataset[T]
- sortWithinPartitions(Column*): Dataset[T]
- sort(String, String*): Dataset[T]
- sort(Column*): Dataset[T]
- orderBy(String, String*): Dataset[T]
- orderBy(Column*): Dataset[T]
- randomSplit(Array[Double], Long): Array[Dataset[T]]
- randomSplit(Array[Double]): Array[Dataset[T]]
- Set operations
- unionAll // alias of union (remove it?)
- except // alias of substract (remove it?)
- Repartitioning
- repartition(Int, Column*): Dataset[T]
- repartition(Column*): Dataset[T]
- explode[A <: Product: TypeTag](Column*)(Row => TraversableOnce[A]):
Dataset[A]
- explode[A, B: TypeTag](String, String)(A => TraversableOnce[B]):
Dataset[B]
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]