[
https://issues.apache.org/jira/browse/SPARK-13554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-13554:
------------------------------------
Assignee: Cheng Lian (was: Apache Spark)
> Migrate typed relational operations
> -----------------------------------
>
> Key: SPARK-13554
> URL: https://issues.apache.org/jira/browse/SPARK-13554
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 2.0.0
> Reporter: Cheng Lian
> Assignee: Cheng Lian
>
> Should migrate the following methods and corresponding tests to Dataset:
> {noformat}
> - Relational operations
> - Typed relational operations
> - as(String): Dataset[T] // Subquery
> - filter(Column): Dataset[T]
> - filter(String): Dataset[T]
> - where(Column): Dataset[T]
> - where(String): Dataset[T]
> - limit(n): Dataset[T]
> - sortWithinPartitions(String, String*): Dataset[T]
> - sortWithinPartitions(Column*): Dataset[T]
> - sort(String, String*): Dataset[T]
> - sort(Column*): Dataset[T]
> - orderBy(String, String*): Dataset[T]
> - orderBy(Column*): Dataset[T]
> - randomSplit(Array[Double], Long): Array[Dataset[T]]
> - randomSplit(Array[Double]): Array[Dataset[T]]
> - Set operations
> - unionAll // alias of union (remove it?)
> - except // alias of substract (remove it?)
> - Repartitioning
> - repartition(Int, Column*): Dataset[T]
> - repartition(Column*): Dataset[T]
> - explode[A <: Product: TypeTag](Column*)(Row => TraversableOnce[A]):
> Dataset[A]
> - explode[A, B: TypeTag](String, String)(A => TraversableOnce[B]):
> Dataset[B]
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]