Tor Myklebust created SPARK-1672:
------------------------------------
Summary: Support separate partitioners (and numbers of partitions)
for users and products
Key: SPARK-1672
URL: https://issues.apache.org/jira/browse/SPARK-1672
Project: Spark
Issue Type: Improvement
Reporter: Tor Myklebust
Priority: Minor
The user ought to be able to specify a partitioning of his data if he knows a
good one. It's convenient to have separate partitioners for users and products
so that no strange mapping step needs to happen.
It may also be reasonable to partition the users and products into different
numbers of partitions (for instance, to balance memory requirements) if the
dataset is tall, thin, and very sparse.
--
This message was sent by Atlassian JIRA
(v6.2#6252)