Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/5614#issuecomment-95374557
That is different. In FPGrowth, we don't really care about the item type as
long as they are serializable. So it is not necessary to map Python objects
into their equivalent JVM objects through SerDes. Instead, we can pickle the
items on Python side and treat all items as strings on the JVM side. I'm not
sure whether it is worth doing this optimization. Maybe we should wait and see
whether there are issues with the current implementation first.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]