Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/8546#issuecomment-140283221
@HuJiayin Could you add a description to the PR title? The JIRA number
doesn't describe the content. For the implementation, maybe we can avoid
duplicating code. How about not caching the cost RDD if the number of features
is small? It should be able to reduce the storage overhead, but it might be
slower than before.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]