Github user mateiz commented on the pull request: https://github.com/apache/spark/pull/1165#issuecomment-49685834 I see, we should revisit that, but probably later. IMO if you're going to store them serialized, you're going to pay for that deserialization cost later anyway. With the current logic we might just never cache them, even though a serialized / compressed form may be reasonable in size.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---