jfn6030217 opened a new pull request #164:
URL: https://github.com/apache/incubator-spot/pull/164


   …large sets
   
   https://issues.apache.org/jira/projects/SPOT/issues/SPOT-286
   identifies an issue with calling org.apache.spark.sql.functions.broadcast()
   on large datasets which may exceed a limit and cause an exception.
   
   It was determined wrapping a dataset in this way is a performance
   optimization, and the justification for this optimization is not clear.
   
   It was decided it was preferable to undo the optimization than allow
   it to fail for some situations.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to