jfn6030217 opened a new pull request #164: URL: https://github.com/apache/incubator-spot/pull/164
…large sets https://issues.apache.org/jira/projects/SPOT/issues/SPOT-286 identifies an issue with calling org.apache.spark.sql.functions.broadcast() on large datasets which may exceed a limit and cause an exception. It was determined wrapping a dataset in this way is a performance optimization, and the justification for this optimization is not clear. It was decided it was preferable to undo the optimization than allow it to fail for some situations. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
