Datasource API V2 and checkpointing

2018-04-23 Thread Thakrar, Jayesh
I was wondering when checkpointing is enabled, who does the actual work? The streaming datasource or the execution engine/driver? I have written a small/trivial datasource that just generates strings. After enabling checkpointing, I do see a folder being created under the checkpoint folder, but

Spark+AI Summit 2018 (promo code within)

2018-04-23 Thread Scott walent
Spark+AI Summit is only 6 week away. Keynotes this year include talks from Tesla, Apple, Databricks, Andreessen Horowitz and many more! Use code *"*SparkList" and save 15% when registering at http://databricks.com/sparkaisummit We hope to see you there. -Scott

ShuffledHashJoin's selection criteria

2018-04-23 Thread Jacek Laskowski
Hi, I've been reviewing the code of JoinSelection for ShuffledHashJoin and can't understand how !RowOrdering.isOrderable(leftKeys) can be ever met for the second case (copying the entire code for a quick look): if !conf.preferSortMergeJoin && canBuildRight(joinType) &&

Unsubscribe

2018-04-23 Thread varma dantuluri
Unsubscribe -- Regards, Varma Dantuluri

Re: Sort-merge join improvement

2018-04-23 Thread Petar Zecevic
Hi, the PR tests completed successfully (https://github.com/apache/spark/pull/21109). Can you please review the patch and merge it upstream if you think it's OK? Thanks, Petar Le 4/18/2018 à 4:52 PM, Petar Zecevic a écrit : As instructed offline, I opened a JIRA for this: