Fokko commented on issue #28238: [SPARK-23647][PYTHON][SQL][2.4] Adds more 
types for hint in pyspark
URL: https://github.com/apache/spark/pull/28238#issuecomment-615590452
 
 
   I know how tedious is it to cherry-pick back to maintenance versions. But 
this is really a bug. However, I should have provided more information. I'm not 
the only one that's running into this. For example, this StackOverflow: 
https://stackoverflow.com/questions/56242674/why-am-i-getting-an-exception-when-using-a-range-join-hint
   
   So the bug is with the Python API. Everything works in the SQL API:
   
   
![image](https://user-images.githubusercontent.com/1134248/79629889-3af7ba80-814d-11ea-909a-5c7f574854c5.png)
   
   But when doing exactly the same using the Python Dataframe API:
   
   
![image](https://user-images.githubusercontent.com/1134248/79629929-9c1f8e00-814d-11ea-8920-f8e32faddc2e.png)
   
   It requires.a string as an argument for the `.hint` API. However, this won't 
work:
   
   
![image](https://user-images.githubusercontent.com/1134248/79629961-ccffc300-814d-11ea-9d52-aafaebccc93b.png)
   
   It is described in the docs that it should be possible: 
https://docs.databricks.com/delta/join-performance/range-join.html#enable-range-join-using-a-range-join-hint
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to