Re: Installing Apache Sedona and its dependencies

2021-03-09 Thread Adam Binford
Can't comment on the runtime, but there was a bug that prevented global indexing from being used in a lot of cases (see https://github.com/apache/incubator-sedona/pull/511), including any attempts from SQL directly. The non-indexed join is very memory inefficient right now (it loads all

Re: Installing Apache Sedona and its dependencies

2021-03-08 Thread Robert Bozsik
Hi Jia, Thanks very much for your help! The setup worked, I managed to run Apache Sedona in a Jupyter Notebook :) However, another problem has occurred. I have two cases: 1. small join: gdf1 contains POLYGONs, shape: (250 rows, 3 columns), gdf2 contains POINTs, shape: (2+ million rows, 5

Re: Installing Apache Sedona and its dependencies

2021-03-06 Thread Jia Yu
Hi Robert, The tutorial you found on our website is a step-by-step tutorial for Python Jupyter. In that tutorial, pipenv will install all dependencies from binder/Pipfile: https://github.com/apache/incubator-sedona/tree/master/binder If you run into any specific issues, you can post here and we

Installing Apache Sedona and its dependencies

2021-03-06 Thread Robert Bozsik
Hi all, I am new to PySpark and programming. I would like to do a spatial join between two geographical datasets, one consists of 50+ million rows. Is there here anyone who could explain to me step by step how to install Apache Sedona (GeoSpark) and its dependencies on a Mac? After the