Can't comment on the runtime, but there was a bug that prevented global
indexing from being used in a lot of cases (see
https://github.com/apache/incubator-sedona/pull/511), including any
attempts from SQL directly. The non-indexed join is very memory inefficient
right now (it loads all
Hi Jia,
Thanks very much for your help!
The setup worked, I managed to run Apache Sedona in a Jupyter Notebook :)
However, another problem has occurred.
I have two cases:
1. small join: gdf1 contains POLYGONs, shape: (250 rows, 3 columns), gdf2
contains POINTs, shape: (2+ million rows, 5
Hi Robert,
The tutorial you found on our website is a step-by-step tutorial for Python
Jupyter. In that tutorial, pipenv will install all dependencies from
binder/Pipfile:
https://github.com/apache/incubator-sedona/tree/master/binder
If you run into any specific issues, you can post here and we
Hi all,
I am new to PySpark and programming.
I would like to do a spatial join between two geographical datasets, one
consists of 50+ million rows.
Is there here anyone who could explain to me step by step how to install Apache
Sedona (GeoSpark) and its dependencies on a Mac?
After the