Hi all,
I am new to PySpark and programming.
I would like to do a spatial join between two geographical datasets, one
consists of 50+ million rows.
Is there here anyone who could explain to me step by step how to install Apache
Sedona (GeoSpark) and its dependencies on a Mac?
After the installation I would like to run it locally in a virtual environment,
first using Jupyter Notebook then in a .py file.
On the official website I have found a quick start guide:
https://sedona.apache.org/download/overview/
and a Python Jupyter Notebook Examples guide:
https://sedona.apache.org/tutorial/jupyter-notebook/
However, it is still not clear, how to install and make it run.
Unfortunately, I didn't find any useful step-by-step guide with the help of
Google or YouTube and feel myself in an infinite loop of reading links after
links that explain always different solutions.
Thanks a lot in advance,
Robert
[email protected]