H Netanel,

Thanks for asking. I think there are a couple of issues to be improved, as
listed here:
https://lists.apache.org/thread.html/r4ebf7c253fff77900cac8b6ad478c8f4808b707062dd8c6b40853a6f%40%3Cdev.sedona.apache.org%3E

On the list given by Pawel and my thoughts, I think the most important
thing to be fixed is that (sorted by importance):

1. Replace the object serializer in RDD API with the WKB serializer you
wrote before, currently only DataFrame API uses WKB based serializer. In
Sedona-core, there is a small benchmark that can test the performance of
the WKB serializer.
2. Allow broadcast join for big+small data frames, or at least give users a
way to force the broadcast join. Currently, once the optimized Sedona join
is enabled, the broadcast join is automatically disabled.
3. Allow null spatial objects in SpatialRDD and DataFrame. Or at least give
users a meaningful exception: https://issues.apache.org/jira/browse/SEDONA-4

Pawel was working on some improvements for GeoJSON loading. So we can leave
it for him.

Do you have any other issues you want to work on?

Thanks,
Jia

On Wed, Feb 17, 2021 at 5:35 AM Netanel Malka <[email protected]> wrote:

> Hi Jia,
> Are there any features that we want to develop? or fix some bugs?
>
> My free time now allows me to make some contributions to the project.
>
> Thanks,
>
>

Reply via email to