H Netanel, Thanks for asking. I think there are a couple of issues to be improved, as listed here: https://lists.apache.org/thread.html/r4ebf7c253fff77900cac8b6ad478c8f4808b707062dd8c6b40853a6f%40%3Cdev.sedona.apache.org%3E
On the list given by Pawel and my thoughts, I think the most important thing to be fixed is that (sorted by importance): 1. Replace the object serializer in RDD API with the WKB serializer you wrote before, currently only DataFrame API uses WKB based serializer. In Sedona-core, there is a small benchmark that can test the performance of the WKB serializer. 2. Allow broadcast join for big+small data frames, or at least give users a way to force the broadcast join. Currently, once the optimized Sedona join is enabled, the broadcast join is automatically disabled. 3. Allow null spatial objects in SpatialRDD and DataFrame. Or at least give users a meaningful exception: https://issues.apache.org/jira/browse/SEDONA-4 Pawel was working on some improvements for GeoJSON loading. So we can leave it for him. Do you have any other issues you want to work on? Thanks, Jia On Wed, Feb 17, 2021 at 5:35 AM Netanel Malka <[email protected]> wrote: > Hi Jia, > Are there any features that we want to develop? or fix some bugs? > > My free time now allows me to make some contributions to the project. > > Thanks, > >
