Kontinuation commented on issue #860:
URL: https://github.com/apache/sedona/issues/860#issuecomment-1588959368

   Apache Sedona requires that Shapfiles be stored in specific directory 
structures to be loaded by `ShapefileReader.readToGeometryRDD`. For example, 
the Geofabrick OSM roads Shapefiles need to be stored in the following 
structure to be loaded:
   
   ```
   /dbfs/mnt/spatial/gis_osm_roads_free_1
   ├── gis_osm_roads_free_1.shp
   ├── gis_osm_roads_free_1.shx
   ├── gis_osm_roads_free_1.prj
   ...
   ```
   
   Then the Shapefiles could be loaded and analyzed. If you are reading from 
DBFS, you can specify the input path as `dbfs://...`. I found that using the 
local mount point `/dbfs/...` has some path translation problems.
   
   ```python
   spatialRdd = ShapefileReader.readToGeometryRDD(sc, 
"dbfs:/mnt/spatial/gis_osm_roads_free_1")
   spatialRdd.analyze()
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to