karthik892 opened a new issue, #1055:
URL: https://github.com/apache/sedona/issues/1055

   I have created a python application that does the following
   
   1. Read a CSV file containing a series on lat, long values
   2. Create a point geometry column from the lat long using 
```.withColumn("geo_point", F.expr("ST_POINT(...)"))```
   3. Save table A using ```createOrReplaceTempView("A")```
   4. Read in another CSV file which contains a series of polygons defined in 
WKT format
   5. Create a polygon geometry column from the wkt using 
```.withColumn("geometry", F.expr("ST_GeomFromWKT(wkt_geometry)"))```
   6. Save table B using ```createOrReplaceTempView("B")```
   7. Run a query using ```ST_Intersects``` and display the results
   
   ## Expected behavior
   I expect to see the result of my spatial join, or an empty result set, or an 
SQL error
   ## Actual behavior
   I receive the following error, usually at step 4 (Is Sedona automatically 
detecting the WKT string?):
   ```
   py4j.protocol.Py4JJavaError: An error occurred while calling o34.sql.
   : java.lang.NoSuchMethodError: 
org.apache.commons.text.StringSubstitutor.setEnableUndefinedVariableException(Z)Lorg/apache/commons/text/StringSubstitutor;
           at 
org.apache.spark.ErrorClassesJsonReader.getErrorMessage(ErrorClassesJSONReader.scala:49)
   ...
   
   ```
   ## Steps to reproduce the problem
   See steps listed above
   ## Settings
   
   Sedona version = 1.5.0
   
   Apache Spark version = 3.4.1 (also tried 3.5.0)
   
   Apache Flink version = Not using Flink
   
   API type = Python
   
   Scala version = 2.12
   
   JRE version = 11.0.1 (OpenJDK)
   
   Python version = 3.8.10
   
   Environment = Standalone ... sort of ...
   I am running a Virtual Hadoop cluster consisting of 5 VirtualBox VM's 
running Hadoop. That's 1 Name node and 4 Data Nodes.
   I am also using vagrant to make the process of creating and destroying these 
VM's much easier.
   I am using Ubuntu 20.04 and Hadoop 3.3.6
   
   I also have an SO question trying to find an answer to this problem - 
https://stackoverflow.com/questions/77326231/receiving-nosuchmethoderror-when-running-sql-query-in-a-pyspark-application-wi
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@sedona.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to