Martin Mauch created SPARK-20745:
------------------------------------

             Summary: Data gets wrongly copied from one row to others, possibly 
related to named structs
                 Key: SPARK-20745
                 URL: https://issues.apache.org/jira/browse/SPARK-20745
             Project: Spark
          Issue Type: Bug
          Components: Input/Output
    Affects Versions: 2.1.1
            Reporter: Martin Mauch


We encountered a strange bug where Spark copies data over from one row to other 
rows. It might be related to named structs, at least the minimal repro we were 
able to achieve involves them: 
https://github.com/crealytics/spark_bug/blob/master/src/test/scala/spark/DataFrameConversionsSpec.scala
The interesting part is that Spark behaves correctly when the DataFrame is 
cached (see the 2nd example) and also if you run the failing example a second 
time (see 1st vs 3rd example).
You should be able to check out the above project and reproduce the problem with
sbt test



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to