[
https://issues.apache.org/jira/browse/SPARK-20113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15944079#comment-15944079
]
Sean Owen commented on SPARK-20113:
-----------------------------------
If there is no primary key, how would anything know that the data is already
inserted? there is no notion of sameness to decide data is already there.
> overwrite mode appends data on MySQL table that does not have a primary key
> ---------------------------------------------------------------------------
>
> Key: SPARK-20113
> URL: https://issues.apache.org/jira/browse/SPARK-20113
> Project: Spark
> Issue Type: Bug
> Components: Input/Output
> Affects Versions: 2.0.1
> Reporter: Bhanu Akaveeti
>
> Dataframe.write in overwrite mode appends data on MySQL table that does not
> have a primary key
> df_mysql.write \
> .mode("overwrite") \
> .jdbc("jdbc:mysql://ip-address/database", "MySQL_Table", properties={"user":
> "MySQL_user", "password": "MySQL_pw"})
> When the above script is run twice, data is inserted twice. Also, I tried
> with option("truncate","true") but still data is appended in MySQL table
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]