[ 
https://issues.apache.org/jira/browse/SPARK-46482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Sadikov updated SPARK-46482:
---------------------------------
    Description: 
SPARK-43049 causes performance regression when writing string fields to an 
Oracle database due to strings written as CLOB instead of VARCHAR2. CLOB is 
known to have bad performance in Oracle so when creating a table and writing to 
it using Spark, SQL statement would take at least 5x performance hit from the 
original patch (2 min vs 10+ min).

I confirmed internally that running a job with the commit reverted brings back 
the original performance numbers.

 

 

 

> Revert SPARK-43049 due to performance regression
> ------------------------------------------------
>
>                 Key: SPARK-46482
>                 URL: https://issues.apache.org/jira/browse/SPARK-46482
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 3.4.1, 3.5.0
>            Reporter: Ivan Sadikov
>            Priority: Major
>
> SPARK-43049 causes performance regression when writing string fields to an 
> Oracle database due to strings written as CLOB instead of VARCHAR2. CLOB is 
> known to have bad performance in Oracle so when creating a table and writing 
> to it using Spark, SQL statement would take at least 5x performance hit from 
> the original patch (2 min vs 10+ min).
> I confirmed internally that running a job with the commit reverted brings 
> back the original performance numbers.
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to