[ https://issues.apache.org/jira/browse/SPARK-46482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ivan Sadikov updated SPARK-46482: --------------------------------- Description: SPARK-43049 causes performance regression when writing string fields to an Oracle database due to strings written as CLOB instead of VARCHAR2. CLOB is known to have bad performance in Oracle so when creating a table and writing to it using Spark, the internal SQL statement that writes 20+ string fields would take at least 5x performance hit from the original patch (2 min vs 10+ min). I confirmed internally that running a job with the commit reverted brings back the original performance numbers. was: SPARK-43049 causes performance regression when writing string fields to an Oracle database due to strings written as CLOB instead of VARCHAR2. CLOB is known to have bad performance in Oracle so when creating a table and writing to it using Spark, SQL statement would take at least 5x performance hit from the original patch (2 min vs 10+ min). I confirmed internally that running a job with the commit reverted brings back the original performance numbers. > Revert SPARK-43049 due to performance regression of using CLOB > -------------------------------------------------------------- > > Key: SPARK-46482 > URL: https://issues.apache.org/jira/browse/SPARK-46482 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 3.4.1, 3.5.0 > Reporter: Ivan Sadikov > Priority: Major > > SPARK-43049 causes performance regression when writing string fields to an > Oracle database due to strings written as CLOB instead of VARCHAR2. CLOB is > known to have bad performance in Oracle so when creating a table and writing > to it using Spark, the internal SQL statement that writes 20+ string fields > would take at least 5x performance hit from the original patch (2 min vs 10+ > min). > I confirmed internally that running a job with the commit reverted brings > back the original performance numbers. > > > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org