[GitHub] spark pull request #14683: [SPARK-16968]Document additional options in jdbc ...

srowen Sat, 20 Aug 2016 03:35:29 -0700

Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14683#discussion_r75576508
  
    --- Diff: docs/sql-programming-guide.md ---
    @@ -1058,6 +1058,20 @@ the Data Sources API. The following options are 
supported:
           The JDBC fetch size, which determines how many rows to fetch per 
round trip. This can help performance on JDBC drivers which default to low 
fetch size (eg. Oracle with 10 rows).
         </td>
       </tr>
    +  
    +  <tr>
    +    <td><code>truncate</code></td>
    +    <td>
    +     This is a JDBC writer related option. To truncate the existing table 
before inserting the new data. This option only works with 
<code>SaveMode.Overwrite</code>. Without this option, Spark will drop the 
entire table, including its table definitions as well. <code>truncate</code> 
way is more efficient and ideal for cleaning out data from existing temp table. 
Its default value is <code>false</code>. 
    --- End diff --
    
    May I suggest somewhat different text here. I'd like to emphasize why you 
_wouldn't_ use it:
    
    When `SaveMode.Overwrite` is enabled, this option causes Spark to truncate 
an existing table instead of dropping and recreating it. This can be more 
efficient, and prevents the table metadata (e.g. indices) from being removed. 
However, it will not work in some cases, such as when the new data has a 
different schema. It defaults to `false`.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #14683: [SPARK-16968]Document additional options in jdbc ...

Reply via email to