[ 
https://issues.apache.org/jira/browse/DERBY-4555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836671#action_12836671
 ] 

Yair Lenga commented on DERBY-4555:
-----------------------------------

I really like the idea of exposing "Bulk Insert" to JDBC Clients. We have 
chosen the "SYSCS_IMPORT_TABLE" approach over plain SQL, because it deliver 10X 
speedup of upload time. In our case,  we have 20K rows of 30 columns: we went 
from ~40 seconds to less then 5 seconds.

In addition, I would like to emphasize that adding bulk import does not mean 
that we do not want the direct upload from delimited format. CSV format is the 
dominant input format. Being able to ship it "as-is" is a big helper, when we 
are writing data integration jobs with MS* products, uploading "BCP" like data 
from other databases, for test scripts, etc.

I think that both "Bulk Insert" feature, and more flexible delimited file 
upload are important features..

> Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines
> --------------------------------------------------------------
>
>                 Key: DERBY-4555
>                 URL: https://issues.apache.org/jira/browse/DERBY-4555
>             Project: Derby
>          Issue Type: Improvement
>          Components: Miscellaneous
>            Reporter: Yair Lenga
>
> The SYSCS_IMPORT_TABLE (and SYSCS_IMPORT_DATA) function allow import of data 
> from external resources. In general, they can process CSV files that created 
> with various tools - with one exception: the header line.
> While there is no accepted standard, most tools will include a header line in 
> the CSV file with column names. This convention is supported in Excel and 
> many other tools.
> My Request: extend the SYSCS_IMPORT_TABLe and SYSCS_IMPORT_DATA (and other 
> related procedures) to include an extra indicator for the number of header 
> lines to be ignored.
> As an extra bonus it will be help is the SYSCS_IMPORT_DATA will accept column 
> names (instead of column indexes) in the 'COLUMNINDEXES' arguments. E.g., it 
> should be possible to indicate COLUMNINDEXES of '1,3,sales,5,'. This feature 
> will make it significantly easier to handle cases where the external input 
> files is extended to include additional columns.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to