[ 
https://issues.apache.org/jira/browse/BEAM-12715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anonymous updated BEAM-12715:
-----------------------------
    Status: Triage Needed  (was: Resolved)

> SnowflakeWrite fails in batch mode when the number of shards is > 1000
> ----------------------------------------------------------------------
>
>                 Key: BEAM-12715
>                 URL: https://issues.apache.org/jira/browse/BEAM-12715
>             Project: Beam
>          Issue Type: Bug
>          Components: io-java-snowflake
>            Reporter: Daniel Mateus Pires
>            Priority: P2
>             Fix For: 2.33.0
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> When writing to Snowflake in batch mode, if the number of files to import is 
> more than 1000, the load will fail
> From the Snowflake docs
> {quote}Of the three options for identifying/specifying data files to load 
> from a stage, providing a discrete list of files is generally the fastest; 
> however, the FILES parameter supports a maximum of 1,000 files, meaning a 
> COPY command executed with the FILES parameter can only load up to 1,000 
> files.
> {quote}
> I noticed that the Snowflake Write in batch mode ignores the number of shards 
> set by the user, and I think the first step should be to get the number of 
> shards before writing.
> Longer term, should Beam issue multiple COPY statements with a distinct list 
> of files when the number of files is more than 1000? Maybe inside the same 
> transaction (BEGIN; END; block)
>  
> Also, I wanted to set the Jira issue component as io-java-snowflake but it 
> does not exist



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to