[ 
https://issues.apache.org/jira/browse/GOBBLIN-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chirag Todarka updated GOBBLIN-968:
-----------------------------------
    Labels: easyfix  (was: PR easyfix)

> Honor file split size for HadoopFileInputSource
> -----------------------------------------------
>
>                 Key: GOBBLIN-968
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-968
>             Project: Apache Gobblin
>          Issue Type: Bug
>          Components: gobblin-core
>    Affects Versions: 0.14.0
>            Reporter: Chirag Todarka
>            Assignee: Abhishek Tiwari
>            Priority: Major
>              Labels: easyfix
>   Original Estimate: 24h
>          Time Spent: 10m
>  Remaining Estimate: 23h 50m
>
> For HadoopFileInputSource it is splitting the source file only in default 
> chunk size (which is 128MB) and not honoring the min and mix split size 
> mentioned in pull file.
> For example even if below entires are present in pull file it will still 
> chunk the source file in smaller 134217728 bytes files and not in 268435456 
> bytes files.
> {{source.hadoop.file.input.split.minsize=268435456}}
> {{source.hadoop.file.input.split.maxsize=268435456}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to