[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-22 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Fix Version/s: 1.3.0

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Fix For: 1.3.0, 2.1.0
>
> Attachments: HIVE-13291-branch-1.patch, HIVE-13291.1.patch, 
> HIVE-13291.2.patch, HIVE-13291.3.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-22 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: HIVE-13291-branch-1.patch

Committed the attached patch to branch-1.

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Fix For: 1.3.0, 2.1.0
>
> Attachments: HIVE-13291-branch-1.patch, HIVE-13291.1.patch, 
> HIVE-13291.2.patch, HIVE-13291.3.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-21 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
   Resolution: Fixed
Fix Version/s: 2.1.0
   Status: Resolved  (was: Patch Available)

Committed to master

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Fix For: 2.1.0
>
> Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch, 
> HIVE-13291.3.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: HIVE-13291.3.patch

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch, 
> HIVE-13291.3.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: (was: HIVE-13291.3.patch)

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: HIVE-13291.3.patch

Addressed [~gopalv]'s RB comments.

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: HIVE-13291.2.patch

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Status: Patch Available  (was: Open)

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size

2016-03-15 Thread Prasanth Jayachandran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13291:
-
Attachment: HIVE-13291.1.patch

> ORC BI Split strategy should consider block size instead of file size
> -
>
> Key: HIVE-13291
> URL: https://issues.apache.org/jira/browse/HIVE-13291
> Project: Hive
>  Issue Type: Bug
>  Components: ORC
>Affects Versions: 2.1.0
>Reporter: Gopal V
>Assignee: Prasanth Jayachandran
> Attachments: HIVE-13291.1.patch
>
>
> When we force split strategy to use "BI" (using 
> hive.exec.orc.split.strategy), entire file is considered as single split. 
> This might be inefficient when the files are large. Instead, BI should 
> consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)