[jira] [Updated] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

2020-07-14 Thread Peter Vary (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Vary updated HIVE-23824:
--
Fix Version/s: 4.0.0
   Resolution: Fixed
   Status: Resolved  (was: Patch Available)

Pushed to master.

Thanks for the patch [~szita]!

> LLAP - add API to look up ORC metadata for certain Path
> ---
>
> Key: HIVE-23824
> URL: https://issues.apache.org/jira/browse/HIVE-23824
> Project: Hive
>  Issue Type: Improvement
>Reporter: Ádám Szita
>Assignee: Ádám Szita
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader 
> / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain 
> paths, that are not necessarily associated with a record reader directly. An 
> example of this could be the caching of ACID delete delta files, as they are 
> currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for 
> retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

2020-07-10 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ádám Szita updated HIVE-23824:
--
Status: Patch Available  (was: Open)

> LLAP - add API to look up ORC metadata for certain Path
> ---
>
> Key: HIVE-23824
> URL: https://issues.apache.org/jira/browse/HIVE-23824
> Project: Hive
>  Issue Type: Improvement
>Reporter: Ádám Szita
>Assignee: Ádám Szita
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader 
> / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain 
> paths, that are not necessarily associated with a record reader directly. An 
> example of this could be the caching of ACID delete delta files, as they are 
> currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for 
> retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

2020-07-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-23824:
--
Labels: pull-request-available  (was: )

> LLAP - add API to look up ORC metadata for certain Path
> ---
>
> Key: HIVE-23824
> URL: https://issues.apache.org/jira/browse/HIVE-23824
> Project: Hive
>  Issue Type: Improvement
>Reporter: Ádám Szita
>Assignee: Ádám Szita
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader 
> / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain 
> paths, that are not necessarily associated with a record reader directly. An 
> example of this could be the caching of ACID delete delta files, as they are 
> currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for 
> retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

2020-07-10 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ádám Szita updated HIVE-23824:
--
Description: 
LLAP IO supports caching but currently this is only done via LlapRecordReader / 
using splits, aka good old mapreduce way.

At certain times it would worth to leverage the caching of files on certain 
paths, that are not necessarily associated with a record reader directly. An 
example of this could be the caching of ACID delete delta files, as they are 
currently being read without caching.

With this patch we'd extend the LLAP API and offer another entry point for 
retrieving metadata of ORC files.

> LLAP - add API to look up ORC metadata for certain Path
> ---
>
> Key: HIVE-23824
> URL: https://issues.apache.org/jira/browse/HIVE-23824
> Project: Hive
>  Issue Type: Improvement
>Reporter: Ádám Szita
>Assignee: Ádám Szita
>Priority: Major
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader 
> / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain 
> paths, that are not necessarily associated with a record reader directly. An 
> example of this could be the caching of ACID delete delta files, as they are 
> currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for 
> retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)