[ 
https://issues.apache.org/jira/browse/PARQUET-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17379001#comment-17379001
 ] 

ASF GitHub Bot commented on PARQUET-2064:
-----------------------------------------

gszadovszky commented on pull request #918:
URL: https://github.com/apache/parquet-mr/pull/918#issuecomment-878037608


   @shangxinli, unfortunately we still struggle with not having proper low 
level API for our clients. We did not add the ranges and such to the public API 
because they should not be required there. Since both Spark and Hive uses lower 
level APIs (that was not designed to be public originally) I don't think we 
have any other choice for now to make all the necessary classes/methods public.
   
   Meanwhile, I've added `RowRanges` to the package 
`org.apache.parquet.internal.filter2.columnindex` (note `internal`) to make it 
clear that even though the class is public it is not for our clients. So, if we 
really want to make this public we also need to move it to another package.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


> Make Range public accessible in RowRanges
> -----------------------------------------
>
>                 Key: PARQUET-2064
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2064
>             Project: Parquet
>          Issue Type: New Feature
>            Reporter: Xinli Shang
>            Assignee: Xinli Shang
>            Priority: Major
>
> When rolling out to Presto, I found we need to know the boundaries of each 
> Range in RowRanges. It is still doable with Iterator but Presto has. batch 
> reader, we cannot use iterator for each row. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to