[
https://issues.apache.org/jira/browse/PARQUET-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17509355#comment-17509355
]
ASF GitHub Bot commented on PARQUET-2117:
-
shangxinli merged pull request #945:
shangxinli merged pull request #945:
URL: https://github.com/apache/parquet-mr/pull/945
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: dev-unsubs
[
https://issues.apache.org/jira/browse/PARQUET-2042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17509336#comment-17509336
]
ASF GitHub Bot commented on PARQUET-2042:
-
shangxinli commented on pull request
shangxinli commented on pull request #900:
URL: https://github.com/apache/parquet-mr/pull/900#issuecomment-1073074540
Can you squash all the commits?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to g
[
https://issues.apache.org/jira/browse/PARQUET-2006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17509332#comment-17509332
]
ASF GitHub Bot commented on PARQUET-2006:
-
shangxinli commented on pull request
shangxinli commented on pull request #950:
URL: https://github.com/apache/parquet-mr/pull/950#issuecomment-1073074030
Hi. @huaxingao Thanks for working on it. I just had a first-round review
and left some comments. After we address them, I will have another look.
--
This is an automat
I can take your comment two ways: what is the downside to large pages or
what is the downside to small row groups.
One of the key considerations I've dealt with is that page is the unit of
compression and if I recall correctly, parquet uses block rather than
stream compression. This means you typi
Hi,
I am trying to understand the benefits of using multiple data pages and
indexes vs multiple row groups.
Some basics first:
row groups ensures that a sequence of rows are "aligned" at the group
boundary independently of how they are divided in pages:
row group 1:
c1: |--p11--|--p12--|---p13-