[jira] [Commented] (PARQUET-2372) Avoid unnecessary reading of RowGroup data during rewriting

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784255#comment-17784255 ] ASF GitHub Bot commented on PARQUET-2372: - ConeyLiu commented on PR #1183: URL:

Re: [PR] PARQUET-2372: Avoid unnecessary reading of RowGroup data during rewriting [parquet-mr]

2023-11-08 Thread via GitHub
ConeyLiu commented on PR #1183: URL: https://github.com/apache/parquet-mr/pull/1183#issuecomment-1803078643 Thanks @wgtmac -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[jira] [Resolved] (PARQUET-2321) allow customized buffer size when creating ArrowInputStream for a column PageReader

2023-11-08 Thread Jinpeng Zhou (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jinpeng Zhou resolved PARQUET-2321. --- Resolution: Won't Do > allow customized buffer size when creating ArrowInputStream for a

[jira] [Commented] (PARQUET-2321) allow customized buffer size when creating ArrowInputStream for a column PageReader

2023-11-08 Thread Jinpeng Zhou (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784252#comment-17784252 ] Jinpeng Zhou commented on PARQUET-2321: ---  I think we can close this one for now as it may not

[jira] [Commented] (PARQUET-2371) Resolve japicmp failure for CI

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784157#comment-17784157 ] ASF GitHub Bot commented on PARQUET-2371: - gszadovszky commented on PR #1181: URL:

Re: [PR] PARQUET-2371: Resolve japicmp CI failure [parquet-mr]

2023-11-08 Thread via GitHub
gszadovszky commented on PR #1181: URL: https://github.com/apache/parquet-mr/pull/1181#issuecomment-1802357813 Thanks for the clarification, @amousavigourabi. We cannot do anything but excluding these classes, then, I'm afraid. But the comment is a bit misleading. Have you tried creating

[jira] [Commented] (PARQUET-2371) Resolve japicmp failure for CI

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784139#comment-17784139 ] ASF GitHub Bot commented on PARQUET-2371: - amousavigourabi commented on PR #1181: URL:

Re: [PR] PARQUET-2371: Resolve japicmp CI failure [parquet-mr]

2023-11-08 Thread via GitHub
amousavigourabi commented on PR #1181: URL: https://github.com/apache/parquet-mr/pull/1181#issuecomment-1802300343 @gszadovszky, the issue at hand was not that adding methods to already existing interfaces caused failures. As you noted, this would be perfectly fine and expected behaviour.

Re: [VOTE][FORMAT] Add repetition, definition and variable length size metadata statistics

2023-11-08 Thread Gábor Szádovszky
+1 (binding) Cheers, Gabor On 2023/11/07 02:46:37 Xinli shang wrote: > +1 (binding) > > On Mon, Nov 6, 2023 at 4:56 PM Gang Wu wrote: > > > +1 (non-binding) > > > > Best, > > Gang > > > > On Tue, Nov 7, 2023 at 3:57 AM Ed Seidl wrote: > > > > > +1 (non-binding) > > > > > > Thanks! > > > Ed >

[jira] [Commented] (PARQUET-2371) Resolve japicmp failure for CI

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784132#comment-17784132 ] ASF GitHub Bot commented on PARQUET-2371: - gszadovszky commented on PR #1181: URL:

Re: [PR] PARQUET-2371: Resolve japicmp CI failure [parquet-mr]

2023-11-08 Thread via GitHub
gszadovszky commented on PR #1181: URL: https://github.com/apache/parquet-mr/pull/1181#issuecomment-1802279769 Sorry @wgtmac, @ConeyLiu, for the late response. I don't think it is a bug in japicmp to fail with a new method in an existing interface. It would fail a 3rd party if it

[jira] [Updated] (PARQUET-1646) [C++] Use arrow::Buffer for buffered dictionary indices in DictEncoder instead of std::vector

2023-11-08 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated PARQUET-1646: Fix Version/s: cpp-15.0.0 (was: cpp-14.0.0) > [C++] Use

[jira] [Updated] (PARQUET-2099) [C++] Statistics::num_values() is misleading

2023-11-08 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated PARQUET-2099: Fix Version/s: cpp-15.0.0 (was: cpp-14.0.0) > [C++]

[jira] [Updated] (PARQUET-2321) allow customized buffer size when creating ArrowInputStream for a column PageReader

2023-11-08 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated PARQUET-2321: Fix Version/s: cpp-15.0.0 (was: cpp-14.0.0) > allow customized

[jira] [Commented] (PARQUET-2372) Avoid unnecessary reading of RowGroup data during rewriting

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784103#comment-17784103 ] ASF GitHub Bot commented on PARQUET-2372: - wgtmac merged PR #1183: URL:

[jira] [Resolved] (PARQUET-2372) Avoid unnecessary reading of RowGroup data during rewriting

2023-11-08 Thread Gang Wu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Wu resolved PARQUET-2372. -- Fix Version/s: 1.14.0 Assignee: Xianyang Liu Resolution: Fixed > Avoid unnecessary

Re: [PR] PARQUET-2372: Avoid unnecessary reading of RowGroup data during rewriting [parquet-mr]

2023-11-08 Thread via GitHub
wgtmac merged PR #1183: URL: https://github.com/apache/parquet-mr/pull/1183 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17784098#comment-17784098 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on PR #197: URL:

Re: [PR] PARQUET-2261: add statistics for better estimating unencoded/uncompressed sizes and finer grained filtering [parquet-format]

2023-11-08 Thread via GitHub
wgtmac commented on PR #197: URL: https://github.com/apache/parquet-format/pull/197#issuecomment-1802124276 Could you help the vote on mailing list: https://lists.apache.org/thread/wgobz41mfldbhqpg9q4mdwypghg2cxg2? Help is needed from the PMC members. @ggershinsky @gszadovszky @julienledem

[jira] [Resolved] (PARQUET-2364) Encrypt all columns option

2023-11-08 Thread Gidon Gershinsky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky resolved PARQUET-2364. --- Fix Version/s: 1.14.0 Resolution: Fixed > Encrypt all columns option >

[jira] [Resolved] (PARQUET-2370) Crypto factory activation of "all column encryption" mode

2023-11-08 Thread Gidon Gershinsky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky resolved PARQUET-2370. --- Resolution: Fixed > Crypto factory activation of "all column encryption" mode >

[jira] [Updated] (PARQUET-2370) Crypto factory activation of "all column encryption" mode

2023-11-08 Thread Gidon Gershinsky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky updated PARQUET-2370: -- Fix Version/s: 1.14.0 > Crypto factory activation of "all column encryption" mode >

[jira] [Commented] (PARQUET-2274) Remove Yetus

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783934#comment-17783934 ] ASF GitHub Bot commented on PARQUET-2274: - Fokko merged PR #1056: URL:

[jira] [Resolved] (PARQUET-2274) Remove Yetus

2023-11-08 Thread Fokko Driesprong (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fokko Driesprong resolved PARQUET-2274. --- Resolution: Fixed > Remove Yetus > - > > Key:

[jira] [Commented] (PARQUET-2274) Remove Yetus

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783932#comment-17783932 ] ASF GitHub Bot commented on PARQUET-2274: - Fokko commented on PR #1056: URL:

Re: [PR] PARQUET-2274: Remove Yetus [parquet-mr]

2023-11-08 Thread via GitHub
Fokko commented on PR #1056: URL: https://github.com/apache/parquet-mr/pull/1056#issuecomment-1801331950 Thank you @sekikn for confirming. I'll move this forward. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[jira] [Commented] (PARQUET-2355) Deprecate parquet-thrift

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783930#comment-17783930 ] ASF GitHub Bot commented on PARQUET-2355: - Fokko closed pull request #1175: PARQUET-2355:

[jira] [Commented] (PARQUET-2355) Deprecate parquet-thrift

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783931#comment-17783931 ] ASF GitHub Bot commented on PARQUET-2355: - Fokko commented on PR #1175: URL:

Re: [PR] PARQUET-2274: Remove Yetus [parquet-mr]

2023-11-08 Thread via GitHub
Fokko merged PR #1056: URL: https://github.com/apache/parquet-mr/pull/1056 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] PARQUET-2355: Deprecate `parquet-thrift` [parquet-mr]

2023-11-08 Thread via GitHub
Fokko closed pull request #1175: PARQUET-2355: Deprecate `parquet-thrift` URL: https://github.com/apache/parquet-mr/pull/1175 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] PARQUET-2355: Deprecate `parquet-thrift` [parquet-mr]

2023-11-08 Thread via GitHub
Fokko commented on PR #1175: URL: https://github.com/apache/parquet-mr/pull/1175#issuecomment-1801330968 @SinghAsDev and @tlazaro I think we can make it work with the latest version of Elephantbird. Let me close this for now. -- This is an automated message from the Apache Git Service.

[jira] [Commented] (PARQUET-2355) Deprecate parquet-thrift

2023-11-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783928#comment-17783928 ] ASF GitHub Bot commented on PARQUET-2355: - Fokko closed pull request #1158: PARQUET-2355:

Re: [PR] PARQUET-2355: Remove `parquet-thrift` [parquet-mr]

2023-11-08 Thread via GitHub
Fokko closed pull request #1158: PARQUET-2355: Remove `parquet-thrift` URL: https://github.com/apache/parquet-mr/pull/1158 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] Bump Elephantbird [parquet-mr]

2023-11-08 Thread via GitHub
Fokko closed pull request #1156: Bump Elephantbird URL: https://github.com/apache/parquet-mr/pull/1156 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: