[jira] [Commented] (PARQUET-1369) [Python] Unavailable Parquet column statistics from Spark-generated file

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584436#comment-16584436 ] ASF GitHub Bot commented on PARQUET-1369: - rgruener opened a new pull request #491:

[jira] [Updated] (PARQUET-1369) [Python] Unavailable Parquet column statistics from Spark-generated file

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1369: Labels: parquet pull-request-available (was: parquet) > [Python] Unavailable Parquet

[jira] [Assigned] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1256: - Assignee: Jacek Pliszka > [C++] Add --print-key-value-metadata option to

[jira] [Resolved] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1256. --- Resolution: Fixed Issue resolved by pull request 450

[jira] [Commented] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584429#comment-16584429 ] ASF GitHub Bot commented on PARQUET-1256: - wesm closed pull request #450: PARQUET-1256: Add

[jira] [Updated] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1256: Labels: patch pull-request-available (was: patch) > [C++] Add

[jira] [Comment Edited] (PARQUET-1370) [C++] Read consecutive column chunks in a single scan

2018-08-17 Thread Robert Gruener (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584397#comment-16584397 ] Robert Gruener edited comment on PARQUET-1370 at 8/17/18 9:20 PM: --

[jira] [Commented] (PARQUET-1370) [C++] Read consecutive column chunks in a single scan

2018-08-17 Thread Robert Gruener (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584397#comment-16584397 ] Robert Gruener commented on PARQUET-1370: - That seems to only be the case for python3. Do the

[jira] [Commented] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584161#comment-16584161 ] ASF GitHub Bot commented on PARQUET-1384: - wesm closed pull request #490: PARQUET-1384: fix

Re: Parquet sync meeting minutes

2018-08-17 Thread Zoltan Ivanfi
Hi, Sorry, that was an error on my side, I suggested Nandor to add a TLDR section with this title. I agree with your comment, Wes, outcome would have been a better choice of word than decision. Br, Zoltan On Fri, Aug 17, 2018 at 6:36 PM Wes McKinney wrote: > hi Nandor, > > A fine detail, and

Re: Parquet sync meeting minutes

2018-08-17 Thread Wes McKinney
hi Nandor, A fine detail, and I may be wrong, but I don't think decisions can technically be made on a call because time zones do not permit everyone to join always and not all collaborators are comfortable having live discussions in English. see [1] You can present the consensus of the

[jira] [Commented] (PARQUET-1389) Improve value skipping at page synchronization

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584107#comment-16584107 ] ASF GitHub Bot commented on PARQUET-1389: - gszadovszky opened a new pull request #514:

[jira] [Updated] (PARQUET-1389) Improve value skipping at page synchronization

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1389: Labels: pull-request-available (was: ) > Improve value skipping at page synchronization

[jira] [Resolved] (PARQUET-1310) Column indexes: Filtering

2018-08-17 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1310. --- Resolution: Fixed > Column indexes: Filtering > - > >

[jira] [Commented] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583997#comment-16583997 ] ASF GitHub Bot commented on PARQUET-1384: - cjjnjust closed pull request #488: PARQUET-1384: fix

[jira] [Commented] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583996#comment-16583996 ] ASF GitHub Bot commented on PARQUET-1384: - cjjnjust opened a new pull request #490:

[jira] [Commented] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583976#comment-16583976 ] ASF GitHub Bot commented on PARQUET-1385: - wesm closed pull request #489: PARQUET-1385: Do not

[jira] [Resolved] (PARQUET-1382) [C++] Prepare for arrow::test namespace removal

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1382. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 487

[jira] [Commented] (PARQUET-1382) [C++] Prepare for arrow::test namespace removal

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583972#comment-16583972 ] ASF GitHub Bot commented on PARQUET-1382: - wesm closed pull request #487: PARQUET-1382: [C++]

[jira] [Created] (PARQUET-1389) Improve value skipping at page synchronization

2018-08-17 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1389: - Summary: Improve value skipping at page synchronization Key: PARQUET-1389 URL: https://issues.apache.org/jira/browse/PARQUET-1389 Project: Parquet

Parquet sync meeting minutes

2018-08-17 Thread Nandor Kollar
Topics discussed and decisions (meeting held on 2018 August 15th, at 6pm CET / 9 am PST): - Aligning page row boundaries between different columns: Debated, please follow-up - Remove Java specific code from parquet-format: Accepted - Column encryption: Please review - Parquet-format release:

[jira] [Commented] (PARQUET-1383) Parquet tools should print logical type instead of (or besides) original type

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583826#comment-16583826 ] ASF GitHub Bot commented on PARQUET-1383: - nandorKollar opened a new pull request #513:

[jira] [Updated] (PARQUET-1383) Parquet tools should print logical type instead of (or besides) original type

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1383: Labels: pull-request-available (was: ) > Parquet tools should print logical type

[jira] [Commented] (PARQUET-1387) Nanosecond precision time and timestamp - parquet-format

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583819#comment-16583819 ] ASF GitHub Bot commented on PARQUET-1387: - nandorKollar opened a new pull request #102:

[jira] [Updated] (PARQUET-1387) Nanosecond precision time and timestamp - parquet-format

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1387: Labels: pull-request-available (was: ) > Nanosecond precision time and timestamp -

[jira] [Created] (PARQUET-1388) Nanosecond precision time and timestamp - parquet-mr

2018-08-17 Thread Nandor Kollar (JIRA)
Nandor Kollar created PARQUET-1388: -- Summary: Nanosecond precision time and timestamp - parquet-mr Key: PARQUET-1388 URL: https://issues.apache.org/jira/browse/PARQUET-1388 Project: Parquet

[jira] [Updated] (PARQUET-1387) Nanosecond precision time and timestamp - parquet-format

2018-08-17 Thread Nandor Kollar (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nandor Kollar updated PARQUET-1387: --- Fix Version/s: (was: format-2.6.0) > Nanosecond precision time and timestamp -

[jira] [Updated] (PARQUET-1387) Nanosecond precision time and timestamp - parquet-format

2018-08-17 Thread Nandor Kollar (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nandor Kollar updated PARQUET-1387: --- Fix Version/s: format-2.6.0 > Nanosecond precision time and timestamp - parquet-format >

[jira] [Created] (PARQUET-1387) Nanosecond precision time and timestamp - parquet-format

2018-08-17 Thread Nandor Kollar (JIRA)
Nandor Kollar created PARQUET-1387: -- Summary: Nanosecond precision time and timestamp - parquet-format Key: PARQUET-1387 URL: https://issues.apache.org/jira/browse/PARQUET-1387 Project: Parquet

[jira] [Created] (PARQUET-1386) Fix issues of NaN and +-0.0 in case of float/double column indexes

2018-08-17 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1386: - Summary: Fix issues of NaN and +-0.0 in case of float/double column indexes Key: PARQUET-1386 URL: https://issues.apache.org/jira/browse/PARQUET-1386

[jira] [Commented] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread Junjie Chen (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583489#comment-16583489 ] Junjie Chen commented on PARQUET-1385: -- std::seed_seq::generate takes more than 75% cpu cycles

[jira] [Resolved] (PARQUET-1308) [C++] parquet::arrow should use thread pool, not ParallelFor

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1308. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 467

[jira] [Commented] (PARQUET-1308) [C++] parquet::arrow should use thread pool, not ParallelFor

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583431#comment-16583431 ] ASF GitHub Bot commented on PARQUET-1308: - wesm closed pull request #467: PARQUET-1308: [C++]

[jira] [Updated] (PARQUET-1308) [C++] parquet::arrow should use thread pool, not ParallelFor

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1308: Labels: pull-request-available (was: ) > [C++] parquet::arrow should use thread pool,

[jira] [Commented] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread Junjie Chen (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583425#comment-16583425 ] Junjie Chen commented on PARQUET-1385: -- The GetRandomString function is very slow, I can change to

[jira] [Updated] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1384: Labels: pull-request-available (was: ) > [C++] Clang compiler warnings in

[jira] [Updated] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1385: Labels: pull-request-available (was: ) > [C++] bloom_filter-test is very slow under

[jira] [Commented] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583419#comment-16583419 ] ASF GitHub Bot commented on PARQUET-1385: - wesm opened a new pull request #489: PARQUET-1385:

[jira] [Commented] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583417#comment-16583417 ] ASF GitHub Bot commented on PARQUET-1384: - cjjnjust opened a new pull request #488:

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: Wes McKinney > [C++] bloom_filter-test is very slow under valgrind >

[jira] [Created] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-17 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1385: - Summary: [C++] bloom_filter-test is very slow under valgrind Key: PARQUET-1385 URL: https://issues.apache.org/jira/browse/PARQUET-1385 Project: Parquet

[jira] [Updated] (PARQUET-1380) [C++] move Bloom filter test binary to parquet-testing repo

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1380: -- Summary: [C++] move Bloom filter test binary to parquet-testing repo (was: move Bloom