[jira] [Resolved] (PARQUET-1218) [C++] More informative error message on too short pages

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1218. --- Resolution: Fixed Resolved by PR

[jira] [Commented] (PARQUET-1218) [C++] More informative error message on too short pages

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370876#comment-16370876 ] ASF GitHub Bot commented on PARQUET-1218: - wesm closed pull request #438: PARQUET-1218: More

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370854#comment-16370854 ] ASF GitHub Bot commented on PARQUET-1233: - wesm commented on issue #443: PARQUET-1233: Enable

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370847#comment-16370847 ] ASF GitHub Bot commented on PARQUET-1233: - wesm commented on issue #443: PARQUET-1233: Enable

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370839#comment-16370839 ] ASF GitHub Bot commented on PARQUET-1233: - wesm commented on issue #443: PARQUET-1233: Enable

[jira] [Commented] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread Deepak Majeti (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370767#comment-16370767 ] Deepak Majeti commented on PARQUET-1225: [~boroknagyz] I opened a PR here

[jira] [Commented] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370762#comment-16370762 ] ASF GitHub Bot commented on PARQUET-1225: - majetideepak opened a new pull request #444:

[jira] [Commented] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread JIRA
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370706#comment-16370706 ] Zoltán Borók-Nagy commented on PARQUET-1225: Hi [~mdeepak], the proposed quick fix for

Re: [VOTE] Release Apache Parquet C++ 1.4.0 RC0

2018-02-20 Thread Zoltan Borok-Nagy
Hi, I'm implementing the quick fix for Impala. The current proposal for the write path fix is to behave like the fmax()/fmin() functions in math.h, ie. ignore NaNs, except for the case when all the values are NaN. http://en.cppreference.com/w/c/numeric/math/fmax

[jira] [Updated] (PARQUET-1224) Implement specification-compliant floating point comparison

2018-02-20 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn updated PARQUET-1224: - Fix Version/s: (was: cpp-1.4.0) > Implement specification-compliant floating point

[jira] [Updated] (PARQUET-1224) Implement specification-compliant floating point comparison

2018-02-20 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn updated PARQUET-1224: - Fix Version/s: cpp-1.4.0 > Implement specification-compliant floating point comparison >

[jira] [Commented] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread Deepak Majeti (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370602#comment-16370602 ] Deepak Majeti commented on PARQUET-1225: Is Impala not handling the write path? > NaN values

[jira] [Commented] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370546#comment-16370546 ] Zoltan Ivanfi commented on PARQUET-1225: Please note there already is a [review request for an

[jira] [Commented] (PARQUET-1218) [C++] More informative error message on too short pages

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370515#comment-16370515 ] ASF GitHub Bot commented on PARQUET-1218: - xhochy commented on a change in pull request #438:

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread e.birukov (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370498#comment-16370498 ] e.birukov commented on PARQUET-860: --- [~rdblue], thank you for detailed answer. I adapt the architecture

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370458#comment-16370458 ] Ryan Blue commented on PARQUET-860: --- The S3 file system implementation should retry and recover if it

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread e.birukov (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370435#comment-16370435 ] e.birukov commented on PARQUET-860: --- [~rdblue], You wrote "Most of the time, we assume that an

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370414#comment-16370414 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370413#comment-16370413 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370412#comment-16370412 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370411#comment-16370411 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370409#comment-16370409 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370408#comment-16370408 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak commented on a change in pull request

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370333#comment-16370333 ] ASF GitHub Bot commented on PARQUET-1233: - wesm commented on a change in pull request #443:

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370332#comment-16370332 ] ASF GitHub Bot commented on PARQUET-1233: - wesm commented on a change in pull request #443:

[jira] [Commented] (PARQUET-1219) [C++] Update release-candidate script links to gitbox

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370325#comment-16370325 ] ASF GitHub Bot commented on PARQUET-1219: - wesm closed pull request #439: PARQUET-1219: Update

[jira] [Resolved] (PARQUET-1219) [C++] Update release-candidate script links to gitbox

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1219. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 439

[jira] [Commented] (PARQUET-1221) [C++] Extend release README

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370320#comment-16370320 ] ASF GitHub Bot commented on PARQUET-1221: - wesm closed pull request #441: PARQUET-1221: Extend

[jira] [Resolved] (PARQUET-1220) [C++] Don't build Thrift examples and tutorials in the ExternalProject

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1220. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 440

[jira] [Commented] (PARQUET-1220) [C++] Don't build Thrift examples and tutorials in the ExternalProject

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370323#comment-16370323 ] ASF GitHub Bot commented on PARQUET-1220: - wesm closed pull request #440: PARQUET-1220: Don't

[jira] [Resolved] (PARQUET-1221) [C++] Extend release README

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1221. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 441

[jira] [Assigned] (PARQUET-1221) [C++] Extend release README

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1221: - Assignee: Uwe L. Korn > [C++] Extend release README > --- > >

[jira] [Resolved] (PARQUET-1226) [C++] Fix new build warnings with clang 5.0

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1226. --- Resolution: Fixed Resolved by PR

[jira] [Commented] (PARQUET-1226) [C++] Fix new build warnings with clang 5.0

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370318#comment-16370318 ] ASF GitHub Bot commented on PARQUET-1226: - wesm closed pull request #442: PARQUET-1226: Fixes

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370317#comment-16370317 ] Ryan Blue commented on PARQUET-860: --- [~e.birukov], this issue is not related to the problem you're

Re: Inconsistent float/double sort order in spec and implementations can lead to incorrect results

2018-02-20 Thread Alexander Behm
Today, Impala does not evaluate " != " against stats, but as Zoltan pointed out there is a way to reasonably do that. It does not work if we ignore NaN though, so we need to be careful. On Tue, Feb 20, 2018 at 9:24 AM, Zoltan Ivanfi wrote: > In parquet-mr, if you are looking

Re: Inconsistent float/double sort order in spec and implementations can lead to incorrect results

2018-02-20 Thread Zoltan Ivanfi
In parquet-mr, if you are looking for a value that is not equal to some reference value r and stats are min = r and max = r then that row group is discarded, because there can not be any other values in that row group. On Tue, Feb 20, 2018 at 6:21 PM Jim Apple wrote: > For

Re: Inconsistent float/double sort order in spec and implementations can lead to incorrect results

2018-02-20 Thread Jim Apple
For that predicate in particular, does Impala use stats already? Let's say a column contains only the intuitive notion of floats: no NaNs, no infs, no -0.0. If we are filtering for $COL != a and the row-group stats are b <= $COL <= c, were a < b, we can know that the whole row group can be

Re: Inconsistent float/double sort order in spec and implementations can lead to incorrect results

2018-02-20 Thread Alexander Behm
On Mon, Feb 19, 2018 at 8:04 AM, Zoltan Ivanfi wrote: > Hi, > > Tim, I added your suggestion to introduce a new ColumnOrder to PARQUET-1222 > as the preferred > solution. > > Alex, not writing min/max if there is a NaN is

Re: [VOTE] Release Apache Parquet C++ 1.4.0 RC0

2018-02-20 Thread Uwe L. Korn
Due to the issues raised, I will close this RC and once all patches are merged, I will build a new one. Uwe On Tue, Feb 20, 2018, at 1:48 AM, Deepak Majeti wrote: > Wes, Zoltan, > > I am taking a look at the issue now. I will handle the patch for this one. > Thanks! > > On Tue, Feb 20, 2018

[jira] [Commented] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370165#comment-16370165 ] ASF GitHub Bot commented on PARQUET-1233: - majetideepak opened a new pull request #443:

[jira] [Updated] (PARQUET-1225) NaN values may lead to incorrect filtering under certain circumstances

2018-02-20 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1225: --- Description: _This JIRA describes a generic problem with floating point comparisons that

[jira] [Created] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-20 Thread Deepak Majeti (JIRA)
Deepak Majeti created PARQUET-1233: -- Summary: [CPP ]Enable option to switch between stl classes and boost classes for thrift header Key: PARQUET-1233 URL: https://issues.apache.org/jira/browse/PARQUET-1233

[jira] [Created] (PARQUET-1232) Document the modular encryption in parquet-format

2018-02-20 Thread Gidon Gershinsky (JIRA)
Gidon Gershinsky created PARQUET-1232: - Summary: Document the modular encryption in parquet-format Key: PARQUET-1232 URL: https://issues.apache.org/jira/browse/PARQUET-1232 Project: Parquet

Save the date: ApacheCon North America, September 24-27 in Montréal

2018-02-20 Thread Rich Bowen
Dear Apache Enthusiast, (You’re receiving this message because you’re subscribed to a user@ or dev@ list of one or more Apache Software Foundation projects.) We’re pleased to announce the upcoming ApacheCon [1] in Montréal, September 24-27. This event is all about you — the Apache project

[jira] [Created] (PARQUET-1231) Not able to load the LocalFileSystem class

2018-02-20 Thread Persistent NGP (JIRA)
Persistent NGP created PARQUET-1231: --- Summary: Not able to load the LocalFileSystem class Key: PARQUET-1231 URL: https://issues.apache.org/jira/browse/PARQUET-1231 Project: Parquet Issue

[jira] [Comment Edited] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread e.birukov (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370053#comment-16370053 ] e.birukov edited comment on PARQUET-860 at 2/20/18 1:37 PM: I get the same

[jira] [Commented] (PARQUET-860) ParquetWriter.getDataSize NullPointerException after closed

2018-02-20 Thread e.birukov (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16370053#comment-16370053 ] e.birukov commented on PARQUET-860: --- I get the same error This happens when I get the temporary

[jira] [Updated] (PARQUET-1229) parquet-mr code changes for encryption support

2018-02-20 Thread Gidon Gershinsky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gidon Gershinsky updated PARQUET-1229: -- Description: # new package and classes for crypto operations # addition of

[jira] [Created] (PARQUET-1230) CLI tools for encrypted files

2018-02-20 Thread Gidon Gershinsky (JIRA)
Gidon Gershinsky created PARQUET-1230: - Summary: CLI tools for encrypted files Key: PARQUET-1230 URL: https://issues.apache.org/jira/browse/PARQUET-1230 Project: Parquet Issue Type:

[jira] [Created] (PARQUET-1229) parquet-mr code changes for encryption support

2018-02-20 Thread Gidon Gershinsky (JIRA)
Gidon Gershinsky created PARQUET-1229: - Summary: parquet-mr code changes for encryption support Key: PARQUET-1229 URL: https://issues.apache.org/jira/browse/PARQUET-1229 Project: Parquet

[jira] [Created] (PARQUET-1228) parquet-format code changes for encryption support

2018-02-20 Thread Gidon Gershinsky (JIRA)
Gidon Gershinsky created PARQUET-1228: - Summary: parquet-format code changes for encryption support Key: PARQUET-1228 URL: https://issues.apache.org/jira/browse/PARQUET-1228 Project: Parquet

[jira] [Created] (PARQUET-1227) Thrift crypto metadata structures

2018-02-20 Thread Gidon Gershinsky (JIRA)
Gidon Gershinsky created PARQUET-1227: - Summary: Thrift crypto metadata structures Key: PARQUET-1227 URL: https://issues.apache.org/jira/browse/PARQUET-1227 Project: Parquet Issue Type: