Please review Parquet-1396 - Crypto Interface for Schema Activation of Parquet Encryption

2019-01-17 Thread Xinli shang
Dear all, As Parquet-1178 passed the vote, I would like to bring Parquet-1396 (Crypto Interface for Schema Activation of Parquet Encryption

[jira] [Updated] (PARQUET-1396) Cryptodata Interface for Schema Activation of Parquet Encryption

2019-01-17 Thread Xinli Shang (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinli Shang updated PARQUET-1396: - Description: This JIRA is an extension to Parquet Modular Encryption Jira(PARQUET-1178) that

[jira] [Updated] (PARQUET-1396) Cryptodata Interface for Schema Activation of Parquet Encryption

2019-01-17 Thread Xinli Shang (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinli Shang updated PARQUET-1396: - Description: This JIRA is an extension to Parquet Modular Encryption Jira(PARQUET-1178) that

[jira] [Updated] (PARQUET-1396) Cryptodata Interface for Schema Activation of Parquet Encryption

2019-01-17 Thread Xinli Shang (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinli Shang updated PARQUET-1396: - Summary: Cryptodata Interface for Schema Activation of Parquet Encryption (was: Cryptodata

Re: [VOTE] Release Apache Parquet 1.11.0 RC3

2019-01-17 Thread Zoltan Ivanfi
Hi, Friendly reminder to please vote for the release. We need 2 more binding +1 votes. Thanks, Zoltan On Sat, Jan 12, 2019 at 3:07 AM 俊杰陈 wrote: > +1 (non-binding) > * contents looks good > * unit tests passed > > > Zoltan Ivanfi 于2019年1月11日周五 下午9:31写道: > > > +1 (binding) > > > > *

Adding more timestamp types to on-disk storage formats

2019-01-17 Thread Zoltan Ivanfi
Hi, There is an ongoing effort amongst the SQL engines of the Hadoop stack to support different timestamp semantics. This development has some implications for the low-level timestamp types as well. The new timestamp types added to the different SQL engines will rely on the decisions of the lower

Re: [Discussion] How to build bloom filter in parquet

2019-01-17 Thread Zoltan Ivanfi
Hi, I like the idea of specifying the maximum acceptable size of the bloom filter bit vector. I think it would be much better than specifying the expected number of distinct values (which we can not expect from the API consumer in my opinion). The desired false positives probability could still

[jira] [Commented] (PARQUET-1328) [java]Bloom filter read/write implementation

2019-01-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745159#comment-16745159 ] ASF GitHub Bot commented on PARQUET-1328: - cjjnjust commented on pull request #587:

[jira] [Commented] (PARQUET-1328) [java]Bloom filter read/write implementation

2019-01-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16745158#comment-16745158 ] ASF GitHub Bot commented on PARQUET-1328: - cjjnjust commented on pull request #587:

Re: [Discussion] How to build bloom filter in parquet

2019-01-17 Thread Gabor Szadovszky
Thanks for raising this, Junjie. One more topic worth to add: Which columns do we want to write bloom filters for? May it depend on the type? Is bloom filter required if we have dictionary? Is bloom filter required if the column is ordered and we have column indexes? (etc.) On Thu, Jan 17,

[Discussion] How to build bloom filter in parquet

2019-01-17 Thread 俊杰陈
Hi Parquet Developers In the bloom filter design doc we have discussed and determined bloom filter definition, now I'd like to invite you to discuss how to build a bloom filter in parquet. In my current implementation, a bloom filter is created first according to specified number of distinct

RE: Date and time for next parquet sync

2019-01-17 Thread Santlal J Gupta
Hi team, This email id is not available from this Friday onwards. Please add my another mail id(santlal561...@gmail.com) in parquet sync meeting. From my mail id(santlal561...@gmail.com), I already sent a meeting request. Thanks Santlal J Gupta -Original Message- From: Santlal J Gupta