Re: [VOTE] Add BYTE_STREAM_SPLIT encoding to Apache Parquet

2019-11-05 Thread Wes McKinney
+1 from me on adding the FP encoding On Sat, Nov 2, 2019 at 4:51 AM Radev, Martin wrote: > > Hello all, > > > thanks for the vote Ryan and to Wes for the feedback. > > > The concern with regards to adding more complex features in the Parquet spec > is valid. > > However, the proposed encoding

Reading past RLE/BitPacking stream

2019-11-05 Thread Jan Morlock
Hi, we have a feed-based distributed system and we are facing the problem that sometimes one special feed produces a parquet file where further processing fails with the following error message: 19/10/30 16:11:22 INFO zlib.ZlibFactory: Successfully loaded & initialized native-zlib library

[jira] [Commented] (PARQUET-112) RunLengthBitPackingHybridDecoder: Reading past RLE/BitPacking stream.

2019-11-05 Thread Jan Morlock (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967858#comment-16967858 ] Jan Morlock commented on PARQUET-112: - any news here? We are _sometimes_ facing the same problem

Re: release process - using rc tags

2019-11-05 Thread Gabor Szadovszky
Thanks everyone for the support. I've created the jira PARQUET-1687 to track this and also 3 PRs for the site , mr and format

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-11-05 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967620#comment-16967620 ] Felix Kizhakkel Jose commented on PARQUET-1679: --- Hi [~q.xu], Thank you so much. Do you

[jira] [Commented] (PARQUET-1687) Update release process

2019-11-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967630#comment-16967630 ] ASF GitHub Bot commented on PARQUET-1687: - gszadovszky commented on pull request #697:

[jira] [Updated] (PARQUET-1687) Update release process

2019-11-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1687: Labels: pull-request-available (was: ) > Update release process >

[jira] [Commented] (PARQUET-1687) Update release process

2019-11-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967639#comment-16967639 ] ASF GitHub Bot commented on PARQUET-1687: - gszadovszky commented on pull request #155:

[GitHub] [parquet-site] gszadovszky opened a new pull request #2: PARQUET-1687: Update release process

2019-11-05 Thread GitBox
gszadovszky opened a new pull request #2: PARQUET-1687: Update release process URL: https://github.com/apache/parquet-site/pull/2 Update the link of the keys file to the official one. The official link is required by the Apache release process. Update the usage of the prepare script

[jira] [Updated] (PARQUET-1687) Update release process

2019-11-05 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1687: -- Description: Our current tagging policy in the release process requires to use the