Re: Date and time for the next Parquet sync

2018-04-21 Thread Lars Volker
I sent an invite for the proposed time. Please let me know if you would like to be added to the meeting but haven't received an invite. Cheers, Lars On Fri, Apr 20, 2018 at 3:11 PM, Julien Le Dem wrote: > +1 > > On Wed, Apr 18, 2018 at 9:23 AM, Zoltan Ivanfi

[jira] [Commented] (PARQUET-1276) [C++] Reduce the amount of memory used for writing null decimal values

2018-04-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446838#comment-16446838 ] ASF GitHub Bot commented on PARQUET-1276: - cpcloud commented on a change in pull request #459:

[jira] [Assigned] (PARQUET-1128) [Java] Upgrade the Apache Arrow version to 0.8.0 for SchemaConverter

2018-04-21 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn reassigned PARQUET-1128: Assignee: Masayuki Takahashi > [Java] Upgrade the Apache Arrow version to 0.8.0 for

[jira] [Resolved] (PARQUET-1128) [Java] Upgrade the Apache Arrow version to 0.8.0 for SchemaConverter

2018-04-21 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn resolved PARQUET-1128. -- Resolution: Fixed Fix Version/s: 1.11 Issue resolved by pull request 443

[jira] [Commented] (PARQUET-1128) [Java] Upgrade the Apache Arrow version to 0.8.0 for SchemaConverter

2018-04-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446810#comment-16446810 ] ASF GitHub Bot commented on PARQUET-1128: - xhochy closed pull request #443: PARQUET-1128: [Java]

Brotli-codec dependency issue when building from source

2018-04-21 Thread Andy Grove
Hi, I’ve been following the instructions in the README to get parquet-mr building locally but I am running into this dependency issue: [ERROR] Failed to execute goal on project parquet-hadoop: Could not resolve dependencies for project org.apache.parquet:parquet-hadoop:jar:1.10.1-SNAPSHOT:

[jira] [Updated] (PARQUET-400) Error reading some files after PARQUET-77 bytebuffer read path

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-400: - Fix Version/s: 1.8.2 > Error reading some files after PARQUET-77 bytebuffer read path >

[jira] [Updated] (PARQUET-343) Caching nulls on group node to improve write performance on wide schema sparse data

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-343: - Fix Version/s: 1.8.2 > Caching nulls on group node to improve write performance on wide

[jira] [Updated] (PARQUET-358) Add support for temporal logical types to AVRO/Parquet conversion

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-358: - Fix Version/s: 1.8.2 > Add support for temporal logical types to AVRO/Parquet conversion

[jira] [Updated] (PARQUET-363) Cannot construct empty MessageType for ReadContext.requestedSchema

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-363: - Fix Version/s: 1.8.2 > Cannot construct empty MessageType for

[jira] [Updated] (PARQUET-364) Parquet-avro cannot decode Avro/Thrift array of primitive array (e.g. array<array>)

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-364: - Fix Version/s: 1.8.2 > Parquet-avro cannot decode Avro/Thrift array of primitive array

[jira] [Updated] (PARQUET-355) Create Integration tests to validate statistics

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-355: - Fix Version/s: 1.8.2 > Create Integration tests to validate statistics >

[jira] [Updated] (PARQUET-791) Predicate pushing down on missing columns should work on UserDefinedPredicate too

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-791: - Fix Version/s: 1.8.2 > Predicate pushing down on missing columns should work on

[jira] [Updated] (PARQUET-651) Parquet-avro fails to decode array of record with a single field name "element" correctly

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-651: - Fix Version/s: 1.8.2 > Parquet-avro fails to decode array of record with a single field

[jira] [Updated] (PARQUET-430) Change to use Locale parameterized version of String.toUpperCase()/toLowerCase

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-430: - Fix Version/s: 1.8.2 > Change to use Locale parameterized version of

[jira] [Updated] (PARQUET-726) TestMemoryManager consistently fails

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-726: - Fix Version/s: 1.8.2 > TestMemoryManager consistently fails >

[jira] [Updated] (PARQUET-393) release parquet-format 2.3.1

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-393: - Fix Version/s: 1.8.2 > release parquet-format 2.3.1 > > >

[jira] [Updated] (PARQUET-743) DictionaryFilters can re-use StreamBytesInput when compressed

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-743: - Fix Version/s: 1.8.2 > DictionaryFilters can re-use StreamBytesInput when compressed >

[jira] [Updated] (PARQUET-353) Compressors not getting recycled while writing parquet files, causing memory leak

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-353: - Fix Version/s: 1.8.2 > Compressors not getting recycled while writing parquet files,

[jira] [Updated] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-241: - Fix Version/s: 1.8.2 > ParquetInputFormat.getFooters() should return in the same order

[jira] [Updated] (PARQUET-396) The builder for AvroParquetReader loses the record type

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-396: - Fix Version/s: 1.8.2 > The builder for AvroParquetReader loses the record type >

[jira] [Updated] (PARQUET-495) Fix mismatches in Types class comments

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-495: - Fix Version/s: 1.8.2 > Fix mismatches in Types class comments >

[jira] [Updated] (PARQUET-544) ParquetWriter.close() throws NullPointerException on second call, improper implementation of Closeable contract

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-544: - Fix Version/s: 1.8.2 > ParquetWriter.close() throws NullPointerException on second call,

[jira] [Updated] (PARQUET-801) Allow UserDefinedPredicates in DictionaryFilter

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-801: - Fix Version/s: 1.8.2 > Allow UserDefinedPredicates in DictionaryFilter >

[jira] [Updated] (PARQUET-356) Add ElephantBird section to LICENSE file

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-356: - Fix Version/s: 1.8.2 > Add ElephantBird section to LICENSE file >

[jira] [Updated] (PARQUET-318) Remove unnecessary objectmapper from ParquetMetadata

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-318: - Fix Version/s: 1.8.2 > Remove unnecessary objectmapper from ParquetMetadata >

[jira] [Updated] (PARQUET-431) Make ParquetOutputFormat.memoryManager volatile

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-431: - Fix Version/s: 1.8.2 > Make ParquetOutputFormat.memoryManager volatile >

[jira] [Updated] (PARQUET-585) Slowly ramp up sizes of int[]s in IntList to keep sizes small when data sets are small

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-585: - Fix Version/s: 1.8.2 > Slowly ramp up sizes of int[]s in IntList to keep sizes small

[jira] [Updated] (PARQUET-352) Add tags to "created by" metadata in the file footer

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-352: - Fix Version/s: 1.8.2 > Add tags to "created by" metadata in the file footer >

[jira] [Updated] (PARQUET-389) Filter predicates should work with missing columns

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-389: - Fix Version/s: 1.8.2 > Filter predicates should work with missing columns >

[jira] [Updated] (PARQUET-612) Add compression to FileEncodingIT tests

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-612: - Fix Version/s: 1.8.2 > Add compression to FileEncodingIT tests >

[jira] [Updated] (PARQUET-415) ByteBufferBackedBinary serialization is broken

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-415: - Fix Version/s: 1.8.2 > ByteBufferBackedBinary serialization is broken >

[jira] [Updated] (PARQUET-528) Fix flush() for RecordConsumer and implementations

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-528: - Fix Version/s: 1.8.2 > Fix flush() for RecordConsumer and implementations >

[jira] [Updated] (PARQUET-484) Warn when Decimal is stored as INT64 while could be stored as INT32

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-484: - Fix Version/s: 1.8.2 > Warn when Decimal is stored as INT64 while could be stored as

[jira] [Updated] (PARQUET-674) Add an abstraction to get the length of a stream

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-674: - Fix Version/s: 1.8.2 > Add an abstraction to get the length of a stream >

[jira] [Updated] (PARQUET-548) Add Java metadata for PageEncodingStats

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-548: - Fix Version/s: 1.8.2 > Add Java metadata for PageEncodingStats >

[jira] [Updated] (PARQUET-432) Complete a todo for method ColumnDescriptor.compareTo()

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-432: - Fix Version/s: 1.8.2 > Complete a todo for method ColumnDescriptor.compareTo() >

[jira] [Updated] (PARQUET-560) Incorrect synchronization in SnappyCompressor

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-560: - Fix Version/s: 1.8.2 > Incorrect synchronization in SnappyCompressor >

[jira] [Updated] (PARQUET-384) Add Dictionary Based Filtering to Filter2 API

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-384: - Fix Version/s: 1.8.2 > Add Dictionary Based Filtering to Filter2 API >

[jira] [Updated] (PARQUET-654) Make record-level filtering optional

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-654: - Fix Version/s: 1.8.2 > Make record-level filtering optional >

[jira] [Updated] (PARQUET-685) Deprecated ParquetInputSplit constructor passes parameters in the wrong order.

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-685: - Fix Version/s: 1.8.2 > Deprecated ParquetInputSplit constructor passes parameters in the

[jira] [Updated] (PARQUET-340) totalMemoryPool is truncated to 32 bits

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-340: - Fix Version/s: 1.8.2 > totalMemoryPool is truncated to 32 bits >

[jira] [Updated] (PARQUET-341) Improve write performance with wide schema sparse data

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-341: - Fix Version/s: 1.8.2 > Improve write performance with wide schema sparse data >

[jira] [Updated] (PARQUET-305) Logger instantiated for package org.apache.parquet may be GC-ed

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-305: - Fix Version/s: 1.8.2 > Logger instantiated for package org.apache.parquet may be GC-ed >

[jira] [Updated] (PARQUET-99) Large rows cause unnecessary OOM exceptions

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-99?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-99: Fix Version/s: 1.8.2 > Large rows cause unnecessary OOM exceptions >

[jira] [Updated] (PARQUET-421) Fix mismatch of javadoc names and method parameters in module encoding, column, and hadoop

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-421: - Fix Version/s: 1.8.2 > Fix mismatch of javadoc names and method parameters in module

[jira] [Updated] (PARQUET-373) MemoryManager tests are flaky

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-373: - Fix Version/s: 1.8.2 > MemoryManager tests are flaky > - > >

[jira] [Updated] (PARQUET-645) DictionaryFilter incorrectly handles null

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-645: - Fix Version/s: 1.8.2 > DictionaryFilter incorrectly handles null >

[jira] [Updated] (PARQUET-783) H2SeekableInputStream does not close its underlying FSDataInputStream, leading to connection leaks

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-783: - Fix Version/s: 1.8.2 > H2SeekableInputStream does not close its underlying

[jira] [Updated] (PARQUET-753) GroupType.union() doesn't merge the original type

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-753: - Fix Version/s: 1.8.2 > GroupType.union() doesn't merge the original type >

[jira] [Updated] (PARQUET-686) Allow for Unsigned Statistics in Binary Type

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-686: - Fix Version/s: 1.8.2 > Allow for Unsigned Statistics in Binary Type >

[jira] [Updated] (PARQUET-361) Add prerelease logic to semantic versions

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-361: - Fix Version/s: 1.8.2 > Add prerelease logic to semantic versions >

[jira] [Updated] (PARQUET-349) VersionParser does not handle versions like "parquet-mr 1.6.0rc4"

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-349: - Fix Version/s: 1.8.2 > VersionParser does not handle versions like "parquet-mr 1.6.0rc4"

[jira] [Updated] (PARQUET-581) Min/max row count for page size check are conflated in some places

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-581: - Fix Version/s: 1.8.2 > Min/max row count for page size check are conflated in some

[jira] [Updated] (PARQUET-623) DeltaByteArrayReader has incorrect skip behaviour

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-623: - Fix Version/s: 1.8.2 > DeltaByteArrayReader has incorrect skip behaviour >

[jira] [Updated] (PARQUET-220) Unnecessary warning in ParquetRecordReader.initialize

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-220: - Fix Version/s: 1.8.2 > Unnecessary warning in ParquetRecordReader.initialize >

[jira] [Updated] (PARQUET-422) Fix a potential bug in MessageTypeParser where we ignore and overwrite the initial value of a method parameter

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-422: - Fix Version/s: 1.8.2 > Fix a potential bug in MessageTypeParser where we ignore and

[jira] [Updated] (PARQUET-511) Integer overflow on counting values in column

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-511: - Fix Version/s: 1.8.2 > Integer overflow on counting values in column >

[jira] [Updated] (PARQUET-571) Fix potential leak in ParquetFileReader.close()

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-571: - Fix Version/s: 1.8.2 > Fix potential leak in ParquetFileReader.close() >

[jira] [Updated] (PARQUET-423) Make writing Avro to Parquet less noisy

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-423: - Fix Version/s: 1.8.2 > Make writing Avro to Parquet less noisy >

[jira] [Updated] (PARQUET-378) Add thoroughly parquet test encodings

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-378: - Fix Version/s: 1.8.2 > Add thoroughly parquet test encodings >

[jira] [Updated] (PARQUET-387) TwoLevelListWriter does not handle null values in array

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-387: - Fix Version/s: 1.8.2 > TwoLevelListWriter does not handle null values in array >

[jira] [Updated] (PARQUET-751) DictionaryFilter patch broke column projection

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-751: - Fix Version/s: 1.8.2 > DictionaryFilter patch broke column projection >

[jira] [Updated] (PARQUET-660) Writing Protobuf messages with extensions results in an error or data corruption.

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-660: - Fix Version/s: 1.8.2 > Writing Protobuf messages with extensions results in an error or

[jira] [Updated] (PARQUET-380) Cascading and scrooge builds fail when using thrift 0.9.0

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-380: - Fix Version/s: 1.8.2 > Cascading and scrooge builds fail when using thrift 0.9.0 >

[jira] [Updated] (PARQUET-382) Add a way to append encoded blocks in ParquetFileWriter

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-382: - Fix Version/s: 1.8.2 > Add a way to append encoded blocks in ParquetFileWriter >

[jira] [Updated] (PARQUET-669) Allow reading file footers from input streams when writing metadata files

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-669: - Fix Version/s: 1.8.2 > Allow reading file footers from input streams when writing

[jira] [Updated] (PARQUET-342) Can't build Parquet on Java 6

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-342: - Fix Version/s: 1.8.2 > Can't build Parquet on Java 6 > - > >

[jira] [Updated] (PARQUET-529) Avoid evoking job.toString() in ParquetLoader

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-529: - Fix Version/s: 1.8.2 > Avoid evoking job.toString() in ParquetLoader >

[jira] [Updated] (PARQUET-413) Test failures for Java 8

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-413: - Fix Version/s: 1.8.2 > Test failures for Java 8 > > >

[jira] [Updated] (PARQUET-335) Avro object model should not require MAP_KEY_VALUE

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-335: - Fix Version/s: 1.8.2 > Avro object model should not require MAP_KEY_VALUE >

[jira] [Updated] (PARQUET-642) Improve performance of ByteBuffer based read / write paths

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-642: - Fix Version/s: 1.8.2 > Improve performance of ByteBuffer based read / write paths >

[jira] [Updated] (PARQUET-348) shouldIgnoreStatistics too noisy

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-348: - Fix Version/s: 1.8.2 > shouldIgnoreStatistics too noisy >

[jira] [Updated] (PARQUET-580) Potentially unnecessary creation of large int[] in IntList for columns that aren't used

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-580: - Fix Version/s: 1.8.2 > Potentially unnecessary creation of large int[] in IntList for

[jira] [Updated] (PARQUET-372) Parquet stats can have awkwardly large values

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-372: - Fix Version/s: 1.8.2 > Parquet stats can have awkwardly large values >

Re: Releasing parquet-mr 1.8.3

2018-04-21 Thread Gabor Szadovszky
Just realized that the issue about closing files (PARQUET-783 ) is already on the 1.8 branch. It was released in 1.8.2. Unfortunately, none of the JIRAs released in 1.8.2 is set in the Fix Versions. 1.8.2 is also completely missing in the

Re: Releasing parquet-mr 1.8.3

2018-04-21 Thread Gabor Szadovszky
Thanks a lot, Ryan. I’ve already made a change on the 1.8 branch to use jdk7 in Travis. We’ll also do the release build with java7. The commit for PARQUET-852 will be reverted on the 1.8 branch. I’ve looked though the bug fixes of 1.10.0 and found the one you’ve mentioned. Linked all related

[jira] [Commented] (PARQUET-1253) Support for new logical type representation

2018-04-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446754#comment-16446754 ] ASF GitHub Bot commented on PARQUET-1253: - gszadovszky commented on a change in pull request