[jira] [Updated] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2019-10-18 Thread Xinli Shang (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinli Shang updated PARQUET-1681: - Description: When using the Avro schema below to write a parquet(1.8.1) file and then read

[jira] [Comment Edited] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954944#comment-16954944 ] Felix Kizhakkel Jose edited comment on PARQUET-1679 at 10/18/19 8:08 PM:

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954944#comment-16954944 ] Felix Kizhakkel Jose commented on PARQUET-1679: --- Hi [~q.xu],  Thank you for the quick

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Qinghui Xu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954937#comment-16954937 ] Qinghui Xu commented on PARQUET-1679: - It seems that your schema builder (`ReflectData`) takes

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954922#comment-16954922 ] Felix Kizhakkel Jose commented on PARQUET-1679: ---

[jira] [Created] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2019-10-18 Thread Xinli Shang (Jira)
Xinli Shang created PARQUET-1681: Summary: Avro's isElementType() change breaks the reading of some parquet(1.8.1) files Key: PARQUET-1681 URL: https://issues.apache.org/jira/browse/PARQUET-1681

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Qinghui Xu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954915#comment-16954915 ] Qinghui Xu commented on PARQUET-1679: - Do you have a stacktrace or something? > Invalid

Help on Parquet Write Slowness and UUID support

2019-10-18 Thread Kizhakkel Jose, Felix
Hello, I am from Philips Architecture team, where I am working on a POC to compare different data models [ Parquet/Avro/Json]. But I see Parquet is very slow while writing [pojo to Parquet file]. I have created two issues in Parquet project. One is regarding the slowness of ParquetWritter

Re: Updating parquet web site

2019-10-18 Thread Driesprong, Fokko
Great work! Op vr 18 okt. 2019 om 17:53 schreef Ryan Blue > Sounds good to me! Thanks for taking care of this. > > On Fri, Oct 18, 2019 at 1:44 AM Gabor Szadovszky wrote: > > > Hi Uwe, > > > > parquet-site sounds good to me. > > > > Cheers, > > Gabor > > > > On Fri, Oct 18, 2019 at 10:19 AM

[jira] [Updated] (PARQUET-1678) [C++] Provide classes for reading/writing using input/output operators

2019-10-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1678: Labels: pull-request-available (was: ) > [C++] Provide classes for reading/writing

Re: Updating parquet web site

2019-10-18 Thread Ryan Blue
Sounds good to me! Thanks for taking care of this. On Fri, Oct 18, 2019 at 1:44 AM Gabor Szadovszky wrote: > Hi Uwe, > > parquet-site sounds good to me. > > Cheers, > Gabor > > On Fri, Oct 18, 2019 at 10:19 AM Uwe L. Korn wrote: > > > Hello Gabor, > > > > can we call this for clarity

[jira] [Commented] (PARQUET-1679) Invalid SchemaException for UUID while using AvroParquetWriter

2019-10-18 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954634#comment-16954634 ] Felix Kizhakkel Jose commented on PARQUET-1679: --- Could someone please help me on this? I

[jira] [Commented] (PARQUET-1680) Parquet Java Serialization is very slow

2019-10-18 Thread Felix Kizhakkel Jose (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954631#comment-16954631 ] Felix Kizhakkel Jose commented on PARQUET-1680: --- Could someone please help me on this? >

Re: Working on 1.11.0 RC7

2019-10-18 Thread Driesprong, Fokko
Perfect, thanks Gabor. Cheers, Fokko Op vr 18 okt. 2019 om 14:24 schreef Gabor Szadovszky : > Hi Fokko, > > There is no separate branch. Based on the discussion on the yesterday > parquet sync 1.11.0 is planned to be released from master. > > Cheers, > Gabor > > Driesprong, Fokko ezt írta

[jira] [Commented] (PARQUET-1496) [Java] Update Scala for JDK 11 compatibility

2019-10-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954596#comment-16954596 ] ASF GitHub Bot commented on PARQUET-1496: - xhochy commented on pull request #605: PARQUET-1496:

[jira] [Commented] (PARQUET-1496) [Java] Update Scala for JDK 11 compatibility

2019-10-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954565#comment-16954565 ] ASF GitHub Bot commented on PARQUET-1496: - Fokko commented on pull request #693: PARQUET-1496:

Re: Working on 1.11.0 RC7

2019-10-18 Thread Gabor Szadovszky
Hi Fokko, There is no separate branch. Based on the discussion on the yesterday parquet sync 1.11.0 is planned to be released from master. Cheers, Gabor Driesprong, Fokko ezt írta (időpont: 2019. okt. 18., P 14:09): > Thanks for doing the release Gabor, > > Is there a branch for 1.11.0?

Re: Working on 1.11.0 RC7

2019-10-18 Thread Driesprong, Fokko
Thanks for doing the release Gabor, Is there a branch for 1.11.0? Please let me know. Cheers, Fokko Op vr 18 okt. 2019 om 09:55 schreef Gabor Szadovszky : > Dear All, > > In the next couple of weeks I'll be working on the next release candidate > of 1.11.0. If you have any ongoing issues that

Re: custom CompressionCodec support

2019-10-18 Thread Driesprong, Fokko
Hi Falak, I was able to set the compression level in Spark using spark.io.compression.zstd.level. Cheers, Fokko Op do 17 okt. 2019 om 20:53 schreef Radev, Martin : > Hi Falak, > > > I was one of the people who recently exposed this to Arrow but this is not > part of the Parquet specification.

Re: Updating parquet web site

2019-10-18 Thread Gabor Szadovszky
Hi Uwe, parquet-site sounds good to me. Cheers, Gabor On Fri, Oct 18, 2019 at 10:19 AM Uwe L. Korn wrote: > Hello Gabor, > > can we call this for clarity https://github.com/apache/parquet-site ? > > Thanks > Uwe > > On Fri, Oct 18, 2019, at 9:46 AM, Gabor Szadovszky wrote: > > Dear All, > >

Re: Updating parquet web site

2019-10-18 Thread Uwe L. Korn
Hello Gabor, can we call this for clarity https://github.com/apache/parquet-site ? Thanks Uwe On Fri, Oct 18, 2019, at 9:46 AM, Gabor Szadovszky wrote: > Dear All, > > There are some stuff on our web site that is ready for update (since a > while). To spin up the process it would be great if

Working on 1.11.0 RC7

2019-10-18 Thread Gabor Szadovszky
Dear All, In the next couple of weeks I'll be working on the next release candidate of 1.11.0. If you have any ongoing issues that you think will be nice to have in 1.11.0, please set "Fix Version/s" accordingly. (If it is not really targeted to 1.11.0, please, remove the related tag.) If you

[jira] [Resolved] (PARQUET-1570) Publish 1.11.0 to maven central

2019-10-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1570. --- Resolution: Duplicate Publishing artifacts to the maven repo is part of the

Updating parquet web site

2019-10-18 Thread Gabor Szadovszky
Dear All, There are some stuff on our web site that is ready for update (since a while). To spin up the process it would be great if we could follow the same git PR process we already have for our existing git repos. Jim has already created PARQUET-1675

[jira] [Assigned] (PARQUET-1675) Switch to git for website

2019-10-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1675: - Assignee: Gabor Szadovszky > Switch to git for website >

[jira] [Resolved] (PARQUET-1650) Implement unit test to validate column/offset indexes

2019-10-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1650. --- Resolution: Fixed > Implement unit test to validate column/offset indexes >

[jira] [Commented] (PARQUET-1650) Implement unit test to validate column/offset indexes

2019-10-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16954311#comment-16954311 ] ASF GitHub Bot commented on PARQUET-1650: - gszadovszky commented on pull request #675: