Test failures from flaky test

2015-10-19 Thread Ryan Blue
There's a flaky test in TestMemoryManager that is causing test failures 
for all of the new PRs. I fixed the problem, but it's still waiting for 
a +1 from a committer. (Thanks to Sergio for reviewing!)


Could someone take a quick look so we can get CI back on track? Thanks!

rb

--
Ryan Blue


[jira] [Commented] (PARQUET-349) VersionParser does not handle versions like "parquet-mr 1.6.0rc4"

2015-10-19 Thread Reuben Kuhnert (JIRA)

[ 
https://issues.apache.org/jira/browse/PARQUET-349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14963892#comment-14963892
 ] 

Reuben Kuhnert commented on PARQUET-349:


Added patch (PR-283).

> VersionParser does not handle versions like "parquet-mr 1.6.0rc4"
> -
>
> Key: PARQUET-349
> URL: https://issues.apache.org/jira/browse/PARQUET-349
> Project: Parquet
>  Issue Type: Bug
>  Components: parquet-mr
>Reporter: Alex Levenson
>Assignee: Reuben Kuhnert
>Priority: Minor
>
> I'm not sure how we missed this one (I thought I added a test for this but 
> apparently not :p) but the tests don't cover it, and the parser can't parse 
> it even though the intention was to support a missing (build abcd) section in 
> the version string.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Error Compiling Parquet 1.8.2-SNAPSHOT

2015-10-19 Thread Chris Mathews

Can anyone please help with the following ?
Not sure if I have done something wrong here.

downloaded zip from https://github.com/apache/parquet-mr/archive/master.zip

mvn -U clean install

[INFO] Scanning for projects...
[WARNING] 
[WARNING] Some problems were encountered while building the effective model for 
org.apache.parquet:parquet-scala_2.10:jar:1.8.2-SNAPSHOT
[WARNING] 'artifactId' contains an expression but should be a constant. @ 
org.apache.parquet:parquet-scala_${scala.binary.version}:[unknown-version], 
/Users/cmathews/PROJECTS/HADOOP/parquet-mr-master/parquet-scala/pom.xml, line 
29, column 15
[WARNING] 'build.plugins.plugin.version' for org.scala-tools:maven-scala-plugin 
is missing. @ 
org.apache.parquet:parquet-scala_${scala.binary.version}:[unknown-version], 
/Users/cmathews/PROJECTS/HADOOP/parquet-mr-master/parquet-scala/pom.xml, line 
71, column 15
[WARNING] 
[WARNING] Some problems were encountered while building the effective model for 
org.apache.parquet:parquet-scrooge_2.10:jar:1.8.2-SNAPSHOT
[WARNING] 'artifactId' contains an expression but should be a constant. @ 
org.apache.parquet:parquet-scrooge_${scala.binary.version}:[unknown-version], 
/Users/cmathews/PROJECTS/HADOOP/parquet-mr-master/parquet-scrooge/pom.xml, line 
29, column 15
[WARNING] 
[WARNING] It is highly recommended to fix these problems because they threaten 
the stability of your build.
[WARNING] 
[WARNING] For this reason, future Maven versions might no longer support 
building such malformed projects.
[WARNING] 
[INFO] 
[INFO] Reactor Build Order:
[INFO] 
[INFO] Apache Parquet MR
[INFO] Apache Parquet Generator
[INFO] Apache Parquet Common
[INFO] Apache Parquet Encodings
[INFO] Apache Parquet Column
[INFO] Apache Parquet Jackson
[INFO] Apache Parquet Hadoop
[INFO] Apache Parquet Avro
[INFO] Apache Parquet Benchmarks
[INFO] Apache Parquet Pig
[INFO] Apache Parquet Thrift
[INFO] Apache Parquet Cascading
[INFO] Apache Parquet Pig Bundle
[INFO] Apache Parquet Protobuf
[INFO] Apache Parquet Scala
[INFO] Apache Parquet Scrooge
[INFO] Apache Parquet Hadoop Bundle
[INFO] Apache Parquet Hive
[INFO] Apache Parquet Hive Binding Parent
[INFO] Apache Parquet Hive Binding Interface
[INFO] Apache Parquet Hive 0.10 Binding
[INFO] Apache Parquet Hive 0.12 Binding
[INFO] Apache Parquet Hive Binding Factory
[INFO] Apache Parquet Hive Binding Bundle
[INFO] Apache Parquet Hive Storage Handler
[INFO] Apache Parquet Hive Bundle
[INFO] Apache Parquet Tools
[INFO] 
[INFO] 
[INFO] Building Apache Parquet MR 1.8.2-SNAPSHOT
[INFO] 
[INFO] 
[INFO] --- maven-clean-plugin:2.5:clean (default-clean) @ parquet ---
[INFO] Deleting /Users/cmathews/PROJECTS/HADOOP/parquet-mr-master/target
[INFO] 
[INFO] --- buildnumber-maven-plugin:1.1:create (default) @ parquet ---
[INFO] Checking for local modifications: skipped.
[INFO] Updating project files from SCM: skipped.
[INFO] Executing: /bin/sh -c cd 
/Users/cmathews/PROJECTS/HADOOP/parquet-mr-master && git rev-parse --verify HEAD
[INFO] Working directory: /Users/cmathews/PROJECTS/HADOOP/parquet-mr-master
[INFO] Storing buildNumber: null at timestamp: 1445276335909
[INFO] Executing: /bin/sh -c cd 
/Users/cmathews/PROJECTS/HADOOP/parquet-mr-master && git rev-parse --verify HEAD
[INFO] Working directory: /Users/cmathews/PROJECTS/HADOOP/parquet-mr-master
[INFO] Storing buildScmBranch: UNKNOWN_BRANCH
[INFO] 
[INFO] --- maven-remote-resources-plugin:1.5:process (default) @ parquet ---
[INFO] 
[INFO] --- apache-rat-plugin:0.11:check (default) @ parquet ---
[INFO] 70 implicit excludes (use -debug for more details).
[INFO] Exclude: **/*.parquet
[INFO] Exclude: **/*.avro
[INFO] Exclude: **/*.json
[INFO] Exclude: **/names.txt
[INFO] Exclude: **/*.avsc
[INFO] Exclude: **/*.iml
[INFO] Exclude: **/*.log
[INFO] Exclude: **/*.md.vm
[INFO] Exclude: **/.classpath
[INFO] Exclude: **/.project
[INFO] Exclude: **/.settings/**
[INFO] Exclude: **/build/**
[INFO] Exclude: **/target/**
[INFO] Exclude: .git/**
[INFO] Exclude: .gitignore
[INFO] Exclude: .idea/**
[INFO] Exclude: */jdiff/*.xml
[INFO] Exclude: .travis.yml
[INFO] Exclude: licenses/**
[INFO] Exclude: protobuf_install/**
[INFO] Exclude: thrift-0.7.0/**
[INFO] Exclude: thrift-0.7.0.tar.gz
[INFO] Exclude: **/dependency-reduced-pom.xml
[INFO] 17 resources included (use -debug for more details)
Warning:  org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser: Property 
'http://www.oracle.com/xml/jaxp/properties/entityExpansionLimit' is not 
recognized.
Compiler warnings:
  WARNING:  'org.apache.xerces.jaxp.SAXParserImpl: Property 
'http://javax.xml.XMLConstants/property/accessExternalDTD' is not recognized.'
Warning:  org.apache.xerces.parsers.SAXParser: Feature 

[jira] [Commented] (PARQUET-124) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException

2015-10-19 Thread Alex Nastetsky (JIRA)

[ 
https://issues.apache.org/jira/browse/PARQUET-124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14964192#comment-14964192
 ] 

Alex Nastetsky commented on PARQUET-124:


Still seeing this issue, at least when running locally. I think it needs to do 
something like FileSystem.get(new URI("file:///"), new 
Configuration()).resolvePath(X) on both "root" and "new Path(footerPath)", 
except using the correct file system.

> parquet.hadoop.ParquetOutputCommitter.commitJob() throws 
> parquet.io.ParquetEncodingException
> 
>
> Key: PARQUET-124
> URL: https://issues.apache.org/jira/browse/PARQUET-124
> Project: Parquet
>  Issue Type: Bug
>  Components: parquet-mr
>Affects Versions: 1.6.0
>Reporter: Chris Albright
>Priority: Minor
> Fix For: 1.6.0
>
> Attachments: PARQUET-124-test
>
>
> I'm running an example combining Avro, Spark and Parquet 
> (https://github.com/massie/spark-parquet-example), and in the process of 
> updating the library versions, am getting the warning below.
> The version of Parquet-Hadoop in the original example is 1.0.0. I am using 
> 1.6.0rc3
> The ParquetFileWriter.mergeFooters(Path, List) method is performing a 
> check to ensure the footers are all for files in the output directory. The 
> output directory is supplied by ParquetFileWriter.writeMetadataFile; in 
> 1.0.0, the output path was converted to a fully qualified output path before 
> the call to mergeFooters, but in 1.6.0rc[2,3] that conversion happens after 
> the call to mergeFooters. Because of this, the check within merge footers is 
> failing (the URI for the footers starts with file:, but not the URI for the 
> root path does not)
> Here is the warning message and stacktrace.
> Oct 30, 2014 9:11:31 PM WARNING: parquet.hadoop.ParquetOutputCommitter: could 
> not write summary file for /tmp/1414728690018-0/output
> parquet.io.ParquetEncodingException: 
> file:/tmp/1414728690018-0/output/part-r-0.parquet invalid: all the files 
> must be contained in the root /tmp/1414728690018-0/output
>   at 
> parquet.hadoop.ParquetFileWriter.mergeFooters(ParquetFileWriter.java:422)
>   at 
> parquet.hadoop.ParquetFileWriter.writeMetadataFile(ParquetFileWriter.java:398)
>   at 
> parquet.hadoop.ParquetOutputCommitter.commitJob(ParquetOutputCommitter.java:50)
>   at 
> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopDataset(PairRDDFunctions.scala:936)
>   at 
> org.apache.spark.rdd.PairRDDFunctions.saveAsNewAPIHadoopFile(PairRDDFunctions.scala:832)
>   at 
> com.zenfractal.SparkParquetExample$.main(SparkParquetExample.scala:72)
>   at com.zenfractal.SparkParquetExample.main(SparkParquetExample.scala)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:606)
>   at com.intellij.rt.execution.application.AppMain.main(AppMain.java:134)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (PARQUET-373) MemoryManager tests are flaky

2015-10-19 Thread Ryan Blue (JIRA)

 [ 
https://issues.apache.org/jira/browse/PARQUET-373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan Blue resolved PARQUET-373.
---
Resolution: Fixed

> MemoryManager tests are flaky
> -
>
> Key: PARQUET-373
> URL: https://issues.apache.org/jira/browse/PARQUET-373
> Project: Parquet
>  Issue Type: Bug
>  Components: parquet-mr
>Affects Versions: 1.8.0
>Reporter: Ryan Blue
>Assignee: Ryan Blue
> Fix For: 1.9.0
>
>
> The memory manager tests are flaky, depending on the heap allocation for the 
> JVM they run in. This is caused by over-specific tests that assert the memory 
> allocation down to the byte and the fact that some assertions implicitly cast 
> long values to doubles to use the "within" form of assertEquals.
> The tests should not validate a specific allocation strategy, but should 
> instead assert that:
> 1. The allocation for a file is the row group size until room runs out
> 2. When scaling row groups, the total allocation does not exceed the pool size



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (PARQUET-349) VersionParser does not handle versions like "parquet-mr 1.6.0rc4"

2015-10-19 Thread Reuben Kuhnert (JIRA)

 [ 
https://issues.apache.org/jira/browse/PARQUET-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Reuben Kuhnert reassigned PARQUET-349:
--

Assignee: Reuben Kuhnert

> VersionParser does not handle versions like "parquet-mr 1.6.0rc4"
> -
>
> Key: PARQUET-349
> URL: https://issues.apache.org/jira/browse/PARQUET-349
> Project: Parquet
>  Issue Type: Bug
>  Components: parquet-mr
>Reporter: Alex Levenson
>Assignee: Reuben Kuhnert
>Priority: Minor
>
> I'm not sure how we missed this one (I thought I added a test for this but 
> apparently not :p) but the tests don't cover it, and the parser can't parse 
> it even though the intention was to support a missing (build abcd) section in 
> the version string.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)