[jira] [Commented] (PARQUET-268) Build is failing with parquet-scrooge errors.

2015-04-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14520532#comment-14520532 ] Ryan Blue commented on PARQUET-268: --- I'm going to do the downgrade and ignore the

[jira] [Resolved] (PARQUET-268) Build is failing with parquet-scrooge errors.

2015-04-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-268. --- Resolution: Fixed Assignee: Ryan Blue Build is failing with parquet-scrooge errors.

[jira] [Assigned] (PARQUET-270) Add legend to parquet-tools readme.md

2015-04-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-270: - Assignee: Ryan Blue Add legend to parquet-tools readme.md

[jira] [Resolved] (PARQUET-280) Please create a DOAP file for your TLP

2015-05-14 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-280. --- Resolution: Fixed Assignee: Julien Le Dem Thanks, Julien! Please create a DOAP file for your

[jira] [Resolved] (PARQUET-253) AvroSchemaConverter has confusing Javadoc

2015-05-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-253. --- Resolution: Fixed Merged #173. Thanks! AvroSchemaConverter has confusing Javadoc

[jira] [Commented] (PARQUET-98) filter2 API performance regression

2015-05-19 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-98?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551393#comment-14551393 ] Ryan Blue commented on PARQUET-98: -- [~phraktle], to save you some time, the 1.7.0 release

[jira] [Commented] (PARQUET-222) parquet writer runs into OOM during writing when calling DataFrame.saveAsParquetFile in Spark SQL

2015-06-05 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14574894#comment-14574894 ] Ryan Blue commented on PARQUET-222: --- [~phatak.dev]: the problem is probably the number

[jira] [Resolved] (PARQUET-314) Fix broken equals implementation(s)

2015-06-22 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-314. --- Resolution: Fixed Fix Version/s: 1.8.0 Merged. Thanks for catching this and fixing it,

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-23 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14597906#comment-14597906 ] Ryan Blue commented on PARQUET-41: -- Interesting, I hadn't heard about the counting bloom

[jira] [Resolved] (PARQUET-306) Improve alignment between row groups and HDFS blocks

2015-06-22 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-306. --- Resolution: Fixed Fix Version/s: 1.8.0 Merged #211. Thanks for reviewing, Alex! Improve

[jira] [Resolved] (PARQUET-317) writeMetaDataFile crashes when a relative root Path is used

2015-06-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-317. --- Resolution: Fixed Fix Version/s: 1.8.0 Merged #228. Thanks for fixing this, Steven!

[jira] [Updated] (PARQUET-317) writeMetaDataFile crashes when a relative root Path is used

2015-06-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-317: -- Assignee: Steven She writeMetaDataFile crashes when a relative root Path is used

[jira] [Resolved] (PARQUET-248) Simplify ParquetWriters's constructors

2015-06-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-248. --- Resolution: Fixed Fix Version/s: 1.8.0 Added a builder class that can be extended by object

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-26 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602532#comment-14602532 ] Ryan Blue commented on PARQUET-41: -- Thanks for working on this, [~Ferd], it's great to be

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600367#comment-14600367 ] Ryan Blue commented on PARQUET-41: -- I don't think the counting bloom filter idea is worth

[jira] [Commented] (PARQUET-152) Encoding issue with fixed length byte arrays

2015-06-18 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14592474#comment-14592474 ] Ryan Blue commented on PARQUET-152: --- I think the RLE_DICTIONARY behavior is probably

[jira] [Comment Edited] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-26 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602532#comment-14602532 ] Ryan Blue edited comment on PARQUET-41 at 6/26/15 7:53 PM: ---

[jira] [Resolved] (PARQUET-293) ScalaReflectionException when trying to convert an RDD of Scrooge to a DataFrame

2015-06-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-293. --- Resolution: Duplicate Closing as a duplicate. Please follow SPARK-8288 instead.

[jira] [Commented] (PARQUET-293) ScalaReflectionException when trying to convert an RDD of Scrooge to a DataFrame

2015-06-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14580868#comment-14580868 ] Ryan Blue commented on PARQUET-293: --- Linking to the issue that replaces this.

[jira] [Commented] (PARQUET-222) parquet writer runs into OOM during writing when calling DataFrame.saveAsParquetFile in Spark SQL

2015-06-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14580861#comment-14580861 ] Ryan Blue commented on PARQUET-222: --- Okay, so it sounds like you're talking about

[jira] [Resolved] (PARQUET-178) META-INF for slf4j should not be in parquet-format jar

2015-06-16 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-178. --- Resolution: Fixed Assignee: Ryan Blue Merged. Thanks for letting us know about this [~koert]!

[jira] [Created] (PARQUET-308) Add accessor to ParquetWriter to get current data size

2015-06-16 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-308: - Summary: Add accessor to ParquetWriter to get current data size Key: PARQUET-308 URL: https://issues.apache.org/jira/browse/PARQUET-308 Project: Parquet Issue

[jira] [Commented] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-06-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590353#comment-14590353 ] Ryan Blue commented on PARQUET-246: --- [~michael] can you answer my questions about this?

[jira] [Updated] (PARQUET-309) Remove unnecessary compile dependency on parquet-generator

2015-06-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-309: -- Assignee: Konstantin Shaposhnikov Remove unnecessary compile dependency on parquet-generator

[jira] [Resolved] (PARQUET-309) Remove unnecessary compile dependency on parquet-generator

2015-06-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-309. --- Resolution: Fixed Fix Version/s: 1.8.0 Remove unnecessary compile dependency on

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590073#comment-14590073 ] Ryan Blue commented on PARQUET-41: -- Great, thanks [~Ferd]! Could you also tell us a bit

[jira] [Commented] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-06-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590045#comment-14590045 ] Ryan Blue commented on PARQUET-246: --- Should we also update the read side so we can

[jira] [Resolved] (PARQUET-39) Simplify ParquetReader's constructors

2015-05-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-39?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-39. -- This was added in https://github.com/apache/parquet-mr/commit/ad32bf0fd111ab473ad1080cde11de39e3c5a67f

[jira] [Commented] (PARQUET-293) ScalaReflectionException when trying to convert an RDD of Scrooge to a DataFrame

2015-05-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14563496#comment-14563496 ] Ryan Blue commented on PARQUET-293: --- [~lian cheng], could you take a look at this?

[jira] [Updated] (PARQUET-151) Null Pointer exception in parquet.hadoop.ParquetFileWriter.mergeFooters

2015-06-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-151: -- Assignee: Yash Datta Null Pointer exception in parquet.hadoop.ParquetFileWriter.mergeFooters

[jira] [Created] (PARQUET-296) Set master branch version back to 1.8.0-SNAPSHOT

2015-06-01 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-296: - Summary: Set master branch version back to 1.8.0-SNAPSHOT Key: PARQUET-296 URL: https://issues.apache.org/jira/browse/PARQUET-296 Project: Parquet Issue Type: Bug

[jira] [Commented] (PARQUET-266) Add support for lists of primitives to Pig schema converter

2015-05-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14561923#comment-14561923 ] Ryan Blue commented on PARQUET-266: --- [~dweeks-netflix] or [~julienledem], you guys are

[jira] [Commented] (PARQUET-292) Release Parquet 1.8.0

2015-05-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14561927#comment-14561927 ] Ryan Blue commented on PARQUET-292: --- Adding PARQUET-265 instead of PARQUET-263.

[jira] [Commented] (PARQUET-292) Release Parquet 1.8.0

2015-05-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14561951#comment-14561951 ] Ryan Blue commented on PARQUET-292: --- Adding PARQUET-201, which was a bug fix pushed out

[jira] [Resolved] (PARQUET-199) Add a callback when the MemoryManager adjusts row group size

2015-05-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-199. --- Resolution: Fixed Fix Version/s: 1.8.0 This was merged a few days ago, just forgot to close.

[jira] [Resolved] (PARQUET-285) Implement nested types write rules in parquet-avro

2015-06-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-285. --- Resolution: Fixed Merged #198. Implement nested types write rules in parquet-avro

[jira] [Updated] (PARQUET-251) Binary column statistics error when reuse byte[] among rows

2015-07-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-251: -- Fix Version/s: (was: 2.0.0) 1.8.0 Binary column statistics error when reuse

[jira] [Resolved] (PARQUET-324) row count incorrect if data file has more than 2^31 rows

2015-07-03 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-324. --- Resolution: Fixed Fix Version/s: 1.8.0 Thanks for contributing the fix, [~tfriedr]! row

[jira] [Resolved] (PARQUET-320) Restore semver checks

2015-07-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-320. --- Resolution: Fixed Merged #230 Restore semver checks - Key:

[jira] [Resolved] (PARQUET-223) Add Map and List builiders

2015-05-26 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-223. --- Resolution: Fixed Fix Version/s: 1.8.0 I committed this. Thanks for the contribution

[jira] [Created] (PARQUET-361) Add prerelease logic to semantic versions

2015-08-19 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-361: - Summary: Add prerelease logic to semantic versions Key: PARQUET-361 URL: https://issues.apache.org/jira/browse/PARQUET-361 Project: Parquet Issue Type:

[jira] [Resolved] (PARQUET-361) Add prerelease logic to semantic versions

2015-08-20 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-361. --- Resolution: Fixed Assignee: Ryan Blue Add prerelease logic to semantic versions

[jira] [Resolved] (PARQUET-316) Run.sh is broken in parquet-benchmarks

2015-06-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-316. --- Resolution: Fixed Fix Version/s: 1.8.0 Merged Nezih's PR. Thanks for fixing this! Run.sh is

[jira] [Commented] (PARQUET-146) make Parquet compile with java 7 instead of java 6

2015-06-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14608848#comment-14608848 ] Ryan Blue commented on PARQUET-146: --- We should discuss this on the mailing list. We've

[jira] [Created] (PARQUET-320) Restore semver checks

2015-06-29 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-320: - Summary: Restore semver checks Key: PARQUET-320 URL: https://issues.apache.org/jira/browse/PARQUET-320 Project: Parquet Issue Type: Bug Components:

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-06-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606674#comment-14606674 ] Ryan Blue commented on PARQUET-41: -- I should also point out there's a table on the first

[jira] [Updated] (PARQUET-321) Set the HDFS padding default to 8MB

2015-06-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-321: -- Summary: Set the HDFS padding default to 8MB (was: Set the HDFS padding default to 16MB) Set the

[jira] [Created] (PARQUET-321) Set the HDFS padding default to 16MB

2015-06-30 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-321: - Summary: Set the HDFS padding default to 16MB Key: PARQUET-321 URL: https://issues.apache.org/jira/browse/PARQUET-321 Project: Parquet Issue Type: Improvement

[jira] [Commented] (PARQUET-144) read a single file outside of mapreduce framework

2015-07-31 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14649334#comment-14649334 ] Ryan Blue commented on PARQUET-144: --- [~hy5446]: you can read files outside of MR using

[jira] [Resolved] (PARQUET-144) read a single file outside of mapreduce framework

2015-07-31 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-144. --- Resolution: Not A Problem I'm resolving this as not a problem because it is a request for

[jira] [Commented] (PARQUET-344) Limit the number of rows per block and per split

2015-07-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14644711#comment-14644711 ] Ryan Blue commented on PARQUET-344: --- [~QuentinFra], you can currently set the row group

[jira] [Commented] (PARQUET-347) Thrift projection does not handle new (optional) fields in requestedSchema

2015-07-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14644955#comment-14644955 ] Ryan Blue commented on PARQUET-347: --- Seems like we should more generally take a look at

[jira] [Commented] (PARQUET-355) Create Integration tests to validate statistics

2015-08-07 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14662000#comment-14662000 ] Ryan Blue commented on PARQUET-355: --- [~sircodesalot], thanks for working on this! Can

[jira] [Commented] (PARQUET-358) Add support for temporal logical types to AVRO/Parquet conversion

2015-08-14 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697252#comment-14697252 ] Ryan Blue commented on PARQUET-358: --- Thanks for opening an issue on this one,

[jira] [Created] (PARQUET-356) Add ElephantBird section to LICENSE file

2015-08-12 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-356: - Summary: Add ElephantBird section to LICENSE file Key: PARQUET-356 URL: https://issues.apache.org/jira/browse/PARQUET-356 Project: Parquet Issue Type: Task

[jira] [Created] (PARQUET-335) Avro object model should not require MAP_KEY_VALUE

2015-07-15 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-335: - Summary: Avro object model should not require MAP_KEY_VALUE Key: PARQUET-335 URL: https://issues.apache.org/jira/browse/PARQUET-335 Project: Parquet Issue Type:

[jira] [Updated] (PARQUET-327) Show statistics in the dump output

2015-07-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-327: -- Fix Version/s: (was: 1.8.0) 1.9.0 Show statistics in the dump output

[jira] [Updated] (PARQUET-288) Add dictionary support to Avro converters

2015-07-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-288: -- Fix Version/s: (was: 1.8.0) Add dictionary support to Avro converters

[jira] [Updated] (PARQUET-337) binary fields inside map/set/list are not handled in parquet-scrooge

2015-07-16 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-337: -- Assignee: Jake Donham binary fields inside map/set/list are not handled in parquet-scrooge

[jira] [Commented] (PARQUET-339) Add Alex Levenson to KEYS file

2015-07-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14631811#comment-14631811 ] Ryan Blue commented on PARQUET-339: --- I'm fine just pushing changes like this, though we

[jira] [Created] (PARQUET-332) Incompatible changes in o.a.p.thrift.projection

2015-07-13 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-332: - Summary: Incompatible changes in o.a.p.thrift.projection Key: PARQUET-332 URL: https://issues.apache.org/jira/browse/PARQUET-332 Project: Parquet Issue Type: Bug

[jira] [Updated] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2015-10-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-241: -- Assignee: Mingyu Kim > ParquetInputFormat.getFooters() should return in the same order as what >

[jira] [Resolved] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2015-10-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-241. --- Resolution: Fixed Merged #164. Thanks [~mkim] for the contribution! (And sorry this took so long.

[jira] [Updated] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2015-10-29 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-241: -- Fix Version/s: 1.9.0 > ParquetInputFormat.getFooters() should return in the same order as what >

[jira] [Resolved] (PARQUET-369) Shading SLF4J prevents SLF4J locating org.slf4j.impl.StaticLoggerBinder

2015-10-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-369. --- Resolution: Fixed Assignee: Ryan Blue Fix Version/s: format-2.3.1 > Shading SLF4J

[jira] [Commented] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2015-10-27 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977329#comment-14977329 ] Ryan Blue commented on PARQUET-241: --- [~skonto], I think that most formats are consistent by accident,

[jira] [Commented] (PARQUET-241) ParquetInputFormat.getFooters() should return in the same order as what listStatus() returns

2015-10-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978699#comment-14978699 ] Ryan Blue commented on PARQUET-241: --- Building 1.7.0 shouldn't make a difference because this issue is

[jira] [Commented] (PARQUET-389) Filter predicates should work with missing columns

2015-10-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978702#comment-14978702 ] Ryan Blue commented on PARQUET-389: --- I agree, assuming that by "merged" you mean resolving the

[jira] [Commented] (PARQUET-140) Allow clients to control the GenericData object that is used to read Avro records

2015-10-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978707#comment-14978707 ] Ryan Blue commented on PARQUET-140: --- [~DeaconDesperado], you are correct. This allows you to use

[jira] [Commented] (PARQUET-391) Parquet build fails with thrift9 profile

2015-11-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14999133#comment-14999133 ] Ryan Blue commented on PARQUET-391: --- I think this is a duplicate of PARQUET-380. There's a PR with a

[jira] [Commented] (PARQUET-124) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException

2015-11-09 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14996949#comment-14996949 ] Ryan Blue commented on PARQUET-124: --- [~swethakasireddy], it looks like this wasn't completely addressed

[jira] [Commented] (PARQUET-390) GroupType.union(Type toMerge, boolean strict) does not honor strict parameter

2015-11-09 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14997083#comment-14997083 ] Ryan Blue commented on PARQUET-390: --- You're right that my suggestion is a much larger issue. For this

[jira] [Commented] (PARQUET-380) Cascading and scrooge builds fail when using thrift 0.9.0

2015-11-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15009063#comment-15009063 ] Ryan Blue commented on PARQUET-380: --- There are build failures from thrift's SLF4J dependency. I just

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2015-11-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15009099#comment-15009099 ] Ryan Blue commented on PARQUET-41: -- [~Ferd], I think we need a design doc for this feature and some data

[jira] [Updated] (PARQUET-390) GroupType.union(Type toMerge, boolean strict) does not honor strict parameter

2015-11-04 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-390: -- Labels: newbie parquet (was: parquet) > GroupType.union(Type toMerge, boolean strict) does not honor

[jira] [Commented] (PARQUET-390) GroupType.union(Type toMerge, boolean strict) does not honor strict parameter

2015-11-04 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14989939#comment-14989939 ] Ryan Blue commented on PARQUET-390: --- Thanks for the bug report, Michael. I think you're right about

[jira] [Resolved] (PARQUET-373) MemoryManager tests are flaky

2015-10-19 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-373. --- Resolution: Fixed > MemoryManager tests are flaky > - > >

[jira] [Updated] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-07-09 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-246: -- Fix Version/s: (was: 2.0.0) 1.8.0 ArrayIndexOutOfBoundsException with Parquet

[jira] [Commented] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-07-09 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14620919#comment-14620919 ] Ryan Blue commented on PARQUET-246: --- The {{parquet.split.files}} option will read all

[jira] [Resolved] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-07-09 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-246. --- Resolution: Fixed Assignee: Konstantin Shaposhnikov Closing this now that the read side has a

[jira] [Commented] (PARQUET-380) Cascading and scrooge builds fail when using thrift 0.9.0

2015-11-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15009838#comment-15009838 ] Ryan Blue commented on PARQUET-380: --- When I add the dependency for libthrift, I get an error somewhere

[jira] [Resolved] (PARQUET-380) Cascading and scrooge builds fail when using thrift 0.9.0

2015-11-17 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-380. --- Resolution: Fixed Fixed. Thanks for the push, [~saucam]! > Cascading and scrooge builds fail when

[jira] [Commented] (PARQUET-344) Limit the number of rows per block and per split

2015-08-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712108#comment-14712108 ] Ryan Blue commented on PARQUET-344: --- Thanks Quentin! I like Dan's idea of limiting the

[jira] [Created] (PARQUET-373) MemoryManager tests are flaky

2015-09-11 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-373: - Summary: MemoryManager tests are flaky Key: PARQUET-373 URL: https://issues.apache.org/jira/browse/PARQUET-373 Project: Parquet Issue Type: Bug

[jira] [Resolved] (PARQUET-335) Avro object model should not require MAP_KEY_VALUE

2015-09-11 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-335. --- Resolution: Fixed > Avro object model should not require MAP_KEY_VALUE >

[jira] [Created] (PARQUET-372) Parquet stats can have awkwardly large values

2015-09-10 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-372: - Summary: Parquet stats can have awkwardly large values Key: PARQUET-372 URL: https://issues.apache.org/jira/browse/PARQUET-372 Project: Parquet Issue Type: Bug

[jira] [Commented] (PARQUET-379) PrimitiveType.union erases original type

2015-09-28 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14933846#comment-14933846 ] Ryan Blue commented on PARQUET-379: --- I think this is part of a larger issue of handling schema

[jira] [Commented] (PARQUET-369) Shading SLF4J prevents SLF4J locating org.slf4j.impl.StaticLoggerBinder

2015-09-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906920#comment-14906920 ] Ryan Blue commented on PARQUET-369: --- I should also note: I've verified that there are no org.slf4j.*

[jira] [Commented] (PARQUET-369) Shading SLF4J prevents SLF4J locating org.slf4j.impl.StaticLoggerBinder

2015-09-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906993#comment-14906993 ] Ryan Blue commented on PARQUET-369: --- I've updated the PR to shade slf4j-nop and confirmed that

[jira] [Commented] (PARQUET-383) ParquetOutputCommitter should propagate errors when writing metadata files

2015-09-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14907123#comment-14907123 ] Ryan Blue commented on PARQUET-383: --- I think this is a good idea. I'd make the error fatal only if the

[jira] [Commented] (PARQUET-369) Shading SLF4J prevents SLF4J locating org.slf4j.impl.StaticLoggerBinder

2015-09-24 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906986#comment-14906986 ] Ryan Blue commented on PARQUET-369: --- Ignore my comment above, I just tested out the partial relocation

[jira] [Created] (PARQUET-382) Add a way to append encoded blocks in ParquetFileWriter

2015-09-24 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-382: - Summary: Add a way to append encoded blocks in ParquetFileWriter Key: PARQUET-382 URL: https://issues.apache.org/jira/browse/PARQUET-382 Project: Parquet Issue

[jira] [Assigned] (PARQUET-372) Parquet stats can have awkwardly large values

2015-09-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue reassigned PARQUET-372: - Assignee: Ryan Blue > Parquet stats can have awkwardly large values >

[jira] [Updated] (PARQUET-372) Parquet stats can have awkwardly large values

2015-09-25 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-372: -- Description: If a column is storing very large values, say 2-4 MB, then the page header's min and max

[jira] [Commented] (PARQUET-34) Add support for repeated columns in the filter2 API

2015-12-02 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-34?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036944#comment-15036944 ] Ryan Blue commented on PARQUET-34: -- [~f.pompermaier], I don't think anyone has extra cycles to spend

[jira] [Resolved] (PARQUET-382) Add a way to append encoded blocks in ParquetFileWriter

2015-12-08 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-382. --- Resolution: Fixed Fix Version/s: 1.9.0 Merged #278. Thanks for reviewing, Sergio! > Add a

[jira] [Commented] (PARQUET-402) Apache Pig cannot store Map data type into Parquet format

2015-12-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15051521#comment-15051521 ] Ryan Blue commented on PARQUET-402: --- Is there anything we can do about it? Maybe we should at least

[jira] [Updated] (PARQUET-393) release parquet-format 2.3.1

2015-12-10 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-393: -- Summary: release parquet-format 2.3.1 (was: release parquet-format 2.4.0) > release parquet-format

[jira] [Updated] (PARQUET-346) ThriftSchemaConverter throws for unknown struct or union type

2015-12-14 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-346: -- Fix Version/s: (was: 2.0.0) 1.9.0 > ThriftSchemaConverter throws for unknown

[jira] [Commented] (PARQUET-405) Backwards-incompatible change to thrift metadata

2015-12-14 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057013#comment-15057013 ] Ryan Blue commented on PARQUET-405: --- Thanks, Ben! Both for reporting the issue and for helping us keep

  1   2   3   4   >