[jira] [Assigned] (PARQUET-485) Decouple data page delimiting from column reader / scanner classes, create test fixtures

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-485: Assignee: Wes McKinney > Decouple data page delimiting from column reader / scanner

[jira] [Commented] (PARQUET-479) Improve/expand functional unit tests

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125001#comment-15125001 ] Wes McKinney commented on PARQUET-479: -- I thought some more about this, and I'm not supportive of

[jira] [Created] (PARQUET-485) Decouple data page delimiting from column reader / scanner classes, create test fixtures

2016-01-30 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-485: Summary: Decouple data page delimiting from column reader / scanner classes, create test fixtures Key: PARQUET-485 URL: https://issues.apache.org/jira/browse/PARQUET-485

[jira] [Comment Edited] (PARQUET-466) Make parquet-format a git submodule and add tool for updating generated Thrift code

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125105#comment-15125105 ] Wes McKinney edited comment on PARQUET-466 at 1/30/16 10:27 PM: In the

[jira] [Commented] (PARQUET-467) Implement and test BIT_PACKED level encoding / decoding

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125162#comment-15125162 ] Wes McKinney commented on PARQUET-467: -- Please see

[jira] [Commented] (PARQUET-466) Make parquet-format a git submodule and add tool for updating generated Thrift code

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125105#comment-15125105 ] Wes McKinney commented on PARQUET-466: -- In this interest of parsimonious development, I propose to

[jira] [Commented] (PARQUET-467) Implement and test BIT_PACKED level encoding / decoding

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125213#comment-15125213 ] Wes McKinney commented on PARQUET-467: -- Yes: * BIT_PACKED uses {{BitReader}} * RLE uses

[jira] [Commented] (PARQUET-467) Implement and test BIT_PACKED level encoding / decoding

2016-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125221#comment-15125221 ] Wes McKinney commented on PARQUET-467: -- If you try to read a file that uses this encoding,

[jira] [Assigned] (PARQUET-496) Fix cpplint configuration to be more restrictive

2016-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-496: Assignee: Wes McKinney > Fix cpplint configuration to be more restrictive >

[jira] [Commented] (PARQUET-496) Fix cpplint configuration to be more restrictive

2016-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15126889#comment-15126889 ] Wes McKinney commented on PARQUET-496: -- See https://github.com/apache/parquet-cpp/pull/33 > Fix

[jira] [Assigned] (PARQUET-468) Add a cmake option to generate the Parquet thrift headers with the thriftc in the environment

2016-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-468: Assignee: Wes McKinney > Add a cmake option to generate the Parquet thrift headers with

[jira] [Assigned] (PARQUET-478) Reassembly algorithms for nested in-memory columnar memory layout

2016-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-478: Assignee: Wes McKinney > Reassembly algorithms for nested in-memory columnar memory

[jira] [Assigned] (PARQUET-498) Add a ColumnChunk builder abstraction as part of creating new row groups

2016-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-498: Assignee: Wes McKinney > Add a ColumnChunk builder abstraction as part of creating new

[jira] [Commented] (PARQUET-502) Scanner segfaults when reading a FIXED_LEN_BYTE_ARRAY column

2016-02-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128560#comment-15128560 ] Wes McKinney commented on PARQUET-502: -- Let's leave this open until we have unit tests to verify?

[jira] [Assigned] (PARQUET-501) Add an OutputStream abstraction (capable of memory allocation) for Encoder public API

2016-02-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-501: Assignee: Wes McKinney > Add an OutputStream abstraction (capable of memory allocation)

[jira] [Created] (PARQUET-501) Add an OutputStream abstraction (capable of memory allocation) for Encoder public API

2016-02-02 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-501: Summary: Add an OutputStream abstraction (capable of memory allocation) for Encoder public API Key: PARQUET-501 URL: https://issues.apache.org/jira/browse/PARQUET-501

[jira] [Commented] (PARQUET-442) Convert flat SchemaElement vector to implied nested schema data structure

2016-02-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129973#comment-15129973 ] Wes McKinney commented on PARQUET-442: -- There's a lot more to do here than fits in one JIRA. I'll

[jira] [Commented] (PARQUET-476) Add a utility function to print the raw repetition / definition levels to an std::ostream

2016-02-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130808#comment-15130808 ] Wes McKinney commented on PARQUET-476: -- It would be best to wait for PARQUET-442 (this week) before

[jira] [Updated] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-438: - Summary: Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

[jira] [Commented] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124123#comment-15124123 ] Wes McKinney commented on PARQUET-438: -- See https://github.com/apache/parquet-cpp/pull/31 I'm

[jira] [Updated] (PARQUET-462) Implement a LevelDecoder class (like Impala) which dispatches to RLE or BIT_PACKED decoding as appropriate

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-462: - Summary: Implement a LevelDecoder class (like Impala) which dispatches to RLE or BIT_PACKED

[jira] [Commented] (PARQUET-467) Implement and test BIT_PACKED level encoding / decoding

2016-01-31 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125389#comment-15125389 ] Wes McKinney commented on PARQUET-467: -- Yes (both JIRAs need test cases, PARQUET-485 will make that

[jira] [Commented] (PARQUET-467) Check for and raise error for deprecated BIT_PACKED encoding

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124182#comment-15124182 ] Wes McKinney commented on PARQUET-467: -- Per PARQUET-462 we can go ahead and implement this level

[jira] [Commented] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124173#comment-15124173 ] Wes McKinney commented on PARQUET-438: -- [~mdeepak] If you identify a specific problem with the

[jira] [Created] (PARQUET-475) Run DebugPrint on all data files in the data/ directory

2016-01-27 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-475: Summary: Run DebugPrint on all data files in the data/ directory Key: PARQUET-475 URL: https://issues.apache.org/jira/browse/PARQUET-475 Project: Parquet

[jira] [Resolved] (PARQUET-531) Can't read past first page in a column

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-531. -- Resolution: Fixed This was fixed in https://github.com/apache/parquet-cpp/pull/62. I verified

[jira] [Updated] (PARQUET-520) Add version of LocalFileSource that uses memory-mapping for zero-copy reads

2016-02-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-520: - Description: Repurposed this JIRA after PARQUET-533. Memory-mapping will save us memory

[jira] [Commented] (PARQUET-545) Support Decimal values

2016-02-25 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15168396#comment-15168396 ] Wes McKinney commented on PARQUET-545: -- Can we scope this issue a bit (what does "support" mean)?

[jira] [Resolved] (PARQUET-539) Enable include_order cpplint check

2016-02-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-539. -- Resolution: Fixed Assignee: Wes McKinney done in

[jira] [Assigned] (PARQUET-493) Adapt DictEncoder from Impala (or implement a new one) and unit test

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-493: Assignee: Wes McKinney > Adapt DictEncoder from Impala (or implement a new one) and unit

[jira] [Commented] (PARQUET-494) Implement PLAIN_DICTIONARY encoding and decoding

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15166711#comment-15166711 ] Wes McKinney commented on PARQUET-494: -- see patch https://github.com/apache/parquet-cpp/pull/64 >

[jira] [Assigned] (PARQUET-494) Implement PLAIN_DICTIONARY encoding and decoding

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-494: Assignee: Wes McKinney (was: Deepak Majeti) > Implement PLAIN_DICTIONARY encoding and

[jira] [Commented] (PARQUET-493) Adapt DictEncoder from Impala (or implement a new one) and unit test

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15166709#comment-15166709 ] Wes McKinney commented on PARQUET-493: -- see patch https://github.com/apache/parquet-cpp/pull/64 >

[jira] [Created] (PARQUET-550) Large file concerns with fseek/ftell

2016-02-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-550: Summary: Large file concerns with fseek/ftell Key: PARQUET-550 URL: https://issues.apache.org/jira/browse/PARQUET-550 Project: Parquet Issue Type: Bug

[jira] [Commented] (PARQUET-520) Add version of LocalFileSource that uses memory-mapping for zero-copy reads

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172327#comment-15172327 ] Wes McKinney commented on PARQUET-520: -- see patch: https://github.com/apache/parquet-cpp/pull/66 >

[jira] [Commented] (PARQUET-463) Add DCHECK* macros for assertions in debug builds

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172507#comment-15172507 ] Wes McKinney commented on PARQUET-463: -- see patch https://github.com/apache/parquet-cpp/pull/67 >

[jira] [Assigned] (PARQUET-482) Organize src code file structure to have a very clear folder with public headers.

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-482: Assignee: Wes McKinney > Organize src code file structure to have a very clear folder

[jira] [Commented] (PARQUET-519) Disable compiler warning supressions and fix compiler warnings with -O3

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173023#comment-15173023 ] Wes McKinney commented on PARQUET-519: -- Patch available:

[jira] [Resolved] (PARQUET-516) Add better error handling for reading local files

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-516. -- Resolution: Fixed Fixed in

[jira] [Updated] (PARQUET-519) Disable compiler warning supressions and fix all DEBUG build warnings

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-519: - Summary: Disable compiler warning supressions and fix all DEBUG build warnings (was: Disable

[jira] [Commented] (PARQUET-519) Disable compiler warning supressions and fix all DEBUG build warnings

2016-02-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173033#comment-15173033 ] Wes McKinney commented on PARQUET-519: -- I'm creating a separate JIRA for looking at the compiler

[jira] [Created] (PARQUET-551) Handle compiler warnings due to disabled DCHECKs in release builds

2016-02-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-551: Summary: Handle compiler warnings due to disabled DCHECKs in release builds Key: PARQUET-551 URL: https://issues.apache.org/jira/browse/PARQUET-551 Project: Parquet

[jira] [Commented] (PARQUET-518) Review usages of size_t and unsigned integers generally per Google style guide

2016-02-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15170246#comment-15170246 ] Wes McKinney commented on PARQUET-518: -- Patch available here

[jira] [Resolved] (PARQUET-493) Adapt DictEncoder from Impala (or implement a new one) and unit test

2016-02-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-493. -- Resolution: Fixed Complete in

[jira] [Resolved] (PARQUET-526) Add more complete unit test coverage for column Scanner implementations

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-526. -- Resolution: Fixed Resolved by combined patch

[jira] [Resolved] (PARQUET-502) Scanner segfaults when its batch size is smaller than the number of rows

2016-02-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-502. -- Resolution: Fixed The fix for this was combined with the patch

[jira] [Created] (PARQUET-547) Refactor most templates to use DataType structs rather than the Type::type enum

2016-02-25 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-547: Summary: Refactor most templates to use DataType structs rather than the Type::type enum Key: PARQUET-547 URL: https://issues.apache.org/jira/browse/PARQUET-547

[jira] [Commented] (PARQUET-277) Remove boost dependency

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15107419#comment-15107419 ] Wes McKinney commented on PARQUET-277: -- With PARQUET-416 we are now on C++11 and the only boost

[jira] [Commented] (PARQUET-437) Ship googletest thirdparty dependency and add cmake tools (ADD_PARQUET_TEST) to simplify adding new unit tests

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15107610#comment-15107610 ] Wes McKinney commented on PARQUET-437: -- See https://github.com/apache/parquet-cpp/pull/19 > Ship

[jira] [Commented] (PARQUET-238) Unable to Install C++ Driver - reference to 'share_ptr' is ambiguous

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15107425#comment-15107425 ] Wes McKinney commented on PARQUET-238: -- [~aaronbenz] now that we are on C++11 with PARQUET-416, can

[jira] [Created] (PARQUET-434) Add a ParquetFileReader class to encapsulate some low-level details of interacting with Parquet files

2016-01-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-434: Summary: Add a ParquetFileReader class to encapsulate some low-level details of interacting with Parquet files Key: PARQUET-434 URL:

[jira] [Assigned] (PARQUET-437) Ship googletest thirdparty dependency and add cmake tools (ADD_PARQUET_TEST) to simplify adding new unit tests

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-437: Assignee: Wes McKinney > Ship googletest thirdparty dependency and add cmake tools

[jira] [Created] (PARQUET-438) Adapt any relevant encoding and compression unit tests from Impala

2016-01-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-438: Summary: Adapt any relevant encoding and compression unit tests from Impala Key: PARQUET-438 URL: https://issues.apache.org/jira/browse/PARQUET-438 Project: Parquet

[jira] [Created] (PARQUET-436) Implement ParquetFileWriter class entry point for generating new Parquet files

2016-01-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-436: Summary: Implement ParquetFileWriter class entry point for generating new Parquet files Key: PARQUET-436 URL: https://issues.apache.org/jira/browse/PARQUET-436

[jira] [Updated] (PARQUET-447) Add Debug and Release build types and associated compiler flags

2016-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-447: - Issue Type: Improvement (was: Bug) > Add Debug and Release build types and associated compiler

[jira] [Created] (PARQUET-448) Add cmake option to skip building the unit tests

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-448: Summary: Add cmake option to skip building the unit tests Key: PARQUET-448 URL: https://issues.apache.org/jira/browse/PARQUET-448 Project: Parquet Issue

[jira] [Created] (PARQUET-449) Update to latest parquet.thrift

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-449: Summary: Update to latest parquet.thrift Key: PARQUET-449 URL: https://issues.apache.org/jira/browse/PARQUET-449 Project: Parquet Issue Type: Improvement

[jira] [Created] (PARQUET-447) Add Debug and Release build types and associated compiler flags

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-447: Summary: Add Debug and Release build types and associated compiler flags Key: PARQUET-447 URL: https://issues.apache.org/jira/browse/PARQUET-447 Project: Parquet

[jira] [Commented] (PARQUET-433) Specialize ColumnReaders based on the column type

2016-01-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15111375#comment-15111375 ] Wes McKinney commented on PARQUET-433: -- I'd love to have it within the next couple days. I can also

[jira] [Commented] (PARQUET-433) Specialize ColumnReaders based on the column type

2016-01-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15111352#comment-15111352 ] Wes McKinney commented on PARQUET-433: -- [~asandryh] where do you stand on your patch for this? I

[jira] [Created] (PARQUET-456) Add zlib codec support

2016-01-22 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-456: Summary: Add zlib codec support Key: PARQUET-456 URL: https://issues.apache.org/jira/browse/PARQUET-456 Project: Parquet Issue Type: New Feature

[jira] [Created] (PARQUET-457) Add compressor-decompressor unit tests

2016-01-22 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-457: Summary: Add compressor-decompressor unit tests Key: PARQUET-457 URL: https://issues.apache.org/jira/browse/PARQUET-457 Project: Parquet Issue Type: Test

[jira] [Created] (PARQUET-458) Implement support for DataPageV2

2016-01-22 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-458: Summary: Implement support for DataPageV2 Key: PARQUET-458 URL: https://issues.apache.org/jira/browse/PARQUET-458 Project: Parquet Issue Type: New Feature

[jira] [Created] (PARQUET-455) Fix compiler warnings on OS X / Clang

2016-01-22 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-455: Summary: Fix compiler warnings on OS X / Clang Key: PARQUET-455 URL: https://issues.apache.org/jira/browse/PARQUET-455 Project: Parquet Issue Type: Bug

[jira] [Created] (PARQUET-451) Add a RowGroup reader interface class

2016-01-22 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-451: Summary: Add a RowGroup reader interface class Key: PARQUET-451 URL: https://issues.apache.org/jira/browse/PARQUET-451 Project: Parquet Issue Type: New

[jira] [Commented] (PARQUET-462) Create a new Level class for definition and repetition values

2016-01-25 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115790#comment-15115790 ] Wes McKinney commented on PARQUET-462: -- Could you explain this in more detail, especially in the

[jira] [Resolved] (PARQUET-238) Unable to Install C++ Driver - reference to 'share_ptr' is ambiguous

2016-01-25 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-238. -- Resolution: Resolved This is resolved with PARQUET-418 and PARQUET-267. Please let us know if

[jira] [Comment Edited] (PARQUET-238) Unable to Install C++ Driver - reference to 'share_ptr' is ambiguous

2016-01-25 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116456#comment-15116456 ] Wes McKinney edited comment on PARQUET-238 at 1/26/16 1:20 AM: --- This is

[jira] [Created] (PARQUET-464) Add cmake option and #defines to enable/disable struct packing

2016-01-25 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-464: Summary: Add cmake option and #defines to enable/disable struct packing Key: PARQUET-464 URL: https://issues.apache.org/jira/browse/PARQUET-464 Project: Parquet

[jira] [Commented] (PARQUET-449) Update to latest parquet.thrift

2016-01-25 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116337#comment-15116337 ] Wes McKinney commented on PARQUET-449: -- [~nongli] the GitHub PR is still outstanding > Update to

[jira] [Created] (PARQUET-445) Batch/vectorized decoding of array sizes within each repetition level

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-445: Summary: Batch/vectorized decoding of array sizes within each repetition level Key: PARQUET-445 URL: https://issues.apache.org/jira/browse/PARQUET-445 Project:

[jira] [Commented] (PARQUET-434) Add a ParquetFileReader class to encapsulate some low-level details of interacting with Parquet files

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15107967#comment-15107967 ] Wes McKinney commented on PARQUET-434: -- [~asandryh] have a look; I tried to steer clear of code

[jira] [Updated] (PARQUET-437) Incorporate googletest thirdparty dependency and add cmake tools (ADD_PARQUET_TEST) to simplify adding new unit tests

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-437: - Description: (was: The googletest developers recommend shipping gtest and building it

[jira] [Commented] (PARQUET-434) Add a ParquetFileReader class to encapsulate some low-level details of interacting with Parquet files

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15107960#comment-15107960 ] Wes McKinney commented on PARQUET-434: -- See https://github.com/apache/parquet-cpp/pull/20 I'd like

[jira] [Updated] (PARQUET-437) Incorporate googletest thirdparty dependency and add cmake tools (ADD_PARQUET_TEST) to simplify adding new unit tests

2016-01-19 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-437: - Summary: Incorporate googletest thirdparty dependency and add cmake tools (ADD_PARQUET_TEST) to

[jira] [Updated] (PARQUET-442) Convert flat SchemaElement vector to implied nested schema data structure

2016-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-442: - Issue Type: New Feature (was: Bug) > Convert flat SchemaElement vector to implied nested schema

[jira] [Updated] (PARQUET-441) Schema resolution: one, two, and three-level array encoding

2016-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-441: - Issue Type: New Feature (was: Bug) > Schema resolution: one, two, and three-level array

[jira] [Created] (PARQUET-443) Schema resolution: map encoding

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-443: Summary: Schema resolution: map encoding Key: PARQUET-443 URL: https://issues.apache.org/jira/browse/PARQUET-443 Project: Parquet Issue Type: New Feature

[jira] [Commented] (PARQUET-446) Hide thrift dependency in parquet-cpp

2016-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15109706#comment-15109706 ] Wes McKinney commented on PARQUET-446: -- Makes sense; it would be nice to extract the

[jira] [Created] (PARQUET-441) Schema resolution: one, two, and three-level array encoding

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-441: Summary: Schema resolution: one, two, and three-level array encoding Key: PARQUET-441 URL: https://issues.apache.org/jira/browse/PARQUET-441 Project: Parquet

[jira] [Created] (PARQUET-444) Metadata generation: Nested physical schema builder

2016-01-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-444: Summary: Metadata generation: Nested physical schema builder Key: PARQUET-444 URL: https://issues.apache.org/jira/browse/PARQUET-444 Project: Parquet Issue

[jira] [Resolved] (PARQUET-440) Error handling: C++ exceptions or Status

2016-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-440. -- Resolution: Resolved > Error handling: C++ exceptions or Status >

[jira] [Commented] (PARQUET-459) Improve handling of null values

2016-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113835#comment-15113835 ] Wes McKinney commented on PARQUET-459: -- Do you have a patch for PARQUET-428 somewhere? Re:

[jira] [Comment Edited] (PARQUET-459) Improve handling of null values

2016-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113835#comment-15113835 ] Wes McKinney edited comment on PARQUET-459 at 1/23/16 4:50 PM: --- Do you have

[jira] [Commented] (PARQUET-459) Improve handling of null values

2016-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114022#comment-15114022 ] Wes McKinney commented on PARQUET-459: -- The value decoders are already internally buffering arrays

[jira] [Commented] (PARQUET-453) Refactor parquet_reader.cc into a ParquetFileReader::DebugPrint method

2016-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114000#comment-15114000 ] Wes McKinney commented on PARQUET-453: -- This is done as part of

[jira] [Commented] (PARQUET-451) Add a RowGroup reader interface class

2016-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113999#comment-15113999 ] Wes McKinney commented on PARQUET-451: -- This is done in

[jira] [Created] (PARQUET-468) Add a cmake option to generate the Parquet thrift headers with the thriftc in the environment

2016-01-26 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-468: Summary: Add a cmake option to generate the Parquet thrift headers with the thriftc in the environment Key: PARQUET-468 URL: https://issues.apache.org/jira/browse/PARQUET-468

[jira] [Commented] (PARQUET-468) Add a cmake option to generate the Parquet thrift headers with the thriftc in the environment

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15117884#comment-15117884 ] Wes McKinney commented on PARQUET-468: -- see PARQUET-469 > Add a cmake option to generate the

[jira] [Updated] (PARQUET-469) Roll back Thrift bindings to 0.9.1

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-469: - Summary: Roll back Thrift bindings to 0.9.1 (was: Roll back Thrift bindings to 0.9.0) > Roll

[jira] [Created] (PARQUET-470) Thrift 0.9.3 cannot be used in conjunction with googletest and C++11 on Linux

2016-01-26 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-470: Summary: Thrift 0.9.3 cannot be used in conjunction with googletest and C++11 on Linux Key: PARQUET-470 URL: https://issues.apache.org/jira/browse/PARQUET-470

[jira] [Commented] (PARQUET-470) Thrift 0.9.3 cannot be used in conjunction with googletest and C++11 on Linux

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15117941#comment-15117941 ] Wes McKinney commented on PARQUET-470: -- https://github.com/apache/parquet-cpp/pull/25 > Thrift

[jira] [Updated] (PARQUET-469) Roll back Thrift bindings to 0.9.0

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-469: - Summary: Roll back Thrift bindings to 0.9.0 (was: Roll back Thrift bindings to 0.9.1) > Roll

[jira] [Commented] (PARQUET-469) Roll back Thrift bindings to 0.9.0

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15117966#comment-15117966 ] Wes McKinney commented on PARQUET-469: -- On Linux, Thrift bindings compiled with 0.9.1 or higher have

[jira] [Updated] (PARQUET-472) Clean up InputStream ownership semantics in ColumnReader

2016-01-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-472: - Component/s: parquet-cpp > Clean up InputStream ownership semantics in ColumnReader >

[jira] [Created] (PARQUET-472) Clean up InputStream ownership semantics in ColumnReader

2016-01-26 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-472: Summary: Clean up InputStream ownership semantics in ColumnReader Key: PARQUET-472 URL: https://issues.apache.org/jira/browse/PARQUET-472 Project: Parquet

[jira] [Created] (PARQUET-440) Error handling: C++ exceptions or Status

2016-01-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-440: Summary: Error handling: C++ exceptions or Status Key: PARQUET-440 URL: https://issues.apache.org/jira/browse/PARQUET-440 Project: Parquet Issue Type: New

[jira] [Commented] (PARQUET-459) Improve handling of null values

2016-01-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113492#comment-15113492 ] Wes McKinney commented on PARQUET-459: -- Makes sense. Want to make you aware of

[jira] [Commented] (PARQUET-459) Improve handling of null values

2016-01-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113509#comment-15113509 ] Wes McKinney commented on PARQUET-459: -- Related to PARQUET-435. A main issue here is that there are

[jira] [Created] (PARQUET-533) Simplify RandomAccessSource API to combine Seek/Read

2016-02-14 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-533: Summary: Simplify RandomAccessSource API to combine Seek/Read Key: PARQUET-533 URL: https://issues.apache.org/jira/browse/PARQUET-533 Project: Parquet

<    1   2   3   4   5   6   7   8   9   10   >