flykobe cheng created PARQUET-460:
-
Summary: Parquet files concat tool
Key: PARQUET-460
URL: https://issues.apache.org/jira/browse/PARQUET-460
Project: Parquet
Issue Type: Improvement
Hi everyone,
In parquet.thrift the definition of struct ColumnMetaData
1.
The field "path_in_schema" is a string list, should not there be only
one path in the schema for a specified column? And in parquet-hadoop the
corresponding class "ColumnChunkMetaData" there is the field
[
https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113835#comment-15113835
]
Wes McKinney commented on PARQUET-459:
--
Do you have a patch for PARQUET-428 somewhere?
Re:
[
https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113835#comment-15113835
]
Wes McKinney edited comment on PARQUET-459 at 1/23/16 4:50 PM:
---
Do you have
[
https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114022#comment-15114022
]
Wes McKinney commented on PARQUET-459:
--
The value decoders are already internally buffering arrays
Inline.
On Sat, Jan 23, 2016 at 8:48 AM, Tenghuan He wrote:
> Hi everyone,
>
> In parquet.thrift the definition of struct ColumnMetaData
>
>1.
>
>The field "path_in_schema" is a string list, should not there be only
>one path in the schema for a specified
[
https://issues.apache.org/jira/browse/PARQUET-453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114000#comment-15114000
]
Wes McKinney commented on PARQUET-453:
--
This is done as part of
[
https://issues.apache.org/jira/browse/PARQUET-451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15113999#comment-15113999
]
Wes McKinney commented on PARQUET-451:
--
This is done in
I expect this to be difficult. This is roughly 3 orders of magnitude more
than even
a typical wide table use case.
Answers inline.
On Thu, Jan 21, 2016 at 2:10 PM, Krishna wrote:
> We are considering using Parquet for storing a matrix that is dense and
> very, very wide
[
https://issues.apache.org/jira/browse/PARQUET-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114012#comment-15114012
]
Deepak Majeti commented on PARQUET-459:
---
[~wesmckinn] I made a pull request for PARQUET-428 here
10 matches
Mail list logo