[
https://issues.apache.org/jira/browse/PARQUET-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan Blue updated PARQUET-1189:
---
Issue Type: Task (was: Bug)
> Release Parquet Java 1.10
> -
>
>
[
https://issues.apache.org/jira/browse/PARQUET-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329326#comment-16329326
]
Jian Fang commented on PARQUET-1169:
[~xhochy] I updated our parquet-cpp and arrow with master
[
https://issues.apache.org/jira/browse/PARQUET-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329261#comment-16329261
]
ASF GitHub Bot commented on PARQUET-1193:
-
xhochy commented on a change in pull request #430:
This work would only involve the Arrow interface in src/parquet/arrow
(converting from Arrow representation to repetition/definition level
encoding, and back), so you wouldn't need to master the whole Parquet
codebase, at least. I'd like to help with this work, but realistically
I won't have
[
https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328893#comment-16328893
]
Wes McKinney commented on PARQUET-1084:
---
I see. I would say in that file we should indicate that
Optimizing compression ratios is one issue, optimizing page granularity for
column indexes is another, and a third issue is that there is per-page
metadata in the Parquet footer in Thrift format that has to be interpreted
before anything in the file can be accessed. Too many pages could slow down
I also have a use-case that requires lists-of-structs and encountered that
limitation in pyarrow. Just one level deep would enable a lot of HEP data.
I've worked out the logic of converting Parquet definition and repetition
levels into Arrow-style arrays:
Uwe L. Korn created PARQUET-1196:
Summary: [C++] Provide a parquet_arrow example project incl. CMake
setup
Key: PARQUET-1196
URL: https://issues.apache.org/jira/browse/PARQUET-1196
Project: Parquet
Hello dear Parquet developers,
I am using your file format and enjoy it a lot !
One feature I find missing is allowing a 'column.mappings' option in the
SERDEPROPERTIES clause similar to org.openx.data.jsonserde.JsonSerDe.
I think this is a very desirable feature among many Analysts/Developers
[
https://issues.apache.org/jira/browse/PARQUET-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328631#comment-16328631
]
ASF GitHub Bot commented on PARQUET-1193:
-
majetideepak opened a new pull request #430:
[
https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328489#comment-16328489
]
Jakob Blomer commented on PARQUET-1084:
---
That's very interesting, many thanks to all of you for
11 matches
Mail list logo