Timothy Miller created PARQUET-2135:
---
Summary: Performance optimizations: Merged all
LittleEndianDataInputStream functionality into ByteBufferInputStream
Key: PARQUET-2135
URL:
[
https://issues.apache.org/jira/browse/PARQUET-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17516909#comment-17516909
]
Timothy Miller commented on PARQUET-2135:
-
Extra note:
The reason PlainValuesReader still
[
https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519683#comment-17519683
]
Timothy Miller commented on PARQUET-1681:
-
Is this related to PARQUET-2069? It looks like it
[
https://issues.apache.org/jira/browse/PARQUET-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519690#comment-17519690
]
Timothy Miller commented on PARQUET-2133:
-
Have you started working on implementing this? What
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519019#comment-17519019
]
Timothy Miller commented on PARQUET-2069:
-
Based on the fact that the option is named "old"
[
https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518883#comment-17518883
]
Timothy Miller commented on PARQUET-2126:
-
This bug isn't affecting me. My employer has tasked
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2069:
Attachment: parquet-diff.png
> Parquet file containing arrays, written by Parquet-MR,
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968
]
Timothy Miller edited comment on PARQUET-2069 at 4/7/22 3:53 PM:
-
An
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968
]
Timothy Miller commented on PARQUET-2069:
-
An initial look at this suggests that the writer is
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968
]
Timothy Miller edited comment on PARQUET-2069 at 4/7/22 3:50 PM:
-
An
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519231#comment-17519231
]
Timothy Miller commented on PARQUET-2069:
-
With the original file, the debug message says this
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519218#comment-17519218
]
Timothy Miller commented on PARQUET-2069:
-
Here's a log message that shows why it's failing:
[
https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518336#comment-17518336
]
Timothy Miller commented on PARQUET-2126:
-
Does the resolution of DRILL-8139 mean that
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522347#comment-17522347
]
Timothy Miller commented on PARQUET-2069:
-
This appears to occur due to the reader and writer
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522406#comment-17522406
]
Timothy Miller commented on PARQUET-2069:
-
I found a fix for the problem. This is going to look
[
https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525795#comment-17525795
]
Timothy Miller commented on PARQUET-1681:
-
Have a look at my further analysis of PARQUET-2069.
[
https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526497#comment-17526497
]
Timothy Miller commented on PARQUET-2126:
-
Alright. You have a point. If the maintainers want
[
https://issues.apache.org/jira/browse/PARQUET-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526575#comment-17526575
]
Timothy Miller commented on PARQUET-1928:
-
It looks like the change was already merged.
[
https://issues.apache.org/jira/browse/PARQUET-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526573#comment-17526573
]
Timothy Miller commented on PARQUET-1928:
-
Is there a reason why patches such as this are not
[
https://issues.apache.org/jira/browse/PARQUET-2125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526698#comment-17526698
]
Timothy Miller commented on PARQUET-2125:
-
Precisely how meaningful is it to provide this
[
https://issues.apache.org/jira/browse/PARQUET-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526699#comment-17526699
]
Timothy Miller commented on PARQUET-2098:
-
[~gershinsky] Did you ever get around to this? If
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526736#comment-17526736
]
Timothy Miller commented on PARQUET-2140:
-
I'll look at this on Monday. Have you tried writing
Timothy Miller created PARQUET-2139:
---
Summary: Bogus file offset for ColumnMetaData written to
ColumnChunk metadata of single parquet files
Key: PARQUET-2139
URL:
[
https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525229#comment-17525229
]
Timothy Miller commented on PARQUET-2139:
-
Of course, I'll be embarrassed if this turns out to
[
https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525272#comment-17525272
]
Timothy Miller edited comment on PARQUET-2139 at 4/20/22 8:37 PM:
--
I
[
https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525264#comment-17525264
]
Timothy Miller commented on PARQUET-2139:
-
I've noticed a few places that could be at fault
[
https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525272#comment-17525272
]
Timothy Miller commented on PARQUET-2139:
-
I just noticed that the file_offset field in
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528263#comment-17528263
]
Timothy Miller commented on PARQUET-2140:
-
Never mind on that PR. It breaks other things. If
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528162#comment-17528162
]
Timothy Miller commented on PARQUET-2140:
-
I'm going to get back onto this today, so I'll
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527556#comment-17527556
]
Timothy Miller edited comment on PARQUET-2142 at 4/26/22 1:19 PM:
--
I
[
https://issues.apache.org/jira/browse/PARQUET-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528160#comment-17528160
]
Timothy Miller commented on PARQUET-2098:
-
I don't personally have a use case. I was just
[
https://issues.apache.org/jira/browse/PARQUET-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528167#comment-17528167
]
Timothy Miller commented on PARQUET-2104:
-
As I mentioned in
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528175#comment-17528175
]
Timothy Miller commented on PARQUET-2140:
-
Here's my minimal parquet reader that I've been
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528248#comment-17528248
]
Timothy Miller commented on PARQUET-2140:
-
I'm still working on this, but the problem appears
[
https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527558#comment-17527558
]
Timothy Miller commented on PARQUET-2143:
-
I didn't notice this earlier, but the
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527559#comment-17527559
]
Timothy Miller commented on PARQUET-2140:
-
I just realized that I managed to duplicate this in
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527559#comment-17527559
]
Timothy Miller edited comment on PARQUET-2140 at 4/25/22 3:50 PM:
--
I
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527556#comment-17527556
]
Timothy Miller commented on PARQUET-2142:
-
I added -verbose:class to the java command line, and
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527592#comment-17527592
]
Timothy Miller commented on PARQUET-2140:
-
The error is caused in
[
https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2143:
Summary: parquet-cli with hadoop throws java.lang.RuntimeException on any
parquet file
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527480#comment-17527480
]
Timothy Miller commented on PARQUET-2140:
-
I can't reproduce this bug with parquet-tools or
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2142:
Priority: Blocker (was: Major)
> parquet-cli throws java.lang.NoSuchMethodError on any
Timothy Miller created PARQUET-2143:
---
Summary: parquet-cli with hadoop throws java.lang.RuntimeException
Key: PARQUET-2143
URL: https://issues.apache.org/jira/browse/PARQUET-2143
Project: Parquet
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527519#comment-17527519
]
Timothy Miller commented on PARQUET-2140:
-
I've been trying to reproduce this with parquet-cli,
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2142:
Description:
I can't do even basic things with parquet-cli from 1.13.0-SNAPSHOT.
Steps
[ https://issues.apache.org/jira/browse/PARQUET-2142 ]
Timothy Miller deleted comment on PARQUET-2142:
-
was (Author: JIRAUSER287471):
Jira swallowed the asterisks from the "java -cp" command, but you can see them
in the README.md file.
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2142:
Summary: parquet-cli without hadoop throws java.lang.NoSuchMethodError on
any parquet
Timothy Miller created PARQUET-2142:
---
Summary: parquet-cli throws
Key: PARQUET-2142
URL: https://issues.apache.org/jira/browse/PARQUET-2142
Project: Parquet
Issue Type: Bug
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2142:
Component/s: parquet-cli
Affects Version/s: 1.13.0
Description:
I
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527511#comment-17527511
]
Timothy Miller commented on PARQUET-2142:
-
Jira swallowed the asterisks from the "java -cp"
[
https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527511#comment-17527511
]
Timothy Miller edited comment on PARQUET-2142 at 4/25/22 1:55 PM:
--
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528393#comment-17528393
]
Timothy Miller commented on PARQUET-2140:
-
I got this information from a combination of
[
https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17540138#comment-17540138
]
Timothy Miller commented on PARQUET-2140:
-
There's a slight chance that
[
https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17540143#comment-17540143
]
Timothy Miller commented on PARQUET-2143:
-
There's a slight chance that
[
https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17542998#comment-17542998
]
Timothy Miller commented on PARQUET-2143:
-
I tested PR #957 on this, and it doesn't fix the
Timothy Miller created PARQUET-2147:
---
Summary: Can't run ParquetMR test in IDEs
Key: PARQUET-2147
URL: https://issues.apache.org/jira/browse/PARQUET-2147
Project: Parquet
Issue Type: Bug
[
https://issues.apache.org/jira/browse/PARQUET-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Timothy Miller updated PARQUET-2147:
Summary: Can't run ParquetMR tests in IDEs (was: Can't run ParquetMR test
in IDEs)
>
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888
]
Timothy Miller edited comment on PARQUET-2069 at 5/13/22 9:02 PM:
--
I
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888
]
Timothy Miller commented on PARQUET-2069:
-
I managed to probe this just a bit. No idea why this
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536902#comment-17536902
]
Timothy Miller commented on PARQUET-2069:
-
Yup. If I force prepareForRead() to ignore the avro
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536894#comment-17536894
]
Timothy Miller commented on PARQUET-2069:
-
So, where does the avro schema come from in the
[
https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17537604#comment-17537604
]
Timothy Miller commented on PARQUET-2069:
-
Well, I tried modifying prepareForRead to just
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17554652#comment-17554652
]
Timothy Miller commented on PARQUET-2159:
-
Could you add a link to the PR, please?
> Parquet
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555099#comment-17555099
]
Timothy Miller commented on PARQUET-2159:
-
I frequently wish Java had a preprocessor like C++
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555142#comment-17555142
]
Timothy Miller commented on PARQUET-2159:
-
If this is already being generated at runtime, then
[
https://issues.apache.org/jira/browse/PARQUET-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17553635#comment-17553635
]
Timothy Miller commented on PARQUET-2153:
-
Is this related to anything fixed by
[
https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573826#comment-17573826
]
Timothy Miller commented on PARQUET-2126:
-
Thanks, Steve. I really like your suggestion. I
[
https://issues.apache.org/jira/browse/PARQUET-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601415#comment-17601415
]
Timothy Miller commented on PARQUET-2181:
-
Is this related to PARQUET-2069? Or maybe
[
https://issues.apache.org/jira/browse/PARQUET-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578470#comment-17578470
]
Timothy Miller commented on PARQUET-2171:
-
This might synergize well with the bulk I/O features
[
https://issues.apache.org/jira/browse/PARQUET-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579770#comment-17579770
]
Timothy Miller commented on PARQUET-2171:
-
The parquet reader has two phases of reading. One
70 matches
Mail list logo