[jira] [Created] (PARQUET-2135) Performance optimizations: Merged all LittleEndianDataInputStream functionality into ByteBufferInputStream

2022-04-01 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2135: --- Summary: Performance optimizations: Merged all LittleEndianDataInputStream functionality into ByteBufferInputStream Key: PARQUET-2135 URL:

[jira] [Commented] (PARQUET-2135) Performance optimizations: Merged all LittleEndianDataInputStream functionality into ByteBufferInputStream

2022-04-04 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17516909#comment-17516909 ] Timothy Miller commented on PARQUET-2135: - Extra note: The reason PlainValuesReader still

[jira] [Commented] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2022-04-08 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519683#comment-17519683 ] Timothy Miller commented on PARQUET-1681: - Is this related to PARQUET-2069? It looks like it

[jira] [Commented] (PARQUET-2133) Support Int8 and Int16 as basic type

2022-04-08 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519690#comment-17519690 ] Timothy Miller commented on PARQUET-2133: - Have you started working on implementing this? What

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519019#comment-17519019 ] Timothy Miller commented on PARQUET-2069: - Based on the fact that the option is named "old"

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518883#comment-17518883 ] Timothy Miller commented on PARQUET-2126: - This bug isn't affecting me. My employer has tasked

[jira] [Updated] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2069: Attachment: parquet-diff.png > Parquet file containing arrays, written by Parquet-MR,

[jira] [Comment Edited] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968 ] Timothy Miller edited comment on PARQUET-2069 at 4/7/22 3:53 PM: - An 

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968 ] Timothy Miller commented on PARQUET-2069: - An  initial look at this suggests that the writer is

[jira] [Comment Edited] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518968#comment-17518968 ] Timothy Miller edited comment on PARQUET-2069 at 4/7/22 3:50 PM: - An 

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519231#comment-17519231 ] Timothy Miller commented on PARQUET-2069: - With the original file, the debug message says this

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17519218#comment-17519218 ] Timothy Miller commented on PARQUET-2069: - Here's a log message that shows why it's failing:

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-04-06 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17518336#comment-17518336 ] Timothy Miller commented on PARQUET-2126: - Does the resolution of DRILL-8139 mean that

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-14 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522347#comment-17522347 ] Timothy Miller commented on PARQUET-2069: - This appears to occur due to the reader and writer

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-04-14 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522406#comment-17522406 ] Timothy Miller commented on PARQUET-2069: - I found a fix for the problem. This is going to look

[jira] [Commented] (PARQUET-1681) Avro's isElementType() change breaks the reading of some parquet(1.8.1) files

2022-04-21 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525795#comment-17525795 ] Timothy Miller commented on PARQUET-1681: - Have a look at my further analysis of PARQUET-2069.

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526497#comment-17526497 ] Timothy Miller commented on PARQUET-2126: - Alright. You have a point. If the maintainers want

[jira] [Commented] (PARQUET-1928) Interpret Parquet INT96 type as FIXED[12] AVRO Schema

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526575#comment-17526575 ] Timothy Miller commented on PARQUET-1928: - It looks like the change was already merged.

[jira] [Commented] (PARQUET-1928) Interpret Parquet INT96 type as FIXED[12] AVRO Schema

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526573#comment-17526573 ] Timothy Miller commented on PARQUET-1928: - Is there a reason why patches such as this are not

[jira] [Commented] (PARQUET-2125) ParquetFileReader has a currentBlock information in a private field

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526698#comment-17526698 ] Timothy Miller commented on PARQUET-2125: - Precisely how meaningful is it to provide this

[jira] [Commented] (PARQUET-2098) Add more methods into interface of BlockCipher

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526699#comment-17526699 ] Timothy Miller commented on PARQUET-2098: - [~gershinsky] Did you ever get around to this? If

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-22 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17526736#comment-17526736 ] Timothy Miller commented on PARQUET-2140: - I'll look at this on Monday. Have you tried writing

[jira] [Created] (PARQUET-2139) Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files

2022-04-20 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2139: --- Summary: Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files Key: PARQUET-2139 URL:

[jira] [Commented] (PARQUET-2139) Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files

2022-04-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525229#comment-17525229 ] Timothy Miller commented on PARQUET-2139: - Of course, I'll be embarrassed if this turns out to

[jira] [Comment Edited] (PARQUET-2139) Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files

2022-04-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525272#comment-17525272 ] Timothy Miller edited comment on PARQUET-2139 at 4/20/22 8:37 PM: -- I

[jira] [Commented] (PARQUET-2139) Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files

2022-04-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525264#comment-17525264 ] Timothy Miller commented on PARQUET-2139: - I've noticed a few places that could be at fault

[jira] [Commented] (PARQUET-2139) Bogus file offset for ColumnMetaData written to ColumnChunk metadata of single parquet files

2022-04-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525272#comment-17525272 ] Timothy Miller commented on PARQUET-2139: - I just noticed that the file_offset field in

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528263#comment-17528263 ] Timothy Miller commented on PARQUET-2140: - Never mind on that PR. It breaks other things. If

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528162#comment-17528162 ] Timothy Miller commented on PARQUET-2140: - I'm going to get back onto this today, so I'll

[jira] [Comment Edited] (PARQUET-2142) parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527556#comment-17527556 ] Timothy Miller edited comment on PARQUET-2142 at 4/26/22 1:19 PM: -- I

[jira] [Commented] (PARQUET-2098) Add more methods into interface of BlockCipher

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528160#comment-17528160 ] Timothy Miller commented on PARQUET-2098: - I don't personally have a use case. I was just

[jira] [Commented] (PARQUET-2104) parquet-cli broken in master

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528167#comment-17528167 ] Timothy Miller commented on PARQUET-2104: - As I mentioned in

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528175#comment-17528175 ] Timothy Miller commented on PARQUET-2140: - Here's my minimal parquet reader that I've been

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528248#comment-17528248 ] Timothy Miller commented on PARQUET-2140: - I'm still working on this, but the problem appears

[jira] [Commented] (PARQUET-2143) parquet-cli with hadoop throws java.lang.RuntimeException on any parquet file access

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527558#comment-17527558 ] Timothy Miller commented on PARQUET-2143: - I didn't notice this earlier, but the 

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527559#comment-17527559 ] Timothy Miller commented on PARQUET-2140: - I just realized that I managed to duplicate this in

[jira] [Comment Edited] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527559#comment-17527559 ] Timothy Miller edited comment on PARQUET-2140 at 4/25/22 3:50 PM: -- I

[jira] [Commented] (PARQUET-2142) parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527556#comment-17527556 ] Timothy Miller commented on PARQUET-2142: - I added -verbose:class to the java command line, and

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527592#comment-17527592 ] Timothy Miller commented on PARQUET-2140: - The error is caused in

[jira] [Updated] (PARQUET-2143) parquet-cli with hadoop throws java.lang.RuntimeException on any parquet file access

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2143: Summary: parquet-cli with hadoop throws java.lang.RuntimeException on any parquet file

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527480#comment-17527480 ] Timothy Miller commented on PARQUET-2140: - I can't reproduce this bug with parquet-tools or

[jira] [Updated] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2142: Priority: Blocker (was: Major) > parquet-cli throws java.lang.NoSuchMethodError on any

[jira] [Created] (PARQUET-2143) parquet-cli with hadoop throws java.lang.RuntimeException

2022-04-25 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2143: --- Summary: parquet-cli with hadoop throws java.lang.RuntimeException Key: PARQUET-2143 URL: https://issues.apache.org/jira/browse/PARQUET-2143 Project: Parquet

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527519#comment-17527519 ] Timothy Miller commented on PARQUET-2140: - I've been trying to reproduce this with parquet-cli,

[jira] [Updated] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2142: Description: I can't do even basic things with parquet-cli from 1.13.0-SNAPSHOT. Steps

[jira] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142 ] Timothy Miller deleted comment on PARQUET-2142: - was (Author: JIRAUSER287471): Jira swallowed the asterisks from the "java -cp" command, but you can see them in the README.md file.

[jira] [Updated] (PARQUET-2142) parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2142: Summary: parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet

[jira] [Created] (PARQUET-2142) parquet-cli throws

2022-04-25 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2142: --- Summary: parquet-cli throws Key: PARQUET-2142 URL: https://issues.apache.org/jira/browse/PARQUET-2142 Project: Parquet Issue Type: Bug

[jira] [Updated] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2142: Component/s: parquet-cli Affects Version/s: 1.13.0 Description: I

[jira] [Commented] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527511#comment-17527511 ] Timothy Miller commented on PARQUET-2142: - Jira swallowed the asterisks from the "java -cp"

[jira] [Comment Edited] (PARQUET-2142) parquet-cli throws java.lang.NoSuchMethodError on any parquet file access command

2022-04-25 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527511#comment-17527511 ] Timothy Miller edited comment on PARQUET-2142 at 4/25/22 1:55 PM: --

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-04-26 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528393#comment-17528393 ] Timothy Miller commented on PARQUET-2140: - I got this information from a combination of

[jira] [Commented] (PARQUET-2140) parquet-cli unable to read UUID values

2022-05-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17540138#comment-17540138 ] Timothy Miller commented on PARQUET-2140: - There's a slight chance that

[jira] [Commented] (PARQUET-2143) parquet-cli with hadoop throws java.lang.RuntimeException on any parquet file access

2022-05-20 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17540143#comment-17540143 ] Timothy Miller commented on PARQUET-2143: - There's a slight chance that

[jira] [Commented] (PARQUET-2143) parquet-cli with hadoop throws java.lang.RuntimeException on any parquet file access

2022-05-27 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17542998#comment-17542998 ] Timothy Miller commented on PARQUET-2143: - I tested PR #957 on this, and it doesn't fix the

[jira] [Created] (PARQUET-2147) Can't run ParquetMR test in IDEs

2022-05-13 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2147: --- Summary: Can't run ParquetMR test in IDEs Key: PARQUET-2147 URL: https://issues.apache.org/jira/browse/PARQUET-2147 Project: Parquet Issue Type: Bug

[jira] [Updated] (PARQUET-2147) Can't run ParquetMR tests in IDEs

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2147: Summary: Can't run ParquetMR tests in IDEs (was: Can't run ParquetMR test in IDEs) >

[jira] [Comment Edited] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888 ] Timothy Miller edited comment on PARQUET-2069 at 5/13/22 9:02 PM: -- I

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888 ] Timothy Miller commented on PARQUET-2069: - I managed to probe this just a bit. No idea why this

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536902#comment-17536902 ] Timothy Miller commented on PARQUET-2069: - Yup. If I force prepareForRead() to ignore the avro

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536894#comment-17536894 ] Timothy Miller commented on PARQUET-2069: - So, where does the avro schema come from in the

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-16 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17537604#comment-17537604 ] Timothy Miller commented on PARQUET-2069: - Well, I tried modifying prepareForRead to just

[jira] [Commented] (PARQUET-2159) Parquet bit-packing de/encode optimization

2022-06-15 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17554652#comment-17554652 ] Timothy Miller commented on PARQUET-2159: - Could you add a link to the PR, please? > Parquet

[jira] [Commented] (PARQUET-2159) Parquet bit-packing de/encode optimization

2022-06-16 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555099#comment-17555099 ] Timothy Miller commented on PARQUET-2159: - I frequently wish Java had a preprocessor like C++

[jira] [Commented] (PARQUET-2159) Parquet bit-packing de/encode optimization

2022-06-16 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555142#comment-17555142 ] Timothy Miller commented on PARQUET-2159: - If this is already being generated at runtime, then

[jira] [Commented] (PARQUET-2153) Cannot read schema from parquet file

2022-06-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17553635#comment-17553635 ] Timothy Miller commented on PARQUET-2153: - Is this related to anything fixed by

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-08-01 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17573826#comment-17573826 ] Timothy Miller commented on PARQUET-2126: - Thanks, Steve. I really like your suggestion. I

[jira] [Commented] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated files

2022-09-07 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601415#comment-17601415 ] Timothy Miller commented on PARQUET-2181: - Is this related to PARQUET-2069? Or maybe

[jira] [Commented] (PARQUET-2171) Implement vectored IO in parquet file format

2022-08-11 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578470#comment-17578470 ] Timothy Miller commented on PARQUET-2171: - This might synergize well with the bulk I/O features

[jira] [Commented] (PARQUET-2171) Implement vectored IO in parquet file format

2022-08-15 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579770#comment-17579770 ] Timothy Miller commented on PARQUET-2171: - The parquet reader has two phases of reading. One