[GitHub] [parquet-mr] steveloughran commented on pull request #970: PARQUET-2150: parquet-protobuf to compile on Mac M1

2022-07-19 Thread GitBox
steveloughran commented on PR #970: URL: https://github.com/apache/parquet-mr/pull/970#issuecomment-1189454282 resolved by #973 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [parquet-mr] steveloughran closed pull request #970: PARQUET-2150: parquet-protobuf to compile on Mac M1

2022-07-19 Thread GitBox
steveloughran closed pull request #970: PARQUET-2150: parquet-protobuf to compile on Mac M1 URL: https://github.com/apache/parquet-mr/pull/970 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Parquet read performance question

2022-07-19 Thread Sol Lederman
Hi, I've created a parquet file with 205 million records that I'm hoping to query quickly. Details of my struggle are here: https://stackoverflow.com/questions/72984734/large-parquet-file-really-slow-to-query I'd appreciate any help, or even next steps. Thanks! Sol

[jira] [Commented] (PARQUET-2150) parquet-protobuf to compile on mac M1

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568692#comment-17568692 ] ASF GitHub Bot commented on PARQUET-2150: - steveloughran commented on PR #970: URL:

[jira] [Commented] (PARQUET-2150) parquet-protobuf to compile on mac M1

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568693#comment-17568693 ] ASF GitHub Bot commented on PARQUET-2150: - steveloughran closed pull request #970:

[jira] [Resolved] (PARQUET-2150) parquet-protobuf to compile on mac M1

2022-07-19 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran resolved PARQUET-2150. - Resolution: Not A Problem with PARQUET-2155 this problem is implicitly fixed. >

[GitHub] [parquet-site] vinooganesh opened a new pull request, #28: Add instructions about how to subscribe to dev list

2022-07-19 Thread GitBox
vinooganesh opened a new pull request, #28: URL: https://github.com/apache/parquet-site/pull/28 Added instructions on how to subscribe. @shangxinli -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: join mailing list

2022-07-19 Thread Aaron Niskode-Dossett
Hi Sol, Welcome! You can send an email to "dev-subscr...@parquet.apache.org" to start that process. We should probably add a note about that here: https://parquet.apache.org/community/ I don't see the explicit instructions in the community section. Best, Aaron On Mon, Jul 18, 2022 at 6:06 PM

Re: Is there a parquet users list?

2022-07-19 Thread Vinoo Ganesh
Hi Sol, There isn't a users list for parquet. You can ask questions here on the dev list. I'll update the website to make this more clear. Thanks, Vinoo Ganesh | vinoo.gan...@gmail.com On Tue, Jul 19, 2022 at 12:10 PM Sol Lederman wrote: > Sorry to spam the dev list. I can't find a

Is there a parquet users list?

2022-07-19 Thread Sol Lederman
Sorry to spam the dev list. I can't find a parquet users list and I got no response to my parquet issue on Stack Overflow. Is there a good place to ask user questions? I've spent several hours googling and reading things on the Web. Thanks. Sol

Re: join mailing list

2022-07-19 Thread Vinoo Ganesh
Good point, Aaron. I'll add this onto the website. Thanks, Vinoo Ganesh | vinoo.gan...@gmail.com On Tue, Jul 19, 2022 at 8:58 AM Aaron Niskode-Dossett wrote: > Hi Sol, > > Welcome! You can send an email to "dev-subscr...@parquet.apache.org" to > start that process. > > We should probably

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568612#comment-17568612 ] ASF GitHub Bot commented on PARQUET-2126: - shangxinli commented on PR #959: URL:

[GitHub] [parquet-mr] ggershinsky merged pull request #973: PARQUET-2155: Upgrade protobuf version to 3.17.3

2022-07-19 Thread GitBox
ggershinsky merged PR #973: URL: https://github.com/apache/parquet-mr/pull/973 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (PARQUET-2158) Upgrade Hadoop dependency to version 3.2.0

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568608#comment-17568608 ] ASF GitHub Bot commented on PARQUET-2158: - shangxinli merged PR #976: URL:

[GitHub] [parquet-mr] shangxinli merged pull request #976: PARQUET-2158: Upgrade Hadoop dependency to version 3.2.0

2022-07-19 Thread GitBox
shangxinli merged PR #976: URL: https://github.com/apache/parquet-mr/pull/976 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [parquet-mr] shangxinli commented on pull request #959: PARQUET-2126: Make cached (de)compressors thread-safe

2022-07-19 Thread GitBox
shangxinli commented on PR #959: URL: https://github.com/apache/parquet-mr/pull/959#issuecomment-1189198163 @theosib-amazon Do you still have time for addressing the feedback? I think we are very close to merge. -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [parquet-mr] shangxinli commented on pull request #973: PARQUET-2155: Upgrade protobuf version to 3.17.3

2022-07-19 Thread GitBox
shangxinli commented on PR #973: URL: https://github.com/apache/parquet-mr/pull/973#issuecomment-1189167965 LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[jira] [Commented] (PARQUET-2155) Upgrade protobuf version to 3.20.1

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568599#comment-17568599 ] ASF GitHub Bot commented on PARQUET-2155: - shangxinli commented on PR #973: URL:

[jira] [Commented] (PARQUET-2134) Incorrect type checking in HadoopStreams.wrap

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568609#comment-17568609 ] ASF GitHub Bot commented on PARQUET-2134: - shangxinli commented on code in PR #971: URL:

[GitHub] [parquet-mr] shangxinli commented on a diff in pull request #971: PARQUET-2134: Improve binding to ByteBufferReadable

2022-07-19 Thread GitBox
shangxinli commented on code in PR #971: URL: https://github.com/apache/parquet-mr/pull/971#discussion_r924651838 ## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/util/HadoopStreams.java: ## @@ -50,51 +46,45 @@ public class HadoopStreams { */ public static

[jira] [Commented] (PARQUET-2155) Upgrade protobuf version to 3.20.1

2022-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568617#comment-17568617 ] ASF GitHub Bot commented on PARQUET-2155: - ggershinsky merged PR #973: URL: