[GitHub] [parquet-mr] ggershinsky commented on pull request #950: PARQUET-2006: Column resolution by ID

2022-03-22 Thread GitBox
ggershinsky commented on pull request #950: URL: https://github.com/apache/parquet-mr/pull/950#issuecomment-1075923688 I'll join too. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [parquet-mr] ggershinsky commented on pull request #950: PARQUET-2006: Column resolution by ID

2022-03-22 Thread GitBox
ggershinsky commented on pull request #950: URL: https://github.com/apache/parquet-mr/pull/950#issuecomment-1074835875 Thanks @huaxingao , one more question / clarification. In the writer, > field_id has to be unique in the entire schema, otherwise, an Exception will be thrown.

[GitHub] [parquet-mr] ggershinsky commented on pull request #950: PARQUET-2006: Column resolution by ID

2022-03-21 Thread GitBox
ggershinsky commented on pull request #950: URL: https://github.com/apache/parquet-mr/pull/950#issuecomment-1073553770 hi @huaxingao , can you describe the lifecycle of the column IDs at a high level, either in the PR description, or in a comment? Where these IDs are stored (if in footer