[ https://issues.apache.org/jira/browse/PARQUET-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17774962#comment-17774962 ]
ASF GitHub Bot commented on PARQUET-2347: ----------------------------------------- amousavigourabi commented on code in PR #1141: URL: https://github.com/apache/parquet-mr/pull/1141#discussion_r1358428311 ########## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/api/ReadSupport.java: ########## @@ -75,14 +76,32 @@ public ReadContext init( throw new UnsupportedOperationException("Override init(InitContext)"); } + /** + * called in {@link org.apache.hadoop.mapreduce.InputFormat#getSplits(org.apache.hadoop.mapreduce.JobContext)} in the front end + * + * @param configuration the configuration + * @param keyValueMetaData the app specific metadata from the file + * @param fileSchema the schema of the file + * @return the readContext that defines how to read the file + * + * @deprecated override {@link ReadSupport#init(InitContext)} instead + */ + @Deprecated Review Comment: This PR is focussed on transitioning from `Configuration` to the `ParquetConfiguration` interface. This included some calls to deprecated methods which I could not very quickly transition away from. I would consider this out-of-scope for this PR. > Add interface layer between Parquet and Hadoop Configuration > ------------------------------------------------------------ > > Key: PARQUET-2347 > URL: https://issues.apache.org/jira/browse/PARQUET-2347 > Project: Parquet > Issue Type: Improvement > Components: parquet-mr > Reporter: Atour Mousavi Gourabi > Priority: Minor > > Parquet relies heavily on a few Hadoop classes, such as its Configuration > class, which is used throughout Parquet's reading and writing logic. If we > include our own interface for this, this could potentially allow users to use > Parquet's readers and writers without the Hadoop dependency later on. > In order to preserve backward compatibility and avoid breaking downstream > projects, the constructors and methods using Hadoop's constructor should be > preserved for the time being, though I would favour deprecation in the near > future. > This is part of an effort that has been [discussed on the dev mailing > list|https://lists.apache.org/thread/4wl0l3d9dkpx4w69jx3rwnjk034dtqr8]. -- This message was sent by Atlassian Jira (v8.20.10#820010)