iemejia commented on pull request #12064: URL: https://github.com/apache/beam/pull/12064#issuecomment-648252297
Hi, PR looks perfect but I have a question. Why do you need to pass a `Configuration` object or in other words what are you trying to configure? Something FileSystem related or Parquet related? Just with this in mind. We are trying to keep ParquetIO public API a bit 'isolated' of Hadoop API to make evolution 'easier' for not Hadoop cases (https://issues.apache.org/jira/browse/PARQUET-1126) I am even feeling inclined if what you want to achieve is Parquet configuration (`parquet.avro.add-list-element-records`, `parquet.avro.write-old-list-structure`) to pass those in a Map. But if what you want is FileSystem configuration I am wondering why the configuration on HDFS does not make it, did you try that if that's the case? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
