boyuanzz commented on pull request #12223: URL: https://github.com/apache/beam/pull/12223#issuecomment-660321284
I'm curious what kind of splittable target that the `ParquetIO`. It seems like a file could have many row groups and each row groups would have many records. Does `ParquetIO` want to split at row groups level, for example, split groups of 2 into one group each, or at records level? Also should we consider x-lang usage of `ParquetIO`? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
