[ https://issues.apache.org/jira/browse/HADOOP-19385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17912091#comment-17912091 ]
ASF GitHub Bot commented on HADOOP-19385: ----------------------------------------- steveloughran opened a new pull request, #7285: URL: https://github.com/apache/hadoop/pull/7285 Add a module hadoop-cloud-storage-project/hadoop-format-testingn for testing integration with file formats: vector IO, bulk delete, format-aware input streams. ### How was this patch tested? TODO ### For code changes: - [ ] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')? - [ ] Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation? - [X] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, `NOTICE-binary` files? > S3A: add a file-format-parsing module for testing format parsing > ---------------------------------------------------------------- > > Key: HADOOP-19385 > URL: https://issues.apache.org/jira/browse/HADOOP-19385 > Project: Hadoop Common > Issue Type: Sub-task > Components: fs/azure, fs/s3 > Affects Versions: 3.4.2 > Reporter: Steve Loughran > Priority: Major > > Create a cloud-storage/format-parsing module declaring various file formats > as dependencies (parquet, iceberg, orc) purely for integration/regression > testing store support for them. > h2. Parquet > for parquet reading we'd want > * parquet lib > * samples of well formed files > * samples of malformed files. > Test runs would upload the files then open then. > h2. Iceberg > Verify bulk delete through iceberg FileIO api. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org