paleolimbot commented on PR #7: URL: https://github.com/apache/arrow-experiments/pull/7#issuecomment-1983849933
> Could in theory also live in https://github.com/apache/arrow-testing Possibly! That repo is pulled many times an hour by CI and so I'm not sure this dataset is a good fit (since it is intentionally not trivially small). The files in arrow-testing are very good for testing but are meaningless for examples (e.g., random UTF-8 characters are not very compelling)....something like this might be good for use in documentation, blog posts, or a cookbook, but is not very good for testing. I added a note to the data README...this is all new territory, but I think that as with everything in this repo, any data here should either find a purpose elsewhere or be removed when it's clear there is no such home. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
