jerqi commented on issue #1030: URL: https://github.com/apache/incubator-uniffle/issues/1030#issuecomment-1667121730
> With my limited experience, I think s3 is a good choice in this issue. > > If we choose to support s3, then it will easily extend to other filesystems (NFS, CIFS, EFS, GCS fuse, Azure File System) by using a solution like MinIO. > > > Yeah, a lot of small index files will not work well in object storage like S3. Maybe a good idea to store it in other places. Or is it possible to serve index files from Spark driver? > > I think we may have other solution for merging small index files like [application-and-practice-of-spark-small-file-merging-function-on-aws-s3](https://aws.amazon.com/cn/blogs/china/application-and-practice-of-spark-small-file-merging-function-on-aws-s3/)? > > Feel free to correct me if I am wrong. I will propose a document at this weekend. First, we can only define some interfaces. Then, we will implement some object systems according to the need of xianjingfeng and zuston. @xianjingfeng What object system do you want to implement? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
