nfarah86 commented on issue #8824: URL: https://github.com/apache/hudi/issues/8824#issuecomment-1569461129
following up from slack: 6 years of data in the active timeline is a lot of data. 1) what kind of queries are you running? Do you need incremental queries across 6 years of data? 2) Do you have a multi-writer situation where multiple writers are writing to the same table? 3) Can you share the Hudi timeline in the .hoodie folder? 4) is the data mostly insert or upsert or a mixed of both? 5) How are you partitioning the data? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
