sweir-thescore commented on issue #12734: URL: https://github.com/apache/hudi/issues/12734#issuecomment-2625025969
Hey @rangareddy, For part 1 of your question, I believe you are asking about this setting, [hoodie.embed.timeline.server.reuse.enabled](https://hudi.apache.org/docs/configurations/#hoodieembedtimelineserverreuseenabled), which we use the default of false for. We do not believe there are multiple writers because we have a dispatch service that only kicks off a single dataproc job at a time for a particular ingest pipeline. If a dataproc job is already running for the pipeline, this dispatch service does not start another dataproc job. Pairing that with SINGLE_WRITER mode, we do not believe there could be multiple writers. As for the second question, can you help define what files you are particularly looking for for the hoodie timeline? We had to sort the issue in the interrim due to production data needing to be ingested, but we may have historical, non-current hoodie metadata files if there are particular ones you are asking for. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
