sweir-thescore commented on issue #12734:
URL: https://github.com/apache/hudi/issues/12734#issuecomment-2625025969

   Hey @rangareddy,
   
   For part 1 of your question, I believe you are asking about this setting, 
[hoodie.embed.timeline.server.reuse.enabled](https://hudi.apache.org/docs/configurations/#hoodieembedtimelineserverreuseenabled),
 which we use the default of false for. We do not believe there are multiple 
writers because we have a dispatch service that only kicks off a single 
dataproc job at a time for a particular ingest pipeline. If a dataproc job is 
already running for the pipeline, this dispatch service does not start another 
dataproc job. Pairing that with SINGLE_WRITER mode, we do not believe there 
could be multiple writers.
   
   As for the second question, can you help define what files you are 
particularly looking for for the hoodie timeline? We had to sort the issue in 
the interrim due to production data needing to be ingested, but we may have 
historical, non-current hoodie metadata files if there are particular ones you 
are asking for.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to