snichols opened a new issue, #13618:
URL: https://github.com/apache/druid/issues/13618

   ### Affected Version
   
   24.0.2
   
   ### Description
   
   Don't get me wrong here, I love so much about Druid!  I love love love it!
   
   But, I'm a noob at operating Druid cluster in a production environment.  I'm 
using druid-operator and it works really well.  I can stand up clusters and 
they work great!  Fantastic.
   
   Where I'm running into issues is when I delete streaming datasources and 
attempt to reconstitute them.  Here's the repro steps:
   
   1. Stand up a fresh Druid cluster using s3 for deep storage.
   2. Set up a Kafka ingest supervisor to pull records from a topic.
   3. Let that supervisor work long enough to persist segments.  An hour, days, 
it's dealer's choice!
   4. Terminate the supervisor.
   5. Wait for the Kafka ingest task to finish.
   6. Mark all datasource segments as unused.
   7. Run a kill task for said datasource.
   8. Wait for kill task to complete.
   9. Observe that there's no datasource in the datasource list.
   10. Observe that there's no segments listed in the segments list.
   11. Set up a Kafka ingest supervisor to pull records from a topic with the 
same settings at #2.
   12. Watch as hilarious bugs occur.  It could be that old segment metadata 
interferes or perhaps a topic name change causes weird exceptions.  In any 
case, this never works cleanly.
   
   I'd like to suggest that when a datasource is deleted then all references to 
the datasource are actually removed from the metadata store.  Am I missing some 
reason why this should't be the case already?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to