paul-rogers commented on issue #13816:
URL: https://github.com/apache/druid/issues/13816#issuecomment-1474377866

   Generalizing, the "catalog" becomes Druid's existing metadata (with a new 
item to store an MSQ ingest query as a "procedure"):
   
   * MSQ queries (for batch ingest) (new)
   * Supervisors (for streaming)
   * Compaction specs
   * Load rules (including cold tier rules, when that feature lands)
   * Data aging rules (proposed)
   
   The "catalog" is just a conceptual term, in this case, for the 
datasource-related APIs we already have. The user is responsible for ensuring 
consistency across columns as data moves from ingestion through compaction (and 
in queries). This gives the user the ultimate freedom to evolve columns in any 
way that the app needs.
   
   This is a task-focused view of metadata: first specify the task, then 
specify how to work with columns for that task. It contrasts with the original 
column-focused view: provide information for each column, and let the task use 
that common information to work out what that task should do.
   
   Both work. Our job is to debate the pros and cons and choose one for 
implementation.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to