codeant-ai-for-open-source[bot] commented on PR #37557:
URL: https://github.com/apache/superset/pull/37557#issuecomment-3821284443

   ## **Sequence Diagram**
   
   The PR threads UUIDs from dataset YAML configs through example discovery 
into the generic Parquet loader and changes dataset lookup to prefer UUID (with 
schema fallback). This prevents duplicate datasets and backfills UUID/schema on 
existing metadata when needed.
   
   ```mermaid
   sequenceDiagram
       participant CLI
       participant DataLoading
       participant GenericLoader
       participant Database
   
       CLI->>DataLoading: discover_datasets() -> read dataset.yaml (includes 
uuid)
       DataLoading->>GenericLoader: create_generic_loader(..., uuid=from_yaml)
       CLI->>GenericLoader: invoke loader -> load_parquet_table(uuid)
       GenericLoader->>Database: _find_dataset(uuid first; else 
table_name+schema)
       alt Dataset found by UUID
           Database-->>GenericLoader: return existing SqlaTable (no changes)
       else Not found
           GenericLoader->>Database: create/load table, create/merge SqlaTable
           GenericLoader->>Database: set/backfill tbl.uuid and tbl.schema if 
provided
           Database-->>GenericLoader: merged SqlaTable
       end
       GenericLoader-->>CLI: return dataset (matched or created)
   ```
   
   ---
   *Generated by [CodeAnt AI](https://codeant.ai)*
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to