Hi folks,

As AI agents, BI tools, notebooks, and query engines increasingly consume
the same data, semantic definitions such as metrics and dimensions are
often duplicated across multiple systems. This leads to inconsistent
definitions, duplicated effort, and governance challenges. The rise of AI
agents further amplifies this problem, as agents rely on semantic context
to understand data and reason about business concepts. Without a shared
semantic layer, organizations often end up maintaining multiple versions of
the same business definitions across tools and applications.

JB and I would like to start a discussion on adding semantic layer support
to Apache Polaris so semantic models can be defined once, governed
centrally, and consumed consistently across tools. The proposal[1]
introduces semantic models as a first class Polaris entity using the Open
Semantic Interchange (OSI)[2] specification[3]. At a high level, the
proposal adds:

   - A new SEMANTIC_MODEL entity type
   - CRUD APIs for semantic models
   - Schema validation and authorization

Polaris remains a metadata service and does not execute metrics or semantic
queries.
Feedback on the overall direction, design, and OSI adoption would be
greatly appreciated.

1.
https://docs.google.com/document/d/1ZdI-1w_5LbyCMhvUhLCtOt-N1Z89L2P-oiGLaYayCZg/edit?usp=sharing
2. https://open-semantic-interchange.org
3.
https://github.com/open-semantic-interchange/OSI/blob/main/core-spec/spec.md


Yufei

Reply via email to