Re: Unstructure data support in Apache Polaris

2024-12-06 Thread Yufei Gu
Thanks Dmitri for the thoughtful review! If we rely on Iceberg manifest files to transactionally track files, it needs additional APIs or engines to manage file operations. In contrast, the approach in the proposal allows users to seamlessly add, delete, or update files without any new API depend

Re: Unstructure data support in Apache Polaris

2024-12-06 Thread Dmitri Bourlatchkov
Hi Yufei, Interesting proposal. I commented in the doc. WDYT about using Iceberg metadata to list the stage files (in manifests)? Thanks, Dmitri. On Thu, Dec 5, 2024 at 6:21 PM Yufei Gu wrote: > Hi Folks, > > Polaris has become a cornerstone for managing structured data across > diverse proce

Re: Unstructure data support in Apache Polaris

2024-12-05 Thread Yufei Gu
Enabled the commenting permission Yufei On Thu, Dec 5, 2024 at 3:25 PM Laurent Goujon wrote: > Can you allow commenting on the doc? unless feedback should be provided via > the mailing list? > > On Thu, Dec 5, 2024 at 3:21 PM Yufei Gu wrote: > > > Hi Folks, > > > > Polaris has become a corner

Re: Unstructure data support in Apache Polaris

2024-12-05 Thread Laurent Goujon
Can you allow commenting on the doc? unless feedback should be provided via the mailing list? On Thu, Dec 5, 2024 at 3:21 PM Yufei Gu wrote: > Hi Folks, > > Polaris has become a cornerstone for managing structured data across > diverse processing engines, ensuring high performance and reliabilit

Unstructure data support in Apache Polaris

2024-12-05 Thread Yufei Gu
Hi Folks, Polaris has become a cornerstone for managing structured data across diverse processing engines, ensuring high performance and reliability. To further enhance its capabilities, we propose extending Polaris to support unstructured data. This will enable it to handle a broader range of dat