Hi, Jan:

Thanks for the update. Left some comments and let's continue the discussion
in doc.

On Tue, Nov 7, 2023 at 4:26 PM Jan Kaul <jank...@mailbox.org.invalid> wrote:

> Hi all,
>
> thanks again to those who left some comments on the Iceberg Materialized
> View spec. The discussions showed that there are still some open questions
> that need to be answered. I added a section to the end of the document to
> highlight the decisions that need to be made. Please have another look, I
> would really appreciate your input.
>
> Best wiches,
>
> Jan
> On 27.10.23 11:48, Jan Kaul wrote:
>
> Thank you Dan and the others for your helpful comments. I've added some
> sections to address the points that you mentioned. I'm not really sure what
> you mean by fail after grace period.
>
> I've found a design document for the trino materialized views and tried to
> incorporate some of the points. I'm not really sure where to find the rest
> of the discussion. If you can think of important points please make a
> comment in the google doc so that I can add them to the document.
>
> Best wishes
>
> Jan
> On 26.10.23 20:57, Daniel Weeks wrote:
>
> I added a few comments to the doc, but I think there a few other things
> that probably need to be considered:
>
> Do we define behaviors around freshness (fail after grace period or fall
> back to view definition), what is the expectation for refreshes
> (manual/just-in-time/lazy), etc.
>
> I think there was a fair amount of discussion around this when Trino
> implemented materialized views, so it may be good to revisit those
> discussions and make sure we're covering the major areas.
>
> -Dan
>
> On Thu, Oct 26, 2023 at 11:50 AM Jean-Baptiste Onofré <j...@nanthrax.net>
> wrote:
>
>> Daniel is right, we deviated :)
>>
>> OK Brian, let's do that.
>>
>> Apologies.
>>
>> Regards
>> JB
>>
>> On Thu, Oct 26, 2023 at 8:40 PM Brian Olsen <bitsondata...@gmail.com>
>> wrote:
>> >
>> > Agreed, apologies to Jan :). JB, let's discuss this at the sync this
>> Wed, and after that we can create a new thread if needed.
>> >
>> > On Thu, Oct 26, 2023 at 1:38 PM Daniel Weeks <dwe...@apache.org> wrote:
>> >>
>> >> JB and Brian,
>> >>
>> >> I think we should probably move this discussion to a discuss thread
>> specifically for the topics you want to address.
>> >>
>> >> We've had a few instances now where the original intent of the thread
>> is redirected to talk about other subjects.  I don't feel this is a good
>> approach because, while it is on the apache mailing list, the topic of the
>> thread doesn't reflect the content, so you don't get the right
>> audience/level of engagement or buy-in.
>> >>
>> >> I'm not disagreeing with trying to improve how we communicate and
>> track improvements/proposals/etc, but I think we should try to keep the
>> thread on topic.
>> >>
>> >> Thanks,
>> >> -Dan
>> >>
>> >> On Thu, Oct 26, 2023 at 9:26 AM Jean-Baptiste Onofré <j...@nanthrax.net>
>> wrote:
>> >>>
>> >>> Oh, I don't say we have to provide a user mailing list. Personally, I
>> >>> like mailing list mainly because we have https://lists.apache.org/
>> >>> where we can browse and search on the mailing lists.
>> >>> A lot of Apache projects are using Slack or Zulip, but in parallel of
>> >>> mailing lists. As we say at Apache: "if it doesn't happen on the
>> >>> mailing list, it never happens".
>> >>> That said I would distinguish:
>> >>> - for dev, obviously we can use Slack for discussion, community
>> >>> meetings, etc, but we have to send main topics/discussions on the dev
>> >>> mailing list.
>> >>> - for user, I think Slack is good, but I like the user mailing list,
>> >>> to track/search/async communication as well.
>> >>>
>> >>> That's another discussion anyway, let's focus on the design proposals
>> >>> space: my understanding is that we want to have a space listing all
>> >>> proposals, for review, tagged as "done" or "in progress". Right ?
>> >>> I don't think a forum/stack overflow like would help here (it helps
>> >>> for users, not for dev/technical/design proposals).
>> >>>
>> >>> At Apache Beam, we have a similar page as at Iceberg:
>> >>> https://beam.apache.org/roadmap/ where you can click on roadmap items
>> >>> for details (https://beam.apache.org/roadmap/portability/).
>> >>> So, initially, I proposed to update
>> >>> https://iceberg.apache.org/roadmap/ with proposals (status
>> >>> "discussion").  As most of the proposals (all ?) come as Google Link,
>> >>> we can change a bit the look'n feel of this page including the list of
>> >>> proposals.
>> >>>
>> >>> That could be a first move, we can update later.
>> >>>
>> >>> Regards
>> >>> JB
>> >>>
>> >>> On Thu, Oct 26, 2023 at 5:54 PM Brian Olsen <bitsondata...@gmail.com>
>> wrote:
>> >>> >
>> >>> > Yeah, unfortunately there's no way to limit the functionality to
>> only facilitate this. In fact, the product that gets closest to it is
>> GitHub Issues.
>> >>> >
>> >>> > I believe putting the onus on developers deeply involved in the
>> project makes sense. Expecting users, especially newer users of a newer
>> generation will use an email list is unlikely, especially if they're in a
>> discovery mode and figuring out how to solve an issue. A lot of garnering
>> adoption from users is lowering every barrier to entry as well as lowering
>> time to that first hello world dopamine hit.
>> >>> >
>> >>> > I'm middle millennial and even I find using email for discussion
>> outside of my mental model/preference but I also see the benefits.
>> >>> >
>> >>> > On Thu, Oct 26, 2023 at 10:45 AM Jean-Baptiste Onofré <
>> j...@nanthrax.net> wrote:
>> >>> >>
>> >>> >> The idea is really to "square" GH Discussion only to
>> roadmap/design proposals.
>> >>> >>
>> >>> >> For "user support", more than Slack, I would love to see
>> >>> >> u...@iceberg.apache.org.
>> >>> >>
>> >>> >> So I would distinguish:
>> >>> >> - the design/spec proposals where we could use GH Discussions. If
>> >>> >> people use GH Discussion for support questions, then we can move
>> to GH
>> >>> >> Issue or direct to the mailing list/slack.
>> >>> >> - the user "support" should be on user mailing list and/or Slack
>> >>> >>
>> >>> >> You have a valid point: GH Discussions could be hard to manage
>> because
>> >>> >> most users will use it as a "support forum".
>> >>> >>
>> >>> >> My point is really:
>> >>> >> - we need a central space for design/spec proposals
>> >>> >> - it has to be on Iceberg community and visible for all
>> >>> >>
>> >>> >> Regards
>> >>> >> JB
>> >>> >>
>> >>> >> On Thu, Oct 26, 2023 at 5:30 PM Brian Olsen <
>> bitsondata...@gmail.com> wrote:
>> >>> >> >
>> >>> >> > GitHub Discussions could be a solution that we should consider.
>> We used it on the Trino side but still have mixed results with it. On one
>> hand, there's a lot of overlap between creating Issues and Discussions. In
>> fact, GitHub allows you to migrate Issues that only involve discussing a
>> topic, or something that can't immediately be tied to any upcoming work to
>> be a discussion. This keeps the Issue backlog focused on actionable
>> requests.
>> >>> >> >
>> >>> >> > That said, Discussions can become difficult to maintain if no
>> person or body of people drives it. Of course, the community will drive it
>> to some degree, especially when it's new and shiny, but GitHub Discussions,
>> much like Slack, becomes a support channel that encourages the messy human
>> interactions that help us arrive at a solution. So the question is do we
>> want to open Discussions knowing that it may become a second support
>> channel compared to Slack? Would we want to use Discussions in place of
>> Slack so that there's still a single triage channel?
>> >>> >> >
>> >>> >> > I personally lean towards keeping a single real-time
>> "support-like" channel in the community, otherwise, you will fragment the
>> attention of the community. Most of what we would need to support the
>> centralization of proposals can be accomplished with Issues. Slack still
>> seems to be the dominant interactive system of choice and where we are now
>> so I wouldn't suggest moving that. I do think this is worth a discussion at
>> the next sync so I'll add it.
>> >>> >> >
>> >>> >> > In full transparency, Tabular is building an Iceberg-focused
>> Discourse forum (not to be confused with Discord) instance to solve the
>> problem of centralizing discussions in the community to wiki-style answers
>> we can link to and having dedicated content curators to those solutions.
>> Think of it as an Iceberg-specific Stack Overflow with lightened rules to
>> allow more open discussion. Adding GitHub discussions wouldn't collide with
>> our goals as it would become another signal that we could use to inform the
>> answers on our forum. It still comes back to the value given the cost for
>> the community to manage it.
>> >>> >> >
>> >>> >> > I know I have a lot of thoughts around this and its because I've
>> been down this road before, but perhaps there's a nuance I'm not seeing yet.
>> >>> >> >
>> >>> >> > On Thu, Oct 26, 2023 at 7:15 AM Jean-Baptiste Onofré <
>> j...@nanthrax.net> wrote:
>> >>> >> >>
>> >>> >> >> Just to be clear: we can GH Discussions subjects template via
>> >>> >> >> .asf.yaml but we have to open a ticket to INFRA to enable it.
>> >>> >> >>
>> >>> >> >> Regards
>> >>> >> >> JB
>> >>> >> >>
>> >>> >> >> On Thu, Oct 26, 2023 at 1:56 PM Jean-Baptiste Onofré <
>> j...@nanthrax.net> wrote:
>> >>> >> >> >
>> >>> >> >> > Hi Brian
>> >>> >> >> >
>> >>> >> >> > I like the idea of GitHub. Why not enabling (in .asf.yml)
>> GitHub
>> >>> >> >> > discussions ? A GitHub Discussion could be a good place to
>> share the
>> >>> >> >> > doc and exchange both in the doc and in the discussion
>> comments.
>> >>> >> >> >
>> >>> >> >> > Regards
>> >>> >> >> > JB
>> >>> >> >> >
>> >>> >> >> > On Thu, Oct 26, 2023 at 1:13 PM Brian Olsen <
>> bitsondata...@gmail.com> wrote:
>> >>> >> >> > >
>> >>> >> >> > > Hey JB,
>> >>> >> >> > >
>> >>> >> >> > > I totally agree we need a place to centralize this but I'm
>> nit a huge fan of all the lists we currently have going on the site. SSGs
>> are just not an accessible method of storing lists. ( roadmap, blogs,
>> videos, etc..).
>> >>> >> >> > >
>> >>> >> >> > > The roadmap is barely touched for this reason. I want to
>> propose we move roadmap to GitHub projects.
>> >>> >> >> > >
>> >>> >> >> > > Likewise, I feel like somewhere on GitHub might be a better
>> location for this type of thing.
>> >>> >> >> > >
>> >>> >> >> > > Maybe posting these in GitHub issues and adding a proposal
>> label?
>> >>> >> >> > >
>> >>> >> >> > > On Tue, Oct 24, 2023 at 9:28 AM Jean-Baptiste Onofré <
>> j...@nanthrax.net> wrote:
>> >>> >> >> > >>
>> >>> >> >> > >> Hi Jan
>> >>> >> >> > >>
>> >>> >> >> > >> Thanks for the reminder. I will take a look.
>> >>> >> >> > >>
>> >>> >> >> > >> As proposed by Renjie a few days ago, it would be great to
>> >>> >> >> > >> gather/store all document proposals in a central place.
>> >>> >> >> > >>
>> >>> >> >> > >> If there are no objections, I will prepare a PR for the
>> website about
>> >>> >> >> > >> that (with a space listing/linking all proposals).
>> >>> >> >> > >>
>> >>> >> >> > >> Regards
>> >>> >> >> > >> JB
>> >>> >> >> > >>
>> >>> >> >> > >>
>> >>> >> >> > >>
>> >>> >> >> > >> On Tue, Oct 24, 2023 at 9:22 AM Jan Kaul
>> <jank...@mailbox.org.invalid> <jank...@mailbox.org.invalid> wrote:
>> >>> >> >> > >> >
>> >>> >> >> > >> > Hi all,
>> >>> >> >> > >> >
>> >>> >> >> > >> > I've created an issue to propose a design for a
>> Materialized View Spec a while ago. After further discussion we reached a
>> first draft for the spec. It would be great if you could have another look
>> at the design and share your feedback.
>> >>> >> >> > >> >
>> >>> >> >> > >> > Here is the google doc:
>> https://docs.google.com/document/d/1UnhldHhe3Grz8JBngwXPA6ZZord1xMedY5ukEhZYF-A/edit?usp=sharing
>> >>> >> >> > >> >
>> >>> >> >> > >> > Thanks in advance,
>> >>> >> >> > >> >
>> >>> >> >> > >> > Jan
>>
>

Reply via email to