Re: Topics [was: news from the topic experiment]

Pierre-Yves David Thu, 13 Oct 2016 07:02:14 -0700


On 10/13/2016 07:33 AM, Erik van Zijst wrote:

Working on our Bitbucket spike I wondered if topics could perhaps
benefit from a small simplification. Instead of adding the topic name
as an additional field, what if we defined a topic commit by merely
adding a boolean property to the meta dict; e.g. `is_topic: True`?
Named branches would not have the property set.

As a foreword, I agree the frontier between named branch and topics isthin. The difference between them are useful but if we can make theirmanagement really close, that would be useful. If we did not had thebackward compatibility constrains, I would happily have `hg branch foo`create a "topics" and something like `hg branch --long-lived` createwhat currently is a named branched.

So, thanks for exploring possibilities to make this frontier thiner.However, I can see some issues with some aspects of this proposal, usingthe same field for either branch or topic make them mostly mutuallyexclusive. Publishing a topic on a named branch would require to alterthe changesets data (and therefore, hash) This means people could usetopic only with the default branch (or significant more complexity towork around this). As I understand, Bitbucket enforces a single masterbranch so that might actually fit your model. This is probably toorestricting for the general project (more about that later).

It would seem to me that this could have some benefits:

1. There would be no risk of a name collision between the branch and
topic namespaces.

I'm not certain this actually avoid the risk of name collision. Peoplecould use the same branch/topic name on different changesets withdifferent values for the flag. That would lead to both a topic and anamed branch to exists with the same name.

In all cases we should have the local UI fight hard to prevent people tocreate collisions between branch and topic. (And some descent way topoint out name conflict if they appends).

During our little Bitbucket spike and demo during the sprint we made
the assumption that topics and branches should never clash, which
allowed me to put them in the existing, single branches namespace.
This would seem desirable as topics are essentially branches, with the
only real difference being their anticipated longevity. This greatly
simplified the UI as hardly anything needed to be modified.


That is a interesting though.

I think there is some value in having the option to use a (branch,topic) pairs (more about this later), However, the same as people arenot really exposed to named branch until they get out of default, a UIcould still omit all named branch informations as long as nothing elsethan "default" exists.

2. Interoperability with non-topic clients would mostly just work.

Currently, when cloning a topics repo with an old client, all topics
would appear as an amorphous mess of anonymous heads on default.
Instead, if we dropped the separate topic name field and just used the
branch name as the topic name, an old client would see the same layout
as a topic-enabled client and while an old client would not be able to
create new topic commits, read-only clients should be totally fine.
This could be a big boon for existing ecosystem tools like CI servers
that wouldn't have to be modified.

There is some very interesting ramification to this proposal. Even if wedo not go with the flag approach. We store the full (branch, topic) pairinto the branch field. For example we could use ":" as a separatorbranch=BRANCH:TOPIC. Not only this would allow old clients to transportthe data (we already have that) but this also mean old client can alsoview and preserve that data (and this is new) even if it does not getthe behavior improvement related to topic. That would be a largeusability boost for old client.


This is a great lead thank you very much.

The only downside I can think of is that when the topic and original
branch name are separate fields, that a topic sort of remains
associated with the branch it was based on. This would provide
implicit merge and rebase targets and therefore slightly shorter
commands. However, I'm not sure that's worth giving up the above
points.

I think having the (named, topics) pair is really useful, especially wecan expect great gains from sensible and clear default (merge, rebase,behind computation, etc). In addition, as we can keep hiding the namedbranch concept for all users who do not needs it. I think we can have agood trade-off regarding extra-feature vs extra-complexity while keepingan initial complexity similar to not having named branch at all.(But as usual, I'm open to be convinced that the right trade-off issomewhere else)

We need to explore a bit more the consequence of having the same topicon multiple branches, but I'm not too worried we can eventually definessome good behavior+constrains pairs that makes this working.

I do realize it's quite possible I'm overlooking other reasons for
having topics encoded as an additional namespace, separate from the
named branch name.


That email was very useful, please send more of them ☺♥

Cheers,

--
Pierre-Yves David
_______________________________________________
Mercurial-devel mailing list
Mercurial-devel@mercurial-scm.org
https://www.mercurial-scm.org/mailman/listinfo/mercurial-devel

Re: Topics [was: news from the topic experiment]

Reply via email to