Re: [PROPOSAL] Deprecate config key aliases / SetFromFlag

Aled Sage Wed, 23 Nov 2016 03:28:08 -0800

Thanks Alex.

For aliases for the _type_, agreed. We'll need to be careful how wechoose/register those aliases. That feels like a different discussionfrom this one, which focuses on configKey names though.

---

For "replace the ad hoc code for different types e.g. EntitySpec,PolicySpec, ..."

My understanding was that #363 would radically change our YAML parsing(and later persistence). However, we would still generate the Javaobjects like `EntitySpec` and still use the Entity Factory forconstructing them. I assumed we would still (at the java backend level)have the ConfigKey class.

I agree we can make other big improvements for getting rid ofdifferences between EntitySpec, PolicySpec etc, and differences betweenhow Entity and Policy handle config/attributes etc. Again, that feelslike a different topic.

---

You said "for readability i'd say that a comfortable alias is nicer towork with and should be the canonical form, as opposed to a long andugly identifier".

I question whether we need the long and ugly identifier at all (exceptas a deprecated name for backwards compatibility).

Perhaps we also have a difference of opinion of how the alias shouldbehave. I think that, if it exists, it should be a genuine alias thatcan be used anywhere that the other name(s) can be used. Do you think weshould have different rules for where the alias can be used and how itbehaves (e.g. never inherited in the runtime management hierarchy,and/or never inherited by sub-types)?

---

I'll respond on the separate email thread ("Deprecate @SetFromFlag")about how best to move towards those usability improvements.

(So this thread is just about whether we should have aliases - they areseparate questions).


Aled


On 23/11/2016 10:27, Alex Heneveld wrote:

concrete example where aliases are essential in my view: whenentering the _type_. how many of us switch to an IDE or grep to findthe package.Class name then paste it in. having `server` as an aliasfor the type makes it much easier to write yaml.
my canonical form vision is that in the ui we can highlight wherever anon-canonical form is being used and give the option to switch. forreadability i'd say that a comfortable alias is nicer to work with andshould be the canonical form, as opposed to a long and uglyidentifier. so re your obj #1 we want some aliases: it's not adead-end code path. discarding them altogether is going to causeother issues. (that's not to say we can't clean up their usage, whichof course we can.)
btw #363 allows design time yaml to be converted to canonical form notjust persisted state of deployed models (and yes, i think it shouldreplace the ad hoc code for different types eg EntitySpec, PolicySpec,....).
the real issue in my view is that we have too many aliases and theyare used inconsistently without good docs/help. a canonical form andinteractive help on keys will go a long way towards solving that.simply deprecating aliases without that is just going to beirritating. similarly i'd like us to have a better tie in with sourcecontrol and developer workflow before advocating a big change toblueprints (i think this will help with #2; ie have in-productwarnings on deprecated usage and paths to upgrade, before wedeprecate/drop aliases ... this is an issue either with your proposalor mine).
so my strong preference is to focus on those usability items first,and *then* look at eliminating some aliases. any other path is goingto be even more disruptive for users!
--a


On 23/11/2016 10:09, Aled Sage wrote:
// The @SetFromFlag is an implementation detail - this proposal isjust to discuss whether we should have aliases.
As Alex says, the main problem being solved is that having multiplenames makes things confusing (even if we were to fix otherinconsistencies so that those names could be used in the same wayanywhere).
---
Alex suggests having a "clear preferred way -- a canonical form ifyou like -- for any blueprint, and the ability to output things inthat format."
Two things scare me about that:

1. It suggests that we go to the effort of supporting an alternative
   name, but make sure we never use that alternate name in any of our
   examples and thus try to make sure users don't use it. If so, why is
   it there? Will it just lead to confusion when someone comes across a
   blueprint that uses the short-form name, which is thus different
   from all "official" examples?
2. For "output things in that format", their YAML blueprint is likely
   in version control (or blog, or whatever). We are thus not changing
   their blueprint. If they use a short-form name, then that will
   continue to be in version control. If the blueprint is added to an
   online catalog, it will continue to use that short-form name
   (because we'd show in the catalog the exact blueprint from their git
   repo).

---
Alex says "It also gives us an easy way to update a blueprint wherethings are deprecated/changed."
I don't follow - are we talking about solving different problems, oris your vision of PR #363 that we eventually replace the EntitySpecand ConfigKey classes, and the `brooklyn.parameters` as well?
Let's take a concrete use-cases. (let's not argue/discuss thespecific names, and instead focus on the use-cases):
Someone writes a blueprint with `enricher.sourceSensor: cpuUsage`. Wedeprecate "enricher.sourceSensor", preferring the name "sourceSensor".
The desired behaviour (IMO) is that:

1. We continue to support both names for X releases/months.
2. The blueprint author is warned about use of the deprecated name
   (next time they validate, or next time they deploy).
3. The entity's type show the new name and deprecated name(s). This is
   also included in auto-generated docs (similar to auto-generated
   javadoc), and is available for Brooklyn's YAML composer to give
   warnings (either while editing, or when the blueprint is submitted).
I'm guessing what you mean by "easy way to update a blueprint" is forthe persisted state: to switch the name that is written to thepersisted state, so that it uses the new name. That is probably good,but we should think carefully about implications for rolling back toolder Brooklyn versions.
---
For comparing "long-name syntax" versus short "flag name" with CLIs...
CLIs usually follow a very specific convention, e.g. `diff -w` and`diff --ignore-all-space`; except for java which uses single "-" forboth short and long form - a very bad thing in my opinion :-)
Some CLIs (like `br`) accept long and short forms (e.g. `brapplication` or `br app`). This is ok because the context is neverambiguous - you never pass "application" or "app" to a differentcommand, expecting different behaviour / ambiguity for it theninvoking `br`.
---
If we conclude that aliases are sometimes a good idea, we shouldagree when and how they should be used (e.g. is it primarily forshort-form; is it for multiple sensible names; is it for supportingcamel-case versus dot or underscore forms; etc).
Unfortunately our aliases are massively over used in Brooklyn (in myopinion), in an ad hoc manner. Most (if not all) should be deprecated.
Aled


On 22/11/2016 22:31, Alex Heneveld wrote:
Hi Aled -
There are a few more things that I think need to be consideredhere. Also, combining your proposals.
Firstly -- throughout Brooklyn we use the long-name syntax as aninternal / formal name of a key to prevent ambiguity, and a short"flag name" to make it easy for a user to write. This is sometimesuseful where you set a config key at the root, using the formalname, so that it is inherited at a specific descendant. If we don'tneed to rely on inheritance this argument goes away somewhat but Idon't think we're there yet.
Secondly -- what problem are we trying to solve? I think we shouldbe dismissive of proposals that don't solve a problem. Yours doesbut it doesn't say what that problem is. I think the problem isthat having multiple ways to do the same thing can be confusing,especially if docs and examples are inconsistent.
I think a better solution that that problem is a clear *preferred*way -- a canonical form if you like -- for any blueprint, and theability to output things in that format.
This means users looking at our docs and examples -- or anythingthat uses canonical forms -- will see a consistent style. Iteliminates some, if not all, of the confusion which is the problem.
It also gives us an easy way to update a blueprint where things aredeprecated/changed.
I'd much prefer going that route than the proposal you suggest, andthen deciding after that whether deprecating all aliases is theright thing. (Given the short/long distinction I'd prefer the ideathat "all-but-one" alias might be deprecated in most cases. But I'malso unsure that with a canonical form and good tooling, aliasesmight actually be a good thing. They are commonplace in more textinteractive scripting -- which this approaches, as opposed to themethod names in Aled's proposal. Think of CLI arguments and ofcourse the text adventure games of our youth...
(As you know this is largely implemented in #363, at which pointdeprecated @SetFromFlag and moving to preferred aliases becomessimple, and optionally saying that all or any non-preferred alias isdeprecated.)
--A

[1] https://github.com/apache/brooklyn-server/pull/363


On 22/11/2016 21:48, Aled Sage wrote:
Hi all,
TL;DR: aliases for config keys should be deprecated. Each configkey should have only one proper name, with other names deprecated.
We should change this *after* releasing 0.10.0, to decrease risk.
(This was mentioned in the email thread "[PROPOSAL] Deprecate@SetFromFlag").
_*Current Situation*_
When defining a config keys in Java, one can add an annotation like:

   @SetFromFlag("version")
   ConfigKey<String> SUGGESTED_VERSION =
   newStringConfigKey("install.version", "Suggested version");
This alternative name specified in @SetFromFlag is respected insome situations, but not others.
_*Requirements*_
The desire to support multiple names can be split into threedifferent use-cases:
1. Backwards compatibility (e.g. because we already support two names,
   so need to keep doing that; or because we want to rename a config
   key, such as correcting its spelling).
This use-case could be reworded as the need to supportdeprecated names.It is covered in the email thread "[PROPOSAL] Deprecate@SetFromFlag".
2. Aliases (i.e. a deliberate desire to support different names,
   because those different names are seen as good).
3. Hints on blueprint validation in a composer (e.g.
   "environment.variables not valid; did you mean env?")

_*Proposal: Don't Use Aliases For Config Keys*_
I propose that we deprecate support for use-case (2) above: use ofaliases.
The use of aliases leads to confusion about what the differentnames mean. When someone is looking at examples, it's unclearwhether they mean the same thing, or if one is valid but the otheris not. There is a scary amount of folk lore about config key namesalready!
Example blueprints have a tendency to proliferate: a blueprintwritten within a company adopting Brooklyn is often used as thebasis for other blueprints. If we support an alias without a veryobvious deprecation warning in the YAML composer, then use of thatalias will spread.
---
Note that this is a separate discussion from whether our existingnames are right! There are probably a lot of names we shoulddeprecate and improve.
_*Proposal: Guidelines for "deprecated"*_
For use-case (1) above, i.e. deprecated names, we should treat thatin a similar way to Java deprecated methods.
We should *not* add a deprecated name just because we think it's anice alternative name. We should only add deprecated names when itis an undesirable name that we need to support for backwardscompatibility.
For example, if someone submitted a pull request with three methodsthat all did the same thing, then I'd reject that PR - e.g.sort(collection), arrange(collection) and order(collection).
_*Proposal: Hints for Names*_
There is a compelling argument for providing hints for incorrectnames, particularly when using an online YAML composer or whenvalidating a YAML blueprint.
For example, if someone uses "environment.variables" but the realname is "env", then a validation warning can be shown with an errormessage proposing the correct name.
This could be achieved by providing "close names". If the namematches another config key, then that would be used. Otherwise, ifthe name matches a "close name" of a config key, then it would showthe validation warning. Note that it is a warning rather than anerror because of the rules for config inheritance: it could be thatthe config key will be inherited by children that will understandthe given name.
We could have a "strict" mode that treated such warnings as errors(sounds like a topic for a different email thread!).
We could do some similar automatic checks for close matches, e.g.to warn if "installCommand" is used instead of "install.command".
To me, it feels like "hints" is stage two - i.e. lower prioritythan agreeing each config key should have a single definitive name,and deprecating the other names.

Re: [PROPOSAL] Deprecate config key aliases / SetFromFlag

Reply via email to