Re: [Metamath] Philosophy and goals for set.mm

OlivierBinda Fri, 24 Jan 2020 00:56:25 -0800

I'm just a newbie (indirect) metamathematicien so, my opinion (from thepoint of view of a metamath tool writer) might not be worth much :


- Simplicity is great (one of the best feature of metamath)

- Deduplication is awesome (it reduces the (memory and time) cost ofwriting proofs)

Recently, there has been a debate lately about having underscore in ids.My (tool writer) opinion is that this debate should not happen :

In software development, there is a generally held opinion that idsshould be opaque tokens.

I fear that you are mixing 2 roles  :
- ids that should uniquely determinate a theorem forever
- (indexed) fields that help humans find theorems among the set.mm database

This happens because Metamath was designed to be easy to work with froma basic text editor with search/replace capabilities.

This worked great (metamath is successfull after all) but this probablyis not the optimal set up for metamath :

An illustration : There are plenty of languages that tried to be thenext Java (groovy, scala, C#, kotlin),but the one that is really succeeding right now is Kotlin, because it'sdesigners realized that

- they had to design a better Language than Java

- they had to write great tools alongside it to make it easy/enticingfor developers to adopt/migrate to their new platform.

So far, all android developers (we are talking million developers)switched from Java to Kotlin. And Kotlin is still aiming to take overother traditional spaces of software development


My point and my opinion is :

Metamath is really awesome but if we developed tooling for it, alongsideit, it would remove some constraints it has and maybee become even better.


I think that people should not have to work in metamath in text editors,

computers should provide the help they need, when it comes todisplaying, looking for theorems, etc...

For example, why isn't there a simple text to search softawre (thatcould interact with mmj2 or other software) that allows people to inputsome string (say "( a + b ) =" or ( a in CC )) and that returns atheorem id ?

Theorems could be hashed,indexed in many ways...with customizedsettings, so that people who would rather have a_in_CC than aInCC couldbe happy)

People should NOT have to learn and remember theorem ids, this is auseless skill in real life,

and it prevents more people to work with metamath

Also, releasing the constraint that metamath should be used in texteditors or should be nice to human eyes may allow to :adopt a syntax aimed at computers, that ensures coherency and that makesit easier to parse, avoid pitfalls, allow definitions that are not axioms...

It would be the job of the tools to display the result in a nice mannersfor humans, and make working with metamath nice for humans...


Well, just a rant :)

As a side note, I managed to use antlr-kotlin to port the antlr metamathparser I was using to kotlin multiplatform.It was the only roadblock preventing releases of Mephistolus for thebrowser/android/ios/linux/windows/mac...

I'm going to work on the browser /javascript target now.

Best regards,
Olivier

Le 24/01/2020 à 05:58, Norman Megill a écrit :

Tierry, Alexander, and Benoit have asked for clarification of thegoals of set.mm. Here are some of my opinions. I am moving thediscussion inhttps://github.com/metamath/set.mm/issues/1431<https://github.com/metamath/set.mm/issues/1431> to here for widerreadership.
*1.* There is no "goal" per se for set.mm. People work on what theyare interested in. Typically work starts off in mathboxes, wherepeople pretty much have freedom to do whatever they want. Twosituations that typically will result in the work being moved to themain part of set.mm are (1) more than one person wants to use ordevelop the work and (2) the work is an mm100 theorem. There areother factors, though, that I'll discuss. And in any case it is ajudgment call by me and others that may not be perfect. Just becausewe choose not to import from a mathbox right away doesn't mean thevalue of the work is less, it just means that in our possibly flawedjudgment it either doesn't quite fit in with the set.mm philosophy orit hasn't found its place there yet.
So what I will primarily discuss is what I see as the "philosophy"that has been generally used so far in set.mm. Hopefully that canguide the goals that people set for themselves as they contribute toset.mm.
*2.* A very important thing to me is the philosophy of striving forsimplicity wherever possible. This particularly impacts whatdefinitions we choose.
*
"Rules" for introducing new definitions*
Definitions are at the heart of achieving simplicity. Whiledeveloping set.mm, two rules of thumb that evolved in my own work were(1) there shouldn't be two definitions that mean almost the same thingand (2) definitions shouldn't be introduced unless there is an actualneed in the work at hand. Maybe I should add (3), don't formallydefine something that doesn't help proofs and can be stated simply inEnglish in a theorem's comment.
Concerning the 1st rule, very early on, when I was starting to workwith ordinals, I noticed that many books would redefine "e." (epsilon) as "<" and "C_" (subset) as "<_" (less than or equal) sincethose were the ordering roles they played in von Neumann ordinaltheory. I figured since books did it, it must be important, so I alsodid that in the early set.mm. (I may have even used those symbolsbecause reals were a distant vision at that point.) Books casuallyinterchanged them in theorems and proofs, but formalization meant Imight have 2 or more versions of many theorems. It was a constantnuisance to have to convert back and forth inside of proofs, makingproofs longer. I eventually decided to use just "e." and "C_", and itmade formalizations much more pleasant with fewer theorems and shorterproofs, sometimes providing more general theorems that could be usedoutside of ordinal theory. I never missed not having the orderingsymbols to remind me of their roles. I liked the feeling of beingcloser to the underlying set theory rather than being an abstractionlayer away from it. If the "less than" role of "e." was important forhuman understanding in a particular case, I could simply use theEnglish language "less than" the comment (3rd rule) while continuingto use "e." in the $p statement.
As for the 2nd rule, there are often different choices that can bemade in the details of how something is defined. It is sometimes hardor impossible to anticipate what choice is optimal until we actuallystart using the definition for serious work. There is also thequestion of whether the definition is even necessary. More than onceI've found that a definition I thought might be needed (for examplewas used in the literature proof) actually wasn't. Sometimes, likethe "e." vs. "<" case above, the definition was even a hindrance andmade proofs longer. Other times an intermediate definition of my owninvention that isn't in the literature turned out to be advantageousfor shorter proofs. It can be best to let the work drive the need forthe definition and its precise details.
Another example of a literature definition we purposely don't use andillustrates the 3rd rule is Takeuti/Zaring's Russell class with symbol"Ru", which they formally define as "{ x | x e/ x }". The onlytheorem that uses it (in set.mm and in T/K) is ~ ru. For the purposesof set.mm, it is wasteful and pointless since we can just define theRussell class verbally in the comment of ~ ru.
A problem that has arisen in the past is where a person has added aset of definitions from the literature, proved some simple propertytheorems, then is disappointed that the work isn't imported into themain set.mm. Sometimes we do and sometimes we don't, but the mainissue is whether the 2nd rule above is being followed. Withoutknowing exactly how the definition is going to be applied, very oftenit won't be optimal and will have to be adjusted, and sometimes it ismore efficient just to start from scratch with a definition in theform that is needed. I prefer to see some "serious" theorems provedfrom a definition before importing it.
As a somewhat crude example of why a definition in isolation may notbe optimal, suppose someone defined sine as "$a |- sin A = ..." (nota straw man; this kind of thing has been attempted in the past) i.e. a1-place function-like "sin A" rather than a stand-alone object "sin". While this can be used to prove a few things, it is very limitingsince the symbol "sin" in isolation has no meaning, and we can't provee.g. "sin : CC --> CC". It is an easy mistake to make if you don'tunderstand set theoretical picture but are blindly copying a textbookdefinition. Here it is pretty obvious, but in general such issues canbe subtle and may depend on the application. Also, there are oftenmultiple ways to define something, some of which are easy to startfrom and others which would require a lot of background development. The best choice can't always be anticipated without knowing whatbackground will be available.
*3.* Let me comment on the recent proposal to import df-mgm,df-asslaw, df-sgrp. These are my opinions, and correct me if I am wrong.
df-mgm: A magma is nothing but an empty shell with a binaryoperation. There are no significant theorems that follow from it. All the properties of binary operations that we have needed havealready been proven. If additional properties are needed in thefuture it seems it would be better to state them directly because theywill be more useful generally. Just because it appears in Bourbaki orother texts is not to me a justification: it also needs to be usefuland be driven by an actual need for it. There are many definitions intextbooks that have turned out to be unnecessary.
df-asslaw: All this is doing is encapsulating the class ofassociative operations. The only thing that can be derived from itAFAIK is the associative law. I don't see how it would benefit proofssince additional work is needed to convert to and from it. I don'tsee how it helps human understanding because it is easier for a humanto see "show the associative law" in the description of a theorem thanbe asked to learn yet another definition (an example of the 3rd rule).
df-sgrp: This is basically df-asslaw converted to an extensiblestructure. So in essence it duplicates df-asslaw (or vice versadepending on your point of view), violating the first rule (noredundancy). More importantly, AFAIK there is nothing that can bederived from it except the associative law, unlike say df-grp wheredeep and non-obvious theorems are possible. Yes, it can be used as a"layer" on top of which we can slightly simplify df-mnd and df-rng,but it is really worth having an entire, otherwise uselessdefinitional layer just to avoid having to state "(x+y)+z=x+(y+z)"? The fact that it simplifies two definitions is a slight positive, butI'm still torn about it. It doesn't quite pay for itself in terms ofreducing set.mm size, and it increases the burden on the reader whohas to learn another definition and drill down another layer tounderstand df-mnd and df-rng. Personally I'd prefer to see"(x+y)+z=x+(y+z)" directly because it is immediately obvious withouthaving to look at a deeper layer. In the case of the description indf-rng0, it would be more direct and less obscure to state"associative operation" instead of "semigroup operation", since mostpeople (at the level of studying that definition) would know what"associative" means, but fewer would would know what "semigroup" means.
I can't say that these will never be imported. Possibly somethingwill drive a need for them, maybe something related to category theorywhere even a magma might be a useful object. But I would want to seethe definitions be driven by that need when it arises; we don't evenknow yet whether the present extensible structure form can be adaptedfor that purpose. It is fine to store them in a mathbox indefinitely,but I'd like to see a true need driving their use before importing.
*4.* As for adding an exhaustive list of all possible definitions onecan find in Bourbaki or whatever, as someone suggested, I don't thinksomething like that belongs in set.mm, for all the reasons above. There are other resources that already list these (Wikipedia,Planetmath, nLab). Their precise formalization will depend on thecontext in which they are needed in set.mm, which is unknown untilthey are used. In isolation (perhaps with a couple of propertytheorems that basically check syntax) there is no guarantee that theyare correct, even in set.mm. To restate, I think the philosophyshould be that definitions should be added by need, not as busywork. Adding a definition when needed is one of the smallest parts ofbuilding a series of significant proofs. It doesn't have to be donein advance, and I don't think it is generally productive or useful todo so.
*5.* Finally, let me touch on the issue of > (greater than).
There are many non-symmetrical relations that use reversed symbols toswap arguments: "B contains A" with reversed epsilon, "B includes A"with reversed subset symbol, "Q if P" with reversed arrow, etc. I'veseen all of these in the literature. If we really feel the reader willencounter them and expect set.mm to explain their meaning (which islikely explained in their textbook anyway), we could mentioninformally the reversed symbol usage when we introduce the forwardsymbol. But we don't add $a's for them because we will never usethem. That is because either we would have to add a huge number oftheorems containing all possible forward and reversed combinations ofthe symbols, or we would have to constantly convert between theminside of proofs. Both of those are contrary to a philosophy ofsimplicity.
IMO the same should be done with >, mentioning what it means in thedescription for <. Introducing a formal $a statement for > that willnever be used is unnecessary and wasteful of resources. If we want tobe excessively pedantic, we could mention in the description for <that the the formal definition would be "|- > = `' >" , although thatseems less intuitive than simply saying that "in theorem descriptions,we occasionally use the terminology 'A is greater than B' to mean 'Bis less than A'." A grade school student can (and does) easilyunderstand that.
Basically, ">" violates all 3 "rules" for new definitions I proposedabove.
Norm
--
You received this message because you are subscribed to the GoogleGroups "Metamath" group.To unsubscribe from this group and stop receiving emails from it, sendan email to [email protected]<mailto:[email protected]>.To view this discussion on the web visithttps://groups.google.com/d/msgid/metamath/6e9144b6-5b8f-488d-bf19-83e5a2a250e0%40googlegroups.com<https://groups.google.com/d/msgid/metamath/6e9144b6-5b8f-488d-bf19-83e5a2a250e0%40googlegroups.com?utm_medium=email&utm_source=footer>.


--
You received this message because you are subscribed to the Google Groups 
"Metamath" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/metamath/ab764391-e62c-a6f2-60ab-33ccd3413391%40gmail.com.

Re: [Metamath] Philosophy and goals for set.mm

Reply via email to