The relevance of category and topos theory

Tim May Sun, 07 Jul 2002 11:18:20 -0700


On Sunday, July 7, 2002, at 12:23  AM, Wei Dai wrote:


> Hi Tim, it's really interesting to see you here. (For those who don't
> know, I knew Tim from the cypherpunks mailing list. Hal Finney was an
> active member of the list as well. See
> http://www.activism.net/cypherpunk/crypto-anarchy.html if you're 
> wondering
> what a cypherpunk is.) Two of the most prominent cypherpunks I know are
> now on my Everything mailing list. I wonder what that means... Anyway,
> welcome!

Thanks. One of my motivations here is to a) learn, b) have a chance to 
explain what I have learned (which helps to learn), and c) avoid 
politics and policies and similar issues completely. (Unless there is 
talk of funding a National Everything Initiative, I think it's safe to 
say that politics won't enter!). The first couple of years, especially 
the first year, of Cypherpunks was heavy on learning and explaining, but 
politics was of course important. The last several years have been 
recyclings of older ideas...this can happen to any list, of course.

I've known of the Everything list, and of course the 
Everett/DeWitt/Niven/Wheeler/Egan/Tegmark ideas for a while...I remember 
reading Larry Niven's "All the Myriad Ways" around 1970 or so. We used 
to sit around in the early 70s debating the Everett model, which a 
couple of science fiction writers were making much of, and which had 
gained new popularity after Bryce DeWitt dusted off the idea and began 
publishing a lot on it. (DeWitt assigned much credit to his student, RN 
Graham, and even called the MWI the "Everett-Wheeler-Graham" theory. Way 
too charitable to Graham, I think.)

I learned of your list a while back, but wasn't super-interested in what 
I thought (and partially still think) is a fanciful idea, akin to the 
late David Lewis' "plurality of worlds" philosophy (that everything we 
can imagine must have reality). Saul Kripke's "possible worlds" work was 
more interesting, as it is closely linked to linguistics and AI and 
predictions about the future ("If Oracle were to announce bad earnings 
tomorrow, then this is what would probably happen," a possible worlds 
"story" which is of course very close to some discussions of alternate 
_presents_. In fact, no different, except more practical.)

Enough of this digression. I'll answer your question:


>
> I remember your post on the cypherpunks list about category theory, 
> but I
> have to admit I didn't pay it much attention since it didn't seem very
> relevent at the time.

I posted to that list just to let folks know I was exploring a new area. 
And because there actually be implications for areas of interest...I 
mentioned these in that post.

>  I guess this is my second chance to learn about
> category theory, so there are some questions for you.
>
> Suppose I had the time for only one book, which would you recommend?

I would start with Lee Smolin's "Three Roads to Quantum Gravity." This 
is not about category theory, but provides a lot of the motivation. He 
discusses many of the points that I allude to. In bullet form:

-- that we are embedded in the universe, that there is no "omniscient 
observer who can see all of space-time"

-- that certain pieces of information are forever beyond our ability to 
see. These turn out to be important for why accelerating objects see a 
"temperature" of space, isomorphic to the temperature seen by a 
spacecraft hovering over a black hole expending fuel at the same rate. 
(I can explain separately if there's interest.)

-- that space appears to be discrete at the Planck scale (a la the "It 
from bit" discreteness outlined by Wheeler 30 years ago, later by 
Fredkin, Toffoli, and others, and more recently by Wolfram)--this 
discreteness has not been proved, and may not be testable for many 
decades, but all three of the routes Smolin outlines essentially predict 
that space-time is not an infinitely differentiable (smooth) manifold 
made of real numbers: the holographic/Crane/Susskind route, the loop 
gravity/spin foam route, and the currently much popularized 
string/M-brane Schwartz/Witten/etc. route.

-- that _relationships_ are more important than objects (which fits very 
closely with category theory, as I'll explain below)

-- that the proper logic of understanding cosmology, given some or all 
of the above points, is not the "omniscient" logic of 
Aristotelian/Boolean logic ("A or not-A, nothing in between"), but a 
more general form of Intuitionist (bad political connotations for many!) 
Brouwer/Heyting logic, which also happens to be the natural logic of a 
topos, a category with certain properties of logic attached to it.

Smolin spends a few pages discussing topos theory, but in such light 
detail that anyone not tuned in to spotting the phrase "topos theory" 
might even miss it. He refers to Fotini Markopoulou, a Greek woman who 
collaborates a lot with him and with the other biggies of the "loop 
gravity/spin foam" school, e.g., Rovelli, Ashketar, Baez, etc.

If you want a straight intro to category theory, with none of the stuff 
about cosmology, space-time, and such, the Lawvere and Schanuel book 
"Conceptual Mathematics: A first introduction to categories" is the book 
I recommend.

But perhaps before reading a book, surfing the Web is better. There are 
many tutorials and primers, and one can read a bunch of them to get the 
lay of the land. I've mentioned John Baez's site a couple of times. 
Chris Hillman has a good "Categorical Primer," but it's incomplete. Some 
authors, like Barr and Wells, have even put their books up on the Web.

Reading any of the main books will not show many _apparent_ links to 
Tegmark, Schmidhuber, etc. The same applies to reading Kleene's 1952 
opus, "Metamathematics," or Lang's "Algebra," or any math book. None of 
these will appear to be closely relate tp theories of everything, or 
multiverses. One has to read between the lines. For example, when 
reading about categories, think of Tegmark's diagram showing links 
between "Abelian Fields," "Manifolds with Tensors Fields," etc.

A categorical approach is cleaner and shows the morphisms more clearly.

Take a look at category theory for about 2 days (of 2-3 hours a day) and 
then look at Tegmark's paper again.

(This is just for the formalisms he talks about, not the underlying 
philosophical point.)



> Also,
> can you elaborate a bit more on the motivation behind category theory? 
> Why
> was it invented, and what problems does it solve?

OK, on to some history.

Mathematics had two great waves of consolidation and formalization in 
the past century. Around 1920 much had been done with symmetries, 
embodied in group theory. Much was known about integers and even the 
real number line, embodied in the work of Stephen Dedekind in the 19th 
century. Also, G. Peano, etc. This was mostly the algebra of rings and 
fields. (On mostly separate tracks, a lot of work in topology, logic, 
set theory.)

Two associates/students of David Hilbert largely gave us our modern 
outlook on math in the 1920s: Emil Artin and Emmy Noether. In 
influential lectures in Germany during the 1920s they consolidated all 
of the bits and pieces about groups, rings, fields, modules, and vector 
spaces into a coherent, axiomatized whole. Others in this milieu were 
Herman Weyl, John von Neumann (different country, but still same 
milieu), David Birkhoff (in the U.S.), and others. One of those who 
transcribed Artin's and Noether's lectures was van der Waerden, who 
wrote an enormously influential book, "Moderne Algebra," published 
around 1930-32 in German. Van der Waerden established the 
"groups-rings-fields" structure of nearly all of the abstract algebra 
books to come later, such as G. Birkhoff (David's son) and Mac Lane's 
1941 text in English, Jacobson's early 50s text, Mac Lane and Birkhoff's 
more advanced text, Herstein's "Topics in Algebra," and so on. One of 
Emil Artin's students (both Artin and Noether moved to the U.S. in the 
early 1930s, part of the enormous wave of refugees from Germany and 
Europe) was Serge Lang, author of a dozen or more "classics." (Warning: 
Lang's books are dry, but complete.)

OK, so that was the first great wave of consolidation in "real" 
mathematics. (By "real" I mean the stuff mathematicians actually use 
every day...there were of course consolidations in foundational areas 
happening at about the same time, via Russell, Whitehead, Godel, Curry, 
Church, Kleene, etc.)

This was the "abstract math" we all know and love.

Around 1940 there was much work on applying algebra to topology, e.g., 
counting holes in surfaces, seeing which loops drawn on a surface could 
be contracted to points and which could not. Two of those working on 
this were Samuel Eilenberg, a young emigre from Poland (dubbed 
"S-squared, P-squared, for "Smart Sammy the Polish Prodigy") and an 
equally young Saunders Mac Lane, the same as the author of the Birkhoff 
and Mac Lane basic text on algebra.

The combination of algebra and topology is of course algebraic topology.

What Eilenberg and Mac Lane realized is that certain _structures_ were 
present in both algebra and topology, and that _structure-preserving 
maps_  were showing up all over the place. For example, deformations of 
an object would have the same structure as algebraic transformations. 
Even examples like counting holes in surfaces were in a deep sense 
isomorphic to counting things in purely symbolic or algebraic structures.

They began thinking about what the abstract ideas were, and came up 
with  the core ideas:

-- that maps, or arrows, or morphisms go between objects

-- that the arrrows themselves can be studied, and that arrows between 
the arrows (morphisms of morphisms) are interesting to look at. (The 
arrows between arrows, or the morphisms between morphisms, are usually 
called "functors.")

(Digression: For example, look at _languages_. Words in English are 
mapped into their plural form in various ways, usually by adding an "s." 
Sometimes in other ways ("oxen"). The morphisms between words can be 
compared (mapped) to morphisms between words in other languages, e.g., 
Chinese, German. There are functors going between plural formation in 
English and plural formation in German. And between other structures in 
each of these languages. In this sense, the "structures" of different 
languages can be diagrammed and compared in a more interesting way than 
simply talking about them or just by compiling descriptions.)

-- that we can often take the set of objects and morphisms and view them 
as a "picture" (a model, sort of) of relationships

-- that "natural transformations" are ways of "sliding" this picture 
into other categories, other domains

-- critical to much of category theory is the analysis of diagrams, 
showing arrows from A to B, or A --> B, arrows from C to D, etc., and 
then figuring out what is needed to make such diagrams commute (as an 
example). This leads to things I won't try to explain here (for space, 
and because I'm not the best person to look to for such explanations) 
such as "pushouts" and "pullbacks." And it turns out that these diagrams 
are closely linked to physics issues (!). (If you turn on your morphing 
engines and begin playing with the words, you probably are already 
speculating that maybe Feynman diagrams are a kind of category theory 
diagram. And you would be right. See John Baez's "From Causal Sets to 
Feynman Diagrams" for more.)

(Digression: By talking to you folks, by attempting to explain these 
ideas, I am essentially trying to take the picture which is inside _my_ 
head and plant some semblance of it in _your_ head. This is a kind of 
natural transformation, albeit much more complicated and fuzzier than 
most nice, formal, pristine examples in math!)

So in 1945, delayed for 3 years by the war, Eilenberg and Mac Lane 
published their influential and very long paper on what they called 
"categories." A category was like a set, except made up of objects and 
arrrows between the objects. Ordinary sets formed the category SET. 
Vector spaces formed the category VECT, and so on, for essentially all 
objects and relationships known to mathematics. And of course there are 
then ways to map one category into another, or to compare two categories 
(actually mapping and asking about the mapping are of course essentially 
the same thing, with just a change of direction or emphasis).

Eilenberg and Mac Lane used category theory as a concise and unifying 
notation for talking about mathematics, especially for algebraic 
topology. Use of category theory for algebraic topology (particularly an 
important branch called "homology theory") became widespread in the late 
40s and ever since.

Aside: They coined the term "category." Perhaps they should have called 
them something like "structures." Then category theory would be 
understand at a glance (via those morphisms in our everyday language, 
those patterns!) as "structure theory." Or as "pattern theory." And then 
probably more people would have understand why it would likely to be 
very important.

But much more was to come.

In the late 1950s a couple of important extensions (obscure pun not 
intended) of Eilenberg and Mac Lane's "notational language" happened. 
Daniel Kan discovered the "Kan extension" and proved something called 
the "adjoint functor theorem." Now physicists have used "adjoints" and 
"self-adjoint" concepts for years. When the category theory and algebra 
people showed the power of their diagrams, physicists began to take 
notice.

And at around that time the great mathematician Grothendieck developed 
some notions which led to a particular kind of category called a topos. 
(No space in this already too-long article to try to explain what a 
topos is.) William (Bill) Lawvere was a young student of Eilenberg's. He 
was convinced that topos theory could provide a foundation for 
mathematics equvalient in power to that of set theory. Instead of sets 
consisting of points and axioms about inclusion we would have objects 
(not necessarily consisting of points) and arrows (morphisms). According 
to the story, Mac Lane warned him away from attempting this. The story 
goes that Eilenberg and Mac Lane were flying down to consult for the 
Pentagon in 1963 and Eilenberg handed Mac Lane the recently completed 
thesis of Lawvere. Bill Lawvere had largely established that toposes 
could form the basis of all of mathematics. (Some worked needed to be 
completed, and this was largely done by 1970.)

Meanwhile, a couple of major theorems were solved in the 1960s with the 
use of category theory: the Weill Conjecture and Paul Cohen's work on 
"forcing" to

Let me give a description I found of just one course in topos theory. 
This will mention a lot of buzzwords, including folks like Saul Kripke 
(logician, "possible worlds"), Grothendieck, etc. Don't expect to grok 
this from a course summary (!), but I include it to show some of the 
links:

http://www.math.uu.se/~palmgren/topos-eng.html

"Topos Theory, spring term 1999

"A graduate course (6 course points) in mathematical logic.
------------------------------------------------------------------------

"Topos theory grew out of the observation that the category of sheaves 
over a fixed topological space forms a universe of "continuously 
variable sets" which obeys the laws of intuitionistic logic. These sheaf 
models, or Grothendieck toposes, turn out to be generalisations of 
Kripke and Beth models (which are fundamental for various non-classical 
logics) as well as Cohen's forcing models for set theory. The notion of 
topos was subsequently extended and given an elementary axiomatisation 
by Lawvere and Tierney, and shown to correspond to a certain higher 
order intuitionistic logic. Various logics and type theories have been 
given categorical characterisations, which are of importance for the 
mathematical foundations for programming languages. One of the most 
interesting aspects of toposes is that they can provide natural models 
of certain theories that lack classical models, viz. synthetic 
differential geometry.

"This graduate course offers an introduction to topos theory and 
categorical logic. In particular the following topics will be covered: 
Categorical logic: relation between logics, type theories and 
categories. Generalised topologies, including formal topologies. 
Sheaves. Pretoposes and toposes. Beth-Kripke-Joyal semantics. Boolean 
toposes and Cohen forcing. Barr's theorem and Diaconescu covers. 
Geometric morphisms. Classifying toposes. Sheaf models of infinitesimal 
analysis.

"We will assume some familiarity with basic category theory, such as is 
obtained in courses in domain theory or algebra. The course will be 
given English, in case someone requests this. "

(Sorry for the quoted material, but it sometimes helps to see someone 
else talking about something. At least I find it does.)


I'd best move on to answering or commenting the rest of Wei Dai's 
questions:

>  What's the relationship
> between category theory and the idea that all possible universes exists?

I believe it provides a natural language for talking about time-varying 
sets and "sheaves" (Jargon Alert: the "sheaf" in mathematics may not 
match exactly what one pictures a "sheaf of histories" to look like. But 
mathematicians usually pick names that bear _some_ resemblance to 
ordinary things, e.g., fibers, fiber bundles, sheaves, pre-sheaves, 
partially ordered sets, spaces, etc.)

Personally, I'm not (yet) "taking seriously" either the David Lewis 
"plurality of worlds" or Max Tegmark "everything" or Greg Egan "all 
topologies model" ideas. At least not yet. I need to learn a lot more of 
the language first.

(Without a good language, we end up just _talking_ and _speculating_. 
I'm finding my category theory reading is giving me a welcome new 
perspective on issues I've long been fascinated with, e.g., if a single 
atom were to have been moved on Sirius a million years ago, would the 
world around us be different? And how much different? The cascade of 
changes has what topology? Is it the Pachinko topology?)

My strong hunch is that the universe at these levels (nature of 
space-time) will need all of the mathematical tools we can muster. Now, 
I'm not saying one needs to be an expert at group theory, for example. 
Or that one needs to know all of algebraic topology.  But it's pretty 
clear to me that mathematics has historically been our most important 
tool for all of the interesting branches of physics. No one can get far 
in relativity or quantum mechanics without mastering some interesting 
math.

For me, category theory has been a stunning window on things. I regret 
that it is not taught, or even mentioned, to most physicists.  (This may 
be changing, with the books by folks like Robert Geroch, who starts with 
category theory and covers much of modern physics from this perspective, 
and the progress by the string theory and quantum gravity communities.)


> Does it help understand or formalize the notion of "all possible
> universes"? I know in logic there is the concept of a categorical theory
> meaning all models of the theory are isomorphic. Does that have anything
> to do with category theory?

Yes, there are deep and important connections. Models form a category. 
The book I mentioned by Paul Taylor, "Practical Foundations of 
Mathematics," is very good on these issues.

In my view, category theory (and topos theory) represents the "modern" 
way of looking at a lot of seemingly unrelated areas.

As we know, as Hal Finney and several of us used to discuss about ten 
years ago on the Extropians list, Chaitin's formulation of algorithmic 
information theory gives us a much more understandable and 
comprehensible proof of Godel's Theorem than Godel himself gave! (For 
the best explanation of this, and why this is so, either see Greg 
Chaitin's own papers and books or the wonderful summaries by Rudy Rucker 
in his "Mind Tools" book.

These modern viewpoints are much more comprehensible than the classics. 
Which is not surprising. Shoulders of giants and all that.

The same is true of category theory. It's a relentlessly modern approach 
to seeing the similarities that pervade mathematics and physics. Whether 
it answers the question about whether lots of other universes exist is 
doubtful...I'm not convinced we'll know the answer to that question in 
the year 3000.

But it's a powerful and elegant approach, with perhaps a slightly 
misleading name, and it looks to me to be the right language for talking 
about the world around us and possibly the worlds we cannot directly see.

I agree with Lee Smolin that topos logic is not just the logic of 
cosmology, but also the logic of our everyday world of limited 
information, bounded rationality, Bayesian decision making, and 
information horizons. Even if this is not useful for answering questions 
about "the Everything theory," because we may need to wait 600 or 6000 
years for experimental tests to become feasible, I believe this outlook 
will be of great utility in many areas.

I'll keep you all posted!

--Tim May

The relevance of category and topos theory

Reply via email to