This year we give thanks for an idea that is central to our modern understanding of the forces of nature: gauge symmetry. (We’ve previously given thanks for the Standard Model Lagrangian, Hubble’s Law, the Spin-Statistics Theorem, conservation of momentum, effective field theory, and the error bar.)
When you write a popular book, some of the biggest decisions you are faced with involve choosing which interesting but difficult concepts to tackle, and which to simply put aside. In The Particle at the End of the Universe, I faced this question when it came to the concept of gauge symmetries, and in particular their relationship to the forces of nature. It’s a simple relationship to summarize: the standard four “forces of nature” all arise directly from gauge symmetries. And the Higgs field is interesting because it serves to hide some of those symmetries from us. So in the end, recognizing that it’s a subtle topic and the discussion might prove unsatisfying, I bit the bullet and tried my best to explain why this kind of symmetry leads directly to what we think of as a force. Part of that involved explaining what a “connection” is in this context, which I’m not sure anyone has ever tried before in a popular book. And likely nobody ever will try again! (Corrections welcome in comments.)
Physicists and mathematicians define a “symmetry” as “a transformation we can do to a system that leaves its essential features unchanged.” A circle has a lot of symmetry, as we can rotate it around the middle by any angle, and after the rotation it remains the same circle. We can also reflect it around an axis down the middle. A square, by contrast, has some symmetry, but less — we can reflect it around the middle, or rotate by some number of 90-degree angles, but if we rotated it by an angle that wasn’t a multiple of 90 degrees we wouldn’t get the same square back. A random scribble doesn’t have any symmetry at all; anything we do to it will change its appearance.
That’s not too hard to swallow. One layer of abstraction is to leap from symmetries of a tangible physical object like a circle to something a bit more conceptual, like “the laws of physics.” But it’s a leap well worth making! The laws of physics as we experience them here on Earth are, like the circle, invariant under rotations. We can do an experiment — say, the Cavendish experiment to measure the strength of gravity between two test bodies — in some given laboratory configuration. Then we can take the entire laboratory, rotate it by a fixed angle, and do the experiment again. If you do it right, you will get the same result, up to experimental errors. (Note that the Cavendish experiment is wickedly hard, so don’t try this at home unless you’re really up to it.) Likewise for other kinds of experiments, like measuring the charge of the electron. The laws of physics are invariant under rotations: you can rotate your experiment and get the same result, just like rotating the circle leaves you with the same geometrical figure.
Now to kick it up an additional notch, imagine you have a friend located in the lab down the hall, doing the same experiment. They will get the same results that you do for the strength of gravity or charge of the electron. That’s due to another symmetry — the laws of physics are invariant under translations (changes of position). And, of course, the invariance under rotations still holds; if anyone were crazy enough to pick up both labs at once, rotate the whole building by some fixed amount, put them back down, and do the experiments again, we would once again expect the same answer.
Your intuition tells you that there’s more to it than that, and your intuition is right. We don’t have to pick up the whole building with both labs inside; we should be able to rotate the apparatus in just one of the buildings, leaving the other one unchanged, and still get the same experimental results. But notice that this isn’t a single rotation of the whole world, as in our previous examples; now we’re rotating the two experiments separately, so their orientation changes with respect to each other.
That’s a gauge symmetry: when a symmetry transformation can be separately carried out at different points in space. Gauge symmetries are sometimes called local symmetries, since we can do them independently (locally) at every point; they are to be contrasted with global symmetries, which need to be done in a uniform way all over the place. It can be confusing, because “local” sounds like it’s less than “global,” whereas really a local/gauge symmetry represents enormously more symmetry than a mere global symmetry — infinitely more, since the transformations can happen completely independently at every point.
Fair enough, and hopefully it all makes sense. Here’s the subtle point: how do you know if one laboratory has been rotated with respect to another one? How are you able to compare the orientations of laboratories at different locations?
Doesn’t sound like it’s too difficult a question; you can use some surveying equipment, or for that matter just look at the other experiment if they’re close enough together. But while doing that you are taking advantage of the structure of space itself, something so fundamental that we typically don’t even notice it’s there. In particular, we have the means for comparing locations and orientations of distant circumstances, by traveling back and forth between them or sending signals of some sort. As we travel (or signals propagate), we are able to keep track of the location and orientation of the circumstances we left behind. Pretty amazing, when you think about it.
In order to compare things that are set up at different locations, what we are implicitly relying on is a field that stretches between the locations. The mathematical name for the kind of field we need is a connection, because it helps connect what’s going on at different points. In physics it’s called a gauge field, because Hermann Weyl introduced an (unhelpful) analogy with the “gauge” measuring the distance between rails on railroad tracks.
You might think of a gauge field as a latticework of invisible lines running through the universe, keeping track of what counts as “staying parallel” and “moving on a straight line” as we travel through space. But it’s a venerable principle of quantum field theory that, once you have a field, that field can have its own dynamics — it can bend and twist through space, typically in response to other fields that it interacts with. And when your gauge field starts twisting, you feel it as a force of nature.
Think of you and your friend doing separate experiments. If you were just in different rooms in the same building, you can travel between on a flat floor, and you aren’t feeling any forces. But if you’re doing your experiments outdoors on a rolling hillside, the ground beneath your feet pushes you back and forth as you walk over the hills. In this case, the structure of the ground itself defines a connection field, and its curvature gives rise to a force.
That’s literally a down-to-Earth example. More fundamentally, there is a connection field on spacetime itself, which tells us how to walk on straight lines (geodesics) and compare orientations at different points. And this connection can be curved, and that curvature gives rise to a force of nature, one we call “gravity.” We’ve just invented the theory of general relativity.
General relativity is based on a rather straightforward set of symmetries: the rotations and translations we’ve already mentioned, plus “boosts” relating frames of reference moving with respect to each other. (All told, the Poincaré group.) What about the other forces — electromagnetism and the strong and weak nuclear forces? Nothing nearly so tangible, I’m afraid. These are all based on “internal” symmetries — they don’t transform things within space, but rather rotate different fields into each other. For example, you may have heard that quarks come in three different colors: red, green, and blue. It doesn’t matter what color you call a particular quark; therefore, there is a symmetry in which you rotate different colors into each other. Mathematically it takes the structure of the group SU(3), and the gauge field associated with it gives rise to the strong interactions. Electromagnetism and the weak interactions follow a simple pattern. Gluons, photons, and W/Z bosons all arise from different kinds of connection fields relating the symmetry transformations at different points in space.
Electromagnetism, indeed, was the first force for which we were able to understand that it was based on a gauge symmetry. General relativity was next, but interestingly the fact that GR is based directly on spacetime symmetries rather than internal symmetries actually makes it something of a special case, so the connection (pardon the pun) wasn’t as obvious. (Although it’s right there in my GR book.) It was Yang and Mills in the 1950’s who took the bold step of suggesting that gauge theories might be at the heart of the nuclear forces as well, although similar notions had been contemplated before.
The reason why the Yang-Mills idea wasn’t tried earlier, and didn’t catch on right away, is that forces based on gauge symmetries seem at first blush to have a universal and immediately-noticeable feature: they stretch over infinitely long ranges. That is the case for both general relativity and electromagnetism, and the mathematical structure of connection fields seems to imply that is should always be true. (This is a statement I could not for the life of me think of how to justify at a hand-waving level — anyone have any ideas?) In particle-physics language, the boson particle you get by quantizing the gauge field should be massless, like the photon and the graviton. But the nuclear forces are manifestly short-range, so the idea wasn’t immediately successful.
The answer to this dilemma is a little something called … the Higgs mechanism! By introducing yet another field (the Higgs field) that has a nonzero value everywhere in space, you can give the gauge bosons a mass in a way that is completely compatible with the mathematics. It’s the triumph of that idea has been seemingly vindicated by the discovery of the Higgs boson.
Interestingly, it turns out that Yang-Mills theories don’t have to give rise to long-range forces even if the bosons do stay massless. Imagine there were no Higgs field (and also no other effect that led to spontaneous symmetry breaking), so that the W and Z bosons of the weak interactions (or their pre-symmetry-breaking precursors) remained exactly massless. Unlike the photon, these bosons interact directly with each other, and at low energies those interactions would become very strong. Sufficiently strong that weakly-interacting particles would be confined, and the weak force wouldn’t be able to stretch over long distances. This is, of course, exactly what does happen with the strong nuclear force; gluons are massless, but the strong force is confined and therefore short-range. Perhaps we’re lucky that the physics of confinement wasn’t discovered until after the Higgs mechanism, or the latter might have taken a long time to figure out.