I’ve heard Erik Meijer quoted as saying the world is imperative and mutable, and so we should embrace side-effects1. I’ve also heard what I interpret as pretty much the opposite from Rich Hickey2. So, which is it?
I’m not sure, but maybe a story about dinosaurs will help.
Somehow we all escaped with our lives
It was a brisk winter’s night. I had just parked my car down at The Rocks, a beautiful part of Sydney right on the Harbour, and had started hurrying down George Street to attend the monthly Drop Bear Awareness Society meetup. I had only gone a couple of hundred metres when hordes of people started flooding out of Circular Quay directly across the road from me, accompanied by yells, screams and sirens. I was swept along with the crowd, herded by police and emergency services into Town Hall station and onto waiting trains. As my train pulled away from the platform I only knew one thing for certain; people were talking about a “monster” with a disturbing amount of credulity.
Turns out that a 50 metre high dinosaur-like monster had chosen that day to emerge from the depths of Sydney Harbour and take a brief stroll through the CBD. It was apparently on land for about 10 minutes, after which time it had grown bored and decided go head to Melbourne where it could destroy something more tasteful.
I returned to my car the next day. It was flattened; the imprint of a giant footprint clearly visible in the twisted wreckage. But I wasn’t going to give up on my beloved car. It took weeks of extensive work at the smash repairer, but I finally got my car back, pretty much as good as new.
State, mutations and time
There are two ways of looking at this story (three if you count “as an unimaginative, poorly-told piece of tripe”). One is from the perspective of mutable state, as commonly associated with imperative programming3. I had a working car, then the car’s state changed to “smashed by monstrous dinosaur”, then it changed back to working. It was the same car, its state was just changed, and rather forcibly at that. We can even write a program to express what happened:
The SmashWith
and Repair
methods have side-effects; they mutate the State
property of the car as the program executes. One interesting result of this is that the State
property is always tied to the concept of “now”; its value is inextricably linked to the point in time at which it is called. We have several states in this program, but whenever we talk about state, we are only referring to the current state.
I tend to think of the concept of time being an implied part of this representation. Any state change requires a change in time, but this representation never refers to time explicitly. The time is always “now”, and evaluating the next statement is always done in that context. From the perspective of “now”, this representation is completely valid.
For programming in particular though, this perspective is not without its problems. Because time is implicit in this representation, “now” changes from instruction to instruction, and so it is possible for operations to interfere. Any contention for the state between operations, such as concurrent access (multi-threading) or temporal coupling (Repair
only makes sense when the car is smashed after a call to SmashWith
) can result in an inconsistent state. To reason about a program, we also have to mentally evaluate it by stepping through all the instances of “now” that can occur.
There is another, equally valid way of thinking about this story. In this telling we make the time variable explicit. The state of the car is a function of time:
Here we still have state changes, but they aren’t destructive. Just because my car is now fixed does not mean that is was never smashed by a giant, 50 metre high mostrosaur. That state was still perfectly valid, it was just at a different time. Because time is an explicit input it is not possible to get interfering states. The output depends purely on the input, and is defined for all values of t
.
One effect of treating state as a function of time is that data necessarily becomes immutable. Once we have data for some time t
, that data is never going to change; we can’t change the past. A change in state over time will result in a new piece of data.
And just because we can explicitly handle time does not mean we have to expose it. Immutable data is enough to ensure we don’t interfere with previous states:
One thing that may seem troubling is we now have different instances of data representing the one car, but we can associate these using identity (the car’s VIN, make, model, etc.) rather than by reference as per the mutable-imperative approach. The state of a car with a specific identity can be considered a function of time and identity with an immutable data representation as the output. Again, all these states are valid, they actually occurred in the story, and we just need to be explicit on which moment of reality we need to work with.
Imperative world?
So the world is imperative, full of mutations and side-effects. But it seems equally valid to say that the world is a function of time (and other variables) that outputs immutable states. Perhaps both perspectives are two sides of the same coin, just separated by their treatment of time.
To me the potential equivalence of these perspectives matches my admittedly-basic-and-quite-probably-wrong interpretation of the Church-Turing thesis: all computable programs are computable by both a Turing machine and using the λ-calculus. A Turing machine relies on mutable state, λ-calculus on pure functions. They’re all able to express the same set of programs, and they can all represent “real world” information and calculations.
The question for programmers is which perspective on state makes programs easier to reason about and work with. And if the answer varies from problem to problem, what particular aspects of a problem make it well-suited to that perspective?
Intuition
One thing to consider before you answer is how much of our opinion is swayed by what John Backus referred to as “the primacy of the von Neumann computer”4.
Most of us have been trained from our very first experience with a computer (Assembler? C? C64 Basic? Java?) until now to think about programs in terms of the von Neumann architecture, where we process an instruction at a time and store to and load from mutable registers. In evaluating these perspectives we need to overcome decades of language and architecture bias. Both perspectives are equally valid when talking about the world around us, both can express the entire set of computable programs, and yet most platforms and languages emphasise mutability.
So it seems reasonable to be a little suspect of both our intuition and of the status quo in this case. Intuitively we understand that things change their state, but logically we also recognise that state depends on time, and a new state at a particular time does not destroy the state at the previous time.
Conclusion
In summary, don’t park right next to a large, city-side harbour unless you have comprehensive car insurance.
And let’s challenge ourselves about the idea that mutability is somehow essential to programming and to expressing concepts from the “real world”.
But mainly the parking thing.
Erik Meijer from GOTO Night August 23, 2012 in Chicago: one, two, three↩
Imperative programs are just sequences of instructions and can also be done without mutable state, although some definitions include mutability as an essential feature.↩
“Can Programming Be Liberated from the von Neumann Style? A Functional Style and Its Algebra of Programs”, John Backus, Communications of the ACM August 1978. PDF link↩