We have spent eleven chapters moving information around. But how much is there? Before quantum, a beautifully simple idea answers it: information is surprise — and surprise, it turns out, is a number you can compute.
“The sun rose this morning” tells you almost nothing — you already knew it would. “It snowed in the desert” tells you a lot. The rule behind the feeling: a rare event carries more information than a common one. If something is certain, learning it gives you zero bits; the less likely it was, the more bits land when it happens. Information isn’t about the content of a message — it’s about how much it narrowed down what you didn’t know.
Play Twenty Questions. Each good yes/no question halves the remaining possibilities. To pin down one of 8 equally-likely things you need 3 questions (8 → 4 → 2 → 1), and log₂8 = 3. Entropy is exactly that: the average number of yes/no questions you’d need to nail down the answer. A fair coin needs 1 question; a coin you already know is two-headed needs 0.
Slide the coin from fair toward loaded. A fair coin is the most uncertain — every flip is worth a full bit. Bend it, and each flip becomes more predictable, so it carries less information. At the extreme of a two-headed coin, a flip tells you nothing at all.
Give each outcome a surprise of log₂(1/p) = −log₂p bits — big when p is tiny, zero when p = 1. The entropy is just the surprise you expect on average — each outcome’s surprise, weighted by how often it shows up:
The unit is the bit: the answer to one ideal yes/no question. Entropy is, quite literally, the average number of yes/no questions you’d need to pin down the outcome — and the log base 2 is what makes that count come out in bits rather than nats or digits.
“A long, detailed message must carry more information.” Not necessarily. Entropy measures uncertainty removed, not length or importance. A thousand-page book that only ever says “yes” carries one bit; a single sharp, unexpected answer can carry many. Information lives in what you didn’t already know.
Hold onto this picture. In the next chapter we feed exactly this formula a quantum object — a density matrix’s eigenvalues — and out comes the von Neumann entropy, the quantum measure of how mixed a state really is.