My favourite pedagogical principle: examples first!

This post is about a very simple idea that can dramatically improve the readability of just about anything, though I shall restrict my discussion to the question of how to write clearly about mathematics. The idea is more or less there in the title: present examples before you discuss general concepts. Before I go any further, I want to make very clear what the point is here. It is not the extremely obvious point that it is good to illustrate what you are saying with examples. Rather, it is to do with where those examples should appear in the exposition. So the emphasis is on the word “first” rather than on the word “examples”.

If this too seems pretty obvious, I invite you to consider how common it is to do the opposite. Open a textbook about some general concept in mathematics — Banach algebras, say — and the chances are very high that it will start with a formal definition of Banach algebras and only then give you a few examples. I myself became consciously aware of the principle as a result of editing the Princeton Companion to Mathematics: over and over again I found that I could make an article clearer by putting the authors’ well-chosen examples earlier in their discussion.

Why should it be better to do it that way round? Well, if a general definition is at all complex, then you will have quite a lot to hold in your head. This can be difficult, but it is much easier if the various aspects of the definition can be related to an example with which you are familiar. Then the words of the definition cease to be free-floating, so to speak, and instead become labels that you can attach to bits of your mental picture of the example.

By now the alert reader will have noticed that I have not practised what I have preached. So let’s forget all about the discussion so far and start again, this time doing things properly.

My favourite pedagogical principle: examples first!

Which of the following two explanations do you find clearer and easier to read? They are intended to introduce the concept of a field to a reader who knows what a binary operation is and knows basic definitions such as those of commutativity and identity elements.

Explanation 1. A field is a set X together with two binary operations, for which one conventionally uses the notation of addition and multiplication, that has the following properties. Both operations are commutative and associative and have identity elements. Every element of $X$ has an inverse under the operation +, and every element other than 0 (the name given to the identity of the operation +) has an inverse under $\times$ as well. Finally, we have the rule known as the distributive law: $x(y+z)=xy+xz$ for every $x$ , $y$ and $z$ in $X$ .

If we interpret + and $\times$ to be the usual operations of addition and multiplication, then we readily see that the familiar number systems $\mathbb{Q}$ , $\mathbb{R}$ and $\mathbb{C}$ are fields. (If one goes right back to first principles, then these statements cease to be obvious, but we shall take facts such as the commutativity of multiplication of complex numbers as already established.) By contrast, the number systems $\mathbb{N}$ and $\mathbb{Z}$ are not fields, since there is no additive identity in $\mathbb{N}$ and not every element of $\mathbb{Z}$ has a multiplicative inverse. Less obvious examples of fields are number fields (subfields of $\mathbb{C}$ that contain $\mathbb{Q}$ ) such as the field of all complex numbers of the form $a+b\sqrt{-3}$ where $a$ and $b$ are rational, which is denoted $\mathbb{Q}(\sqrt{-3})$ . (All the field properties are very easily verified, with the exception of the existence of multiplicative inverses: but even that is a simple exercise.) Another important source of examples is the collection of finite fields, of which the simplest cases are obtained by taking a prime p and the set of all integers modulo p. (Here again the only field axiom that is not almost trivial to verify is the existence of multiplicative inverses — for that one needs Euclid’s algorithm.)

Explanation 2. The five main number systems, $\mathbb{N}$ , $\mathbb{Z}$ , $\mathbb{Q}$ , $\mathbb{R}$ and $\mathbb{C}$ , though different from each other, have many features in common. For example, if $X$ is one of these number systems and $x$ and $y$ are numbers in $X$ , then one can add or multiply x and y together. One may also be able to subtract $y$ from $x$ or divide $x$ by $y$ , but this is not always possible, at least if one wants to stay in the same number system. For example, if we are confined to $\mathbb{N}$ , then we cannot subtract 5 from 3, and if we are confined to $\mathbb{Z}$ then we cannot divide 5 by 3.

It is noticeable that there are far fewer problems of this kind in $\mathbb{Q}$ , $\mathbb{R}$ and $\mathbb{C}$ than there are in $\mathbb{N}$ and $\mathbb{Z}$ . In these larger number systems subtraction is always possible (as it is in $\mathbb{Z}$ ), and so is division, provided only that one does not try to divide by 0.

Returning to the properties that these number systems share, we notice that in all five of them addition and multiplication are commutative and associative, and they obey the distributive law: $x(y+z)=xy+xz$ for every $x$ , $y$ and $z$ in the number system in question.

A field is a mathematical structure that has the basic properties of the larger number systems $\mathbb{Q}$ , $\mathbb{R}$ and $\mathbb{C}$ . In other words, it is a set $X$ on which two binary operations, which we think of as addition and multiplication and therefore denote using conventional notation for these operations, are defined. These operations must both be commutative and associative, and they must obey the distributive law. Also, both operations must have identities, every element must have an inverse under the additive operation, and every element other than 0 (which is defined more formally as the additive identity) must have an inverse under the multiplicative operation as well. Once we have these inverses, we can easily define subtraction and division: $x-y$ is the sum of $x$ and the additive inverse of $y$ and $x/y$ is the product of $x$ and the multiplicative inverse of $y$ .

Thus, a field is basically an algebraic structure that “behaves like $\mathbb{Q}$ , $\mathbb{R}$ or $\mathbb{C}$ ,” in the sense that it has two binary operations that obey the algebraic rules that one observes in those number systems.

The concept of a field is important because besides $\mathbb{Q}$ , $\mathbb{R}$ and $\mathbb{C}$ there are some less obvious examples that play a central role in number theory. Most notable are number fields and finite fields. The former are fields such as $\mathbb{Q}(\sqrt{-3})$ , which consists of all complex numbers of the form $a+b\sqrt{-3}$ where $a$ and $b$ are rational. (In general a number field is a subfield of $\mathbb{C}$ that contains $\mathbb{Q}$ .) In a number field it tends to be very easy to verify all the field properties, with the exception of the existence of multiplicative inverses: but even that is usually a simple exercise. The simplest examples of finite fields are obtained by taking the set of all integers modulo a prime p. Here again the only field axiom that is not almost trivial to verify is the existence of multiplicative inverses — for that one needs Euclid’s algorithm. End of explanation 2.

I hope very much that you found the second explanation vastly preferable, or that if you didn’t, then at least you found that it had some quality of speaking directly to the reader that the first explanation lacked. What is the main difference between the two explanations? The content is more or less the same. But there is an important difference in the way that content is organized: in the first explanation the abstract definition of a field is given first and is then followed by some examples, whereas the second starts with the examples (or at least some of them) and uses them as a springboard for a more general discussion. Why should it be an advantage to put the examples first? Well, try to imagine the reaction of a reader who does not know what a field is. At the beginning of the first explanation she [I decided on the sex of the reader by tossing a coin, by the way] is presented with a list that is not related to her previous mathematical experience. Therefore, it is extremely forgettable. Probably she will go on to read about the examples and then look back at the definition to check that it really does apply to them — a clear sign that the order is pedagogically unnatural. By contrast, if she reads the second explanation then the field axioms are describing something, namely her mental picture of a few fields that she already knows. So she has no need to commit anything to memory or to look back on parts of the text that she has not fully understood.

The comparison may seem unfair because the second explanation was longer, and spent a bit longer discussing the number systems. But that was such a natural thing to do when one started the discussion with the number systems that I think of it as almost a consequence of the policy of putting the examples first.

Back to the top level of this post.

And so, which one of those two explanations of the pedagogical principle did you prefer? I hope very much that you preferred the second. It was certainly much easier for me to explain why I think it is better to put examples first when I had an example to use to illustrate what I was talking about. (I’m referring here to the theory that the examples give you a mental picture of the concept that allows you to treat the abstract definition as a set of labels attached to concepts you already know rather than as a set of meaningless words with relationships that you just have to learn off by heart.)

When this principle occurred to me, I realized that I had sometimes put it into practice, but I had never been fully conscious of what I was doing. Now that I am, I always either put examples first, or make a conscious decision not to (perhaps because I judge that the reader can cope without). If you are not already conscious of what you are doing in this way, then try thinking about it for a while: if this post does not persuade you, then your own experience surely will — both of your own writing and of other people’s.

This entry was posted on October 19, 2007 at 9:03 am and is filed under Mathematical pedagogy. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

66 Responses to “My favourite pedagogical principle: examples first!”

Adam Chmaj Says:
October 19, 2007 at 9:21 am | Reply
Dear Tim,
Going back to the previous discussion about different proofs being
essentially the same, what do you think about this work:
J. Andreasen, B. Jensen and R. Poulsen, Eight valuation methods
in financial mathematics: the Black-Scholes formula as an example,
Math. Sci. 23 (1998), 18-40.
a small Says:
October 19, 2007 at 9:32 am | Reply
Regarding the form of your RSS feed, could you include in it the complete entries ? It would make it easier for those who use aggregators (see recent example of T. Tao). Thanks.
gowers Says:
October 19, 2007 at 9:39 am | Reply
I’m afraid I need my hand held here. I don’t know what an RSS feed is, or an aggregator …
Anonymous Says:
October 19, 2007 at 10:06 am | Reply
Considering examples first is like doing agriculture, considering general definitions first is like doing philosophy. What do you like more?
Adam Chmaj Says:
October 19, 2007 at 10:38 am | Reply
Natural methods lead to better life. They are based on examples,
which are used to construct mathematical objects without algebraic
axioms. T. M. Apostol, Mathematical Analysis, explains it well on page 1:
“Several-methods are used to introduce real numbers. One method
starts with the positive integers 1,2,3,… as undefined concepts and
uses them to build a larger system, the positive rational numbers
(quotients of positive integers), their negatives, and zero. The rational
numbers, in turn, are then used to construct the irrational numbers,
real numbers like \sqrt{2} and \pi which are not rational. The rational
and irrational numbers together constitute the real-number system. “
gowers Says:
October 19, 2007 at 12:52 pm | Reply
In reply to Anonymous: I find that some philosophers are far more readable than others. By some strange coincidence the ones who are easiest to read (amongst others, Daniel Dennett and Saul Kripke spring to mind) very often begin their general discussions with extremely well-chosen examples.
Terence Tao Says:
October 19, 2007 at 3:19 pm | Reply
Dear Tim,

(Heh, let me try writing this comment following your principle.)

I read several blogs, online news web pages, and a couple other pages daily. But instead of visiting each of them regularly to check for updates, I instead use Google Reader, which “scrapes” these pages for me and combines (or “aggregates”) them into a single web page so that I can see summaries of all the new articles at once; I find this vastly more convenient. This is possible because all modern web pages with regularly updating content produce something called an “RSS feed”, which is in a format that can be automatically extracted by aggregators such as Google Reader.

You get to choose exactly what goes into the RSS feed. The WordPress default is to only summarise the first four lines or so of each article. But you can instead place the entire article (or more precisely, the article before any break point you placed to truncate the article) by going to the Options menu in your dashboard, then Reading, then Syndication Feeds, then show “Full Text” instead of “Summary”.
Terence Tao Says:
October 19, 2007 at 3:28 pm | Reply
Incidentally, returning to the main topic, there is also a dual version of the principle to write examples first, which is to read (or think of) examples first. I think at some point during graduate school I realised that a totally linear approach to reading a maths paper (reading and digesting line n before moving on to line n+1) was usually not the optimal way to understand what the paper was about, because by doing so one had to juggle a number of partly understood and non-internalised concepts, high-level strategies, and low-level tactics in one’s head all at once. Nowadays, when I read the statement of a theorem or lemma, I try to find the simplest case of that statement that is not immediately obvious to me, or is obvious but not by the methods that are going into the current proof, or which I already know to follow from the methods in the proof. In the first two cases I can then go into the guts of the proof with the example in mind and in relatively short order find out the place where something I didn’t know about before is happening; in the third case I can then guess a rough outline of how the proof can go, which greatly facilitates the actual reading of the proof (and in fact in many cases I can skip that proof for the time being and instead focus on other parts of the paper).
gowers Says:
October 19, 2007 at 3:53 pm | Reply
Terry, you’ve done it again: while I was consciously aware of the principle that a purely linear reading of a paper was a bad idea, I had not clearly enunciated to myself the idea that you’ve just put forward. Maybe if I try to put it into practice I’ll become a more efficient reader of papers (which would not be hard — at the moment if I get through a paper quickly it is usually because for other reasons I have thought to some extent about similar ideas, so it always feels like a happy accident).

I’m now confused by the earlier request concerning RSS feeds, since I looked at my options and I do seem to be giving the full text, up to the “more …” cutoff. A Small, are you asking for the whole thing, even beyond that point? Or is my RSS feed not in fact doing what I think it is? And while I’m talking about this kind of thing, I’ve been having trouble editing recently: I find that I make changes and click “Save”, only to find that the changes have not been made. In one extreme case, I almost lost the entirety of the above post — I saved it only by the desperate measure of copying a previewed version from a Mozilla window into an emacs window and changing some of the resulting funny symbols. Now I find myself unable to make any further changes — hence the inconsistent fonts. Am I running into some known problem?
JSE Says:
October 19, 2007 at 4:05 pm | Reply
This point was first brought home to me in an excellent presentation by Felder and Brent. But what they said was the following: some students respond better to examples followed by general definitions, and others to general definitions followed by examples. The first group is more numerous, but their recommendation was to use both strategies, but to be quite transparent about the choice being made — i.e. one says explicity, “OK, now I’m going to give a couple of examples to give you some idea of what I mean by the word “field,” but never fear, a formal definition will follow” or “OK, let me write down a formal definition, if it seems unmotivated and weird, never fear, examples will follow.” I’ve found this to be very good advice, at least for lecturing. I’m not sure just what I do in mathematical writing. For the PCM article I dealt with this problem by banishing precise definitions altogether!
Jason Dyer Says:
October 19, 2007 at 4:52 pm | Reply
I love examples, but not ncessarily as labels for my mental picture. They let me understand the motivation behind a formal definition. Taking your fields example, I don’t necessarily need the example as a concrete analogy; what I do need to know is why the particular properties of a field were chosen and not some other properties. (Your second version does a lovely job of encapsulating this.)

Also, thank you Terry for mentioning out-of-order reading. I’ve always tended to that but I always felt “bad” for doing so, like I was compensating for not being smart enough or some such.
Mark Meckes Says:
October 19, 2007 at 5:01 pm | Reply
I’m teaching abstract algebra for the first time this term, and I realize now that I’ve been following the principle of examples first sometimes, but not consistently. Now that my attention has been drawn to it I’ll try to do so more often. Likewise, I’ll try always reading math papers as Terry suggests, which I have done somtimes but not always.

Regarding your coin flip to choose the gender of a hypothetical reader, if you want your random choice to reflect gender representation in the general population, then according to this paper, you want to make sure you choose female when the same side lands up that started facing up.
gregknese Says:
October 19, 2007 at 5:23 pm | Reply
Maybe if writers wrote papers using this pedagogical principle, then readers wouldn’t have to read papers out of order…
Reid Says:
October 19, 2007 at 11:31 pm | Reply
Your first example is very misleading (including very poor wording). Quite frankly, I’ve *never* seen a Field defined like that; it’s normally in an ordered list and better defined which makes it *far* easier to read and understand (e.g. Nering).

Also, I’ve attended classes that have used both styles and have spoken to fellow students on this topic. With *one* exception, we /all/ preferred general first, examples latter. For me, when I get examples first, my first thought is along the lines of, “What are you getting at? Get to the point!”

This is actually one of my main complaints about Gallian. Namely, that he puts a lot of things in examples that should be listed as definitions/theorems/etc. It is really frustrating when you get to the questions at the end of a chapter and run into things that “weren’t defined”. Yet, when you go back it was, but wasn’t listed as such, and/or was in an example. Quite frankly, I find this just plain lazy on his part.

Though I’ll concede that example first *might* be more appropriate for first year class(es), in upper level classes, IMO, this is encroaching on an insult to the students intelligence.

I’ve said it to many people before, and given this applies here, I’ll say it again:

This is University, playtime’s over, it’s time for some real work.

i.e. If the student cannot be trusted to understand in a manner which is fundamental to mathematics, they don’t belong in the class and should not be catered to. I know that sounds harsh, but this is the first step on a slippery slope of dumbing down the curriculum which I see going on in a disturbing number of Universities (I’ve actually seen where it leads as well – an embarrassment of a curriculum).
toomuchcoffeeman Says:
October 20, 2007 at 12:17 am | Reply
This is University, playtime’s over, it’s time for some real work.

i.e. If the student cannot be trusted to understand in a manner which is fundamental to mathematics, they don’t belong in the class and should not be catered to.

With due respect I think that’s somewhat simplistic, no? At some point one is surely trying to educate one’s audience mathematically, rather than to sieve for the fastest and most confident.

I think most of us all have subjective views on how slow and cautious a mathematical exposition can get before it becomes tedious, and how terse and high-flown it can be before it becomes intimidating and opaque. My own view is that the latter extreme is too common, but people with other experiences might well take the other view.

Declaration of interest: I think “Banach algebras” are a perfect example of a definition that is completely uninformative for a novice, and where an approach of initial illustration through examples could be very interesting, if less concise than just getting on with the main definition.
bbs Says:
October 20, 2007 at 3:05 am | Reply
this is my favorite pedagogical principle too. in my view, all subject specific issues aside, it conveys “the right attitude” toward abstraction.

the point of abstraction is to focus on what a mathematical object “does,” not what it “is.” it is only an accident of language that definitions must begin “a [mathematical object] _is_…”

if examples do not come first, students are given the idea that the point of a formal, abstract definition is to tell us what something “really is,” and that examples are secondary (or diversions to make weaker students feel less scared of what the abstract thing “really is,” the way plugging in numbers might have made them feel better about basic algebra).

this is a misconception, and a misunderstanding of how mathematics is actually done. from calculus to quantum groups, examples came first.

i like to point out to my students that if they look in books other than the course text, they might well find other lists of axioms characterizing the subject matter (vector spaces, or real numbers, or groups, or rings, or fields), but they will always find the same examples. there is a reason why there is no mathematical version of IUPAC that standardizes lists of axioms, or terminology, or symbols for use by the community: the choices we make in what our abstractions “are” don’t matter as much as what the abstractions “do.” the more we put examples first, the better we get this message across.
James Says:
October 20, 2007 at 10:04 am | Reply
It might be the opposite of what you’d expect, but I often find it easier to read non-linearly when the paper is written in a structured Bourbaki-like style. It is the non-structurally written papers that I have to read linearly. Because you have paragraph after paragraph of text, it’s often not clear without reading them which ones you can afford to skip. I think one of the most common errors writers make (including myself, I have to admit) is assuming that people will actually read your paper linearly, rather than skipping around and zooming in on the interesting parts. In a modularly written paper, it doesn’t really matter whether the examples come before or after the definition — it is clear at a glance which is which and the readers can read them in whichever order they want. Of course this doesn’t apply to lectures at all.
Ponder Stibbons Says:
October 20, 2007 at 2:14 pm | Reply
JSE wrote:
…their recommendation was to use both strategies, but to be quite transparent about the choice being made — i.e. one says explicity, “OK, now I’m going to give a couple of examples to give you some idea of what I mean by the word “field,” but never fear, a formal definition will follow” or “OK, let me write down a formal definition, if it seems unmotivated and weird, never fear, examples will follow.” I’ve found this to be very good advice, at least for lecturing.

I concur. I’m much more of an ‘examples first’ person, but I find that I find ‘definitions first’ much less bewildering if it’s done in the way you describe.
Adam Chmaj Says:
October 20, 2007 at 5:24 pm | Reply
One of my favourite examples is the one which gives
a working knowledge of the Riemann integral, i.e.,
the passage to the limit from Fourier series to the
Fourier integral.
Emmanuel Kowalski Says:
October 20, 2007 at 6:47 pm | Reply
Rather than “examples” only, I find it useful to think of “motivation”, which may be slightly more general and a bit different. When introducing a new type of object (groups, fields, etc), then motivation is naturally associated with examples, but we may also need to motivate properties (say, completeness of a metric space), particular techniques (e.g., why should we learn about representing functors if we want to solve diophantine equations? why can iot be useful to define structures axiomatically?), particular problems (why should we care about the Riemann Hypothesis?), etc.

I also think examples/motivations are crucial features in spreading various aspects of mathematics within the research community itself; when we need to learn or use some unfamiliar area of mathematics, and all standard references concerning this are written at the level of the specialist and/or with unfamilar terminology without “links” to an outside world, one may well be placed in the situation of staring at a result which answers a given question while being entirely unable to say why this is the case…

Two further remarks:

(1) When teaching, the background of the students may be taken into account, and in particular the evolution of their knowledge: after the students understand (say) the usefulness of groups and fields, and start to appreciate their axiomatic definition, it becomes much safer to introduce rings by starting with the similar definition, and then discussing examples. On the other hand, if one goes on to define topological spaces (or measure spaces) after groups and fields, then the style of the axioms is different enough that some motivation becomes very important again.

(2) Often I think that looking at the history of the object/property/problem can give a useful clue as to how to motivate it, and also whether there is a chance of doing a really good job
considering the time available. For instance, Galois theory can be grown “organically” by following the history, but it will take an enormous amount of time, and the early versions are likely to be essentially incomprehensible even for the teacher. Seeing the tremendous clarity arising from the more abstract viewpoint is a wonderful experience, but most students would be discouraged before — though it is also clear that by making the student see the problems, and how difficult they seemed for centuries, one could lead them to a proper appreciation of this clarity…
As another example, a very good motivation for developping algebraic number theory is to try to solve Fermat’s equation the way it was attempted before the theory was well-understood, and exhibit the problem of the failure of unique factorization. But then if ideals have not already been introduced in a basic ring-theory class, it seems hard to grow the motivation for their introduction in a reasonable amount of time.
Adam Chmaj Says:
October 20, 2007 at 7:24 pm | Reply
The Riemann Hypothesis is the ultimate professional challenge in
the algebraic scientific system based on the i^2=-1 definition.
I doubt that it can be solved by someone who e.g., does not
understand basic calculus of variations concepts.
Tal Says:
October 21, 2007 at 5:33 am | Reply
Examples first – that’s how it really happened.
Fields, groups, Banach algebras were all there and were all
used before they were defined as such.

Although when studying and teaching I prefer the classical textbook method –
starting with a definition, and then giving some examples –
I think that the really important principle is to keep in mind the importance
of examples, and their roles, not just as “examples”, but also as precursors of the theory and the reason for it.

When I was an undergraduate I got the impression that mathematics is the following activity:
“make up some definition of a structure and see what comes out of it”.
Today I think quite differently (although such an activity can be described as mathematics). Regarding Reid’s comment, this would be more like what I would call “play”.

However, we do not always have centuries to learn or to teach mathematics as it happened. I find the abstract approach, including definitions first, very concise and clear (and interesting and beautiful).
Kenny Says:
October 21, 2007 at 7:07 am | Reply
I think in cases where the concept is already somewhat familiar to the student, putting the examples first can get in the way. In particular, by giving examples first, students can get the wrong idea of which aspects to generalize – for instance, the examples-first definition of a field might lead students to think that all fields have characteristic 0.

Of course, if it’s a new topic for the students, then examples first is almost certainly better. But I’d agree with Emmanuel Kowalski – the really general issue here is motivation, rather than examples in particular. But most motivation comes from examples and counterexamples.

On a related topic, my friend Noah once (approximately) said over tea that for some of the crazy convoluted definitions that mathematicians end up working with, it would be better if people just gave the examples and counterexamples, and then told the reader to come up with the definition, rather than giving the definition itself. In many cases, there’s no way one can follow the definition without knowing exactly the sorts of examples and counterexamples it’s meant to deal with, and once you know, the definition itself isn’t very useful. I suppose it’s still necessary at some point to prove general theorems about the concept, but that can come much later.
Emmanuel Kowalski Says:
October 21, 2007 at 8:57 am | Reply
“Examples getting in the way” is actually often problematic in research also: I think it’s in his memoirs that André Weil mentions that it took him (and others) a long time to embrace the use of the Zariski topology in algebraic geometry because it looked so wildly different from the well-accepted topologies of the time, in particular by being non-Hausdorff.
Adam Chmaj Says:
October 21, 2007 at 9:49 am | Reply
I think it doesn’t hurt to keep our activities in broader perspective.
E.g., it seems to me that research in mathematics is stimulated mainly
by neat open problems, e.g., FLT or TGC. However, it would be safe
if claims of solutions of such questions were either verified by other
research areas or given alternate proofs.
Predigested Formalisms, Spoonfeeding of « The truth makes me fret. Says:
October 21, 2007 at 3:28 pm | Reply
[…] just because it’s also about pedagogy and because I have said similar things: Timothy Gowers explains why “examples first” is his favourite pedagogical principle. I couldn’t agree […]
Terence Tao Says:
October 21, 2007 at 10:32 pm | Reply
Incidentally, regarding the RSS feeds, it seems that your feed (and mine) are now giving full summaries. Perhaps it takes a bit of time for the change to propagate.

I find, when reading papers, that even just having a single sentence of motivation or examples before plunging into a technical definition, or a single sentence attempting an informal definition before setting up a lengthy example, helps tremendously. (The same is doubly true for listening to mathematical talks.) Even just a promise to explain things more later, after the current necessary technicalities, is reassuring. Any further up-front motivation or explanation beyond that sentence is of course also helpful, but there is a law of diminishing returns and eventually one runs up against the issue of reader impatience mentioned earlier.

But one fairly cheap thing to do (and which already occurs in many papers) is to present theorems or definitions in their model cases first, before stating the general case. (This also provides a natural place to discuss previous literature and partial results.) The temptation to state Main Theorem 1 in maximal generality on page 1 of a paper, and bury the interesting model cases of that theorem in the final remarks section of the paper, should be resisted.
Adam Chmaj Says:
October 22, 2007 at 9:48 am | Reply
Dear Terry,
I disagree that presenting theorems in their model cases first
is fairly cheap. E.g., don’t you think students should first read
van der Corput’s work on length three arithmetic progressions
in prime numbers, before diving into the the Green-Tao publication?
Nicolas B. Says:
October 22, 2007 at 5:37 pm | Reply
Of course Explanation 1 is vastly superior.
The examples are useless and can only confuse any mind endowed with a modicum of aesthetic sense.
Why don’t you immerse yourself in my treatise and emulate my pedagogy?
Nicolas B.
Doug Says:
October 23, 2007 at 8:39 am | Reply
I am attempting to use, but may be misusing your examples first concept.

In the following Euclidean geometric representation
OA=OB=OB1=BC=B1C1
and
AB=AB1=OC=OC1
with
semicircle of radius OA through B,A,B1 not drawn
and
mirror reflection of this semicircle and in-circum-scribing squares not drawn.

Investigated is E=mc^2 from a dynamic programming perspective in an attempt to understand how Einstein derived this equation.

| B . _ . _ . _ . _ . _ . _ . _ . _ C
|
|
|
|
|
|
|
|
|_____________________ A
| O
|
|
|
|
|
|
|
| _ . _ . _ . _ . _ . _ . _ . _ . _ C1
| B1

Let OA=a=MAX(a) when MAX(a)=c^2

Then OA=(1/2)*v^2=(1/2)*MAX(v^2) when v=c as v->c

and

AB=c*sqrt(2) from Pythagorean theorem

Were the entire circle and both the inscribed and circumscribed squares drawn, then this would be a step in Archimedes attempt to use polygons to determine PI.

It would seem that:
1 – Newton’s F=ma is represented by a vector OA.

2 – Einstein’s E=mc^2 is represented by the area of the square OACBO.

3 – The force F of Einstein E would be represented by a vector OC.

4 – That OC=AB in absolute value, with AB as a relative speed or velocity.

5 – The maximum E occurs at v=c*sqrt(2) or when vectors of force are orthogonal.

6 – Einstein allowed for 2*OB=BB1=2c as a maximum relative speed or velocity when vectors are in 180 degree opposition.

Questions:
a – This geometric representation, working backward from the perspective of dynamic programming concepts [Richard Bellman IEEE “systems analysis and engineering topic”], appears to suggest Einstein was aware that relative speed or velocity could exceed v->c up to v->2c?

b – Is E=m*MAX(a) with MAX(a)=c^2, as a constant extrema, more accurate than E=mc^2, with c as a constant?

c – Could this geometric representation become a geometric proof?

d – Are concepts of Euclid, Pythagoras, Archimedes, Newton, Einstein and Bellman unified through this representation?

Note that I am also using [correctly?] the “amplitwist” concept of Tristan Needham, ‘Visual Complex Analysis’ in visualizing the transformation of the outside of the circle to the inside and vice versa.

http://www.usfca.edu/vca/
Adam Chmaj Says:
October 23, 2007 at 5:23 pm | Reply
Dear Doug,
What is wrong with this “proof” of E=mc^2:
http://www.drphysics.com/syllabus/energy/energy.html
Gil Kalai Says:
October 23, 2007 at 6:12 pm | Reply
Dear Tim and dear all,

In the first class of the first year in university we learned the definition of a field as the first thing in linear algebra. In the parallel course in calculus (or “infinitesimal calculus”) the first task was to define the real numbers using Dedekind cuts. So it is fair to say that the only example we had was the field of rational numbers. The main point in both courses was to introduce how rigorous mathematics definitions and proofs should go. (Later in linear algebra another implicit point was that the specific field does not matter much to many of the topics.)

Real numbers came only a couple weeks later (in the calculus course). While some of us heard about complex numbers they were not defined and considered in our classes until even later. We became familiar with general finite fields or Q(-sqrt 3) much later (not in the first year). I think the example of fields of prime orders was considered in the first few weeks. (But well after fields were defined.)

So, this is an experience which goes contrary to Tim’s pedagogical advice. Overall, I am not sure that in the context of first year mathematics studies, going first with examples would be better. Developing some of these examples is more difficult than the basic definitions and propositions regarding fields, especially if you insist on rigorous presentation.

In summary, while I like examples very much and wish always we had more of them, I am not convinced by Tim’s general pedagogical advice (which indeed sounds appealing.) Maybe more examples or some theory supporting the approach can help.
John Armstrong Says:
October 23, 2007 at 6:15 pm | Reply
Doug, if you want to “attempt to understand how Einstein derived this equation”, why don’t you read the book, The Principle of Relativity wherein Einstein himself tries to explain to a general audience how he derived it? I’ll tell you this: it looks nothing like what you’re saying here.
gowers Says:
October 23, 2007 at 9:43 pm | Reply
I’m planning another post on this question, but a brief answer to Gil’s point is that it isn’t necessary to have a rigorous definition of the reals in order to understand them as an example of a field: you just need to be familiar with the idea that the various binary operations have “obvious” properties. In other words, I think your qualification “especially if you insist on rigorous presentation” is important: I don’t. (In case that is misunderstood, I do of course think that a rigorous presentation of the real numbers is important — all I’m saying is that it’s not important if all you want them for is an example to give people an idea of what a field is before you define it. When I lectured Cambridge undergraduates on fields and gave the reals as an example, well before they had seen them rigorously presented, there wasn’t the slightest hint of anxiety in my audience. I think, incidentally, that I gave the general definition first in that course, so in a way this question is orthogonal to the point at issue. Your remarks make me curious to know whether, before the reals had been rigorously defined in the calculus course, your linear algebra course avoided discussing vector spaces over the reals — that would strike me as a very odd approach.)

In the next post I’ll see if I can support my case with some more examples, as you suggest, and I’ll also try to come up with more refined advice: I agree with many of the comments here that the examples-first approach should not be applied universally, but I remain convinced that it should be applied much more than it is and will try to specify general circumstances where it should.
Horace Says:
October 24, 2007 at 4:45 am | Reply
What are some other effective pedagogical principles that are not immediately obvious?
John Armstrong Says:
October 24, 2007 at 5:55 am | Reply
Horace: one of my favorites is that you can teach a lot of interesting theory to low-level classes as long as you don’t say the big scary names. Whenever I teach multivariable calculus, I can communicate the basic ideas of DeRham cohomology as long as I don’t say “DeRham” or “cohomology”.
Mathematics and writing Says:
October 24, 2007 at 4:18 pm | Reply
[…] My favourite pedagogical example: examples first […]
Gil Kalai Says:
October 24, 2007 at 4:49 pm | Reply
Well, I double checked and looked at the Hebrew text book (lecture notes) by Amitsur who was also the lecturer of my first year linear algebra course. The book starts with a sentece like:”Let’s examine the properties of the system of real numbers” and then gives the axioms of a field and move directly to describe several other examples: the rationals, a+sqrt 2 b where a and b rationals; integers modulo n with product and multiplication stating that for n prime this is a field; and the complex numbers. So real numbers were taken for granted and plenty of examples were given. One example – the reals was the prototype for the axioms and several others follows immediately afterwards.

My description above was not so accurate.

I also looked at the earlier Hebrew text book (again lecture notes) by Levitzki but it started from the notion of groups. So it was a different concept

BTW Levitzki and Amitsur (who was his Ph. D.) student proved together a beautiful theorem about commutativity of n by n matrices. Take 2n such matrices and look at all (2n)! products for all permutations and sum them up with alternating coefficients according to the sign of the permutation. The theorem sais that you will always get 0.

(Amitzur later told me that the Cayley Hamilton theorem that f(A) =0 when f is the characteristic polynomial of A, a theorem we learned at the same course, is in a sense the “mother” of the Amitsur-Levitzki identity and all other combinatorial identities for non commutative rings.)
Beans Says:
October 27, 2007 at 2:00 pm | Reply
I have yet to study fields formally, but do have a ‘small’ idea about them. As a student, I feel that in some cases definitions are necessary. If your explanations were part of my notes, then in both of them I sort of skimmed/skipped the last part and only recall the word division. (If I am being honest that is!)

However, I agree that having the example first i.e. the sets X which are fields, enabled me to better relate to the definition because I already know the rules of them sets.

I am thinking back to my first year and partially agree with you. In my case, in most instances I prefer definitions first, and then examples straight after. I then try to make sense of the defintion with the example in mind. For example, for the formal definition of a limit, I feel it was important to have the definition first and then examples next.

I think it is up to the lecturer to decide whether examples are necessary before or after. (When we were told about the division algorithm, we were given examples beforehand and the defintion after.) I feel one of my lecturer does this brilliantly.
gowers Says:
October 27, 2007 at 2:21 pm | Reply
Beans, I think the case of limits is an interesting one, and may depend quite a bit on the aptitude of the student. For students who have difficulty with the concept, perhaps because they are completely unfamiliar with definitions that involve quantifiers, I think it is quite a good idea to give an example, such as the limit of the sequence 1, 1.4, 1.41, 1.414, … being the square root of 2, and then challenge them to say precisely what that means. It is quite likely that they will make guesses that are wrong. For instance, they might say, “It gets closer and closer to the square root of 2,” which suggests the definition that $x_n$ tends to $a$ if the sequence $\null |x_n-a|$ is decreasing (or perhaps strictly decreasing). Then one can give examples that show that this is an inadequate definition (for example, by that definition the above sequence would converge to any number greater than or equal to the square root of 2, and the sequence 1,0,1/2,0,1/3,0,… clearly converges to 0 without actually getting closer and closer at each step). After a bit of that one arrives at the right idea, which can either be presented informally first — however close you want to be, eventually you are always at least that close — or expressed straight away as a formal definition.

Of course, it would be possible to start with the formal definition and then say, “If you think that looks strange, let’s consider some examples that show why it is as it is.” In this case I don’t feel that one way round is significantly better than the other, though in some ways the motivation-first is a better reflection of how one comes up with a definition. And it also does what Deane Yang suggests in the second paragraph of this comment.
beans Says:
October 27, 2007 at 5:31 pm | Reply
Actually I am sorry but I had confused my notes! The lecturer did precisely what you wrote, but it was with two other functions! I will have to retract my previous statement, since whether or not I prefer examples first depends on the course. Indeed, motivation was the key word. In analysis, in the first lecture this semester, we were given the motivation which set the scene rather nicely. That is the very same thing which I have been desperate for in my applied course.

Quite strangely, I myself wrote a rather long post about what Deane Yang commented about! I feel that the lecturers tell us a story, and they each have their own unique way of doing so. And as with books, you sometimes prefer one over the other. One fantastic lecturer gives us definitions first and then examples straight after and I follow that. Hmm, I think each theorem, definition etc deserves its own attention. Had the Intermediate Value Theorem not been introduced by a diagram, I would have had great difficulty in understanding and following its proof.
anon Says:
October 27, 2007 at 8:06 pm | Reply
But, the ‘closer to the limit’ idea works with upper and lower subsequences. Then one can tie it up by saying that if both these subsequences get ‘closer to the limit’ then the sequence itself is said to converge.
I wonder if sequences could be taught that way — by first doing LimSup and LimInf and then then limits. That way the student is first introduced to non-decreasing or non-increasing sequences where the ‘closer to the limit’ intuition can work.
Anonymous Says:
October 28, 2007 at 12:30 pm | Reply
Reid,

The fields you’re referring to (i.e. those in an ordered list) seem to be these, whereas Tim’s fields are those.

This is one of the many instances where people working in different fields (pun not intended) turn a word of common English (or any other language) into a technical term in different ways that are often unrelated to each other.
Gil Kalai Says:
November 2, 2007 at 12:16 am | Reply
The examples put forward in the discussions: How to teach the concept of a field and the example regarding “how large is your orbit, x?” and several additional examples put forward in the discussion like teaching limits and teaching Sylow’s theorems are excellent examples. Not because they immediately lead you to accept with enthusiasm Tim’s suggestion but because they can be very useful in examining the “examples first” suggestion. When it comes to pedagogical claims or to other forms of advice we are not able to prove things, so our best shot in examining them is to look at them skeptically. (I hope my earlier comment while not enthusiastic did not come across as hostile.) So this is not special to blogs, it is special to moving away from mathematics.

Regarding ‘fields’ I like the way Amitsur do it basing the definition of a field on trying to examine properties of the real numbers; I see no problem in mentioning Q and C as “positive” examples and N and Z as “negative” examples before giving the formal definition provided the students already know these examples. I am not sure it makes a big difference. If the main objective is to define “fields” and the students need an introduction to the complex numbers I would not suggest to give this example first but to wait after fields are defined.

Actually my own style/taste of teaching is in the direction of examples-first plenty of preliminary chat, stories, philosophy, non linear development and plenty apropos and even dubious humor. It is perhaps a good pedagogical principle that people should follow, within reason, their taste and style. But I think people should be especially skeptical about transforming their tastes into strong principles.

Apropos Amitsur, while I do not remember him giving much chat and non linear stories at class I do remember we had many chats in our faculty club (Belgium house) and we even tried once to work on a problem: Take a non Papussian projective space (3-dimensional), say a projective space
over the Quaternion. Is it possible to find there seven pairs of lines e1, e2, … e7 ; f1 f2 …f7 so that ei is disjoint from fi for every i and ei intersects fj whenever i and j are different. Such a configuration is impossible for a Papussian projective space (over a field).
As far as I know, this problem is still open.
John Armstrong Says:
November 2, 2007 at 12:43 am | Reply
I think people should be especially skeptical about transforming their tastes into strong principles.

I second this. As one much wiser than I has said, “It is my firm belief that it is a mistake to hold firm beliefs.”
gowers Says:
November 2, 2007 at 10:00 am | Reply
A quick remark in response to the last two comments. Although for rhetorical purposes I may have made it seem as though I was holding to a certain belief in a dogmatic way, that’s not really the point of what I am saying. I do, as it happens, think that there are many many examples of expositions in the public domain that could be made more accessible if an examples-first approach was used. But, as has already been mentioned, there are two separate (though related) issues that one can distinguish: what is achieved by putting examples first, and is that something that one wants to achieve?

Although my answer to the second question is very often yes, it’s really my answer to the first question that I’d like to persuade people of. I may even end up writing “Examples first III” to deal with it at greater length, but a brief summary of my view is this. It depends on a distinction between “direct” memory (the sort you’d need to remember digits of $\pi$ , say) and “derived” memory (the sort you’d need to reconstruct a proof of a theorem from one key idea using your years of mathematical training). One of the difficulties of reading mathematics linearly is that you sometimes have to rely on direct memory because you don’t yet have anything from which to derive a memory. This would be true if a proof had steps that appeared to be arbitrary, or a complicated lemma was stated and you didn’t know how it was going to be applied, or a definition was given that turned out later to be exactly what you needed. Putting examples (and other kinds of motivation) first is a way of reducing the reliance of the reader on direct memory, since it sometimes makes it possible to derive the memory instead. (E.g., in the fields example, one could derive the list of axioms, or at least get a long way towards doing so, from the simple idea that they are the properties that hold of addition and multiplication in the rationals.) It’s this factual point that I’m mainly trying to push. (By “factual” I mean that it makes a statement about the world that could be true or false.) The normative point (this is how we ought to present mathematics) is secondary — of course, it’s no secret what my views are here too, but I don’t hold them in a rigid way and can think of situations where presenting examples first would not be helpful.
Atdotde Says:
November 6, 2007 at 6:29 pm | Reply
An example for example

Tim Gowers has two very interesting posts on using examples early on in a mathematical exposition of a subject. I can only second that and say that this is my favorite way of understanding mathematical concepts: Try to think through the simplest non-…
Anonymous Says:
November 10, 2007 at 6:52 am | Reply
I can’t be sure, but I think Reid has the same definition in mind; he means that the field axioms be listed out, as in

(i) a,b in F then a+b in F
(ii)…etc

Whether this is put first or second is perhaps neither here nor there; the central thing, from my perspective, is to emphasize that we care about fields because they clarify a lot of things about N, Z, Q, and C (and perhaps even more to the point, they distinguish Z_n from Z_p, both of which can be defined without a recourse to abstraction). This may seem like an obvious statement, but I think a good deal of university mathematics is unfortunately written from the perpsective that this would ‘dumb-down’ the material.

So I can’t agree with the sentiment that “This is University, playtime’s over, it’s time for some real work.” In my personal experience I have been able to learn math much more quickly (and deeply) if I view it as play, rather than ‘real work.’ In the former instance, the subject itself motivates me; in the second, it’s not the math, but just a desire to impress people or just do what’s expected. And there are easier ways to do these thing than to learn about tensor products, etc. From an even broader viewpoint, surely less sterile math will be developed if people learn in their undergraduate days to pursue abstractions because they are useful and clarifying, rather than pursue the abstractions because someone of authority has written them down.
Arnold Lebow Says:
March 18, 2008 at 12:06 am | Reply
Problems first is even better. A problem can motivate an example and a definition.
Mathematics Books for Non-Mathematicians « The Number Warrior Says:
March 18, 2008 at 8:42 pm | Reply
[…] I’m reminded of Timothy Gowers and his principle of “examples first”. […]
I hate axioms « Annoying Precision Says:
June 27, 2009 at 10:13 pm | Reply
[…] axioms before examples. On this point I agree wholeheartedly with Tim Gowers when he says to put examples first, as I think it places the emphasis on the empirical side of mathematics, as well as on the correct […]
Gil Says:
August 13, 2009 at 11:15 am | Reply
Oded Goldreich has a different view on examples (at all) His
“SHORT OPINION NR 1” reads:
“ON EXAMPLES.
If a clear explanation is provided, then no examples are needed;
otherwise no examples will help…”

http://www.wisdom.weizmann.ac.il/~oded/etc.txt
- gowers Says:
  August 13, 2009 at 12:53 pm
  The fallacy, in my view, is a very simple one. If a clear explanation is provided, then no examples are needed, but they certainly help. (Strictly speaking, that’s consistent with what Oded says, but not with his implied conclusion that there’s no point in examples.)
  
  A different way of disagreeing would be this: examples are often what make a clear explantion clear.
esmeyny Says:
June 9, 2010 at 9:43 am | Reply
Of course, I agree, but I have to add the important principle:

Clearly mark the point where you leave the example and start the general definition (or proof).
In your example, this would be the point “in other words”, and there should be a paragraph (and properties should be numbered and structured as a list, too).

I had a professor who would calculate with examples and suddenly claim to have proved the theorem. I started to understand *why* students could have been happy with the introduction of the “definition, theorem, proof” style.
On writing [excerpt from Terry Tao's blog] « (郝成春) Chengchun Hao's HomePage Says:
July 10, 2010 at 2:51 am | Reply
[…] Gowers on “writing examples first!” (see also this followup […]
Cristi Says:
November 30, 2010 at 9:26 pm | Reply
This is a good idea. If we compare with programming techniques (I saw Terry Tao does this sometimes), I would choose “test driven development”. It is good to have an example, a toy model, something on which to test the results and to be inspired with new ideas.
Wayne Ariola Says:
November 30, 2010 at 10:34 pm | Reply
We have been exploring TDD for quite sometime. We think it truly increases productivity. Here is a paper we wrote on this topic:

http://bit.ly/e1RP2D
On writing « mathTHÍCHinTOÁNmyHỌCbrain Says:
December 30, 2010 at 3:21 am | Reply
[…] Gowers on “writing examples first!” (see also this followup […]
Rajnikant Sinha Says:
July 21, 2013 at 7:57 am | Reply
How to read a paper in maths is shortest duration? This vital question is directly related to the intention of the paper author. If the author has good intention, he will write proofs of the theorems in a natural development manner. There will be no gap in the argument, If some gap occurs, the author should supply argument in sufficient detail at the same place with the heading “Reason” like (Reason: ….). Unfortunately, except Euclid, and Rajnikant Sinha, no author in Maths takes the bother to write everything like, in Chemistry, and Biology. Because of riddle-type writing in maths, it is going to become complicated even for so-called experts. The day “write detailed proof” practice is followed in text books and periodicals, there will be a boom in maths learners and research.
Examples First: A Math Pedagogy | Anthony Bonifonte Says:
January 31, 2014 at 4:12 am | Reply
[…] https://gowers.wordpress.com/2007/10/19/my-favourite-pedagogical-principle-examples-first/ […]
isomorphismes Says:
November 20, 2014 at 6:11 pm | Reply
You are absolutely, totally right. Along with that I would say “Motivating questions / payoff” as well as the examples up front.
Examples first – drossbucket Says:
February 14, 2017 at 5:59 pm | Reply
[…] the idea I personally like to think about is an approach I call ‘examples first’, after these two blog posts by Timothy Gowers. (The second one has an absolutely epic comment thread – […]
How to write a paper – Ju Says:
July 12, 2017 at 5:27 am | Reply
[…] Gowers on “writing examples first!” (see also this followup […]
m88 Says:
May 7, 2020 at 12:35 pm | Reply
m88

My favourite pedagogical principle: examples first! | Gowers's Weblog
Tronsr.org site Says:
February 18, 2021 at 3:35 pm | Reply
Tronsr.org site

My favourite pedagogical principle: examples first! | Gowers's Weblog
On Plato’s Republic, 1 « Polytropy Says:
August 15, 2021 at 4:34 am | Reply
[…] 2007 blog post of Timothy Gowers recommends not giving definitions until you have shown your listeners some […]