I was shocked and saddened to hear about a week ago that Ted Odell, a mathematician to whom I owe a lot, died suddenly on January 9th of a heart attack while he was travelling to this year’s joint AMS/MAA meeting in San Diego. He was 65, but seemed a lot younger.
Ted was a world leader in Banach space theory, and in particular in the infinite-dimensional theory. The wry and slightly enigmatic smile you see in the photo was extremely characteristic: if I imagine Ted, I automatically imagine him with exactly that smile. Less clear from the photo, though perhaps it can be guessed from the camera angle, is that he was extremely tall: he belonged to a handful of mathematicians I know who make me feel short (Tom Sanders and Alex Scott being two others).
I first met Ted when I went to my first ever conference, in Strobl am Wolfgangsee in Austria in 1989. I can’t remember how it came about, but I ended up chatting to him, and he started explaining to me in a wonderfully clear way — the kind of explanation you just can’t get from a textbook — how Tsirelson’s space worked. I read in an obituary by András Zsak (which starts on page 30 of this issue of the LMS newsletter) that Ted had a reputation for being kind and encouraging to young mathematicians. He certainly was to me at this conference, taking the time to give this explanation to a graduate student about whom he knew nothing. Most of the next section describes an argument that he sketched out for me on one of those yellow pads of paper that seem to be standard in US maths departments. (I think I’ve still got the yellow sheets that he let me keep, but I’ve no idea where they are.)
At the time, I was very interested in the so-called distortion problem for Banach spaces. I shall have a lot more to say about that later, but let me briefly formulate the notion of distortion here. Two norms and on the same space are said to be –equivalent if there are constants and with such that for every . After rescaling, one can assume that they satisfy the inequalities , so for convenience I’ll do that. We call the norm a –distortion of if the above inequality holds, and if it cannot be improved by passing to an infinite-dimensional subspace. That is, for every , and for every subspace of and every you can find such that and .
It is far from obvious that there exists a constant and a norm that can be -distorted. If you want to get a sense of the difficulty of this problem, then you might like to spend half an hour thinking about it: if you haven’t already seen the answer, then you will get absolutely nowhere.
Of course, I’ve sort of given away that Tsirelson’s space provides an example, but even with that huge clue the problem isn’t trivial, since you have to come up with the -distortion. It was this that Ted explained to me.
I’m not going to give a full explanation with all its details, but I’ll try at least to convey the basic idea, starting with the definition of Tsirelson’s space (or rather, the dual of the space that Tsirelson originally defined, which, as was observed by Figiel and Johnson, is easier to work with). To define the norm, which is on a space of sequences, we need the following notation. If is a sequence and , then is the sequence that takes the value when and otherwise. In other words, we write for the coordinate projection associated with the set . If , we write to mean that : that is, ends before starts. Finally, we say that a sequence of subsets of is admissible if . (I write when formally I should write .) That is, a sequence of sets is admissible if each set in the sequence starts after the previous one has finished, and if the minimum of the first set is bigger than the number of sets in the sequence.
We now define the norm of a sequence using the following amazing formula. Let be the set of admissible sequences of sets.
In words, the idea is this. With each you want to associate a non-negative real number, and you want to make that number as large as possible. You have two options. Either you can settle for the norm of , or you can chop up into pieces (discarding everything up to the th coordinate), play the same game with each piece, add up the results, and divide by .
Note that if is finitely supported, then there is nothing to be gained by the chopping up if the support of is entirely contained in one of the pieces, since then you’ll simply have divided by unnecessarily. So the game terminates, and if you play it optimally, then you get the norm of .
How does one say anything about a norm that is defined by an implicit formula such as the one above? This is what Ted taught me. A first observation is that it is possible to find sequences of unit vectors that approximate the unit vector basis of as closely as you like. (The notation means that the support of ends before the support of starts.) The proof is simple. If the support of begins after , then the supports of themselves form an admissible sequence. It follows from the implicit definition that for any sequence of real numbers.
A nice inductive argument of R. C. James allows one to improve this -equivalence to a -equivalence. Let be any norm on such that for every . Let (for convenience I’ll assume that is a perfect square) and divide the standard unit vector basis into blocks of size . Then one of two things is guaranteed to happen.
(i) For every , the th block generates a vector such that .
(ii) For some , every vector generated by the unit vectors in the th block satisfies .
In the second case, we have obviously improved the equivalence to on a subspace of dimension . But in the first case we have as well, because is at least by hypothesis and at most by the triangle inequality.
By iterating this argument a few times, we can make the equivalence as good as we like.
Now comes Ted’s nice idea. If we want a -distortable space, then we can take Tsirelson’s space as defined above (actually, the normal thing to do was to take , but Ted wanted to take a general in order to show that -distortions existed for any ) and define an equivalent norm as follows:
Here, I’m being a bit sloppy about constants: and are basically the same, but for the chopping up in the above definition I want an integer. The idea is that you take a vector in Tsirelson’s space and you’re allowed to increase its norm by cutting it up into pieces and adding the norms of those pieces.
It is easy to show that this is an -equivalent norm on Tsirelson’s space. It is at least as big as the original norm. But also, since for every , it follows by the triangle inequality that . To show that we have a distortion, we need to show that sometimes chopping up a vector and adding the norms of its bits doesn’t help us, and sometimes it does. And we need to show that that is the case inside every subspace of Tsirelson’s space.
Here’s where I’ll get a bit sketchier. In any subspace we can find unit vectors that generate a subspace that’s -close to . If we take such vectors and define to be their average , then we obtain a vector that can’t be improved much by chopping, at least if is much bigger than . The reason is that, apart from a few edge effects, we’ve basically chopped up into sums of consecutive blocks of the , and since the generate something very close to a copy of , is very close to the sum of the norms of those sums. So the chopping didn’t achieve anything.
How about a vector where chopping does help? The technique here goes a long way towards explaining the point of the strange definition of Tsirelson’s space. Let’s call a vector of the kind I defined in the previous paragraph an -average. We now inductively construct (inside some arbitrary subspace) a sequence of -averages as follows. Once we’ve constructed , we pick to be much larger than the maximum of the support of and we let be an -average that starts after finishes.
It turns out that if we want to calculate the norm of the vector in Tsirelson’s space, we can’t do much better than the obvious decomposition into , which tells us that the norm is roughly .
Why can’t we do better? Well, suppose we chop into pieces that start after . Then for each such that we know that the maximum of the support of is smaller than , so we’ve completely missed out in the chopping up of . And for each we know that is much greater than , so there is nothing to be gained by chopping up (or any later ).
But if is approximately , then , since is at least .
What we have done with those calculations is show that every subspace contains vectors of two different kinds. One kind is averages, and the other is sums of averages for a sequence that grows very rapidly. For the first kind of vector , and are roughly equal, whereas for the second kind of vector , . So we have shown that Tsirelson’s space, defined with constant , is -distortable (or perhaps it’s safer to say -distortable just to provide some elbow room in the calculations).
This shows that for every there exists a -distortable Banach space. But that raises a very natural question: can we reverse the quantifiers and find a Banach space that is -distortable for every ? This question was answered by Thomas Schlumprecht, who had a very close collaboration with Ted over many years, which resulted in a large number of beautiful papers. Let me briefly give the definition of Schlumprecht’s space, the first known arbitrarily distortable space.
The philosophy behind Tsirelson’s space is that since the more pieces you chop a vector into, the bigger you can make the sum of the norms of the pieces, you need some kind of “penalty” associated with the number of pieces you chop into. The penalty is that if you want to chop into pieces, then you must sacrifice the first coordinates of your vector. Schlumprecht’s idea was to impose a different kind of penalty: instead of taking a fixed constant , you let grow with the number of pieces you chop into. Many functions would do for this purpose, but the function had some properties that made it convenient in calculations. Accordingly, Schlumprecht’s space is defined by the following formula.
Because grows slowly, every subspace of Schlumprecht’s space contains averages, by the James argument. But now if you take sums of averages with rapidly increasing , you can get the gain from chopping up into pieces to grow without limit as grows.
Incidentally, Tsirelson first constructed his space as an example of an infinite-dimensional space that does not have a subspace isomorphic to or for any . The fact that it contains copies of in every subspace and that these copies are generated by disjointly supported vectors rules out everything except . And is ruled out because if it contained a copy of then that subspace wouldn’t be distortable.
The distortion problem.
I now want to describe a problem that was for several years my favourite problem in mathematics. The problem is very simple indeed to state, given the definitions in the previous section.
Problem. If , is -distortable for some ?
We have seen above that there are distortable spaces. But what about the spaces we know best, the -spaces? An infinitary analogue of the argument of R. C. James that was used above to obtain -averages can be used to prove that is not distortable, and an argument along similar lines shows that is not distortable. But what about a Hilbert space, say?
The fact that Tsirelson’s space can be distorted offers little encouragement, because it relied on the general idea that there are two different kinds of vectors (in the strong sense that they can be found inside every subspace). A Hilbert space is so homogeneous that it looks hard, and perhaps impossible, to identify two different types of vectors.
I came to believe strongly that Hilbert spaces were not distortable. My reason was that there is a simple reformulation of the distortion problem that turns it into a problem in Ramsey theory that is very similar to known Ramsey theorems that have positive answers. Here is the reformulation.
Problem. Let and let the unit sphere of be coloured with finitely many colours. Must there exist an infinite-dimensional subspace and one of the colours such that every point in the unit sphere of is within of a point with that colour?
This is slightly different from a normal Ramsey theorem in that we don’t find an infinite-dimensional subspace that’s entirely of one colour. Instead, we expand the colours by and then try to find a subspace that lives inside one of the (expanded) colours. This is necessary, since otherwise one could colour a vector red if its first non-zero coordinate is positive and blue if it is negative. But the problem above is a natural “analytic” Ramsey problem.
To see the connection, observe that if you have a -distorted norm on the Hilbert space, then you can have one colour for vectors where the distorted norm is less than times the usual norm, and one for the rest. Since every subspace contains vectors that have norms roughly equal to the usual norm and other vectors that have norms roughly equal to times the usual norm, this gives us a counterexample to the Ramsey-type problem. In the other direction, if you have a two-colouring and no subspace is contained in the -expansion of either colour, then you can construct a distorted norm. I leave this as an exercise.
Why did I believe so strongly that this Ramsey problem had a positive solution? It is because of a beautiful theorem in Ramsey theory due to Neil Hindman. In its original formulation, it states the following.
Theorem. Let be coloured with finitely many colours. Then there exists an infinite sequence such that has the same colour for every non-empty finite subset of .
There is also a “finite-unions” version, which says the following.
Theorem. Let the finite subsets of be coloured with finitely many colours. Then there exist subsets such that has the same colour for every non-empty finite subset of .
Here the notation means that the maximum element of is less than the minimum element of .
To see that the sums version implies the unions version, think of each subset of as the binary expansion of a positive integer. That is, the subset corresponds to the positive integer . Given a colouring of the finite subsets of , pull it over to the integers using this correspondence, and apply the first version of Hindman’s theorem. This doesn’t quite finish things off, because it gives us some sets that may well not satisfy the condition . However, given any infinite set of integers and any , we can always find a sum of distinct elements of that is divisible by . This follows easily from the pigeonhole principle: you just find integers that are the same mod and add them up. Therefore, using the set that Hindman’s theorem has given us, we can find finite sums that correspond to sets with arbitrarily large minimum, from which it follows that we can find a sequence of (disjoint) finite sums that correspond to sets .
The finite-unions version of the theorem can in turn be rephrased in terms of 01-sequences. If and are two 01-sequences, write to mean that the last non-zero entry of comes before the first non-zero entry of . Then the finite-unions version of the theorem is trivially equivalent to the following statement.
Theorem. Let the finitely supported 01-sequences be coloured with finitely many colours. Then there exist sequences such that has the same colour for every non-empty finite subset of .
This is what got me excited. It seems very much like a combinatorial analogue of a positive solution to the distortion problem: a sequence space is replaced by a set of 01-sequences, but that shouldn’t be a huge deal.
I did in fact manage to pursue these thoughts to prove a theorem in the space that most resembles the space of finitely supported 01-sequences, namely (the space of real sequences that tend to zero, with the norm). It states the following.
Theorem. Let and let the unit sphere of be coloured with finitely many colours. Then there exists an infinite-dimensional subspace that lies entirely in the -expansion of one of the colours.
For what it’s worth, the theorem above is the one result of mine that appears in an Elsevier journal.
There are two things that make particularly combinatorial. One is that it can be discretized very naturally and easily: you just take all points with coordinates that are multiples of for some . The other is that if you add two disjointly supported vectors, then you basically just concatenate them. I won’t say much about the proof here — just that it used ultrafilters (as does one of the nicest proofs of Hindman’s theorem) and it involved a subtlety, in that the obvious discrete problem that you might be tempted to write down has a negative answer because of the sign-of-first-non-zero-coordinate example. So even for the discrete problem one has to prove an approximate result — the doesn’t just come from the approximation of the continuous problem by the discrete one.
Try as I might, I just couldn’t find a way of doing without the combinatorial crutch that I had in . And yet the homogeneity of and the fact that it was a very beautiful and natural sequence space made it impossible for me to give up the dream that there was some vast analytic generalization of Hindman’s theorem waiting to be proved.
What did finally cause me to give up that dream was the announcement in I would guess 1993 (since the paper was published in 1994) that Odell and Schlumprecht had constructed a counterexample to the distortion problem. Very surprisingly (to me at least), they deduced that every with is arbitrarily distortable from the fact that Schlumprecht’s space is arbitrarily distortable. To do this, they had to apply a very clever non-linear map to take the colourings that correspond to distortions of Schlumprecht’s space and convert them into colourings that correspond to distortions of . (I think I may be oversimplifying here — it was a long time ago and I don’t remember much more than I am writing here.)
Another way of stating the remarkable result that Odell and Schlumprecht proved is the following. I’ll state it for because there it is nicest.
Theorem. For every there exist subsets and of the unit sphere of with the following properties.
(i) Every infinite dimensional subspace of contains a point in and a point in .
(ii) for every and .
In other words, the two sets and are almost orthogonal to each other and yet both intersect every single infinite-dimensional subspace. Even getting the inner product to be at most is formidably difficult, or at least seems so: it would be fascinating if somebody could come up with a fundamentally different construction.
For me, the whole episode had some important lessons for how mathematical research works. I became obsessed with the distortion problem, and ultimately failed to solve it. And yet, out of that obsession, and also from what I learned during that conversation with Ted Odell, came an understanding of distortability that ended up yielding me a number of other results. On the other hand, I was a little too convinced that the problem had a positive answer, and that meant that there was no chance of my solving it — though the solution of Odell and Schlumprecht was so ingenious that I don’t think I would have solved it even if I had gone all out to find a counterexample.
I have tried in this post to give some idea of what Ted Odell contributed to mathematics, not by going through his numerous mathematical achievements one by one, but in a more indirect way by conveying the impact he had on me personally. Many other people will have different stories to tell, but the themes that will run through them are that he was a wonderful mathematician, generous with his ideas, and very supportive to younger mathematicians.
I’d like to end by mentioning a beautiful problem that arises very naturally from the above discussion and that is, as far as I know, still unsolved. It is all that remains of my hopes for proving an analytic Ramsey theorem for a non-combinatorial Banach space.
Problem. Does there exist a distortable Banach space that is not arbitrarily distortable? In particular, if Tsirelson’s space is defined with constant , can it be distorted by substantially more than ?