<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Gowers&#039;s Weblog</title>
	<atom:link href="http://gowers.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://gowers.wordpress.com</link>
	<description>Mathematics related discussions</description>
	<lastBuildDate>Sat, 18 May 2013 08:52:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gowers.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Gowers&#039;s Weblog</title>
		<link>http://gowers.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gowers.wordpress.com/osd.xml" title="Gowers&#039;s Weblog" />
	<atom:link rel='hub' href='http://gowers.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Answers, results of polls, and a brief description of the program</title>
		<link>http://gowers.wordpress.com/2013/04/14/answers-results-of-polls-and-a-brief-description-of-the-program/</link>
		<comments>http://gowers.wordpress.com/2013/04/14/answers-results-of-polls-and-a-brief-description-of-the-program/#comments</comments>
		<pubDate>Sun, 14 Apr 2013 17:32:58 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Computing]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4915</guid>
		<description><![CDATA[I have now closed the polls in the second mathematical writing experiment. Here are the results. I have also published the comments on the first and second experiments, which shed further light on the results and on (some) people&#8217;s reasons for voting the way they did. The results were complicated slightly by the fact that [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4915&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I have now closed the polls in the second mathematical writing experiment. Here are the results. I have also published the comments on the first and second experiments, which shed further light on the results and on (some) people&#8217;s reasons for voting the way they did.</p>
<p>The results were complicated slightly by the fact that after a couple of days the post was linked to from the front page of Hacker News, and suddenly the number of people who voted more than tripled in a few hours. Furthermore, there was a bit of discussion about the polls, so it is not clear that the votes were completely independent. Also, the profile of an average Hacker News reader is probably somewhat different from the profile of an average reader of this blog.</p>
<p>Fortunately, I had recorded the votes shortly before this happened, so below we (&#8220;we&#8221; means Mohan Ganesalingam and I) present both sets of results. As it happens, the proportions didn&#8217;t change too much. We begin with some bar charts. U stands for &#8220;undergraduate&#8221;, G for &#8220;graduate&#8221;, C for &#8220;computer&#8221; and D for &#8220;don&#8217;t know&#8221;. The portion coloured in blue represents people who claimed to be sure that they were correct, and the portion coloured in red represents those who were unsure. </p>
<p>At the end of the post we give the exact numbers. </p>
<p>A brief remark before we present the results is that none of the three &#8220;contestants&#8221; was explicitly trying to write proofs that would appear as human as possible. The two humans were asked to provide proofs and not told why. And the program was designed to produce passable write-ups, but not to fool people into thinking that those write-ups were written by a human. (There are some easy improvements that could have been made if we had intended to do that, but we did not originally envisage carrying out this second experiment.)</p>
<p>The general picture can be summarized as follows.</p>
<p>1. The computer was typically identified by around 50% of all those who voted.</p>
<p>2. Typically around half of those were confident that they were correct, and half not so confident.</p>
<p>3. A non-negligible percentage of respondents claimed to be sure that a write-up that was not by the computer was by the computer.</p>
<h3>The results in bar-chart form</h3>
<p><a href="http://gowers.files.wordpress.com/2013/04/whichisthecomputer.png"><img src="http://gowers.files.wordpress.com/2013/04/whichisthecomputer.png?w=410&#038;h=600" alt="whichisthecomputer" width="410" height="600" class="aligncenter size-medium wp-image-4917" /></a></p>
<h3>Mohan Ganesalingam</h3>
<p>There is quite a lot to say about these results, and also about the program. Most of the rest of this post is written jointly with my collaborator Mohan Ganesalingam (as indeed were the posts with the two experiments &#8212; we chose the wording very carefully), but first let me say a little about who Mohan is.<br />
<span id="more-4915"></span></p>
<p>Officially he is a computer scientist who got a PhD in computational linguistics about three years ago. However, as an undergraduate he was a mathematician, and not just any old mathematician: he was Senior Wrangler in his year, <a href="http://en.wikipedia.org/wiki/Senior_Wrangler_(University_of_Cambridge)">which puts him in some illustrious company</a>, and he could certainly have gone on to do research in mathematics if he had been so inclined. However, he took a different and more interesting path, beginning with an MA in Anglo-Saxon, where he won a university prize for the best results in the Cambridge English Faculty, and continuing with a PhD in computer science, where he gave a complete analysis of formal mathematical language. Rather than say what &#8220;complete analysis&#8221; means, I&#8217;ll just say that with Tom Barnet-Lamb (another former Senior Wrangler, who will be known to algebraic number theorists for some major recent papers with Toby Gee, David Geraghty and Richard Taylor &#8212; I suppose I should mention that Toby Gee is yet another former Senior Wrangler; I don&#8217;t know about Richard Taylor) he has written a program that can take as its input mathematical text such as</p>
<p>&#8220;A closed subset of a compact set is compact.&#8221; </p>
<p>and convert it into a logical format such as</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cforall+x_1%2Cx_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall x_1,x_2' title='&#92;forall x_1,x_2' class='latex' /> set<img src='http://s0.wp.com/latex.php?latex=%28x_1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_1)' title='(x_1)' class='latex' /><br />
&#8230;&#8230;&#8230;&#8230;&#8230;compact<img src='http://s0.wp.com/latex.php?latex=%28x_1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_1)' title='(x_1)' class='latex' /><br />
&#8230;&#8230;&#8230;&#8230;&#8230;subset_of<img src='http://s0.wp.com/latex.php?latex=%28x_2%2Cx_1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_2,x_1)' title='(x_2,x_1)' class='latex' /><br />
&#8230;&#8230;&#8230;&#8230;&#8230;closed_in<img src='http://s0.wp.com/latex.php?latex=%28x_2%2C-%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_2,-)' title='(x_2,-)' class='latex' /><br />
&#8230;&#8230;&#8230;&#8230;&#8230;<img src='http://s0.wp.com/latex.php?latex=%5Cimplies&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;implies' title='&#92;implies' class='latex' /><br />
&#8230;&#8230;&#8230;&#8230;&#8230;compact<img src='http://s0.wp.com/latex.php?latex=%28x_2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_2)' title='(x_2)' class='latex' /> </p>
<p>Mohan and Tom were interested in mathematical language primarily from the point of view of linguistics: the language that mathematicians use is simple enough to be tractable, but complex enough to involve many of the interesting difficulties that linguists face when studying natural language. However, as will be obvious, their program has many interesting potential applications.</p>
<p>Regular readers of this blog and, before the blog existed, of my web page may be aware that for a long time I have believed that at least the easy parts of mathematics could be done by a computer without the need for major leaps forward in artificial intelligence. I always thought that I would love to be able to do more than just say, &#8220;This surely ought to be possible,&#8221; but knew that I would never get round to doing it on my own. When Mohan became a research fellow at Trinity College, it was clear that if I ever wanted to take part in producing an actual program, I had an opportunity that was unlikely to be repeated, since Mohan was unbelievably well qualified for the project. Better still, he too was very interested in the idea of writing a theorem-proving program, and his ideas about how such a program should work were very similar to mine.</p>
<h3>How the collaboration has worked so far</h3>
<p>So we started collaborating about three years ago (shortly before Mohan got his PhD). Initially, most of the collaboration was by email, but after a while we moved over to what one might call &#8220;Duomath,&#8221; by which I mean a private blog where we wrote posts and commented on each other&#8217;s posts. At the last count, that blog had reached over a million words. We also met regularly for face-to-face conversations about where things were going.</p>
<p>During the three years of our collaboration, we talked about &#8220;the program&#8221; where what we meant was an algorithm that we had specified fairly precisely but not actually written as a program. We ended up radically redesigning this &#8220;program&#8221; at least twice: typically it would work well to start with, but then we would find ourselves dissatisfied with certain aspects of it &#8212; for example, sometimes there would be a rule governing its behaviour that we chose not because we had a good theoretical justification for it, but because it seemed to deal well with all the example problems that we were considering, and then we would think of examples where the rule led to &#8220;stupid&#8221; behaviour on the part of the program. But towards the end of 2012, Mohan, who was coming to the end of his fellowship and therefore badly needed publications, insisted that we should write an actual program and not be too perfectionist about it. We started work in January, and after ten days the program was up and running. We then worked on the writing-up side of things for another ten days, so the whole thing took three weeks. I say this to emphasize that the hard work had gone on during the preceding three years. </p>
<p>Another point we are keen to make is that we put almost no effort into the write-up part of the program. That is, we spent those 10 days on it, but the three years of preparation were devoted exclusively to the problem-solving part. The reason we care about this point is that we want to be able to claim that our program tackles problems in a human way. The fact that rather crude methods could be used to convert the program&#8217;s main output into a human-style write-up helps us to back up that claim.</p>
<p>It is also worth repeating that when we created the write-up part of the program, we were not doing so with the second experiment in mind: that is, we were not planning to ask people to see whether they could distinguish between human write-ups and our program&#8217;s write-ups. However, after the first experiment we felt an obligation to do so. If we <em>had</em> intended to make our write-ups indistinguishable from human write-ups, then at the very least we would made them more varied &#8212; for example, not ending them all in the same way with the words &#8220;we are done&#8221;. So the fact that a lot of people still got the answer wrong took us by surprise.</p>
<h3>A rough idea of how the program works</h3>
<p>To begin with we would like to be clear that nothing in this program is radically new: almost all the techniques we use would, for instance, have been familiar to <a href="http://en.wikipedia.org/wiki/Woody_Bledsoe">Woody Bledsoe</a>, a pioneer in the use of &#8220;human methods&#8221; in automatic theorem proving. We believe that the way we have come up with the program <em>is</em> new, but for somewhat subtle reasons that we will not go into in this blog post. One effect of this is that it is easy for us to produce human-readable write-ups, which as far as we are aware has not been done before. We hope that in the not too distant future our new approach will also be rewarded by a program that can solve problems that no previous program has managed to solve, but we do not make that claim of the program we have so far.</p>
<p>Broadly speaking, what the program does is start with a collection of statements, some of which we call <em>hypotheses</em> (that is, things it is allowed to assume) and some of which we call <em>targets</em> (that is, things it is trying to prove). It then continually modifies this collection of statements until it reaches an obvious stopping point. Depending on precisely how we decide to implement the algorithm, this stopping point could be when it has no targets left, or when it has replaced all targets by DONE, or when all targets are also present as hypotheses: at any rate, it stops when the problem is clearly solved.</p>
<p>Of course, it won&#8217;t solve every problem it is given, so another major reason for it to halt is that it can&#8217;t see anything to do.</p>
<p>Our main priority when writing the program was that the steps it took should be ones that a human would naturally take. One could express that in the form of a slogan: the program should not do silly things.</p>
<p>One thing that humans do <em>not</em> do is rewrite all their statements in low-level language and work directly from the axioms of set theory. So our program doesn&#8217;t do that either. It is happy to deal with high-level concepts, and it also has a (so far very small) library of statements that it is allowed to assume.</p>
<h4>A worked example</h4>
<p>If we give an example of how the program works through a problem, it will greatly clarify what we have just said. Let us take Problem 2 from the list, the one that asks you to show that if <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous and <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open then <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open. The initial input to the program would be this.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open<br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open</p>
<p>There would also be some &#8220;background information&#8221; saying that <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> are metric spaces and that <img src='http://s0.wp.com/latex.php?latex=f%3AX%5Cto+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X&#92;to Y' title='f:X&#92;to Y' class='latex' />. And the program has a library that includes things like the epsilon-delta definition of continuity and the definition of open sets. Many people commented on the fact that the program didn&#8217;t use open balls. This is not some fundamental fact about the program &#8212; we could have got it to reason using open balls by changing not the program but the data it worked with. To do this, we would have had to tell it the open-balls definitions of continuity and open sets, and a few basic facts about open balls such as that if <img src='http://s0.wp.com/latex.php?latex=r%5Cleq+s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r&#92;leq s' title='r&#92;leq s' class='latex' /> then <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29%5Csubset+B_s%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)&#92;subset B_s(x)' title='B_r(x)&#92;subset B_s(x)' class='latex' />. </p>
<p>However, let us accept that in this instance the program is aiming to behave like a human who wants to go back to epsilons and deltas. In that case, the natural first step is to expand the target &#8212; that is, to say in lower-level language what needs to be proved if we want to show that <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open. The answer is that it needs to show that if <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />, then there is some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. So the program moves to the following collection of statements. (Of course, the way it does that can be broken up further, but it is sufficiently obvious that that is a purely mechanical process that we will not go into that level of detail.)</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open<br />
<img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cexists+%5Cdelta%5C+%5Cforall+y%5C+d%28x%2Cy%29%3C%5Cdelta%5Cimplies+y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' title='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' class='latex' /> </p>
<p>Humans typically rewrite (or rethink) statements of the form <img src='http://s0.wp.com/latex.php?latex=a%5Cin+f%5E%7B-1%7D%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in f^{-1}(B)' title='a&#92;in f^{-1}(B)' class='latex' /> in the form <img src='http://s0.wp.com/latex.php?latex=f%28a%29%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(a)&#92;in B' title='f(a)&#92;in B' class='latex' />. We therefore have a rule that says that statements of the form <img src='http://s0.wp.com/latex.php?latex=a%5Cin+f%5E%7B-1%7D%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in f^{-1}(B)' title='a&#92;in f^{-1}(B)' class='latex' /> should be rewritten in that way. So the program quickly does that.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open<br />
<img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cexists+%5Cdelta%5C+%5Cforall+y%5C+d%28x%2Cy%29%3C%5Cdelta%5Cimplies+y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' title='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' class='latex' /> </p>
<p>However, it doesn&#8217;t do it to the substatement <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> because in our judgment a typical human wouldn&#8217;t be thinking about that yet. Instead, it spots that the statement <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' /> is exactly the kind of &#8220;input&#8221; that is demanded by the statement &#8220;<img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open&#8221;, and it goes ahead and applies that statement.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+d%28f%28x%29%2Cu%29%3C%5Ceta%5Cimplies+u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' title='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cexists+%5Cdelta%5C+%5Cforall+y%5C+d%28x%2Cy%29%3C%5Cdelta%5Cimplies+y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' title='&#92;exists &#92;delta&#92; &#92;forall y&#92; d(x,y)&lt;&#92;delta&#92;implies y&#92;in f^{-1}(U)' class='latex' /> </p>
<p>A few comments are in order here. First, one might think that the new hypothesis should begin with <img src='http://s0.wp.com/latex.php?latex=%5Cexists+%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists &#92;eta' title='&#92;exists &#92;eta' class='latex' />, but the program automatically applies an operation that we call &#8220;peeling&#8221; to get rid of that existential quantifier. This operation corresponds to the kind of reasoning that humans do without thinking: we establish a statement like <img src='http://s0.wp.com/latex.php?latex=%5Cexists+%5Ceta%5C+P%28%5Ceta%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists &#92;eta&#92; P(&#92;eta)' title='&#92;exists &#92;eta&#92; P(&#92;eta)' class='latex' /> and from that moment we refer to <img src='http://s0.wp.com/latex.php?latex=%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta' title='&#92;eta' class='latex' /> as though it has been chosen, without bothering to say, &#8220;Let <img src='http://s0.wp.com/latex.php?latex=%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta' title='&#92;eta' class='latex' /> be such that <img src='http://s0.wp.com/latex.php?latex=P%28%5Ceta%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(&#92;eta)' title='P(&#92;eta)' class='latex' />.&#8221; We want the program to behave in a human way, so it too quietly gets rid of existential quantifiers that begin hypotheses. (In a similar way it &#8220;peels&#8221; universal quantifiers in front of targets. This corresponds to the human practice of taking a target of the form <img src='http://s0.wp.com/latex.php?latex=%5Cforall+x%5C+P%28x%29%5Cimplies+Q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall x&#92; P(x)&#92;implies Q(x)' title='&#92;forall x&#92; P(x)&#92;implies Q(x)' class='latex' />, saying, &#8220;Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be such that <img src='http://s0.wp.com/latex.php?latex=P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x)' title='P(x)' class='latex' />,&#8221; and concentrating on proving <img src='http://s0.wp.com/latex.php?latex=Q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q(x)' title='Q(x)' class='latex' />.) </p>
<p>Another point to mention is that the hypotheses <img src='http://s0.wp.com/latex.php?latex=x%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in U' title='x&#92;in U' class='latex' /> and &#8220;<img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open&#8221; have been perceived by the program as &#8220;used up&#8221; and therefore deleted. This is because a typical human would be aware that these statements are very unlikely to be needed again. Deletion of hypotheses is not an entirely straightforward matter, because there are many circumstances in which a statement is used more than once. However, we have developed some rules that do a reasonable job of deleting hypotheses when a human would think they were obviously used up. (These rules work OK for now, but will undoubtedly need to be replaced by more sophisticated rules when we start to tackle harder problems.)</p>
<p>Deleting hypotheses is not something that the program needs in order to solve these problems successfully. This is an example of how our aims are different from conventional automatic theorem provers: we are very keen to model human mathematicians, so we have thought hard about how it is that they are aware that statements are &#8220;used up&#8221;, even though the advantage of doing so (in terms of time saved by not looking at those statements again when searching for further steps to take) is negligible.</p>
<p>The program is now slightly stuck, so it performs an operation we call &#8220;suspension&#8221;: it pretends that it has chosen <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, so that it can get inside the existential quantifier that begins the target and start breaking up the conditional statement inside. To indicate that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> is yet to be chosen, it is marked with a bullet. (Actually, this is not precisely what the program does, but it is similar, and to explain what the program does that is slightly different from this would take us to an unnecessary level of detail.) After suspension, the problem looks like this.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+d%28f%28x%29%2Cu%29%3C%5Ceta%5Cimplies+u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' title='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cforall+y%5C+d%28x%2Cy%29%3C%5Cdelta%5E%5Cbullet%5Cimplies+y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall y&#92; d(x,y)&lt;&#92;delta^&#92;bullet&#92;implies y&#92;in f^{-1}(U)' title='&#92;forall y&#92; d(x,y)&lt;&#92;delta^&#92;bullet&#92;implies y&#92;in f^{-1}(U)' class='latex' /></p>
<p>However, the program immediately peels <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> and splits up the conditional, so we get to this.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+d%28f%28x%29%2Cu%29%3C%5Ceta%5Cimplies+u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' title='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta%5E%5Cbullet%5B%5Coverline+y%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' title='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /></p>
<p>It is important that, like a human, the program should be aware that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> is not allowed to depend on <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />. That is what is indicated by the notation <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E%5Cbullet%5B%5Coverline+y%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta^&#92;bullet[&#92;overline y]' title='&#92;delta^&#92;bullet[&#92;overline y]' class='latex' />.</p>
<p>Now that the statement <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> has been isolated, it is automatically rewritten.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+d%28f%28x%29%2Cu%29%3C%5Ceta%5Cimplies+u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' title='&#92;forall u&#92; d(f(x),u)&lt;&#92;eta&#92;implies u&#92;in U' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta%5E%5Cbullet%5B%5Coverline+y%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' title='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' /></p>
<p>The program is now in a position to do some &#8220;backwards reasoning&#8221;: that is, replacement of a target <img src='http://s0.wp.com/latex.php?latex=Q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q(x)' title='Q(x)' class='latex' /> by a new target <img src='http://s0.wp.com/latex.php?latex=P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x)' title='P(x)' class='latex' /> when there is a hypothesis <img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+P%28u%29%5Cimplies+Q%28u%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; P(u)&#92;implies Q(u)' title='&#92;forall u&#92; P(u)&#92;implies Q(u)' class='latex' />. So it does it and deletes the second hypothesis.</p>
<p><img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous<br />
<img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta%5E%5Cbullet%5B%5Coverline+y%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' title='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<br />
<img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' /></p>
<p>Now it can do some more backwards reasoning (of a slightly more complicated nature because there are more quantifiers involved) using the continuity hypothesis, so it does that. It knows the expansion of &#8220;<img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous&#8221; in terms of epsilons and deltas and uses that and a matching algorithm to spot that it can do this, but it doesn&#8217;t actually expand the definition, since an experienced human normally wouldn&#8217;t bother to do so.</p>
<p><img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta%5E%5Cbullet%5B%5Coverline+y%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' title='d(x,y)&lt;&#92;delta^&#92;bullet[&#92;overline y]' class='latex' /><br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-<br />
<img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta%5Bx%2C%5Ceta%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta[x,&#92;eta]' title='d(x,y)&lt;&#92;theta[x,&#92;eta]' class='latex' /></p>
<p>Here the notation <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%5Bx%2C%5Ceta%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta[x,&#92;eta]' title='&#92;theta[x,&#92;eta]' class='latex' /> indicates that <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> depends on <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta' title='&#92;eta' class='latex' />. (It seems that humans are more conscious of dependencies for variables that arise in certain ways and <em>in</em>dependencies for variables that arise in other ways, and that turns out to be convenient when writing the program too.)</p>
<p>Finally, the program notes that it can get the one remaining hypothesis to match the target if it sets <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> equal to <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' />. It also checks that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> is allowed to depend on <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta' title='&#92;eta' class='latex' />, which it is. (All that happens here is that it checks that the list of variables that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> must not depend on is disjoint from the list of variables that <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> depends on.)</p>
<h4>Writing up</h4>
<p>We stated earlier that the writing-up part of the program is quite crude. The main properties that the writing up has (neither of which we shall attempt to define precisely) are that it is <em>local</em> and <em>faithful</em>. The second property means that the sentences that appear in the write-up are basically translations of things that the program does in the same order, with none of the program&#8217;s working hidden, and the first means that the each sentence of the write-up basically depends just on one step, or perhaps a sequence of one or two adjacent steps, that the program takes.</p>
<p>So the first thing the write-up algorithm does is associate with each step a translation. For example, if the step is to go from</p>
<p><img src='http://s0.wp.com/latex.php?latex=P%28x%2CZ%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x,Z)' title='P(x,Z)' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z' title='Z' class='latex' /> has property <img src='http://s0.wp.com/latex.php?latex=R&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='R' title='R' class='latex' /></p>
<p>to </p>
<p><img src='http://s0.wp.com/latex.php?latex=Q%28x%2Cz%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q(x,z)' title='Q(x,z)' class='latex' /></p>
<p>where the statement &#8220;<img src='http://s0.wp.com/latex.php?latex=Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z' title='Z' class='latex' /> has property <img src='http://s0.wp.com/latex.php?latex=R&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='R' title='R' class='latex' />&#8221; expands to </p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+P%28u%2CZ%29%5Cimplies+Q%28u%2CZ%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; P(u,Z)&#92;implies Q(u,Z)' title='&#92;forall u&#92; P(u,Z)&#92;implies Q(u,Z)' class='latex' /></p>
<p>then the write-up algorithm will translate that into something like, &#8220;Since <img src='http://s0.wp.com/latex.php?latex=P%28x%2CZ%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x,Z)' title='P(x,Z)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z' title='Z' class='latex' /> has property <img src='http://s0.wp.com/latex.php?latex=R&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='R' title='R' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=Q%28x%2CZ%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q(x,Z)' title='Q(x,Z)' class='latex' />.&#8221;</p>
<p>At this point one could get a horrible but just about comprehensible write-up by simply concatenating all these individual translations. However, it would have several defects. Most notably, it would be repetitive in a way that no human would. For instance, it would say things like this: &#8220;Let <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />, it follows that <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, it follows that there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that &#8230;&#8221; </p>
<p>We deal with that by removing hypotheses from sentences when they have only just been established and are therefore already &#8220;ringing in the ears&#8221; of the reader. So the above fragment might be reduced to &#8220;Let <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, it follows that there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that &#8230;&#8221; Actually, it is not reduced to exactly that, but we are already pretty close.</p>
<p>Other tweaks we performed were messing about with the rules for how to say that one thing follows from another (do we say &#8220;Since <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />,&#8221; or &#8220;<img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' />. Therefore <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />.&#8221; or &#8220;We know that <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' />. It follows that <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />.&#8221; or what?) until the write-ups that the program produced were not too repetitive and had the right sort of flow. By &#8220;flow&#8221; we mean that sometimes one has a chain of forward steps, and one wants to convey that one is chugging along through the proof, whereas at other times the forwards reasoning gets stuck and one has to turn to backwards reasoning and there should be more of a feeling of a new idea coming in or some almost forgotten hypothesis being resurrected. Words like &#8220;but&#8221; and &#8220;now&#8221; help to convey these kinds of things.</p>
<p>And that is more or less all there was to it. We have not (yet) done a proper job on the write-up algorithm, in the sense that we have not sat down and carefully analysed things like what it is that makes humans decide to write &#8220;but&#8221; or &#8220;it follows that&#8221; when they do. As a result, if you give the program new problems to solve, it tends to produce write-ups with slightly weird wording in places. So in that sense it is not fully robust: we have shown you the output that looks nice, but if you had a chance to choose your own input you would immediately produce output that looks slightly stilted in the ways just described, though it would be comprehensible and would still be English rather than some special-purpose formal language. </p>
<h4>How the program chooses what to do next</h4>
<p>The basic method we use is extremely simple. We give the program an ordered list of operations it can do, many of which were used in the worked example above. It then behaves greedily: that is, it repeatedly chooses the first operation it can do. The word &#8220;greedily&#8221; is a slight exaggeration though: for one thing we have chosen the ordering quite carefully, and for another some of the moves have constraints that must be satisfied if they are to be done. </p>
<p>We do not expect this very rudimentary architecture to survive for ever, but it works well for routine problems. (In a moment we shall say more precisely what we mean by &#8220;routine&#8221;.)</p>
<h3>Constraints on the program</h3>
<p>Going back to the proof-finding aspect of the program, there are three main constraints that we have imposed on ourselves. The first two are concerned with making it behave in a human way.</p>
<p><strong>1.</strong> <em>The program should not find a step hard if an experienced human finds it the obvious thing to do.</em></p>
<p><strong>2.</strong> <em>The program should not be willing to do a step if an experienced human would be very reluctant to do it.</em></p>
<p>The point of the first constraint is clear: if a human finds a step obvious but the program finds it only after some big search with a lot of backtracking, say, then we have not understood something about how humans do mathematics. (That is not quite true: it could be that we understand how humans do it, but find it hard to model &#8212; that may well be the case, for instance, if visualization plays a big role, but we are avoiding that problem for the time being.)</p>
<p>We (informally) define a problem to be <em>routine</em> if an experienced human can solve the problem easily without backtracking. Combining this definition with the first constraint above, we obtain the following consequence.</p>
<p><em>The program should be able to solve routine problems without backtracking.</em></p>
<p>A big advantage of this no-backtracking principle is that if you give the program a statement that it cannot prove (possibly because it is false), then it does a few sensible steps and gets stuck, rather than producing pages and pages of garbage.</p>
<p>As for the second constraint, the kind of thing we want to rule out is an undue willingness to use brute force, whether that means big messy calculations or large searches. This &#8220;undue willingness&#8221; often arises when one tries to produce methods that are too general. To give a very simple example, a pocket calculator when asked to multiply 30 by 70 will probably use the same method that it uses when asked to multiply 29 by 73. To give a slightly less simple example, if an algebra package was given the equation </p>
<p><img src='http://s0.wp.com/latex.php?latex=%28x%2B1%29%5E3%3D3%28x%2B1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x+1)^3=3(x+1)' title='(x+1)^3=3(x+1)' class='latex' /></p>
<p>to solve, it might well expand the brackets and collect terms before starting. If it did so, then even if it had no trouble with the resulting calculations it would fall foul of our second constraint, since a human would instantly see that the equation was easier to solve in the form in which it is already presented.</p>
<p>The third constraint has a rather different justification. It is intended to stop us &#8220;cheating&#8221; by hardwiring solutions of problems into the program itself.</p>
<p><strong>3.</strong> <em>The methods that the program uses should not be specific to any domain of mathematics. Any domain-specific behaviour should be the result of </em>data<em> that the program is given to work with.</em></p>
<p>Why impose any constraints at all? Surely if one is trying to produce a program that can prove as many theorems as possible, the fewer constraints there are the better? </p>
<p>The simple answer to that is that we are <em>not</em> trying to produce a program that can prove as many theorems as possible. Rather, we are trying with our program to build a foundation for more elaborate programs. We strongly believe that if we can create a program that solves easy problems the &#8220;right&#8221; way, then it will be much easier to extend it to programs that can solve harder problems.</p>
<h3>A few things that the program cannot (yet) do</h3>
<p>There are many very important things that the program cannot do. Here we list just a few of our immediate targets.</p>
<h4>1. Second-order quantification</h4>
<p>We said in the previous post that the program cannot solve non-routine problems and in fact cannot solve more than a small class of routine problems. However, during the last three years we have worked out rather more than is exhibited in this program. For example, we are close to being able to write a program that can handle second-order quantification. That will allow it to solve two major classes of problems: ones that involve producing sequences, and ones that involve producing open covers. A target problem that we essentially know how to do is the proof that a compact subset of a Hausdorff topological space is closed.</p>
<h4>2. Algebra</h4>
<p>At the moment, the level of granularity at which the program works is that of <em>statements</em>. For example, if it has a statement <img src='http://s0.wp.com/latex.php?latex=P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x)' title='P(x)' class='latex' /> and a statement <img src='http://s0.wp.com/latex.php?latex=%5Cforall+u%5C+P%28u%29%5Cimplies+Q%28u%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;forall u&#92; P(u)&#92;implies Q(u)' title='&#92;forall u&#92; P(u)&#92;implies Q(u)' class='latex' />, then it notes the match between <img src='http://s0.wp.com/latex.php?latex=P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x)' title='P(x)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=P%28u%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(u)' title='P(u)' class='latex' /> and generates the statement <img src='http://s0.wp.com/latex.php?latex=Q%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q(x)' title='Q(x)' class='latex' />. However, a great deal of mathematical reasoning, especially in algebra, takes place at the level of <em>terms</em>: you have a term that you want to make into another term by performing various allowable operations. Our program does not yet do this, and therefore there is a large class of routine problems that it is not yet capable of solving. </p>
<p>Dealing with equality is a well-known difficulty in the automatic theorem proving literature. In many approaches the problem is that without big restrictions on when you can substitute one equal quantity for another, a program is liable to make many irrelevant observations. It seems, however, that many of the ideas that have gone into our program so far can be modified to deal with term rewriting for sufficiently routine problems. We hope to make an attempt in this direction in the not too distant future.</p>
<p>One option that we do not wish to pursue is using a standard package such as Maple or Mathematica to do algebraic manipulations for us. This would be completely contrary to the spirit of our program, since it would not be modelling the way humans rewrite terms while solving routine problems. In the short term, it might be taking the easy way out, but in the longer term we would pay for not adequately understanding what human mathematicians do. (Of course, sometimes even humans have to do very complicated calculations. In such situations, a program could always say, just as a human might, &#8220;This calculation looks as though the answer would be useful, but it&#8217;s very complicated. I think I&#8217;ll use an algebra package.&#8221; But this would be regarded as an extremely costly option.)</p>
<h4>3. Non-trivial existence problems</h4>
<p>If the program needs to prove a statement that begins with an existential quantifier, then at the moment it has three basic methods, the first of which is a trivial case of the second. For the second, recall that suspension is the operation of putting a bullet on a variable to indicate that you are postponing the decision about what to substitute for it.</p>
<ol>
<li>Direct substitution: it needs to prove a statement of the form <img src='http://s0.wp.com/latex.php?latex=%5Cexists+u%5C+P%28u%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists u&#92; P(u)' title='&#92;exists u&#92; P(u)' class='latex' /> and it has a hypothesis of the form <img src='http://s0.wp.com/latex.php?latex=P%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P(x)' title='P(x)' class='latex' />, so it sets <img src='http://s0.wp.com/latex.php?latex=u%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u=x' title='u=x' class='latex' />.</li>
<li>Direct substitution after suspension and simplification: it needs to prove a statement of the form <img src='http://s0.wp.com/latex.php?latex=%5Cexists+u%5C+P%28u%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exists u&#92; P(u)' title='&#92;exists u&#92; P(u)' class='latex' />, suspends the variable <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' />, carries out some steps, and then at a later stage finds that it can substitute directly for <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' />.</li>
<li>Use of the library: it has a few &#8220;standard&#8221; solutions in the library, such as the fact that if <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cbeta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;beta' title='&#92;beta' class='latex' /> are positive you want a positive number that is at most as big as <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> and at most as big as <img src='http://s0.wp.com/latex.php?latex=%5Cbeta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;beta' title='&#92;beta' class='latex' />, then you can take <img src='http://s0.wp.com/latex.php?latex=%5Cmin%5C%7B%5Calpha%2C%5Cbeta%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;min&#92;{&#92;alpha,&#92;beta&#92;}' title='&#92;min&#92;{&#92;alpha,&#92;beta&#92;}' class='latex' />. (It used that at the end of its discovery of a proof that the intersection of two open sets is open.)</li>
</ol>
<p>It is surprising how far one can get with such simple methods, but there are many other common techniques. For example, a method that works for some simple problems in group theory is looking through a list of a few very basic examples of groups to see whether any of them happens to have the desired property. Another method that works in many situations is what one might call &#8220;guess and adjust&#8221;. That is, you begin with an example that you don&#8217;t expect to work, but you hope is in the right ball park. You then try to identify what needs to be changed to make it work. And many examples are not built or even approximately built in one go but are &#8220;constructed&#8221; using basic building blocks and methods for combining or extending them. We would certainly like to introduce more advanced techniques like these, but for now there are more urgent priorities.</p>
<h4>4. Backtracking</h4>
<p>We have already mentioned that our program does not backtrack. This is a different kind of inability from the ones above, since it does not concern routine problems (by our definition of &#8220;routine&#8221;). Quite clearly, humans backtrack constantly when they are solving non-routine problems. We would not be interested in this project if we did not hope eventually to tackle such problems, but that will require a great deal of thought: we will need to find a way of allowing the program to backtrack without indulging in &#8220;unreasonable&#8221; searches. Probably we will start by allowing it, under very controlled circumstances, to try slightly risky steps that will either be confirmed as good almost immediately or be taken back. More or less equivalently, we might occasionally allow the program to look a couple of steps ahead. An example of the kind of problem where this could be useful is in proving the cancellation law in group theory. We are given that <img src='http://s0.wp.com/latex.php?latex=ax%3Day&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='ax=ay' title='ax=ay' class='latex' /> and want to prove that <img src='http://s0.wp.com/latex.php?latex=x%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=y' title='x=y' class='latex' />. Multiplying both sides on the left by <img src='http://s0.wp.com/latex.php?latex=a%5E%7B-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a^{-1}' title='a^{-1}' class='latex' /> makes the expressions more complicated, but if we look slightly further ahead we see that we get exactly what we want. (This is, however, a rather tricky example: it seems that what makes it easy for humans is the analogy with cancellation laws in arithmetic.)</p>
<h3>The numbers in the polls</h3>
<p>The numbers in brackets represent the numbers before the Hacker-News invasion. I have now made the results of the polls visible, so you can check that the main set of results are as we say they are.</p>
<p>Problem 1.  (a=undergraduate, b=graduate, c=computer)</p>
<p>The computer-generated output is definitely (a): 123 votes (36)<br />
I think the computer-generated output is (a) but am not certain: 156 votes (48)<br />
The computer-generated output is definitely (b): 83 votes (12)<br />
I think the computer-generated output is (b) but am not certain: 139 votes (27)<br />
The computer-generated output is definitely (c): 266 votes (71)<br />
I think the computer-generated output is (c) but am not certain:  362 votes (100)<br />
I have no idea which write-up was computer generated: 131 votes (11)</p>
<p>Problem 2. (a=computer, b=undergraduate, c=graduate)</p>
<p>The computer-generated output is definitely (a): 140 votes (45)<br />
I think the computer-generated output is (a) but am not certain:  133 votes (55)<br />
The computer-generated output is definitely (b): 42 votes (13)<br />
I think the computer-generated output is (b) but am not certain: 75 votes (32)<br />
The computer-generated output is definitely (c): 10 votes (4)<br />
I think the computer-generated output is (c) but am not certain: 34 votes (14)<br />
I have no idea which write-up was computer generated: 42 votes (13)</p>
<p>Problem 3. (a=graduate, b=computer, c=undergraduate)</p>
<p>The computer-generated output is definitely (a): 24 votes (9)<br />
I think the computer-generated output is (a) but am not certain: 39 votes (18)<br />
The computer-generated output is definitely (b): 72 votes (17)<br />
I think the computer-generated output is (b) but am not certain: 89 votes (32)<br />
The computer-generated output is definitely (c): 42 votes (19)<br />
I think the computer-generated output is (c) but am not certain: 63  votes (36)<br />
I have no idea which write-up was computer generated: 48 votes (21)</p>
<p>Problem 4. (a=undergraduate, b=graduate, c=computer)</p>
<p>The computer-generated output is definitely (a): 27 votes (12)<br />
I think the computer-generated output is (a) but am not certain: 52 votes (29)<br />
The computer-generated output is definitely (b): 12 votes (4)<br />
I think the computer-generated output is (b) but am not certain: 32  votes (16)<br />
The computer-generated output is definitely (c): 82 votes (22)<br />
I think the computer-generated output is (c) but am not certain: 79 votes (41)<br />
I have no idea which write-up was computer generated: 38 votes (16)</p>
<p>Problem 5. (a=graduate, b=undergraduate, c=computer)</p>
<p>The computer-generated output is definitely (a): 12 votes (5)<br />
I think the computer-generated output is (a) but am not certain: 17  votes (7)<br />
The computer-generated output is definitely (b): 19 votes (6)<br />
I think the computer-generated output is (b) but am not certain:  46 votes (23)<br />
The computer-generated output is definitely (c) 110 votes (41)<br />
I think the computer-generated output is (c) but am not certain: 115 votes (52)<br />
I have no idea which write-up was computer generated: 49 votes (15)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4915/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4915/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4915&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/04/14/answers-results-of-polls-and-a-brief-description-of-the-program/feed/</wfw:commentRss>
		<slash:comments>33</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>

		<media:content url="http://gowers.files.wordpress.com/2013/04/whichisthecomputer.png?w=205" medium="image">
			<media:title type="html">whichisthecomputer</media:title>
		</media:content>
	</item>
		<item>
		<title>A second experiment concerning mathematical writing.</title>
		<link>http://gowers.wordpress.com/2013/04/02/a-second-experiment-concerning-mathematical-writing/</link>
		<comments>http://gowers.wordpress.com/2013/04/02/a-second-experiment-concerning-mathematical-writing/#comments</comments>
		<pubDate>Tue, 02 Apr 2013 19:10:40 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Computing]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4902</guid>
		<description><![CDATA[The time has come to reveal what the experiment in the previous post was about. As with many experiments (the most famous probably being that of Stanley Milgram about obedience to authority), its real purpose was not its ostensive purpose. Over the last three years, I have been collaborating with Mohan Ganesalingam, a computer scientist, [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4902&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The time has come to reveal what the experiment in the previous post was about. As with many experiments (the most famous probably being that of Stanley Milgram about obedience to authority), its real purpose was not its ostensive purpose.</p>
<p>Over the last three years, I have been collaborating with Mohan Ganesalingam, a computer scientist, linguist and mathematician (and amazingly good at all three) on a project to write programs that can solve mathematical problems. We have recently produced our first program. It is rather rudimentary: the only problems it can solve are ones that mathematicians would describe as very routine and not requiring any ideas, and even within that class of problems there are considerable restrictions on what it can do; we plan to write more sophisticated programs in the future. However, each of the problems in the previous post belongs to the class of problems that it <em>can</em> solve, and for each problem one write-up was by an undergraduate mathematician, one by a mathematics PhD student and one by our program. (To be clear, the program was given the problems and produced the proofs and the write-ups with no further help. I will have more to say about how it works in future posts.) We wanted to see whether anybody would suspect that not all the write-ups were human-generated. Nobody gave the slightest hint that they did.</p>
<p>Of course, there is a world of difference between not noticing a difference that you have not been told to look out for, and being unable to detect that difference at all. Our aim was merely to be able to back up a claim that our program produces passable human-style output, so we did not want to subject that output to full Turing-test-style scrutiny, but you may, if you were kind enough to participate in the experiment, feel slightly cheated. Indeed, in a certain sense you <em>were</em> cheated &#8212; that was the whole point. It seems only fair to give you the chance to judge the write-ups again now that you know how they were produced. For each problem I have created a poll, and each poll has seven possible answers. These are:</p>
<p>The computer-generated output is definitely (a).<br />
I think the computer-generated output is (a) but am not certain.<br />
The computer-generated output is definitely (b).<br />
I think the computer-generated output is (b) but am not certain.<br />
The computer-generated output is definitely (c).<br />
I think the computer-generated output is (c) but am not certain.<br />
I have no idea which write-up was computer generated.</p>
<p>I would also be interested in comments about how you came to your judgments. All comments on both experiments and all votes in the polls will be kept private until I decide that it is time to finish the second experiment. A small remark is that I transcribed by hand all the write-ups into a form suitable for WordPress, so the existence of a typo in a write-up is not a trivial proof that it was by a human.</p>
<p>If you did not participate in the first experiment but nevertheless want to try this one, that&#8217;s fine. [<strong>Update:</strong> I have now closed the polls. Very soon Mohan and I will post the results and a discussion of them. <strong>Further update:</strong> The results now appear below. They appear displayed in possibly a more convenient way in <a href="http://gowers.wordpress.com/2013/04/14/answers-results-of-polls-and-a-brief-description-of-the-program/">this post, which also contains a discussion of how the program works</a>.]<br />
<span id="more-4902"></span></p>
<p><strong>Problem 1.</strong> Let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> be open sets in a metric space. Then <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' /> is open.</p>
<p><strong>1(a)</strong> We want to show that for all <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />, there exists <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> such that the open ball <img src='http://s0.wp.com/latex.php?latex=B_x%28r%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_x(r)' title='B_x(r)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />.</p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in A &#92;cap B' title='x &#92;in A &#92;cap B' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> are open, there exist <img src='http://s0.wp.com/latex.php?latex=a%2C+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a, b' title='a, b' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=B_a%28x%29+%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_a(x) &#92;subset A' title='B_a(x) &#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_b%28x%29+%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_b(x) &#92;subset B' title='B_b(x) &#92;subset B' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=r+%3D+%5Cmin%5C%7Ba%2C+b%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r = &#92;min&#92;{a, b&#92;}' title='r = &#92;min&#92;{a, b&#92;}' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)' title='B_r(x)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=B_a%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_a(x)' title='B_a(x)' class='latex' />, so it&#8217;s contained in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. Similarly, <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)' title='B_r(x)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=B_b%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_b(x)' title='B_b(x)' class='latex' />, so it&#8217;s contained in <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29+%5Csubset+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x) &#92;subset A &#92;cap B' title='B_r(x) &#92;subset A &#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' /> is open.</p>
<p><strong>1(b)</strong> For arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29%3A%3D%5C%7By%5Cin+X%7Cd%28y%2Cx%29%3Cr%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x):=&#92;{y&#92;in X|d(y,x)&lt;r&#92;}' title='B_r(x):=&#92;{y&#92;in X|d(y,x)&lt;r&#92;}' class='latex' />. Consider an arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A&#92;cap B' title='x&#92;in A&#92;cap B' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=A%2CB&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A,B' title='A,B' class='latex' /> are open there are <img src='http://s0.wp.com/latex.php?latex=r%2Cr%27%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r,r&#039;&gt;0' title='r,r&#039;&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)&#92;subset A' title='B_r(x)&#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_%7Br%27%7D%28x%29%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{r&#039;}(x)&#92;subset B' title='B_{r&#039;}(x)&#92;subset B' class='latex' />. Take <img src='http://s0.wp.com/latex.php?latex=R%3D%5Cmin%28r%2Cr%27%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='R=&#92;min(r,r&#039;)' title='R=&#92;min(r,r&#039;)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+B_r%28x%29%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset B_r(x)&#92;subset A' title='B_R(x)&#92;subset B_r(x)&#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+B_%7Br%27%7D%28x%29%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset B_{r&#039;}(x)&#92;subset B' title='B_R(x)&#92;subset B_{r&#039;}(x)&#92;subset B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset A&#92;cap B' title='B_R(x)&#92;subset A&#92;cap B' class='latex' />. We&#8217;ve proved that for any <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A&#92;cap B' title='x&#92;in A&#92;cap B' class='latex' /> there is an open ball (<img src='http://s0.wp.com/latex.php?latex=B_R%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)' title='B_R(x)' class='latex' /> in this case) that contains <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and is inside <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' /> is open.</p>
<p><strong>1(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A' title='x&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in B' title='x&#92;in B' class='latex' />. Therefore, since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A' title='u&#92;in A' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cu%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,u)&lt;&#92;eta' title='d(x,u)&lt;&#92;eta' class='latex' /> and since <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=v%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v&#92;in B' title='v&#92;in B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cv%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,v)&lt;&#92;theta' title='d(x,v)&lt;&#92;theta' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B' title='y&#92;in B' class='latex' />. We know that <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;eta' title='d(x,y)&lt;&#92;eta' class='latex' /> and that <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B' title='y&#92;in B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Assume now that <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;eta' title='d(x,y)&lt;&#92;eta' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Cleq%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&#92;leq&#92;eta' title='&#92;delta&#92;leq&#92;eta' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Cleq%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&#92;leq&#92;theta' title='&#92;delta&#92;leq&#92;theta' class='latex' />. We may therefore take <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Cmin%5C%7B%5Ceta%2C%5Ctheta%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;min&#92;{&#92;eta,&#92;theta&#92;}' title='&#92;delta=&#92;min&#92;{&#92;eta,&#92;theta&#92;}' class='latex' /> and we are done.</p>
<a name="pd_a_7008584"></a>
<div class="PDS_Poll" id="PDI_container7008584" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008584.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008584">Take Our Poll</a></noscript>
<p><strong>Problem 2.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> be metric spaces, let <img src='http://s0.wp.com/latex.php?latex=f%3AX-%3EY&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X-&gt;Y' title='f:X-&gt;Y' class='latex' /> be continuous, and let <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' />  be an open subset of <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is an open subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />.</p>
<p><strong>2(a)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. Therefore, since <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in U' title='u&#92;in U' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cu%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),u)&lt;&#92;eta' title='d(f(x),u)&lt;&#92;eta' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' />. We know that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;theta' title='&#92;delta=&#92;theta' class='latex' />, we are done.</p>
<p><strong>2(b)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in f^{-1}(U)' title='x &#92;in f^{-1}(U)' class='latex' />. We seek <img src='http://s0.wp.com/latex.php?latex=r+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r &gt; 0' title='r &gt; 0' class='latex' /> such that the open ball <img src='http://s0.wp.com/latex.php?latex=B_x%28r%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_x(r)' title='B_x(r)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' />.</p>
<p><img src='http://s0.wp.com/latex.php?latex=x+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in f^{-1}(U)' title='x &#92;in f^{-1}(U)' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=f%28x%29+%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x) &#92;in U' title='f(x) &#92;in U' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, so we know that for some <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon &gt; 0' title='&#92;epsilon &gt; 0' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=B_%7B%5Cepsilon%7D%28f%28x%29%29+%5Csubset+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{&#92;epsilon}(f(x)) &#92;subset U' title='B_{&#92;epsilon}(f(x)) &#92;subset U' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=z+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z &#92;in X' title='z &#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+z%29+%3C+%5Cdelta%5Cimplies+d%28f%28x%29%2C+f%28z%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, z) &lt; &#92;delta&#92;implies d(f(x), f(z)) &lt; &#92;epsilon' title='d(x, z) &lt; &#92;delta&#92;implies d(f(x), f(z)) &lt; &#92;epsilon' class='latex' />; i.e., <img src='http://s0.wp.com/latex.php?latex=f%28z%29+%5Cin+B_%7B%5Cepsilon%7D%28f%28x%29%29+%5Csubset+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z) &#92;in B_{&#92;epsilon}(f(x)) &#92;subset U' title='f(z) &#92;in B_{&#92;epsilon}(f(x)) &#92;subset U' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=z+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z &#92;in f^{-1}(U)' title='z &#92;in f^{-1}(U)' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+z%29+%3C+%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, z) &lt; &#92;delta' title='d(x, z) &lt; &#92;delta' class='latex' />; i.e., <img src='http://s0.wp.com/latex.php?latex=B_%7B%5Cdelta%7D%28x%29+%5Csubset+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{&#92;delta}(x) &#92;subset f^{-1}(U)' title='B_{&#92;delta}(x) &#92;subset f^{-1}(U)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open.</p>
<p><strong>2(c)</strong> Take any <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />. We have <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, there is an open ball <img src='http://s0.wp.com/latex.php?latex=B_%5Cepsilon%28f%28x%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;epsilon(f(x))' title='B_&#92;epsilon(f(x))' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' />. Because <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that for any <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B_%5Cdelta%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B_&#92;delta(x)' title='y&#92;in B_&#92;delta(x)' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=f%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)' title='f(y)' class='latex' /> belongs to <img src='http://s0.wp.com/latex.php?latex=B_%5Cepsilon%28f%28x%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;epsilon(f(x))' title='B_&#92;epsilon(f(x))' class='latex' />. Hence, for such <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_%5Cdelta%28x%29%5Csubset+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;delta(x)&#92;subset f^{-1}(U)' title='B_&#92;delta(x)&#92;subset f^{-1}(U)' class='latex' />. We&#8217;ve proved that every point in <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> has an open ball neighbourhood. So <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open.</p>
<a name="pd_a_7008585"></a>
<div class="PDS_Poll" id="PDI_container7008585" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008585.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008585">Take Our Poll</a></noscript>
<p><strong>Problem 3.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> be a complete metric space and let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> be a closed subset of  <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<p><strong>3(a)</strong> Consider an arbitrary Cauchy sequence <img src='http://s0.wp.com/latex.php?latex=%28x_n%29_%7B%5C%7Bn%5Cin%5Cmathbb%7BN%7D%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_n)_{&#92;{n&#92;in&#92;mathbb{N}&#92;}}' title='(x_n)_{&#92;{n&#92;in&#92;mathbb{N}&#92;}}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, <img src='http://s0.wp.com/latex.php?latex=%28x_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_n)' title='(x_n)' class='latex' /> has a limit in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Suppose <img src='http://s0.wp.com/latex.php?latex=%5Clim_%7Bn%5Cto%5Cinfty%7Dx_n%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lim_{n&#92;to&#92;infty}x_n=x' title='&#92;lim_{n&#92;to&#92;infty}x_n=x' class='latex' />. Because <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is closed, <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> belongs to <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. We&#8217;ve proved that every Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> has a limit point in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<p><strong>3(b)</strong> Let <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> be a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. Then, since <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, we have that <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> converges. That is, there exists <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a_n%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n&#92;to a' title='a_n&#92;to a' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is closed in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> is a sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=a_n%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n&#92;to a' title='a_n&#92;to a' class='latex' />, we have that <img src='http://s0.wp.com/latex.php?latex=a%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in A' title='a&#92;in A' class='latex' />. Thus <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> converges in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and we are done.</p>
<p><strong>3(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> be a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. We want to show that <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> tends to a limit in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />.</p>
<p>Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> is a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, <img src='http://s0.wp.com/latex.php?latex=a_n+%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n &#92;to a' title='a_n &#92;to a' class='latex' />, for some <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in X' title='a &#92;in X' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a closed subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, it must contain all its limit points, so <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A' title='a &#92;in A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=a_n+%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n &#92;to a' title='a_n &#92;to a' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<a name="pd_a_7008591"></a>
<div class="PDS_Poll" id="PDI_container7008591" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008591.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008591">Take Our Poll</a></noscript>
<p><strong>Problem 4.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X%2C+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X, Y' title='X, Y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z' title='Z' class='latex' /> be metric spaces and let <img src='http://s0.wp.com/latex.php?latex=f%3AX%5Cto+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X&#92;to Y' title='f:X&#92;to Y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g%3AY%5Cto+Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:Y&#92;to Z' title='g:Y&#92;to Z' class='latex' /> be  continuous. Then the composition <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.</p>
<p><strong>4(a)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in X' title='x &#92;in X' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon &gt; 0' title='&#92;epsilon &gt; 0' class='latex' />. We need to show that there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=p%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;in X' title='p&#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+p%29+%3C+%5Cdelta%5Cimplies+d%28g%5Ccirc+f%28x%29%2C+g%5Ccirc+f%28p%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, p) &lt; &#92;delta&#92;implies d(g&#92;circ f(x), g&#92;circ f(p)) &lt; &#92;epsilon' title='d(x, p) &lt; &#92;delta&#92;implies d(g&#92;circ f(x), g&#92;circ f(p)) &lt; &#92;epsilon' class='latex' />.</p>
<p><img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is continuous, so there exists <img src='http://s0.wp.com/latex.php?latex=%5Cgamma+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;gamma &gt; 0' title='&#92;gamma &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in Y' title='y &#92;in Y' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28y%2C+f%28x%29%29+%3C+%5Cgamma%5Cimplies+d%28g%28y%29%2C+g%5Ccirc+f%28x%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(y, f(x)) &lt; &#92;gamma&#92;implies d(g(y), g&#92;circ f(x)) &lt; &#92;epsilon' title='d(y, f(x)) &lt; &#92;gamma&#92;implies d(g(y), g&#92;circ f(x)) &lt; &#92;epsilon' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, so there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=p%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;in X' title='p&#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+p%29+%3C+%5Cdelta%5Cimplies+d%28f%28x%29%2C+f%28p%29%29+%3C+%5Cgamma&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, p) &lt; &#92;delta&#92;implies d(f(x), f(p)) &lt; &#92;gamma' title='d(x, p) &lt; &#92;delta&#92;implies d(f(x), f(p)) &lt; &#92;gamma' class='latex' />. But then <img src='http://s0.wp.com/latex.php?latex=d%28g%5Ccirc+f%28p%29%2C+g%5Ccirc+f%28x%29%29+%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g&#92;circ f(p), g&#92;circ f(x)) &lt;&#92;epsilon' title='d(g&#92;circ f(p), g&#92;circ f(x)) &lt;&#92;epsilon' class='latex' />, as desired.  So <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.  </p>
<p><strong>4(b)</strong> Take an arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=y%3Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y=f(x)' title='y=f(x)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%3Dg%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z=g(y)' title='z=g(y)' class='latex' />. Using continuity of <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' />, for any <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' />, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&#039;' title='&#92;epsilon&#039;' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28y%27%2Cy%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(y&#039;,y)&lt;&#92;epsilon&#039;' title='d(y&#039;,y)&lt;&#92;epsilon&#039;' class='latex' /> (for <img src='http://s0.wp.com/latex.php?latex=y%27%5Cin+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#039;&#92;in Y' title='y&#039;&#92;in Y' class='latex' />), then <img src='http://s0.wp.com/latex.php?latex=d%28f%28y%27%29%2Cz%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(y&#039;),z)&lt;&#92;epsilon' title='d(f(y&#039;),z)&lt;&#92;epsilon' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cx%27%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,x&#039;)&lt;&#92;delta' title='d(x,x&#039;)&lt;&#92;delta' class='latex' /> (for <img src='http://s0.wp.com/latex.php?latex=x%27%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#039;&#92;in X' title='x&#039;&#92;in X' class='latex' />), then <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%27%29%2Cy%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x&#039;),y)&lt;&#92;epsilon&#039;' title='d(f(x&#039;),y)&lt;&#92;epsilon&#039;' class='latex' />. So for any <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> we&#8217;ve found <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cx%27%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,x&#039;)&lt;&#92;delta' title='d(x,x&#039;)&lt;&#92;delta' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28x%27%29%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(x&#039;))&lt;&#92;epsilon&#039;' title='d(f(x),f(x&#039;))&lt;&#92;epsilon&#039;' class='latex' /> and therefore <img src='http://s0.wp.com/latex.php?latex=d%28g%5Ccirc+f%28x%29%2Cg%5Ccirc+f%28x%27%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g&#92;circ f(x),g&#92;circ f(x&#039;))&lt;&#92;epsilon' title='d(g&#92;circ f(x),g&#92;circ f(x&#039;))&lt;&#92;epsilon' class='latex' />. Hence <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.</p>
<p><strong>4(c)</strong> Take <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=d%28g%28f%28x%29%29%2Cg%28f%28y%29%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' title='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28g%28f%28x%29%29%2Cg%28f%28y%29%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' title='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;theta' title='&#92;delta=&#92;theta' class='latex' />, we are done.</p>
<a name="pd_a_7008598"></a>
<div class="PDS_Poll" id="PDI_container7008598" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008598.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008598">Take Our Poll</a></noscript>
<p><strong>Problem 5.</strong>  Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> be sets, let <img src='http://s0.wp.com/latex.php?latex=f%3AX%5Cto+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X&#92;to Y' title='f:X&#92;to Y' class='latex' /> be an injection and let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> be  subsets of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%28A%29%5Ccap+f%28B%29+%5Csubset+f%28A%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A)&#92;cap f(B) &#92;subset f(A&#92;cap B)' title='f(A)&#92;cap f(B) &#92;subset f(A&#92;cap B)' class='latex' />.</p>
<p><strong>5(a)</strong> Take <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28A%29%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(A)&#92;cap f(B)' title='x&#92;in f(A)&#92;cap f(B)' class='latex' />. So there is some <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z&#92;in B' title='z&#92;in B' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Df%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=f(z)=x' title='f(y)=f(z)=x' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is injective, <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z' title='z' class='latex' /> are equal. So <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=x%3Df%28y%29%5Cin+f%28A%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=f(y)&#92;in f(A&#92;cap B)' title='x=f(y)&#92;in f(A&#92;cap B)' class='latex' />.</p>
<p><strong>5(b)</strong> Suppose <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+f%28A%29+%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in f(A) &#92;cap f(B)' title='y &#92;in f(A) &#92;cap f(B)' class='latex' />.  Then, for some <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A%2C+b+%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A, b &#92;in B' title='a &#92;in A, b &#92;in B' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=y+%3D+f%28a%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y = f(a)' title='y = f(a)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y+%3D+f%28b%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y = f(b)' title='y = f(b)' class='latex' />.  So <img src='http://s0.wp.com/latex.php?latex=f%28a%29+%3D+f%28b%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(a) = f(b)' title='f(a) = f(b)' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is injective, <img src='http://s0.wp.com/latex.php?latex=a+%3D+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a = b' title='a = b' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A &#92;cap B' title='a &#92;in A &#92;cap B' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+f%28A+%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in f(A &#92;cap B)' title='y &#92;in f(A &#92;cap B)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=f%28A%29+%5Ccap+f%28B%29+%5Csubset+f%28A+%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A) &#92;cap f(B) &#92;subset f(A &#92;cap B)' title='f(A) &#92;cap f(B) &#92;subset f(A &#92;cap B)' class='latex' />.  </p>
<p><strong>5(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=f%28A%29%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A)&#92;cap f(B)' title='f(A)&#92;cap f(B)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28A%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(A)' title='x&#92;in f(A)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(B)' title='x&#92;in f(B)' class='latex' />. That is, there exists <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=x' title='f(y)=x' class='latex' /> and there exists <img src='http://s0.wp.com/latex.php?latex=z%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z&#92;in B' title='z&#92;in B' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z)=x' title='f(z)=x' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is an injection, <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=x' title='f(y)=x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=f%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z)=x' title='f(z)=x' class='latex' />, we have that <img src='http://s0.wp.com/latex.php?latex=y%3Dz&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y=z' title='y=z' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A&#92;cap B' title='u&#92;in A&#92;cap B' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=f%28u%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(u)=x' title='f(u)=x' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A&#92;cap B' title='u&#92;in A&#92;cap B' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A' title='u&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=u%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in B' title='u&#92;in B' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=u%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u=y' title='u=y' class='latex' />, we are done.</p>
<a name="pd_a_7008605"></a>
<div class="PDS_Poll" id="PDI_container7008605" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008605.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008605">Take Our Poll</a></noscript>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4902/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4902/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4902&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/04/02/a-second-experiment-concerning-mathematical-writing/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Another test</title>
		<link>http://gowers.wordpress.com/2013/04/02/another-test/</link>
		<comments>http://gowers.wordpress.com/2013/04/02/another-test/#comments</comments>
		<pubDate>Tue, 02 Apr 2013 12:37:22 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4896</guid>
		<description><![CDATA[This time I want to test whether I can have polls where the results are not visible until the poll closes. So if you have a few seconds to vote, that would be very helpful. If the facility works, then my next post will include some secret ballots.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4896&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This time I want to test whether I can have polls where the results are not visible until the poll closes. So if you have a few seconds to vote, that would be very helpful. If the facility works, then my next post will include some secret ballots.</p>
<a name="pd_a_7008553"></a>
<div class="PDS_Poll" id="PDI_container7008553" data-settings="{&quot;url&quot;:&quot;http:\/\/static.polldaddy.com\/p\/7008553.js&quot;}" style="display:inline-block;"></div>
<div id="PD_superContainer"></div>
<noscript><a href="http://polldaddy.com/poll/7008553">Take Our Poll</a></noscript>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4896/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4896/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4896&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/04/02/another-test/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>An experiment concerning mathematical writing</title>
		<link>http://gowers.wordpress.com/2013/03/25/an-experiment-concerning-mathematical-writing/</link>
		<comments>http://gowers.wordpress.com/2013/03/25/an-experiment-concerning-mathematical-writing/#comments</comments>
		<pubDate>Mon, 25 Mar 2013 07:30:17 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4861</guid>
		<description><![CDATA[Update: comments on this post are now closed, since my latest post would compromise any further contributions to the experiment. Most of this post consists of write-ups of proofs of five simple propositions about metric spaces. There are three write-ups per proof, and I would be very grateful for any comments that you might have. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4861&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><strong>Update: comments on this post are now closed, since my latest post would compromise any further contributions to the experiment.</strong></p>
<p>Most of this post consists of write-ups of proofs of five simple propositions about metric spaces. There are three write-ups per proof, and I would be very grateful for any comments that you might have. If you would like to participate in the experiment, then please state your level of mathematical experience (the main thing I need to know is whether you yourself have studied the basic theory of metric spaces) and then make any comments/observations you wish on the write-ups. The more you say, the more useful it will be (within reason). I am particularly interested in comparisons and preferences. For each proof, the order of the three write-ups has been chosen randomly and independently.</p>
<p>It would also be useful if you could rate each of the 15 write-ups for clarity and style. So that everyone rates in the same way, I suggest the following rating systems.</p>
<p>Clarity.</p>
<p>-2 very hard to understand<br />
-1 hard to understand<br />
0 neither particularly hard nor particularly easy<br />
1 easy to understand<br />
2 very easy to understand</p>
<p>Style.</p>
<p>-2 very badly written<br />
-1 badly written<br />
0 neither badly written nor well written<br />
1 well written<br />
2 very well written</p>
<p>I stress that ratings should not be regarded as a substitute for comments and observations, or vice versa. What I really need is <em>both</em> comments <em>and</em> numerical ratings. </p>
<p>I do not want people to be influenced by the answers that other people give, so all comments on this post will go to my moderation queue. When I have enough data for the experiment, probably in a week or so, I will publish all the comments (unless for some reason you specifically request that your comment should not be published).</p>
<p>The more people who participate, the more reliable the results of the experiment will be. I realize that it may take a little time, so thank you very much in advance to everybody who agrees to help. (Update 26th March: I now have over 30 responses; they have been very helpful indeed, so I am extremely grateful for those. If they keep coming in at a similar rate over the next few days it will be wonderful.)<br />
<span id="more-4861"></span></p>
<h3>The write-ups.</h3>
<p><strong>Problem 1.</strong> Let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> be open sets in a metric space. Then <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' /> is open.</p>
<p><strong>1(a)</strong> We want to show that for all <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />, there exists <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> such that the open ball <img src='http://s0.wp.com/latex.php?latex=B_x%28r%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_x(r)' title='B_x(r)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />.</p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in A &#92;cap B' title='x &#92;in A &#92;cap B' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is in <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> are open, there exist <img src='http://s0.wp.com/latex.php?latex=a%2C+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a, b' title='a, b' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=B_a%28x%29+%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_a(x) &#92;subset A' title='B_a(x) &#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_b%28x%29+%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_b(x) &#92;subset B' title='B_b(x) &#92;subset B' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=r+%3D+%5Cmin%5C%7Ba%2C+b%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r = &#92;min&#92;{a, b&#92;}' title='r = &#92;min&#92;{a, b&#92;}' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)' title='B_r(x)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=B_a%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_a(x)' title='B_a(x)' class='latex' />, so it&#8217;s contained in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. Similarly, <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)' title='B_r(x)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=B_b%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_b(x)' title='B_b(x)' class='latex' />, so it&#8217;s contained in <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29+%5Csubset+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x) &#92;subset A &#92;cap B' title='B_r(x) &#92;subset A &#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap B' title='A &#92;cap B' class='latex' /> is open.</p>
<p><strong>1(b)</strong> For arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29%3A%3D%5C%7By%5Cin+X%7Cd%28y%2Cx%29%3Cr%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x):=&#92;{y&#92;in X|d(y,x)&lt;r&#92;}' title='B_r(x):=&#92;{y&#92;in X|d(y,x)&lt;r&#92;}' class='latex' />. Consider an arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A&#92;cap B' title='x&#92;in A&#92;cap B' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=A%2CB&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A,B' title='A,B' class='latex' /> are open there are <img src='http://s0.wp.com/latex.php?latex=r%2Cr%27%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r,r&#039;&gt;0' title='r,r&#039;&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=B_r%28x%29%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_r(x)&#92;subset A' title='B_r(x)&#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_%7Br%27%7D%28x%29%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{r&#039;}(x)&#92;subset B' title='B_{r&#039;}(x)&#92;subset B' class='latex' />. Take <img src='http://s0.wp.com/latex.php?latex=R%3D%5Cmin%28r%2Cr%27%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='R=&#92;min(r,r&#039;)' title='R=&#92;min(r,r&#039;)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+B_r%28x%29%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset B_r(x)&#92;subset A' title='B_R(x)&#92;subset B_r(x)&#92;subset A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+B_%7Br%27%7D%28x%29%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset B_{r&#039;}(x)&#92;subset B' title='B_R(x)&#92;subset B_{r&#039;}(x)&#92;subset B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_R%28x%29%5Csubset+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)&#92;subset A&#92;cap B' title='B_R(x)&#92;subset A&#92;cap B' class='latex' />. We&#8217;ve proved that for any <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A&#92;cap B' title='x&#92;in A&#92;cap B' class='latex' /> there is an open ball (<img src='http://s0.wp.com/latex.php?latex=B_R%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_R(x)' title='B_R(x)' class='latex' /> in this case) that contains <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and is inside <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' /> is open.</p>
<p><strong>1(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap B' title='A&#92;cap B' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A' title='x&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in B' title='x&#92;in B' class='latex' />. Therefore, since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A' title='u&#92;in A' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cu%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,u)&lt;&#92;eta' title='d(x,u)&lt;&#92;eta' class='latex' /> and since <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=v%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v&#92;in B' title='v&#92;in B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cv%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,v)&lt;&#92;theta' title='d(x,v)&lt;&#92;theta' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B' title='y&#92;in B' class='latex' />. We know that <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;eta' title='d(x,y)&lt;&#92;eta' class='latex' /> and that <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B' title='y&#92;in B' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Assume now that <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;eta' title='d(x,y)&lt;&#92;eta' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Cleq%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&#92;leq&#92;eta' title='&#92;delta&#92;leq&#92;eta' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Cleq%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&#92;leq&#92;theta' title='&#92;delta&#92;leq&#92;theta' class='latex' />. We may therefore take <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Cmin%5C%7B%5Ceta%2C%5Ctheta%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;min&#92;{&#92;eta,&#92;theta&#92;}' title='&#92;delta=&#92;min&#92;{&#92;eta,&#92;theta&#92;}' class='latex' /> and we are done.</p>
<hr />
<p><strong>Problem 2.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> be metric spaces, let <img src='http://s0.wp.com/latex.php?latex=f%3AX-%3EY&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X-&gt;Y' title='f:X-&gt;Y' class='latex' /> be continuous, and let <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' />  be an open subset of <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is an open subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />.</p>
<p><strong>2(a)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. Therefore, since <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=u%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in U' title='u&#92;in U' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cu%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),u)&lt;&#92;eta' title='d(f(x),u)&lt;&#92;eta' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' />. We know that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;theta' title='&#92;delta=&#92;theta' class='latex' />, we are done.</p>
<p><strong>2(b)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in f^{-1}(U)' title='x &#92;in f^{-1}(U)' class='latex' />. We seek <img src='http://s0.wp.com/latex.php?latex=r+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r &gt; 0' title='r &gt; 0' class='latex' /> such that the open ball <img src='http://s0.wp.com/latex.php?latex=B_x%28r%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_x(r)' title='B_x(r)' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' />.</p>
<p><img src='http://s0.wp.com/latex.php?latex=x+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in f^{-1}(U)' title='x &#92;in f^{-1}(U)' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=f%28x%29+%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x) &#92;in U' title='f(x) &#92;in U' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, so we know that for some <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon &gt; 0' title='&#92;epsilon &gt; 0' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=B_%7B%5Cepsilon%7D%28f%28x%29%29+%5Csubset+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{&#92;epsilon}(f(x)) &#92;subset U' title='B_{&#92;epsilon}(f(x)) &#92;subset U' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=z+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z &#92;in X' title='z &#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+z%29+%3C+%5Cdelta%5Cimplies+d%28f%28x%29%2C+f%28z%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, z) &lt; &#92;delta&#92;implies d(f(x), f(z)) &lt; &#92;epsilon' title='d(x, z) &lt; &#92;delta&#92;implies d(f(x), f(z)) &lt; &#92;epsilon' class='latex' />; i.e., <img src='http://s0.wp.com/latex.php?latex=f%28z%29+%5Cin+B_%7B%5Cepsilon%7D%28f%28x%29%29+%5Csubset+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z) &#92;in B_{&#92;epsilon}(f(x)) &#92;subset U' title='f(z) &#92;in B_{&#92;epsilon}(f(x)) &#92;subset U' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=z+%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z &#92;in f^{-1}(U)' title='z &#92;in f^{-1}(U)' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+z%29+%3C+%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, z) &lt; &#92;delta' title='d(x, z) &lt; &#92;delta' class='latex' />; i.e., <img src='http://s0.wp.com/latex.php?latex=B_%7B%5Cdelta%7D%28x%29+%5Csubset+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_{&#92;delta}(x) &#92;subset f^{-1}(U)' title='B_{&#92;delta}(x) &#92;subset f^{-1}(U)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open.</p>
<p><strong>2(c)</strong> Take any <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f^{-1}(U)' title='x&#92;in f^{-1}(U)' class='latex' />. We have <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;in U' title='f(x)&#92;in U' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' /> is open, there is an open ball <img src='http://s0.wp.com/latex.php?latex=B_%5Cepsilon%28f%28x%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;epsilon(f(x))' title='B_&#92;epsilon(f(x))' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='U' title='U' class='latex' />. Because <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that for any <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B_%5Cdelta%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B_&#92;delta(x)' title='y&#92;in B_&#92;delta(x)' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=f%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)' title='f(y)' class='latex' /> belongs to <img src='http://s0.wp.com/latex.php?latex=B_%5Cepsilon%28f%28x%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;epsilon(f(x))' title='B_&#92;epsilon(f(x))' class='latex' />. Hence, for such <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=f%28y%29%5Cin+U&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)&#92;in U' title='f(y)&#92;in U' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=y%5Cin+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in f^{-1}(U)' title='y&#92;in f^{-1}(U)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=B_%5Cdelta%28x%29%5Csubset+f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B_&#92;delta(x)&#92;subset f^{-1}(U)' title='B_&#92;delta(x)&#92;subset f^{-1}(U)' class='latex' />. We&#8217;ve proved that every point in <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> has an open ball neighbourhood. So <img src='http://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28U%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f^{-1}(U)' title='f^{-1}(U)' class='latex' /> is open.</p>
<hr />
<p><strong>Problem 3.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> be a complete metric space and let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> be a closed subset of  <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<p><strong>3(a)</strong> Consider an arbitrary Cauchy sequence <img src='http://s0.wp.com/latex.php?latex=%28x_n%29_%7B%5C%7Bn%5Cin%5Cmathbb%7BN%7D%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_n)_{&#92;{n&#92;in&#92;mathbb{N}&#92;}}' title='(x_n)_{&#92;{n&#92;in&#92;mathbb{N}&#92;}}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, <img src='http://s0.wp.com/latex.php?latex=%28x_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_n)' title='(x_n)' class='latex' /> has a limit in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Suppose <img src='http://s0.wp.com/latex.php?latex=%5Clim_%7Bn%5Cto%5Cinfty%7Dx_n%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lim_{n&#92;to&#92;infty}x_n=x' title='&#92;lim_{n&#92;to&#92;infty}x_n=x' class='latex' />. Because <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is closed, <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> belongs to <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. We&#8217;ve proved that every Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> has a limit point in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<p><strong>3(b)</strong> Let <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> be a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. Then, since <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, we have that <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> converges. That is, there exists <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a_n%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n&#92;to a' title='a_n&#92;to a' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is closed in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> is a sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=a_n%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n&#92;to a' title='a_n&#92;to a' class='latex' />, we have that <img src='http://s0.wp.com/latex.php?latex=a%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in A' title='a&#92;in A' class='latex' />. Thus <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> converges in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and we are done.</p>
<p><strong>3(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> be a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. We want to show that <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> tends to a limit in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />.</p>
<p>Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> is a Cauchy sequence in <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is complete, <img src='http://s0.wp.com/latex.php?latex=a_n+%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n &#92;to a' title='a_n &#92;to a' class='latex' />, for some <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in X' title='a &#92;in X' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a closed subset of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, it must contain all its limit points, so <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A' title='a &#92;in A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=a_n+%5Cto+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n &#92;to a' title='a_n &#92;to a' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is complete.</p>
<hr />
<p><strong>Problem 4.</strong> Let <img src='http://s0.wp.com/latex.php?latex=X%2C+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X, Y' title='X, Y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z' title='Z' class='latex' /> be metric spaces and let <img src='http://s0.wp.com/latex.php?latex=f%3AX%5Cto+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X&#92;to Y' title='f:X&#92;to Y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g%3AY%5Cto+Z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:Y&#92;to Z' title='g:Y&#92;to Z' class='latex' /> be  continuous. Then the composition <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.</p>
<p><strong>4(a)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x &#92;in X' title='x &#92;in X' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon &gt; 0' title='&#92;epsilon &gt; 0' class='latex' />. We need to show that there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=p%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;in X' title='p&#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+p%29+%3C+%5Cdelta%5Cimplies+d%28g%5Ccirc+f%28x%29%2C+g%5Ccirc+f%28p%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, p) &lt; &#92;delta&#92;implies d(g&#92;circ f(x), g&#92;circ f(p)) &lt; &#92;epsilon' title='d(x, p) &lt; &#92;delta&#92;implies d(g&#92;circ f(x), g&#92;circ f(p)) &lt; &#92;epsilon' class='latex' />.</p>
<p><img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is continuous, so there exists <img src='http://s0.wp.com/latex.php?latex=%5Cgamma+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;gamma &gt; 0' title='&#92;gamma &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in Y' title='y &#92;in Y' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28y%2C+f%28x%29%29+%3C+%5Cgamma%5Cimplies+d%28g%28y%29%2C+g%5Ccirc+f%28x%29%29+%3C+%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(y, f(x)) &lt; &#92;gamma&#92;implies d(g(y), g&#92;circ f(x)) &lt; &#92;epsilon' title='d(y, f(x)) &lt; &#92;gamma&#92;implies d(g(y), g&#92;circ f(x)) &lt; &#92;epsilon' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, so there exists <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3E+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' /> such that for all <img src='http://s0.wp.com/latex.php?latex=p%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;in X' title='p&#92;in X' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=d%28x%2C+p%29+%3C+%5Cdelta%5Cimplies+d%28f%28x%29%2C+f%28p%29%29+%3C+%5Cgamma&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x, p) &lt; &#92;delta&#92;implies d(f(x), f(p)) &lt; &#92;gamma' title='d(x, p) &lt; &#92;delta&#92;implies d(f(x), f(p)) &lt; &#92;gamma' class='latex' />. But then <img src='http://s0.wp.com/latex.php?latex=d%28g%5Ccirc+f%28p%29%2C+g%5Ccirc+f%28x%29%29+%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g&#92;circ f(p), g&#92;circ f(x)) &lt;&#92;epsilon' title='d(g&#92;circ f(p), g&#92;circ f(x)) &lt;&#92;epsilon' class='latex' />, as desired.  So <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.  </p>
<p><strong>4(b)</strong> Take an arbitrary <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=y%3Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y=f(x)' title='y=f(x)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%3Dg%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z=g(y)' title='z=g(y)' class='latex' />. Using continuity of <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' />, for any <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' />, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&#039;' title='&#92;epsilon&#039;' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28y%27%2Cy%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(y&#039;,y)&lt;&#92;epsilon&#039;' title='d(y&#039;,y)&lt;&#92;epsilon&#039;' class='latex' /> (for <img src='http://s0.wp.com/latex.php?latex=y%27%5Cin+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#039;&#92;in Y' title='y&#039;&#92;in Y' class='latex' />), then <img src='http://s0.wp.com/latex.php?latex=d%28f%28y%27%29%2Cz%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(y&#039;),z)&lt;&#92;epsilon' title='d(f(y&#039;),z)&lt;&#92;epsilon' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there is some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cx%27%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,x&#039;)&lt;&#92;delta' title='d(x,x&#039;)&lt;&#92;delta' class='latex' /> (for <img src='http://s0.wp.com/latex.php?latex=x%27%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#039;&#92;in X' title='x&#039;&#92;in X' class='latex' />), then <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%27%29%2Cy%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x&#039;),y)&lt;&#92;epsilon&#039;' title='d(f(x&#039;),y)&lt;&#92;epsilon&#039;' class='latex' />. So for any <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> we&#8217;ve found <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cx%27%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,x&#039;)&lt;&#92;delta' title='d(x,x&#039;)&lt;&#92;delta' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28x%27%29%29%3C%5Cepsilon%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(x&#039;))&lt;&#92;epsilon&#039;' title='d(f(x),f(x&#039;))&lt;&#92;epsilon&#039;' class='latex' /> and therefore <img src='http://s0.wp.com/latex.php?latex=d%28g%5Ccirc+f%28x%29%2Cg%5Ccirc+f%28x%27%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g&#92;circ f(x),g&#92;circ f(x&#039;))&lt;&#92;epsilon' title='d(g&#92;circ f(x),g&#92;circ f(x&#039;))&lt;&#92;epsilon' class='latex' />. Hence <img src='http://s0.wp.com/latex.php?latex=g%5Ccirc+f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g&#92;circ f' title='g&#92;circ f' class='latex' /> is continuous.</p>
<p><strong>4(c)</strong> Take <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta&gt;0' title='&#92;delta&gt;0' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=d%28g%28f%28x%29%29%2Cg%28f%28y%29%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' title='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;delta' title='d(x,y)&lt;&#92;delta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ceta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;eta&gt;0' title='&#92;eta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28g%28f%28x%29%29%2Cg%28f%28y%29%29%29%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' title='d(g(f(x)),g(f(y)))&lt;&#92;epsilon' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is continuous, there exists <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=d%28f%28x%29%2Cf%28y%29%29%3C%5Ceta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(f(x),f(y))&lt;&#92;eta' title='d(f(x),f(y))&lt;&#92;eta' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=d%28x%2Cy%29%3C%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(x,y)&lt;&#92;theta' title='d(x,y)&lt;&#92;theta' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%3D%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta=&#92;theta' title='&#92;delta=&#92;theta' class='latex' />, we are done.</p>
<hr />
<p><strong>Problem 5.</strong>  Let <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> be sets, let <img src='http://s0.wp.com/latex.php?latex=f%3AX%5Cto+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:X&#92;to Y' title='f:X&#92;to Y' class='latex' /> be an injection and let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> be  subsets of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=f%28A%29%5Ccap+f%28B%29+%5Csubset+f%28A%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A)&#92;cap f(B) &#92;subset f(A&#92;cap B)' title='f(A)&#92;cap f(B) &#92;subset f(A&#92;cap B)' class='latex' />.</p>
<p><strong>5(a)</strong> Take <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28A%29%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(A)&#92;cap f(B)' title='x&#92;in f(A)&#92;cap f(B)' class='latex' />. So there is some <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z&#92;in B' title='z&#92;in B' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Df%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=f(z)=x' title='f(y)=f(z)=x' class='latex' />. As <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is injective, <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z' title='z' class='latex' /> are equal. So <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A&#92;cap B' title='y&#92;in A&#92;cap B' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=x%3Df%28y%29%5Cin+f%28A%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=f(y)&#92;in f(A&#92;cap B)' title='x=f(y)&#92;in f(A&#92;cap B)' class='latex' />.</p>
<p><strong>5(b)</strong> Suppose <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+f%28A%29+%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in f(A) &#92;cap f(B)' title='y &#92;in f(A) &#92;cap f(B)' class='latex' />.  Then, for some <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A%2C+b+%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A, b &#92;in B' title='a &#92;in A, b &#92;in B' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=y+%3D+f%28a%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y = f(a)' title='y = f(a)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y+%3D+f%28b%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y = f(b)' title='y = f(b)' class='latex' />.  So <img src='http://s0.wp.com/latex.php?latex=f%28a%29+%3D+f%28b%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(a) = f(b)' title='f(a) = f(b)' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is injective, <img src='http://s0.wp.com/latex.php?latex=a+%3D+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a = b' title='a = b' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;in A &#92;cap B' title='a &#92;in A &#92;cap B' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=y+%5Cin+f%28A+%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y &#92;in f(A &#92;cap B)' title='y &#92;in f(A &#92;cap B)' class='latex' />. So <img src='http://s0.wp.com/latex.php?latex=f%28A%29+%5Ccap+f%28B%29+%5Csubset+f%28A+%5Ccap+B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A) &#92;cap f(B) &#92;subset f(A &#92;cap B)' title='f(A) &#92;cap f(B) &#92;subset f(A &#92;cap B)' class='latex' />.  </p>
<p><strong>5(c)</strong> Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> be an element of <img src='http://s0.wp.com/latex.php?latex=f%28A%29%5Ccap+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(A)&#92;cap f(B)' title='f(A)&#92;cap f(B)' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28A%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(A)' title='x&#92;in f(A)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%5Cin+f%28B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in f(B)' title='x&#92;in f(B)' class='latex' />. That is, there exists <img src='http://s0.wp.com/latex.php?latex=y%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in A' title='y&#92;in A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=x' title='f(y)=x' class='latex' /> and there exists <img src='http://s0.wp.com/latex.php?latex=z%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z&#92;in B' title='z&#92;in B' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z)=x' title='f(z)=x' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is an injection, <img src='http://s0.wp.com/latex.php?latex=f%28y%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(y)=x' title='f(y)=x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=f%28z%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(z)=x' title='f(z)=x' class='latex' />, we have that <img src='http://s0.wp.com/latex.php?latex=y%3Dz&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y=z' title='y=z' class='latex' />. We would like to find <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A&#92;cap B' title='u&#92;in A&#92;cap B' class='latex' /> s.t. <img src='http://s0.wp.com/latex.php?latex=f%28u%29%3Dx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(u)=x' title='f(u)=x' class='latex' />. But <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A&#92;cap B' title='u&#92;in A&#92;cap B' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=u%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in A' title='u&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=u%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u&#92;in B' title='u&#92;in B' class='latex' />. Therefore, setting <img src='http://s0.wp.com/latex.php?latex=u%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u=y' title='u=y' class='latex' />, we are done.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4861/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4861/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4861&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/03/25/an-experiment-concerning-mathematical-writing/feed/</wfw:commentRss>
		<slash:comments>64</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Test</title>
		<link>http://gowers.wordpress.com/2013/03/24/test/</link>
		<comments>http://gowers.wordpress.com/2013/03/24/test/#comments</comments>
		<pubDate>Sun, 24 Mar 2013 18:24:29 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4879</guid>
		<description><![CDATA[I am testing the WordPress feature that allows me to moderate all comments before allowing them to appear. This has nothing to do with the discussion on the last post. Rather, I want to be sure that the feature works before my next post, where it will be important for people to comment without seeing [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4879&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I am testing the WordPress feature that allows me to moderate all comments before allowing them to appear. This has nothing to do with the discussion on the last post. Rather, I want to be sure that the feature works before my next post, where it will be important for people to comment without seeing what others have said. So if someone could make a quick comment on this post, that would be helpful. Once I&#8217;m sure the feature is working, I&#8217;ll put up the post for which it matters.</p>
<p>In an ideal world, I would use the feature just for that post. However, as far as I can tell, my only options are allowing all comments, moderating all comments, or disabling comments completely on individual posts. Sending comments to the moderation queue on a post-by-post basis doesn&#8217;t seem to be possible, but if anyone knows a way, then I&#8217;d be very pleased to hear about it. Assuming there isn&#8217;t a way, then for a short while, all comments on this blog will be moderated, but I will try to approve comments on other posts regularly, so I hope this won&#8217;t be too annoying.</p>
<p><strong>Update.</strong> Good job I did this test. I changed the relevant setting but didn&#8217;t click &#8220;Save settings&#8221;. Hence the three comments below.</p>
<p><strong>Further update.</strong> OK, now it seems to be working just fine. Many thanks to those who sent test comments. I&#8217;ll put up the new post later this evening (British time).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4879/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4879/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4879&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/03/24/test/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Pierre Deligne wins the 2013 Abel Prize</title>
		<link>http://gowers.wordpress.com/2013/03/20/pierre-deligne-wins-the-2013-abel-prize/</link>
		<comments>http://gowers.wordpress.com/2013/03/20/pierre-deligne-wins-the-2013-abel-prize/#comments</comments>
		<pubDate>Wed, 20 Mar 2013 12:18:25 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4851</guid>
		<description><![CDATA[I have just finished presenting the work of this year&#8217;s Abel Prize winner, who is Pierre Deligne. In due course, the talk will appear on the Abel Prize website. As in the last two years, I have also prepared a written version of the talk, which goes into more detail. However, even the written version [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4851&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I have just finished presenting the work of this year&#8217;s Abel Prize winner, who is Pierre Deligne. In due course, the talk will appear on the Abel Prize website. As in the last two years, I have also prepared a <a href="http://gowers.files.wordpress.com/2013/03/peterd.pdf">written version of the talk</a>, which goes into more detail. However, even the written version leaves a lot out. It was intended for a general &#8212; that is, not necessarily mathematical &#8212; audience, though I had to assume at least some maths. If your level of mathematical experience means that you find it too elementary, then I have three recommendations for further reading. I found <a href="http://home.iiserbhopal.ac.in/~kashyap/tau.pdf">these slides of Kumar Murty</a> about Ramanujan&#8217;s tau function helpful and interesting. I also very much like <a href="http://www.math.ucdavis.edu/~osserman/math/pcm.pdf">Brian Osserman&#8217;s article on the Weil conjectures</a>, written for the Princeton Companion to Mathematics. Finally, Nick Katz did the laudatio for Deligne&#8217;s Fields Medal and wrote <a href="http://ada00.math.uni-bielefeld.de/ICM/ICM1978.1/Main/icm1978.1.0047.0052.ocr.pdf">an excellent article on his work</a>. (Another article that I stumbled on only recently that looks incredibly nice, which is not about Deligne, though it mentions him, but which sheds interesting light on some of Deligne&#8217;s work is <a href="http://www.math.harvard.edu/~mazur/preprints/meaning_error.pdf">Finding Meaning in Error Terms</a>, by Barry Mazur. So far I have just skimmed through some of it, but I think I&#8217;ll be going back to read it in more detail.) </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4851/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4851/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4851&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/03/20/pierre-deligne-wins-the-2013-abel-prize/feed/</wfw:commentRss>
		<slash:comments>149</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>The work of Endre Szemer&#233;di</title>
		<link>http://gowers.wordpress.com/2013/03/08/the-work-of-endre-szemerdi/</link>
		<comments>http://gowers.wordpress.com/2013/03/08/the-work-of-endre-szemerdi/#comments</comments>
		<pubDate>Fri, 08 Mar 2013 11:25:58 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Straight maths]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4844</guid>
		<description><![CDATA[A few years ago, Springer published a book called, The Abel Prize: 2003-2007 The First Five Years. A brief calculation will reveal that a second volume ought to be due soon, and that is indeed the case. I was asked to write the article about Endre Szemer&#233;di, the 2012 winner, which I have just finished. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4844&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A few years ago, Springer published a book called, <a href="http://www.springer.com/mathematics/history+of+mathematics/book/978-3-642-01372-0">The Abel Prize: 2003-2007 The First Five Years</a>. A brief calculation will reveal that a second volume ought to be due soon, and that is indeed the case. I was asked to write the article about Endre Szemer&eacute;di, the 2012 winner, which I have just finished. I am glad to say that Springer&#8217;s policy with regard to this book is extremely enlightened: not only am I allowed to post my article as a preprint, but the entire book will be posted on the Norwegian Academy of Sciences website and will be freely accessible. </p>
<p>I was told to write the article as I pleased &#8212; the articles in the first volume are very different in style from each other &#8212; so I went for a style that was not unlike what I might have written if I had wanted to present several of Szemer&eacute;di&#8217;s results in a series of blog posts. That is, I&#8217;ve tried to explain the ideas, and when the going gets tough I have skipped the details. So it seems appropriate to post the article on this blog. </p>
<p>If you look at it and happen to notice any typos, false statements, wrong emphases, etc., I think it isn&#8217;t too late to make changes, so I&#8217;d be grateful to hear about them. <a href="http://gowers.files.wordpress.com/2013/03/szemeredi.pdf">Here is the article.</a> </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4844/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4844/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4844&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/03/08/the-work-of-endre-szemerdi/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Whither Polymath?</title>
		<link>http://gowers.wordpress.com/2013/02/28/whither-polymath/</link>
		<comments>http://gowers.wordpress.com/2013/02/28/whither-polymath/#comments</comments>
		<pubDate>Thu, 28 Feb 2013 13:19:04 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4836</guid>
		<description><![CDATA[Over at the Polymath blog, Gil Kalai recently proposed a discussion about possible future Polymath projects. This post is partly to direct you to that discussion in case you haven&#8217;t noticed it and might have ideas to contribute, and partly to start a specific Polymathematical conversation. I don&#8217;t call it a Polymath project, but rather [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4836&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Over at the Polymath blog, Gil Kalai recently <a href="http://polymathprojects.org/2013/02/14/next-polymath-projects-what-when-where/">proposed a discussion</a> about possible future Polymath projects. This post is partly to direct you to that discussion in case you haven&#8217;t noticed it and might have ideas to contribute, and partly to start a specific Polymathematical conversation. I don&#8217;t call it a Polymath project, but rather an idea I&#8217;d like to discuss that might or might not become the basis for a nice project. One thing that Gil and others have said is that it would be a good idea to experiment with various different levels of difficulty and importance of problem. Perhaps one way of getting a Polymath project to take off is to tackle a problem that isn&#8217;t necessarily all that hard or important, but is nevertheless sufficiently interesting to appeal to a critical mass of people. That is very much the spirit of this post. </p>
<p>Before I go any further, I should say that the topic in question is one about which I am not an expert, so it may well be that the answer to the question I&#8217;m about to ask is already known. I could I suppose try to find out on Mathoverflow, but I&#8217;m not sure I can formulate the question precisely enough to make a suitable Mathoverflow question, so instead I&#8217;m doing it here. This has the added advantage that if the question does seem suitable, then any discussion of it that there might be will take place where I would want any continuation of the discussion to take place.<br />
<span id="more-4836"></span></p>
<h3>Fast parallel sorting.</h3>
<p>I am in the middle of writing an article about Szemer&eacute;di&#8217;s mathematical work (for a book about Abel Prize winners), and one of his results that I am writing about is a famous parallel sorting network, discovered with Ajtai and Koml&oacute;s, that sorts <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> objects in <img src='http://s0.wp.com/latex.php?latex=C%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&#92;log n' title='C&#92;log n' class='latex' /> rounds. This is a gorgeous result, but unfortunately the constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> is quite large, and remains so after subsequent improvements, which means that in practice their sorting network does not work as well as a simpler one that runs in time <img src='http://s0.wp.com/latex.php?latex=C%28%5Clog+n%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C(&#92;log n)^2' title='C(&#92;log n)^2' class='latex' /> for a much smaller <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />. </p>
<p>The thought I would like to explore is not a way of solving that problem &#8212; obtaining the AKS result with a reasonable constant &#8212; but it is closely related. I have a rough idea for a randomized method of sorting, and I&#8217;d like to know whether it can be made to work. If it gave a good constant that would be great, but I think even proving that it works with <em>any</em> constant would be nice, unless it has been done already &#8212; in which case congratulations to whoever did it and I very much like your result.</p>
<p>The rough idea is this. As with any fast parallel sorting method, the sorting should take place in rounds, where each round consists in partitioning all the pairs (or a constant fraction of them, but there isn&#8217;t much to be lost by doing extra comparisons) and swapping the objects round if the one that should come before the other in fact comes after it. </p>
<p>An obvious thing to try is a random partition. So you start with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> objects arranged in an arbitrary order. Your target is to rearrange them so that they are arranged according to some linear ordering that you don&#8217;t know, and all you are allowed to do is pairwise comparisons. To visualize it, I like to imagine that the objects are <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> rocks that all look fairly similar, that you want to lay them out in a line going from lightest to heaviest, and that all you have to help you is a set of <img src='http://s0.wp.com/latex.php?latex=n%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/2' title='n/2' class='latex' /> balances that can fit at most one rock on each side. </p>
<p>With this picture, the random method would be to partition the rocks randomly into pairs and do the corresponding <img src='http://s0.wp.com/latex.php?latex=n%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/2' title='n/2' class='latex' /> comparisons. When you have compared rocks <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />, you put them back in the same two places in the line, but with the lighter one to the left of the heavier one (so they may have been switched round). </p>
<p>What happens if we do <img src='http://s0.wp.com/latex.php?latex=C%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&#92;log n' title='C&#92;log n' class='latex' /> random comparisons of this kind? The answer is that it doesn&#8217;t work very well at all. To see why not, suppose that the ordering is approximately correct, in the sense that for every <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> the <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />th lightest rock is in roughly the <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />th place. (To be more concrete, we could say that it is in the <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' />th place and that <img src='http://s0.wp.com/latex.php?latex=%7Cr-m%7C%5Cleq%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|r-m|&#92;leq&#92;sqrt{n}' title='|r-m|&#92;leq&#92;sqrt{n}' class='latex' /> or something like that.) Then when we do a random comparison, the probability that we move any given rock is extremely small (in the concrete case around <img src='http://s0.wp.com/latex.php?latex=n%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{-1/2}' title='n^{-1/2}' class='latex' /> at most). Basically, we are wasting our time comparing rocks when we are pretty certain of the answer in advance.</p>
<p>This suggests an adjustment to the naive random strategy, which is to have a succession of random rounds, but to make them gradually more and more &#8220;localized&#8221;. (Something like this is quite similar to what Ajtai, Koml&oacute;s and Szemer&eacute;di do, so this idea doesn&#8217;t come out of nowhere. But it is also very natural.) That is, at each stage, we would choose a distance scale <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> and pair up the rocks randomly in a way that favours distances that are of order of magnitude <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' />, and we would make <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> shrink exponentially quickly as we proceed.</p>
<p>The thing that makes it a bit complicated (and again this has strong echoes in the AKS proof) is that if you just do a constant number of rounds at one distance scale, then a few points will get &#8220;left behind&#8221; &#8212; by sheer bad luck they just don&#8217;t get compared with anything useful. So the challenge is to prove that as the distance scale shrinks, the points that are far from where they should be get moved with such high probability that they &#8220;catch up&#8221; with the other points that they should be close to. </p>
<p>It seems to me at least possible that a purely random strategy with shrinking distance scales could sort <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> objects in <img src='http://s0.wp.com/latex.php?latex=C%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&#92;log n' title='C&#92;log n' class='latex' /> rounds, and that if one devised the random partitions in a particularly nice way then the proof might even be rather nice &#8212; some kind of random walk with drift might be taking place for each rock. </p>
<p>So my questions are these.</p>
<p>(i) Has something like this been done already?</p>
<p>(ii) If not, is anyone interested in exploring the possibility?</p>
<p>A final remark is that something that contributed a great deal to the health of the discussion about the Erd&#337;s discrepancy problem was the possibility of doing computer experiments. That would apply to some extent here too: one could devise an algorithm along the above lines and just observe experimentally how it does, whether points catch up after getting left behind, etc.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4836/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4836/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4836&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/02/28/whither-polymath/feed/</wfw:commentRss>
		<slash:comments>38</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Ted Odell</title>
		<link>http://gowers.wordpress.com/2013/02/10/ted-odell/</link>
		<comments>http://gowers.wordpress.com/2013/02/10/ted-odell/#comments</comments>
		<pubDate>Sun, 10 Feb 2013 12:11:18 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Straight maths]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4796</guid>
		<description><![CDATA[I was shocked and saddened to hear about a week ago that Ted Odell, a mathematician to whom I owe a lot, died suddenly on January 9th of a heart attack while he was travelling to this year&#8217;s joint AMS/MAA meeting in San Diego. He was 65, but seemed a lot younger. Ted was a [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4796&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gowers.files.wordpress.com/2013/02/tedodell.png"><img src="http://gowers.files.wordpress.com/2013/02/tedodell.png?w=214&#038;h=300" alt="tedodell" width="214" height="300" class="alignright size-medium wp-image-4797" /></a></p>
<p>I was shocked and saddened to hear about a week ago that Ted Odell, a mathematician to whom I owe a lot, died suddenly on January 9th of a heart attack while he was travelling to this year&#8217;s joint AMS/MAA meeting in San Diego. He was 65, but seemed a lot younger.</p>
<p>Ted was a world leader in Banach space theory, and in particular in the infinite-dimensional theory. The wry and slightly enigmatic smile you see in the photo was extremely characteristic: if I imagine Ted, I automatically imagine him with exactly that smile. Less clear from the photo, though perhaps it can be guessed from the camera angle, is that he was extremely tall: he belonged to a handful of mathematicians I know who make me feel short (Tom Sanders and Alex Scott being two others). </p>
<p>I first met Ted when I went to my first ever conference, in Strobl am Wolfgangsee in Austria in 1989. I can&#8217;t remember how it came about, but I ended up chatting to him, and he started explaining to me in a wonderfully clear way &#8212; the kind of explanation you just can&#8217;t get from a textbook &#8212; how Tsirelson&#8217;s space worked. I read in an obituary by Andr&aacute;s Zsak (which starts on page 30 of <a href="http://newsletter.lms.ac.uk/423/423_issue.pdf">this issue of the LMS newsletter</a>) that Ted had a reputation for being kind and encouraging to young mathematicians. He certainly was to me at this conference, taking the time to give this explanation to a graduate student about whom he knew nothing. Most of the next section describes an argument that he sketched out for me on one of those yellow pads of paper that seem to be standard in US maths departments. (I think I&#8217;ve still got the yellow sheets that he let me keep, but I&#8217;ve no idea where they are.)<br />
<span id="more-4796"></span></p>
<h3>Tsirelson&#8217;s space</h3>
<p>At the time, I was very interested in the so-called distortion problem for Banach spaces. I shall have a lot more to say about that later, but let me briefly formulate the notion of distortion here. Two norms <img src='http://s0.wp.com/latex.php?latex=%7C.%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|.|' title='|.|' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5C%7C.%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|.&#92;|' title='&#92;|.&#92;|' class='latex' /> on the same space <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> are said to be <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-<em>equivalent</em> if there are constants <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=A%2Fa%5Cleq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A/a&#92;leq C' title='A/a&#92;leq C' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a%5C%7Cx%5C%7C%5Cleq+%7Cx%7C%5Cleq+A%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;|x&#92;|&#92;leq |x|&#92;leq A&#92;|x&#92;|' title='a&#92;|x&#92;|&#92;leq |x|&#92;leq A&#92;|x&#92;|' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />. After rescaling, one can assume that they satisfy the inequalities <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C%5Cleq+%7Cx%7C%5Cleq+C%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|&#92;leq |x|&#92;leq C&#92;|x&#92;|' title='&#92;|x&#92;|&#92;leq |x|&#92;leq C&#92;|x&#92;|' class='latex' />, so for convenience I&#8217;ll do that. We call the norm <img src='http://s0.wp.com/latex.php?latex=%7C.%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|.|' title='|.|' class='latex' /> a <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-<em>distortion</em> of <img src='http://s0.wp.com/latex.php?latex=%5C%7C.%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|.&#92;|' title='&#92;|.&#92;|' class='latex' /> if the above inequality holds, and if it cannot be improved by passing to an infinite-dimensional subspace. That is, <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C%5Cleq+%7Cx%7C%5Cleq+C%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|&#92;leq |x|&#92;leq C&#92;|x&#92;|' title='&#92;|x&#92;|&#92;leq |x|&#92;leq C&#92;|x&#92;|' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=x%5Cin+X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in X' title='x&#92;in X' class='latex' />, and for every subspace <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> you can find <img src='http://s0.wp.com/latex.php?latex=y%2Cz%5Cin+Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y,z&#92;in Y' title='y,z&#92;in Y' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7Cy%7C%5Cleq%281%2B%5Cepsilon%29%5C%7Cy%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|y|&#92;leq(1+&#92;epsilon)&#92;|y&#92;|' title='|y|&#92;leq(1+&#92;epsilon)&#92;|y&#92;|' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Cz%7C%5Cgeq%28C-%5Cepsilon%29%5C%7Cz%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|z|&#92;geq(C-&#92;epsilon)&#92;|z&#92;|' title='|z|&#92;geq(C-&#92;epsilon)&#92;|z&#92;|' class='latex' />.</p>
<p>It is far from obvious that there exists a constant <img src='http://s0.wp.com/latex.php?latex=C%3E1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&gt;1' title='C&gt;1' class='latex' /> and a norm that can be <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distorted. If you want to get a sense of the difficulty of this problem, then you might like to spend half an hour thinking about it: if you haven&#8217;t already seen the answer, then you will get absolutely nowhere. </p>
<p>Of course, I&#8217;ve sort of given away that Tsirelson&#8217;s space provides an example, but even with that huge clue the problem isn&#8217;t trivial, since you have to come up with the <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortion. It was this that Ted explained to me.</p>
<p>I&#8217;m not going to give a full explanation with all its details, but I&#8217;ll try at least to convey the basic idea, starting with the definition of Tsirelson&#8217;s space (or rather, the dual of the space that Tsirelson originally defined, which, as was observed by Figiel and Johnson, is easier to work with). To define the norm, which is on a space of sequences, we need the following notation. If <img src='http://s0.wp.com/latex.php?latex=a%3D%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a=(a_n)' title='a=(a_n)' class='latex' /> is a sequence and <img src='http://s0.wp.com/latex.php?latex=E%5Csubset%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E&#92;subset&#92;mathbb{N}' title='E&#92;subset&#92;mathbb{N}' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=Ea&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Ea' title='Ea' class='latex' /> is the sequence that takes the value <img src='http://s0.wp.com/latex.php?latex=a_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n' title='a_n' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=n%5Cin+E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;in E' title='n&#92;in E' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='0' title='0' class='latex' /> otherwise. In other words, we write <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E' title='E' class='latex' /> for the coordinate projection associated with the set <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E' title='E' class='latex' />. If <img src='http://s0.wp.com/latex.php?latex=E%2CF%5Csubset%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E,F&#92;subset&#92;mathbb{N}' title='E,F&#92;subset&#92;mathbb{N}' class='latex' />, we write <img src='http://s0.wp.com/latex.php?latex=E%3CF&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E&lt;F' title='E&lt;F' class='latex' /> to mean that <img src='http://s0.wp.com/latex.php?latex=%5Cmax+E%3C%5Cmin+F&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;max E&lt;&#92;min F' title='&#92;max E&lt;&#92;min F' class='latex' />: that is, <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E' title='E' class='latex' /> ends before <img src='http://s0.wp.com/latex.php?latex=F&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='F' title='F' class='latex' /> starts. Finally, we say that a sequence <img src='http://s0.wp.com/latex.php?latex=%28E_1%2C%5Cdots%2CE_k%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(E_1,&#92;dots,E_k)' title='(E_1,&#92;dots,E_k)' class='latex' /> of subsets of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> is <em>admissible</em> if <img src='http://s0.wp.com/latex.php?latex=k%3CE_1%3C%5Cdots%3CE_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k&lt;E_1&lt;&#92;dots&lt;E_k' title='k&lt;E_1&lt;&#92;dots&lt;E_k' class='latex' />. (I write <img src='http://s0.wp.com/latex.php?latex=k%3CE_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k&lt;E_1' title='k&lt;E_1' class='latex' /> when formally I should write <img src='http://s0.wp.com/latex.php?latex=%5C%7Bk%5C%7D%3CE_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{k&#92;}&lt;E_1' title='&#92;{k&#92;}&lt;E_1' class='latex' />.) That is, a sequence of sets is admissible if each set in the sequence starts after the previous one has finished, and if the minimum of the first set is bigger than the number of sets in the sequence. </p>
<p>We now define the norm of a sequence <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> using the following amazing formula. Let <img src='http://s0.wp.com/latex.php?latex=%5Cmathcal%7BE%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathcal{E}' title='&#92;mathcal{E}' class='latex' /> be the set of admissible sequences of sets.</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7Cx%5C%7C%3D%5C%7Cx%5C%7C_%5Cinfty%5Cvee%5Cmax%5C%7BC%5E%7B-1%7D%5Csum_%7Bi%3D1%7D%5Ek%5C%7CE_ix%5C%7C%3Ak%5Cgeq+2%2C+%28E_1%2C%5Cdots%2CE_k%29%5Cin%5Cmathcal%7BE%7D%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;|x&#92;|=&#92;|x&#92;|_&#92;infty&#92;vee&#92;max&#92;{C^{-1}&#92;sum_{i=1}^k&#92;|E_ix&#92;|:k&#92;geq 2, (E_1,&#92;dots,E_k)&#92;in&#92;mathcal{E}&#92;}' title='&#92;displaystyle &#92;|x&#92;|=&#92;|x&#92;|_&#92;infty&#92;vee&#92;max&#92;{C^{-1}&#92;sum_{i=1}^k&#92;|E_ix&#92;|:k&#92;geq 2, (E_1,&#92;dots,E_k)&#92;in&#92;mathcal{E}&#92;}' class='latex' /></p>
<p>In words, the idea is this. With each <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> you want to associate a non-negative real number, and you want to make that number as large as possible. You have two options. Either you can settle for the <img src='http://s0.wp.com/latex.php?latex=%5Cell_%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_&#92;infty' title='&#92;ell_&#92;infty' class='latex' /> norm of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />, or you can chop up <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> into pieces <img src='http://s0.wp.com/latex.php?latex=E_1x%2C%5Cdots%2CE_kx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E_1x,&#92;dots,E_kx' title='E_1x,&#92;dots,E_kx' class='latex' /> (discarding everything up to the <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />th coordinate), play the same game with each piece, add up the results, and divide by <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />. </p>
<p>Note that if <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is finitely supported, then there is nothing to be gained by the chopping up if the support of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is entirely contained in one of the pieces, since then you&#8217;ll simply have divided by <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> unnecessarily. So the game terminates, and if you play it optimally, then you get the norm of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />.</p>
<p>How does one say anything about a norm that is defined by an implicit formula such as the one above? This is what Ted taught me. A first observation is that it is possible to find sequences of unit vectors <img src='http://s0.wp.com/latex.php?latex=x_1%3C%5Cdots%3Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1&lt;&#92;dots&lt;x_n' title='x_1&lt;&#92;dots&lt;x_n' class='latex' /> that approximate the unit vector basis of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^n' title='&#92;ell_1^n' class='latex' /> as closely as you like. (The notation <img src='http://s0.wp.com/latex.php?latex=x%3Cy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&lt;y' title='x&lt;y' class='latex' /> means that the support of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> ends before the support of <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> starts.) The proof is simple. If the support of <img src='http://s0.wp.com/latex.php?latex=x_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1' title='x_1' class='latex' /> begins after <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />, then the supports of <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cdots%2Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,&#92;dots,x_n' title='x_1,&#92;dots,x_n' class='latex' /> themselves form an admissible sequence. It follows from the implicit definition that <img src='http://s0.wp.com/latex.php?latex=%5C%7Ca_1x_1%2B%5Cdots%2Ba_nx_n%5C%7C%5Cgeq+C%5E%7B-1%7D%5Csum_%7Bi%3D1%7D%5En%7Ca_i%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|a_1x_1+&#92;dots+a_nx_n&#92;|&#92;geq C^{-1}&#92;sum_{i=1}^n|a_i|' title='&#92;|a_1x_1+&#92;dots+a_nx_n&#92;|&#92;geq C^{-1}&#92;sum_{i=1}^n|a_i|' class='latex' /> for any sequence <img src='http://s0.wp.com/latex.php?latex=a_1%2C%5Cdots%2Ca_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_1,&#92;dots,a_n' title='a_1,&#92;dots,a_n' class='latex' /> of real numbers.</p>
<p>A nice inductive argument of R. C. James allows one to improve this <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-equivalence to a <img src='http://s0.wp.com/latex.php?latex=%281%2B%5Cepsilon%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1+&#92;epsilon)' title='(1+&#92;epsilon)' class='latex' />-equivalence. Let <img src='http://s0.wp.com/latex.php?latex=%7C.%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|.|' title='|.|' class='latex' /> be any norm on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{R}^n' title='&#92;mathbb{R}^n' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C_1%5Cleq+%7Cx%7C%5Cleq+C%5C%7Cx%5C%7C_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|_1&#92;leq |x|&#92;leq C&#92;|x&#92;|_1' title='&#92;|x&#92;|_1&#92;leq |x|&#92;leq C&#92;|x&#92;|_1' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=x%5Cin%5Cmathbb%7BR%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in&#92;mathbb{R}^n' title='x&#92;in&#92;mathbb{R}^n' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=m%3D%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m=&#92;sqrt{n}' title='m=&#92;sqrt{n}' class='latex' /> (for convenience I&#8217;ll assume that <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is a perfect square) and divide the standard unit vector basis <img src='http://s0.wp.com/latex.php?latex=e_1%2C%5Cdots%2Ce_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e_1,&#92;dots,e_n' title='e_1,&#92;dots,e_n' class='latex' /> into blocks of size <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />. Then one of two things is guaranteed to happen.</p>
<p>(i) For every <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' />, the <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' />th block generates a vector <img src='http://s0.wp.com/latex.php?latex=x_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_i' title='x_i' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7Cx_i%7C%5Cleq%5Csqrt%7BC%7D%5C%7Cx_i%5C%7C_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x_i|&#92;leq&#92;sqrt{C}&#92;|x_i&#92;|_1' title='|x_i|&#92;leq&#92;sqrt{C}&#92;|x_i&#92;|_1' class='latex' />.</p>
<p>(ii) For some <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' />, every vector <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> generated by the unit vectors in the <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' />th block satisfies <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D%5C%7Cx%5C%7C_1%5Cleq+%7Cx%7C%5Cleq+C%5C%7Cx%5C%7C_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}&#92;|x&#92;|_1&#92;leq |x|&#92;leq C&#92;|x&#92;|_1' title='&#92;sqrt{C}&#92;|x&#92;|_1&#92;leq |x|&#92;leq C&#92;|x&#92;|_1' class='latex' />.</p>
<p>In the second case, we have obviously improved the equivalence to <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}' title='&#92;sqrt{C}' class='latex' /> on a subspace of dimension <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' />. But in the first case we have as well, because <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bi%3D1%7D%5Ema_ix_i%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{i=1}^ma_ix_i|' title='|&#92;sum_{i=1}^ma_ix_i|' class='latex' /> is at least <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Em%7Ca_i%7C%5C%7Cx_i%5C%7C_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^m|a_i|&#92;|x_i&#92;|_1' title='&#92;sum_{i=1}^m|a_i|&#92;|x_i&#92;|_1' class='latex' /> by hypothesis and at most <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D%5Csum_%7Bi%3D1%7D%5Em%7Ca_i%7C%5C%7Cx_i%5C%7C_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}&#92;sum_{i=1}^m|a_i|&#92;|x_i&#92;|_1' title='&#92;sqrt{C}&#92;sum_{i=1}^m|a_i|&#92;|x_i&#92;|_1' class='latex' /> by the triangle inequality.</p>
<p>By iterating this argument a few times, we can make the equivalence as good as we like. </p>
<p>Now comes Ted&#8217;s nice idea. If we want a <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortable space, then we can take Tsirelson&#8217;s space as defined above (actually, the normal thing to do was to take <img src='http://s0.wp.com/latex.php?latex=C%3D2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C=2' title='C=2' class='latex' />, but Ted wanted to take a general <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> in order to show that <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortions existed for any <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />) and define an equivalent norm as follows:</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%7Cx%7C%3D%5Cmax%5C%7B%5Csum_%7Bi%3D1%7D%5En%5C%7CE_ix%5C%7C%3AE_1%3C%5Cdots%3CE_n%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle |x|=&#92;max&#92;{&#92;sum_{i=1}^n&#92;|E_ix&#92;|:E_1&lt;&#92;dots&lt;E_n&#92;}' title='&#92;displaystyle |x|=&#92;max&#92;{&#92;sum_{i=1}^n&#92;|E_ix&#92;|:E_1&lt;&#92;dots&lt;E_n&#92;}' class='latex' /></p>
<p>Here, I&#8217;m being a bit sloppy about constants: <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> are basically the same, but for the chopping up in the above definition I want an integer. The idea is that you take a vector in Tsirelson&#8217;s space and you&#8217;re allowed to increase its norm by cutting it up into <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> pieces and adding the norms of those pieces. </p>
<p>It is easy to show that this is an <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />-equivalent norm on Tsirelson&#8217;s space. It is at least as big as the original norm. But also, since <img src='http://s0.wp.com/latex.php?latex=%5C%7CE_ix%5C%7C%5Cleq+%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|E_ix&#92;|&#92;leq &#92;|x&#92;|' title='&#92;|E_ix&#92;|&#92;leq &#92;|x&#92;|' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />, it follows by the triangle inequality that <img src='http://s0.wp.com/latex.php?latex=%7Cx%7C%5Cleq+n%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x|&#92;leq n&#92;|x&#92;|' title='|x|&#92;leq n&#92;|x&#92;|' class='latex' />. To show that we have a distortion, we need to show that sometimes chopping up a vector and adding the norms of its bits doesn&#8217;t help us, and sometimes it does. And we need to show that that is the case inside every subspace of Tsirelson&#8217;s space.</p>
<p>Here&#8217;s where I&#8217;ll get a bit sketchier. In any subspace we can find unit vectors <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cdots%2Cx_N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,&#92;dots,x_N' title='x_1,&#92;dots,x_N' class='latex' /> that generate a subspace that&#8217;s <img src='http://s0.wp.com/latex.php?latex=%281%2B%5Cepsilon%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1+&#92;epsilon)' title='(1+&#92;epsilon)' class='latex' />-close to <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^n' title='&#92;ell_1^n' class='latex' />. If we take such vectors and define <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> to be their average <img src='http://s0.wp.com/latex.php?latex=N%5E%7B-1%7D%28x_1%2B%5Cdots%2Bx_N%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N^{-1}(x_1+&#92;dots+x_N)' title='N^{-1}(x_1+&#92;dots+x_N)' class='latex' />, then we obtain a vector that can&#8217;t be improved much by chopping, at least if <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> is much bigger than <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. The reason is that, apart from a few edge effects, we&#8217;ve basically chopped <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> up into sums of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> consecutive blocks of the <img src='http://s0.wp.com/latex.php?latex=x_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_i' title='x_i' class='latex' />, and since the <img src='http://s0.wp.com/latex.php?latex=x_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_i' title='x_i' class='latex' /> generate something very close to a copy of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1' title='&#92;ell_1' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|' title='&#92;|x&#92;|' class='latex' /> is very close to the sum of the norms of those sums. So the chopping didn&#8217;t achieve anything.</p>
<p>How about a vector where chopping <em>does</em> help? The technique here goes a long way towards explaining the point of the strange definition of Tsirelson&#8217;s space. Let&#8217;s call a vector of the kind I defined in the previous paragraph an <em><img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^N' title='&#92;ell_1^N' class='latex' />-average</em>. We now inductively construct (inside some arbitrary subspace) a sequence <img src='http://s0.wp.com/latex.php?latex=x_1%3C%5Cdots%3Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1&lt;&#92;dots&lt;x_n' title='x_1&lt;&#92;dots&lt;x_n' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^N' title='&#92;ell_1^N' class='latex' />-averages as follows. Once we&#8217;ve constructed <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cdots%2Cx_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,&#92;dots,x_k' title='x_1,&#92;dots,x_k' class='latex' />, we pick <img src='http://s0.wp.com/latex.php?latex=N_%7Bk%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N_{k+1}' title='N_{k+1}' class='latex' /> to be much larger than the maximum of the support of <img src='http://s0.wp.com/latex.php?latex=x_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_k' title='x_k' class='latex' /> and we let <img src='http://s0.wp.com/latex.php?latex=x_%7Bk%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_{k+1}' title='x_{k+1}' class='latex' /> be an <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5E%7BN_%7Bk%2B1%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^{N_{k+1}}' title='&#92;ell_1^{N_{k+1}}' class='latex' />-average that starts after <img src='http://s0.wp.com/latex.php?latex=x_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_k' title='x_k' class='latex' /> finishes.</p>
<p>It turns out that if we want to calculate the norm of the vector <img src='http://s0.wp.com/latex.php?latex=x%3Dx_1%2B%5Cdots%2Bx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=x_1+&#92;dots+x_n' title='x=x_1+&#92;dots+x_n' class='latex' /> in Tsirelson&#8217;s space, we can&#8217;t do much better than the obvious decomposition into <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cdots%2Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,&#92;dots,x_n' title='x_1,&#92;dots,x_n' class='latex' />, which tells us that the norm is roughly <img src='http://s0.wp.com/latex.php?latex=C%5E%7B-1%7D%5Csum_%7Bi%3D1%7D%5En%5C%7Cx_i%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C^{-1}&#92;sum_{i=1}^n&#92;|x_i&#92;|' title='C^{-1}&#92;sum_{i=1}^n&#92;|x_i&#92;|' class='latex' />. </p>
<p>Why can&#8217;t we do better? Well, suppose we chop into <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' /> pieces that start after <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' />. Then for each <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=N_i%5Cleq+M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N_i&#92;leq M' title='N_i&#92;leq M' class='latex' /> we know that the maximum of the support of <img src='http://s0.wp.com/latex.php?latex=x_%7Bi-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_{i-1}' title='x_{i-1}' class='latex' /> is smaller than <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' />, so we&#8217;ve completely missed out <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cdots%2Cx_%7Bi-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,&#92;dots,x_{i-1}' title='x_1,&#92;dots,x_{i-1}' class='latex' /> in the chopping up of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />. And for each <img src='http://s0.wp.com/latex.php?latex=N_i%5Cgeq+M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N_i&#92;geq M' title='N_i&#92;geq M' class='latex' /> we know that <img src='http://s0.wp.com/latex.php?latex=N_%7Bi%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N_{i+1}' title='N_{i+1}' class='latex' /> is much greater than <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' />, so there is nothing to be gained by chopping up <img src='http://s0.wp.com/latex.php?latex=x_%7Bi%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_{i+1}' title='x_{i+1}' class='latex' /> (or any later <img src='http://s0.wp.com/latex.php?latex=x_j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_j' title='x_j' class='latex' />). </p>
<p>But if <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|' title='&#92;|x&#92;|' class='latex' /> is approximately <img src='http://s0.wp.com/latex.php?latex=C%5E%7B-1%7D%5Csum_%7Bi%3D1%7D%5En%5C%7Cx_i%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C^{-1}&#92;sum_{i=1}^n&#92;|x_i&#92;|' title='C^{-1}&#92;sum_{i=1}^n&#92;|x_i&#92;|' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7Cx%7C%5Capprox+C%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x|&#92;approx C&#92;|x&#92;|' title='|x|&#92;approx C&#92;|x&#92;|' class='latex' />, since <img src='http://s0.wp.com/latex.php?latex=%7Cx%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x|' title='|x|' class='latex' /> is at least <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5En%5C%7Cx_i%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^n&#92;|x_i&#92;|' title='&#92;sum_{i=1}^n&#92;|x_i&#92;|' class='latex' />.</p>
<p>What we have done with those calculations is show that every subspace contains vectors of two different kinds. One kind is <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^N' title='&#92;ell_1^N' class='latex' /> averages, and the other is sums of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5E%7BN_i%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^{N_i}' title='&#92;ell_1^{N_i}' class='latex' /> averages for a sequence <img src='http://s0.wp.com/latex.php?latex=N_1%2C%5Cdots%2CN_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N_1,&#92;dots,N_n' title='N_1,&#92;dots,N_n' class='latex' /> that grows very rapidly. For the first kind of vector <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|' title='&#92;|x&#92;|' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Cx%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x|' title='|x|' class='latex' /> are roughly equal, whereas for the second kind of vector <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5C%7Cx%5C%7C%5Capprox+C%7Cx%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|x&#92;|&#92;approx C|x|' title='&#92;|x&#92;|&#92;approx C|x|' class='latex' />. So we have shown that Tsirelson&#8217;s space, defined with constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />, is <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortable (or perhaps it&#8217;s safer to say <img src='http://s0.wp.com/latex.php?latex=C%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C/2' title='C/2' class='latex' />-distortable just to provide some elbow room in the calculations). </p>
<p>This shows that for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> there exists a <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortable Banach space. But that raises a very natural question: can we reverse the quantifiers and find a Banach space that is <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortable for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />? This question was answered by Thomas Schlumprecht, who had a very close collaboration with Ted over many years, which resulted in a large number of beautiful papers. Let me briefly give the definition of Schlumprecht&#8217;s space, the first known arbitrarily distortable space.</p>
<p>The philosophy behind Tsirelson&#8217;s space is that since the more pieces you chop a vector into, the bigger you can make the sum of the norms of the pieces, you need some kind of &#8220;penalty&#8221; associated with the number of pieces you chop into. The penalty is that if you want to chop into <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> pieces, then you must sacrifice the first <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> coordinates of your vector. Schlumprecht&#8217;s idea was to impose a different kind of penalty: instead of taking a fixed constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />, you let <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> grow with the number of pieces you chop into. Many functions would do for this purpose, but the function <img src='http://s0.wp.com/latex.php?latex=f%28x%29%3D%5Clog_2%28x%2B1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)=&#92;log_2(x+1)' title='f(x)=&#92;log_2(x+1)' class='latex' /> had some properties that made it convenient in calculations. Accordingly, Schlumprecht&#8217;s space is defined by the following formula.</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%7Cx%5C%7C%3D%5C%7Cx%5C%7C_%5Cinfty%5Cvee%5Cmax%5C%7B%5Cfrac+1%7Bf%28n%29%7D%5Csum_%7Bi%3D1%7D%5En%5C%7CE_ix%5C%7C%3An%5Cgeq+2%2C+E_1%3C%5Cdots%3CE_n%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;|x&#92;|=&#92;|x&#92;|_&#92;infty&#92;vee&#92;max&#92;{&#92;frac 1{f(n)}&#92;sum_{i=1}^n&#92;|E_ix&#92;|:n&#92;geq 2, E_1&lt;&#92;dots&lt;E_n&#92;}' title='&#92;displaystyle &#92;|x&#92;|=&#92;|x&#92;|_&#92;infty&#92;vee&#92;max&#92;{&#92;frac 1{f(n)}&#92;sum_{i=1}^n&#92;|E_ix&#92;|:n&#92;geq 2, E_1&lt;&#92;dots&lt;E_n&#92;}' class='latex' /></p>
<p>Because <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> grows slowly, every subspace of Schlumprecht&#8217;s space contains <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^N' title='&#92;ell_1^N' class='latex' /> averages, by the James argument. But now if you take sums of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5E%7BN_i%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^{N_i}' title='&#92;ell_1^{N_i}' class='latex' /> averages with rapidly increasing <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' />, you can get the gain from chopping up into <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' /> pieces to grow without limit as <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='M' title='M' class='latex' /> grows.</p>
<p>Incidentally, Tsirelson first constructed his space as an example of an infinite-dimensional space that does not have a subspace isomorphic to <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Cell_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_p' title='&#92;ell_p' class='latex' /> for any <img src='http://s0.wp.com/latex.php?latex=1%5Cleq+p%3C%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&#92;leq p&lt;&#92;infty' title='1&#92;leq p&lt;&#92;infty' class='latex' />. The fact that it contains copies of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^n' title='&#92;ell_1^n' class='latex' /> in every subspace and that these copies are generated by disjointly supported vectors rules out everything except <img src='http://s0.wp.com/latex.php?latex=%5Cell_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1' title='&#92;ell_1' class='latex' />. And <img src='http://s0.wp.com/latex.php?latex=%5Cell_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1' title='&#92;ell_1' class='latex' /> is ruled out because if it contained a copy of <img src='http://s0.wp.com/latex.php?latex=%5Cell_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1' title='&#92;ell_1' class='latex' /> then that subspace wouldn&#8217;t be distortable.</p>
<h3>The distortion problem.</h3>
<p>I now want to describe a problem that was for several years my favourite problem in mathematics. The problem is very simple indeed to state, given the definitions in the previous section.</p>
<p><strong>Problem.</strong> <em>If <img src='http://s0.wp.com/latex.php?latex=1%3Cp%3C%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&lt;p&lt;&#92;infty' title='1&lt;p&lt;&#92;infty' class='latex' />, is <img src='http://s0.wp.com/latex.php?latex=%5Cell_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_p' title='&#92;ell_p' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distortable for some <img src='http://s0.wp.com/latex.php?latex=C%3E1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&gt;1' title='C&gt;1' class='latex' />?</em></p>
<p>We have seen above that there are distortable spaces. But what about the spaces we know best, the <img src='http://s0.wp.com/latex.php?latex=%5Cell_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_p' title='&#92;ell_p' class='latex' />-spaces? An infinitary analogue of the argument of R. C. James that was used above to obtain <img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^N' title='&#92;ell_1^N' class='latex' />-averages can be used to prove that <img src='http://s0.wp.com/latex.php?latex=%5Cell_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1' title='&#92;ell_1' class='latex' /> is not distortable, and an argument along similar lines shows that <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' /> is not distortable. But what about a Hilbert space, say? </p>
<p>The fact that Tsirelson&#8217;s space can be distorted offers little encouragement, because it relied on the general idea that there are two different kinds of vectors (in the strong sense that they can be found inside every subspace). A Hilbert space is so homogeneous that it looks hard, and perhaps impossible, to identify two different types of vectors.</p>
<p>I came to believe strongly that Hilbert spaces were not distortable. My reason was that there is a simple reformulation of the distortion problem that turns it into a problem in Ramsey theory that is very similar to known Ramsey theorems that have positive answers. Here is the reformulation.</p>
<p><strong>Problem.</strong> <em>Let <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> and let the unit sphere of <img src='http://s0.wp.com/latex.php?latex=%5Cell_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_2' title='&#92;ell_2' class='latex' /> be coloured with finitely many colours. Must there exist an infinite-dimensional subspace <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and one of the colours such that every point in the unit sphere of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is within <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> of a point with that colour?</em></p>
<p>This is slightly different from a normal Ramsey theorem in that we don&#8217;t find an infinite-dimensional subspace that&#8217;s entirely of one colour. Instead, we expand the colours by <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> and <em>then</em> try to find a subspace that lives inside one of the (expanded) colours. This is necessary, since otherwise one could colour a vector <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> red if its first non-zero coordinate is positive and blue if it is negative.  But the problem above is a natural &#8220;analytic&#8221; Ramsey problem. </p>
<p>To see the connection, observe that if you have a <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />-distorted norm on the Hilbert space, then you can have one colour for vectors where the distorted norm is less than <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}' title='&#92;sqrt{C}' class='latex' /> times the usual norm, and one for the rest. Since every subspace contains vectors that have norms roughly equal to the usual norm and other vectors that have norms roughly equal to <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> times the usual norm, this gives us a counterexample to the Ramsey-type problem. In the other direction, if you have a two-colouring and no subspace is contained in the <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' />-expansion of either colour, then you can construct a distorted norm. I leave this as an exercise.</p>
<p>Why did I believe so strongly that this Ramsey problem had a positive solution? It is because of a beautiful theorem in Ramsey theory due to Neil Hindman. In its original formulation, it states the following.</p>
<p><strong>Theorem.</strong> <em>Let <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> be coloured with finitely many colours. Then there exists an infinite sequence <img src='http://s0.wp.com/latex.php?latex=n_1%3Cn_2%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n_1&lt;n_2&lt;&#92;dots' title='n_1&lt;n_2&lt;&#92;dots' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%5Cin+A%7Dn_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i&#92;in A}n_i' title='&#92;sum_{i&#92;in A}n_i' class='latex' /> has the same colour for every non-empty finite subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />.</em></p>
<p>There is also a &#8220;finite-unions&#8221; version, which says the following.</p>
<p><strong>Theorem.</strong> <em>Let the finite subsets of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> be coloured with finitely many colours. Then there exist subsets <img src='http://s0.wp.com/latex.php?latex=E_1%3CE_2%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E_1&lt;E_2&lt;&#92;dots' title='E_1&lt;E_2&lt;&#92;dots' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Cbigcup_%7Bi%5Cin+A%7DE_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;bigcup_{i&#92;in A}E_i' title='&#92;bigcup_{i&#92;in A}E_i' class='latex' /> has the same colour for every non-empty finite subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />.</em></p>
<p>Here the notation <img src='http://s0.wp.com/latex.php?latex=E%3CF&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E&lt;F' title='E&lt;F' class='latex' /> means that the maximum element of <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E' title='E' class='latex' /> is less than the minimum element of <img src='http://s0.wp.com/latex.php?latex=F&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='F' title='F' class='latex' />.</p>
<p>To see that the sums version implies the unions version, think of each subset of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> as the binary expansion of a positive integer. That is, the subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> corresponds to the positive integer <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%5Cin+A%7D2%5E%7Bi-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i&#92;in A}2^{i-1}' title='&#92;sum_{i&#92;in A}2^{i-1}' class='latex' />. Given a colouring of the finite subsets of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />, pull it over to the integers using this correspondence, and apply the first version of Hindman&#8217;s theorem. This doesn&#8217;t quite finish things off, because it gives us some sets that may well not satisfy the condition <img src='http://s0.wp.com/latex.php?latex=E_1%3CE_2%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E_1&lt;E_2&lt;&#92;dots' title='E_1&lt;E_2&lt;&#92;dots' class='latex' />. However, given any infinite set <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> of integers and any <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />, we can always find a sum of distinct elements of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> that is divisible by <img src='http://s0.wp.com/latex.php?latex=2%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^k' title='2^k' class='latex' />. This follows easily from the pigeonhole principle: you just find <img src='http://s0.wp.com/latex.php?latex=2%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^k' title='2^k' class='latex' /> integers that are the same mod <img src='http://s0.wp.com/latex.php?latex=2%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^k' title='2^k' class='latex' /> and add them up. Therefore, using the set <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> that Hindman&#8217;s theorem has given us, we can find finite sums that correspond to sets <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E' title='E' class='latex' /> with arbitrarily large minimum, from which it follows that we can find a sequence of (disjoint) finite sums that correspond to sets <img src='http://s0.wp.com/latex.php?latex=E_1%3CE_2%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E_1&lt;E_2&lt;&#92;dots' title='E_1&lt;E_2&lt;&#92;dots' class='latex' />.</p>
<p>The finite-unions version of the theorem can in turn be rephrased in terms of 01-sequences. If <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> are two 01-sequences, write <img src='http://s0.wp.com/latex.php?latex=x%3Cy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&lt;y' title='x&lt;y' class='latex' /> to mean that the last non-zero entry of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> comes before the first non-zero entry of <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />. Then the finite-unions version of the theorem is trivially equivalent to the following statement.</p>
<p><strong>Theorem.</strong> Let the finitely supported 01-sequences be coloured with finitely many colours. Then there exist sequences <img src='http://s0.wp.com/latex.php?latex=x_1%3Cx_2%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1&lt;x_2&lt;&#92;dots' title='x_1&lt;x_2&lt;&#92;dots' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%5Cin+A%7Dx_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i&#92;in A}x_i' title='&#92;sum_{i&#92;in A}x_i' class='latex' /> has the same colour for every non-empty finite subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />.</em></p>
<p>This is what got me excited. It seems very much like a combinatorial analogue of a positive solution to the distortion problem: a sequence space is replaced by a set of 01-sequences, but that shouldn&#8217;t be a huge deal.</p>
<p>I did in fact manage to pursue these thoughts to prove a theorem in the space that most resembles the space of finitely supported 01-sequences, namely <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' /> (the space of real sequences that tend to zero, with the <img src='http://s0.wp.com/latex.php?latex=%5Cell_%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_&#92;infty' title='&#92;ell_&#92;infty' class='latex' /> norm). It states the following.</p>
<p><strong>Theorem.</strong> <em>Let <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> and let the unit sphere of <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' /> be coloured with finitely many colours. Then there exists an infinite-dimensional subspace that lies entirely in the <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' />-expansion of one of the colours.</em></p>
<p>For what it&#8217;s worth, the theorem above is the one result of mine that appears in an Elsevier journal. </p>
<p>There are two things that make <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' /> particularly combinatorial. One is that it can be discretized very naturally and easily: you just take all points with coordinates that are multiples of <img src='http://s0.wp.com/latex.php?latex=1%2Fn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/n' title='1/n' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. The other is that if you add two disjointly supported vectors, then you basically just concatenate them. I won&#8217;t say much about the proof here &#8212; just that it used ultrafilters (as does one of the nicest proofs of Hindman&#8217;s theorem) and it involved a subtlety, in that the obvious discrete problem that you might be tempted to write down has a negative answer because of the sign-of-first-non-zero-coordinate example. So even for the discrete problem one has to prove an approximate result &#8212; the <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> doesn&#8217;t just come from the approximation of the continuous problem by the discrete one.</p>
<p>Try as I might, I just couldn&#8217;t find a way of doing without the combinatorial crutch that I had in <img src='http://s0.wp.com/latex.php?latex=c_0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c_0' title='c_0' class='latex' />. And yet the homogeneity of <img src='http://s0.wp.com/latex.php?latex=%5Cell_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_2' title='&#92;ell_2' class='latex' /> and the fact that it was a very beautiful and natural sequence space made it impossible for me to give up the dream that there was some vast analytic generalization of Hindman&#8217;s theorem waiting to be proved. </p>
<p>What did finally cause me to give up that dream was the announcement in I would guess 1993 (since the paper was published in 1994) that Odell and Schlumprecht had constructed a counterexample to the distortion problem. Very surprisingly (to me at least), they deduced that every <img src='http://s0.wp.com/latex.php?latex=%5Cell_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_p' title='&#92;ell_p' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=1%3Cp%3C%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&lt;p&lt;&#92;infty' title='1&lt;p&lt;&#92;infty' class='latex' /> is arbitrarily distortable from the fact that Schlumprecht&#8217;s space is arbitrarily distortable. To do this, they had to apply a very clever non-linear map to take the colourings that correspond to distortions of Schlumprecht&#8217;s space and convert them into colourings that correspond to distortions of <img src='http://s0.wp.com/latex.php?latex=%5Cell_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_p' title='&#92;ell_p' class='latex' />. (I think I may be oversimplifying here &#8212; it was a long time ago and I don&#8217;t remember much more than I am writing here.) </p>
<p>Another way of stating the remarkable result that Odell and Schlumprecht proved is the following. I&#8217;ll state it for <img src='http://s0.wp.com/latex.php?latex=%5Cell_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_2' title='&#92;ell_2' class='latex' /> because there it is nicest.</p>
<p><strong>Theorem.</strong> <em>For every <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon&gt;0' title='&#92;epsilon&gt;0' class='latex' /> there exist subsets <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> of the unit sphere of <img src='http://s0.wp.com/latex.php?latex=%5Cell_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_2' title='&#92;ell_2' class='latex' /> with the following properties.</p>
<p>(i) Every infinite dimensional subspace of <img src='http://s0.wp.com/latex.php?latex=%5Cell_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_2' title='&#92;ell_2' class='latex' /> contains a point in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and a point in <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />.</p>
<p>(ii) <img src='http://s0.wp.com/latex.php?latex=%7C%5Clangle+x%2Cy%5Crangle%7C+%3C%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;langle x,y&#92;rangle| &lt;&#92;epsilon' title='|&#92;langle x,y&#92;rangle| &lt;&#92;epsilon' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=x%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;in A' title='x&#92;in A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in B' title='y&#92;in B' class='latex' />. </em></p>
<p>In other words, the two sets <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> are almost orthogonal to each other and yet both intersect every single infinite-dimensional subspace. Even getting the inner product to be at most <img src='http://s0.wp.com/latex.php?latex=1-%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1-&#92;epsilon' title='1-&#92;epsilon' class='latex' /> is formidably difficult, or at least seems so: it would be fascinating if somebody could come up with a fundamentally different construction.</p>
<p>For me, the whole episode had some important lessons for how mathematical research works. I became obsessed with the distortion problem, and ultimately failed to solve it. And yet, out of that obsession, and also from what I learned during that conversation with Ted Odell, came an understanding of distortability that ended up yielding me a number of other results. On the other hand, I was a little too convinced that the problem had a positive answer, and that meant that there was no chance of my solving it &#8212; though the solution of Odell and Schlumprecht was so ingenious that I don&#8217;t think I would have solved it even if I had gone all out to find a counterexample. </p>
<h3>Conclusion.</h3>
<p>I have tried in this post to give some idea of what Ted Odell contributed to mathematics, not by going through his numerous mathematical achievements one by one, but in a more indirect way by conveying the impact he had on me personally. Many other people will have different stories to tell, but the themes that will run through them are that he was a wonderful mathematician, generous with his ideas, and very supportive to younger mathematicians.</p>
<p>I&#8217;d like to end by mentioning a beautiful problem that arises very naturally from the above discussion and that is, as far as I know, still unsolved. It is all that remains of my hopes for proving an analytic Ramsey theorem for a non-combinatorial Banach space.</p>
<p><strong>Problem.</strong> <em>Does there exist a distortable Banach space that is not arbitrarily distortable? In particular, if Tsirelson&#8217;s space is defined with constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />, can it be distorted by substantially more than <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />?</em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4796/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4796/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4796&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/02/10/ted-odell/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>

		<media:content url="http://gowers.files.wordpress.com/2013/02/tedodell.png?w=214" medium="image">
			<media:title type="html">tedodell</media:title>
		</media:content>
	</item>
		<item>
		<title>The Elsevier boycott one year on</title>
		<link>http://gowers.wordpress.com/2013/01/28/the-elsevier-boycott-one-year-on/</link>
		<comments>http://gowers.wordpress.com/2013/01/28/the-elsevier-boycott-one-year-on/#comments</comments>
		<pubDate>Mon, 28 Jan 2013 14:29:15 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Elsevier]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4768</guid>
		<description><![CDATA[A few days ago was the anniversary of the beginning of the Cost of Knowledge boycott of Elsevier. It seems a good moment to take stock of what the boycott has achieved and to think about what progress has or hasn&#8217;t been made since it started. This post is a short joint statement by many [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4768&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A few days ago was the anniversary of the beginning of the <a href="http://thecostofknowledge.com/">Cost of Knowledge boycott</a> of Elsevier. It seems a good moment to take stock of what the boycott has achieved and to think about what progress has or hasn&#8217;t been made since it started. This post is a short joint statement by many of the people who signed the original Cost of Knowledge statement last year. At some point in the not too distant future I plan to write a longer post giving a more personal view.</p>
<p><strong>The Elsevier boycott: where do we now stand?</strong></p>
<p>In the first few months after the boycott started, the number of signatories grew very rapidly. The growth is now much slower, but this was to be expected: given that, for understandable reasons, no editorial boards of Elsevier journals were ready to take the drastic step of leaving Elsevier, it was inevitable that further progress would depend on the creation of new publication models, which takes time and work, much of it not in the public eye. We are very pleasantly surprised by how much progress of this kind there has already been, with the setting up of Forum of Mathematics, a major new open-access journal, and the recent announcement of the Episciences Project, a new platform for overlay journals. We are also pleased by the rapid progress made by the wider Open Access movement over the last year.</p>
<p>In one respect the boycott has been an unqualified success: it has helped to raise awareness of the concerns we have about academic publishing. This, we believe, will make it easier for new publishing initiatives to succeed, and we strongly encourage further experimentation. We believe that commercial publishers could in principle play a valuable role in the future of mathematical publishing, but we would prefer to see publishers as &#8220;service providers&#8221;: that is, mathematicians would control journals, publishers would provide services that mathematicians deemed necessary, and prices would be kept competitive since mathematicians would have the option of obtaining these services elsewhere.</p>
<p>We welcome the moves that Elsevier made last year in the months that followed the start of the boycott: the dropping of support for the Research Works Act, the fact that back issues for many journals have now been made available, a clear statement that authors can post preprints on the arXiv that take into account comments by referees, and some small price reductions. However, the fundamental problems remain. Elsevier still has a stranglehold over many of our libraries as a result of Big Deals (a.k.a. bundling) and this continues to do real damage, such as forcing them to cancel subscriptions to more independent journals and to reduce their spending on books. There has also been no improvement in transparency: it as hard as ever to know what libraries are paying for Big Deals. We therefore plan to continue boycotting Elsevier and encourage others to do the same.</p>
<p>The problem of expensive subscriptions will not be solved until more libraries are prepared to cancel subscriptions and Big Deals. To be an effective negotiating tactic this requires support from the community: we must indicate that we would be willing to put up with cancelling overly expensive subscriptions. The more papers are made freely available online (e.g., through the arXiv), the easier that will be. Many already are, and we regard it as a moral duty for mathematicians to make their papers available when publishers allow it. Unfortunately, since mathematics papers are bundled together with papers in other subjects, real progress on costs will depend on coordinated action by mathematicians and scientists, many of whom have very different publication practices. However, a statement by mathematicians that they would not be unduly inconvenienced by the cancelling of expensive subscriptions would be a powerful one.</p>
<p>We are well aware that the problems mentioned above are not confined to Elsevier. We believe that the boycott has been more successful as a result of focusing attention on Elsevier, but the problem is a wider one, and many of us privately try to avoid the other big commercial publishers. We realize that this is not easy for all researchers. When there are more alternatives available, it will become easier: we encourage people to support new ventures if they are in a position do so without undue risk to their careers.</p>
<p>We acknowledge that there are differing opinions about what an ideal publishing system would be like. In particular, the issue of article processing charges is a divisive one: some mathematicians are strongly opposed to them, while others think that there is no realistic alternative. We do not take a collective position on this, but we would point out that the debate is by no means confined to mathematicians: it has been going on in the Open Access community for many years. We note also that the advantages and disadvantages of article processing charges depend very much on the policies that journals have towards fee waivers: we strongly believe that editorial decisions should be independent of an author&#8217;s access to appropriate funds, and that fee-waiver policies should be designed to ensure this.</p>
<p>To summarize, we believe that the boycott has been a success and should be continued. Further success will take time and effort, but there are simple steps that we can all take: making our papers freely available, and supporting new and better publication models when they are set up.</p>
<p>Doug Arnold, John Baez, Folkmar Bornemann, Danny Calegari, Henry Cohn, Ingrid Daubechies, Jordan Ellenberg, Marie Farge, David Gabai, Timothy Gowers, Michael Harris, Fr&eacute;d&eacute;ric H&eacute; lein, Rolf Jeltsch, Rob Kirby, Vincent Lafforgue, Randall J. LeVeque, Peter Olver, Olof Sisask, Terence Tao, Richard Taylor, Nick Trefethen, Marie-France Vigneras, Wendelin Werner, G&uuml;nter M. Ziegler</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4768/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4768/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4768&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/01/28/the-elsevier-boycott-one-year-on/feed/</wfw:commentRss>
		<slash:comments>50</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Why I&#8217;ve also joined the good guys</title>
		<link>http://gowers.wordpress.com/2013/01/16/why-ive-also-joined-the-good-guys/</link>
		<comments>http://gowers.wordpress.com/2013/01/16/why-ive-also-joined-the-good-guys/#comments</comments>
		<pubDate>Wed, 16 Jan 2013 17:42:59 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Mathematics on the internet]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4754</guid>
		<description><![CDATA[For some months now I have known of a very promising initiative that until recently I have been asked not to publicize too widely, because the people in charge of it did not have a good estimate for when it would actually come to fruition. But now those who know about it have been given [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4754&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>For some months now I have known of a very promising initiative that until recently I have been asked not to publicize too widely, because the people in charge of it did not have a good estimate for when it would actually come to fruition. But now those who know about it have been given the green light. The short version of what I want to say in this post is that a platform is to be created that will make it very easy to set up arXiv overlay journals. </p>
<p>What is an arXiv overlay journal? It is just like an electronic journal, except that instead of a website with lots of carefully formatted articles, all you get is a list of links to preprints on the arXiv. The idea is that the parts of the publication process that academics do voluntarily &#8212; editing and refereeing &#8212; are just as they are for traditional journals, and we do without the parts that cost money, such as copy-editing and typesetting.<br />
<span id="more-4754"></span></p>
<p>The organization setting up this platform is called the <a href="http://episciences.org/">Episciences Project</a>, and they are referring to the journals as epijournals, which I&#8217;ll do here, though epijournals will probably not use the word &#8220;epijournal&#8221; in their titles (since they will want to make clear that the stamp of quality that they confer is every bit as legitimate as the stamp of quality conferred by a traditional journal). They aim to make the software good enough that the administrative burden on editorial boards is no greater than it is for a traditional journal. If they succeed in that aim, then it should be possible for epijournals to be &#8220;Diamond&#8221; open access &#8212; free to read and free to publish. Certainly the intention is that there should be no charges of any kind, with the costs of maintaining the site met, if I understand correctly, by an organization called <a href="http://ccsd.cnrs.fr/">Centre pour la Communication Scientifique Directe</a> (CCSD) in collaboration with the <a href="http://www-fourier.ujf-grenoble.fr/?lang=fr">Institut Fourier at Grenoble University</a>.</p>
<p>One possibility being discussed, which I am very much in favour of, is each accepted article having not just a link to the arXiv but also a web page for (non-anonymous) comments and reviews. For example, the editor who accepts an article might wish to write a paragraph or two about why the article is interesting, a reader who spots a minor error might write explaining the error and how it can be fixed (if it can), and an expert in the area might write a review that could be very useful to hiring committees. </p>
<p>This may even go further, with comment pages being set up for other preprints and journal articles &#8212; not just the ones that have appeared in epijournals. </p>
<p>Apparently, the plan is for the whole thing to start this April. Because I have known about the project for some time, I have quietly sounded out a few people in additive combinatorics, and it seems that there is enough enthusiasm that we will be able to start an epijournal broadly in that area (with a title that is not yet decided, but that will definitely not be &#8220;The Epijournal of Additive Combinatorics&#8221;). I am also on a committee (actually, they call it an Epicommittee) that is discussing some of the details of what the platform should be like &#8212; any comments you might have will be read with interest. [Added later: now that he has said so on Google+, I feel I can add that Terence Tao is also on the Epicommittee, so he has joined the good guys too.]</p>
<p>One question that some people might have is why, when there are a number of initiatives out there, this one should be regarded as particularly promising and worth supporting. I don&#8217;t know enough to give a detailed answer to that, but my impression is that this initiative has significant institutional back-up, including funding, that makes it more likely to succeed. Also, it is being designed for mathematicians and with the needs of mathematicians very much in mind, though it may later expand into other subjects. </p>
<p>April is very soon, but I hope people reading this, especially people who are critical of FoM and would rather move straight to a more radically different publication model, will give serious thought to setting up epijournals or encouraging others to do so. Another possibility envisaged by the people running the project is that some existing journals might like to convert to epijournals, which would certainly be interesting if it happened. And finally, if and when people do start to set up epijournals, please support them: if an epijournal gets plenty of good papers, then it will be much easier for it to establish the kind of reputation that will impress hiring committees (though I hope that if post-publication comments and reviews take off, they will be seen to provide more useful information than what can be deduced from which journal a paper gets into). </p>
<p>The Episciences project will soon be releasing a statement about the project. When it has done so, I&#8217;ll provide the link here. </p>
<p>I&#8217;ve been slightly vague about who the people behind this project are, which is because I am not 100% sure. However, the initial approach came from Jean-Pierre Demailly, Ariane Rolland and Beno&icirc;t Kloeckner and subsequent emails have come from Jean-Pierre Demailly, so I think it&#8217;s them &#8212; my uncertainty is over whether there are other people I should be mentioning too. If I discover that there are, then I&#8217;ll add their names.</p>
<p>Added later: Beno&icirc;t Kloeckner makes the following comment below.</p>
<blockquote><p>I can clarify a bit the “epi-team” composition. Jean-Pierre Demailly tried to launch a similar project some years ago, but it had much less institutional support and did not work out. More recently, Ariane Rolland heard about this tentative and, having contact at CCSD, made them meet with Jean-Pierre. That’s the real beginning of the episciences project, which I joined a bit later. The names you should add are the people involved in the CCSD: Christine Berthaud, head of CCSD, Laurent Capelli who is coding the software right now, and Agnès Magron who is working on the communication with Ariane.</p></blockquote>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4754/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4754&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/01/16/why-ive-also-joined-the-good-guys/feed/</wfw:commentRss>
		<slash:comments>191</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Why I&#8217;ve joined the bad guys</title>
		<link>http://gowers.wordpress.com/2013/01/14/why-ive-joined-the-bad-guys/</link>
		<comments>http://gowers.wordpress.com/2013/01/14/why-ive-joined-the-bad-guys/#comments</comments>
		<pubDate>Mon, 14 Jan 2013 15:40:02 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Elsevier]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Mathematics on the internet]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4540</guid>
		<description><![CDATA[A few months ago I was alerted by a pingback to the existence of a blog post by Orr Shalit entitled Worse than Elsevier which included the assertion that Terence Tao and I had &#8220;joined the bad guys&#8221;. That is an allusion to the fact that we are editors for Forum of Mathematics, CUP&#8217;s new [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4540&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A few months ago I was alerted by a pingback to the existence of a blog post by Orr Shalit entitled <a href="http://noncommutativeanalysis.wordpress.com/2012/09/27/worse-than-elsevier/">Worse than Elsevier</a> which included the assertion that Terence Tao and I had &#8220;joined the bad guys&#8221;. That is an allusion to the fact that we are editors for <a href="http://journals.cambridge.org/action/displaySpecialPage?pageId=3896">Forum of Mathematics</a>, CUP&#8217;s new open-access journal. This post serves a dual purpose: to draw attention to the fact that Forum of Mathematics is now accepting submissions, and to counter some of the many objections that have been raised to it. In particular, I want to separate out the objections that are based on misconceptions from the objections that have real substance. Both kinds exist, and unfortunately they tend to get mixed up. </p>
<p>If you are not already familiar with this debate, the aspect of Forum of Mathematics that many people dislike is that it will be funded by means of article processing charges (which I shall abbreviate to APCs) rather than subscriptions. For the next three years, these charges will be waived, but after that there will be a charge of £500 per article. Let me now consider a number of objections that people have to APCs.<br />
<span id="more-4540"></span></p>
<h3>It is just plain wrong to ask authors to pay to get their articles published.</h3>
<p>There are many variants of this argument. For instance, an analogy is often drawn with vanity publishing: do we want vanity publishing for mathematical articles? </p>
<p>Let me begin with the &#8220;it is just plain wrong&#8221; part. A number of people have said that they find APCs morally repugnant. However, that on its own is not an argument. It reminds me of some objections to stem cell research. Many people feel that that is wrong, regardless of any benefits that it might bring. Usually their objections are on religious grounds, though I imagine that even some non-religious people just feel instinctively that stem-cell research is wrong. In the case of APCs, I very much doubt that there is a <em>religious</em> objection, so I think everybody can agree that merely saying, &#8220;I find the idea horrible,&#8221; is not an argument until one has given a <em>reason</em> for its being horrible. It is scarcely necessary to say this, but some people have advanced the &#8220;it is obviously immoral&#8221; argument, so I am briefly mentioning it.</p>
<p>To be fair to the people who have said that it is immoral, they <em>have</em> gone on to give further arguments, so all I&#8217;m saying is that instead of <em>starting</em> from the position that it is immoral (as some people have), we should start by discussing the benefits and harms and conclude that it is immoral only if we find that the harmful consequences are unacceptable.</p>
<p>Actually, I myself very strongly agree with the assertion that it is wrong for authors to pay to have their articles published. Why? The main one is that it gives an advantage to rich authors. When we judge the research output of other mathematicians, we pay some attention to the quality of the journals that they have published in. If it turned out that rich mathematicians could publish in better journals than poor mathematicians, we would be introducing a completely irrelevant criterion, wealth, when all that should matter is the quality of the mathematics. The fact that it would be giving an advantage to people who are already advantaged makes things even worse.</p>
<p>So what am I doing on the editorial board of a journal that will in due course have article processing charges? There is no inconsistency here, because <em>authors will not pay to publish in Forum of Mathematics</em>.</p>
<p>There is a misconception here, which I have unfortunately helped to perpetuate. In <a href="http://gowers.wordpress.com/2012/07/02/a-new-open-access-venture-from-cambridge-university-press/#more-4356">my previous post about Forum of Mathematics</a> I made a bad mistake, which was to suggest that APC stood for &#8220;author publication charge&#8221; rather than &#8220;article processing charge&#8221;. Other people often refer to this method of financing a journal as the author-pays model. But it isn&#8217;t. An article processing charge is what it says &#8212; a charge for the processing of an article. So to call it an author-pays model is incorrect unless the APC is met by the author.</p>
<p>Forum of Mathematics will not under any circumstances expect authors to meet APCs out of their own pockets, and I would refuse to be an editor if it did. (I imagine the same holds for all the other editors.) Of course, it is one thing to say that authors are not expected to pay, and another to make sure that that never happens. Let me describe the safeguards that will be put in place.</p>
<p>First of all, when you submit an article, there will be no mention of APCs. The article will be processed in the normal way &#8212; sent out to referees, discussed by editors, etc. &#8212; and a decision will be made whether to accept or reject it. Since APCs have not been mentioned, the decision will be completely independent of financial considerations. (Of course, it will often be easy to guess whether an author can pay. But the editors will have absolutely no incentive to take this information into account. And if there were ever any pressure from CUP to be a bit more lenient to authors who were likely to be able to pay, I imagine that the entire editorial board would immediately resign.)</p>
<p>If your article is accepted, and if your institution is set up to meet APCs (as an increasing number of institutions already are) or they are covered by a grant that you are on, then you will arrange for your APC to be paid. Otherwise, CUP will ask for a letter from your institution stating that they are unwilling to pay the charge. No justification for this is required &#8212; just confirmation that it is the case. If you are not affiliated with an institution, then the charge will be automatically waived. </p>
<p>In short, you yourself won&#8217;t pay anything, and you won&#8217;t be expected to go to huge trouble applying for money to cover the APC. Either there will be a system in place for covering the charge, or you will need to organize for a simple letter to be sent. The worst that can be said for the effect this will have on you is that it will involve a bit more bureaucracy. But I don&#8217;t see that it will be any more time-consuming than correcting galley proofs, say. And even the bureaucracy should gradually become less necessary, since after a while CUP should be able to deal directly with the institutions that meet APCs and will know which ones don&#8217;t.</p>
<p>So if you regard APCs as immoral because you are imagining authors having to pay out of their own pockets, or authors from rich institutions having an advantage over authors from poor institutions, or authors having to go round with a begging bowl when they get an article accepted, or authors managing to get worthless articles published by paying money to unscrupulous publishers, then what you are objecting to does not apply to Forum of Mathematics. Maybe it will apply to other journals, and maybe that will be a problem: that is a question I&#8217;ll come back to.</p>
<h3>What??!! How can it cost £500 to process an article?</h3>
<p>There are two questions here. One is whether £500 will be a fair reflection of the costs that CUP will incur when processing Forum articles. The other is whether what they provide for those costs is worth paying for. The first question has a simple answer: it will. The answer to the second question is much less obvious, for which reason I want to postpone discussing it until the part of this post that will deal with the more serious objections to Forum of Mathematics.</p>
<p>So how can the costs reach anything like £500? I&#8217;ll talk in general terms here, and not specifically about Forum of Mathematics. There are many things that an academic journal does to a paper once it has gone through the refereeing process and been accepted. It does copy-editing, typesetting, addition of metadata, and making sure the article appears on various bibliographic databases. (I repeat that in this section I am not discussing whether we want all these things.) A typical cost for all this is around $20 per page. That&#8217;s just a fact: if you go round and ask people who work for conventional maths journals what it costs them per page to process an article, that is the kind of figure you will get. </p>
<p>At this point, you can do some calculations yourself. If an average article is 25 pages, that&#8217;s already $500, which is approximately the same order of magnitude as £500. Then you have to take into account a number of other factors, such as that it costs money to handle papers that are then rejected (not all that much, but even arXiv needs $7 per paper), and there will probably be several of those per accepted paper, that fees will be waived for some articles, that there will be staff costs and overheads (such as part of the cost of heating the building used by the staff &#8212; things like that), and so on. </p>
<p>For that kind of reason, it is a straightforward empirical fact that £500 is the right order of magnitude for the costs per article incurred by a journal that operates in roughly the same way as a current conventional print journal.</p>
<h3>Forum of Mathematics is even worse than Elsevier.</h3>
<p>Let&#8217;s think about what you are committing yourself to if you agree with this. First, the cost to the academic community of an article published in Forum of Mathematics is £500. What is the cost of an article published by Elsevier. This is harder to judge, for various reasons, but it seems to be at least an order of magnitude higher. Let me quote Mike Taylor <a href="http://www.guardian.co.uk/science/blog/2012/apr/17/persistent-myths-open-access-scientific-publishing">writing in the Guardian a few months ago</a>.</p>
<blockquote><p>For Elsevier, the biggest of the barrier-based publishers, we can calculate the total cost per article as £1,605m subscription revenue divided by 240,000 articles per year = £6,689 per article. By contrast, the cost of publishing an article with a flagship open access journal such as PLoS ONE is $1,350 (£850), about one eighth as much. No one expects open access to eliminate costs. But we can expect it to dramatically reduce them, as well as making research universally and freely available.</p></blockquote>
<p>I actually think that the &#8220;real&#8221; cost of the arrangement (which I won&#8217;t attempt to define here) is higher still, because Elsevier&#8217;s bundling arrangements mean that libraries are paying for a lot of articles that they don&#8217;t really want. Or perhaps what I should say is that while the average cost may be £6,689 per article, we should think of it as quite a lot more than that for the articles we want, and quite a lot less than that for the articles that we don&#8217;t want.</p>
<p>But even if we accept the figure of £6,689 as it stands, that&#8217;s a lot more than £500. So to show that Forum of Mathematics is worse than Elsevier, you need to establish that it is worth paying £6,189 per article to avoid the harm associated with submitting an article to Forum of Mathematics. </p>
<p>Let&#8217;s remind ourselves what that harm was: it was a little piece of extra bureaucracy that has to be gone through when you submit a paper. It isn&#8217;t any of those things that people like to imagine such as hard-up graduate students being shut out from the journal, faculty being unable to publish because their universities won&#8217;t cover the fee, authors paying substantial sums out of their own pockets, people using money to buy prestige, etc. Those would all be very bad things, worth fighting against. But you aren&#8217;t fighting against them by fighting against FoM. </p>
<p>If you say that FoM is worse than Elsevier, then you are saying that an hour of your time (to give a generous estimate for how long you would need to write and follow up on an email requesting either payment or a letter refusing payment) is worth £6,189 to the academic community at large. </p>
<h3>Authors are doing a service to the world, so making them pay is ridiculous.</h3>
<p>First, let me repeat that <em>authors will not pay to publish in FoM</em>. But let&#8217;s think about what the service is that authors do to the world. In some cases, they prove results that fascinate other mathematicians and stimulate a great deal of further research. That is undoubtedly doing a service. But that service is already done the moment they put their paper on the arXiv or their home page (assuming they do). So why do they bother to publish? </p>
<p>As I think everybody agrees, now that we have the internet, the main function left for journals is providing a stamp of quality. There is a big question about whether we actually need journals for that, but that question is independent of the question of who benefits from the service provided by journals. It is not the reader, since readers can quite happily look at preprints. The main person who benefits from the stamp of quality is the author, who boosts his or her CV and has a better chance when applying for jobs and so on. There is also some benefit to hiring committees, who can look at a publication list and get a quick sense of whether an author is publishing in good journals. </p>
<p>If you feel that APCs are wrong because if anything you as an author should be paid for the wonderful research you have done, I would counter that (i) it is not <em>journals</em> who should be paying you &#8212; they are helping you to promote yourself, and (ii) if your research is good, then you <em>will</em> be rewarded for it, by having a better career than you would have had without it. </p>
<p>Let me now turn to some arguments that I think have more merit to them.</p>
<h3>Maybe a typical article costs around £500 to process under the current system, but do we need what we get for that money?</h3>
<p>This is a much more serious question. While I&#8217;m discussing it, let me also highlight another misconception, which is that the editors of FoM regard it as a blueprint for the future of all of mathematical publishing. Maybe some of them do, but I don&#8217;t. There are two more modest ways in which it could be part of the picture: it might exist in its current form only as a temporary measure until newer and cheaper methods of assessment are developed and become accepted, or it might be that it and a few other journals would persist with traditional methods of processing articles but the bulk of mathematical publishing would be done much more cheaply, with minimal typesetting, copy-editing etc. </p>
<p>If traditional methods of processing articles do cost something like £500, whereas merely having an editorial and refereeing process should cost much much less (but not quite nothing, since there will be administrative costs), what is the argument for spending that much on the copy-editing and typesetting of articles that people find perfectly readable in their preprint form? </p>
<p>To my mind, the main argument is that moving from the current system to a radically new system is difficult unless there is a smooth path from one to the other. Imagine, for example, that somebody sets up an editorial board that does nothing except ask referees to report on papers on the arXiv, &#8220;accept&#8221; the papers it regards as good enough, and list those papers, with links, on a website. It seems to many people, including me, that such a board is doing pretty well all that we need of a mathematics journal. But suppose that a board of that kind were to be established, with the stated aim of competing directly with Journal of Functional Analysis, and that you were a postdoc trying to improve your publication list with a view to getting a good job somewhere. Wouldn&#8217;t you feel that it was safer to submit your paper to Journal of Functional Analysis than to the new &#8220;journal&#8221; that people reading your CV might not have heard of or might not trust? </p>
<p>I very much hope that ventures such as that will be set up, will be successful, will be trusted, and will look good on people&#8217;s CVs. But I think that that will take time. Meanwhile, FoM provides an option that is enough like a conventional journal that an article published there will look every bit as good as an article published in the journals it is competing with, and that is also open access and much cheaper to the academic community than a subscription journal.</p>
<p>In my ideal world, would every maths journal be run like FoM? Not at all. But to get to the ideal world, I think that it is going to be easier to persist with journals that are pretty conventional (but much cheaper) at least for now.</p>
<p>There is another argument in favour of what publishers currently do, which is that they help your paper appear on citation indexes, they give you journals with impact factors, and so on. I hate all that stuff: the measures are incredibly crude and far less useful than a well-written reference. I think most mathematicians share my distaste. But a lot of other scientists don&#8217;t seem to, and there is a danger that if mathematicians are perceived as &#8220;not really publishing&#8221; any more, then they will not be understood or taken seriously in situations where they are competing with people from other subjects. </p>
<p>I wish that argument would go away, and I hope that one day it will, but that&#8217;s an even bigger battle than the battle for reasonably priced journals.</p>
<h3>I don&#8217;t want traditional-style journals with APCs. I want much more radical change.</h3>
<p>I basically agree with this, but as I argued in the previous section, I think that there is a case for having APCs at least as a transitional arrangement. There is another, and to my mind stronger, argument for this, which is that APC-based journals are much more vulnerable if a better model comes along. The faults of the current subscription model have been obvious for years, but it has been very hard to do anything about it because of bundling, which means that you can&#8217;t easily cancel subscriptions. (For a great description of the problem, try <a href="http://johncarlosbaez.wordpress.com/2012/10/17/elsevier-strangling-libraries-worldwide/">this blog post</a> of John Baez.) Suppose now that we lived in a world where all maths journals were open access and funded by article processing charges. And suppose that a lot of mathematicians decided that they were perfectly happy to publish in different ways &#8212; free electronic journals, arxiv overlay boards, or whatever. Then they could simply publish in those different ways. If you publish in a different way at the moment, your poor old library is still locked into all those expensive subscriptions, but if you publish in a different way in a world full of APC-based open access journals, then whoever would have had to pay the APC no longer has to.</p>
<p>I had a horrible fantasy the other day, when it occurred to me that publishers could try to reintroduce the bundling concept in connection with APCs. Suppose that Elsevier made an offer to a university that for a flat fee all academics at that university could publish free in Elsevier journals for the next five years. If the flat fee was set in such a way that the university expected to save money, then it would be a tempting offer. But what would happen then? The university would say to its academics, &#8220;If you have the choice between an Elsevier journal and a comparable journal published by someone else, please go for the Elsevier journal.&#8221; And once Elsevier (and other big publishers with similar arrangements) had driven the smaller journals out of business, it could start upping the fees, and it would be very difficult for new journals to compete. In other words, the major problem with subscription journals could be reborn in a new guise.</p>
<p>However, forewarned is forearmed. Now that we know that bundling arrangements, however tempting in the short term, are ruinous in the long term, we can tell our universities to have nothing to do with them. Any sign that a publisher is trying to introduce them can be met with widespread negative publicity. And I think that if this nightmare did eventually come to pass, mathematicians could have moved on to better publication systems before they were affected by it.</p>
<h3>Maybe FoM&#8217;s waiver policy is OK, but by associating yourself with FoM, you are indirectly conferring legitimacy on many journals with much worse policies.</h3>
<p>This is a danger, I&#8217;ll admit. However, I think that the right way to counter this danger is not to campaign against the principle of article processing charges itself, but to campaign for certain safeguards to apply to any journal that has such charges. Here is a possibly incomplete list of safeguards.</p>
<p>1. Editorial decisions should be completely independent of financial considerations. If the editors decide that a paper is good enough to be accepted, then it will be published. Ideally, editors should not know, when they handle a paper, whether the author has access to funds for article processing charges. [I say "ideally" simply because there will be situations where an author's institution's policy is known to an editor. For example, it seems that in the UK, as a result of government mandates, all universities will be obliged to have a pot of money for paying APCs, and an editor may well know that an author is British.]</p>
<p>2. Under no circumstances should there ever be any advantage to an author who is happy to pay an article processing charge out of his/her own pocket. </p>
<p>3. An author at an institution that is willing to pay article processing charges should not be at any advantage over an author at an institution that is not willing to pay article processing charges.</p>
<p>4. The article processing charges should be set at the level needed to cover reasonable costs of the publisher (including overheads and possibly a modest profit for the purposes of reinvestment). </p>
<p>As I have argued above, Forum of Mathematics has these four safeguards in place. The fourth one is perhaps less obviously essential than the others, for two reasons. One is that some institutions, such as learned societies, might want to make larger profits in order to support their activities (and perhaps replace lost subscription revenue). Another is that one might hope that market forces would operate more efficiently. If a subscription journal is outrageously expensive, bundling makes it hard to do anything about it, but if a journal charges outrageous APCs, it is easy (in many cases) to avoid publishing in that journal.</p>
<p>What I would like to see is (cautious) support for journals with safeguards like 1-4 in place, and strong criticism of journals that manifestly don&#8217;t &#8212; which in my case would probably include adding them to the list of journals that I am boycotting. </p>
<hr />
<p>As always, there is much more that I could say, but I think I&#8217;ll end it there. Before I finish, I would like to mention that this post will be followed soon by a companion post entitled &#8220;Why I&#8217;ve also joined the good guys.&#8221; If the idea of APCs still sticks in your craw, then you will find that post more to your taste. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4540/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4540&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2013/01/14/why-ive-joined-the-bad-guys/feed/</wfw:commentRss>
		<slash:comments>76</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>What maths A-level doesn&#8217;t necessarily give you</title>
		<link>http://gowers.wordpress.com/2012/11/20/what-maths-a-level-doesnt-necessarily-give-you/</link>
		<comments>http://gowers.wordpress.com/2012/11/20/what-maths-a-level-doesnt-necessarily-give-you/#comments</comments>
		<pubDate>Tue, 20 Nov 2012 13:16:38 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Mathematical pedagogy]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4693</guid>
		<description><![CDATA[I had a mathematical conversation yesterday with a 17-year-old boy who is in his second year of doing maths A-level. Although a sample of size 1 should be treated with caution, I&#8217;m pretty sure that the boy in question, who is very intelligent and is expected to get at least an A grade, has been [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4693&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I had a mathematical conversation yesterday with a 17-year-old boy who is in his second year of doing maths A-level. Although a sample of size 1 should be treated with caution, I&#8217;m pretty sure that the boy in question, who is very intelligent and is expected to get at least an A grade, has been taught as well as the vast majority of A-level mathematicians. If this is right, then what I discovered from talking to him was quite worrying.</p>
<p>The purpose of the conversation was to help him catch up with some work that he had missed through illness. The particular topics he wanted me to cover were integrating <img src='http://s0.wp.com/latex.php?latex=%5Clog+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log x' title='&#92;log x' class='latex' />, or <img src='http://s0.wp.com/latex.php?latex=%5Cln+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ln x' title='&#92;ln x' class='latex' /> as he called it, and integration by parts. (Actually, after I had explained integration by parts to him, he told me that that hadn&#8217;t been what he had meant, but I don&#8217;t think any harm was done.) But as we were starting, he asked me why the derivative of <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> was <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />, and what was special about <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e' title='e' class='latex' />.<br />
<span id="more-4693"></span></p>
<p>That seemed like a good preliminary conversation to have, so I said, &#8220;OK, let&#8217;s try to differentiate <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> from first principles and see what happens.&#8221; He didn&#8217;t know what I meant by &#8220;from first principles&#8221; so I tried to give him a nudge, by saying, &#8220;If you didn&#8217;t know the derivative of <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />, then how would you go about working it out?&#8221;</p>
<p>At this point, he suggested <img src='http://s0.wp.com/latex.php?latex=xe%5E%7Bx-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='xe^{x-1}' title='xe^{x-1}' class='latex' />. To be fair to him, he wasn&#8217;t saying that he knew that this was correct. Nevertheless, this was an interesting piece of cognitive dissonance, given that we were trying to understand why the derivative of <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> was <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />. But what bothered me more than the fact that he said it was the fact that he couldn&#8217;t see why it wasn&#8217;t right. And what bothered me perhaps even more than that was the fact that he should think of &#8220;from first principles&#8221; as mechanically applying the <img src='http://s0.wp.com/latex.php?latex=x%5En%5Crightarrow+nx%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^n&#92;rightarrow nx^{n-1}' title='x^n&#92;rightarrow nx^{n-1}' class='latex' /> rule.</p>
<p>In an effort to get past that, I asked, &#8220;Yes, but what does the derivative actually <em>mean</em>?&#8221; He had no answer. So I drew a graph of an arbitraryish function, labelled it <img src='http://s0.wp.com/latex.php?latex=y%3Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y=f(x)' title='y=f(x)' class='latex' />, drew a point on the curve, and asked him what the derivative meant. I think he did then say that it was the gradient of the curve at that point. (I don&#8217;t think he used the word &#8220;tangent&#8221;.) I asked how we could go about working that out. He suggested <img src='http://s0.wp.com/latex.php?latex=y%2Fx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y/x' title='y/x' class='latex' />. I said, &#8220;So to work out the derivative you just divide by <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> &#8212; is that it?&#8221; He laughed and said no. </p>
<p>It was time to go back to basics, so I asked him how you work out the gradient of a straight line. He said, &#8220;Rise over run&#8221;, terminology that I either never knew or had completely forgotten &#8212; but the meaning was obvious. I then asked what the difficulty was when the line wasn&#8217;t straight, to which he replied that the gradient was changing all the time. So what could we do about that? He suggested taking a point not too far away and working out the slope of the line joining that to the point in question.</p>
<p>Now we were getting somewhere. I had already drawn a line segment going up from a point on the x-axis marked <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> until it hit the curve. I did the same for a line segment going up from a point marked <img src='http://s0.wp.com/latex.php?latex=x%2Bh&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x+h' title='x+h' class='latex' /> and asked what the rise and run were. He correctly got the answers <img src='http://s0.wp.com/latex.php?latex=f%28x%2Bh%29-f%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x+h)-f(x)' title='f(x+h)-f(x)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=h&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='h' title='h' class='latex' />. I then said that as <img src='http://s0.wp.com/latex.php?latex=h&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='h' title='h' class='latex' /> got smaller and smaller, the curve got more and more straight, so </p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Clim_%7Bh%5Crightarrow+0%7D%5Cfrac%7Bf%28x%2Bh%29-f%28x%29%7Dh&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;lim_{h&#92;rightarrow 0}&#92;frac{f(x+h)-f(x)}h' title='&#92;displaystyle &#92;lim_{h&#92;rightarrow 0}&#92;frac{f(x+h)-f(x)}h' class='latex' /></p>
<p>was a formula for the derivative <img src='http://s0.wp.com/latex.php?latex=f%27%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f&#039;(x)' title='f&#039;(x)' class='latex' />. </p>
<p>&#8220;Have you really never seen that before?&#8221; I asked. He denied it, but when I pressed him, he eventually conceded that he probably had seen it, but that if so, then it would have been clearly flagged up as something that he didn&#8217;t need to know, because there wouldn&#8217;t be questions about it on the exam.</p>
<p>It&#8217;s <em>that</em> that bothered me enough to make me need to get it off my chest in the form of a blog post. It hardly needs spelling out what was wrong with the argument that his teacher gave (if his teacher did in fact give that argument &#8212; I cannot be certain about this, though the mere fact that that was the message that got across is bad enough) but I&#8217;m going to anyway. Let&#8217;s suppose that your aim is simply to do well at maths A-level and that there are no questions that test your familiarity with the formula for the derivative of an arbitrary (nice) function at an arbitrary point. Which is better?</p>
<p>1. Don&#8217;t make any effort to learn and understand the formula, but simply learn a few basic examples of derivatives (polynomials, exponentials, logs, trig functions) and rules for differentiating combinations (linearity, product rule, quotient rule, chain rule) and you should be able to differentiate anything that comes up in the exams.</p>
<p>2. Learn what the derivative means, derive the formula for the derivative of an arbitrary function at an arbitrary point, calculate a few derivatives from first principles, derive the product rule, quotient rule and chain rule, and then learn how to use them to differentiate combinations.</p>
<p>The answer is that if you are capable of doing 2, then 2 is far better. And the boy I was talking to was certainly capable of doing 2. Why is it better? Because (and this is something I plan to devote a blog post to at some point) memory works far better when you learn <em>networks of facts</em> rather than facts in isolation. If you don&#8217;t really understand what differentiation is all about, then the fact that the derivative of <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=3x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3x^2' title='3x^2' class='latex' /> is a completely different fact from the fact that the derivative of <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />. But if you&#8217;ve derived them both from first principles (I&#8217;ll come back to what I said about <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> in a moment), then they are related: we have a process we do to the functions <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> and this is what comes out. Of course, another reason is that if you forget something, you have a chance of rederiving it, but that&#8217;s a slightly different point. Your knowledge of a piece of maths is far more grounded if you know how it is derived, or at least have some memory of the derivation, even if you have no problem remembering the fact in question. Even if you forget the details of the derivations, just having seen them has a major effect on binding together the facts you know.</p>
<p>After I had explained differentiation in the abstract, I suggested that we should try differentiating <img src='http://s0.wp.com/latex.php?latex=x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^2' title='x^2' class='latex' /> from first principles. Or as I put it, &#8220;Let&#8217;s apply this formula in the case <img src='http://s0.wp.com/latex.php?latex=f%28x%29%3Dx%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)=x^2' title='f(x)=x^2' class='latex' />. What do we get then?&#8221; To my further dismay, he didn&#8217;t immediately know what to do. &#8220;If <img src='http://s0.wp.com/latex.php?latex=f%28x%29%3Dx%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)=x^2' title='f(x)=x^2' class='latex' />, then what is <img src='http://s0.wp.com/latex.php?latex=f%28x%2Bh%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x+h)' title='f(x+h)' class='latex' />?&#8221; I asked. I can&#8217;t remember what his response was, but it wasn&#8217;t <img src='http://s0.wp.com/latex.php?latex=%28x%2Bh%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x+h)^2' title='(x+h)^2' class='latex' />. He floundered and made wild guesses, not really understanding what I was asking. Again, something quite serious seemed not to have been done by his school, though I couldn&#8217;t give a precise diagnosis in this case &#8212; something along the lines of understanding the notion of a function well enough to talk about an abstract function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> and see that it could have many instantiations.</p>
<p>Anyhow, once that point was cleared up (not necessarily for good, but for then anyway) we got through the differentiation of <img src='http://s0.wp.com/latex.php?latex=x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^2' title='x^2' class='latex' /> without further trouble. Again he denied having seen that derivation &#8212; maybe it was at that point that he said the thing about not needing to know it for the exam. </p>
<p>The general point here is of course that A-levels have got easier and schools have a natural tendency to teach to the test. If just one of those were true, it would be far less of a problem. I would have nothing against an easy A-level if people who were clever enough were given a much deeper understanding than the exam strictly required (though as I&#8217;ve argued above, for many people teaching to the test is misguided even on its own terms, since they will do a lot better on the exam if they have <em>not</em> been confined to what&#8217;s on the test), and I would not be too against teaching to the test if the test was hard enough. </p>
<p>How about differentiating <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />? Well, after a couple of false starts we got to the expression</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Clim_%7Bh%5Cto+0%7D%5Cfrac+%7Be%5E%7Bx%2Bh%7D-e%5Ex%7Dh&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;lim_{h&#92;to 0}&#92;frac {e^{x+h}-e^x}h' title='&#92;displaystyle &#92;lim_{h&#92;to 0}&#92;frac {e^{x+h}-e^x}h' class='latex' /></p>
<p>I asked what could be done with <img src='http://s0.wp.com/latex.php?latex=e%5E%7Bx%2Bh%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^{x+h}' title='e^{x+h}' class='latex' />. Quite a bit of prompting was needed to get him to say <img src='http://s0.wp.com/latex.php?latex=e%5Exe%5Eh&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^xe^h' title='e^xe^h' class='latex' />. Then I asked what could be done with <img src='http://s0.wp.com/latex.php?latex=e%5Exe%5Eh-e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^xe^h-e^x' title='e^xe^h-e^x' class='latex' />. A lot of prompting was needed to get the answer to that question. (I had to ask what he would do with <img src='http://s0.wp.com/latex.php?latex=uv-u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='uv-u' title='uv-u' class='latex' />.) Anyhow, we eventually got to</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+e%5Ex%5Clim_%7Bh%5Cto+0%7D%5Cfrac%7Be%5Eh-1%7Dh&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle e^x&#92;lim_{h&#92;to 0}&#92;frac{e^h-1}h' title='&#92;displaystyle e^x&#92;lim_{h&#92;to 0}&#92;frac{e^h-1}h' class='latex' /></p>
<p>I decided just to point out that the last limit was the derivative of <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> at 0. I also pointed out that the entire argument so far worked just as well for the function <img src='http://s0.wp.com/latex.php?latex=a%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a^x' title='a^x' class='latex' />, whatever the (positive) value of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' />. I then drew some pictures for different <img src='http://s0.wp.com/latex.php?latex=a%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a^x' title='a^x' class='latex' />, pointing out that some of them crossed the y-axis with a slope less than 1 and others with a slope greater than 1 and that <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' /> was the one where it actually equalled 1. He asked me <em>why</em> the slope was exactly 1 for <img src='http://s0.wp.com/latex.php?latex=e%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^x' title='e^x' class='latex' />, which was a good opportunity for me to try to explain that that was getting things the wrong way round, and that <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e' title='e' class='latex' /> was chosen precisely to make that work. (Of course, the question would make very good sense the way he asked it if we had already defined <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e' title='e' class='latex' /> in some other way, but I&#8217;m not sure he had. Certainly, I remember this as the definition of <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e' title='e' class='latex' /> when I was at school, and I remember feeling slightly uncomfortable about it.) </p>
<p>There was plenty more of our conversation, though not much more to say about it. I accidentally fell into a derivation of the product rule, which again I think he had not seen. That was part of my preparation for deriving the formula for integration by parts. When I had done that, I went through an example or two. One of the examples I tried was <img src='http://s0.wp.com/latex.php?latex=x%5Csin+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;sin x' title='x&#92;sin x' class='latex' /> from <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='0' title='0' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cpi%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi/2' title='&#92;pi/2' class='latex' />. (He thought <img src='http://s0.wp.com/latex.php?latex=%5Ccos%28%5Cpi%2F2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cos(&#92;pi/2)' title='&#92;cos(&#92;pi/2)' class='latex' /> was <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B2%7D%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{2}/2' title='&#92;sqrt{2}/2' class='latex' />, by the way, but was OK when we did it in degrees.) We got the answer <img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5E%7B%5Cpi%2F2%7Dx%5Csin+xdx%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;int_0^{&#92;pi/2}x&#92;sin xdx=1' title='&#92;int_0^{&#92;pi/2}x&#92;sin xdx=1' class='latex' />. I then felt annoyed not to be able to see why the answer had to be 1. I still haven&#8217;t got round to thinking about that. </p>
<p>I also discussed integrating <img src='http://s0.wp.com/latex.php?latex=%5Clog+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log x' title='&#92;log x' class='latex' /> by the method I call guess-and-adjust. You guess <img src='http://s0.wp.com/latex.php?latex=x%5Clog+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;log x' title='x&#92;log x' class='latex' /> because one part of what the product rule gives you is correct and you might be able to deal with the other part. Differentiating you get <img src='http://s0.wp.com/latex.php?latex=%5Clog+x+%2B+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log x + 1' title='&#92;log x + 1' class='latex' />. &#8220;What can we do to get rid of that 1?&#8221; He suggested <img src='http://s0.wp.com/latex.php?latex=x%5Clog+x+-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;log x -1' title='x&#92;log x -1' class='latex' />. We tried that, saw why it didn&#8217;t work, and then got to the right answer.</p>
<p>He managed to integrate <img src='http://s0.wp.com/latex.php?latex=xe%5Ex&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='xe^x' title='xe^x' class='latex' /> between <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='b' title='b' class='latex' /> with no help at all, so I think he got the basic idea, though whether he&#8217;ll hold on to it I don&#8217;t know. </p>
<p>Another thing I discovered was that he was very shaky on the chain rule. When I asked him what the chain rule said, he didn&#8217;t know what I was talking about. Eventually I got a glimmer of recognition out of him by writing down <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Bdz%7D%7Bdx%7D%3D%5Cfrac%7Bdz%7D%7Bdy%7D%5Cfrac%7Bdy%7D%7Bdx%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;frac{dz}{dx}=&#92;frac{dz}{dy}&#92;frac{dy}{dx}' title='&#92;frac{dz}{dx}=&#92;frac{dz}{dy}&#92;frac{dy}{dx}' class='latex' />. But the idea that if you want to differentiate <img src='http://s0.wp.com/latex.php?latex=e%5E%7Bx%5E3%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e^{x^3}' title='e^{x^3}' class='latex' /> you first pretend that <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' /> is a single variable with respect to which you are differentiating and then correct what you&#8217;ve just done by multiplying by the derivative of <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' /> was completely foreign to him. We looked at a few examples but they&#8217;ll need reinforcing at some point. It was yet another illustration of the general principle that if you forget about understanding what&#8217;s going on and concentrate on mechanical manipulations, you&#8217;ll forget how to do even the mechanical manipulations.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4693/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4693/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4693&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/11/20/what-maths-a-level-doesnt-necessarily-give-you/feed/</wfw:commentRss>
		<slash:comments>116</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>What actually happened</title>
		<link>http://gowers.wordpress.com/2012/11/09/what-actually-happened/</link>
		<comments>http://gowers.wordpress.com/2012/11/09/what-actually-happened/#comments</comments>
		<pubDate>Fri, 09 Nov 2012 09:12:32 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4626</guid>
		<description><![CDATA[The short version is that I&#8217;ve had the ablation (see previous post) and the surgeon who did it says that he has a good feeling about it. It&#8217;s taken till now to write this because, unlike most people who have ablations, I felt terrible for two days after it &#8212; with a headache (normal) and [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4626&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The short version is that I&#8217;ve had the ablation (see previous post) and the surgeon who did it says that he has a good feeling about it.  It&#8217;s taken till now to write this because, unlike most people who have ablations, I felt terrible for two days after it &#8212; with a headache (normal) and a fever (less normal but not unheard of). The fever was not very high, but high enough to be unpleasant, and meant that the only thing I could bear to do was go to bed, except that on the second night after the operation I had to spend part of the night sitting up on a sofa because my chest hurt too much when I was horizontal. (That was normal, and nothing to worry about.) So today is the first day that I am well enough to do anything as strenuous as writing a blog post.<br />
<span id="more-4626"></span></p>
<p>One of the first things I found out when I arrived at the hospital was that I was having the ablation done under a general anaesthetic. I was quite surprised by this but the surgeon came and explained that he was expecting it to take quite a long time, and that he thought he would be able to do a better job if I was under a general anaesthetic. Since that was my top priority, I was happy to agree, though a little disappointed that I wouldn&#8217;t have an interesting and unusual experience that I was expecting to have. (That reasoning was conditional on my needing to have the ablation in the first place &#8212; obviously it&#8217;s not the sort of experience I would seek out otherwise.)</p>
<p>So I spent the morning being prepared in various ways. I had sticky pads attached to me so that I could be monitored, I had a canula inserted into my left wrist, and I was required to put on hilariously unsexy garments &#8212; long white socks with a hole at the end for the toes to stick out (which squeeze the leg hard and reduce the chance of an embolism), &#8220;paper pants&#8221;, which after the operation were half cut off, and a hospital gown that opened at the back. Another thing I had to do was discuss the risks and sign a consent form. The risks sounded worse than I thought &#8212; quite a long list, and quite a number of items being given as 1 to 2 percent. The most worrying was permanent damage to the phrenic nerve, a nerve that takes signals from the brain to the diaphragm. The registrar I spoke to wasn&#8217;t very specific, but I think that would have left me short of breath for the rest of my life. That was one of the 1 to 2 percent risks. I&#8217;ve just found a paper on the web that says that the risk is between 0.11% and 0.48% (whatever that means). Anyhow, I signed the form. I was later asked twice to confirm that it was my signature. </p>
<p>Eventually the time came and I was wheeled to the operating theatre. I was given a drug that made me feel pleasantly woozy, and after a couple of minutes of that I was out.</p>
<p>Next thing I knew, I was back in my room. My wife was there, and I thought to myself, &#8220;Great &#8212; survived that.&#8221; I can&#8217;t remember whether my surgeon was there too or whether he arrived soon afterwards. Anyhow, he told me it had gone well.</p>
<p>The next night was fairly awful, because they were worried that my blood pressure was low, so they came to measure it once an hour. I was fairly uncomfortable, so I had a succession of hours where I would be woken up, would get to sleep with some difficulty, and would be woken up again. So the next day I felt pretty awful, but assumed it was probably the night I&#8217;d had. The best part of the night was watching the moment the US election was called for Barack Obama &#8212; I was awake, and someone else was listening to it so loudly that I could hear what was going on, so there wasn&#8217;t much to lose by watching it myself. </p>
<p>The low point of the whole experience was when the registrar who had explained to me the risks (and also put in a canula with very shaky hands) said to me a rather matter-of-fact way that they had put a stent in one of my arteries. He explained to me that this meant I would be on a certain drug for the next year. I was quite surprised, to say the least. I asked why, and he explained that the artery in question was quite narrow. I said something like, &#8220;You mean it was dangerously narrow and they had to do it as an emergency procedure?&#8221; and he suddenly said, &#8220;Oh wait, hang on, I&#8217;ve got the wrong notes.&#8221; He came back with the right notes and helpfully explained to me that I had had an ablation. But for a couple of minutes I genuinely believed that I had had a stent fitted.</p>
<p>I was discharged in the early evening the next day, but then had another difficult night (mentioned above), a day in bed doing nothing, a better night, and now part of a morning sitting writing this. But I&#8217;m beginning to feel like going back to bed for a while. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4626/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4626/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4626&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/11/09/what-actually-happened/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Mathematics meets real life</title>
		<link>http://gowers.wordpress.com/2012/11/05/mathematics-meets-real-life/</link>
		<comments>http://gowers.wordpress.com/2012/11/05/mathematics-meets-real-life/#comments</comments>
		<pubDate>Mon, 05 Nov 2012 16:31:04 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4616</guid>
		<description><![CDATA[I&#8217;ve been in two minds about whether to post this. On the one hand, I try to keep personal matters out of this blog &#8212; though there has been the occasional exception &#8212; but on the other hand I have a topic that fits quite nicely with some of what I&#8217;ve been writing about recently, [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4616&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I&#8217;ve been in two minds about whether to post this. On the one hand, I try to keep personal matters out of this blog &#8212; though there has been the occasional exception &#8212; but on the other hand I have a topic that fits quite nicely with some of what I&#8217;ve been writing about recently, since it concerns a fairly important medical decision that I have had to make based on what felt like inadequate information. Since that is quite an interesting situation from a mathematical point of view, and even a philosophical point of view, and since most people have to make similar decisions at some point in their lives, I have opted to write the post.</p>
<p>The background is that over the last fifteen years or so I have had occasional bouts of <a href="http://en.wikipedia.org/wiki/Atrial_fibrillation">atrial fibrillation</a>, a condition that causes the heart to beat irregularly and not as strongly as it should. It is quite a common condition: I&#8217;ve just read that 2.3% of people over the age of 40 have it, and 5.9% of people over 65. Some people have no symptoms. I myself have mild symptoms &#8212; I can feel a slightly strange, and instantly recognisable, feeling in my chest, and I experience a few seconds of dizziness almost every time I stand up from a relaxed seated position &#8212; otherwise known as <a href="http://en.wikipedia.org/wiki/Orthostatic_hypotension">orthostatic hypotension</a>, which I often used to get anyway (as do many people).<br />
<span id="more-4616"></span></p>
<p>I would gladly live with those symptoms, but unfortunately that&#8217;s not all there is to it. When a heart is in atrial fibrillation, it is not beating as efficiently as it should, and a little pool of blood can form that doesn&#8217;t get pumped away. And if that happens, it can form a clot. And if your heart then goes back into sinus rhythm (that is, it starts to beat normally again), that clot can get pumped out into your bloodstream and wreak havoc: in particular, it can lead to a stroke. I know this all too well, because my father had a severe stroke for exactly that reason in 2001. </p>
<p>Until 2004, my bouts of atrial fibrillation, of which I think there were two, were several years apart and lasted a few hours each. But then in early 2004 I went into atrial fibrillation and didn&#8217;t come out anything like so quickly. To decrease the risk of stroke, I had injections of a blood-thinning drug called Heparin (or at least, that&#8217;s what it&#8217;s called in the UK) into my stomach, once a day. But then I was put on to <a href="http://en.wikipedia.org/wiki/Warfarin">Warfarin</a>, a drug that is used in rat poison. When a rat eats Warfarin, it goes off and has a haemhorrage and dies, but if you take just the right amount, you can thin your blood to the point where it is not dangerously thin, but atrial fibrillation is less likely to cause clots. The appropriate dose varies widely from person to person, so they have to increase it very gradually, testing your blood several times, until you reach the right INR (international normalized ratio), which roughly speaking means the ratio of the time your blood takes to clot to the time it would take to clot if you didn&#8217;t thin it. The recommended INR for people with atrial fibrillation is between 2 and 3. </p>
<p>When my atrial fibrillation ended in 2004, I was at an early stage of the process of getting the dose right. I asked my doctor whether that meant that I had in fact been more or less unprotected at the critical moment, and the answer was yes. So a completely standard procedure &#8212; coming off the Heparin before the correct Warfarin dose was established &#8212; had a very obvious defect. Fortunately, I didn&#8217;t have a stroke. (The probability was quite small, but even so.) That&#8217;s nothing compared with my father&#8217;s experience: he had his stroke after being advised by his cardiologist to come off Warfarin, a recommendation that other doctors told him later made no sense at all. </p>
<p>Let me try to fast forward to the present day. I have had quite long periods without AF, sometimes as long as two or three years, but I have also had two periods where I went into AF and it became pretty clear that I wasn&#8217;t going to come out of it again spontaneously. This, by the way, is very normal &#8212; once it starts, it gets gradually worse. Twice I had an electrical cardioversion: you go under a general anaesthetic and are given an electric shock that stops your heart, and when it starts again, if you are lucky it starts in sinus rhythm. When you have an electrical cardioversion, they try three times before giving up. I was lucky both times and went into sinus rhythm after just one shock. </p>
<p>The first cardioversion lasted me three years, apart from one 24-hour bout that ended of its own accord. The second cardioversion was in June, but in early September I went back into atrial fibrillation.</p>
<p>So what is the decision I have recently made? Well, back in 2004, when I first went to visit the person who is now my regular cardiologist, he told me about an operation called a <a href="http://en.wikipedia.org/wiki/Catheter_ablation">catheter ablation</a>. This is a procedure where the surgeon puts a wire into your leg and up an artery all the way to your heart. The tip of the wire then burns a bit of the surface of your heart, which causes it to form scar tissue that doesn&#8217;t conduct some faulty electrical signals that are responsible for the atrial fibrillation. At the time, my cardiologist said that it offered the prospect of a permanent cure for atrial fibrillation, but that it was probably best to wait, since the operation was relatively new and improving all the time. </p>
<p>We occasionally discussed the operation in the intervening years, and then in June he arranged an appointment with somebody who specializes in catheter ablations and performs them at Papworth Hospital, which is near Cambridge and is famous for being where the UK&#8217;s first successful heart transplant was carried out. This second specialist told me the following things (some of which I had read on the internet already).</p>
<p>1. I was probably progressing from &#8220;paroxysmal AF&#8221; (occasional bouts) to &#8220;permanent AF&#8221; (what it sounds like). </p>
<p>2. Catheter ablation is more effective against paroxysmal AF.</p>
<p>3. Now that more data is available, it has become clear that catheter ablation is not after all a permanent cure, but it might delay the progression of AF by five to ten years or something like that.</p>
<p>4. It is more effective the younger you are when you have it, with the decline in effectiveness quite high at my age, so in his opinion I should have it done sooner rather than later.</p>
<p>5. It has only a 60-70% chance of working at all.</p>
<p>6. It carries risks.</p>
<p>Most of that was pretty negative news, so I left the consultation not entirely sure what to do, though thinking I probably ought to have the operation. But the risks seemed pretty serious &#8212; about a 1% risk of a major complication &#8212; so I wanted to think a bit harder about them.</p>
<p>The two that bothered me most (and still do) are stroke and death. There are other serious things that can go wrong, but if their effects are temporary, then for me that puts them in a different league from a stroke, which could end my productive life, and death, which would end my life altogether. </p>
<p>The risk of death is put at one in a thousand, and this is where things get interesting. How worried should I be about a 0.1% risk? How do I even think about that question? Perhaps if my life expectancy from now on is around 30 years, I should think of this as an expected loss of 30/1000 years, or about 10 days. That doesn&#8217;t sound too bad &#8212; about as bad as having a particularly nasty attack of flu. But is it right to think about it in terms of expectations? I feel that the distribution is important: I would rather have a guaranteed loss of ten days than a 1/1000 chance of losing 30 years. </p>
<p>In the end, what convinced me that I shouldn&#8217;t worry too much about this risk was looking up what the risk of death is anyway over, say, the next year. I found <a href="http://www.medicine.ox.ac.uk/bandolier/booth/Risk/dyingage.html">on this site</a> that the average risk of death in the UK for a man between 45 and 54 is 1/279, much higher than 1/1000. So if I am worried about a 1/1000 mortality rate from an operation, I should be about as worried that I will die from some other cause over the next four months or so. And yet I don&#8217;t lose any sleep over that possibility. </p>
<p>But maybe the problem is that I am concentrating four months&#8217; risk into a few hours. Doesn&#8217;t that change everything? </p>
<p>Yes it would if I was planning to have lots of catheter ablations, but this is much more of a one-off event (though quite a few people have to have it done two or three times before it works). That makes a significant difference. For example, if aeroplane flights carried a 1/1000 mortality risk, that would be completely unacceptable, since some people take enough flights that all those risks would combine to create a near certainty of dying. So what matters in addition to the risk of the operation is the fact that I will have it at most a very small number of times. Maybe a rough rule of thumb is that I shouldn&#8217;t be too concerned, since on average my frequency of having this operation will be significantly less than once every four months.</p>
<p>Another possible counterargument is that for various reasons I am probably less likely than average for my age to die over the next year &#8212; by most people&#8217;s standards I am well off, I am in generally good health, I don&#8217;t smoke, and so on. However, I think that many of those factors also reduce my chances of major complications from a catheter ablation, so I&#8217;m inclined to guess that the validity of the rough calculation above is not hugely affected.</p>
<p>What about the risk of stroke? That brings me to something I haven&#8217;t yet mentioned. Even if a 1/1000 risk of death isn&#8217;t something to get too worked up about, one doesn&#8217;t want to take that risk unless there is some benefit from doing so. And because the benefit can be measured in terms of reducing risks, I am in the useful position of being able to compare like with like. In other words, it&#8217;s not like being asked whether I want to play Russian roulette for a million pounds, where I would have to weigh up a lot of money against a one in six chance of dying. It&#8217;s more like being asked to play Russian roulette (but with much better odds) once in order to avoid having to play it once a year for the next five to ten years, since the additional risk per year of having a stroke if you are in atrial fibrillation is comparable to the risk of having a stroke as a result of catheter ablation, even if one is taking Warfarin. (AF increases your annual stroke risk by a factor of about 5, but Warfarin divides that by about 3, or so I&#8217;ve read.) I can&#8217;t now find the figures I used. Again, the calculations were complicated by the fact that relative to many AF patients my risks of stroke are quite small, but again I think that applies to the risks as a result of the operation as well. As a precaution, it is standard practice to have weekly blood tests to make sure that one&#8217;s INR stays within the right range for a good long time before the operation, which mine has, so I have done what I can to minimize the stroke risk.</p>
<p>One final complication is that Warfarin carries its own risk: because it thins your blood, it increases the chances that you will have a brain haemorrhage, which can have very serious consequences. I think the extra risk may be something like 1% a year. Unfortunately, it isn&#8217;t considered safe to stop taking Warfarin after a successful catheter ablation, so this risk isn&#8217;t going to go away. But from the point of view of balancing the risks and benefits of the operation, that means that this particular risk doesn&#8217;t have to be taken into account, as it will be there either way.</p>
<p>In summary then, I&#8217;ve looked online at various statistics, none of which tell me exactly what I want to know &#8212; since they refer to populations that are more general than me, and in particular usually somewhat older than me (which is good news, since my risks should therefore be lower than average) &#8212; and concluded that probably the risk of having the operation is comparable to the risk associated with not having it. The fact that the last time I went into AF, which I&#8217;m in now, was only three months after I had had an electrical cardioversion was what finally persuaded me of that. And if I do have it and it works, then my quality of life will be improved, though not hugely, by my not being in AF. And it seems that when the doctors say that the risks of the operation are low, they are (in this instance) talking sense, since the risks are comparable to the background risk that everyone faces.</p>
<p>The operation is tomorrow. It takes a few hours and is done under a mixture of a local anaesthetic (in your leg where the wires go in) and light sedation. So I&#8217;ll be conscious. Assuming all goes well, it should be quite an interesting experience &#8212; I&#8217;ll report back on that when it&#8217;s over.</p>
<p>Two more small things. One is that AF itself is mathematically interesting: it seems that something causes the heart to go from a nice periodic rhythm into a more chaotic one, and it is not well understood why. The other is that I have tried to look things up in the medical literature in order to be able to assess the risks as well as I can. Last night I decided I wanted to look at a paper in the Lancet, that renowned medical journal published by Elsevier. Cambridge subscribes to Science Direct, Elsevier&#8217;s huge electronic package of all its science journals, so I in theory I should have been able to read the paper. I won&#8217;t go into details, but suffice it to say that even though I was entitled to read it, the system, for some mysterious reason, wouldn&#8217;t let me and kept giving annoying error messages. It reminded me why I&#8217;m in favour of getting rid of the subscription model for academic journals, and also of why I don&#8217;t like being told by Elsevier how much they have invested in Science Direct. </p>
<p>I&#8217;ll end by saying that this post is very much <em>not</em> intended to be a plea for sympathy. I know many people who have had much worse decisions (of the same general kind, but with far less favourable probabilities) to face than this one. In fact, it&#8217;s supposed to be the opposite of a plea for sympathy: more like an explanation of why a risk of 1/1000, which initially seems quite scary, is in fact not that scary after all. If, very much contrary to expectations, something goes badly wrong tomorrow, that will be the moment for sympathy. But the chances of that are very much smaller even than the chances that Mitt Romney will win the presidential election, something that as an avid <a href="http://fivethirtyeight.blogs.nytimes.com/author/nate-silver/">Nate Silver</a> reader I find highly encouraging.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4616/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4616&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/11/05/mathematics-meets-real-life/feed/</wfw:commentRss>
		<slash:comments>46</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP27 &#8212; the modular version of Roth&#8217;s AP-discrepancy theorem</title>
		<link>http://gowers.wordpress.com/2012/09/19/edp27-the-modular-version-of-roths-ap-discrepancy-theorem/</link>
		<comments>http://gowers.wordpress.com/2012/09/19/edp27-the-modular-version-of-roths-ap-discrepancy-theorem/#comments</comments>
		<pubDate>Wed, 19 Sep 2012 09:25:50 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4528</guid>
		<description><![CDATA[Recall from earlier posts Gil&#8217;s modular conjecture for HAPs. It states that if is large enough and is a function from to that never takes the value 0, then for every there exists a HAP such that mod . It is easy to see that this implies EDP, so it may well be very hard, [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4528&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Recall from earlier posts Gil&#8217;s modular conjecture for HAPs. It states that if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is large enough and <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is a function from <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p' title='&#92;mathbb{Z}_p' class='latex' /> that never takes the value 0, then for every <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> there exists a HAP <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29%5Cequiv+a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)&#92;equiv a' title='&#92;sum_{x&#92;in P}f(x)&#92;equiv a' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. It is easy to see that this implies EDP, so it may well be very hard, or even false. However, one can hold out a little hope that, as with some strengthenings of statements, it is in fact easier, because it is in some way more symmetrical and fundamental. Given that, it makes good sense, as Gil has suggested, to try to prove modular versions of known discrepancy theorems, in the hope of developing general techniques that can then be tried out on the modular EDP conjecture.</p>
<p>A very obvious candidate for a discrepancy theorem that we could try to modularize is Roth&#8217;s theorem, which asserts that for any <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-valued function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> there exists an arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bx%5Cin+P%7Df%28x%29%7C%5Cgeq+cn%5E%7B1%2F4%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{x&#92;in P}f(x)|&#92;geq cn^{1/4}' title='|&#92;sum_{x&#92;in P}f(x)|&#92;geq cn^{1/4}' class='latex' />. That gives rise to the following problem.</p>
<p><strong>Problem.</strong> <em>Let <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> be a prime. What is the smallest <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> such that for every function <img src='http://s0.wp.com/latex.php?latex=f%3A%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D%5Cto%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:&#92;{1,2,&#92;dots,n&#92;}&#92;to&#92;mathbb{Z}_p' title='f:&#92;{1,2,&#92;dots,n&#92;}&#92;to&#92;mathbb{Z}_p' class='latex' /> that never takes the value 0, every <img src='http://s0.wp.com/latex.php?latex=a%5Cin%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in&#92;mathbb{Z}_p' title='a&#92;in&#92;mathbb{Z}_p' class='latex' /> can be expressed as <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> for some arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' />?</em></p>
<p>In this post I shall collect together a few simple observations about this question.<br />
<span id="more-4528"></span></p>
<p>1. We can at least prove that such an <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> exists. Indeed, by Szemer&eacute;di&#8217;s theorem, if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is large enough then there is an arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> of length <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> on which <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is constant. Since <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> takes non-zero values and <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is prime, the sums along initial segments of <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> run through all the numbers mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. </p>
<p>So the question is really asking whether the result can be proved with a reasonable bound for <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />.</p>
<p>2. Roth&#8217;s discrepancy result tells us that if <img src='http://s0.wp.com/latex.php?latex=n%5Cgeq+Cp%5E4&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;geq Cp^4' title='n&#92;geq Cp^4' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> takes only the values <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> takes all values mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. That is because there is a progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bx%5Cin+P%7Df%28x%29%7C%5Cgeq+p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{x&#92;in P}f(x)|&#92;geq p' title='|&#92;sum_{x&#92;in P}f(x)|&#92;geq p' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}' title='&#92;mathbb{Z}' class='latex' />, and by what one might call the discrete intermediate value theorem, it therefore takes all values in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}' title='&#92;mathbb{Z}' class='latex' /> between 0 and <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> or between <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='0' title='0' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=-p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-p' title='-p' class='latex' />. This is reasonably compelling evidence that the conjecture is true with a decent bound, but of course the intermediate-value-theorem argument breaks down completely when <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> can take arbitrary non-zero values mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />.</p>
<p>The known lower bounds for Roth&#8217;s discrepancy theorem for APs (which are equal to the upper bounds up to a constant) show that the best possible bound for the modular question is at least <img src='http://s0.wp.com/latex.php?latex=cp%5E4&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='cp^4' title='cp^4' class='latex' />.</p>
<p>3. To make the problem more symmetrical, and therefore potentially easier to handle, it might be a good idea to begin by tackling the following variant.</p>
<p><strong>Problem.</strong> <em>Let <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> be a prime. What is the smallest <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> such that for every function <img src='http://s0.wp.com/latex.php?latex=f%3A%5Cmathbb%7BZ%7D_n%5Cto%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:&#92;mathbb{Z}_n&#92;to&#92;mathbb{Z}_p' title='f:&#92;mathbb{Z}_n&#92;to&#92;mathbb{Z}_p' class='latex' /> that never takes the value 0, every <img src='http://s0.wp.com/latex.php?latex=a%5Cin%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;in&#92;mathbb{Z}_p' title='a&#92;in&#92;mathbb{Z}_p' class='latex' /> can be expressed as <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> for some arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_n' title='&#92;mathbb{Z}_n' class='latex' />?</em></p>
<p>The big difference here is that arithmetic progressions are allowed to &#8220;wrap around&#8221;. That means that for every arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> coprime to <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> (it might be convenient to insist that <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is prime), the sets <img src='http://s0.wp.com/latex.php?latex=%5C%7Brx%3Ax%5Cin+P%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{rx:x&#92;in P&#92;}' title='&#92;{rx:x&#92;in P&#92;}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5C%7Bx%2Br%3Ax%5Cin+P%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{x+r:x&#92;in P&#92;}' title='&#92;{x+r:x&#92;in P&#92;}' class='latex' /> are also progressions. I think the bounds in the integer case with <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> functions show that the best bound one could hope for for this modified problem are <img src='http://s0.wp.com/latex.php?latex=n%5Cgeq+cp%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;geq cp^2' title='n&#92;geq cp^2' class='latex' />, but I need to check that.</p>
<p>3. One natural approach to proving that a function takes all values mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> would be to attempt to show that it takes all values with approximately the same frequency. This would potentially allow us to bring in analytic tools. However, it is not in general true. For example, let <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> be the function that is <img src='http://s0.wp.com/latex.php?latex=-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1' title='-1' class='latex' /> up to <img src='http://s0.wp.com/latex.php?latex=n%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/2' title='n/2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1' title='1' class='latex' /> from <img src='http://s0.wp.com/latex.php?latex=n%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/2' title='n/2' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. Then a small calculation shows that if <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> is a mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> AP of common difference <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bx%5Cin+P%7Df%28x%29%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{x&#92;in P}f(x)|' title='|&#92;sum_{x&#92;in P}f(x)|' class='latex' /> is never more than <img src='http://s0.wp.com/latex.php?latex=Cn%2F%7Cd%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Cn/|d|' title='Cn/|d|' class='latex' />. I think <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> can probably be taken to be 2, or maybe even 1. (By <img src='http://s0.wp.com/latex.php?latex=%7Cd%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|d|' title='|d|' class='latex' /> I mean the smallest modulus of any number congruent to <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />.) Thus, for a positive proportion of common differences <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' />, the sums along APs of common difference <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> never exceed <img src='http://s0.wp.com/latex.php?latex=10&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='10' title='10' class='latex' />, say, in modulus. If <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is large, this implies that the values of the sums <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> are very far from uniformly distributed, since they are concentrated around 0. On the other hand, <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is obviously not a counterexample to the conjecture (by which I mean the assertion that the modular statement holds with a good bound). </p>
<p>4. That observation does not rule out a proof that goes via showing that the values of sums along APs are approximately uniformly distributed, but it does demonstrate that in order to prove that, we would have to put some conditions on <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' />. That is, we would need a two-step argument along the following lines (reminiscent of proofs of Szemer&eacute;di&#8217;s theorem, though I would hope that this problem is easier). </p>
<p>(i) If <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is quasirandom, or at least &#8220;contains substantial quasirandomness&#8221;, then the values of <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> are approximately uniformly distributed mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />.</p>
<p>(ii) If <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is not quasirandom, or better still &#8220;does not contain substantial quasirandomness&#8221;, then we can deduce in some other way (such as finding a long AP on which <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is constant, but that may be too much to ask) that the sums take all values.</p>
<p>5. The weaker the property we can get away with in (i), the better. However, in order to get started it would be good to find <em>any</em> quasirandomness property that implies that the values of <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> are approximately uniformly distributed. </p>
<p>6. One thing that needs deciding here is what we mean by a random progression <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' />. The simplest would be to fix some <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> and pick random mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> APs of length <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />. These don&#8217;t work for the modular conjecture in general, since the constant function <img src='http://s0.wp.com/latex.php?latex=f%28x%29%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)&#92;equiv 1' title='f(x)&#92;equiv 1' class='latex' /> is a counterexample, but they might work for functions with a suitable quasirandomness property. </p>
<p>7. The fact that we can&#8217;t fix the length of the progressions shows that the modular conjecture differs in an important way from Roth&#8217;s discrepancy theorem, where fixing the length is not a problem (especially in the mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> version). This dampens any hopes one might start off with for adapting the proof of Roth&#8217;s discrepancy theorem to cope with the modular version. But that might in the end be a good thing, since the point of the modular conjecture is to introduce new techniques that can then be brought to bear on EDP.</p>
<p>8. When we are looking for a suitable quasirandomness property, it is tempting to turn to Fourier analysis. However, the following (modification of a standard) example is a bit troubling. Let <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> be defined as follows. For each <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> we randomly decide whether to set <img src='http://s0.wp.com/latex.php?latex=f%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)' title='f(x)' class='latex' /> equal to <img src='http://s0.wp.com/latex.php?latex=%5Cpm+x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm x^2' title='&#92;pm x^2' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Cpm+3x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 3x^2' title='&#92;pm 3x^2' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> (where <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is the residue between 0 and <img src='http://s0.wp.com/latex.php?latex=n-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n-1' title='n-1' class='latex' />). If we now choose a random mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> arithmetic progression <img src='http://s0.wp.com/latex.php?latex=P%3D%5C%7Ba%2Ca%2Bd%2Ca%2B2d%2Ca%2B3d%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P=&#92;{a,a+d,a+2d,a+3d&#92;}' title='P=&#92;{a,a+d,a+2d,a+3d&#92;}' class='latex' /> of length 4, there is an absolute constant <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta&gt;0' title='&#92;theta&gt;0' class='latex' /> such that with probability at least <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> does not wrap around, and <img src='http://s0.wp.com/latex.php?latex=f%28a%29%3Da%5E2%2C+f%28a%2Bd%29%3D-3%28a%2Bd%29%5E2%2C+f%28a%2B2d%29%3D3%28a%2B2d%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(a)=a^2, f(a+d)=-3(a+d)^2, f(a+2d)=3(a+2d)^2' title='f(a)=a^2, f(a+d)=-3(a+d)^2, f(a+2d)=3(a+2d)^2' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=f%28a%2B3d%29%3D-%28a%2B3d%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(a+3d)=-(a+3d)^2' title='f(a+3d)=-(a+3d)^2' class='latex' />. When this happens <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)=0' title='&#92;sum_{x&#92;in P}f(x)=0' class='latex' />. Therefore, the value 0 occurs much too often, but according to the natural Fourier definitions of quasirandomness (which we could get by looking at the function <img src='http://s0.wp.com/latex.php?latex=%5Cexp%282%5Cpi+if%28x%29%2Fp%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(2&#92;pi if(x)/p)' title='&#92;exp(2&#92;pi if(x)/p)' class='latex' />) this function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is highly quasirandom.</p>
<p>9. Of course, we are looking at much longer arithmetic progressions. However, it is difficult to think of proofs that work for long arithmetic progressions and fail for short ones. (It is mildly encouraging that at least the above source of examples seems to break down when the progressions become long &#8212; though that should be checked carefully.)</p>
<p>10. Hmm &#8230; just after writing that I thought of the following modified example. Instead of choosing <em>randomly</em> whether to set <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> equal to <img src='http://s0.wp.com/latex.php?latex=%5Cpm+x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm x^2' title='&#92;pm x^2' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Cpm+3x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 3x^2' title='&#92;pm 3x^2' class='latex' />, let&#8217;s do it in the following highly deterministic way: if <img src='http://s0.wp.com/latex.php?latex=x%5Cequiv+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;equiv 0' title='x&#92;equiv 0' class='latex' /> mod 4 we set <img src='http://s0.wp.com/latex.php?latex=f%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)' title='f(x)' class='latex' /> equal to <img src='http://s0.wp.com/latex.php?latex=x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^2' title='x^2' class='latex' />, if <img src='http://s0.wp.com/latex.php?latex=x%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;equiv 1' title='x&#92;equiv 1' class='latex' /> mod 4 we set it equal to <img src='http://s0.wp.com/latex.php?latex=-3x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-3x^2' title='-3x^2' class='latex' />, if <img src='http://s0.wp.com/latex.php?latex=x%5Cequiv+2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;equiv 2' title='x&#92;equiv 2' class='latex' /> mod 4 we set it equal to <img src='http://s0.wp.com/latex.php?latex=3x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3x^2' title='3x^2' class='latex' />, and if <img src='http://s0.wp.com/latex.php?latex=x%5Cequiv+3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;equiv 3' title='x&#92;equiv 3' class='latex' /> mod 4 we set it equal to <img src='http://s0.wp.com/latex.php?latex=-x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-x^2' title='-x^2' class='latex' />. In addition, let&#8217;s suppose that <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is a multiple of <img src='http://s0.wp.com/latex.php?latex=4p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='4p' title='4p' class='latex' />, since I seem to need that to make this example work. (Even if we can&#8217;t get rid of this restriction somehow, it will still show that a proof that worked for prime values of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> would have to make strong use of the fact that <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> was not a multiple of <img src='http://s0.wp.com/latex.php?latex=4p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='4p' title='4p' class='latex' /> &#8212; and there would be many other examples of a similar kind that would lead to similar requirements &#8212; which would be quite a big restriction on what it could look like.)</p>
<p>Then if <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> is a multiple of 4 and <img src='http://s0.wp.com/latex.php?latex=P%3D%5C%7Ba%2Ca%2Bd%2C%5Cdots%2Ca%2B%28m-1%29d%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P=&#92;{a,a+d,&#92;dots,a+(m-1)d&#92;}' title='P=&#92;{a,a+d,&#92;dots,a+(m-1)d&#92;}' class='latex' /> is a random mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> AP of length <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />, there is a probability 1/16 that <img src='http://s0.wp.com/latex.php?latex=a%5Cequiv+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;equiv 0' title='a&#92;equiv 0' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=d%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d&#92;equiv 1' title='d&#92;equiv 1' class='latex' /> mod 4. (Note that because <img src='http://s0.wp.com/latex.php?latex=4%7Cn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='4|n' title='4|n' class='latex' /> this makes sense independently of which residue we pick.) If that is the case, we can partition <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> into APs <img src='http://s0.wp.com/latex.php?latex=P_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P_i' title='P_i' class='latex' /> of length 4 such that the sum of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> along each <img src='http://s0.wp.com/latex.php?latex=P_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P_i' title='P_i' class='latex' /> is zero. This shows that with probability at least 1/16 the sum <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Df%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}f(x)' title='&#92;sum_{x&#92;in P}f(x)' class='latex' /> is 0 mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. </p>
<p>Is this function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> quasirandom? According to many definitions, yes, since although we split into cases in a highly structured way, the behaviour of each case of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> is highly quasirandom.</p>
<p>I mentioned above that there are many similar examples. To give just one illustration, we could take <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> as a multiple of <img src='http://s0.wp.com/latex.php?latex=5p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='5p' title='5p' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> take turns taking values <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=-4x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-4x^3' title='-4x^3' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=6x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='6x^3' title='6x^3' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=-4x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-4x^3' title='-4x^3' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^3' title='x^3' class='latex' />. Essentially the same argument would work, but now the probability would be <img src='http://s0.wp.com/latex.php?latex=1%2F25&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/25' title='1/25' class='latex' />. (Actually, I now see that the probabilities can be doubled in both cases, since <img src='http://s0.wp.com/latex.php?latex=d%5Cequiv+-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d&#92;equiv -1' title='d&#92;equiv -1' class='latex' /> works just as well as <img src='http://s0.wp.com/latex.php?latex=d%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d&#92;equiv 1' title='d&#92;equiv 1' class='latex' />.)</p>
<hr />
<p>I&#8217;ll leave it there. I&#8217;ve mentioned some proof strategies, and also discussed some difficulties with them. Does anyone have other proof strategies to suggest? I haven&#8217;t mentioned using the polynomial method, but that is an obvious thing to try to do: that is, write down a polynomial that vanishes identically only if the modular conjecture for APs is true, and try to prove that it vanishes identically. It is certainly worth looking into that, though it is a little discouraging that the conjecture is false for APs of any fixed length, since that would necessarily make the polynomials less symmetrical. If the above strategy seems the most promising (or should that be least unpromising?) then does anyone have any thoughts about quasirandomness properties that might do the job? (Gil mentioned something in a recent comment &#8212; it would be good to have that thought in more detail.)</p>
<p>I&#8217;ll end by saying that even if proving the modular conjecture for mod-<img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> APs ended up having nothing much to do with EDP, it is a very nice problem on its own, and could make an excellent spin-off Polymath project.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4528/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4528/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4528&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/09/19/edp27-the-modular-version-of-roths-ap-discrepancy-theorem/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP26 &#8212; three generalizations</title>
		<link>http://gowers.wordpress.com/2012/09/06/edp26-three-generalizations/</link>
		<comments>http://gowers.wordpress.com/2012/09/06/edp26-three-generalizations/#comments</comments>
		<pubDate>Thu, 06 Sep 2012 10:42:57 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4512</guid>
		<description><![CDATA[This short post is designed as a possible way in to EDP for anyone who might be interested in participating but daunted by the idea of reading large amounts of material. One of the natural strategies for proving EDP is to try to formulate and prove stronger statements. At first that sounds paradoxical: isn&#8217;t it [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4512&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This short post is designed as a possible way in to EDP for anyone who might be interested in participating but daunted by the idea of reading large amounts of material. One of the natural strategies for proving EDP is to try to formulate and prove stronger statements. At first that sounds paradoxical: isn&#8217;t it even harder to prove a stronger statement? But the answer to that question is often no. To give a slightly silly example, suppose you were asked to prove that for every <img src='http://s0.wp.com/latex.php?latex=c%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c&gt;0' title='c&gt;0' class='latex' /> there exists <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> such that for every <img src='http://s0.wp.com/latex.php?latex=n%5Cgeq+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;geq N' title='n&#92;geq N' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is odd and has at least <img src='http://s0.wp.com/latex.php?latex=c%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c&#92;log n' title='c&#92;log n' class='latex' /> prime factors (counted with multiplicity), then <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%5Cphi%28n%29%7D%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{&#92;phi(n)}&#92;equiv 1' title='2^{&#92;phi(n)}&#92;equiv 1' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> is Euler&#8217;s totient function. You could make the problem easier by proving Euler&#8217;s theorem, that <img src='http://s0.wp.com/latex.php?latex=a%5E%7B%5Cphi%28n%29%7D%5Cequiv+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a^{&#92;phi(n)}&#92;equiv 1' title='a^{&#92;phi(n)}&#92;equiv 1' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> that is coprime to <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. You wouldn&#8217;t have as many hypotheses to use, but that&#8217;s good, since they can&#8217;t be used. Perhaps a better and more relevant example is when you generalize the class of numbers you are working with so as to allow a wider set of methods. For instance, suppose you want to prove that the largest possible product of three positive integers that add to 300 is at most <img src='http://s0.wp.com/latex.php?latex=10%5E6&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='10^6' title='10^6' class='latex' />. If you replace positive integers by positive reals, then you suddenly have available methods that you didn&#8217;t have before &#8212; for example, you could use compactness plus a lemma that says that if any two numbers are not equal then you can increase the product by replacing both of them by their average. (I&#8217;m not saying that&#8217;s the easiest proof &#8212; just that it&#8217;s a proof that you can&#8217;t do without first generalizing the statement.)<br />
<span id="more-4512"></span></p>
<p>In this post I want to mention three strengthenings of EDP. One of them I find interesting but not promising as a way of proving EDP, for reasons that I will explain. The other two look to me also very interesting and much more promising. All of them have been mentioned already, but the point of this post is to collect them in one convenient place.</p>
<p><strong>Restricting the allowable common differences.</strong></p>
<p>I know of no <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-sequence that has bounded discrepancy on all HAPs with common difference either a prime or a power of 2. The sequence <img src='http://s0.wp.com/latex.php?latex=1%2C1%2C-1%2C-1%2C1%2C1%2C-1%2C-1%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,1,-1,-1,1,1,-1,-1,&#92;dots' title='1,1,-1,-1,1,1,-1,-1,&#92;dots' class='latex' /> has bounded discrepancy on all HAPs of prime common difference, but if you allow powers of 2 as well, then no periodic construction can work, since if the period <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=2%5Eka&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^ka' title='2^ka' class='latex' /> for some odd integer <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' />, then a HAP of common difference <img src='http://s0.wp.com/latex.php?latex=2%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^k' title='2^k' class='latex' /> will have a bias of at least 1 in each interval of length <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />, for parity reasons, and this bias will gradually accumulate.</p>
<p>One can defeat powers of 2 by using the Morse sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C-1%2C1%2C-1%2C1%2C1%2C-1%2C-1%2C1%2C1%2C-1%2C1%2C-1%2C-1%2C1%2C-1%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,-1,1,-1,1,1,-1,-1,1,1,-1,1,-1,-1,1,-1,&#92;dots' title='1,-1,-1,1,-1,1,1,-1,-1,1,1,-1,1,-1,-1,1,-1,&#92;dots' class='latex' /> (if you haven&#8217;t seen it, then it&#8217;s a nice exercise to see how it is generated), but that has unbounded discrepancy on HAPs of odd prime length. (I can&#8217;t remember exactly why.)</p>
<p>Why do I not think that this potential strengthening of EDP is likely to be useful for attacking the problem? Well, one promising feature of EDP is that it seems to generalize to sequences taking other kinds of values, such as complex numbers of modulus 1 or even unit vectors in an arbitrary Hilbert space. However, something that I&#8217;ve only just noticed (though others may have spotted it ages ago) is that if one restricts to, say, prime-power common differences, then even the complex version of the problem becomes false. A very simple counterexample is the sequence <img src='http://s0.wp.com/latex.php?latex=%5Comega%2C%5Comega%5E2%2C%5Comega%5E3%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;omega,&#92;omega^2,&#92;omega^3,&#92;dots' title='&#92;omega,&#92;omega^2,&#92;omega^3,&#92;dots' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Comega%3D%5Cexp%28%5Cpi+i%2F3%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;omega=&#92;exp(&#92;pi i/3)' title='&#92;omega=&#92;exp(&#92;pi i/3)' class='latex' /> (that is, a primitive sixth root of 1). Then the sum along any HAP with common difference <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> that isn&#8217;t a multiple of 6 will be a sum of a GP with common ratio <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28d%5Cpi+i%2F3%29%5Cne+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(d&#92;pi i/3)&#92;ne 1' title='&#92;exp(d&#92;pi i/3)&#92;ne 1' class='latex' />, and will therefore be (uniformly) bounded. In particular, this is true of HAPs with prime-power common difference.</p>
<p>This simple example also shows that a certain real generalization of EDP is false when you restrict the common differences in this way: you cannot prove unbounded discrepancy for sequences that take values in the set <img src='http://s0.wp.com/latex.php?latex=%5C%7B-2%2C-1%2C1%2C2%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{-2,-1,1,2&#92;}' title='&#92;{-2,-1,1,2&#92;}' class='latex' />. The counterexample is simply twice the real part of the example above: the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C-2%2C-1%2C1%2C2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,-2,-1,1,2' title='1,-1,-2,-1,1,2' class='latex' /> repeated over and over again. So if (and I think it&#8217;s a big if) EDP is true for HAPs with common differences that are either primes or powers of 2, then any proof must make pretty strong use of the fact that the sequence is a <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-valued sequence. This rules out a lot of promising techniques, so it appears to make the problem harder rather than easier.</p>
<p><strong>A discrepancy question about matrices.</strong></p>
<p>In my earlier post I mentioned what I called the non-symmetric vector-valued EDP. I have subsequently realized (though perhaps a better word is &#8220;remembered&#8221; since I must have been sort of aware of this at some point) that it is equivalent to another discrepancy statement that I now find more appealing. The statement is the following.</p>
<p><strong>Conjecture.</strong> Let <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> be a function from <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}^2' title='&#92;mathbb{N}^2' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{R}' title='&#92;mathbb{R}' class='latex' /> and suppose that <img src='http://s0.wp.com/latex.php?latex=f%28n%2Cn%29%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(n,n)=1' title='f(n,n)=1' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n%5Cin%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;in&#92;mathbb{N}' title='n&#92;in&#92;mathbb{N}' class='latex' />. Then for every real number <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> there exist HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bm%5Cin+P%2Cn%5Cin+Q%7Df%28m%2Cn%29%7C%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)|&#92;geq C' title='|&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)|&#92;geq C' class='latex' />. </p>
<p>If we take a function <img src='http://s0.wp.com/latex.php?latex=g%3A%5Cmathbb%7BN%7D%5Cto%5C%7B-1%2C1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:&#92;mathbb{N}&#92;to&#92;{-1,1&#92;}' title='g:&#92;mathbb{N}&#92;to&#92;{-1,1&#92;}' class='latex' /> and define <img src='http://s0.wp.com/latex.php?latex=f%28m%2Cn%29%3Dg%28m%29g%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(m,n)=g(m)g(n)' title='f(m,n)=g(m)g(n)' class='latex' />, then for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> the above conjecture, if true, gives us HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bm%5Cin+P%7Dg%28m%29%7C%7C%5Csum_%7Bn%5Cin+Q%7Dg%28n%29%7C%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{m&#92;in P}g(m)||&#92;sum_{n&#92;in Q}g(n)|&#92;geq C' title='|&#92;sum_{m&#92;in P}g(m)||&#92;sum_{n&#92;in Q}g(n)|&#92;geq C' class='latex' />, which proves that the discrepancy of <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is at least <img src='http://s0.wp.com/latex.php?latex=C%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C^{1/2}' title='C^{1/2}' class='latex' />. So the above conjecture implies EDP. But the class of functions of the form <img src='http://s0.wp.com/latex.php?latex=g%28m%29g%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g(m)g(n)' title='g(m)g(n)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=g%3A%5Cmathbb%7BN%7D%5Cto%5C%7B-1%2C1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:&#92;mathbb{N}&#92;to&#92;{-1,1&#92;}' title='g:&#92;mathbb{N}&#92;to&#92;{-1,1&#92;}' class='latex' /> is a very small subset of the class of all functions <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> that take the value 1 along the diagonal, so this conjecture is very much stronger than EDP.</p>
<p>It is also equivalent to the statement that for every <img src='http://s0.wp.com/latex.php?latex=c%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c&gt;0' title='c&gt;0' class='latex' /> there exists a diagonal matrix <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> of trace at least 1 that can be written as a linear combination <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%5Clambda_iu_i%5Cotimes+v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' title='&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' class='latex' />, where for each <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_i' title='u_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> are characteristic functions of HAPs <img src='http://s0.wp.com/latex.php?latex=P_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P_i' title='P_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q_i' title='Q_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%7C%5Clambda_i%7C%5Cleq+c&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i|&#92;lambda_i|&#92;leq c' title='&#92;sum_i|&#92;lambda_i|&#92;leq c' class='latex' />. (This is the statement that I discussed at length in my previous post.) In one direction this is easy: if a decomposition of that kind exists, then <img src='http://s0.wp.com/latex.php?latex=%5Csum_nf%28n%2Cn%29D%28n%2Cn%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_nf(n,n)D(n,n)' title='&#92;sum_nf(n,n)D(n,n)' class='latex' /> equals the trace of <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> and is therefore greater than 1, but it also equals <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%5Clambda_i%7C%5Csum_%7B%28m%2Cn%29%5Cin+P_i%5Ctimes+Q_i%7Df%28m%2Cn%29%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i&#92;lambda_i|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|' title='&#92;sum_i&#92;lambda_i|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|' class='latex' />, which is at most <img src='http://s0.wp.com/latex.php?latex=c%5Cmax_i%7C%5Csum_%7B%28m%2Cn%29%5Cin+P_i%5Ctimes+Q_i%7Df%28m%2Cn%29%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c&#92;max_i|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|' title='c&#92;max_i|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|' class='latex' />. It follows that there is some <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7B%28m%2Cn%29%5Cin+P_i%5Ctimes+Q_i%7Df%28m%2Cn%29%7C%5Cgeq+c%5E%7B-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|&#92;geq c^{-1}' title='|&#92;sum_{(m,n)&#92;in P_i&#92;times Q_i}f(m,n)|&#92;geq c^{-1}' class='latex' />. </p>
<p>In the other direction, if no such decomposition of a diagonal matrix exists, then the Hahn-Banach theorem gives us a linear functional that separates the class of diagonal matrices with trace at least 1 from the class of linear combinations of HAP products defined above. It is easy to check that this functional must be a diagonal matrix <img src='http://s0.wp.com/latex.php?latex=%28f%28m%2Cn%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(f(m,n))' title='(f(m,n))' class='latex' /> with a constant diagonal such that the value along the diagonal is at least 1 and such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7B%28m%2Cn%29%5Cin+P%5Ctimes+Q%7Df%28m%2Cn%29%7C%5Cleq+c%5E%7B-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{(m,n)&#92;in P&#92;times Q}f(m,n)|&#92;leq c^{-1}' title='|&#92;sum_{(m,n)&#92;in P&#92;times Q}f(m,n)|&#92;leq c^{-1}' class='latex' /> for any two HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />. </p>
<p>The conjecture is so much stronger than EDP that I think it would be a mistake just to assume that it is true. My guess is that it <em>is</em> true, but I would be very interested if there was a counterexample (even if, like the complex sequence above, it is disappointingly simple &#8212; in fact, if there is a counterexample, then it seems quite likely that it will be fairly simple). And if there isn&#8217;t a counterexample, then the fact that it is so much stronger a conjecture than EDP does this time make me think that the result might be easier to prove than EDP itself. </p>
<p>Until very recently, I had been mainly interested in the dual version of the question (that is, the question about decomposing diagonal matrices), but now it seems to me that the matrix discrepancy question is worth thinking about directly. It is a clean question, and it has the big advantage over the original EDP question that it does not restrict values to the set <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C-1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,-1&#92;}' title='&#92;{1,-1&#92;}' class='latex' />, so a number of methods can be used that cannot be used directly for EDP. For instance, linear programming can be used to get experimental results: Sasha Nikolov may be going to look into this.</p>
<p>Given any matrix <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' />, one can find vectors <img src='http://s0.wp.com/latex.php?latex=v_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_m' title='v_m' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=w_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_n' title='w_n' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28m%2Cn%29%3D%5Clangle+v_m%2Cw_n%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(m,n)=&#92;langle v_m,w_n&#92;rangle' title='f(m,n)=&#92;langle v_m,w_n&#92;rangle' class='latex' />, so this matrix question is actually trivially equivalent to the non-symmetric vector-valued question I had formulated earlier. But expressing it in terms of vectors makes it harder to think about rather than easier, and that is what had previously put me off thinking about the question directly.</p>
<p>There is one class of matrices that is particularly worth mentioning I think. If EDP is true but this matrix question is false, then the best candidates for counterexamples to the matrix problem are probably matrices of high rank, and an obvious class of matrices that tend to have high rank is matrices that are constant on diagonals (that is, <a href="http://en.wikipedia.org/wiki/Toeplitz_matrix">Toeplitz matrices</a>).</p>
<p>Suppose, then, that our matrix <img src='http://s0.wp.com/latex.php?latex=f%28m%2Cn%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(m,n)' title='f(m,n)' class='latex' /> is defined to be <img src='http://s0.wp.com/latex.php?latex=g%28m-n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g(m-n)' title='g(m-n)' class='latex' /> for some function <img src='http://s0.wp.com/latex.php?latex=g%3A%5Cmathbb%7BZ%7D%5Cto%5Cmathbb%7BR%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:&#92;mathbb{Z}&#92;to&#92;mathbb{R}' title='g:&#92;mathbb{Z}&#92;to&#92;mathbb{R}' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=g%280%29%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g(0)=1' title='g(0)=1' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bm%5Cin+P%2Cn%5Cin+Q%7Df%28m%2Cn%29%3D%5Csum_xg%28x%29P%2A%28-Q%29%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)=&#92;sum_xg(x)P*(-Q)(x)' title='&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)=&#92;sum_xg(x)P*(-Q)(x)' class='latex' />, where by <img src='http://s0.wp.com/latex.php?latex=P%2A%28-Q%29%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P*(-Q)(x)' title='P*(-Q)(x)' class='latex' /> I mean the number of ways of writing <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> as <img src='http://s0.wp.com/latex.php?latex=y-z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y-z' title='y-z' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=y%5Cin+P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y&#92;in P' title='y&#92;in P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%5Cin+Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z&#92;in Q' title='z&#92;in Q' class='latex' />. So we have a class of functions that I&#8217;ll call <em>HAP convolutions</em>, and we&#8217;d like to show that if <img src='http://s0.wp.com/latex.php?latex=g%3A%5Cmathbb%7BZ%7D%5Cto%5Cmathbb%7BR%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g:&#92;mathbb{Z}&#92;to&#92;mathbb{R}' title='g:&#92;mathbb{Z}&#92;to&#92;mathbb{R}' class='latex' /> is any function with <img src='http://s0.wp.com/latex.php?latex=g%280%29%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g(0)=1' title='g(0)=1' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> is any constant, then there exists a HAP convolution <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Clangle+g%2C%5Cphi%5Crangle%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;langle g,&#92;phi&#92;rangle&#92;geq C' title='&#92;langle g,&#92;phi&#92;rangle&#92;geq C' class='latex' />. This is true if and only if there is an efficient way of writing the function that is 1 at 0 and 0 everywhere else as a linear combination of HAP convolutions. This is another question that could be investigated using linear programming, and perhaps since it concerns functions of just one variable we could get more extensive results than we could for the general matrix problem.</p>
<p><strong>The modular conjecture.</strong></p>
<p>In his most recent guest post, Gil Kalai reformulates EDP as a question about sums mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. EDP is trivially equivalent to the following assertion.</p>
<p><strong>Conjecture.</strong> For every <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> there exists <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> such that for every <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> there exists a HAP <img src='http://s0.wp.com/latex.php?latex=P%5Csubset%5C%7B1%2C2%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;subset&#92;{1,2&#92;dots,n&#92;}' title='P&#92;subset&#92;{1,2&#92;dots,n&#92;}' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7D%5Cepsilon%28x%29%5Cequiv+r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}&#92;epsilon(x)&#92;equiv r' title='&#92;sum_{x&#92;in P}&#92;epsilon(x)&#92;equiv r' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. </p>
<p>At first that doesn&#8217;t look like a very interesting reformulation since it is too obviously equivalent to EDP. But what makes it interesting is that it has a very natural generalization that doesn&#8217;t have any obvious counterexamples.</p>
<p><strong>Stronger Conjecture.</strong> For every <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> there exists <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> such that for every sequence <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> of non-zero numbers mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> there exists a HAP <img src='http://s0.wp.com/latex.php?latex=P%5Csubset%5C%7B1%2C2%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;subset&#92;{1,2&#92;dots,n&#92;}' title='P&#92;subset&#92;{1,2&#92;dots,n&#92;}' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+P%7Ds%28x%29%5Cequiv+r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in P}s(x)&#92;equiv r' title='&#92;sum_{x&#92;in P}s(x)&#92;equiv r' class='latex' /> mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. </p>
<p>In other words, we replace the condition that the sequence takes values <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> by the much weaker condition that it is never zero. Gil calls this the <em>modular conjecture</em>. (He also presented it in a comment on a much earlier EDP post.) </p>
<p>As Gil points out in his post, one can write down a polynomial that is identically zero (mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />) if and only if the modular conjecture is true for <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. It is tempting to try to prove that it is zero by analysing its coefficients. More generally, this approach to the problem appears to open the door to a number of algebraic methods.</p>
<p>If you want to prove EDP this way, you have to solve the conjecture for some non-zero <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' />. (Since <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is prime, if you can show it for one then you&#8217;ve shown it for all.) However, the problem with <img src='http://s0.wp.com/latex.php?latex=r%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r=0' title='r=0' class='latex' /> is interesting in its own right, and in particular it seems to be interestingly different from the problem with non-zero <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' />. At the time of writing, I don&#8217;t see any way of modifying the EDP examples to obtain exponentially long sequences mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> that avoid zero sums &#8212; it seems to me that the upper bound on the length could be significantly smaller. That would be interesting, as it would place constraints on what a proof could look like for non-zero <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' />. </p>
<p><strong>A fourth strengthening.</strong></p>
<p>This isn&#8217;t really meant as part of the body of the post, but more of an afterthought. A strengthening of EDP that has been considered since very early on in the project is where you replace a <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence by a sequence of unit vectors in a Hilbert space. To be precise, one looks at the following statement.</p>
<p><strong>Conjecture.</strong> Let <img src='http://s0.wp.com/latex.php?latex=v_1%2Cv_2%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,v_2,&#92;dots' title='v_1,v_2,&#92;dots' class='latex' /> be a sequence of unit vectors in a Hilbert space and let <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> be a real number. Then there exists a HAP <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5C%7C%5Csum_%7Bn%5Cin+P%7Dv_n%5C%7C%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;|&#92;sum_{n&#92;in P}v_n&#92;|&#92;geq C' title='&#92;|&#92;sum_{n&#92;in P}v_n&#92;|&#92;geq C' class='latex' />. </p>
<p>There is something slightly curious about this conjecture, which is that it is very hard to see how having infinitely many dimensions to play with could help one find a counterexample. In some sense, if you use too many dimensions, then it ought to be the case that there will be a HAP <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that the vectors <img src='http://s0.wp.com/latex.php?latex=v_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_n' title='v_n' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=n%5Cin+P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;in P' title='n&#92;in P' class='latex' /> are pointing in lots of different directions and therefore not cancelling. That makes me wonder whether one can prove that if there is a counterexample to the above conjecture, then there must be a counterexample for some finite-dimensional Hilbert space. Or if that is too much to ask, perhaps there might have to be a counterexample where all the <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> live in some compact set. I think it would be very interesting to think about whether the vague intuition I have just expressed can be made precise. As it stands, it is of course not even close to a proper argument. (I should stress one aspect of what I am saying. If there is any vector sequence <img src='http://s0.wp.com/latex.php?latex=%28v_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(v_n)' title='(v_n)' class='latex' /> of bounded discrepancy, then one can arbitrarily modify <img src='http://s0.wp.com/latex.php?latex=v_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_p' title='v_p' class='latex' /> for each prime <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> and the sequence will still have bounded discrepancy. So I&#8217;m not suggesting that every bounded-discrepancy sequence lives, or almost lives, in finite dimensions, but that it can be used to construct one that does.)</p>
<p>As I write that, another question occurs to me. For this one I don&#8217;t even have a vague intuitive argument. Let&#8217;s suppose that we can find a counterexample <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> to the matrix discrepancy question earlier. Suppose also that <img src='http://s0.wp.com/latex.php?latex=f%28m%2Cn%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(m,n)' title='f(m,n)' class='latex' /> takes the form <img src='http://s0.wp.com/latex.php?latex=%5Clangle+v_m%2Cw_n%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;langle v_m,w_n&#92;rangle' title='&#92;langle v_m,w_n&#92;rangle' class='latex' /> for some (not necessarily unit) vectors <img src='http://s0.wp.com/latex.php?latex=v_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_m' title='v_m' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=w_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_n' title='w_n' class='latex' /> in a Hilbert space. Must there be an example where the <img src='http://s0.wp.com/latex.php?latex=v_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_m' title='v_m' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=w_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_n' title='w_n' class='latex' /> lie in a finite-dimensional Hilbert space, or at least in a compact subset of a Hilbert space? </p>
<p><strong>A combined generalization.</strong></p>
<p>A second afterthought is that the matrix question has a modular version that might be of some interest. Let <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> be a prime and let <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> be a function from <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}^2' title='&#92;mathbb{N}^2' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p' title='&#92;mathbb{Z}_p' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=f%28n%2Cn%29%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(n,n)=1' title='f(n,n)=1' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />. Must the sums of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> on products <img src='http://s0.wp.com/latex.php?latex=P%5Ctimes+Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;times Q' title='P&#92;times Q' class='latex' /> take all possible values mod <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />? What if we merely ask for <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> to take non-zero values on the diagonal?</p>
<p>If the strong modular conjecture is false, then we can turn a counterexample <img src='http://s0.wp.com/latex.php?latex=%28x_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x_n)' title='(x_n)' class='latex' /> into a diagonal matrix in the obvious way and we get a counterexample to the second question. Indeed, suppose that <img src='http://s0.wp.com/latex.php?latex=f%28n%2Cn%29%3Dx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(n,n)=x_n' title='f(n,n)=x_n' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=f%28m%2Cn%29%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(m,n)=0' title='f(m,n)=0' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=m%5Cne+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m&#92;ne n' title='m&#92;ne n' class='latex' />. Then <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bm%5Cin+P%2Cn%5Cin+Q%7Df%28m%2Cn%29%3D%5Csum_%7Bn%5Cin+P%5Ccap+Q%7Dx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)=&#92;sum_{n&#92;in P&#92;cap Q}x_n' title='&#92;sum_{m&#92;in P,n&#92;in Q}f(m,n)=&#92;sum_{n&#92;in P&#92;cap Q}x_n' class='latex' />, which avoids some value, since <img src='http://s0.wp.com/latex.php?latex=P%5Ccap+Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;cap Q' title='P&#92;cap Q' class='latex' /> is a HAP. But with matrices there are many more ways of trying to avoid particular values, so the strong modular matrix conjecture looks like a much stronger statement. Again there are two cases &#8212; avoiding 0 and avoiding a non-zero value &#8212; of which only the second obviously implies EDP. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4512/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4512/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4512&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/09/06/edp26-three-generalizations/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP25 &#8212; third guest post by Gil Kalai</title>
		<link>http://gowers.wordpress.com/2012/09/04/edp25-third-guest-post-by-gil-kalai/</link>
		<comments>http://gowers.wordpress.com/2012/09/04/edp25-third-guest-post-by-gil-kalai/#comments</comments>
		<pubDate>Tue, 04 Sep 2012 21:33:44 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4508</guid>
		<description><![CDATA[The Polynomial Method The polynomial method is another basic combinatorial technique that occasionally works. One way to describe the method is as a way to translate a combinatorial statement into the vanishing of a certain polynomial modulo . A demonstration of the method Theorem: Every graph (or hypergraph) G with n vertices and 2n+1 edges [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4508&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<h2>The Polynomial Method</h2>
<p>The polynomial method is another basic combinatorial technique that occasionally works. One way to describe the method is as a way to translate a combinatorial statement into the vanishing of a certain polynomial modulo <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />.</p>
<h3>A demonstration of the method</h3>
<p><strong>Theorem: </strong>Every graph (or hypergraph) <em>G</em> with <em>n</em> vertices and <em>2n+1</em> edges contains a nontrivial subgraph <em>H</em> with all vertex-degrees divisible by 3.</p>
<p>(This is a theorem of Noga Alon,  Shmuel Friedland, and me from 1984.)</p>
<p><strong>Before the proof</strong>: If we want to get a subgraph with all vertex degrees even then we need<em> n</em> edges (or <em>n+1</em> edges for hypergraphs). This has a simple linear algebra proof which also gives an efficient algorithm.<br />
<span id="more-4508"></span></p>
<p><strong>From-scratch proof sketch: </strong>Associate with every edge <em>e</em> of the graph a variable <img src='http://s0.wp.com/latex.php?latex=x_e&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_e' title='x_e' class='latex' />. Consider the two polynomials</p>
<p><em>P=</em> <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bv+%5Cin+V%28G%29%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{v &#92;in V(G)}' title='&#92;prod_{v &#92;in V(G)}' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=%28%28%5Csum_%7Be%3A+v+%5Cin+e%7Dx%5E2_e%29%5E2-1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='((&#92;sum_{e: v &#92;in e}x^2_e)^2-1)' title='((&#92;sum_{e: v &#92;in e}x^2_e)^2-1)' class='latex' />, and</p>
<p>Q=<img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Be%5Cin+E%28G%29%7D%28x_e%5E2-1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{e&#92;in E(G)}(x_e^2-1)' title='&#92;prod_{e&#92;in E(G)}(x_e^2-1)' class='latex' /></p>
<p>If the theorem is false then <em>P-Q=0, </em>as polynomials over the field with three elements. This is impossible since P is a polynomial of degree 4n while Q is a polynomial which has a monomial of degree 4n+2 with nonzero coefficient.</p>
<p>The theorem follows more directly from a theorem of Chevalley-Warning and even more directly from a theorem of Olson, but the above proof serves best our purpose.</p>
<h3>Remarks about the polynomial method:</h3>
<p>1) The polynomial method has many applications but only in specific cases. It is not nearly as widely applicable as, say, the probabilistic method.</p>
<p>2) Good basic references:   A. Blokhuis, Polynomials in finite geometries and combinatorics.  In Keith Walker, editor, <em>Surveys in Combinatorics, 1993</em>, pages 35-52. Cambridge University Press, 1993.</p>
<p align="left">Noga Alon, <a href="http://www.tau.ac.il/~nogaa/PDFS/null2.pdf">Combinatorial Nullstellensatz, Combinatorics, Probability and Computing 8 (1999), 7-29.</a></p>
<p>3) The polynomial method  is related to the &#8220;linear algebra method&#8221; in combinatorics. Often, however, direct linear algebraic proofs lead to efficient algorithms while this is not known for applications of the polynomial method. For example, no polynomial algorithm to find the graph <em>H</em> in the above theorem is known, and there is a related complexity class introduced by Christos Papadimitrou . The polynomial method is closely related to arguments coming from the theory of error-correcting codes, and to arguments in TCS related to interactive proofs and PCP.</p>
<h2>The modular EDP.</h2>
<p>The following is an equivalent way to formulate the Erdős 1932 conjecture that the discrepancy for EDP is unbounded.</p>
<p>1) Consider the sequence <img src='http://s0.wp.com/latex.php?latex=x_1%2Cx_2%2C%5Cdots+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,x_2,&#92;dots ' title='x_1,x_2,&#92;dots ' class='latex' /> as a sequence with <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> modulo <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is a prime that we can choose as large as we want.</p>
<p>2) Then every number modulo <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> can be expressed as a sum of the sequence along a HAP modulo <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />.</p>
<p>Translating EDP (in this form) into a statement about polynomials modulo <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is cumbersome. But one thing we may have going for us is that it suggests a natural extension of EDP where the supposed-to-vanish polynomial is simpler.</p>
<p><strong>Modular EDP Conjecture:</strong> Consider a sequence <img src='http://s0.wp.com/latex.php?latex=x_1%2Cx_2%5Cdots+%2Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,x_2&#92;dots ,x_n' title='x_1,x_2&#92;dots ,x_n' class='latex' /> of non-zero numbers modulo <em>p</em>. Then if <em>n</em> is sufficiently large w.r.t. <em>p</em>, then every number can be expressed as a sum of the sequence along a HAP modulo <em>p</em>.</p>
<p>As in the original EDP we can consider general sequences or just multiplicative sequences.</p>
<h2>The Polynomial identity required for the modular EDP</h2>
<p>Here is the polynomial identity in <em>n</em> variables <img src='http://s0.wp.com/latex.php?latex=x_1%2Cx_2%2C%5Cdots%2Cx_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,x_2,&#92;dots,x_n' title='x_1,x_2,&#92;dots,x_n' class='latex' /> we need to prove over <img src='http://s0.wp.com/latex.php?latex=Z%2FpZ&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Z/pZ' title='Z/pZ' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> grows to infinity with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> as slow as we wish.  For every <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=0+%5Cle+k%5Cle+p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='0 &#92;le k&#92;le p' title='0 &#92;le k&#92;le p' class='latex' />,</p>
<blockquote><p>(*) <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%3D1%7D%5Enx_i%5Cprod+%5C%7B%28x_d%2Bx_%7B2d%7D%2B%5Ccdots%2Bx_%7Bid%7D%29-k%7E%3A%7Ed%2Ci%3A%7Edi%5Cle+n%5C%7D%7E%3D%7E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i=1}^nx_i&#92;prod &#92;{(x_d+x_{2d}+&#92;cdots+x_{id})-k~:~d,i:~di&#92;le n&#92;}~=~0' title='&#92;prod_{i=1}^nx_i&#92;prod &#92;{(x_d+x_{2d}+&#92;cdots+x_{id})-k~:~d,i:~di&#92;le n&#92;}~=~0' class='latex' /></p></blockquote>
<p>These polynomials are not familiar but they are related to generating functions which arise in permutation statistics. In particular, when we look at the product</p>
<blockquote><p><img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%3D1%7D%5En%28x_1%2Bx_2%2B%5Cdots%2Bx_i%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i=1}^n(x_1+x_2+&#92;dots+x_i)' title='&#92;prod_{i=1}^n(x_1+x_2+&#92;dots+x_i)' class='latex' /></p></blockquote>
<p>and expand it to monomials, the coefficients have a combinatorial meaning in terms of permutations and inversions.</p>
<p>Given a permutation <img src='http://s0.wp.com/latex.php?latex=%5Cpi+%281%29%2C%5Cpi+%282%29%2C%5Cdots%2C%5Cpi+%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi (1),&#92;pi (2),&#92;dots,&#92;pi (n)' title='&#92;pi (1),&#92;pi (2),&#92;dots,&#92;pi (n)' class='latex' />, and an integer <img src='http://s0.wp.com/latex.php?latex=i%2C%7E1%5Cle+i%5Cle+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i,~1&#92;le i&#92;le n' title='i,~1&#92;le i&#92;le n' class='latex' />  we can ask how many inversions are there between <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> and a smaller integer.  This is a number between 1 and <img src='http://s0.wp.com/latex.php?latex=i-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i-1' title='i-1' class='latex' />.</p>
<p>The coefficient of   <img src='http://s0.wp.com/latex.php?latex=x_1%5E%7Bi_1%7Dx_2%5E%7Bi_2%7D%5Cdots+x_n%5E%7Bi_n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1^{i_1}x_2^{i_2}&#92;dots x_n^{i_n}' title='x_1^{i_1}x_2^{i_2}&#92;dots x_n^{i_n}' class='latex' /> in the above product is the number of permutations where there are <img title="i_j" src="http://s0.wp.com/latex.php?latex=i_j&amp;bg=ffffff&amp;fg=333333&amp;s=0" alt="i_j" /> integers contributing j inversions. The proposed identity (*) may be expressed in terms of modular properties of such permutation statistics.</p>
<blockquote><p><strong>Challenge: <em>Prove the modular EDP using the polynomial method.</em></strong></p></blockquote>
<h2>What does the LDH tell us about the modular EDP?</h2>
<p>It is especially easy to apply the large deviation heuristic to the modular version of EDP. Suppose we want to compute the probability that all HAP-sums miss the outcome <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />.</p>
<p>Given <img src='http://s0.wp.com/latex.php?latex=x_1%2Bx_2%2B%5Cdots+%2Bx_r+%5Cne+y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1+x_2+&#92;dots +x_r &#92;ne y' title='x_1+x_2+&#92;dots +x_r &#92;ne y' class='latex' />, the probability that <img src='http://s0.wp.com/latex.php?latex=x_1%2B%5Cdots%2Bx_r%2Bx_%7Br-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1+&#92;dots+x_r+x_{r-1}' title='x_1+&#92;dots+x_r+x_{r-1}' class='latex' /> is not <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=1-1%2F%28p-1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1-1/(p-1)' title='1-1/(p-1)' class='latex' />.   So we are interested in the value of <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%281-1%2F%28p-1%29%29%5E%7Bnlogn%7D+%3D+%28p-1%29%5E%7B-n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1-1/(p-1))^{nlogn} = (p-1)^{-n}' title='(1-1/(p-1))^{nlogn} = (p-1)^{-n}' class='latex' />. (Restricting our attention to multiplicative sequences will divide the exponents on both sides by <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' />.) Solving this equation gives us <img src='http://s0.wp.com/latex.php?latex=p%3D%5Clog+n+%5Clog%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=&#92;log n &#92;log&#92;log n' title='p=&#92;log n &#92;log&#92;log n' class='latex' />. The LDH heuristic comes with a firm prediction and a weak prediction. In this case the LDH gives</p>
<p>a) (Firm prediction) There are sequences violating the modular EDP when <img src='http://s0.wp.com/latex.php?latex=p%3E+%5Clog+n%5Clog%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&gt; &#92;log n&#92;log&#92;log n' title='p&gt; &#92;log n&#92;log&#92;log n' class='latex' />.</p>
<p>b) (Weak prediction) There are no such sequences when <img src='http://s0.wp.com/latex.php?latex=p%3C%3C%5Clog+n%5Clog%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&lt;&lt;&#92;log n&#92;log&#92;log n' title='p&lt;&lt;&#92;log n&#92;log&#92;log n' class='latex' />.</p>
<p>The firm prediction is correct by the log<em>n</em> discrepancy constructions for EDP and as a matter of fact the LDH itself gives an even stronger prediction of <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt {&#92;log n}' title='&#92;sqrt {&#92;log n}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%5Cpm1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm1' title='&#92;pm1' class='latex' />-sequences. By restricting our attention to <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequences we see that the weak prediction is incorrect and LDH for the modular EDP is blind to the special substructure of <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequences. Note that the firm conjecture is far from being known when we extend the modular EDP and replace all integers by a random subset of integers, or by square-free integers , or by SCJ-systems of integers etc.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4508/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4508/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4508&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/09/04/edp25-third-guest-post-by-gil-kalai/feed/</wfw:commentRss>
		<slash:comments>59</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>

		<media:content url="http://s0.wp.com/latex.php?latex=i_j&#38;bg=ffffff&#38;fg=333333&#38;s=0" medium="image">
			<media:title type="html">i_j</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP24 &#8212; an attempt to get back into the diagonal-decomposition approach</title>
		<link>http://gowers.wordpress.com/2012/08/31/edp24-an-attempt-to-get-back-into-the-diagonal-decomposition-approach/</link>
		<comments>http://gowers.wordpress.com/2012/08/31/edp24-an-attempt-to-get-back-into-the-diagonal-decomposition-approach/#comments</comments>
		<pubDate>Fri, 31 Aug 2012 21:39:04 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4440</guid>
		<description><![CDATA[Gil has a not quite finished third post that will appear soon on this blog. Meanwhile, here are a few thoughts I&#8217;ve had recently as I tried to get my brain into EDP mode again. The approach to the Erd&#337;s discrepancy problem that, rightly or wrongly, I found most promising when we were last working [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4440&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Gil has a not quite finished third post that will appear soon on this blog. Meanwhile, here are a few thoughts I&#8217;ve had recently as I tried to get my brain into EDP mode again.</p>
<p>The approach to the Erd&#337;s discrepancy problem that, rightly or wrongly, I found most promising when we were last working on it was to prove a certain statement about matrices that can be shown quite easily to imply a positive solution to the problem. In this post, I&#8217;m going to treat that matrix statement as <em>the</em> problem, and think about how one might go about trying to prove it. I&#8217;ll give the brief explanation of why it implies EDP, but not of what the possible advantages of the approach are (discussions of which can be found in some of the earlier material).<br />
<span id="more-4440"></span></p>
<p>First I&#8217;ll need to recall a couple of definitions and some notation. A <em>homogeneous arithmetic progression</em>, or HAP, is a set of the form <img src='http://s0.wp.com/latex.php?latex=%5C%7Bd%2C2d%2C%5Cdots%2Crd%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{d,2d,&#92;dots,rd&#92;}' title='&#92;{d,2d,&#92;dots,rd&#92;}' class='latex' /> for some pair of positive integers <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' />. Let me say that <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> is a <em>HAP-function</em> if it is the characteristic function of a HAP. Finally, if <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> are functions defined on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=f%5Cotimes+g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f&#92;otimes g' title='f&#92;otimes g' class='latex' /> denote the function defined on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}^2' title='&#92;mathbb{N}^2' class='latex' /> that takes the value <img src='http://s0.wp.com/latex.php?latex=f%28x%29g%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(x)g(y)' title='f(x)g(y)' class='latex' /> at <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x,y)' title='(x,y)' class='latex' />, which can be thought of as an infinite matrix. </p>
<p>Here, then, is the statement that it would be great to prove. What is nice (but also quite daunting) about it is that it is a pure existence statement. </p>
<p><strong>Problem.</strong> Prove that for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> there exists a real matrix decomposition <img src='http://s0.wp.com/latex.php?latex=D%3D%5Csum_i%5Clambda_iu_i%5Cotimes+v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D=&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' title='D=&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' class='latex' /> with the following properties.</p>
<p>(i) <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> is a diagonal matrix and <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%3D1%7D%5E%5Cinfty+D_%7Bxx%7D%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x=1}^&#92;infty D_{xx}&#92;geq C' title='&#92;sum_{x=1}^&#92;infty D_{xx}&#92;geq C' class='latex' />.</p>
<p>(ii) Each <img src='http://s0.wp.com/latex.php?latex=u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_i' title='u_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> is a HAP-function.</p>
<p>(iii) <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%7C%5Clambda_i%7C%5Cleq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i|&#92;lambda_i|&#92;leq 1' title='&#92;sum_i|&#92;lambda_i|&#92;leq 1' class='latex' />.</p>
<p>I have stated the problem in this form because (for a reason I will outline below) I am confident that such a decomposition exists. However, it seems to be quite hard to find.</p>
<p>Why would the existence of an efficient decomposition of a diagonal matrix into products of HAP-functions prove EDP? Well, let <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3D%28%5Cepsilon_1%2C%5Cepsilon_2%2C%5Cdots%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon=(&#92;epsilon_1,&#92;epsilon_2,&#92;dots)' title='&#92;epsilon=(&#92;epsilon_1,&#92;epsilon_2,&#92;dots)' class='latex' /> be a <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-sequence and consider the quantity <img src='http://s0.wp.com/latex.php?latex=%5Clangle+%5Cepsilon%2C+D%5Cepsilon%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;langle &#92;epsilon, D&#92;epsilon&#92;rangle' title='&#92;langle &#92;epsilon, D&#92;epsilon&#92;rangle' class='latex' />. On the one hand it equals </p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_x+D_%7Bxx%7D%5Cepsilon_x%5E2%3D%5Csum_x+D_%7Bxx%7D%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;sum_x D_{xx}&#92;epsilon_x^2=&#92;sum_x D_{xx}&#92;geq C' title='&#92;displaystyle &#92;sum_x D_{xx}&#92;epsilon_x^2=&#92;sum_x D_{xx}&#92;geq C' class='latex' />, </p>
<p>and on the other hand it equals </p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_i%5Clambda_i+%5Clangle+%5Cepsilon%2Cu_i%5Crangle%5Clangle%5Cepsilon%2Cv_i%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle &#92;sum_i&#92;lambda_i &#92;langle &#92;epsilon,u_i&#92;rangle&#92;langle&#92;epsilon,v_i&#92;rangle' title='&#92;displaystyle &#92;sum_i&#92;lambda_i &#92;langle &#92;epsilon,u_i&#92;rangle&#92;langle&#92;epsilon,v_i&#92;rangle' class='latex' />.</p>
<p>Since <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%7C%5Clambda_i%7C%5Cleq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i|&#92;lambda_i|&#92;leq 1' title='&#92;sum_i|&#92;lambda_i|&#92;leq 1' class='latex' />, there must exist <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Clangle+%5Cepsilon%2Cu_i%5Crangle%5Clangle%5Cepsilon%2Cv_i%5Crangle%7C%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;langle &#92;epsilon,u_i&#92;rangle&#92;langle&#92;epsilon,v_i&#92;rangle|&#92;geq C' title='|&#92;langle &#92;epsilon,u_i&#92;rangle&#92;langle&#92;epsilon,v_i&#92;rangle|&#92;geq C' class='latex' />, from which it follows that there exists a HAP <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bx%5Cin+P%7D%5Cepsilon_x%7C%5Cgeq+C%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{x&#92;in P}&#92;epsilon_x|&#92;geq C^{1/2}' title='|&#92;sum_{x&#92;in P}&#92;epsilon_x|&#92;geq C^{1/2}' class='latex' />. This shows that <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> must have HAP-discrepancy at least <img src='http://s0.wp.com/latex.php?latex=C%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C^{1/2}' title='C^{1/2}' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />, and we are done.</p>
<p>Of course, one can deduce anything from a false premise, so one needs at least some reason to believe that the decomposition exists. Quite a good reason is that the existence of the decomposition can be shown (without too much difficulty) to be equivalent to the following generalization of EDP.</p>
<p><strong>Problem.</strong> Show that if <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%28b_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(b_n)' title='(b_n)' class='latex' /> are any two sequences of vectors in a Hilbert space such that <img src='http://s0.wp.com/latex.php?latex=%5Clangle+a_n%2Cb_n%5Crangle%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;langle a_n,b_n&#92;rangle=1' title='&#92;langle a_n,b_n&#92;rangle=1' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />, then for every constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> there exist HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Clangle%5Csum_%7Bn%5Cin+P%7Da_n%2C%5Csum_%7Bm%5Cin+Q%7Db_m%5Crangle%7C%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;langle&#92;sum_{n&#92;in P}a_n,&#92;sum_{m&#92;in Q}b_m&#92;rangle|&#92;geq C' title='|&#92;langle&#92;sum_{n&#92;in P}a_n,&#92;sum_{m&#92;in Q}b_m&#92;rangle|&#92;geq C' class='latex' />.</p>
<p>This is a strengthening of EDP, since it clearly implies EDP when the Hilbert space is <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{R}' title='&#92;mathbb{R}' class='latex' />. However, it doesn&#8217;t seem like such a huge strengthening that it might be false even if EDP is true. At any rate, it seems just as hard to find a counterexample to this as it does to EDP itself. (You might ask why I believe that EDP is true. It&#8217;s partly blind faith, but the heuristics presented by Gil Kalai in the previous post offer some support, as does &#8212; very weakly &#8212; the experimental evidence.)</p>
<p><strong>How does one ever solve existence problems?</strong></p>
<p>Let&#8217;s begin by thinking very generally about the situation we&#8217;re in: we have to come up with a mathematical object <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> that has certain properties <img src='http://s0.wp.com/latex.php?latex=P_1%2C%5Cdots%2CP_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P_1,&#92;dots,P_k' title='P_1,&#92;dots,P_k' class='latex' />. (Of course, we could combine those into one property, but it is often more natural to think of the object as having several different properties simultaneously.)</p>
<p>I&#8217;m going to list strategies until I can&#8217;t (or at least can&#8217;t immediately) think of any more that seem to have any chance of success. (An example of a strategy that I think has no chance of success for this problem is to search for a just-do-it proof, the problem being that the constraints appear to be quite delicate, whereas a just-do-it proof works better when one has a number of independent constraints that are reasonably easy to satisfy simultaneously.)</p>
<p>1. Use an object you already know, possibly with some small modifications.</p>
<p>2. Start with examples that work for a certain constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />, and  use them as building blocks for larger and more complicated examples that work with a larger constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />.</p>
<p>3. Use some kind of duality to prove that if an example does not exist, then some other object <em>does</em> exist, and show that in fact the second object does not exist.</p>
<p>4. Use a computer to search for small examples, try to discern a pattern in those examples, use that pattern to guess how to construct larger examples, and check that the guess works.</p>
<p>5. Strengthen the conditions on the object until there is a unique (or almost unique) object that satisfies those conditions.</p>
<p>6. Weaken the conditions on the object (e.g. by forgetting about some of the properties <img src='http://s0.wp.com/latex.php?latex=P_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P_i' title='P_i' class='latex' />), try to describe all objects that satisfy the weaker conditions, and then reformulate the problem as that of finding an object of that description that satisfies the stronger conditions. [As a very simple example, if you were required to find a positive solution of some quadratic equation, you would solve the equation without the positivity condition and then pick out a positive root.]</p>
<p>7. Assume that you have an object with the required properties, deduce some interesting further properties, find a natural object with the further properties, and hope that it has the original properties as well. [This is a variant of 6.]</p>
<p><strong>How do the general strategies fare when it comes to EDP?</strong></p>
<p>I will discuss them briefly at first, before concentrating in more detail on a few of them.</p>
<p>1. This doesn&#8217;t appear to be the kind of problem where there is already some matrix decomposition (or related mathematical object) waiting in the wings to be modified so that it becomes the example we want. Or if there is, the only way we are likely to discover it is by pinning down more precisely the properties we are interested in so that someone somewhere says, &#8220;Hang on, that sounds like X.&#8221; But I think it is very unlikely that we will discover an example that way.</p>
<p>2. I am definitely interested in the idea of starting with small examples and using them to build bigger examples. However, an approach like this needs plenty of thought: non-trivial small examples seem hard to come by, and the only obvious method for building bigger ones is taking linear combinations of smaller ones. (A trivial small example is to take a linear combination of the form <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%5Clambda_iu_i%5Cotimes+u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i&#92;lambda_iu_i&#92;otimes u_i' title='&#92;sum_i&#92;lambda_iu_i&#92;otimes u_i' class='latex' /> where each <img src='http://s0.wp.com/latex.php?latex=u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_i' title='u_i' class='latex' /> is the characteristic function of a singleton. This gives us <img src='http://s0.wp.com/latex.php?latex=C%3D1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C=1' title='C=1' class='latex' />. To get anything interesting, we need the <img src='http://s0.wp.com/latex.php?latex=u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_i' title='u_i' class='latex' /> to be HAP-functions for longer HAPs, but then it is very difficult to get the off-diagonal terms to cancel.)</p>
<p>3. We are trying to write <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> as an efficient linear combination of objects of a certain form. That is a classic situation for applying the Hahn-Banach theorem: if there is no such linear combination, then there must be a separating functional. What can we say about that functional? </p>
<p>This approach seems promising at first, but is in fact of no use at all, since if you pursue it, the conclusion you come to is that if there is no such linear combination, then there are sequences of unit vectors <img src='http://s0.wp.com/latex.php?latex=%28a_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_n)' title='(a_n)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%28b_m%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(b_m)' title='(b_m)' class='latex' /> in a Hilbert space and a constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Clangle%5Csum_%7Bn%5Cin+P%7Da_n%2C%5Csum_%7Bm%5Cin+Q%7Db_m%5Crangle%7C%5Cleq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;langle&#92;sum_{n&#92;in P}a_n,&#92;sum_{m&#92;in Q}b_m&#92;rangle|&#92;leq C' title='|&#92;langle&#92;sum_{n&#92;in P}a_n,&#92;sum_{m&#92;in Q}b_m&#92;rangle|&#92;leq C' class='latex' /> for every pair of HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />. The reason duality doesn&#8217;t help is that we used duality to reformulate EDP as a problem about decomposing diagonal matrices. If we use duality again, we get back to (a generalization of) EDP.</p>
<p>4. We have tried this. Moses Charikar and others have written programs to search for the most efficient decompositions (for this and for a related problem). The results are useful in that they give one some feel for what the diagonal matrix <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> needs to be like, but it does not seem to be possible to obtain a large enough matrix to use it to go as far as guessing a formula. For that, more theoretical methods will be necessary (but knowing roughly what the answer needs to look like should make finding those methods quite a bit quicker). </p>
<p>5. One difficulty is that we are not looking for a unique object: it would be nice if the properties we were trying to obtain determined the matrix and its decomposition uniquely, since then we could hope to <em>calculate</em> them. An obvious property to add is that the object is in some sense extremal (though what that sense is is not completely obvious). Another thing we can do is insist on various symmetries &#8212; something that has already been tried. Yet another is to restrict the class of HAPs that may be used: as far as we can tell at the moment, we can get away with HAPs with common differences that are all either prime or a power of 2. (I say that simply because it seems to be hard to find a sequence that has bounded discrepancy on all such HAPs. Gil&#8217;s heuristics suggest that the discrepancy should grow like <img src='http://s0.wp.com/latex.php?latex=%5Clog%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log&#92;log n' title='&#92;log&#92;log n' class='latex' /> for sequences of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />.) </p>
<p>6. What weaker properties might we go for? One idea is simply to try to find a linear combination of products of HAP-functions that leads to a considerable amount of cancellation off the diagonal, even if there are still off-diagonal terms left. This would be interesting because if one naively tries to find a decomposition that works, it seems to be very hard to get a substantial amount of cancellation (except if very small HAPs are used, in which case the resulting decomposition is not very informative). Another (somewhat related) idea is to look for interesting decompositions but with not particularly large values of <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />. Yet another might be to allow a wider class of functions that HAP-functions. </p>
<p>7. I don&#8217;t have much idea of how to apply this strategy to the entire decomposition, but there is plenty one can say about the diagonal matrix that ones wishes to decompose, and that information, if it could be expressed neatly, would surely be very useful: it seems much easier to try to decompose a specific diagonal matrix than to try to decompose a diagonal matrix that you have to choose first, when the vast majority of diagonal matrices <em>cannot</em> be efficiently decomposed. </p>
<p>The proof that the diagonal decomposition implies EDP actually proves a much stronger result. It gives us a function <img src='http://s0.wp.com/latex.php?latex=d%28n%29%3DD_%7Bnn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)=D_{nn}' title='d(n)=D_{nn}' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_nd%28n%29%3D%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_nd(n)=&#92;infty' title='&#92;sum_nd(n)=&#92;infty' class='latex' /> such that if <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon%3D%28%5Cepsilon_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon=(&#92;epsilon_n)' title='&#92;epsilon=(&#92;epsilon_n)' class='latex' /> is any sequence of real numbers (not necessarily <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-valued) with <img src='http://s0.wp.com/latex.php?latex=%5Csum_nd%28n%29%5Cepsilon_n%5E2%5Cgeq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_nd(n)&#92;epsilon_n^2&#92;geq 1' title='&#92;sum_nd(n)&#92;epsilon_n^2&#92;geq 1' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> has unbounded HAP-discrepancy. To put that loosely, the sequence <img src='http://s0.wp.com/latex.php?latex=%28d%28n%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(d(n))' title='(d(n))' class='latex' /> cannot correlate with the square of any sequence of bounded discrepancy. That puts very strong conditions on <img src='http://s0.wp.com/latex.php?latex=%28d%28n%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(d(n))' title='(d(n))' class='latex' />, but so far we have not fully understood what those conditions are. It seems that we should be able to make progress on this, perhaps even adding reasonable conditions that would allow us determine the function uniquely.</p>
<p><strong>Creating new examples out of old ones.</strong></p>
<p>Suppose that <img src='http://s0.wp.com/latex.php?latex=D%3D%5Csum_i%5Clambda_iu_i%5Cotimes+v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D=&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' title='D=&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=D%27%3D%5Csum_j%5Cmu_jw_j%5Cotimes+z_j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D&#039;=&#92;sum_j&#92;mu_jw_j&#92;otimes z_j' title='D&#039;=&#92;sum_j&#92;mu_jw_j&#92;otimes z_j' class='latex' />. Clearly if we take a linear combination of <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=D%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D&#039;' title='D&#039;' class='latex' />, we get another decomposition of a diagonal matrix, and if the original decompositions were into products of HAPs then so is the new one.</p>
<p>Can we do anything more interesting? Another possibility is to look for some kind of product. Now Dirichlet convolutions are natural objects to look at in the context of EDP, since the expression <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bd%7Cn%7Df%28d%29g%28n%2Fd%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{d|n}f(d)g(n/d)' title='&#92;sum_{d|n}f(d)g(n/d)' class='latex' /> can be thought of as follows. You start with a function <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> defined on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />, then you define the dilate <img src='http://s0.wp.com/latex.php?latex=g_d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g_d' title='g_d' class='latex' /> to be the function that takes the value <img src='http://s0.wp.com/latex.php?latex=g%28r%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g(r)' title='g(r)' class='latex' /> at <img src='http://s0.wp.com/latex.php?latex=rd&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='rd' title='rd' class='latex' /> and 0 at non-multiples of <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' />, and finally you take a linear combination <img src='http://s0.wp.com/latex.php?latex=%5Csum_df%28d%29g_d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_df(d)g_d' title='&#92;sum_df(d)g_d' class='latex' /> of those dilates. If <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> is identically 1, then this is taking a linear combination of HAPs.</p>
<p>We can&#8217;t take a Dirichlet product of two matrices, but we can do something similar: given functions <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> of two integer variables we can take the function <img src='http://s0.wp.com/latex.php?latex=h%28x%2Cy%29%3D%5Csum_%7Bd%7Cx%2Cd%27%7Cy%7Df%28d%2Cd%27%29g%28x%2Fd%2Cy%2Fd%27%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='h(x,y)=&#92;sum_{d|x,d&#039;|y}f(d,d&#039;)g(x/d,y/d&#039;)' title='h(x,y)=&#92;sum_{d|x,d&#039;|y}f(d,d&#039;)g(x/d,y/d&#039;)' class='latex' />. If <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' /> are diagonal (meaning that they are zero if <img src='http://s0.wp.com/latex.php?latex=x%5Cne+y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;ne y' title='x&#92;ne y' class='latex' />), then the terms of the sum are zero unless <img src='http://s0.wp.com/latex.php?latex=d%3Dd%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d=d&#039;' title='d=d&#039;' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=y' title='x=y' class='latex' />, so in this case <img src='http://s0.wp.com/latex.php?latex=h%28x%2Cx%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='h(x,x)' title='h(x,x)' class='latex' />, considered as a function of <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> only, is the Dirichlet convolution of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g' title='g' class='latex' />, also considered as functions of one variable.</p>
<p>This operation also respects products in the following sense. If <img src='http://s0.wp.com/latex.php?latex=f%3Du%5Cotimes+v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f=u&#92;otimes v' title='f=u&#92;otimes v' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=g%3Dw%5Cotimes+z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='g=w&#92;otimes z' title='g=w&#92;otimes z' class='latex' />, then </p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+h%28x%2Cy%29%3D%5Csum_%7Bd%7Cx%2Cd%27%7Cy%7Du%28d%29v%28d%27%29w%28x%2Fd%29z%28y%2Fd%27%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle h(x,y)=&#92;sum_{d|x,d&#039;|y}u(d)v(d&#039;)w(x/d)z(y/d&#039;)' title='&#92;displaystyle h(x,y)=&#92;sum_{d|x,d&#039;|y}u(d)v(d&#039;)w(x/d)z(y/d&#039;)' class='latex' />,</p>
<p>which equals <img src='http://s0.wp.com/latex.php?latex=p%28x%29q%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(x)q(y)' title='p(x)q(y)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is the Dirichlet convolution of <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=w&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w' title='w' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q' title='q' class='latex' /> is the Dirichlet convolution of <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z' title='z' class='latex' />. </p>
<p>One further obvious fact is that the operation is bilinear in the two matrices that it is applied to.</p>
<p>So far everything is working swimmingly: out of two decompositions of diagonal matrices into linear combinations of products we build another one that has some number-theoretic significance. However, there is a problem: a Dirichlet convolution of two HAP-functions is not an HAP-function except in trivial cases.</p>
<p>How much does that matter? Without doing some detailed calculations I don&#8217;t know. Here are a couple of simple observations. Let <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' /> be two HAP-functions. Then their Dirichlet convolution is a sum of dilates of <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' />, one for each point in the HAP of which <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> is the characteristic function. So we can at least decompose the Dirichlet convolution of two HAP-functions as a sum of HAP-functions. The sum of the coefficients is the number of points in the HAP corresponding to <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> &#8212; though since the situation is symmetrical, we can add dilates of <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> instead, so it is better to say that the sum of the coefficients can be taken to be the length of the smaller of the two HAPs. </p>
<p>That doesn&#8217;t sound very efficient, but we must also remember that when we take the Dirichlet convolution of two diagonal matrices, we obtain a diagonal matrix that sums to the product of the sums of the original two matrices. So there may be some gain there to compensate for the loss of efficiency. That is why I say that more detailed calculations are necessary, a point I will return to at some stage.</p>
<p><strong>What can we say about the diagonal matrix <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' />?</strong></p>
<p>As I have already mentioned, if a diagonal decomposition of the required kind exists, then we can place strong constraints on the diagonal matrix <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> itself. Writing <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta(n)' title='&#92;delta(n)' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=D_%7Bnn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D_{nn}' title='D_{nn}' class='latex' />, it must have the following property: if <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> is any real-valued function defined on the integers such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq C' title='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq C' class='latex' />, then the HAP-discrepancy of <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> must be at least <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}' title='&#92;sqrt{C}' class='latex' />. (This follows from the proof that the existence of the decomposition for a diagonal matrix with <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29%5Cgeq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)&#92;geq C' title='&#92;sum_n&#92;delta(n)&#92;geq C' class='latex' /> implies that every <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-sequence has HAP-discrepancy at least <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7BC%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{C}' title='&#92;sqrt{C}' class='latex' />.)</p>
<p>Turning that round, if we have any real-valued sequence <img src='http://s0.wp.com/latex.php?latex=s%281%29%2Cs%282%29%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(1),s(2),&#92;dots' title='s(1),s(2),&#92;dots' class='latex' /> of discrepancy at most <img src='http://s0.wp.com/latex.php?latex=T&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T' title='T' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2' title='&#92;sum_n&#92;delta(n)s(n)^2' class='latex' /> cannot be more than <img src='http://s0.wp.com/latex.php?latex=T%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T^2' title='T^2' class='latex' />.</p>
<p>A simple example of the kind of conclusion one can draw from that is that the sum of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta(n)' title='&#92;delta(n)' class='latex' /> over all <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />s that are not multiples of 3 is at most 1. This follows from the fact that the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> has HAP-discrepancy 1. More generally, if <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)' title='&#92;sum_n&#92;delta(n)' class='latex' /> is large, then for all small <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> most of the largeness must occur on multiples of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />.</p>
<p>In qualitative terms, this suggests that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> ought to be concentrated at numbers with many factors, and the experimental evidence backs this up, though not quite as cleanly as one might ideally like. (However, &#8220;edge effects&#8221; could well account for some of the peculiar features that are observed experimentally, so I still think it is worth trying to find a very aesthetically satisfying diagonal matrix <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' />.)</p>
<p>Here is another reason to expect that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta(n)' title='&#92;delta(n)' class='latex' /> should be larger for numbers with many factors: they are contained in lots of HAPs. We are trying to find a measure on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> (at least if we insist that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> takes non-negative values, which seems a reasonable extra condition to try to impose) such that any function with a large <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' />-norm with respect to that measure must have a large HAP-discrepancy. What might make it difficult to find a function with large <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' />-norm and small HAP-discrepancy? It would be that we had lots of constraints on the values taken by the functions. And the natural way that we can place lots of constraints on one value <img src='http://s0.wp.com/latex.php?latex=s%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(n)' title='s(n)' class='latex' /> is if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> has many factors and is therefore contained in many HAPs.</p>
<p>As an example of the opposite phenomenon, suppose we take an arbitrary <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence and alter its values however we like at all primes. What will happen to its HAP-discrepancy? Well, each HAP contains at most one prime, so we cannot have changed the HAP-discrepancy by more than 2. This illustrates fairly dramatically how little it matters what a sequence does at numbers with few factors.</p>
<p><strong>A first wild guess.</strong></p>
<p>It is tempting to jump straight from this kind of observation to a guess of a function <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta(n)' title='&#92;delta(n)' class='latex' /> that might work. For example, what if we took some primes <img src='http://s0.wp.com/latex.php?latex=p_1%2C%5Cdots%2Cp_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_1,&#92;dots,p_m' title='p_1,&#92;dots,p_m' class='latex' />, a collection <img src='http://s0.wp.com/latex.php?latex=%5Cmathcal%7BA%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathcal{A}' title='&#92;mathcal{A}' class='latex' /> of subsets of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cm%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,m&#92;}' title='&#92;{1,2,&#92;dots,m&#92;}' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> be the characteristic measure of the set of all numbers that can be written as <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%5Cin+A%7Dp_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i&#92;in A}p_i' title='&#92;prod_{i&#92;in A}p_i' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=A%5Cin%5Cmathcal%7BA%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;in&#92;mathcal{A}' title='A&#92;in&#92;mathcal{A}' class='latex' />. That is, <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> takes the value <img src='http://s0.wp.com/latex.php?latex=%7C%5Cmathcal%7BA%7D%7C%5E%7B-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;mathcal{A}|^{-1}' title='|&#92;mathcal{A}|^{-1}' class='latex' /> on all such numbers and zero on all other numbers.</p>
<p>Does a function like that have any chance of working? Because we have ensured that it sums to 1, we would now be looking for a decomposition with absolute values of coefficients summing to at most <img src='http://s0.wp.com/latex.php?latex=c&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c' title='c' class='latex' /> for a small constant <img src='http://s0.wp.com/latex.php?latex=c&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c' title='c' class='latex' />. But before we even think about doing that, we should try to find a sequence that is large on many of the products of primes but that has bounded discrepancy. Only if we find it hard to do that is it worth looking for a decomposition.</p>
<p>Let me be more precise about what we are looking for. If we can find a decomposition of <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='D' title='D' class='latex' /> (the diagonal matrix with entries given by <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />) as <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%5Clambda_iu_i%5Cotimes+v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' title='&#92;sum_i&#92;lambda_iu_i&#92;otimes v_i' class='latex' /> where the <img src='http://s0.wp.com/latex.php?latex=u_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_i' title='u_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> are HAP-functions and <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%7C%5Clambda_i%7C%5Cleq+c&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i|&#92;lambda_i|&#92;leq c' title='&#92;sum_i|&#92;lambda_i|&#92;leq c' class='latex' />, then for every sequence <img src='http://s0.wp.com/latex.php?latex=s%281%29%2Cs%282%29%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(1),s(2),&#92;dots' title='s(1),s(2),&#92;dots' class='latex' />, we have two ways of calculating <img src='http://s0.wp.com/latex.php?latex=%5Clangle+s%2CDs%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;langle s,Ds&#92;rangle' title='&#92;langle s,Ds&#92;rangle' class='latex' />. One of them gives us <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2' title='&#92;sum_n&#92;delta(n)s(n)^2' class='latex' />, while the other gives us <img src='http://s0.wp.com/latex.php?latex=%5Csum_i%5Clambda_i%5Clangle+s%2Cu_i%5Crangle%5Clangle+s%2Cv_i%5Crangle&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_i&#92;lambda_i&#92;langle s,u_i&#92;rangle&#92;langle s,v_i&#92;rangle' title='&#92;sum_i&#92;lambda_i&#92;langle s,u_i&#92;rangle&#92;langle s,v_i&#92;rangle' class='latex' />. So if <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2%5Cgeq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq 1' title='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq 1' class='latex' />, then there must be some HAP on which <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> has discrepancy at least <img src='http://s0.wp.com/latex.php?latex=c%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c^{-1/2}' title='c^{-1/2}' class='latex' />.</p>
<p>Therefore, in order to show that this particular <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> will not do, we need to find a sequence <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=s%28n%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(n)^2' title='s(n)^2' class='latex' /> averages at least 1 on the numbers <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%5Cin+A%7Dp_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i&#92;in A}p_i' title='&#92;prod_{i&#92;in A}p_i' class='latex' /> but has bounded discrepancy.</p>
<p>Let us make our task easier to think about by looking for a sequence that takes <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> values on the numbers <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%5Cin+A%7Dp_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i&#92;in A}p_i' title='&#92;prod_{i&#92;in A}p_i' class='latex' />. Can we choose the values elsewhere in such a way that we cancel out any discrepancy that we might accidentally pick up with those values?</p>
<p>To answer this question, let us think about how HAPs intersect the set, which, since I&#8217;m mentioning it quite a bit, I&#8217;d better give a name to: I&#8217;ll write <img src='http://s0.wp.com/latex.php?latex=S%3DS%28%5Cmathcal%7BA%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S=S(&#92;mathcal{A})' title='S=S(&#92;mathcal{A})' class='latex' /> for the set of numbers <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%5Cin+A%7Dp_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i&#92;in A}p_i' title='&#92;prod_{i&#92;in A}p_i' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=A%5Cin%5Cmathcal%7BA%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;in&#92;mathcal{A}' title='A&#92;in&#92;mathcal{A}' class='latex' />. Let me also write <img src='http://s0.wp.com/latex.php?latex=p_A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_A' title='p_A' class='latex' /> as shorthand for <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%5Cin+A%7Dp_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;prod_{i&#92;in A}p_i' title='&#92;prod_{i&#92;in A}p_i' class='latex' />. Any HAP that intersects <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> must have a common difference of the form <img src='http://s0.wp.com/latex.php?latex=p_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_B' title='p_B' class='latex' /> for some set <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. The resulting HAP will contain every <img src='http://s0.wp.com/latex.php?latex=p_A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_A' title='p_A' class='latex' /> for which <img src='http://s0.wp.com/latex.php?latex=B%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B&#92;subset A' title='B&#92;subset A' class='latex' />, which is rather pleasantly combinatorial.</p>
<p>Let&#8217;s suppose that we&#8217;ve chosen the <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> values everywhere on <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' />. What we are trying to do now is choose values off <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> (not necessarily <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />) in such a way that any HAP-discrepancy contributed by values in <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> is cancelled out by a roughly opposite discrepancy in the values outside <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' />.</p>
<p>An obvious way to do that is to be fairly greedy about it. That is, we look at HAPs that have a substantial intersection with <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> and simply choose a few further values on those HAPs, hoping that our choices won&#8217;t mess up too many discrepancies elsewhere. Let&#8217;s see if we can get something like that to work.</p>
<p>Consider, then, the (infinite) HAP with common difference <img src='http://s0.wp.com/latex.php?latex=p_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_B' title='p_B' class='latex' />. As already mentioned, this intersects <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> in every <img src='http://s0.wp.com/latex.php?latex=p_A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_A' title='p_A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=B%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B&#92;subset A' title='B&#92;subset A' class='latex' />. So as we trundle along the multiples <img src='http://s0.wp.com/latex.php?latex=p_B%2C+2p_B%2C+3p_B%2C+%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_B, 2p_B, 3p_B, &#92;dots' title='p_B, 2p_B, 3p_B, &#92;dots' class='latex' />, we find that the partial sums of the <img src='http://s0.wp.com/latex.php?latex=s%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(n)' title='s(n)' class='latex' />s that we encounter and have already chosen go up and down, and we would like to choose some further values to ensure that they remain bounded.</p>
<p>At which values of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> are we still at liberty to choose the value <img src='http://s0.wp.com/latex.php?latex=s%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(n)' title='s(n)' class='latex' />? It must be a multiple <img src='http://s0.wp.com/latex.php?latex=rp_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='rp_B' title='rp_B' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=p_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_B' title='p_B' class='latex' />, and two ways of ensuring that that multiple does not lie in <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> are (i) to make sure that <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> is also a multiple of some <img src='http://s0.wp.com/latex.php?latex=p_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i' title='p_i' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=i%5Cin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i&#92;in B' title='i&#92;in B' class='latex' /> and (ii) to make sure that <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> is a multiple of <img src='http://s0.wp.com/latex.php?latex=p_j%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_j^2' title='p_j^2' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=j%5Cnotin+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='j&#92;notin B' title='j&#92;notin B' class='latex' />.</p>
<p>The question now is whether if we do that we will find that the value <img src='http://s0.wp.com/latex.php?latex=s%28rp_B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(rp_B)' title='s(rp_B)' class='latex' /> we choose has an effect on lots of other HAPs at the same time. And it looks rather as though it will. Suppose <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> is quite a large set. Then any multiple <img src='http://s0.wp.com/latex.php?latex=rp_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='rp_B' title='rp_B' class='latex' /> that does not belong to <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> is also a multiple of <img src='http://s0.wp.com/latex.php?latex=p_C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_C' title='p_C' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=C%5Csubset+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C&#92;subset B' title='C&#92;subset B' class='latex' />, so there is indeed a danger that by choosing a value of <img src='http://s0.wp.com/latex.php?latex=s%28rp_B%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(rp_B)' title='s(rp_B)' class='latex' /> we are messing up quite a lot of HAPs and not just the one we were interested in. We could of course try to sort out the HAPs with large <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> first and later correct any damage we had done to smaller sets &#8212; but a large set <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> has a lot of subsets.</p>
<p>This is another situation where the devil is in the detail. Maybe there is enough flexibility that some kind of careful greedy approach would work &#8212; perhaps we could even choose <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> to be 1 everywhere on <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> &#8212; but so far it seems at least possible that there exists an efficient decomposition of a diagonal matrix with entries that are concentrated on square-free numbers with many prime factors. </p>
<p>It is sometimes convenient to describe arithmetic progressions of the form <img src='http://s0.wp.com/latex.php?latex=%5C%7Brd%2C%28r%2B1%29d%2C%5Cdots%2Csd%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{rd,(r+1)d,&#92;dots,sd&#92;}' title='&#92;{rd,(r+1)d,&#92;dots,sd&#92;}' class='latex' /> as HAPs. Let me do that now, and let me define two HAPs to be <em>adjacent</em> if they are disjoint and their union is also a HAP (in this generalized sense). In other words, two HAPs are adjacent if one is simply a continuation of the other. </p>
<p>If <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' /> are HAP-functions coming from adjacent HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' /> (in which case I&#8217;ll say that they are adjacent HAP-functions), then <img src='http://s0.wp.com/latex.php?latex=%28u-v%29%5Cotimes%28u-v%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(u-v)&#92;otimes(u-v)' title='(u-v)&#92;otimes(u-v)' class='latex' /> is 1 on the diagonal at all points in the HAP <img src='http://s0.wp.com/latex.php?latex=P%5Ccup+Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;cup Q' title='P&#92;cup Q' class='latex' />, while off the diagonal it is 1 on <img src='http://s0.wp.com/latex.php?latex=P%5Ctimes+P%5Ccup+Q%5Ctimes+Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P&#92;times P&#92;cup Q&#92;times Q' title='P&#92;times P&#92;cup Q&#92;times Q' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=-1+on+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1 on ' title='-1 on ' class='latex' />latex P\times Q\cup Q\times P$. If we take lots of products of this form, we can hope to get lots of cancellation off the diagonal, while getting none at all on the diagonal.</p>
<p>Let&#8217;s try to pursue this idea in more detail. Suppose we have some kind of probability distribution on functions of this form. That is, we pick, according to some distribution, a random set <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> and then a random pair of adjacent HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />. Now let <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' /> be their characteristic functions and let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' /> be two positive integers. What is the expected value of <img src='http://s0.wp.com/latex.php?latex=%28u%28x%29-v%28x%29%29%28u%28y%29-v%28y%29%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(u(x)-v(x))(u(y)-v(y))' title='(u(x)-v(x))(u(y)-v(y))' class='latex' />? </p>
<p>A necessary condition for this quantity to be non-zero is that <img src='http://s0.wp.com/latex.php?latex=p_B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_B' title='p_B' class='latex' /> should divide both <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y' title='y' class='latex' />. So let us condition on this event. If we are clever about the way we choose our probability distribution, we should be able to organize it so that if you choose a random <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />, then typically plenty of its subsets have a similar probability of being chosen. The reason that is potentially important is that if we fix a common difference and pick random pairs of adjacent HAPs <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' /> of some given length, then pairs <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x,y)' title='(x,y)' class='latex' /> that are close to the diagonal will tend to get positive values (because normally if one of them is in <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> then so is the other, and similarly for <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Q' title='Q' class='latex' />) while pairs that are a little further away tend to get negative values. But if we have many different common differences in operation then we can hope that some of these biases will cancel out: a difference <img src='http://s0.wp.com/latex.php?latex=%7Cx-y%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|x-y|' title='|x-y|' class='latex' /> that tends to result in a positive value at one scale might tend to result in a negative value at another scale.</p>
<p>While writing this I have just remembered a useful trick that we were well aware of during the previous attempt at EDP. It is straightforward to show that EDP is equivalent to the same question for the rationals. That is, one wishes to show that for every <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> function defined on the positive rationals and every constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> there is a HAP (the definition is obvious) on which the sum of the function has absolute value at least <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />. The nice thing about this formulation is a much greater symmetry: for every rational <img src='http://s0.wp.com/latex.php?latex=q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q' title='q' class='latex' />, the map <img src='http://s0.wp.com/latex.php?latex=x%5Cmapsto+qx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;mapsto qx' title='x&#92;mapsto qx' class='latex' /> is an isomorphism from <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BQ%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Q}' title='&#92;mathbb{Q}' class='latex' /> (considered as a group under addition) to itself. This makes it unnecessary to think about numbers with lots of factors. (It is not hard to get into this nice situation using just integers &#8212; for example, you can just multiply everything by <img src='http://s0.wp.com/latex.php?latex=n%21&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n!' title='n!' class='latex' /> for a very large <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and you can divide your numbers by whatever you like, within reason &#8212; but it is more natural to use rationals.)</p>
<p>The hope was that if we thought about functions defined on the rationals, then the rather peculiar properties of the diagonal matrix one wishes to decompose might become less peculiar. However, one would still need to think carefully about <em>ratios</em> of numbers.</p>
<p>I&#8217;m tempted to continue thinking aloud about these possibilities, but I&#8217;ll save that up for the comments.</p>
<p><strong>A second wild guess.</strong></p>
<p>I&#8217;m now going to ignore what I&#8217;ve just written about working with the rationals and go back to the question of trying to find a natural function that is biased towards positive integers with many factors. What I&#8217;d like to do is build a sequence of non-negative functions <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, each better than the last, in the hope of eventually finding one that has (or at least doesn&#8217;t obviously not have) the property that every sequence <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2%5Cgeq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq 1' title='&#92;sum_n&#92;delta(n)s(n)^2&#92;geq 1' class='latex' /> has large HAP-discrepancy. (To make this non-trivial, I also need <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29%5Cgeq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)&#92;geq 1' title='&#92;sum_n&#92;delta(n)&#92;geq 1' class='latex' />.)</p>
<p>I&#8217;ll start with a function I know doesn&#8217;t work: let <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%28n%29%3D1%2FN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta(n)=1/N' title='&#92;delta(n)=1/N' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n%5Cleq+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&#92;leq N' title='n&#92;leq N' class='latex' /> and 0 for every <img src='http://s0.wp.com/latex.php?latex=n%3EN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n&gt;N' title='n&gt;N' class='latex' />. That fails because the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> has HAP-discrepancy 1 but <img src='http://s0.wp.com/latex.php?latex=%5Csum_n%5Cdelta%28n%29s%28n%29%5E2%3D2%2F3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_n&#92;delta(n)s(n)^2=2/3' title='&#92;sum_n&#92;delta(n)s(n)^2=2/3' class='latex' /> (at least if <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> is a multiple of 3). That doesn&#8217;t satisfy the condition I said, but it does if we multiply it by <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{3/2}' title='&#92;sqrt{3/2}' class='latex' />. </p>
<p>The problem there was that <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> has a large sum on non-multiples of 3. To correct that and similar problems, it feels natural to give greater weight to numbers with more factors. But how? Well, let&#8217;s do it in a rather naive way and simply weight each number by how many factors it has (and therefore how many HAPs it belongs to). This at least has the advantage of being a natural number-theoretic function: it is (up to a constant) the Dirichlet convolution of the constant function 1 with itself, at least until you get beyond <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />. That is, <img src='http://s0.wp.com/latex.php?latex=d%28n%29%3D%5Csum_%7Bd%7Cn%7D1%5Ctimes+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)=&#92;sum_{d|n}1&#92;times 1' title='d(n)=&#92;sum_{d|n}1&#92;times 1' class='latex' />. </p>
<p>Roughly how big is <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%5Cleq+N%7Dd%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n&#92;leq N}d(n)' title='&#92;sum_{n&#92;leq N}d(n)' class='latex' />? This sum is the number of pairs <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a,b)' title='(a,b)' class='latex' /> of positive integers such that <img src='http://s0.wp.com/latex.php?latex=ab%5Cleq+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='ab&#92;leq N' title='ab&#92;leq N' class='latex' /> (since <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' /> is the number of pairs <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a,b)' title='(a,b)' class='latex' /> of positive integers such that <img src='http://s0.wp.com/latex.php?latex=ab%3Dn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='ab=n' title='ab=n' class='latex' />). Counting the number of pairs with <img src='http://s0.wp.com/latex.php?latex=a%3D1%2C2%2C%5Cdots%2CN&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a=1,2,&#92;dots,N' title='a=1,2,&#92;dots,N' class='latex' /> we can crudely estimate this as <img src='http://s0.wp.com/latex.php?latex=N%2BN%2F2%2BN%2F3%2B%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N+N/2+N/3+&#92;dots' title='N+N/2+N/3+&#92;dots' class='latex' />, which is roughly <img src='http://s0.wp.com/latex.php?latex=N%5Clog+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N&#92;log N' title='N&#92;log N' class='latex' />. On the other hand, the Dirichlet convolution with itself of the function that is <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1' title='1' class='latex' /> up to <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> and 0 thereafter sums to <img src='http://s0.wp.com/latex.php?latex=N%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N^2' title='N^2' class='latex' /> (since each pair of positive integers <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a,b)' title='(a,b)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=a%2Cb%5Cleq+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a,b&#92;leq N' title='a,b&#92;leq N' class='latex' /> contributes 1 to the sum). Since the two functions are equal up to <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />, we find that the vast majority of the second function lies beyond the point where it equals <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' />. </p>
<p>But let&#8217;s not worry about that. On average, <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' /> grows like <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' />, which is not too fast a rate of growth, so let&#8217;s simply take the function <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' /> up to <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />. We now want to prove that every sequence <img src='http://s0.wp.com/latex.php?latex=s%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(n)' title='s(n)' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5ENd%28n%29s%28n%29%5E2%5Cgeq+N%5Clog+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n=1}^Nd(n)s(n)^2&#92;geq N&#92;log N' title='&#92;sum_{n=1}^Nd(n)s(n)^2&#92;geq N&#92;log N' class='latex' /> has HAP-discrepancy at least <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> (for some <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> that tends to infinity with <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />). Alternatively, we want to find a counterexample, which will induce us to try to improve the function.</p>
<p>An obvious example to try is the usual one, namely the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' />. To see whether this works, we need to estimate the sum of <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' /> over all non-multiples of <img src='http://s0.wp.com/latex.php?latex=3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3' title='3' class='latex' /> up to <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />. That equals the number of pairs <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a,b)' title='(a,b)' class='latex' /> such that neither <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> nor <img src='http://s0.wp.com/latex.php?latex=b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='b' title='b' class='latex' /> is a multiple of 3 and <img src='http://s0.wp.com/latex.php?latex=ab%5Cleq+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='ab&#92;leq N' title='ab&#92;leq N' class='latex' />. And that is roughly <img src='http://s0.wp.com/latex.php?latex=4%2F9&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='4/9' title='4/9' class='latex' /> times what you get without the divisibility condition. In other words, it is roughly <img src='http://s0.wp.com/latex.php?latex=%284%2F9%29N%5Clog+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(4/9)N&#92;log N' title='(4/9)N&#92;log N' class='latex' />. So to get <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%5Cleq+N%7Dd%28n%29s%28n%29%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n&#92;leq N}d(n)s(n)^2' title='&#92;sum_{n&#92;leq N}d(n)s(n)^2' class='latex' /> to equal <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%5Cleq+N%7Dd%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n&#92;leq N}d(n)' title='&#92;sum_{n&#92;leq N}d(n)' class='latex' /> we need to multiply the sequence by <img src='http://s0.wp.com/latex.php?latex=3%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3/2' title='3/2' class='latex' />, which gives it a HAP-discrepancy of <img src='http://s0.wp.com/latex.php?latex=3%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3/2' title='3/2' class='latex' />, a slight improvement on the <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{3/2}' title='&#92;sqrt{3/2}' class='latex' /> that we obtained with a constant function. </p>
<p>This doesn&#8217;t prove much &#8212; maybe there are better examples &#8212; but it suggests in a weak way that taking Dirichlet convolutions results in improved functions. To be clear what I mean by &#8220;improved&#8221;, I am now looking for a non-negative function <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> defined on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> such that for large <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />, or perhaps even all <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' />, if <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> is any sequence such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%5Cleq+N%7D%5Cdelta%28n%29s%28n%29%5E2%3D%5Csum_%7Bn%5Cleq+N%7D%5Cdelta%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n&#92;leq N}&#92;delta(n)s(n)^2=&#92;sum_{n&#92;leq N}&#92;delta(n)' title='&#92;sum_{n&#92;leq N}&#92;delta(n)s(n)^2=&#92;sum_{n&#92;leq N}&#92;delta(n)' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> has HAP-discrepancy at least <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />. The bigger I can get <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> to be, the better I consider the function <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> to be.</p>
<p>If you take repeated Dirichlet convolutions of the constant function 1, then you get functions <img src='http://s0.wp.com/latex.php?latex=d_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d_k' title='d_k' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=d_k%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d_k(n)' title='d_k(n)' class='latex' /> is the number of <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />-tuples <img src='http://s0.wp.com/latex.php?latex=%28a_1%2C%5Cdots%2Ca_k%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_1,&#92;dots,a_k)' title='(a_1,&#92;dots,a_k)' class='latex' /> of positive integers such that <img src='http://s0.wp.com/latex.php?latex=a_1a_2%5Cdots+a_k%3Dn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_1a_2&#92;dots a_k=n' title='a_1a_2&#92;dots a_k=n' class='latex' />. (The function <img src='http://s0.wp.com/latex.php?latex=d%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d(n)' title='d(n)' class='latex' /> is equal to <img src='http://s0.wp.com/latex.php?latex=d_2%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d_2(n)' title='d_2(n)' class='latex' />.) Suppose <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' /> is large, <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> is very large, and <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> is a sequence such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%5Cleq+N%7Dd_k%28n%29s%28n%29%5E2%3D%5Csum_%7Bn%5Cleq+N%7Dd_k%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{n&#92;leq N}d_k(n)s(n)^2=&#92;sum_{n&#92;leq N}d_k(n)' title='&#92;sum_{n&#92;leq N}d_k(n)s(n)^2=&#92;sum_{n&#92;leq N}d_k(n)' class='latex' />. Does it follow that <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> has large HAP-discrepancy? More ambitiously, does it have HAP-discrepancy that grows exponentially with <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />, provided that <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> is large enough? </p>
<p><strong>Back to the rationals.</strong></p>
<p>The following idea is one that feels familiar &#8212; I think something similar has come up already. But there&#8217;s nothing like a break from a problem to make an old stale idea seem new and fresh, and sometimes the apparent staleness was an illusion. So here it is again.</p>
<p>I&#8217;ll begin by creating a function on the diagonal that seems to me to have a sporting chance of being efficiently decomposable into HAPs. Let <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> (for generating set) be the set <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cm%5C%7D%5Ccup%5C%7B1%2C1%2F2%2C%5Cdots%2C1%2Fm%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,m&#92;}&#92;cup&#92;{1,1/2,&#92;dots,1/m&#92;}' title='&#92;{1,2,&#92;dots,m&#92;}&#92;cup&#92;{1,1/2,&#92;dots,1/m&#92;}' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> be the <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />-fold multiplicative convolution of the characteristic function of <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> with itself. That is, <img src='http://s0.wp.com/latex.php?latex=%5Cphi%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi(x)' title='&#92;phi(x)' class='latex' /> is the number of ways of writing <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> as <img src='http://s0.wp.com/latex.php?latex=a_1a_2%5Cdots+a_r%2Fb_1b_2%5Cdots+b_s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_1a_2&#92;dots a_r/b_1b_2&#92;dots b_s' title='a_1a_2&#92;dots a_r/b_1b_2&#92;dots b_s' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=r%2Bs%3Dk&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r+s=k' title='r+s=k' class='latex' /> and all the <img src='http://s0.wp.com/latex.php?latex=a_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_i' title='a_i' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b_j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='b_j' title='b_j' class='latex' /> being integers between 1 and <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />. </p>
<p>Before I think about how one might go about decomposing the diagonal &#8220;matrix&#8221; (that is, a function defined on <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BQ%7D_%2B%5Ctimes%5Cmathbb%7BQ%7D_%2B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Q}_+&#92;times&#92;mathbb{Q}_+' title='&#92;mathbb{Q}_+&#92;times&#92;mathbb{Q}_+' class='latex' /> that is zero at <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x,y)' title='(x,y)' class='latex' /> unless <img src='http://s0.wp.com/latex.php?latex=x%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=y' title='x=y' class='latex' />) into products of HAP-functions, I want to see whether it looks hard to find a function <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Csum_x%5Cphi%28x%29s%28x%29%5E2%5Cgeq%5Csum_x%5Cphi_x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_x&#92;phi(x)s(x)^2&#92;geq&#92;sum_x&#92;phi_x' title='&#92;sum_x&#92;phi(x)s(x)^2&#92;geq&#92;sum_x&#92;phi_x' class='latex' /> but <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> is of bounded HAP-discrepancy.</p>
<p>To have any chance of that, we need to understand a bit about <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' />. We can get that understanding by representing every rational as a product of (possibly negative) powers of distinct primes. For the rationals where <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> is non-zero, the primes are all at most <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />, so we can think of <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> as a function defined on a lattice of dimension <img src='http://s0.wp.com/latex.php?latex=%5Cpi%28m%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi(m)' title='&#92;pi(m)' class='latex' /> (the number of primes less than or equal to <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />). Then <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> is the <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />-fold convolution of the characteristic function of <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> regarded as a subset of this lattice. If <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' /> is large enough, then this convolution will look like a large and spread-out Gaussian, which will have the potentially useful property that it is approximately invariant if you multiply (or divide) by an integer between 1 and <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />. </p>
<p>If <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> has that property, does it rule out some kind of variant of the <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> example? A key feature of that example is that the function in question is zero on multiples of 3, but in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BQ%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Q}' title='&#92;mathbb{Q}' class='latex' /> there is no such thing as a multiple of 3 (or rather, everything is a multiple of 3 so the concept ceases to make sense). </p>
<p>But that argument is far from conclusive. How about defining a function as follows: given a rational <img src='http://s0.wp.com/latex.php?latex=p%2Fq&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p/q' title='p/q' class='latex' /> in its lowest terms, let <img src='http://s0.wp.com/latex.php?latex=s%28p%2Fq%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(p/q)' title='s(p/q)' class='latex' /> be 0 if either <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q' title='q' class='latex' /> is divisible by 3, and otherwise 1 if <img src='http://s0.wp.com/latex.php?latex=p%5Cequiv+q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;equiv q' title='p&#92;equiv q' class='latex' /> mod 3 and <img src='http://s0.wp.com/latex.php?latex=-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1' title='-1' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=p%5Cnot%5Cequiv+q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p&#92;not&#92;equiv q' title='p&#92;not&#92;equiv q' class='latex' /> mod 3?</p>
<p>Let us think what the values of <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> are at <img src='http://s0.wp.com/latex.php?latex=p%2Fq&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p/q' title='p/q' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=2p%2Fq&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2p/q' title='2p/q' class='latex' />, etc. If <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> is a multiple of 3, then all those values are 0 (since <img src='http://s0.wp.com/latex.php?latex=q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q' title='q' class='latex' /> is not a multiple of 3). If <img src='http://s0.wp.com/latex.php?latex=q&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q' title='q' class='latex' /> is a multiple of <img src='http://s0.wp.com/latex.php?latex=3%5Er&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3^r' title='3^r' class='latex' /> but not <img src='http://s0.wp.com/latex.php?latex=3%5E%7Br%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3^{r+1}' title='3^{r+1}' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=s%28ap%2Fq%29%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(ap/q)=0' title='s(ap/q)=0' class='latex' /> except if <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> is a multiple of <img src='http://s0.wp.com/latex.php?latex=3%5Er&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3^r' title='3^r' class='latex' />. If <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=q%2F3%5Er&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q/3^r' title='q/3^r' class='latex' /> are congruent mod 3, then the values of <img src='http://s0.wp.com/latex.php?latex=s%28ap%2Fq%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s(ap/q)' title='s(ap/q)' class='latex' /> go <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' />. If they are not congruent mod 3, then they go <img src='http://s0.wp.com/latex.php?latex=-1%2C1%2C0%2C-1%2C1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1,1,0,-1,1,0,&#92;dots' title='-1,1,0,-1,1,0,&#92;dots' class='latex' />. That last argument worked even if <img src='http://s0.wp.com/latex.php?latex=r%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r=0' title='r=0' class='latex' />, so we have covered all cases. </p>
<p>What is the density of rationals such that neither numerator nor denominator is a multiple of 3? I haven&#8217;t made the question precise, and that allows me to give two different answers. The first answer is that if we pick the numerator and denominator randomly, then the probability that neither is a multiple of 3 is <img src='http://s0.wp.com/latex.php?latex=4%2F9&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='4/9' title='4/9' class='latex' />. Also, the probability that they are coprime is <img src='http://s0.wp.com/latex.php?latex=6%2F%5Cpi%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='6/&#92;pi^2' title='6/&#92;pi^2' class='latex' />, and the probability that they are both coprime and not multiples of 3 can easily be shown by similar methods to be at least an absolute constant. So it seems that our function is supported on a set of rationals of positive density.</p>
<p>But another way of thinking about it suggests that the support of the function has zero density: every rational can be written as a product of powers of primes, and the probability that the power of 3 that we choose is 0 tends to 0 as we choose more and more rationals.</p>
<p>The second answer is, I think, the correct one for us, though the question is a bit misleading. The <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' />-density of rationals for which the &#8220;3-coordinate&#8221; is zero does indeed tend to zero, so if we regard <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' /> as our measure, rather than something more additive, then we do appear to have ruled out examples that are similar to the <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> example.</p>
<p><strong>Can we use information about the original problem to help us with the dual problem?</strong></p>
<p>We know that for the integers there is not an efficient HAP-decomposition of a diagonal matrix that is 1 for the first <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> terms and zero afterwards. The proof we have is that if such a decomposition existed, then the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> would have to have large HAP-discrepancy, which it doesn&#8217;t. But can we give a &#8220;direct&#8221; proof? I&#8217;m not quite sure what I mean by that, except that it should not involve the use of a separating functional. Ideally what I&#8217;d like to do is use that example and others like it to work out some rather general constraint that would end up saying a precise version of, &#8220;If an efficiently decomposable matrix equals 1 on the diagonal, then it must have quite a lot of stuff off the diagonal,&#8221; and ideally say a fair amount about that &#8220;stuff&#8221;. </p>
<p>The hope would be to find a clear obstacle <em>on the dual side</em> to a direct approach to decomposing the <img src='http://s0.wp.com/latex.php?latex=N%5Ctimes+N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N&#92;times N' title='N&#92;times N' class='latex' /> identity matrix, and then to show that when we move to the measure on the rationals defined in the previous section, that obstacle goes away and allows us to do what we wished we could do over the integers. The thing that would make life easier would be the ability to divide anything by small integers.</p>
<p>Actually, maybe it isn&#8217;t so important to go for the dual side. Let&#8217;s just understand what the existing proof is telling us goes wrong when we try to find an efficient HAP-decomposition. For the decomposition to be efficient, the HAPs we use have to be quite long. And for that to be the case, they must contain roughly as many numbers congruent to 1 mod 3 as they do numbers congruent to 2 mod 3. (They might contain none of either.) Therefore, any HAP product has roughly as many points congruent to <img src='http://s0.wp.com/latex.php?latex=%281%2C1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1,1)' title='(1,1)' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%282%2C2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(2,2)' title='(2,2)' class='latex' /> mod 3 as points congruent to <img src='http://s0.wp.com/latex.php?latex=%281%2C2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1,2)' title='(1,2)' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%282%2C1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(2,1)' title='(2,1)' class='latex' /> mod 3. Let us call these <em>white</em> points and <em>black</em> points, respectively. By roughly I mean that these two numbers differ by at most a constant. In fact, I think they differ by at most 1. But no points on the diagonal are congruent to <img src='http://s0.wp.com/latex.php?latex=%281%2C2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1,2)' title='(1,2)' class='latex' /> or to <img src='http://s0.wp.com/latex.php?latex=%282%2C1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(2,1)' title='(2,1)' class='latex' />, so if a product of HAPs includes <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> points on the diagonal that are congruent to <img src='http://s0.wp.com/latex.php?latex=%281%2C1%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1,1)' title='(1,1)' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%282%2C2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(2,2)' title='(2,2)' class='latex' />, then it must include at least <img src='http://s0.wp.com/latex.php?latex=r-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r-1' title='r-1' class='latex' /> more black points than white points off the diagonal. </p>
<p>So, roughly speaking, we can&#8217;t build up the non-multiples of 3 on the diagonal without building up a bias towards black points off the diagonal. </p>
<p>Does that matter? Can&#8217;t we <em>correct</em> that unwanted bias by adding or subtracting some other HAP products? No we can&#8217;t, unless we also cancel out much of the sum on the diagonal.</p>
<p>What rescues us if we deal with rationals instead (or more precisely a portion of the rationals weighted by the function <img src='http://s0.wp.com/latex.php?latex=%5Cphi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;phi' title='&#92;phi' class='latex' />)? Also, how easy is it to find an efficient decomposition so that the weight off the diagonal is comparable to the weight on it? I suspect the answer to the second question is that it is not easy, since we have used just the fact that the sequence <img src='http://s0.wp.com/latex.php?latex=1%2C-1%2C0%2C1%2C-1%2C0%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,-1,0,1,-1,0,&#92;dots' title='1,-1,0,1,-1,0,&#92;dots' class='latex' /> has bounded discrepancy, but there are many other examples.</p>
<p><strong>A variant of the HAP problem.</strong></p>
<p>I stopped the previous section prematurely because I was feeling a bit stuck. Instead I would like to think about a modified problem that may well be easier. EDP asks for a high discrepancy inside some HAP. The set of HAPs has the nice symmetry property that if you dilate a HAP then you get another one. This is particularly good over the rationals, when you can also contract. </p>
<p>Now let me ask a slightly vague question and then attempt to make it precise. Can we prove EDP for a different &#8220;base collection&#8221; of sets? For EDP I&#8217;m regarding the base collection as the collection of all sets <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cm%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,m&#92;}' title='&#92;{1,2,&#92;dots,m&#92;}' class='latex' />. If you look at all dilations of those, you get all HAPs. Since I&#8217;m having trouble proving EDP for HAPs, I&#8217;d like to try to prove it for a different base collection. Of course, the smaller the collection, and the more closely related it is to the set of HAPs, the better, but for now my priority is to be able to prove <em>something</em>.</p>
<p>I actually have a collection of sets in mind. Let us think of the rationals multiplicatively as an infinite-dimensional lattice, or rather the portion of that lattice that consists of points with only finitely many non-zero coordinates. That is, the point <img src='http://s0.wp.com/latex.php?latex=%28a_1%2Ca_2%2C%5Cdots%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_1,a_2,&#92;dots)' title='(a_1,a_2,&#92;dots)' class='latex' /> corresponds to the rational <img src='http://s0.wp.com/latex.php?latex=2%5E%7Ba_1%7D3%5E%7Ba_2%7D5%5E%7Ba_3%7D%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{a_1}3^{a_2}5^{a_3}&#92;dots' title='2^{a_1}3^{a_2}5^{a_3}&#92;dots' class='latex' />, the <img src='http://s0.wp.com/latex.php?latex=a_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_i' title='a_i' class='latex' /> being integers. I&#8217;m not sure exactly what I want to do next, so let me be slightly less precise. If we look at the rationals multiplicatively, as we are doing here, then dilation becomes simply translation. What I&#8217;d really like is something like an orthonormal basis, but I&#8217;d like it to consist largely of functions that are translates of one another. So far that won&#8217;t work, but it looks a lot more hopeful if we allow some kind of &#8220;spreading out&#8221; (which would be the analogue of taking longer and longer intervals). The most natural type of spreading out is, I would suggest, to take repeated convolutions. So a possible question &#8212; not necessarily the exact one I&#8217;ll end up asking &#8212; is this. Suppose we start with the set <img src='http://s0.wp.com/latex.php?latex=G%3D%5C%7B1%2C2%2C%5Cdots%2Cm%5C%7D%5Ccup%5C%7B1%2C1%2F2%2C1%2F3%2C%5Cdots%2C1%2Fm%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G=&#92;{1,2,&#92;dots,m&#92;}&#92;cup&#92;{1,1/2,1/3,&#92;dots,1/m&#92;}' title='G=&#92;{1,2,&#92;dots,m&#92;}&#92;cup&#92;{1,1/2,1/3,&#92;dots,1/m&#92;}' class='latex' /> that we considered earlier. Is it true that every function that has large <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' /> norm relative to the measure defined by (multiplicatively) convolving <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' /> times has a large inner product with some (multiplicative) translate of some iterated convolution of <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' />?</p>
<p>This problem should be not that hard since it is purely multiplicative, so we can take logs and make it purely additive. If the answer turned out to be yes, then for any <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-function we would get a large inner product with some iterated convolution of <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' />. If we could nicely decompose that iterated convolution into HAPs, we would then be done, though that seems like quite a lot to ask.</p>
<p>Hmm &#8230; I&#8217;ve almost instantly run into trouble. It seems that my &#8220;multiplicative version&#8221; of EDP is just plain trivially false. Consider the function that is 1 for products of an even number of primes and -1 for products of odd numbers of primes. (This is the completely multiplicative function <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' />, with which we are already very familiar.) From a multiplicative point of view, where we think of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BQ%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Q}' title='&#92;mathbb{Q}' class='latex' /> as an infinite-dimensional lattice, this is putting 1 at &#8220;white points&#8221; and -1 at &#8220;black points&#8221;. If we now take any &#8220;cuboid&#8221;, by which I mean a set of the form <img src='http://s0.wp.com/latex.php?latex=%5C%7Bp_1%5E%7Ba_1%7D%5Cdots+p_k%5E%7Ba_k%7D%3Ar_i%5Cleq+a_i%3Cs_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{p_1^{a_1}&#92;dots p_k^{a_k}:r_i&#92;leq a_i&lt;s_i' title='&#92;{p_1^{a_1}&#92;dots p_k^{a_k}:r_i&#92;leq a_i&lt;s_i' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=i%5Cleq+k%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i&#92;leq k&#92;}' title='i&#92;leq k&#92;}' class='latex' />, then the sum of <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> on that cuboid is either 0, 1 or -1. I was about to say that I couldn&#8217;t see an elegant proof of this, but here&#8217;s a fairly quick one. If we remove two adjacent slices from the cuboid, we remove an equal number of white and black squares. &#8220;Removing two adjacent slices&#8221; means reducing <img src='http://s0.wp.com/latex.php?latex=s_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s_i' title='s_i' class='latex' /> by 2, provided <img src='http://s0.wp.com/latex.php?latex=s_i%5Cgeq+r_i%2B2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s_i&#92;geq r_i+2' title='s_i&#92;geq r_i+2' class='latex' />. If some <img src='http://s0.wp.com/latex.php?latex=s_i-r_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s_i-r_i' title='s_i-r_i' class='latex' /> is even, then by repeating this operation for that <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i' title='i' class='latex' /> we can remove all points, which shows that the numbers of white and black points were originally the same. If all <img src='http://s0.wp.com/latex.php?latex=s_i-r_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s_i-r_i' title='s_i-r_i' class='latex' /> are odd, then we end up reducing the cuboid to a set with just one point, in which case we get <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />. </p>
<p>If we multiplicatively convolve any two cuboids, we obtain a function that can be expressed as a linear combination of translates of a cuboid, so its inner product with <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> will also be small, as will those of all <em>its</em> translates. (What&#8217;s more, those small inner products will themselves alternate in sign as you multiply or divide by primes.) So <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> will have small inner product with everything you build out of products of GPs using multiplicative convolutions.</p>
<p>It could be fruitful to think about why HAPs have a better chance of giving rise to discrepancy than the kinds of multiplicative sets I&#8217;ve just considered. In preparation for that, here is a question to which I know the answer. Let <img src='http://s0.wp.com/latex.php?latex=A_1%2CA_2%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_1,A_2,&#92;dots' title='A_1,A_2,&#92;dots' class='latex' /> be a nested collection of subsets of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7CA_n%7C%3Dn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A_n|=n' title='|A_n|=n' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cbigcup_nA_n%3D%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;bigcup_nA_n=&#92;mathbb{N}' title='&#92;bigcup_nA_n=&#92;mathbb{N}' class='latex' />. Is it possible for there to be a <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> function <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' /> and a constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bx%5Cin+A_n%7D%5Cepsilon%7Bmx%7D%7C%5Cleq+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{x&#92;in A_n}&#92;epsilon{mx}|&#92;leq C' title='|&#92;sum_{x&#92;in A_n}&#92;epsilon{mx}|&#92;leq C' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> and every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />? In the case that <img src='http://s0.wp.com/latex.php?latex=A_n%3D%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n=&#92;{1,2,&#92;dots,n&#92;}' title='A_n=&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> this is EDP. But for a general nested collection, it is easy to see that bounded discrepancy is possible for that collection and all its dilates. You just take a completely multiplicative function such as <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> and choose <img src='http://s0.wp.com/latex.php?latex=A_n%3D%5C%7Ba_1%2Ca_2%2C%5Cdots%2Ca_n%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n=&#92;{a_1,a_2,&#92;dots,a_n&#92;}' title='A_n=&#92;{a_1,a_2,&#92;dots,a_n&#92;}' class='latex' /> in such a way that <img src='http://s0.wp.com/latex.php?latex=%5Clambda%28a_k%29%3D%28-1%29%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda(a_k)=(-1)^k' title='&#92;lambda(a_k)=(-1)^k' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' /> (and such that the <img src='http://s0.wp.com/latex.php?latex=a_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_k' title='a_k' class='latex' /> are distinct and every integer is equal to <img src='http://s0.wp.com/latex.php?latex=a_k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_k' title='a_k' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='k' title='k' class='latex' />). Then the sum <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx%5Cin+A_n%7D%5Clambda%28x%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{x&#92;in A_n}&#92;lambda(x)' title='&#92;sum_{x&#92;in A_n}&#92;lambda(x)' class='latex' /> is always either -1 or 0, and multiplicativity ensures that the sum on dilates of the <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> is always either 1, -1 or 0.</p>
<p>The partial sums of <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> grow (it is believed) like <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' />, so if we choose our sets <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> greedily &#8212; that is, by taking <img src='http://s0.wp.com/latex.php?latex=a_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a_n' title='a_n' class='latex' /> to be the smallest positive integer we have not yet chosen such that <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> takes the value <img src='http://s0.wp.com/latex.php?latex=%28-1%29%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(-1)^n' title='(-1)^n' class='latex' /> &#8212; then <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> will contain the first <img src='http://s0.wp.com/latex.php?latex=n-m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n-m' title='n-m' class='latex' /> integers, where <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> is around <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' /> (at most), and look a bit like a random selection of integers within around <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> for the rest of the set. In other words, <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> will be pretty close to the set <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' />, so the system of sets <img src='http://s0.wp.com/latex.php?latex=mA_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='mA_n' title='mA_n' class='latex' /> will be pretty close to the set of all HAPs. This suggests that proving EDP is going to be really quite delicate.</p>
<p>In fact, it will be more delicate still, since we can take a better multiplicative function than <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' />. If we take the unique completely multiplicative function that takes <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> to 1 if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is congruent to 1 mod 3, to <img src='http://s0.wp.com/latex.php?latex=-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1' title='-1' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is congruent to 2 mod 3, and 3 to -1, its partial sums grow logarithmically, so we can say something similar to the above but with <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> around <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> rather than around <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' />. </p>
<p>We can go slightly further with this function. Because it is 1 on the infinite arithmetic progression <img src='http://s0.wp.com/latex.php?latex=1%2C4%2C7%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1,4,7,&#92;dots' title='1,4,7,&#92;dots' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=-1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='-1' title='-1' class='latex' /> on the infinite arithmetic progression <img src='http://s0.wp.com/latex.php?latex=2%2C5%2C8%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2,5,8,&#92;dots' title='2,5,8,&#92;dots' class='latex' />, we can choose our sets <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> to be intervals up to roughly <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> plus logarithmically short (non-homogeneous) arithmetic progressions of common difference 3.</p>
<p>Since these sets are cooked up so as to ensure that the function has bounded discrepancy, they might seem not that interesting. But I think they are important, because if we want to find an efficient decomposition of a diagonal matrix into HAP products, we probably need to understand what it is that makes HAPs so much better than HAPs with highly structured tiny logarithmic perturbations at the end. Otherwise we are in danger of taking seriously arguments that would work just as well for statements that we know to be false.</p>
<p>As I write, it occurs to me that we might be able to prove a result that would be quite a bit easier than EDP but nevertheless interesting (and in particular interesting enough to be publishable, unless the answer turns out to be disappointingly easy): that there <em>exists</em> a permutation <img src='http://s0.wp.com/latex.php?latex=%5Cpi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi' title='&#92;pi' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' /> such that the discrepancy of every <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence on the sets of the form <img src='http://s0.wp.com/latex.php?latex=%5C%7Bm%5Cpi%281%29%2C%5Cdots%2Cm%5Cpi%28n%29%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{m&#92;pi(1),&#92;dots,m&#92;pi(n)&#92;}' title='&#92;{m&#92;pi(1),&#92;dots,m&#92;pi(n)&#92;}' class='latex' /> is unbounded. We know it is not true for all permutations &#8212; even ones that in some sense don&#8217;t permute by very much &#8212; and we suspect that it is true for the identity permutation but find that very hard to prove. What about if <img src='http://s0.wp.com/latex.php?latex=%5Cpi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi' title='&#92;pi' class='latex' /> is in some sense &#8220;fairly random&#8221;? For example, what if we chose a large <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> and randomly permuted the first <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> integers. That would give us <img src='http://s0.wp.com/latex.php?latex=N&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> sets and all their dilates. Could we prove that the discrepancy on that set system is at least <img src='http://s0.wp.com/latex.php?latex=%5Comega%28N%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;omega(N)' title='&#92;omega(N)' class='latex' />? What if we drop the condition that the union of the  <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n' title='A_n' class='latex' /> should be all of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />? What if instead we ask that <img src='http://s0.wp.com/latex.php?latex=A_n%3D%5C%7Ba_1%2C%5Cdots%2Ca_n%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A_n=&#92;{a_1,&#92;dots,a_n&#92;}' title='A_n=&#92;{a_1,&#92;dots,a_n&#92;}' class='latex' /> for some strictly increasing sequence <img src='http://s0.wp.com/latex.php?latex=%28a_i%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(a_i)' title='(a_i)' class='latex' />?</p>
<p>I rather like those questions, which makes this a good place to stop the post &#8212; for the length of which I apologize.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4440/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4440/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4440&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/08/31/edp24-an-attempt-to-get-back-into-the-diagonal-decomposition-approach/feed/</wfw:commentRss>
		<slash:comments>51</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP23 &#8212; second guest post by Gil Kalai</title>
		<link>http://gowers.wordpress.com/2012/08/27/edp23-second-guest-post-by-gil-kalai/</link>
		<comments>http://gowers.wordpress.com/2012/08/27/edp23-second-guest-post-by-gil-kalai/#comments</comments>
		<pubDate>Mon, 27 Aug 2012 09:34:09 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4434</guid>
		<description><![CDATA[The Large Deviation Heuristic: an example &#8211; triangle-free graphs Here is a very general probabilistic-based heuristic that seems to give good predictions for questions related to EDP. I will refer to this heuristic as &#8220;LDH&#8221;. (In my polymath5 comments I referred to it as PH &#8211; probabilistic heuristic)). I am thankful to Noga Alon and [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4434&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<h2>The Large Deviation Heuristic: an example &#8211;  triangle-free graphs</h2>
<p>Here is a very general probabilistic-based heuristic that seems to give good predictions for questions related to EDP. I will refer to this heuristic as &#8220;LDH&#8221;. (In my polymath5 comments I referred to it as PH &#8211; probabilistic heuristic)).  I am thankful to Noga Alon and to Yuval Peres for some helpful  help.<br />
Here is an example: Suppose we want to study the following extremal problem.</p>
<blockquote><p>What is the largest number of edges in a graph on n vertices with no triangle.</p></blockquote>
<p>If we use the probabilistic method we can ask what is the probability that a random graph in <img src='http://s0.wp.com/latex.php?latex=G%28n%2Cm%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G(n,m)' title='G(n,m)' class='latex' /> contains no triangle. As long as this probability is positive we know that a triangle-free graph with n vertices and m edges exists. (Being a little careful we can consider <img src='http://s0.wp.com/latex.php?latex=G%28n%2Cp%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G(n,p)' title='G(n,p)' class='latex' /> instead of <img src='http://s0.wp.com/latex.php?latex=G%28n%2Cm%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G(n,m)' title='G(n,m)' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=m%3Dp%7B%7Bn%7D%5Cchoose+%7B2%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m=p{{n}&#92;choose {2}}' title='m=p{{n}&#92;choose {2}}' class='latex' />.  Looking at random graphs gives us a perfectly correct proof of the assertion that there are triangle-free graphs with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices and <img src='http://s0.wp.com/latex.php?latex=Cn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Cn' title='Cn' class='latex' /> edges for every <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />.</p>
<p><strong>LDH</strong>:</p>
<p>1) Estimate naively the probability that a random graph in G(n,m) contains no triangle.</p>
<p>2) Choose m so that this estimated probability behaves like 1 over the number of graphs with n vertices and m edges.<br />
<span id="more-4434"></span></p>
<p>So let&#8217;s implement this plan. The probability that a random graph in <img src='http://s0.wp.com/latex.php?latex=G%28n%2Cp%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G(n,p)' title='G(n,p)' class='latex' /> does not contain a specific triangle is <img src='http://s0.wp.com/latex.php?latex=1-p%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1-p^3' title='1-p^3' class='latex' />. Naively assuming that these probabilities are independent we estimate the probability of not having any triangle as <img src='http://s0.wp.com/latex.php?latex=%281-p%5E3%29%5E%7B%7Bn%7D%5Cchoose+%7B3%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(1-p^3)^{{n}&#92;choose {3}}' title='(1-p^3)^{{n}&#92;choose {3}}' class='latex' />. We want to find <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> so that this probability is roughly <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-H%28p%29%7B%7Bn%7D%5Cchoose%7B2%7D%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-H(p){{n}&#92;choose{2}}}' title='2^{-H(p){{n}&#92;choose{2}}}' class='latex' />. This is the case when roughly <img src='http://s0.wp.com/latex.php?latex=p%3Dn%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=n^{-1/2}' title='p=n^{-1/2}' class='latex' />.</p>
<p><strong>LDH  prediction for Mantel-Turán problem:</strong> The maximum number of edges in a triangle-free graph behaves like <img src='http://s0.wp.com/latex.php?latex=n%5E%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{3/2}' title='n^{3/2}' class='latex' />.</p>
<p>In other words, the LDH gives two predictions that we will refer to as the &#8220;firm&#8221; prediction and the &#8220;weak&#8221; prediction.</p>
<p>A) (The firm prediction.) There exist triangle-free graphs with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices and <img src='http://s0.wp.com/latex.php?latex=n%5E%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{3/2}' title='n^{3/2}' class='latex' /> edges</p>
<p>and</p>
<p>B) (The weak prediction.) There are no (substantially) larger triangle-free graphs.</p>
<p>Prediction A is correct. There are indeed triangle-free graphs with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices and <img src='http://s0.wp.com/latex.php?latex=n%5E%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{3/2}' title='n^{3/2}' class='latex' /> edges. (But the LDH does not prove their existence.) Prediction B is miserably false: Actually there are graphs with <img src='http://s0.wp.com/latex.php?latex=n%5E2%2F4&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^2/4' title='n^2/4' class='latex' /> edges without a triangle. The LDH heuristic ignores the fact that not containing one triangle is not independent of not containing another, and is completely blind to large bipartite graphs.</p>
<p><strong>Excercise: </strong>What is the LDH prediction for the question: How large can a subset of the integers {1,2,&#8230;,n} be if it contains no 3-term arithmetic progression?</p>
<p>We would like to propose the following points regarding the large deviation heuristic:</p>
<ul>
<li>LDH predictions about the existence of some combinatorial objects are quite often true. (We refer to such predictions as the <em>firm predictions</em>.) </li>
<li>LDH weak predictions are blind to various structured examples. Sometimes, if we understand the relevant structures we can update the large deviation predictions. (I will come back to this at the end of the post.)</li>
<li>LDH predictions appear to be quite good for the Erdős Discrepancy Problem (EDP) and for variations of EDP. This is what we are going to discuss now.</li>
</ul>
<p>Since LDH is based on a heuristic method to compute probabilities it is quite possible that different heuristics will give different answers but, overall, we did not encounter this.</p>
<p><strong>Problem 1</strong>: Find natural examples where the LDH firm prediction, namely the prediction for the existence of certain combinatorial objects, fails.</p>
<p><strong>Problem 2:</strong> Find ways to improve the LDH when it fails. (Especially when the answer is known by other methods).</p>
<h2>What does LDH predict for EDP?</h2>
<h3>Direct Computation</h3>
<p>We consider the multiplicative version of Erdős Discrepancy Problem . The number of multiplicative <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-sequences of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is close to <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-n/&#92;log n}' title='2^{-n/&#92;log n}' class='latex' />. (These sequences are determined by their values on prime indices and the Prime Number Theorem tells us that the number of primes smaller than <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> behaves like <img src='http://s0.wp.com/latex.php?latex=n%2F%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/&#92;log n' title='n/&#92;log n' class='latex' />.) We expect that the LDH computations for the general question will give the same answer.</p>
<p> What we need to compute is:</p>
<blockquote><p>What is the value of <img src='http://s0.wp.com/latex.php?latex=K%3DK%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=K(n)' title='K=K(n)' class='latex' /> so that the probability <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' /> that all partial sums <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Er%5Cepsilon_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^r&#92;epsilon_i' title='&#92;sum_{i=1}^r&#92;epsilon_i' class='latex' /> of a random <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> belong to the interval <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /> satisfies <img src='http://s0.wp.com/latex.php?latex=p%3D2%5E%7B-n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=2^{-n/&#92;log n}' title='p=2^{-n/&#92;log n}' class='latex' />?</p></blockquote>
<p>This question can be formulated in terms of a simple random walk of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> on the line. We start at the origin and at each step we go a unit length to the left or to the right with probability 1/2 for each direction. We want to know the probability that the random walk will be confined to the interval <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /> and the value of <img src='http://s0.wp.com/latex.php?latex=K&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K' title='K' class='latex' /> for which this probability is <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-n/&#92;log n}' title='2^{-n/&#92;log n}' class='latex' />.</p>
<p><strong>Problem 3 (the answer is known):</strong> What is this value <img src='http://s0.wp.com/latex.php?latex=K%3DK%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=K(n)' title='K=K(n)' class='latex' />?</p>
<p>Towards answering problem 2, I  asked over Mathoverflow &#8220;<a href="http://mathoverflow.net/questions/16892/the-probability-for-a-sequence-to-have-small-partial-sums" target="_blank">What is the probability that a random walk of length n will be confined to the interval <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /></a>&#8221; and Douglas Zare provided a <a href="http://mathoverflow.net/questions/16892/the-probability-for-a-sequence-to-have-small-partial-sums/16940#16940" target="_blank">very detailed answer</a>. Yet, I did not complete the work needed to answer Problem 2. So a few weeks ago I asked Yuval Peres</p>
<blockquote><p>What is the probability that the simple random walk of n steps will be confined to the interval <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' />, and what is the value of <img src='http://s0.wp.com/latex.php?latex=K%3DK%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=K(n)' title='K=K(n)' class='latex' /> for which this probability is <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-n/&#92;log n}' title='2^{-n/&#92;log n}' class='latex' />?</p></blockquote>
<p style="text-align:left;">And a few hours later I received the following reply from Yuval:</p>
<blockquote>
<p style="text-align:left;"> Gil, the confinement probability in <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /> decays up to a constant like <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28-cn%2FK%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(-cn/K^2)' title='&#92;exp(-cn/K^2)' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=c&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c' title='c' class='latex' /> is known: it is <img src='http://s0.wp.com/latex.php?latex=%5Cpi%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pi/2' title='&#92;pi/2' class='latex' />. This is classical and you can find it e.g. in Feller volume 2 or in Spitzer’s book. This holds for all <img src='http://s0.wp.com/latex.php?latex=K+%3Do%28%5Csqrt%7Bn%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K =o(&#92;sqrt{n})' title='K =o(&#92;sqrt{n})' class='latex' />. So the answer to your query is that <img src='http://s0.wp.com/latex.php?latex=K%3DC%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=C&#92;sqrt{&#92;log n}' title='K=C&#92;sqrt{&#92;log n}' class='latex' /> for a suitable <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' />.</p>
</blockquote>
<p style="text-align:left;">So, we get:</p>
<blockquote>
<p style="text-align:left;"><strong>The LDH prediction for the EDP is that the maximum discrepancy of a multiplicative sequence of <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> behaves like <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt {&#92;log n}' title='&#92;sqrt {&#92;log n}' class='latex' />. (The same prediction applies to general sequences.)</strong></p>
</blockquote>
<h3>General sequences and positive correlations.</h3>
<p>If we consider general <img src='http://s0.wp.com/latex.php?latex=%5Cpm&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm' title='&#92;pm' class='latex' /> sequences we get the same answer. We need to compute the probability that for a random sequence of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />, all HAP are confined to <img src='http://s0.wp.com/latex.php?latex=K&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K' title='K' class='latex' />. These HAP are random sequences of lengths <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=n%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/2' title='n/2' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=n%2F3%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/3,&#92;dots' title='n/3,&#92;dots' class='latex' /> . The probability that the initial sums of a random sequence of length <img src='http://s0.wp.com/latex.php?latex=n%2Fr&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n/r' title='n/r' class='latex' /> is confined to <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28-cn%2FrK%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(-cn/rK^2)' title='&#92;exp(-cn/rK^2)' class='latex' />. If we assume that these probabilities are independent we get an estimate of <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28-c+n%5Clog+n%2FK%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(-c n&#92;log n/K^2)' title='&#92;exp(-c n&#92;log n/K^2)' class='latex' /> which behaves like <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-n}' title='2^{-n}' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=K%3D%5Csqrt+%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=&#92;sqrt {&#92;log n}' title='K=&#92;sqrt {&#92;log n}' class='latex' />. We get the same outcome. Of course the confinement of different HAPs to <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' /> are not independent. In fact they seem positively correlated and this strengthens the case for the firm prediction. But I don&#8217;t know how such positive correlation can be used to prove that the firm prediction is correct. (Of course, the feeling that small discrepancy on different HAP are positively correlated is rather tentative.  We know that for every individual HAP we have with positive probability discrepancy bounded by 1. Yet the probability that this happens for all HAPs is zero.)</p>
<p><strong>Problem  4:</strong>  Fix two positive integers <img src='http://s0.wp.com/latex.php?latex=r%2Cs&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r,s' title='r,s' class='latex' />. For a function <img src='http://s0.wp.com/latex.php?latex=f%3A%5C%7B1%2C2%2C%5Cdots+n%5C%7D%5Cto%5C%7B-1%2C1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:&#92;{1,2,&#92;dots n&#92;}&#92;to&#92;{-1,1&#92;}' title='f:&#92;{1,2,&#92;dots n&#92;}&#92;to&#92;{-1,1&#92;}' class='latex' /> consider the two events</p>
<p>1)  For a maximal HAP of gap <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> the initial sums of the function are confined to <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' />.</p>
<p> 2)  For a maximal HAP of gap <img src='http://s0.wp.com/latex.php?latex=s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='s' title='s' class='latex' /> the initial sums of the function are confined to <img src='http://s0.wp.com/latex.php?latex=%5B-K%2CK%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[-K,K]' title='[-K,K]' class='latex' />.</p>
<p>Are these two events positively correlated when <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> is large enough?</p>
<h3>Indirect computations</h3>
<p>At an earlier time, I had a version of the LDH based on gaps between vanishing partial sums. Let me discuss it here. We start with the following question: What is a probability that a sequence of <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> of length <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' /> will not have a vanishing partial sum <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Er%5Cepsilon_i%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^r&#92;epsilon_i=0' title='&#92;sum_{i=1}^r&#92;epsilon_i=0' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=t%2F2%5Cle+r%5Cle+t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t/2&#92;le r&#92;le t' title='t/2&#92;le r&#92;le t' class='latex' />? Another way to ask this question is:</p>
<blockquote><p>What is the probability that a simple random walk of length t will not reach zero in the interval <img src='http://s0.wp.com/latex.php?latex=%5Bt%2F2%2Ct%5D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='[t/2,t]' title='[t/2,t]' class='latex' />? </p></blockquote>
<p>For our purposes all we need to know is that this probability tends to some constant strictly between 0 and 1. The precise value is related to a classic question and let me cite another email by Yuval Peres about it:</p>
<blockquote><p>The probability that a simple random walk <del>will</del> not meet 0 in the time interval [s,t], where   s=xt, tends as <img src='http://s0.wp.com/latex.php?latex=t+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t &#92;to &#92;infty' title='t &#92;to &#92;infty' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%282%2F%5Cpi%29%5Carcsin%28%5Csqrt%7Bx%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(2/&#92;pi)&#92;arcsin(&#92;sqrt{x})' title='(2/&#92;pi)&#92;arcsin(&#92;sqrt{x})' class='latex' />. This is one of the two classical arcsine laws for random walks that you can find in many sources, including e.g. Durrett’s book or proposition 5.7 page 137 in  <a href="http://research.microsoft.com/en-us/um/people/peres/brbook.pdf" target="_blank">My Brownian book</a> . There you will see this law applies to all random walks with increments of mean zero and finite variance. More combinatorial arguments for the special case of SRW can be found in Feller vol I, as well as in <a href="http://www.scribd.com/doc/34335170/The-Arcsine-Law-of-a-Simple-Random-Walk" target="_blank">these slides</a>.  </p></blockquote>
<p>Given a random walk on the line we will try to estimate the probability that we can find sequence of indices <img src='http://s0.wp.com/latex.php?latex=i_1%2Ci_2%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i_1,i_2,&#92;dots' title='i_1,i_2,&#92;dots' class='latex' /> for which the random walk reaches the origin so that the differences between consecutive indices is between <img src='http://s0.wp.com/latex.php?latex=t%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t/2' title='t/2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' /> and compute the value of <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' /> for which this probability is <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-c_1n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-c_1n/&#92;log n}' title='2^{-c_1n/&#92;log n}' class='latex' />. Since the probability for a random walk of <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' /> steps to reach the origin between the <img src='http://s0.wp.com/latex.php?latex=t%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t/2' title='t/2' class='latex' />th and <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' />th steps is a certain constant we have the following LDH predictions:</p>
<p>(Firm prediction) There exists a multiplicative sequence of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> such that the gap between two consecutive vanishing partial sums is (up to a multiplicative constant) at most <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' />.</p>
<p>(Weak prediction) This is best possible.</p>
<p>Some additional gymnastics allow us to move from the prediction regarding multiplicative sequences of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> where all the gaps between consecutive zeros behave like <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> to sequences where, in addition,  the maximum discrepancy behaves like <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt {&#92;log n}' title='&#92;sqrt {&#92;log n}' class='latex' />. The probability that between every two consecutive zeros the maximum discrepancy will be smaller than <img src='http://s0.wp.com/latex.php?latex=K%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K&#92;sqrt{&#92;log n}' title='K&#92;sqrt{&#92;log n}' class='latex' /> is indeed small but if <img src='http://s0.wp.com/latex.php?latex=K&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K' title='K' class='latex' /> is large this too behaves like <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-c_2+n%2F%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-c_2 n/&#92;log n}' title='2^{-c_2 n/&#92;log n}' class='latex' /> so the indirect applications of the LDH based on gaps between consecutive zeroes gives the same prediction as the direct prediction above.</p>
<p>Here too the computation for general sequences gives the same outcome and indicates positive correlation between events which in the heuristic are pretended to be independent.  </p>
<p>(Let me remark that over the polymath5 threads there were several remarks in the direction of trying to show that there are no <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequences of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> where the differences between consecutive vanishing partial sums are bounded for all HAP. This is weaker than what is required to show that the discrepancy is unbounded.)</p>
<h2>Variations of EDP and related discrepancy problems:</h2>
<h3>Variations</h3>
<p>Let me mention briefly several variations of EDP and what LDH says about them. The LDH is responsible for the guesses we propose in the previous post for variations <strong>E1-E8</strong> of EDP.</p>
<p>The weak LDH predictions will not change if we consider <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1%2C0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1,0' title='&#92;pm 1,0' class='latex' /> sequences where the zero entries occupy the indices divisible by 3. But the prediction fails in this case (the discrepancy can be bounded).</p>
<p>If we consider only HAPs with prime power differences the LDH prediction is the square root of <img src='http://s0.wp.com/latex.php?latex=%5Clog+%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log &#92;log n' title='&#92;log &#92;log n' class='latex' />. I would guess that this is the true behavior. Note that if we consider HAPs with prime differences, we have the same LDH prediction, but since we can have bounded discrepancy the weak LDH prediction is false.</p>
<p>The firm prediction of the LDH predicts a polylog(n) discrepancy (in fact even <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' /> discrepancy) when we restrict our attention to square free integers and to  random subsets of integers.</p>
<h3>Probabilistic analogs for EDP</h3>
<h3><strong>Problem 5:</strong> Does the LDH give good predictions for discrepancy problems for random hypergraphs <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> $on {1,2,&#8230;,n}?</h3>
<p>Let <img src='http://s0.wp.com/latex.php?latex=%5Ccal+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal H' title='&#92;cal H' class='latex' /> be the hypergraph in which we consider precisely one edge <img src='http://s0.wp.com/latex.php?latex=E_r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E_r' title='E_r' class='latex' /> which contains every element with probability <img src='http://s0.wp.com/latex.php?latex=1%2Fr&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/r' title='1/r' class='latex' />. Let <img src='http://s0.wp.com/latex.php?latex=%5Ccal+G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal G' title='&#92;cal G' class='latex' /> be the hypergraph obtianed fron <img src='http://s0.wp.com/latex.php?latex=%5Ccal+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal H' title='&#92;cal H' class='latex' /> by adding as edges to all initial segments of edges in <img src='http://s0.wp.com/latex.php?latex=%5Ccal+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal H' title='&#92;cal H' class='latex' />. The LDH predicts that the discrapancy of $\cal H$ is bounded. When we move from <img src='http://s0.wp.com/latex.php?latex=%7B%5Ccal+H%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;cal H}' title='{&#92;cal H}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B%5Ccal+G%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;cal G}' title='{&#92;cal G}' class='latex' /> then we come back to the <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+%28%5Clog+n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt (&#92;log n)' title='&#92;sqrt (&#92;log n)' class='latex' /> prediction. Such probabilistic versions take away the number-theoretic aspect from EDP. Still the probabilities of low discrepancy for different edges are not independent and the problem still looks hard. </p>
<h3>LDH and the six standard deviation theorem</h3>
<p>Of course, it is of interest to understand the LDH predictions (both for <img src='http://s0.wp.com/latex.php?latex=%5Ccal+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal H' title='&#92;cal H' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Ccal+G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;cal G' title='&#92;cal G' class='latex' />) for the general case when we have <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> probabilities <img src='http://s0.wp.com/latex.php?latex=p_1%2Cp_2%2C%5Cdots+%2C+p_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_1,p_2,&#92;dots , p_m' title='p_1,p_2,&#92;dots , p_m' class='latex' /> and the edges are <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> random subsets based on these probabilities. The most famous case is when we consider all <img src='http://s0.wp.com/latex.php?latex=p_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i' title='p_i' class='latex' />s to be 1/2 and <img src='http://s0.wp.com/latex.php?latex=m%3Dn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m=n' title='m=n' class='latex' />. Joel Spencer&#8217;s Six Standard Deviation Theorem asserts that the discrepancy for <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> random subsets of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> is at most <img src='http://s0.wp.com/latex.php?latex=6+%5Csqrt+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='6 &#92;sqrt n' title='6 &#92;sqrt n' class='latex' />. Nore generally it was proved that the discrepancy of a random hypergraph with <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> edges behaves like <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt m' title='&#92;sqrt m' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=m+%5Cle+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m &#92;le n' title='m &#92;le n' class='latex' /> and like <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn+%5Clog%28m%2Fn%29%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n &#92;log(m/n)}' title='&#92;sqrt{n &#92;log(m/n)}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=m+%5Cge+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m &#92;ge n' title='m &#92;ge n' class='latex' />.</p>
<h3>Roth&#8217;s theorem and how the LDH predicts the answer</h3>
<p>Recall that Roth&#8217;s theorem is about the discrepancy of the hypergraph whose edges correspond to all arithmetic progressions in {1,2,&#8230;,n}.</p>
<p>The LDH predicts the correct answer, at least roughly. The probability that a maximal AP of gap r will be confined to [-K,K] is <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28-cn%2FK%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(-cn/K^2)' title='&#92;exp(-cn/K^2)' class='latex' />.  When <img src='http://s0.wp.com/latex.php?latex=K%3Dc%5Ccdot+n%5E%7B1%2F4%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='K=c&#92;cdot n^{1/4}' title='K=c&#92;cdot n^{1/4}' class='latex' /> we have to multiply the <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' />th power of <img src='http://s0.wp.com/latex.php?latex=%5Cexp%28-cn%5E%7B1%2F2%7D%2Fr%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;exp(-cn^{1/2}/r)' title='&#92;exp(-cn^{1/2}/r)' class='latex' />, for <img src='http://s0.wp.com/latex.php?latex=r%3D1%2C2%2C%5Cdots%2C+%5Calpha%5Csqrt+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r=1,2,&#92;dots, &#92;alpha&#92;sqrt n' title='r=1,2,&#92;dots, &#92;alpha&#92;sqrt n' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> is small and this will give us <img src='http://s0.wp.com/latex.php?latex=c%5E%7B-n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c^{-n}' title='c^{-n}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=c%2C+1%3Cc%3C2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c, 1&lt;c&lt;2' title='c, 1&lt;c&lt;2' class='latex' />. The contribution coming from APs of larger gaps will be of a smaller order of magnitide.</p>
<p><strong>Problem 6:</strong> Are the methods of proving upper bounds for the disrepancy problem for APs relevant for proving better upper bounds for EDP, its extensions, and its variations?</p>
<h2>Large deviations for extremal graph properties revisited</h2>
<h3>Graph limits and the work of Chatterjee and Varadhan</h3>
<p>We started this post by trying to answer a classical problem in extremal graph theory using a probabilistic heuristic which is based on large deviation estimates. It would not be irresponsible to say that the heuristic estimates proposed  a very poor prediction to the extremal problem we considered.</p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> be a fixed graph and let <img src='http://s0.wp.com/latex.php?latex=T%28n%2CH%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(n,H)' title='T(n,H)' class='latex' /> be the maximum number of edges for a graph <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices that does not contain <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> as a subgraph. <a href="http://en.wikipedia.org/wiki/Tur%C3%A1n's_theorem" target="_blank">Turán&#8217;s 1941 theorem</a> determined the value of <img src='http://s0.wp.com/latex.php?latex=T%28n%2CH%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(n,H)' title='T(n,H)' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is a complete graph with <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> vertices. Turán&#8217;s theorem is the starting point of the wide and deep area of extremal graph theory.  The case of triangle-free graphs goes back to Mantel in 1907.</p>
<p>One of the important discoveries in graph theory which is related both to additive number theory and to probability theory is the notion of limits of graphs. This notion is connected to the famous Szemeredi lemma. The theory of limits of graphs sheds new light on extremal graph theory; in a sense it tells us what the relevant structures for <img src='http://s0.wp.com/latex.php?latex=T%28n%2CH%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(n,H)' title='T(n,H)' class='latex' /> are when <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is not bipartite.</p>
<p>A recent paper entitled <a href="http://front.math.ucdavis.edu/1008.1946" target="_blank"><em>The large deviation principle for the Erdős-Rényi random graph</em> </a>by Sourav <a href="http://front.math.ucdavis.edu/author/S.Chatterjee">Chatterjee</a> and S. R. S. <a href="http://front.math.ucdavis.edu/author/S.Varadhan">Varadhan</a> revealed a connection between large deviation for properties of Erdős-Renyi graphs and graphons &#8211; limits of graphs. (It complements a large body of related results obtained by various other methods.) Here is the abstract.</p>
<blockquote><p><strong>Abstract:</strong> What does an Erdős-Renyi graph look like when a rare event happens? This paper answers this question when p is fixed and n tends to infinity by establishing a large deviation principle under an appropriate topology. The formulation and proof of the main result uses the recent development of the theory of graph limits by Lovasz and coauthors and Szemeredi&#8217;s regularity lemma from graph theory. As a basic application of the general principle, we work out large deviations for the number of triangles in G(n,p). Surprisingly, even this simple example yields an interesting double phase transition.</p></blockquote>
<p>So we can &#8220;morally&#8221; understand why the weak prediction of LDH fails for the property of &#8220;including a triangle&#8221;. To obtain a better prediction we also need to condition on various relevant limit structures of the random graph.</p>
<h3>LDH for the triangle-free process</h3>
<p>Consider the following graph process. We start with the empty graph on <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices and add random edges one after the other conditioned on not forming a triangle. <a href="http://www.math.cmu.edu/~tbohman/papers/r3t.ps" target="_blank">Bohman proved</a> that such a process for <img src='http://s0.wp.com/latex.php?latex=n%3Dct%5E2%2F%5Clog+t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n=ct^2/&#92;log t' title='n=ct^2/&#92;log t' class='latex' /> will lead with substantial probability to a triangle-free graph not containing an independent set of size <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' />. This proved a conjecture of Joel Spencer and gave a new proof to a famous result by Jeong Han Kim (with a remarkable history that I won&#8217;t describe here). Can we apply the LDH to the triangle-free processes to give a heuristic argument why triangle-free graphs on <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> vertices with <img src='http://s0.wp.com/latex.php?latex=%5COmega+%28n%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Omega (n^2)' title='&#92;Omega (n^2)' class='latex' /> edges exist?</p>
<h2>Extremal problems for bipartite graphs</h2>
<p><strong>Problem  7:</strong> Let <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> be a fixed <strong>bipartite</strong> graph. Does the LDH give good predictions for <img src='http://s0.wp.com/latex.php?latex=T%28n%2CH%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(n,H)' title='T(n,H)' class='latex' />?</p>
<p>It turns out that for extremal problems on graphs the LDH gives quite similar predictions to those obtained by the rigorous well known edge-deletion method based on the following proposition:</p>
<p><strong>Proposition:</strong> If, for a random graph with n vertices and 2m edges, the expected number of copies of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is smaller than half the number of edges, then <img src='http://s0.wp.com/latex.php?latex=T%28n%2CH%29%3Em&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(n,H)&gt;m' title='T(n,H)&gt;m' class='latex' />.</p>
<p>For Turán-type problems, the LDH&#8217;s predictions are quite similar to those obtained by the edge-deletion method. (I am not aware of a similar trick for discrepancy problems.)The LDH predicts the existence of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />-free graphs with roughly <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> times more edges than what the edge-deletion method gives. Achieving an improvement of similar kind is a difficult and well-known problem in extremal combinatorics. See the paper: T. Bohman and P. Keevash. <a href="http://www.math.cmu.edu/~tbohman/papers/inv10.pdf">The early evolution of the H-free process.</a> Inventiones Mathematicae, 181, 291&#8211;336, 2010.</p>
<p>The LDH prediction are still far from the correct answer. Erdős conjectured that for any bipartite <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> with degeneracy <img src='http://s0.wp.com/latex.php?latex=r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r' title='r' class='latex' /> the Turán number is at most <img src='http://s0.wp.com/latex.php?latex=O%28n%5E%7B2-1%2Fr%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='O(n^{2-1/r})' title='O(n^{2-1/r})' class='latex' />, and that the Turan number is <img src='http://s0.wp.com/latex.php?latex=O%28n%5E%7B3%2F2%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='O(n^{3/2})' title='O(n^{3/2})' class='latex' /> iff the graph is bipartite and 2-degenerate. For more on that see, e.g.,  N. Alon, M. Krivelevich and B. Sudakov, <a href="http://www.math.tau.ac.il/~nogaa/PDFS/aksinduced4.pdf" target="_blank">Turan numbers of bipartite graphs and related Ramsey-type questions</a>, Combinatorics, Probability and Computing 12 (2003), 477-494.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4434/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4434/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4434&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/08/27/edp23-second-guest-post-by-gil-kalai/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP22 &#8212; first guest post by Gil Kalai</title>
		<link>http://gowers.wordpress.com/2012/08/22/edp22-first-guest-post-from-gil-kalai/</link>
		<comments>http://gowers.wordpress.com/2012/08/22/edp22-first-guest-post-from-gil-kalai/#comments</comments>
		<pubDate>Wed, 22 Aug 2012 15:59:08 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4417</guid>
		<description><![CDATA[The purpose of this post is to ignite some new activity related to polymath5 about Erdős&#8217; Discrepancy Problem. This post is thus a polymath5 research thread and comments (also unrelated to the suggestions in this post) are welcome. The general form of a discrepancy problem Let be a hypergraph, i.e., a collection of subsets of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4417&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<h3>The purpose of this post is to ignite some new activity related to polymath5 about Erdős&#8217; Discrepancy Problem. This post is thus a polymath5 research thread and comments (also unrelated to the suggestions in this post) are welcome.</h3>
<h3>The general form of a discrepancy problem</h3>
<p>Let <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> be a hypergraph, i.e., a collection of subsets of a ground set <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. The<strong> discrepancy</strong> of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />, denoted by <img src='http://s0.wp.com/latex.php?latex=disc%28H%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='disc(H)' title='disc(H)' class='latex' />  is the minimum over all functions <img src='http://s0.wp.com/latex.php?latex=f%3AA+%5Cto+%5C%7B-1%2C1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:A &#92;to &#92;{-1,1&#92;}' title='f:A &#92;to &#92;{-1,1&#92;}' class='latex' /> of the maximum over all  <img src='http://s0.wp.com/latex.php?latex=S+%5Cin+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;in H' title='S &#92;in H' class='latex' /> of</p>
<blockquote><p><img src='http://s0.wp.com/latex.php?latex=%7C%5Csum+%5C%7Bf%28s%29%3As%5Cin+S%5C%7D%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum &#92;{f(s):s&#92;in S&#92;}|' title='|&#92;sum &#92;{f(s):s&#92;in S&#92;}|' class='latex' />.  </p></blockquote>
<p>We will mention one additional definition,  that of <strong>hereditary discrepancy</strong>. When <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is a hypergraph and <img src='http://s0.wp.com/latex.php?latex=B%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B&#92;subset A' title='B&#92;subset A' class='latex' />, the restriction of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> is the hypergraph with vertex set <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> whose edges are all sets of the form <img src='http://s0.wp.com/latex.php?latex=S+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;cap B' title='S &#92;cap B' class='latex' /> for edges <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />. The hereditary discrepancy <img src='http://s0.wp.com/latex.php?latex=herddisc%28H%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='herddisc(H)' title='herddisc(H)' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is the maximum over all <img src='http://s0.wp.com/latex.php?latex=B+%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B &#92;subset A' title='B &#92;subset A' class='latex' /> of the discrepancy of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> restricted to <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />.<br />
<a href="http://gilkalai.wordpress.com/2011/08/28/discrepancy-the-beck-fiala-theorem-and-the-answer-to-test-your-intuition-14/">Here is a link</a> for a recent post discussing discrepancy and the famous Beck-Fiala theorem. The Beck-Fiala theorem assert that if every element in <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is included in at most <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='t' title='t' class='latex' /> sets in <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> then <img src='http://s0.wp.com/latex.php?latex=disc+%28H%29+%5Cle+2t&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='disc (H) &#92;le 2t' title='disc (H) &#92;le 2t' class='latex' />. (Of course, the theorem applies also to the hereditary discrepancy.)<br />
<span id="more-4417"></span></p>
<h2>The Erdős&#8217; Discrepancy Problem reviewed</h2>
<h3>The problem and links to polymath5 material</h3>
<p><strong>Erdős Discrepancy Problem (EDP).</strong> <em>Is it possible to find a <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' />-valued sequence <img src='http://s0.wp.com/latex.php?latex=x_1%2Cx_2%2Cx_3%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,x_2,x_3&#92;dots' title='x_1,x_2,x_3&#92;dots' class='latex' /> and a constant <img title="C" src="http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=333333&amp;s=0" alt="C" /> such that  <img src='http://s0.wp.com/latex.php?latex=%7C%5Csum_%7Bi%3D1%7D%5Enx_%7Bid%7D%7C+%5Cle+C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum_{i=1}^nx_{id}| &#92;le C' title='|&#92;sum_{i=1}^nx_{id}| &#92;le C' class='latex' /> for every <img title="n" src="http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0" alt="n" /> and every <img title="d" src="http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0" alt="d" />?</em><br />
A HAP (Homogeneous Arithmetic Progression) is an arithmetic prograssion of the form {k,2k,&#8230;,rk}.  EDP asks about the sum of a <img src='http://s0.wp.com/latex.php?latex=%5Cpm1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm1' title='&#92;pm1' class='latex' /> sequence on HAPs.<br />
Given a <img src='http://s0.wp.com/latex.php?latex=%5Cpm1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm1' title='&#92;pm1' class='latex' /> sequence <img src='http://s0.wp.com/latex.php?latex=x_1%2Cx_2%2C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1,x_2,&#92;dots' title='x_1,x_2,&#92;dots' class='latex' /> define <img src='http://s0.wp.com/latex.php?latex=E%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E(n)' title='E(n)' class='latex' /> to be the maximum of sums of subsequences over HAPs which are subsets of {1,2,3,&#8230;,n}. EDP asks if we can find a sequence for which <img src='http://s0.wp.com/latex.php?latex=E%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E(n)' title='E(n)' class='latex' /> is uniformly bounded and we will be interested in finding sequences where <img src='http://s0.wp.com/latex.php?latex=E%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='E(n)' title='E(n)' class='latex' /> grows slowly.<br />
EDP was extensively discussed and studied in polymath5. Here is <a href="http://gowers.wordpress.com/2009/12/17/erdoss-discrepancy-problem/" target="_blank">the link of the first post</a>. Here are links to all <a href="http://gowers.wordpress.com/category/polymath5/" target="_blank">polymath5 posts</a>.  Here is the link to <a href="http://michaelnielsen.org/polymath1/index.php?title=The_Erdős_discrepancy_problem">polymath5 wiki</a>.<br />
A sequence is called <em>completely multiplicative</em> if <img src='http://s0.wp.com/latex.php?latex=x_%7Bnm%7D%3Dx_nx_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_{nm}=x_nx_m' title='x_{nm}=x_nx_m' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />. EDP is of great interest even if we restrict our attention to completely multiplicative sequences. For those we have only to consider partial sums <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Er+x_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^r x_i' title='&#92;sum_{i=1}^r x_i' class='latex' />.</p>
<h3>An example where the discrepancy grows like log n.</h3>
<p>The function that takes n to 1 if the last non-zero digit of n in its ternary representation is 1 and -1 if the last non-zero digit is 2 is completely multiplicative and the partial sum up to n is easily shown to be at most <img src='http://s0.wp.com/latex.php?latex=%5Clog_3+n%2B1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log_3 n+1' title='&#92;log_3 n+1' class='latex' />. Therefore, the rate at which the worst discrepancy grows, as a function of the length of the homogeneous progression, can be as slow as logarithmic.</p>
<h3>What a random assignment gives us.</h3>
<p>A random sequence (or a random completely multiplicative sequence) gives us discrepancy close to <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt n' title='&#92;sqrt n' class='latex' />. (There is  apparently an additional <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' /> factor but I am not sure of the precise asymptotic behavior.)</p>
<h2>Two greedy algorithms</h2>
<p>From now on we write <img src='http://s0.wp.com/latex.php?latex=f%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(n)' title='f(n)' class='latex' /> instead of <img src='http://s0.wp.com/latex.php?latex=x_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_n' title='x_n' class='latex' />.<br />
<strong>Greedy algorithm 1</strong> (for multiplicative functions): Assign the value <img src='http://s0.wp.com/latex.php?latex=f%28p_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(p_n)' title='f(p_n)' class='latex' /> as to minimize the maximum discrepancy in every partial sum whose terms are now determined.<br />
<strong>Greedy algorithm 1</strong> (for general functions): Assign the value <img src='http://s0.wp.com/latex.php?latex=f%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(n)' title='f(n)' class='latex' /> so as to minimize the maximum discrepancy in every HAP whose terms are now determined.<br />
<strong>Problem 1:</strong> How does Greedy algorithm 1 perform?<br />
<strong>Empirical observation:</strong> (This was claimed in some polymath 5 remarks but I am not sure if there was definite evidence. I would appreciate clarifications.) The discrepancy for sequences based on the greedy algorithm 1 (for multiplicative functions and for general functions) is <img src='http://s0.wp.com/latex.php?latex=%5COmega+%28%5Csqrt+n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Omega (&#92;sqrt n)' title='&#92;Omega (&#92;sqrt n)' class='latex' />.<br />
<strong>Interpretation:</strong> Greedy algorithm 1 optimizes an &#8220;irrelevant task&#8221;.<br />
We would like to suggest here<br />
<strong>Greedy algorithm 2</strong> (for multiplicative sequences): Assign the value <img src='http://s0.wp.com/latex.php?latex=f%28p_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f(p_n)' title='f(p_n)' class='latex' /> so as to minimize the maximum discrepancy in every partial sum where unassigned entries get the value zero.<br />
<strong>Greedy algorithm 2</strong> (for general sequences ): Assign the value f(n) as to minimize the maximum discrepancy in every partial sum in every HAP where unassigned entries get the value zero.<br />
<strong>Problem 2:</strong> How does Greedy algorithm 2 perform?<br />
Omri Floman ran Greedy 2 on inputs such as N=10000 and got a discrepancy of around 20 (there is a bit of randomness involved in cases of ties). For N=100000 he got about 45. It is unclear what the behavior is.<br />
<strong>Problem 3:</strong> Can we find an optimal compromise between Greedy 1 and Greedy 2?<br />
<strong>Problem 4:</strong> Can randomization help?<br />
Of course, since we know that a sequence of length n and discrepancy log n exists if we draw a random <img src='http://s0.wp.com/latex.php?latex=%5Cpm+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;pm 1' title='&#92;pm 1' class='latex' /> sequence there is a probability larger than <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^{-n}' title='2^{-n}' class='latex' /> of reaching such a low discrepancy sequence. What we want to ask is if randomization can lead to a method of getting a low-discrepancy sequence with larger probability, or even better with provable larger probability. (Or even better yet, to a sequence of provable low discrepancy via an effective algorithm.)<br />
<strong>Problem 5:</strong> How do Greedy algorithms 1 and 2 perform if we apply them for a random ordering of {1,2,&#8230;,n}. </p>
<h2>Extensions of Erdős&#8217; Discrepancy problems</h2>
<p>Here are some variations of the Erdős&#8217; Discrepancy Problem along with some guesses for the answers. I will explain where these guesses came from next time.<br />
<strong>E0:</strong> Erdős&#8217; Discrepancy Problem </p>
<blockquote><p><strong>Guess</strong>: <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' /></p></blockquote>
<p><strong>E1</strong>: Allow f(n) to attain values which are complex numbers of norm 1.</p>
<blockquote><p><strong>Guess</strong>, same answer <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' /></p></blockquote>
<p>(Perhaps we can even consider norm-1 vectors in some Euclidean space or Banach space. If this is too premissive (<img src='http://s0.wp.com/latex.php?latex=%5Cell_1%5E%7B%5Cinfty%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;ell_1^{&#92;infty}' title='&#92;ell_1^{&#92;infty}' class='latex' />) we may go down to a constant.)</p>
<h3>Square-free integers</h3>
<p><strong>E2</strong>: The EDP  for square-free integers<br />
Here we simply consider sequences where <img src='http://s0.wp.com/latex.php?latex=x_m%3D0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_m=0' title='x_m=0' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> is not square-free. For this variation the multiplicative version of the problem is not a special case of the general question. (Here multiplicative means that <img src='http://s0.wp.com/latex.php?latex=x_%7Bnm%7D%3Dx_nx_m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_{nm}=x_nx_m' title='x_{nm}=x_nx_m' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=n%2Cm&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n,m' title='n,m' class='latex' /> are coprime.) <strong></strong></p>
<blockquote><p><strong>Guess:</strong> for both versions, same answer <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' />.</p></blockquote>
<h3>Random subsets of integers</h3>
<p><strong>E3:</strong> Instead of square-free integers consider a random dense subset of integers and assume that the sequence vanishes for indices not in the subset.<br />
<strong>Guess:</strong> same answer <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' />.</p>
<h3>Random sets of primes</h3>
<p>Here we consider multiplicative functions which are non zero only on a random dense sert of primes.<br />
<strong>E4</strong>: The EDP for a random dense subset of primes.</p>
<blockquote><p><strong>Guess:</strong> same answer</p></blockquote>
<p><strong>Problem 6:</strong> Find a sequence for problems E2, E3, and E4 with discrepancy <img src='http://s0.wp.com/latex.php?latex=o%28%5Csqrt+n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='o(&#92;sqrt n)' title='o(&#92;sqrt n)' class='latex' />, or better <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2-%5Cepsilon%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2-&#92;epsilon}' title='n^{1/2-&#92;epsilon}' class='latex' /> or better <img src='http://s0.wp.com/latex.php?latex=polylog+%28n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='polylog (n)' title='polylog (n)' class='latex' />. <strong>Update</strong>: see below.  It can be shown that sequences with discrepancy <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F%5Clog+%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/&#92;log &#92;log n}' title='n^{1/&#92;log &#92;log n}' class='latex' /> exist for all these variations.</p>
<h3>Prime power differences</h3>
<p><strong>E5</strong>: The EDP for HAP with prime power differences.</p>
<blockquote><p><strong>Guess:</strong>  <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log &#92;log n}' title='&#92;sqrt{&#92;log &#92;log n}' class='latex' />. </p></blockquote>
<h3>Beurling primes and integers</h3>
<p>Beurling primes were defined by Arne Beurling in 1937 and he also proved a prime number theorem for them. The most general definition is very simple: Consider a sequence of real numbers <img src='http://s0.wp.com/latex.php?latex=1%3Cx_2%3Cx_3%3C%5Cdots+%3Cx_n%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&lt;x_2&lt;x_3&lt;&#92;dots &lt;x_n&lt;&#92;dots' title='1&lt;x_2&lt;x_3&lt;&#92;dots &lt;x_n&lt;&#92;dots' class='latex' /> regarded as &#8220;primes&#8221; and consider the (ordered) sequence of their products (multiplicities allowed) <img src='http://s0.wp.com/latex.php?latex=1%3Cy_2%3Cy_3%3C%5Cdots%3Cy_n%3C%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&lt;y_2&lt;y_3&lt;&#92;dots&lt;y_n&lt;&#92;dots' title='1&lt;y_2&lt;y_3&lt;&#92;dots&lt;y_n&lt;&#92;dots' class='latex' /> as the &#8220;integers&#8221;. (We will assume that all products are distinct although for the original purpose of defining a zeta function multiplicities may play a role.) Beurling primes played a role in the <a href="http://en.wordpress.com/tag/polymath4/" target="_blank">polymath4 discussions</a>.<br />
<strong>E6</strong>: The EDP for Beurling primes and integers</p>
<blockquote><p><strong>Careless</strong> <strong>Guess:</strong> at most  <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' />.</p></blockquote>
<h3>Sune Kristian Jakobsen&#8217;s systems of pseudointegers</h3>
<p>One way to think about Beurling primes is to identify <img src='http://s0.wp.com/latex.php?latex=y_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y_n' title='y_n' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> and to reorder the integers according to the ordering of the <img src='http://s0.wp.com/latex.php?latex=y_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='y_n' title='y_n' class='latex' />s. Actually, given the ordering we can recover uniquely the Beurling primes. A much more general notion of &#8220;pseudointegers&#8221;  <a href="http://gowers.wordpress.com/2010/02/08/edp7-emergency-post/#comment-6116" target="_blank">was suggested</a> over polymath5 by Sune Kristian Jakobsen. See also the <a href="http://michaelnielsen.org/polymath1/index.php?title=Pseudointegers" target="_blank">overview over Polymath5&#8242;s wiki</a>.<br />
An ordering of the natural numbers is a SKJ-ordering if it fulfills the following two conditions:<br />
1) If <img src='http://s0.wp.com/latex.php?latex=a+%5Cge+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a &#92;ge b' title='a &#92;ge b' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=c+%5Cge+d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='c &#92;ge d' title='c &#92;ge d' class='latex' /> then <img src='http://s0.wp.com/latex.php?latex=ac%5Cge+bd&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='ac&#92;ge bd' title='ac&#92;ge bd' class='latex' />.<br />
and<br />
2) for any <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a' title='a' class='latex' /> the set <img src='http://s0.wp.com/latex.php?latex=%5C%7Bb%3A+b%3Ca%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{b: b&lt;a&#92;}' title='&#92;{b: b&lt;a&#92;}' class='latex' /> is finite.<br />
<strong>Remark: </strong>1) Sune Kristian considered orderings on sequences of integers (the exponents in the prime factorization). This is equivalent to (but perhaps less provocative than) the formulation here. 2) We can expect that Beurling-orderings are a tiny tiny subset of SKJ-orderings.<br />
Given an SKJ-ordering of the natural numbers we can ask the EDP for that ordering.<br />
<strong>E7</strong>: The EDP for  <cite>Sune Kristian Jakobsen</cite> systems of integers.</p>
<blockquote><p><strong>Careless</strong> <strong>Guess:</strong> at most  <strong>polylog n</strong></p></blockquote>
<p><strong>Problem 7</strong>: How does Greedy algorithm 2 perform for variations E2 -E7.<br />
The answers for E2, E3 and E4 is especially interesting because these are examples where the best upper bounds I know behave like  <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt n' title='&#92;sqrt n' class='latex' /> and are obtained from a random assignment. (See problem 5.)<br />
<strong>Problem 8:</strong> Prove the assertion of EDP (namely that the discrepancy is unbounded) for an SKJ-pseudointegers of <strong><em>your</em></strong> choice.</p>
<h2>Related Discrepancy Problems</h2>
<h3>Discrepancy and hereditary discrepancy</h3>
<p>Recall that for a hypergraph <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> defined on a ser <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> the discrepancy of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />, denoted by <img src='http://s0.wp.com/latex.php?latex=disc%28H%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='disc(H)' title='disc(H)' class='latex' />  is the minimum over all functions <img src='http://s0.wp.com/latex.php?latex=f%3AA+%5Cto+%5C%7B-1%2C1%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f:A &#92;to &#92;{-1,1&#92;}' title='f:A &#92;to &#92;{-1,1&#92;}' class='latex' /> of the maximum over all  <img src='http://s0.wp.com/latex.php?latex=S+%5Cin+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;in H' title='S &#92;in H' class='latex' /> of</p>
<blockquote><p><img src='http://s0.wp.com/latex.php?latex=%7C%5Csum+%5C%7Bf%28s%29%3As%5Cin+S%5C%7D%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|&#92;sum &#92;{f(s):s&#92;in S&#92;}|' title='|&#92;sum &#92;{f(s):s&#92;in S&#92;}|' class='latex' />.  </p></blockquote>
<p>When <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is a hypergraph and <img src='http://s0.wp.com/latex.php?latex=B%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B&#92;subset A' title='B&#92;subset A' class='latex' />, the restriction of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> is the hypergraph with vertex set <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' /> whose edges are all sets of the form <img src='http://s0.wp.com/latex.php?latex=S+%5Ccap+B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;cap B' title='S &#92;cap B' class='latex' /> for edges <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />. The hereditary discrepancy <img src='http://s0.wp.com/latex.php?latex=herddisc%28H%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='herddisc(H)' title='herddisc(H)' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> is the maximum over all <img src='http://s0.wp.com/latex.php?latex=B+%5Csubset+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B &#92;subset A' title='B &#92;subset A' class='latex' /> of the discrepancy of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> restricted to <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='B' title='B' class='latex' />. </p>
<h3>An operation of hypergraphs</h3>
<p>Let <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> be a hypergraph on a vertex set <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> and assume that <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is ordered, e.g., <img src='http://s0.wp.com/latex.php?latex=A%3D%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A=&#92;{1,2,&#92;dots,n&#92;}' title='A=&#92;{1,2,&#92;dots,n&#92;}' class='latex' />. Consider the hypergraph <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> obtained from <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> by adding for every set <img src='http://s0.wp.com/latex.php?latex=S+%5Cin+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;in H' title='S &#92;in H' class='latex' /> all its initial subsets w.r.t. the ordering. We will consider the operation of moving from <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' />. Note that for EDP all our variations E1-E8 the underlying hypergraph is obtained by this construction <img src='http://s0.wp.com/latex.php?latex=H+%5Cto+G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H &#92;to G' title='H &#92;to G' class='latex' />. (From a certain natural hypergraphs <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />). I further guess that for all the variations E1-E8 if we consider the hypergraph <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> (before taking initial segments) then the discrepancy is bounded. </p>
<h3>Probabilistic versions</h3>
<p>Let <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> be a ground set and let <img src='http://s0.wp.com/latex.php?latex=1%5Cge+p_1%5Cge+p_2%5Cge%5Cdots%5Cge+p_m%3E0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1&#92;ge p_1&#92;ge p_2&#92;ge&#92;dots&#92;ge p_m&gt;0' title='1&#92;ge p_1&#92;ge p_2&#92;ge&#92;dots&#92;ge p_m&gt;0' class='latex' /> be a set of reals. First consider the discrepancy problem for a random hypergraph <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> edges, whose ith edge is a random set <img src='http://s0.wp.com/latex.php?latex=S_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S_i' title='S_i' class='latex' /> so that every element of <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> belongs to <img src='http://s0.wp.com/latex.php?latex=S_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S_i' title='S_i' class='latex' /> with probability <img src='http://s0.wp.com/latex.php?latex=p_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i' title='p_i' class='latex' /> (and all these events are statistically independent). Next we can move from <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> as described above.<br />
The case of taking <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> for the probabilities <img src='http://s0.wp.com/latex.php?latex=p_i%3D1%2Fi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i=1/i' title='p_i=1/i' class='latex' /> can be seen as a probabilistic analog of EDP. My guess for the discrepancy of this case is also <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{&#92;log n}' title='&#92;sqrt{&#92;log n}' class='latex' />. I also guess that the discrepancy of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> itself is bounded.<br />
<strong>Problem 9:</strong> How do Greedy algorithms 1 and 2 perform for the random hypergraph <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' />. </p>
<h3>Roth&#8217;s theorem about the discrepancy of arithmetic progressions</h3>
<p>For EDP the ground set is all natural numbers, or just the set {1,2,&#8230;,n} and the hypergraph is the collection of all HAPs(homogeneous arithmetic progressions). Roth considered the hypergraphs of <strong>all</strong> arithmetic progressions. Roth proved that the discrepancy in this case is at <img src='http://s0.wp.com/latex.php?latex=%5COmega+%28n%5E%7B1%2F4%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Omega (n^{1/4})' title='&#92;Omega (n^{1/4})' class='latex' />. The existence of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' />  with discrepancy of <img src='http://s0.wp.com/latex.php?latex=O%28n%5E%7B1%2F4%7D+%5Clog%5E%7B5%2F2%7D+n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='O(n^{1/4} &#92;log^{5/2} n)' title='O(n^{1/4} &#92;log^{5/2} n)' class='latex' /> was proved by Beck and the <img src='http://s0.wp.com/latex.php?latex=%5Clog%5E%7B5%2F2%7D+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log^{5/2} n' title='&#92;log^{5/2} n' class='latex' /> factor was removed by Matousek and Spencer. These works by Beck, Matousek and Spencer may be very relevant to prove the existence of low-discrepancy sequences for EDP and some of its extensions.</p>
<h2>Some results with Noga Alon</h2>
<p>Here are some results proved in collaboration with Noga Alon which are based on the Beck-Fiala theorem and a general argument on moving from <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' />. (It is likely that some of them are known and we will be happy to know.)<br />
<strong>Proposition 1:</strong> For <img src='http://s0.wp.com/latex.php?latex=p_i%3D1%2Fi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_i=1/i' title='p_i=1/i' class='latex' /> the discrepancy of <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> described above is at most <img src='http://s0.wp.com/latex.php?latex=%28%5Clog%5E3n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(&#92;log^3n)' title='(&#92;log^3n)' class='latex' />.<br />
This follows from the following general proposaition:<br />
<strong>Proposition 2:</strong> Let <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> be a hypergraph on an <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n' title='n' class='latex' /> element ordered set <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d' title='d' class='latex' /> be the maximum degree of a point of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=disc%28G%29%5Cle+2d+log%5E2n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='disc(G)&#92;le 2d log^2n' title='disc(G)&#92;le 2d log^2n' class='latex' />.<br />
<strong>Proof</strong>: Let <img src='http://s0.wp.com/latex.php?latex=X%3DX_1+%5Ccup+X_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X=X_1 &#92;cup X_2' title='X=X_1 &#92;cup X_2' class='latex' /> be a partition of <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> into two nearly equal intervals, <img src='http://s0.wp.com/latex.php?latex=X_1%3DX_%7B11%7D%5Ccup+X_%7B12%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X_1=X_{11}&#92;cup X_{12}' title='X_1=X_{11}&#92;cup X_{12}' class='latex' />  a partition of <img src='http://s0.wp.com/latex.php?latex=X_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X_1' title='X_1' class='latex' /> into 2 nearly equal intervals and similarly <img src='http://s0.wp.com/latex.php?latex=X_2%3D+X_%7B21%7D%5Ccup+X_%7B22%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X_2= X_{21}&#92;cup X_{22}' title='X_2= X_{21}&#92;cup X_{22}' class='latex' />, etc (<img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> levels). Now define a new hypergraph <img src='http://s0.wp.com/latex.php?latex=H%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H&#039;' title='H&#039;' class='latex' /> obtained from <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> by replacing each set <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> by the following <img src='http://s0.wp.com/latex.php?latex=1%2B2%2B4%2B...%2Bn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1+2+4+...+n' title='1+2+4+...+n' class='latex' /> sets (possibly some empty):</p>
<blockquote><p><img src='http://s0.wp.com/latex.php?latex=A%5Ccap+X%28%3DA%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap X(=A)' title='A&#92;cap X(=A)' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+X_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap X_1' title='A&#92;cap X_1' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=A%5Ccap+X_2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A&#92;cap X_2' title='A&#92;cap X_2' class='latex' />,<img src='http://s0.wp.com/latex.php?latex=%5Cdots%2C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;dots,' title='&#92;dots,' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=A+%5Ccap+X_%7Bij%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A &#92;cap X_{ij}' title='A &#92;cap X_{ij}' class='latex' /> (for all <img src='http://s0.wp.com/latex.php?latex=i%2Cj&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i,j' title='i,j' class='latex' />), <img src='http://s0.wp.com/latex.php?latex=%5Cdots&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;dots' title='&#92;dots' class='latex' />.</p></blockquote>
<p>Note that the maximum degree of a point in <img src='http://s0.wp.com/latex.php?latex=H%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H&#039;' title='H&#039;' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=d+log_2+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d log_2 n' title='d log_2 n' class='latex' />, hence the discrepancy of <img src='http://s0.wp.com/latex.php?latex=H%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H&#039;' title='H&#039;' class='latex' /> is at most <img src='http://s0.wp.com/latex.php?latex=2d%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2d&#92;log n' title='2d&#92;log n' class='latex' /> by Beck-Fiala&#8217;s theorem. Also, each initial segment of <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a union of at most <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> pairwise disjoint members of <img src='http://s0.wp.com/latex.php?latex=H%27&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H&#039;' title='H&#039;' class='latex' />. This gives Proposition 1, and since for the case described in Proposition 2 the degree of vertices for <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> behaves like <img src='http://s0.wp.com/latex.php?latex=%5Clog+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log n' title='&#92;log n' class='latex' /> we obtain Proposition 2 as well.<br />
<strong>Proposition 3:</strong> for versions <strong>E1-E8 </strong>there are examples where the discrepancy is <img src='http://s0.wp.com/latex.php?latex=n%5E%7BO%281%2F%5Clog%5Clog+n%29%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{O(1/&#92;log&#92;log n)}' title='n^{O(1/&#92;log&#92;log n)}' class='latex' />.<br />
<strong>Proof: </strong>In all these examples we consider a hypergraph <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> on the ground set A={1,2,&#8230;,n}, or on a subest of A with positive density (or for E7 and E8 on another non conventionally ordered set of integers). Then move to the hypergraph <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='G' title='G' class='latex' /> of initial subsets of edges of <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' />.  To apply Proposition 2 and Beck-Fiala&#8217;s theorem we need to find an upper bound on the degree of a vertex in the hypergraph $\cal H$. Consider the case of EDP. (The other cases are similar.) The maximum degree is obtained by an integer that is the product of the first k distinct primes.  In this case the degree <img src='http://s0.wp.com/latex.php?latex=2%5Ek&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2^k' title='2^k' class='latex' /> is smaller than <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F%5Clog+%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/&#92;log &#92;log n}' title='n^{1/&#92;log &#92;log n}' class='latex' />. Note that the proof applies even to hereditary discrepancy.<br />
<strong>Problem 10:</strong> Use similar ideas to prove better (even polylog (n)) constructions for EDP and its variations <strong>E1</strong>-<strong>E8</strong>.<br />
I extend all my earlier guesses when we move from discrepancy to hereditary discrepancy.<br />
<strong>Proposition 4: </strong>The hereditary discrepancy for the hypergraph of HAP on {1,2,&#8230;,n} is <img src='http://s0.wp.com/latex.php?latex=%5COmega+%28%5Csqrt%7B%5Clog+n%2F%5Clog+%5Clog+n%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Omega (&#92;sqrt{&#92;log n/&#92;log &#92;log n})' title='&#92;Omega (&#92;sqrt{&#92;log n/&#92;log &#92;log n})' class='latex' />.<br />
The proof is obtained as follows: Consider the first <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> primes, and a hypergraph <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='H' title='H' class='latex' /> on {1,2,&#8230;,m} of discrepancy <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt m' title='&#92;sqrt m' class='latex' />.  (For example, a Hadamard matrix of order <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' /> describes such an hypergraph. ) Next, for every edge <img src='http://s0.wp.com/latex.php?latex=S+%5Cin+H&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S &#92;in H' title='S &#92;in H' class='latex' /> consider the integer that correspond to  products of primes whose indices are in <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' />. Then restrict your attention only to these integers.<br />
Propositions 3 and 4 show that the hereditary discrepancy of the hypergraphs of HAPs in {1,2,&#8230;,n} is between <img src='http://s0.wp.com/latex.php?latex=%5Csqrt+%7B%5Clog+n%2F%5Clog%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt {&#92;log n/&#92;log&#92;log n}' title='&#92;sqrt {&#92;log n/&#92;log&#92;log n}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F%5Clog+%5Clog+n%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/&#92;log &#92;log n}' title='n^{1/&#92;log &#92;log n}' class='latex' />.</p>
<h2>A Quick Outlook at Polymath 5: EDP 1-21</h2>
<p>Before our quick review of polymath5, let me mention a major difficulty which seems relevant to all approaches:</p>
<blockquote><p>If we allow zero entries in our sequence, even a small density of zero entries, then the discrepancy can be bounded.</p></blockquote>
<p>For example consider the sequence 1, -1, 0, 1, -1, 0, 1, -1, 0, &#8230;<br />
<strong>Experimentations:</strong> Computer experiments played a large role in polymath5. One of the most striking discoveries was a sequence of length 1124 of discrepancy 2. Later a second sequence of length 1124 and discrepancy 2 was found but not a larger sequence. Several people made great contributions with computer experiments.<br />
<strong>Additive Fourier analysis ideas:</strong> Using Fourier analysis on our sequence and somehow reaching a contradiction when we assume that the discrepancy is bounded was a suggestion that was dominant in the first few threads and we came back to from time to time.<br />
<strong>Terry Tao&#8217;s reduction:</strong> Terry Tao found a reduction from the general question to the variation of multiplicative functions with complex norm-1 values (<strong>E1</strong>). The beautiful proof relies on &#8220;multiplicative&#8221; Fourier analysis and it is striking how &#8220;little&#8221; the proof uses.<br />
<strong>Problem 11</strong> (Sune Kristian Jakobsen):  Does Tao&#8217;s reduction apply to arbitrary SKJ-pseudointegers.<br />
<strong>Semi-definite programming:</strong> A major turning point was a suggestion by Moses Charikar to use a natural semi-definite relaxation for the problem. This promising avenue was explored over several threads and here too computer experimentation was done.  One reason to regard Charikar&#8217;s approach (and the related linear programming approach described next)  as hopeful is precisely because it offers a convincing way to get around the difficulty demonstrated by the sequence 1, -1, 0, 1, -1, 0, &#8230;<br />
<strong>A generalization and linear programming:</strong> The last few threads were centered around a different related relaxation proposed by Tim Gowers. The problem was generalized from a single-sequence problem to a pair-of-sequences problem. (This is a common motif in extremal combinatorics although here the motivation came from functional analysis.) Then relaxation of the problem led to a very appealing linear programming question.<br />
<strong>Mathoverflow questions:</strong> Mathoverflow was used to ask <a href="http://mathoverflow.net/questions/tagged/polymath5" target="_blank">several questions</a> related to the project.<br />
<strong>Participation:</strong> Polymath 5 attracted much interest and wide participation. Overall, it  did not attract researchers with major prior interest in discrepancy theory.</p>
<h2>Discrepancy</h2>
<p>Discrepancy theory is a huge an exciting area. Let me just give references to some books.<br />
Jozsef Beck, William W. L. Chen: <em>Irregularities of Distribution</em>, Cambridge University Press, 1987. And, Jozsef Beck: <em>Irregularities of Distribution (Cambridge Tracts in Mathematics Cambridge Tracts in Mathemat Volume 89)(Paperback)</em><br />
Bernard Chazele: <em>The Discrepancy Method: Randomness and Complexity</em><br />
Jiri Matousek: <em>Geometric Discrepancy: An Illustrated Guide (Algorithms and Combinatorics)</em><br />
J. Beck: <em>Combinatorial Games: Tic-Tac-Toe Theory</em>, Cambridge University Press, 2008.<br />
J. Beck: <em>A forthcoming book.</em><br />
Reading these books will prepare you better to deal with EDP and will enrich your life tremendously.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4417/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4417/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4417&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/08/22/edp22-first-guest-post-from-gil-kalai/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>

		<media:content url="http://s0.wp.com/latex.php?latex=C&#38;bg=ffffff&#38;fg=333333&#38;s=0" medium="image">
			<media:title type="html">C</media:title>
		</media:content>

		<media:content url="http://s0.wp.com/latex.php?latex=n&#38;bg=ffffff&#38;fg=333333&#38;s=0" medium="image">
			<media:title type="html">n</media:title>
		</media:content>

		<media:content url="http://s0.wp.com/latex.php?latex=d&#38;bg=ffffff&#38;fg=333333&#38;s=0" medium="image">
			<media:title type="html">d</media:title>
		</media:content>
	</item>
		<item>
		<title>EDP: a possible revival</title>
		<link>http://gowers.wordpress.com/2012/08/22/edp-a-possible-revival/</link>
		<comments>http://gowers.wordpress.com/2012/08/22/edp-a-possible-revival/#comments</comments>
		<pubDate>Wed, 22 Aug 2012 15:54:52 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath5]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4423</guid>
		<description><![CDATA[A few months ago, Gil Kalai got in touch to suggest that it was time to revisit Polymath5, the attempt to prove the Erd&#337;s discrepancy problem. I agreed, but we have both been rather busy in the interim, so it is only now that we are ready. One of the things that Gil did was [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4423&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A few months ago, <a href="http://gilkalai.wordpress.com/">Gil Kalai</a> got in touch to suggest that it was time to revisit Polymath5, the attempt to prove the Erd&#337;s discrepancy problem. I agreed, but we have both been rather busy in the interim, so it is only now that we are ready. One of the things that Gil did was write three posts, which I shall be putting up as guest posts, the first one today and the others over the next few days. I hope that people who contributed to the project the first time round will consider having another go, and that people who didn&#8217;t, but would be interested, will find that they can use <a href="http://michaelnielsen.org/polymath1/index.php?title=The_Erdős_discrepancy_problem">the wiki that we created to record our progress</a> to get up to speed with what is already known. Once Gil&#8217;s posts are up, I&#8217;ll probably write a post myself, and I hope that some serious work might start in early September. I always felt that when the original attempt petered out, it was just a kind of loss of energy that individual mathematicians often feel, rather than the result of hitting a brick wall. And just as having a break from a problem is often useful for individuals, I hope it will turn out to be for this polymath project. If it doesn&#8217;t, then maybe I&#8217;ll do something I meant to do much earlier, which is write up in paper form the progress that has already been made. (Of course, if I do that, then anybody is free to contribute to the writing.)</p>
<p>If you want to look at some of the earlier posts, they are collected together in <a href="http://gowers.wordpress.com/category/polymath5/">the polymath5 category</a> on this blog.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4423/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4423/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4423&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/08/22/edp-a-possible-revival/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>What are dense Sidon subsets of {1,2,&#8230;,n} like?</title>
		<link>http://gowers.wordpress.com/2012/07/13/what-are-dense-sidon-subsets-of-12-n-like/</link>
		<comments>http://gowers.wordpress.com/2012/07/13/what-are-dense-sidon-subsets-of-12-n-like/#comments</comments>
		<pubDate>Fri, 13 Jul 2012 10:50:13 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Straight maths]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4398</guid>
		<description><![CDATA[The short answer if you don&#8217;t feel like reading a post with some actual mathematics in it is that I don&#8217;t know. Now for the longer answer. A subset of is called a Sidon set if the only solutions of the equation with are the trivial ones with and or and . Since the number [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4398&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The short answer if you don&#8217;t feel like reading a post with some actual mathematics in it is that I don&#8217;t know. </p>
<p>Now for the longer answer. A subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> is called a <em>Sidon set</em> if the only solutions of the equation <img src='http://s0.wp.com/latex.php?latex=a%2Bb%3Dc%2Bd&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a+b=c+d' title='a+b=c+d' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=a%2Cb%2Cc%2Cd%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a,b,c,d&#92;in A' title='a,b,c,d&#92;in A' class='latex' /> are the trivial ones with <img src='http://s0.wp.com/latex.php?latex=a%3Dc&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a=c' title='a=c' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b%3Dd&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='b=d' title='b=d' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=a%3Dd&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a=d' title='a=d' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b%3Dc&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='b=c' title='b=c' class='latex' />. Since the number of pairs <img src='http://s0.wp.com/latex.php?latex=a%5Cleq+b&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a&#92;leq b' title='a&#92;leq b' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=a%2Cb%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a,b&#92;in A' title='a,b&#92;in A' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=%7CA%7C%28%7CA%7C%2B1%29%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A|(|A|+1)/2' title='|A|(|A|+1)/2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=2%5Cleq+a%2Bb%5Cleq+2n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2&#92;leq a+b&#92;leq 2n' title='2&#92;leq a+b&#92;leq 2n' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=a%2Cb%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='a,b&#92;in A' title='a,b&#92;in A' class='latex' />, it is trivial that if <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> is a Sidon set, then <img src='http://s0.wp.com/latex.php?latex=%7CA%7C%28%7CA%7C%2B1%29%2F2%5Cleq+2n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A|(|A|+1)/2&#92;leq 2n' title='|A|(|A|+1)/2&#92;leq 2n' class='latex' />, which gives an upper bound for <img src='http://s0.wp.com/latex.php?latex=%7CA%7C&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A|' title='|A|' class='latex' /> of around <img src='http://s0.wp.com/latex.php?latex=2%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='2&#92;sqrt{n}' title='2&#92;sqrt{n}' class='latex' />.<br />
<span id="more-4398"></span></p>
<p>There is also a matching lower bound, up to a constant. I think it is due to Erd&#334;s. One notes first that the subset <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p^2' title='&#92;mathbb{Z}_p^2' class='latex' /> defined by <img src='http://s0.wp.com/latex.php?latex=%5CGamma%3D%5C%7B%28x%2Cx%5E2%29%3Ax%5Cin%5Cmathbb%7BZ%7D_p%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Gamma=&#92;{(x,x^2):x&#92;in&#92;mathbb{Z}_p&#92;}' title='&#92;Gamma=&#92;{(x,x^2):x&#92;in&#92;mathbb{Z}_p&#92;}' class='latex' /> is a Sidon set inside <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p^2' title='&#92;mathbb{Z}_p^2' class='latex' />. Indeed, if <img src='http://s0.wp.com/latex.php?latex=%28x%2Cx%5E2%29%2B%28y%2Cy%5E2%29%3D%28z%2Cz%5E2%29%2B%28w%2Cw%5E2%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x,x^2)+(y,y^2)=(z,z^2)+(w,w^2)' title='(x,x^2)+(y,y^2)=(z,z^2)+(w,w^2)' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=x%5E2-z%5E2%3Dw%5E2-y%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^2-z^2=w^2-y^2' title='x^2-z^2=w^2-y^2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=x-z%3Dw-y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x-z=w-y' title='x-z=w-y' class='latex' />. Assuming the quadruple is not degenerate, <img src='http://s0.wp.com/latex.php?latex=x%5Cne+z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x&#92;ne z' title='x&#92;ne z' class='latex' />, so we can divide the first equation by the second to deduce that <img src='http://s0.wp.com/latex.php?latex=x%2Bz%3Dw%2By&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x+z=w+y' title='x+z=w+y' class='latex' />, and that implies that <img src='http://s0.wp.com/latex.php?latex=x%3Dw&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x=w' title='x=w' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z%3Dy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z=y' title='z=y' class='latex' />, so the quadruple is degenerate for a different reason.</p>
<p>Thus, we can find a Sidon subset of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p^2' title='&#92;mathbb{Z}_p^2' class='latex' /> of size <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />, which is the square root of the size of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p^2' title='&#92;mathbb{Z}_p^2' class='latex' />. We can now regard this set as a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B0%2C1%2C%5Cdots%2Cp-1%5C%7D%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{0,1,&#92;dots,p-1&#92;}^2' title='&#92;{0,1,&#92;dots,p-1&#92;}^2' class='latex' />, and then map each point <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='(x,y)' title='(x,y)' class='latex' /> to the number <img src='http://s0.wp.com/latex.php?latex=1%2Bx%2B2py&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1+x+2py' title='1+x+2py' class='latex' />. It is easy to check that the resulting set is a Sidon subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2C2p%5E2%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,2p^2&#92;}' title='&#92;{1,2,&#92;dots,2p^2&#92;}' class='latex' />. </p>
<p>Much more is known about this: the correct bound is now known up to a constant. However, that is not the theme of this post. What I want to discuss is whether any kind of converse of this result is true. That is, can one say that a Sidon set of size close to <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' /> must be constructed in some kind of quadratic way?</p>
<p>Some very weak evidence in favour of a conclusion like that is that random methods fail miserably to produce Sidon sets of anything like size <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' />. Indeed, let&#8217;s pick a subset <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> randomly by choosing each element with probability <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p' title='p' class='latex' />. The total number of quadruples <img src='http://s0.wp.com/latex.php?latex=x%2By%3Dz%2Bw&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x+y=z+w' title='x+y=z+w' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=x%2Cy%2Cz%2Cw&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x,y,z,w' title='x,y,z,w' class='latex' /> taken from <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> is roughly <img src='http://s0.wp.com/latex.php?latex=n%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^3' title='n^3' class='latex' /> (since apart from a few edge effects <img src='http://s0.wp.com/latex.php?latex=x%2Cy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x,y' title='x,y' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=z&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='z' title='z' class='latex' /> can be chosen more or less freely and they determine <img src='http://s0.wp.com/latex.php?latex=w&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w' title='w' class='latex' />), and each nondegenerate one has a probability <img src='http://s0.wp.com/latex.php?latex=p%5E4&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p^4' title='p^4' class='latex' />. So the expected number of nondegenerate quadruples <img src='http://s0.wp.com/latex.php?latex=x%2By%3Dz%2Bw&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x+y=z+w' title='x+y=z+w' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=x%2Cy%2Cz%2Cw%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x,y,z,w&#92;in A' title='x,y,z,w&#92;in A' class='latex' /> is roughly <img src='http://s0.wp.com/latex.php?latex=p%5E4n%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p^4n^3' title='p^4n^3' class='latex' />. For this to be smaller than <img src='http://s0.wp.com/latex.php?latex=%7CA%7C%2F2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A|/2' title='|A|/2' class='latex' /> (so we can remove a point from each one and still have plenty of <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> left), we need <img src='http://s0.wp.com/latex.php?latex=p%5E4n%5E3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p^4n^3' title='p^4n^3' class='latex' /> to be significantly smaller than <img src='http://s0.wp.com/latex.php?latex=pn&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='pn' title='pn' class='latex' />, which gives a bound of <img src='http://s0.wp.com/latex.php?latex=p%3Dcn%5E%7B-2%2F3%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=cn^{-2/3}' title='p=cn^{-2/3}' class='latex' />, which gives us <img src='http://s0.wp.com/latex.php?latex=%7CA%7C%3Dcn%5E%7B1%2F3%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|A|=cn^{1/3}' title='|A|=cn^{1/3}' class='latex' />.</p>
<p>There are several other ways to see that random sets of size <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2}' title='n^{1/2}' class='latex' /> are very different indeed from Sidon sets, which suggests that Sidon sets of size close to <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2}' title='n^{1/2}' class='latex' /> must have interesting structure. But what is that structure?</p>
<p>It isn&#8217;t easy even to state a conjecture here, since before we map the subset of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D_p%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{Z}_p^2' title='&#92;mathbb{Z}_p^2' class='latex' /> to the integers, we can apply an invertible linear transformation to it. It is hard to think of a way of characterizing the kinds of sets that can arise like this, let alone thinking about how to make the structure looser when the set has size only <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D%2F1000&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;sqrt{n}/1000' title='&#92;sqrt{n}/1000' class='latex' />. (Additive combinatorialists will immediately recognise that I am inspired by Freiman&#8217;s theorem here.)</p>
<p>But today &#8230; make that yesterday as a night has passed since I wrote those two words &#8230; I was at a talk given by Javier Cilleruelo, thanks to which I learnt of an example that places a much more serious constraint on any conjecture one might hope to make. The example is one I sort of knew already, since it is based on a brilliant argument of Imre Ruzsa, who created an infinite Sidon set <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S' title='S' class='latex' /> such that the asymptotic size of <img src='http://s0.wp.com/latex.php?latex=S%5Ccap%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='S&#92;cap&#92;{1,2,&#92;dots,n&#92;}' title='S&#92;cap&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> is around <img src='http://s0.wp.com/latex.php?latex=n%5E%7B%5Csqrt%7B2%7D-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{&#92;sqrt{2}-1}' title='n^{&#92;sqrt{2}-1}' class='latex' />. (As with the finite case, <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F3%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/3}' title='n^{1/3}' class='latex' /> is very easy; the true answer is conjectured to be <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2-o%281%29%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2-o(1)}' title='n^{1/2-o(1)}' class='latex' />; Ruzsa was the first person to beat <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F3%2Bo%281%29%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/3+o(1)}' title='n^{1/3+o(1)}' class='latex' /> and his exponent has not been improved.) What Cilleruelo mentioned in passing was that you can use some of Ruzsa&#8217;s ideas in an easy way to get a pretty decent example in the finite case. This is something I hadn&#8217;t realized. I presume Ruzsa himself was well aware of it, but I don&#8217;t remember it being mentioned in his paper (though I haven&#8217;t looked at the paper for many years, so he might have done). What I find particularly interesting is that the example is completely different from the graph-of-<img src='http://s0.wp.com/latex.php?latex=x%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x^2' title='x^2' class='latex' /> type examples.</p>
<p>Since the construction is simple (to understand, that is) and rather gorgeous, I thought it merited a blog post. And maybe it can also serve as an advertisement for Ruzsa&#8217;s amazing paper.</p>
<p>It begins with the observation that the logarithms of the primes form a Sidon set of reals: if <img src='http://s0.wp.com/latex.php?latex=p%2Cq%2Cr%2Cs&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p,q,r,s' title='p,q,r,s' class='latex' /> are primes with <img src='http://s0.wp.com/latex.php?latex=pq%3Drs&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='pq=rs' title='pq=rs' class='latex' />, then either <img src='http://s0.wp.com/latex.php?latex=p%3Dr&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=r' title='p=r' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=q%3Ds&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q=s' title='q=s' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=p%3Ds&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p=s' title='p=s' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=q%3Dr&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='q=r' title='q=r' class='latex' />; therefore, the same is true if <img src='http://s0.wp.com/latex.php?latex=%5Clog+p%2B%5Clog+q%3D%5Clog+r%2B%5Clog+s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log p+&#92;log q=&#92;log r+&#92;log s' title='&#92;log p+&#92;log q=&#92;log r+&#92;log s' class='latex' />. </p>
<p>&#8220;So what?&#8221; it is tempting to say. After all, the logs of the primes are not integers. Indeed, there are far more of them than there are integers.</p>
<p>However, at this point a principle comes in that I often feel doesn&#8217;t deserve to be as incredibly fundamentally useful as it in fact is: that two distinct integers must differ by at least 1. It&#8217;s not too much of an exaggeration to say, for example, that to prove that a concrete number is irrational or transcendental you have to find some way of reducing the problem to this principle. (That is, you say that if your number is rational/algebraic then there must be two distinct integers that have difference less than 1.) For this problem we make a much simpler use of the principle. If <img src='http://s0.wp.com/latex.php?latex=pq%5Cne+rs&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='pq&#92;ne rs' title='pq&#92;ne rs' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7Cpq-rs%7C%5Cgeq+1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|pq-rs|&#92;geq 1' title='|pq-rs|&#92;geq 1' class='latex' />, since <img src='http://s0.wp.com/latex.php?latex=p%2Cq%2Cr%2Cs&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p,q,r,s' title='p,q,r,s' class='latex' /> are integers. Since the derivative of <img src='http://s0.wp.com/latex.php?latex=%5Clog+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log x' title='&#92;log x' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=1%2Fx&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/x' title='1/x' class='latex' />, that tells us that <img src='http://s0.wp.com/latex.php?latex=%5Clog%28pq%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log(pq)' title='&#92;log(pq)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Clog%28rs%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log(rs)' title='&#92;log(rs)' class='latex' /> differ by at least <img src='http://s0.wp.com/latex.php?latex=1%2F2pq&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/2pq' title='1/2pq' class='latex' />. (One could be a bit more careful here: I think Cilleruelo stated a bound of <img src='http://s0.wp.com/latex.php?latex=1%2F%28pqrs%29%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/(pqrs)^{1/2}' title='1/(pqrs)^{1/2}' class='latex' />.) </p>
<p>Note what we now have that&#8217;s better: instead of merely knowing that <img src='http://s0.wp.com/latex.php?latex=%5Clog+p%2B%5Clog+q%5Cne%5Clog+r%2B%5Clog+s&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;log p+&#92;log q&#92;ne&#92;log r+&#92;log s' title='&#92;log p+&#92;log q&#92;ne&#92;log r+&#92;log s' class='latex' /> in nondegenerate cases, we have a lower bound on the difference. This allows us to approximate and discretize.</p>
<p>So let&#8217;s start with all the logs of the primes up to <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m' title='m' class='latex' />, of which there are about <img src='http://s0.wp.com/latex.php?latex=m%2F%5Clog+m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m/&#92;log m' title='m/&#92;log m' class='latex' /> by the prime number theorem. If we multiply those by say <img src='http://s0.wp.com/latex.php?latex=6m%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='6m^2' title='6m^2' class='latex' />, then all the differences between the distinct sums, which were previously at least <img src='http://s0.wp.com/latex.php?latex=1%2F2m%5E2&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='1/2m^2' title='1/2m^2' class='latex' />, are now at least <img src='http://s0.wp.com/latex.php?latex=3&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3' title='3' class='latex' />. Therefore, if we move each number to the nearest integer, the differences will be altered by at most 2, so will be at least 1. So we have a Sidon set! It is a subset of the integers up to <img src='http://s0.wp.com/latex.php?latex=m%5E2%5Clog+m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m^2&#92;log m' title='m^2&#92;log m' class='latex' /> and it has size around <img src='http://s0.wp.com/latex.php?latex=m%2F%5Clog+m&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='m/&#92;log m' title='m/&#92;log m' class='latex' />, so this construction gives a Sidon subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{1,2,&#92;dots,n&#92;}' title='&#92;{1,2,&#92;dots,n&#92;}' class='latex' /> of size around <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2%7D%2F%28%5Clog+n%29%5E%7B3%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2}/(&#92;log n)^{3/2}' title='n^{1/2}/(&#92;log n)^{3/2}' class='latex' />.</p>
<p>The moral of this is that to prove a structural result about Sidon sets of density close to <img src='http://s0.wp.com/latex.php?latex=n%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n^{1/2}' title='n^{1/2}' class='latex' />, then one would probably have to insist on close meaning &#8220;within a constant of&#8221; (which one would then relax to some slow-growing function later). Or alternatively, one would look for a much more general notion of structure, but I don&#8217;t see an obvious generalization of the two examples I now know. Another moral is that it&#8217;s worth looking for more examples of dense Sidon sets before even trying to make a conjecture. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4398/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4398&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/07/13/what-are-dense-sidon-subsets-of-12-n-like/feed/</wfw:commentRss>
		<slash:comments>25</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A trip to Watford Grammar School for Boys</title>
		<link>http://gowers.wordpress.com/2012/07/07/a-trip-to-watford-grammar-school-for-boys/</link>
		<comments>http://gowers.wordpress.com/2012/07/07/a-trip-to-watford-grammar-school-for-boys/#comments</comments>
		<pubDate>Fri, 06 Jul 2012 23:09:13 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Mathematical pedagogy]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4383</guid>
		<description><![CDATA[As I said would happen in my post about a possible approach to teaching maths to non-mathematicians aged 16-18, I went last Wednesday to Watford Grammar School for Boys to try the approach out. The headmaster there, Martin Post, was remarkably helpful and assembled a usefully varied group of pupils, some from his school, some [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4383&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>As I said would happen in <a href="http://gowers.wordpress.com/2012/06/08/how-should-mathematics-be-taught-to-non-mathematicians/">my post about a possible approach to teaching maths to non-mathematicians</a> aged 16-18, I went last Wednesday to <a href="http://en.wikipedia.org/wiki/Watford_Grammar_School_for_Boys">Watford Grammar School for Boys</a> to try the approach out. The headmaster there, Martin Post, was remarkably helpful and assembled a usefully varied group of pupils, some from his school, some from <a href="http://en.wikipedia.org/wiki/Watford_Grammar_School_for_Girls">the equivalent school for girls</a>, and some from a nearby mixed comprehensive school (I wasn&#8217;t told which one) whose pupils receive some of their teaching in scientific subjects from Watford Grammar School. What&#8217;s more, some of the people there were doing maths and further maths, some were doing just maths, and some were not doing either. The one thing that was not representative about the group was that they were much brighter than average: for example, the non-mathematicians there had been chosen by their teachers as clever people who could have done maths but decided that they were more interested in other things. For most of the rest of this post, I&#8217;ll say what questions I discussed and how the discussions went. All but two of them were taken from the list in the earlier post.<br />
<span id="more-4383"></span></p>
<p>I began with a warm-up: I asked how many times a ball was kicked during the recent European Football Championship. Of course, I made it clear that I wasn&#8217;t after an exact answer. My first surprise of the day was that pretty well the moment I had asked the question the room went from almost silent to very far from silent: after a while I got used to this. The conversation was, it seemed, entirely about the question at hand. One person asked whether I was interested in the number of passes or the number of actual kicks. I said the latter.</p>
<p>After a while, and when I had managed to quieten everyone down, I drew a logarithmic scale on the whiteboard, going up the powers of 10 from 10 to 1,000,000. I then asked people for their answers and marked them roughly on the scale. There was a significant cluster around 100,000 (by which I mean in an interval from about 70,000 to about 120,000). I then asked people how they had arrived at their answers.</p>
<p>Everybody seemed to know that there had been 31 matches. (Proof: there were four groups of four, so 24 matches at the group stage, and then eight teams left in a knockout competition.) So I wrote that down on the whiteboard. From that point there were two approaches. Unfortunately, I can remember only one of them, which is the one I would have used: to estimate the average amount of time between kicks. The person who proposed this approach thought that was 1-2 seconds, so I asked what we should do with that information. That quickly led to the calculation <img src='http://s0.wp.com/latex.php?latex=30%5Ctimes+90%5Ctimes+60%2F1.5&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='30&#92;times 90&#92;times 60/1.5' title='30&#92;times 90&#92;times 60/1.5' class='latex' />, which is about <img src='http://s0.wp.com/latex.php?latex=20%5Ctimes+100%5Ctimes+50%3D100%2C000&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='20&#92;times 100&#92;times 50=100,000' title='20&#92;times 100&#92;times 50=100,000' class='latex' />. I don&#8217;t know whether they were used to doing things like saying &#8220;90 is basically 100&#8243; but they quickly got into the spirit of it.</p>
<p>Next up, I said that sometimes one reads of prophetic dreams, and that in order to get a feel for how seriously to take them I wanted to know how significant it would be if somebody dreamt of the death of someone close to them and that person died the next day. So I asked a rather vague question: what is the probability of that happening? I followed that up by asking what information it would be useful to know.</p>
<p>One suggestion that came in quite quickly was the probability that someone dies on any particular day. That led to a discussion of whether we were talking about expected deaths (e.g. if someone is very ill or very old) or unexpected deaths. There was general agreement that if someone is expected to die soon, then an apparently prophetic dream would not be too surprising, since both the dream and the death are more likely. So we decided to restrict attention to unexpected deaths. In the end we calculated the probability by simply estimating the number of days in 70 years &#8212; we went for 25,000 &#8212; and taking the reciprocal.</p>
<p>What else did we need to know? At this point an interesting discussion arose. One person suggested that a relevant parameter was the number of people that a typical person knows. That led to a subdiscussion about how close an acquaintance needed to be to be counted. I suggested that pretty well any identifiable acquaintance would be surprising enough to be worth counting. One other clarification asked for was whether we were talking about reports of prophetic death dreams or <em>true</em> reports of prophetic death dreams. I said the latter. </p>
<p>I haven&#8217;t yet said what the interesting discussion was. It was that some people thought it was relevant how many people people typically know, while other people thought it was not relevant at all. The arguments were roughly as follows. Those who thought it was relevant said that if somebody died, then you needed to know how many people they knew if you wanted to assess how likely it was that one of those people had dreamt of their death the night before. Those who thought it was not relevant argued that if someone dreams of a death, then all you care about is the chances that the person dreamt about will die the next day.</p>
<p>Did we have a paradox? Somebody gave a reasonable explanation of why we didn&#8217;t, which I made a bit more precise by saying that we were conditioning on two different events (the occurrence of a dream and the occurrence of a death), so it wasn&#8217;t too surprising that we had to take into account different things.</p>
<p>But the problem still wasn&#8217;t very clear, which itself became clear when people wondered whether we needed to know how many people there were in the whole world. In the end I stated the problem more precisely: roughly how many times per year would we expect somebody in the UK to have a death dream that comes true unexpectedly the next day? We settled on the following method: estimate how many times a death dream occurs per person per year, multiply by the population of the UK, and divide that by 25,000. I asked people how often they had death dreams per year and there were very different answers: some people said they thought 10 was about average and seemed surprised when I said that I thought that for me it was more like one every two years. We settled on 3, which I still found a bit high, but not outrageously so.</p>
<p>So that led to the calculation <img src='http://s0.wp.com/latex.php?latex=3%5Ctimes+50%2C000%2C000%2F25%2C000&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='3&#92;times 50,000,000/25,000' title='3&#92;times 50,000,000/25,000' class='latex' />. I think we had already multiplied 3 by 50,000,000 to get the number of death dreams per year, so what I actually wrote down was 150,000,000/25,000. So I naughtily changed that to 250,000,000/25,000 which gave me 1,000 and then compensated for the naughtiness by dividing that by 2 to get 500. That confused one or two people, so I explained it again more slowly.</p>
<p>While I can remember which questions I discussed during the session, I can no longer remember the order in which I covered the next three. However, I think it was either at this point or after the next question I discussed that there was a short break &#8212; not that it is too important. Anyhow, at some point round now I described <a href="http://terrytao.wordpress.com/2008/12/09/an-airport-inspired-puzzle/">Terence Tao&#8217;s airport-inspired puzzle</a>, which is the following question. You want to get from one end of an airport to the other and your shoelace is undone. If you want to get to your destination as quickly as you can, is it better to tie your shoelace when you are on a moving walkway or when you are on stationary ground?</p>
<p>The moment I had finished asking the question, the noise switch was in its &#8220;on&#8221; position again, which was fine. After I had managed to quieten the room down again (which was hardish but not impossible), I took a vote. The options I gave them were &#8220;faster on walkway&#8221;, &#8220;faster off walkway&#8221;, and &#8220;it makes no difference&#8221;. &#8220;Faster on walkway&#8221; got a few votes, &#8220;faster off walkway&#8221; got one vote, and &#8220;it makes no difference&#8221; came top, though not by miles. There were also a number of abstentions. </p>
<p>The next thing I did was invite people to argue for the answers they had voted for. The one person who had voted for &#8220;faster off walkway&#8221; did not look at all keen to do this, but after a little while someone else said, &#8220;You would need to know the speeds of the walkways, your walking speed, and the time it takes to tie your shoelace,&#8221; which prompted me to add a &#8220;not enough information&#8221; option, which was popular, and to which the &#8220;faster off walkway&#8221; person transferred her vote. </p>
<p>Here are some arguments I collected. In favour of the &#8220;on walkway&#8221; answer was that when you tie your shoelace you are moving along rather than wasting time being still. In favour of the &#8220;makes no difference&#8221; option was that wherever you tie your shoelace it takes the same amount of time. And when I tried to elicit an argument for the &#8220;not enough information&#8221; option, I failed: I asked under what circumstances it would be better to stop on the walkway and under what circumstances it would be better stop off the walkway, but nobody had much to suggest. I also explained that while knowing all the various speeds and times is <em>sufficient</em> to work out the answer, it isn&#8217;t obviously necessary: for example, if you are deciding whether it is better to use the walkway or not use it, you don&#8217;t need to know how fast it is.</p>
<p>A notable aspect of the discussion that permeated the whole thing was that people were keen to introduce non-mathematical considerations. For example, they (rightly) pointed out that it would make a difference if the airport was crowded: if the people just ahead of you on the walkway are stationary and you can&#8217;t get past, then obviously that&#8217;s a good moment to tie your shoelace, and if the people just behind you are in a hurry, then you will annoy them by stopping. So I asked them to assume that the airport was deserted. Other assumptions that had to be spelt out were that your walking speed is unaffected by having a shoelace undone and that there isn&#8217;t a robot sensor on the walkway that detects that you are tying your shoelace and stops it. Later on I found out who the mathematicians were and who the non-mathematicians were. It turned out that many of these objections came from the non-mathematicians. They were very helpful to the discussion, since one of the points I wanted to get across was that mathematical modelling involves choosing appropriate simplifications. One of the non-mathematicians begged me to say what the answer was (well before I wanted to do so) so the question had clearly worked its magic.</p>
<p>If I had been discussing this question as part of a genuine course, I would have waited much longer for someone to come up with a good solution. But as I wanted to get through several questions, I hurried things along a bit. I started by offering the advice to consider extreme cases. To illustrate what I meant by an extreme case, I suggested that maybe the walkways could be very very fast. (Actually, I think that would have helped people guess the answer, but it wasn&#8217;t the extreme case I had in mind.) Pretty soon after that, someone suggested considering the case where someone starts tying their shoelace just after getting on a walkway. Since that was exactly the case I was interested in, I basically gave away the answer at that point by suggesting also considering the case where someone starts tying their shoelace just <em>before</em> getting on the walkway. Most people could see that that was a proof that doing it on the walkway was better. One or two had a bit of trouble still, so I drew some pictures of two people and what would happen if one stopped just before and one just after getting on. </p>
<p>If I&#8217;d had more time I would have tried to get everyone to understand why the &#8220;you spend the same amount of time wherever you do it&#8221; argument gives the wrong answer. </p>
<p>Another thing I did at around this stage was what I call the dividing-up-the-pot game. I first asked question 41 from my earlier post, which is this. You and a friend are out for a walk, when you are approached by a stranger, who offers the two of you £1000 on one condition: that you agree how to split it between you. After establishing to your satisfaction that you are not about to be kidnapped, you propose a 50-50 split to your friend. To your astonishment, your friend insists on receiving £900 with only £100 going to you, and appears to be prepared to lose all the money rather than accept anything less than this deal. What should you do?</p>
<p>The main person to make a suggestion suggested that you should call your friend&#8217;s bluff, since when the stranger starts to walk away, your friend would be bound to say, &#8220;OK OK stop! I&#8217;ll settle for £500!&#8221; I wasn&#8217;t sure what more to say at this point, so I told them that they would now have a chance to see for themselves. We would have five rounds, in each of which people would be randomly paired and would have the chance to share ten points, on condition that they could agree how to do so. At the end of the five rounds we would see who had the most points. To do the random pairings I had brought along (part of) a pack of playing cards, which I shuffled and dealt out once per round, and each person was paired with the person who had a card of the same colour and same denomination. </p>
<p>It took a while to get through the rounds, but once it was finished, I asked whether anyone had 30 or more points. Several did. 35 or more? Nobody. 34? Yes, one person.</p>
<p>I then asked people what their strategies had been. To my surprise, several people had used randomized strategies, the main one being to decide by the toss of a coin (or in one case stone, scissors and paper) who would get all the points. I asked one person why he had adopted that strategy. His response was that the main aim of the game was to come top, which one couldn&#8217;t do by sharing points out equally and couldn&#8217;t easily do by insisting on lots of points each round. So he judged that the all-or-nothing strategy gave him his best chance of coming top. He turned out to be one of the people doing maths and further maths. I think his strategy got him 30 points. The person who got 34 points got them by, as he put it, &#8220;threatening his opponents.&#8221;</p>
<p>I then started a discussion of what possible other strategies there might be. I had planned to think about a population of strategies and how they would perform against each other, making a link with things like the evolution of morality, but after a few seconds I aborted it because I thought it would be too long and complicated. But this was something else that I would definitely have tried to do if I had devoted more time to this one question. </p>
<p>Similarly, I had a brief discussion of what real-life situations might be reasonably well modelled by this game. There were some suggestions that I didn&#8217;t like and now can&#8217;t remember. (The problem with one of them was that there was no analogue of getting zero if you can&#8217;t agree.) There was one about haggling, which seemed fine. I gave the example of the Conservatives and Liberal Democrats drawing up their coalition agreement two years ago, and said that any negotiation where there is a range of outcomes that would benefit both parties could be thought of as an instance of this game. If I had had more time I would have held back for much longer before giving these examples.</p>
<p>Next (or possibly previously), I asked question 37(i), which is the following. In several parts of the UK the police gathered statistics on where road accidents took place, identified accident blackspots, put speed cameras there, and gathered more statistics. There was a definite tendency for the number of accidents at these blackspots to go down after the speed cameras had been installed. Does this show conclusively that speed cameras improve road safety?</p>
<p>The same person who argued for the randomized strategy in the negotiation game basically knew the answer to this question already. He said no, since if you pick out the extreme cases then you would expect them to be less extreme if you run the experiment again. I decided to move on quickly from this question since there wasn&#8217;t a lot more to say. But I told people about a plan I had had, which was to do a bogus telepathy experiment. I would get them to guess the outcomes of 20 coin tosses, which I would attempt to beam to them telepathically. I would then pick the three best performers and the three worst, and would toss the coins again, this time asking the best ones to help me beam the answers to the worst ones. People could see easily that the performances would be expected to improve and that it would have nothing to do with telepathy. One person said, &#8220;Oh I want to do that,&#8221; of the experiment.</p>
<p>Penultimately, I discussed the following table, slightly modified from an example of Joseph Malkevitch. There are five options, and the orders of preference amongst 55 people are as follows.</p>
<p>14532 &#8212; 18 people</p>
<p>25431 &#8212; 12 people</p>
<p>32541 &#8212; 10 people</p>
<p>43251 &#8212; 9 people</p>
<p>52431 &#8212; 4 people</p>
<p>53421 &#8212; 2 people</p>
<p>Which option should the group go for? (I gave examples of different film genres for the options in a key below, but we didn&#8217;t use those.) </p>
<p>I asked for relevant observations. The group seemed pretty clued up on voting methods, so I was soon writing up that option 1 would win under first past the post, and someone calculated that option 3 (if I remember correctly) would win under the alternative vote. I then suggested doing pairwise comparisons, and was very gratified when everyone thought that transitivity was obvious (though one person did say that we should consider not just the preferences but the strengths of the preferences), so I could surprise them by showing that it failed in this case. I then gave the simple 123/231/312 example. Some people were keen on the system where you award 5 points for a first preference, 4 for a second and so on. We could probably have discussed this for quite a bit longer. I briefly mentioned Arrow&#8217;s theorem, without stating it precisely, and moved on to my final question.</p>
<p>I told them that for the last couple of years I have been reading Proust. (I asked how many of them had heard of Proust. One had.) More precisely, I have been reading Remembrance of Things Past, Scott Moncrieff&#8217;s translation of Proust. The edition I have is in twelve volumes, and my practice has been to read one volume of Proust for every two or three other books I read (my reading taking place before I go to sleep at night). I&#8217;ve now read eight of the volumes, and on page 25 of the eighth, I came up against this sentence. (I didn&#8217;t tell them that the eighth volume was the second half of &#8220;Sodom and Gomorrah&#8221;, which Scott Moncrieff coyly translates as &#8220;Cities of the Plain&#8221;.)</p>
<blockquote><p>But if the course of life, by making Cottard assume, if not at the Verdurins&#8217;, where he had, because of the influence that past associations exert over us when we find ourselves in familiar surroundings, remained more or less the same, at least in his practice, in his hospital ward, at the Academy of Medicine, a shell of coldness, disdain, gravity, that became more accentuated while he rewarded his appreciative students with puns, had made a clean cut between the old Cottard and the new, the same defects had on the contrary become exaggerated in Saniette, the more he sought to correct them.</p></blockquote>
<p>I displayed it for them and asked how we might go about checking that it was syntactically correct, and, even better, actually understanding it. It was projected on to a whiteboard, so I was in a position to annotate it. Fairly quickly the suggestion came in to use brackets. Somebody also said that we should work from the inside outwards. So I asked where people would like the brackets to go. The first suggestion was to have a bracket opening before &#8220;because of the influence&#8221; and ending after &#8220;Academy of Medicine&#8221;. That looked right, but turned out to be wrong: the opening bracket needed to be before &#8220;if not at the Verdurins&#8217;&#8221;. However, eventually we sorted it all out, and I got them, with a bit of prodding, to say that the general principle we were using was to put brackets round bits of text if the sentence still makes sense with those bits removed. (It was because it initially looked as though one could jump from &#8220;where he had&#8221; to &#8220;a shell of coldness&#8221; that we made the initial mistake.) </p>
<p>If it had not been inconvenient to do so, I would have rewritten the sentence with indentations, as follows.</p>
<p>But if the course of life,<br />
&#8230;.by making Cottard assume,<br />
&#8230;&#8230;..if not at the Verdurins&#8217;,<br />
&#8230;&#8230;&#8230;&#8230;where he had,<br />
&#8230;&#8230;&#8230;&#8230;&#8230;.because of the influence that past associations exert over<br />
&#8230;&#8230;&#8230;&#8230;&#8230;.us when we find ourselves in familiar surroundings,<br />
&#8230;&#8230;&#8230;&#8230;remained more or less the same,<br />
&#8230;&#8230;..at least in his practice, in his hospital ward, at the Academy of<br />
&#8230;&#8230;..Medicine,<br />
&#8230;.a shell of coldness, disdain, gravity,<br />
&#8230;&#8230;..that became more accentuated while he rewarded his<br />
&#8230;&#8230;..appreciative students with puns,<br />
had made a clean cut between the old Cottard and the new,<br />
the same defects had on the contrary become exaggerated in Saniette,<br />
the more he sought to correct them.</p>
<p>That would have shown more clearly just how many levels of subordinate clauses there were in the sentence. I would also have talked about trees and perhaps broadened the discussion into one about parsing trees for other sentences (which would have required them to decide and try to justify what the natural tree structure of a sentence is).</p>
<p>I didn&#8217;t do this at the time, but it is quite interesting to try to rewrite the sentence to make it more comprehensible. Here is what happens if instead of chopping clauses in two, you try to put subordinate clauses at the end. For example, instead of, &#8220;He decided, because he was in a good mood, to smile at people he would usually have ignored,&#8221; you write, &#8220;He decided to smile at people he would usually have ignored, because he was in a good mood.&#8221; The sentence above can be reordered as follows.</p>
<blockquote><p>But if the course of life had made a clean cut between the old Cottard and the new, by making him assume a shell of coldness, disdain, gravity that became more accentuated while he rewarded his appreciative students with puns, if not at the Verdurins&#8217;, where he had remained more or less the same because of the influence that past associations exert over us when we find ourselves in familiar surroundings, at least in his practice, in his hospital ward, at the Academy of Medicine, the same defects had on the contrary become exaggerated in Saniette, the more he sought to correct them.</p></blockquote>
<p>It still isn&#8217;t easy, but it&#8217;s easier. Does it lose something important? I think it loses some of its distinctively Proustian character &#8212; decoding this kind of sentence is part of the peculiar pleasure of reading Proust &#8212; but why it should be pleasurable rather than just a nuisance is hard to say.</p>
<p>I might add that the sentence is fairly typical of the whole book, though not many sentences go quite that far. And there are similar phenomena at other scales: paragraphs sometimes go on for several pages, and chapters can go on for hundreds of pages. So not only does one have to read twelve volumes of 350-400 pages each, but the reading is slow going. Out of curiosity, I looked up the sentence in the original French. Interestingly, it uses one pair of brackets, which makes it a lot easier to understand. (What I mean by &#8220;interestingly&#8221; is that I wonder why Scott Moncrieff decided to do without the brackets.) Here it is. I got it from <a href="http://www.gutenberg.org/files/15075/15075-h/15075-h.htm">an online version</a>.</p>
<blockquote><p>Mais si la vie, en faisant revêtir à Cottard (sinon chez les Verdurin, où il était, par la suggestion que les minutes anciennes exercent sur nous quand nous nous retrouvons dans un milieu accoutumé, resté quelque peu le même, du moins dans sa clientèle, dans son service d&#8217;hôpital, à l&#8217;Académie de Médecine) des dehors de froideur, de dédain, de gravité qui s&#8217;accentuaient pendant qu&#8217;il débitait devant ses élèves complaisants ses calembours, avait creusé une véritable coupure entre le Cottard actuel et l&#8217;ancien, les mêmes défauts s&#8217;étaient au contraire exagérés chez Saniette, au fur et à mesure qu&#8217;il cherchait à s&#8217;en corriger.</p></blockquote>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4383/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4383/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4383&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/07/07/a-trip-to-watford-grammar-school-for-boys/feed/</wfw:commentRss>
		<slash:comments>46</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A new open-access venture from Cambridge University Press</title>
		<link>http://gowers.wordpress.com/2012/07/02/a-new-open-access-venture-from-cambridge-university-press/</link>
		<comments>http://gowers.wordpress.com/2012/07/02/a-new-open-access-venture-from-cambridge-university-press/#comments</comments>
		<pubDate>Mon, 02 Jul 2012 10:20:41 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Elsevier]]></category>
		<category><![CDATA[Mathematics on the internet]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=4356</guid>
		<description><![CDATA[The formal launch has just taken place at the European Congress of Mathematicians in Krakow of the Forum of Mathematics, which to a first approximation is a new open-access electronic journal. However, the singular &#8220;journal&#8221; is misleading, because in some ways it is more like a whole set of journals. But there will be considerable [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4356&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The formal launch has just taken place at the European Congress of Mathematicians in Krakow of the <a href="http://journals.cambridge.org/forumofmathematics">Forum of Mathematics</a>, which to a first approximation is a new open-access electronic journal. However, the singular &#8220;journal&#8221; is misleading, because in some ways it is more like a whole set of journals. But there will be considerable interdependence between the elements of the set, so &#8220;journals&#8221; is misleading too. We need an intermediate number between singular and plural. Also, although the journal(s) is/are primarily electronic, there will be a print-on-demand option if anyone wants it.</p>
<p><strong>What is the Forum of Mathematics?</strong></p>
<p>Terminological questions aside, how will this new journal-like object work? I think the easiest way of explaining it is to describe the process for submitting an article, which is similar to the process for submitting an article to a conventional maths journal, but with one or two unusual aspects.<br />
<span id="more-4356"></span></p>
<p>First of all, as with any journal you&#8217;ll need to decide whether your article is likely to be of the required standard. Except that here there are <em>two</em> standards to worry about, which brings me to the main unusual feature: the Forum of Mathematics, as the journal(s) is called, has two &#8220;levels&#8221;, called Pi and Sigma. A paper suitable for Sigma should be of the kind of standard you would expect in a leading journal in the area of that paper. For example, if your paper is in combinatorics, then it will be suitable for Forum of Mathematics: Sigma if it is of roughly the standard expected by Combinatorica or JCT A/B. As for Pi, that is for papers that are sufficiently interesting or important that their appeal goes beyond their immediate area of mathematics. Thus, Pi papers will be at the level of leading general mathematics journals and will be an open-access alternative to them. Discussion is still going on about what precisely this means, but it looks as though the aim will probably be for Pi to be a serious competitor for Annals, Inventiones, the Journal of the AMS and the like. Of course, we can only really know what the level and characters of the two journals will be when they have been going for a while. The CUP website says this: </p>
<blockquote><p>Pi is the open access alternative to the leading generalist mathematics journals. Papers published will be of a high quality and of real interest to a broad cross-section of all mathematicians. </p></blockquote>
<p>It will also be possible for papers submitted to Pi to be reconsidered for Sigma, if the author is willing for this to happen. However, to encourage authors to think hard before submitting to Pi, rather than merely trying their luck with it, there will be a requirement that papers submitted to Pi are accompanied by a justification of a side or two, which should explain why the paper is of more than merely specialist interest. I hope that these justifications will be very useful documents for the editors, and perhaps even for readers of the papers at a later stage, but that kind of detail has not yet been decided.</p>
<p>The editorial board of the Forum of Mathematics will be divided into &#8220;clusters&#8221; of people, with each cluster representing a different area of mathematics. When you submit a paper, you will decide which is the appropriate cluster to submit it to, and the editors in that cluster will handle the paper. However, not all papers can be easily classified, and that is where the not-quite-singular-but-not-quite-plural aspect of the Forum of Mathematics becomes apparent. It&#8217;s not just papers that are hard to classify, but also editors, and some editors will belong to more than one cluster. Also, there will be plenty of communication between editors of different clusters in an effort to achieve a reasonably uniform standard across the whole of the Forum of Mathematics. </p>
<p>Otherwise, the journal will be pretty conventional. Once you&#8217;ve submitted a paper, it will be processed in basically the same way as papers are processed for any other journal. A small but important difference is that the Forum will not have &#8220;issues&#8221;. As with many other electronic journals, once a paper is accepted, it goes straight up on the website. It will have a number for reference purposes, and if it is a Sigma paper then it will in some sense &#8220;belong&#8221; to the relevant cluster, but it won&#8217;t belong to an issue (though there will be &#8220;volumes&#8221; for those who want print copies). One of the advantages of this is that the Forum of Mathematics will be able to aim for an absolute standard rather than taking space into account when deciding whether or not to accept papers. </p>
<p>Speaking as one of the editors, I&#8217;d like to say that this is something I feel very strongly about. It is very unclear at this stage how popular the Forum of Mathematics will be. I would hate to be under pressure to lower standards in order to attract more papers, or to turn away good papers because there wasn&#8217;t room for them. An electronic journal makes it easy to avoid that kind of pressure, and for the Forum of Mathematics it will be avoided. </p>
<p><strong>How will the journal be paid for?</strong></p>
<p>So far, what I&#8217;ve been discussing is only rather minor modifications of the normal practice of journals. But I said at the beginning of this post that the Forum of Mathematics will be open access. What does that mean, and how will the finances work? Before I answer this, let me briefly introduce some terminology.</p>
<p>It is unfortunate that the phrase &#8220;open access&#8221; has come to mean different things to different people. The result is that if you say that you are in favour of open access, you have to clarify what you mean, since there may be business models that you are not in favour of but that are nevertheless called open access by their proponents. However, there are two types of open access, known as gold and green, that have well-established meanings. <em>Gold</em> open access is where a paper is published in a journal and made freely available online, and the publisher is paid by the author. So a more descriptive name would be author-pays open access. <em>Green</em> open access is where papers are published in the usual way, but also made available on repositories such as the arXiv, so that even if the formatted journal versions are behind a paywall, freely downloadable versions of the papers will still appear when you Google them.</p>
<p>The advocates of green open access argue that if everyone makes a habit of posting their papers on places like the arXiv, then in time the problems we have with expensive journals will melt away: there will simply no longer be any point in subscribing to them at ridiculous prices. Opponents of green open access, and especially of mandates from funding bodies that people should make their papers available online, use exactly the same argument but with one extra line: if everybody did it, then there would be no point in subscribing to a journal, and therefore the journals as we know them would no longer be financially viable. </p>
<p>I will resist the temptation to discuss these arguments in more detail, because gold open access is more relevant to the Forum of Mathematics (though papers submitted to the Forum of Mathematics can also be posted on the arXiv, so it is completely sympathetic to green open access). Here there is bad news and good news. The bad news is <em>that it will cost money to have a paper published in the Forum of Mathematics</em>. In other words, it is following a gold open-access model. The good news is hidden in the word &#8220;will&#8221;. For the first three years of the journal, <em>Cambridge University Press will waive the publication charges</em>. So for three years the journal will be what Marie Farge (who has worked very hard for a more rational publication system) likes to call <em>diamond</em> open access, a quasi-miraculous model where neither author nor reader pays anything. </p>
<p>A second piece of good news will seem like good news only if you know what typical author publication charges, or APCs as they are commonly called, are. (Actually, APC officially stands for &#8220;article processing charge&#8221;.) Here are a couple of examples. One of the most famous open-access journals is the Public Library of Science, or PLoS. PLoS has several high-quality journals and a big all-encompassing journal called PLoS-ONE that has a high acceptance rate and is basically somewhere where you put a paper if all you want to signal is &#8220;this was a publishable paper&#8221;. The APCs are between $2000 and $3000 for the high-quality journals and are $1350 for PLoS-ONE. The details are on <a href="http://www.plos.org/publish/pricing-policy/publication-fees/">this page from their webiste</a>. Another example is the London Mathematical Society&#8217;s open-access option. If you want your LMS-journal paper to be freely available after it is published, then you can have that for a fee of $3050 (or £1925) from outside the UK or £2310 from within the UK. The UK figure includes VAT of 20%. </p>
<p>The proposed charge for the Forum of Mathematics after the initial three years is £500 or $750. These figures do not include VAT, and are in absolute terms quite a lot of money, but they are nevertheless <em>much</em> cheaper than what appears to be the industry standard at the moment. They will also be waived for people from developing countries (CUP has a list of the countries for which fees will be waived) and for people who can demonstrate a genuine inability to pay. In addition, CUP promises to be <em>transparent about its costs</em>, so one will be able to understand the justification for the fees. I do not know the exact form that this transparency will take: what I would like to see is a web page somewhere that discusses in detail the cost of processing a typical paper, but maybe they won&#8217;t go that far.</p>
<p>A further point is that CUP would like to keep the fees as low as possible, even after the three years are over, and will seek funding to do so. The dream scenario would be a rich donor agreeing to underwrite APCs for several years after the initial three-year period. But there are also smaller things that can be done. For example, some people are at institutions that routinely agree to cover author charges. Such people will be encouraged to pay the author charges if they get a paper into the Forum of Mathematics, since it will not be a problem for them to pay, and the money received will be used to mitigate in one way or another the introduction of APCs in three years&#8217; time. I am being slightly vague here because there are different forms that this mitigation might take: for example, there might be a choice between waiving APCs completely for a further period and half waiving them for longer. </p>
<p><strong>Some pros and cons of gold open access.</strong></p>
<p>I have talked to a number of mathematicians about author publication charges, and it is clear that the idea is regarded with deep suspicion by many people. It is also clear that many of the arguments that people have against it are good ones. Before I go into some of those arguments, I would urge you to bear in mind that there are also some very strong arguments against the current subscription system. It seems to me that every system of publication has its drawbacks, so pointing out those drawbacks is not enough: one must argue that they outweigh the drawbacks of the existing system. </p>
<p>The main drawback of gold open access is that it makes life more difficult for people from less wealthy institutions. More generally, it introduces financial considerations into certain decisions that one would prefer to be made entirely on academic grounds. An example of the kind of consequence this might have if gold open access became the norm is a less well-off UK university telling its academics that it would pay for just the papers needed for the next Research Excellence Framework. In fact, I have even heard a rumour of this kind, and apparently the decision about which papers would be supported was to be taken by the university administration and not by the academics. I very much hope that won&#8217;t be typical.</p>
<p>One of the best ways to avoid unpleasant consequences of that kind is to make the charges small enough that they are relatively painless for universities and funding bodies to pay. Maybe the charges for the Forum of Mathematics could be lower still &#8212; I don&#8217;t know &#8212; but the fact that they are much lower than the ones that are prevalent at the moment is at the very least a significant piece of progress. (I would add here that <a href="http://math.berkeley.edu/~kirby/">Rob Kirby</a>, who will be the managing editor and who has been campaigning against expensive journals for many years, regards the £500 price as an absolute bargain.)</p>
<p>An important point to make is that while libraries continue with their subscriptions to existing journals, APCs are simply an additional cost to universities and funding bodies. However, if we were to switch tomorrow to an author-pays model (that is, stop all subscriptions and do all publishing through gold open access), then the total cost of the publication system would be much less than it is now. Or at least so I have read many times, and I find it plausible. For instance, I have been told that University College London pays over a million pounds a year to Elsevier for its subscriptions. Even if APCs were £2000, a million pounds would be enough for 500 papers. Elsevier has 2000 or so journals, so I would guess that the total number of articles is something like 30,000. If 1% of these articles come from University College London (surely an overestimate &#8212; many of those 2000 journals are not all that high quality and there are many good universities round the world), then that&#8217;s 300 papers, so already a significant saving. Improvements to this back-of-envelope calculation are welcome. (Of  course, APCs of £500 alter the calculation dramatically.) Thus, one needs to take a long-term view when thinking about APCs. I see the pain of paying APCs as to some extent a good thing: if academics themselves feel that pain (not quite directly but through their departments or other funders) then they will be less likely to accept outrageous prices than they are now, when the pain of journal prices is felt by librarians.</p>
<p>Since switching to an author-pays model would save universities a lot of money, one option that looks very attractive on the face of it would be for a large number of universities to club together and agree to fund APCs for journals that were sufficiently cheap and of sufficiently high quality. If that could be organized, then the consortium of universities would have much more bargaining power than libraries do at present: a journal would <em>much</em> rather be able to say, &#8220;Don&#8217;t worry about APCs &#8212; they are paid for by consortium X,&#8221; than have explicit APCs. (Why? Well, which kind of journal would you prefer to submit to?) So with luck there would be significant downward pressure on prices. The difficulty, of course, is of a prisoner&#8217;s-dilemma type: while it may be in the collective interest of universities to set up a fund to cover APCs, any individual university would do better by not contributing to the fund. However, if enough universities could be persuaded to start such a fund, then moral pressure could be put on other universities to join in &#8212; especially wealthy universities whose members frequently benefit from the fund. I hope that the Forum of Mathematics will quickly become such a valued part of the mathematical landscape that many of us will make efforts to bring a fund like this into existence (perhaps initially just for FoM, but ideally the fund would grow and support other open-access journals). </p>
<p>If you are interested in arguments for and against various kinds of open access, I would recommend <a href="http://www.earlham.edu/~peters/fos/oawritings.htm">the writings of Peter Suber</a>, who also has <a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=12862">a book coming out soon</a>.</p>
<p><strong>My plans as an editor.</strong></p>
<p>Here I am speaking for myself only. The idea of taking on a significant editorial job was not one that would ordinarily have appealed to me, but when David Tranah (whose brainchild all this was) summoned me for a chat in early February, shortly after I had published my first Elsevier post, I realized that it was basically impossible for me to say no. </p>
<p>Having got myself into that position, I now want to think of ways of doing the work efficiently. Here are a few ideas I have had so far.</p>
<p>1. I will follow the practice of many editors these days and ask for quick opinions first, unless I myself already have a quick opinion. I will proceed to a more detailed reference only if the initial opinion suggests that it is worth doing so.</p>
<p>2. One reason I have been rather inefficient at this kind of job in the past is that I always feel guilty asking potential referees for a favour. Maybe I shouldn&#8217;t, but I do. To counteract that, I would like to compile a list of names of people who volunteer <em>in advance</em> to do a certain amount of work for the discrete maths cluster of the journal (which is the cluster I&#8217;ll be part of). What I&#8217;d like is for people to tell me roughly how often they are prepared to handle a paper and roughly what topics they are ready to cover. That way, if I have a paper that matches a referee who has not yet reached his/her quota, I&#8217;ll be able to ask that person without the slightest embarrassment. If you are reading this and feel like showing your support for the journal by dropping me an email and making a commitment of that kind, I will be very grateful. If you don&#8217;t, and if you are a discrete mathematician, you may find that I email <em>you</em> at some point. I won&#8217;t hold anyone to the commitments that they make. The main purpose of this is to make it easier to ask for help, but if you&#8217;ve said you can handle three papers a year and a long and difficult paper comes along at a bad time, I will understand.</p>
<p>3. I plan at some point to write some guidelines for what I personally consider makes a combinatorics paper good enough to publish in Sigma and what gives it that extra Pi-like quality. This probably won&#8217;t be easy, but I hope that authors and referees will find it useful to have some explicit criteria (which they will be free to ignore) rather than relying on some vague instinct that &#8220;this paper is good enough for the Forum of Mathematics: Sigma&#8221;. </p>
<p><strong>How does all this relate to the Elsevier boycott?</strong></p>
<p>Here again I am speaking for myself rather than for CUP, though I hope my opinions will be shared by many others. As it happens, the idea of the Forum of Mathematics predates the Elsevier boycott, so in a sense the answer is that it has nothing to do with the boycott at all. However, I myself see it as potentially a very important development in the campaign for a better system of academic publishing. In particular, it greatly weakens what was previously quite a strong argument for some people against participating in the Elsevier boycott: that the best specialist journal in their area is an Elsevier journal so joining the boycott would harm their career. </p>
<p>Well, maybe that has been true up to now, but it is about to become less true. If you have a paper that is suitable for the best specialist journal in your area and that journal happens to be very expensive, then you now have the option of submitting your paper to Forum of Mathematics: Sigma instead. Similarly, if you were thinking of submitting a top-notch paper to a top-notch but expensive journal (Inventiones comes to mind here &#8212; a Springer journal, but this discussion is not just about Elsevier), then you can get just as much of a career boost from Forum of Mathematics: Pi but also enjoy the warm glow that comes from knowing that your paper is freely available to all mathematicians.</p>
<p>At some point I hope to compile an informal list of expensive journals of roughly the same standard as Forum of Mathematics: Sigma, though for that I&#8217;ll need help, as I have very little idea of the standards of specialist journals outside my areas of mathematics.</p>
<p>It is of course unlikely that the Forum of Mathematics will change the face of mathematical publishing in three years. To be a serious direct threat to Elsevier&#8217;s mathematics journals, for example, it would need to cause a reduction of their quality by enough to make libraries consider cancelling their subscriptions, which is very difficult when mathematics journals are bundled together with journals from other subjects. However, the Forum of Mathematics can still have a big influence. For one thing, it will demonstrate that a major publishing house can produce a high-quality journal with high-quality formatting and editing with APCs of around £500. I hope that people in charge of funding bodies who are considering open-access mandates will ask some tough questions of publishers who continue to charge four times that. Secondly, if the Forum of Mathematics is successful, it has the potential to reduce the quality of a number of other journals, which will at least strengthen the hand of librarians who are bargaining with publishers like Elsevier (because cancelling their subscriptions won&#8217;t cause quite the inconvenience that it would at the moment). Thirdly, the Forum of Mathematics may encourage other publishers to set up cheaper open-access journals &#8212; one huge gap in the market would still be a place where people could submit papers that were perfectly worthy but not good enough for the Forum of Mathematics. I myself think that for such papers, there is a strong case for not putting in too much effort into formatting and typesetting, since the papers will usually not be read all that much, and since many people can produce a decent typescript themselves. So it ought to be possible to produce such a journal (or journal-type object, if it is like the Forum of Mathematics) with smaller APCs. </p>
<p>And even if we forget all about price, we still have the huge bonus that the papers published in the Forum of Mathematics will be freely accessible. I hope that once people start to get used to a high-end journal being freely accessible, they will feel all the more keenly the inconvenience of paywalls.</p>
<p>So I urge you to support the Forum of Mathematics. If you&#8217;ve got a good paper, then why not hold on to it until 1st October and submit it to us? If the Forum of Mathematics takes off, then you will be able to look back with pride and say that your paper was one of the very first &#8212; who knows, perhaps even <em>the</em> first &#8212; to appear in it. If your department has not yet thought about author charges, then another thing you can do is initiate a discussion about those &#8212; something I plan to do in Cambridge &#8212; since, as I said above, any author fees that are paid voluntarily during the first three years will help keep the fees lower later on and increase the chances that the venture will be a success, which in turn will raise the prospect of saving a lot of money in the longer term.</p>
<p>As regular readers of this blog will know, I am keen on the idea of much more radical changes to the way we evaluate and disseminate our work. However, I am also in favour of evolutionary change rather than a sudden collapse of the existing system, and that is what the Forum of Mathematics offers. It is not a solution to all our problems, but neither is anything else. What it is is another way of doing things, and the more of those we have, the greater the chance that some of them will work and become successful and help us move to a cheaper and more open system of publishing. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/4356/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/4356/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&#038;blog=1659011&#038;post=4356&#038;subd=gowers&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2012/07/02/a-new-open-access-venture-from-cambridge-university-press/feed/</wfw:commentRss>
		<slash:comments>215</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
	</channel>
</rss>
