<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Gowers's Weblog</title>
	<atom:link href="http://gowers.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://gowers.wordpress.com</link>
	<description>Mathematics related discussions</description>
	<lastBuildDate>Sat, 14 Nov 2009 18:50:49 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='gowers.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/c78463f86c20a18efc9c799202c52977?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Gowers's Weblog</title>
		<link>http://gowers.wordpress.com</link>
	</image>
			<item>
		<title>The first unknown case of polynomial DHJ</title>
		<link>http://gowers.wordpress.com/2009/11/14/the-first-unknown-case-of-polynomial-dhj/</link>
		<comments>http://gowers.wordpress.com/2009/11/14/the-first-unknown-case-of-polynomial-dhj/#comments</comments>
		<pubDate>Sat, 14 Nov 2009 18:23:06 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=1134</guid>
		<description><![CDATA[In this post I want to discuss a combinatorial problem that is very appealing in its own right, but also important as a potential first step towards solving a central open problem in Ramsey theory. It is the second in a series of three posts (I may add to this number later, but three is [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1134&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>In this post I want to discuss a combinatorial problem that is very appealing in its own right, but also important as a potential first step towards solving a central open problem in Ramsey theory. It is the second in a series of three posts (I may add to this number later, but three is what I have written or semi-written so far) that give further details about possible Polymath projects. This one was number 2 in <a href="http://gowers.wordpress.com/2009/09/16/possible-future-polymath-projects/">the post in which I first discussed these projects</a>. In that post, I said nothing beyond the fact that the project had close connections with the <a href="http://michaelnielsen.org/polymath1/index.php?title=Density_Hales-Jewett_theorem">density Hales-Jewett theorem</a>. Unlike the origin-of-life suggestion, this project is a straightforward mathematical one.</p>
<p>Let me very briefly indicate what the central open problem in Ramsey theory is. The density Hales-Jewett theorem, which has been discussed at great length on this blog, is the assertion that every dense subset of <img src='http://l.wordpress.com/latex.php?latex=%5Bk%5D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[k]^n' title='[k]^n' class='latex' /> contains a combinatorial line (provided that <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is large enough in terms of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and the density). This implies Szemer&eacute;di&#8217;s theorem. </p>
<p>Now there is an amazing generalization of Szemer&eacute;di&#8217;s theorem, due to Bergelson and Leibman, known as the polynomial Szemer&eacute;di&#8217;s theorem. This is the following assertion. For any <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> and any choice of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> polynomials <img src='http://l.wordpress.com/latex.php?latex=P_1%2C%5Cdots%2CP_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P_1,\dots,P_k' title='P_1,\dots,P_k' class='latex' /> with integer coefficients and constant terms equal to zero, there exists <img src='http://l.wordpress.com/latex.php?latex=N%3DN%28%5Cdelta%2CP_1%2C%5Cdots%2CP_k%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N=N(\delta,P_1,\dots,P_k)' title='N=N(\delta,P_1,\dots,P_k)' class='latex' /> such that every subset <img src='http://l.wordpress.com/latex.php?latex=A%5Csubset%5C%7B1%2C2%2C%5Cdots%2CN%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A\subset\{1,2,\dots,N\}' title='A\subset\{1,2,\dots,N\}' class='latex' /> of size at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta N' title='\delta N' class='latex' /> contains a subset of the form <img src='http://l.wordpress.com/latex.php?latex=%5C%7Ba%2BP_1%28d%29%2Ca%2BP_2%28d%29%2C%5Cdots%2Ca%2BP_k%28d%29%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{a+P_1(d),a+P_2(d),\dots,a+P_k(d)\}' title='\{a+P_1(d),a+P_2(d),\dots,a+P_k(d)\}' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=d%5Cne+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d\ne 0' title='d\ne 0' class='latex' />. To see that this generalizes Szemer&eacute;di&#8217;s theorem, just let <img src='http://l.wordpress.com/latex.php?latex=P_i%28d%29%3Did&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P_i(d)=id' title='P_i(d)=id' class='latex' /> (unless you feel that <img src='http://l.wordpress.com/latex.php?latex=%28i-1%29d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i-1)d' title='(i-1)d' class='latex' /> is more natural). </p>
<p>Another special case of this theorem is when <img src='http://l.wordpress.com/latex.php?latex=P_1%28d%29%5Cequiv+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P_1(d)\equiv 0' title='P_1(d)\equiv 0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=P_2%28d%29%3Dd%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P_2(d)=d^2' title='P_2(d)=d^2' class='latex' />. Then one is trying to find a subset of the form <img src='http://l.wordpress.com/latex.php?latex=%5C%7Ba%2Ca%2Bd%5E2%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{a,a+d^2\}' title='\{a,a+d^2\}' class='latex' />, or in other words a pair of elements of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> that differ by a perfect square. This was proved independently by Sarkozy and Furstenberg, though my favourite proof is a more recent argument due to Ben Green that follows more closely the general structure of Roth&#8217;s proof of Roth&#8217;s theorem.</p>
<p>An obvious question now arises: is there a generalization of DHJ that implies both DHJ and the polynomial Szemer&eacute;di&#8217;s theorem? So far, the answer is no, and that is the central problem I was referring to earlier. However, we do at least know the colouring version: that is, there is a colouring statement that simultaneously generalizes the Hales-Jewett theorem and van der Waerden&#8217;s theorem. This is a result of Bergelson and McCutcheon &#8212; alternative proofs have been given by Mark Walters (who has the shortest and simplest argument) and Saharon Shelah (who has produced a primitive recursive bound).<span id="more-1134"></span></p>
<p>A result of this work is that we know what the correct statement of polynomial DHJ ought to be. It is slightly complicated, so first let me give one special case that is already enough to convey the flavour of it.</p>
<p><strong>Conjecture 1.</strong> <em>For every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> and every positive integer <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> there exists <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> such that if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is any subset of <img src='http://l.wordpress.com/latex.php?latex=%5Bk%5D%5E%7B%5Bn%5D%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[k]^{[n]^2}' title='[k]^{[n]^2}' class='latex' /> of density at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta' title='\delta' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> contains a <a href="http://michaelnielsen.org/polymath1/index.php?title=Combinatorial_line">combinatorial line</a> such that the wildcard set is of the form <img src='http://l.wordpress.com/latex.php?latex=X%5Ctimes+X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X\times X' title='X\times X' class='latex' /> for some subset <img src='http://l.wordpress.com/latex.php?latex=X%5Csubset%5Bn%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X\subset[n]' title='X\subset[n]' class='latex' />.</em></p>
<p>By considering the map <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%3A%5Bk%5D%5E%7B%5Bn%5D%5E2%7D%5Crightarrow%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi:[k]^{[n]^2}\rightarrow\mathbb{N}' title='\phi:[k]^{[n]^2}\rightarrow\mathbb{N}' class='latex' /> that takes each element of <img src='http://l.wordpress.com/latex.php?latex=%5Bk%5D%5E%7B%5Bn%5D%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[k]^{[n]^2}' title='[k]^{[n]^2}' class='latex' /> to the sum of all its values, we find that the conjecture above implies the following special case of the polynomial Szemer&eacute;di theorem.</p>
<p><strong>Theorem 2.</strong> <em>For every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> and every <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> there exists <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> such that every subset of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2CN%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,N\}' title='\{1,2,\dots,N\}' class='latex' /> of density at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta' title='\delta' class='latex' /> contains an arithmetic progression of length <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> whose common difference is a perfect square. </em></p>
<p>The deduction is not quite trivial, but it is not too hard either. The main reason it holds is that the cardinality of <img src='http://l.wordpress.com/latex.php?latex=X%5Ctimes+X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X\times X' title='X\times X' class='latex' /> is a perfect square.</p>
<p>Here is (one version of) the full polynomial density Hales-Jewett conjecture. It is a bit like a set-theoretic equivalent of trying to find <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=a%2Bc_1x%2Bc_2x%5E2%2B%5Cdots%2Bc_dx%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a+c_1x+c_2x^2+\dots+c_dx^d' title='a+c_1x+c_2x^2+\dots+c_dx^d' class='latex' /> belongs to a given set whenever <img src='http://l.wordpress.com/latex.php?latex=0%5Cleq+c_i%5Cleq+k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0\leq c_i\leq k' title='0\leq c_i\leq k' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, just as Conjecture 1 resembles Theorem 2.</p>
<p><strong>Conjecture 3.</strong> <em>For every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> and every pair of positive integers <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> there exists <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> such that if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is any subset of <img src='http://l.wordpress.com/latex.php?latex=%5Bk%5D%5E%7B%5Bn%5D%7D%5Ctimes%5Bk%5D%5E%7B%5Bn%5D%5E2%7D%5Ctimes%5Cdots%5Ctimes%5Bk%5D%5E%7B%5Bn%5D%5Ed%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[k]^{[n]}\times[k]^{[n]^2}\times\dots\times[k]^{[n]^d}' title='[k]^{[n]}\times[k]^{[n]^2}\times\dots\times[k]^{[n]^d}' class='latex' /> of density at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta' title='\delta' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> contains a <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' />-dimensional <a href="http://michaelnielsen.org/polymath1/index.php?title=Combinatorial_line">combinatorial subspace</a> such that the wildcard sets are of the form <img src='http://l.wordpress.com/latex.php?latex=X%2C+X%5Ctimes+X%2C%5Cdots%2CX%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X, X\times X,\dots,X^d' title='X, X\times X,\dots,X^d' class='latex' /> for some subset <img src='http://l.wordpress.com/latex.php?latex=X%5Csubset%5Bn%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X\subset[n]' title='X\subset[n]' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=X%5Ej&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X^j' title='X^j' class='latex' /> is understood to be a subset of the <img src='http://l.wordpress.com/latex.php?latex=%5Bk%5D%5E%7B%5Bn%5D%5Ej%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[k]^{[n]^j}' title='[k]^{[n]^j}' class='latex' /> that appears in the product).</em></p>
<p>As far as I am aware, the polynomial version of DHJ is the only interesting statement of the form &#8220;Does every dense subset of such and such a structure contain a substructure of such and such a type?&#8221; where the answer is not known, even if one asks nothing at all about bounds. (I&#8217;m not saying that there are definitely no other interesting purely qualitative unsolved density conjectures, but I can&#8217;t think of any. I&#8217;d be interested to hear of others.)</p>
<p>One might expect Conjecture 1 to become hard when <img src='http://l.wordpress.com/latex.php?latex=k%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=3' title='k=3' class='latex' />, but it is in fact not known even when <img src='http://l.wordpress.com/latex.php?latex=k%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=2' title='k=2' class='latex' />. On reflection, this is not quite so surprising, since it implies the theorem of Sarkozy and Furstenberg, which is far from trivial. On the other hand, the theorem of Sarkozy and Furstenberg is significantly easier than the full polynomial Szemer&eacute;di theorem, so perhaps there is some hope of tackling the <img src='http://l.wordpress.com/latex.php?latex=k%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=2' title='k=2' class='latex' /> case of Conjecture 1.</p>
<p>A small variant of this case is particularly appealing combinatorially, and is what I would actually suggest as a good Polymath project.</p>
<p><strong>Conjecture 4.</strong> <em> For every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> there exists <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> such that if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is any collection of at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+2%5E%7B%5Cbinom+n2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta 2^{\binom n2}' title='\delta 2^{\binom n2}' class='latex' /> graphs with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> contains two distinct graphs <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=G%5Csubset+H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G\subset H' title='G\subset H' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=H%5Csetminus+G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H\setminus G' title='H\setminus G' class='latex' /> is a clique.</em></p>
<p>From this conjecture one can deduce the special case of the polynomial Szemer&eacute;di theorem where one is looking for a pair of numbers of the form <img src='http://l.wordpress.com/latex.php?latex=%5C%7Ba%2Ca%2B%5Cbinom+d2%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{a,a+\binom d2\}' title='\{a,a+\binom d2\}' class='latex' />. (The polynomial <img src='http://l.wordpress.com/latex.php?latex=P%28d%29%3D%5Cbinom+d2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P(d)=\binom d2' title='P(d)=\binom d2' class='latex' /> does not have integer coefficients, but that is a minor technicality.) Bergelson and Leibman proved their theorem by first proving the colouring version (the polynomial van der Waerden theorem) and then using general ergodic machinery to deduce the density version. The colouring version followed by a sophisticated induction scheme (which they called PET induction), and in that scheme the polynomial <img src='http://l.wordpress.com/latex.php?latex=d%28d-1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d(d-1)' title='d(d-1)' class='latex' /> was &#8220;easier&#8221; than the polynomial <img src='http://l.wordpress.com/latex.php?latex=d%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d^2' title='d^2' class='latex' />. Thus, there are two reasons for looking at this question about dense sets of graphs: it has a nice combinatorial formulation that does not require one to understand the terminology connected with the Hales-Jewett theorem, and it ought to be (trivially equivalent to) the easiest unsolved case of polynomial DHJ.</p>
<p>Here is a looser way of stating the conjecture: a dense set of graphs must contain two that differ by a clique. How might one go about proving this? </p>
<p>Here is an approach that <em>doesn&#8217;t</em> work, but it is still worth thinking about it in order to get a reasonable understanding of the difficulties of the problem. We can think of Conjecture 4 as a strengthening of <a href="http://michaelnielsen.org/polymath1/index.php?title=Sperner%27s_theorem">Sperner&#8217;s theorem</a>: we can reformulate Sperner&#8217;s theorem as the assertion that if you have a dense set of graphs, then you can find distinct graphs <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> in your set such that <img src='http://l.wordpress.com/latex.php?latex=G%5Csubset+H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G\subset H' title='G\subset H' class='latex' />, and Conjecture 3 adds the requirement that <img src='http://l.wordpress.com/latex.php?latex=H%5Csetminus+G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H\setminus G' title='H\setminus G' class='latex' /> should be a clique. (Of course, if we do not add some such requirement then the resulting statement is not really a statement about graphs at all.)</p>
<p>The &#8220;correct&#8221; proof of Sperner&#8217;s theorem, if we pretend that it is a statement about graphs, proceeds by considering a random ordering of all the edges of the complete graph on <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> vertices. If <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is a dense set of graphs, and if we start with the empty graph and build up to the complete graph by adding random edges one at a time, then the expected number of times we produce a graph in <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is equal to <img src='http://l.wordpress.com/latex.php?latex=n%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n+1' title='n+1' class='latex' /> times the equal-slices density of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> (which can be defined as the probability that a graph <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> belongs <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> if you first choose an integer <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> uniformly at random from the set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%2C%5Cdots%2C%5Cbinom+n2%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1,\dots,\binom n2\}' title='\{0,1,\dots,\binom n2\}' class='latex' /> and then choose <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> edges uniformly at random from all such graphs). It is easy to show that if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is dense, then it has equal-slices density greater than <img src='http://l.wordpress.com/latex.php?latex=1%2F%28n%2B1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/(n+1)' title='1/(n+1)' class='latex' />, and this completes the proof of Sperner&#8217;s theorem.</p>
<p>Can we adapt this idea to prove a properly graph-theoretic version of Sperner&#8217;s theorem? If we could find a way of randomly building up a sequence of graphs that started with the empty graph and ended with the complete graph and had the property that any two graphs in the sequence differed by a clique, then we would be in business. However, a moment&#8217;s thought shows that that is not even close to being possible: indeed, a union of two cliques cannot be a clique, so even a very short sequence of such graphs does not exist.</p>
<p>But it is not necessary to construct a sequence with such a ridiculously strong property. For example, suppose that one could build a sequence of graphs <img src='http://l.wordpress.com/latex.php?latex=G_0%5Csubset+G_1%5Csubset%5Cdots%5Csubset+G_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_0\subset G_1\subset\dots\subset G_m' title='G_0\subset G_1\subset\dots\subset G_m' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=m%3D%5Cbinom+n2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m=\binom n2' title='m=\binom n2' class='latex' />) such that <img src='http://l.wordpress.com/latex.php?latex=G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_r' title='G_r' class='latex' /> has <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> edges, and <img src='http://l.wordpress.com/latex.php?latex=G_i%5Csetminus+G_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_i\setminus G_j' title='G_i\setminus G_j' class='latex' /> is a clique whenever <img src='http://l.wordpress.com/latex.php?latex=i-j%3D%5Cbinom+d2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i-j=\binom d2' title='i-j=\binom d2' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> (which is clearly a necessary condition). That wouldn&#8217;t immediately lead to a proof, but it would be quite encouraging, because if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> ever intersected such a sequence densely, then by the polynomial Szemer&eacute;di theorem (in fact, this case follows also from the work of Sarkozy and Furstenberg) it would have to contain two graphs that differed by a clique.</p>
<p>Even this property is far too strong, however. For instance, it would require <img src='http://l.wordpress.com/latex.php?latex=G_%7Bi%2B3%7D%5Csetminus+G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_{i+3}\setminus G_i' title='G_{i+3}\setminus G_i' class='latex' /> to be a triangle for every <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, and it doesn&#8217;t take long to convince oneself that that is impossible. </p>
<p>Nevertheless, these discouraging observations do not completely rule out covering almost all graphs by sequences that are sufficiently rich in clique differences for it to be possible to reduce the problem to a problem about sets of integers. One possible line of enquiry would be to see whether all such proofs can be ruled out. For example, perhaps it is the case that for every chain of graphs <img src='http://l.wordpress.com/latex.php?latex=G_1%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1\subset\dots\subset G_r' title='G_1\subset\dots\subset G_r' class='latex' /> one can find a set of at least <img src='http://l.wordpress.com/latex.php?latex=r%2F100&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r/100' title='r/100' class='latex' /> graphs in the chain such that no two differ by a clique. That would completely kill off all hopes of a Sperner-type proof, and it is also quite a nice problem in itself (though it may turn out to be easy).</p>
<p>Let me state that formally as a problem. It strikes me as an almost compulsory question to try to answer if one wishes to prove Conjecture 4.</p>
<p><strong>Problem 5.</strong> <em>Is it true that for every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> there exists a sequence of distinct graphs <img src='http://l.wordpress.com/latex.php?latex=G_1%5Csubset+G_2%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1\subset G_2\subset\dots\subset G_r' title='G_1\subset G_2\subset\dots\subset G_r' class='latex' /> such that for every subset <img src='http://l.wordpress.com/latex.php?latex=A%5Csubset%5C%7B1%2C2%2C%5Cdots%2Cr%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A\subset\{1,2,\dots,r\}' title='A\subset\{1,2,\dots,r\}' class='latex' /> of size at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta r' title='\delta r' class='latex' /> there exist <img src='http://l.wordpress.com/latex.php?latex=i%3Cj&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i&lt;j' title='i&lt;j' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=i%2Cj%5Cin+A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i,j\in A' title='i,j\in A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=G_j%5Csetminus+G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_j\setminus G_i' title='G_j\setminus G_i' class='latex' /> is a clique?</em></p>
<p>I suspect that this question is not too hard. In fact, I think Mark Walters probably already knows the answer. (No logical relationship is intended between those two sentences.) If the answer is yes, then one could set about trying to decompose the set of all graphs on <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> vertices into a union of such sequences (or do something more general like approximating the uniform measure on the set of all graphs by a weighted sum of such sequences). However, I think it will be more interesting if the answer is no. Then one would be faced with the following question: <em>what could a proof of Conjecture 4 conceivably look like?</em></p>
<p>One difficulty we would face with this problem if we adopted it as a Polymath project is that we do not have as wide a variety of model proofs to work with as we had with DHJ. The obvious theorem to try to generalize is the theorem of Sarkozy and Furstenberg, but the only two proof techniques that have so far given proofs of that theorem are Fourier analytic techniques (used by Sarkozy, and also in subsequent proofs such as that of Green) and ergodic techniques. It seems to be hard to develop an analogue of Fourier analysis in a Hales-Jewett setting&#8212;we had a try at that with DHJ(3), and although we did have some ideas we did not manage to use them to prove DHJ(3) itself. Having said that, we did find a Fourier-ish proof of Sperner&#8217;s theorem, so perhaps it would be worth revisiting those ideas.</p>
<p>An alternative approach might be to try to find a more combinatorial proof of the theorem of Sarkozy and Furstenberg. Given the way that the PET induction scheme works, it might be sensible to assume Szemer&eacute;di&#8217;s theorem. If we could find a density increment argument that made use of Szemer&eacute;di&#8217;s theorem, then we might be able to generalize it in a way that was similar to the way the modified version of Ajtai and Szemer&eacute;di&#8217;s proof of the corners theorem was generalized to yield DHJ(3). That is quite a long shot, but it still seems worth asking the following question.</p>
<p><strong>Problem 6.</strong> <em>Is there a combinatorial proof, using Szemer&eacute;di&#8217;s theorem as a black box, that shows that if <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is a dense subset of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2CN%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,N\}' title='\{1,2,\dots,N\}' class='latex' /> that contains no pair <img src='http://l.wordpress.com/latex.php?latex=%28a%2Ca%2B%5Cbinom+d2%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(a,a+\binom d2)' title='(a,a+\binom d2)' class='latex' />, then there is an arithmetic progression <img src='http://l.wordpress.com/latex.php?latex=P%5Csubset%5C%7B1%2C2%2C%5Cdots%2CN%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P\subset\{1,2,\dots,N\}' title='P\subset\{1,2,\dots,N\}' class='latex' /> with length tending to infinity such that the density of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> inside <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%2Bc%28%5Cdelta%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta+c(\delta)' title='\delta+c(\delta)' class='latex' />? </em></p>
<p>How might Szemer&eacute;di&#8217;s theorem be useful, even in principle? I&#8217;m not sure, but the kind of thing I vaguely imagine is, say, a clever trick that associates a two-dimensional set with <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' />, finds a density increment in some set <img src='http://l.wordpress.com/latex.php?latex=P%5Ctimes+Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P\times Q' title='P\times Q' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is an arithmetic progression and <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> is a translate of <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />, and does something clever with the diagonals. </p>
<p>That&#8217;s probably so vague as to be useless. One might get something better by examining the colour-focusing argument of Walters. It is admittedly a colouring argument, but it would at least give some ideas if we wanted a proof of the following kind (which is the basic form of the DHJ proof).</p>
<p>Step 1: Find a configuration of a certain kind in which <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is dense.</p>
<p>Step 2: Argue that many numbers cannot be in <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> as a result, and that these numbers form a set with a certain structure to it. (This is where a close examination of Walters&#8217;s argument might give us some clues.)</p>
<p>Step 3: By averaging, show that <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> has a density increment on a structured set.</p>
<p>Step 4: Show that a structured set can be almost entirely partitioned into arithmetic progressions with square common difference. (It&#8217;s plausible that Szemer&eacute;di&#8217;s theorem might be useful for this step.)</p>
<p>Step 5: By averaging, obtain a density increment on an arithmetic progression of square common difference.</p>
<p>Step 6: Iterate.</p>
<p>***********************</p>
<p>I am writing this section of the post after having been in contact with Mark Walters. It seems that part of what I wrote above may be wrong (in as far as a matter of judgment can be wrong). It turns out that he does not know the answer to Problem 5 above, and despite what I said above that is good evidence that the problem is after all hard. His instinct is that such a sequence of graphs does exist, and that if it didn&#8217;t exist then the entire theorem would be false.</p>
<p>That suggests another question one could think about.</p>
<p><strong>Problem 7.</strong> <em>Suppose that for every <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> there is a dense set of graphs with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' /> such that no two differ by a clique. Does it follow that for every nested sequence <img src='http://l.wordpress.com/latex.php?latex=G_1%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1\subset\dots\subset G_r' title='G_1\subset\dots\subset G_r' class='latex' /> there is a dense subset <img src='http://l.wordpress.com/latex.php?latex=A%5Csubset%5C%7B1%2C2%2C%5Cdots%2Cr%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A\subset\{1,2,\dots,r\}' title='A\subset\{1,2,\dots,r\}' class='latex' /> such that if <img src='http://l.wordpress.com/latex.php?latex=i%2Cj%5Cin+A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i,j\in A' title='i,j\in A' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=G_j%5Csetminus+G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_j\setminus G_i' title='G_j\setminus G_i' class='latex' /> is not a clique?</em></p>
<p>In case that sounds a bit far fetched, let me indicate how one might go about proving it (not that I have any particular reason to suppose that this proof idea works). Given a nested sequence of graphs <img src='http://l.wordpress.com/latex.php?latex=G_1%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1\subset\dots\subset G_r' title='G_1\subset\dots\subset G_r' class='latex' /> one would choose some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> that is much much bigger than <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and then cover the set of all graphs with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' /> as evenly as possible with &#8220;copies&#8221; of this sequence. Then one would take a dense set of graphs with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' /> and no clique difference, and by averaging one would find a copy of <img src='http://l.wordpress.com/latex.php?latex=%28G_1%2C%5Cdots%2CG_r%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(G_1,\dots,G_r)' title='(G_1,\dots,G_r)' class='latex' /> that contained at least <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+r%2F2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta r/2' title='\delta r/2' class='latex' /> elements of this dense set. This would show that the copy of <img src='http://l.wordpress.com/latex.php?latex=%28G_1%2C%5Cdots%2CG_r%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(G_1,\dots,G_r)' title='(G_1,\dots,G_r)' class='latex' /> had a dense subset with no clique difference, and hence that <img src='http://l.wordpress.com/latex.php?latex=%28G_1%2C%5Cdots%2CG_r%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(G_1,\dots,G_r)' title='(G_1,\dots,G_r)' class='latex' /> itself had a dense subset with no clique difference. </p>
<p>Since almost all graphs contain almost all small graphs as induced subgraphs, and do so with roughly the same frequency, it could be that the above sketch is basically correct, but I&#8217;d need to check. If it is, then Problem 5 is, as Mark thought, equivalent to the whole problem. In that case, attention would focus on the following question: how does one produce a nested sequence of graphs that is &#8220;rich&#8221; in clique differences?</p>
<p>To make that question very slightly more precise, let us define a class of graphs as follows. Given a nested sequence of <img src='http://l.wordpress.com/latex.php?latex=G_1%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1\subset\dots\subset G_r' title='G_1\subset\dots\subset G_r' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> be the graph with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cr%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,r\}' title='\{1,2,\dots,r\}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> is joined to <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> if and only if <img src='http://l.wordpress.com/latex.php?latex=G_j%5Csetminus+G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_j\setminus G_i' title='G_j\setminus G_i' class='latex' /> is a clique. Let us call a graph <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> constructed in this way a <em>clique-difference graph</em>. Problem 5 can be reformulated as follows.</p>
<p><strong>Problem 5A.</strong> <em>Is it true that for every <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta%3E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta&gt;0' title='\delta&gt;0' class='latex' /> there exists a clique-difference graph with <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> vertices and independence number at most <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta r' title='\delta r' class='latex' />?</em></p>
<p>(The independence number of a graph is the largest possible number of vertices you can find such that no two are joined, so this is equivalent to saying that you cannot find <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta+r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta r' title='\delta r' class='latex' /> graphs in the corresponding nested sequence without two of them differing by a clique.) </p>
<p>Having reformulated Problem 5 in this way, we are in a position to ask weaker questions, and therefore to give ourselves some hope of getting started. For instance, one could ask the following general question.</p>
<p><strong>Problem 8.</strong> <em>What properties do clique-difference graphs have?</em></p>
<p>To give some idea of the kind of result one might hope for here, let me make the simple observation that clique-difference graphs are all triangle free, since a disjoint union of two cliques is never a clique. So we are searching for a triangle-free graph with small independence number (and hence large chromatic number). This might sound a bit worrying, but the difficulty already arises in the number-theoretic case. For instance, if you join two integers when they differ by a cube, you get a triangle-free graph (by a special case of Fermat&#8217;s last theorem, proved by Euler), but all dense sets of integers contain a difference equal to a cube (so there&#8217;s yet another solution to the nice problem of finding triangle-free graphs with arbitrarily large chromatic number).</p>
<p>Of course, the fact that clique-difference graphs are triangle free is working <em>against</em> us, but it may be that if we could understand the restrictions that clique-difference graphs have to satisfy, we would then be in a better position to work out whether clique-difference graphs with small independence number exist.</p>
<p>If constructing good clique-difference graphs seems to be too difficult, then another line of attack might be a probabilistic one. The following question encapsulates what I mean.</p>
<p><strong>Problem 9.</strong> <em>Let <img src='http://l.wordpress.com/latex.php?latex=r%3D%5Cbinom+s2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r=\binom s2' title='r=\binom s2' class='latex' /> and define a nested sequence of graphs <img src='http://l.wordpress.com/latex.php?latex=G_0%5Csubset%5Cdots%5Csubset+G_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_0\subset\dots\subset G_r' title='G_0\subset\dots\subset G_r' class='latex' /> with vertex set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cs%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,s\}' title='\{1,2,\dots,s\}' class='latex' /> by taking <img src='http://l.wordpress.com/latex.php?latex=G_0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_0' title='G_0' class='latex' /> to have no edges and letting <img src='http://l.wordpress.com/latex.php?latex=G_%7Bi%2B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_{i+1}' title='G_{i+1}' class='latex' /> be the union of <img src='http://l.wordpress.com/latex.php?latex=G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_i' title='G_i' class='latex' /> with a random edge not contained in <img src='http://l.wordpress.com/latex.php?latex=G_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_i' title='G_i' class='latex' />. What can we say about the resulting clique-difference graph?</em></p>
<p>Looking at that after having written it, I&#8217;m not sure it&#8217;s such a great question. It looks to me as though if you add edges completely randomly, then there will be almost no clique differences in the resulting sequence. But perhaps one can do something more ingenious, such as adding edges in a way that favours vertices of high degree. (Such processes have been studied in detail as a way of modelling how the internet grows.) So let us interpret &#8220;random edge&#8221; above to mean &#8220;edge chosen according to some clever procedure that involves at least some randomization&#8221;. Even then, it is by no means clear that randomization will work better than a careful deterministic choice of graph sequence that creates many clique differences. </p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/1134/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/1134/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/1134/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/1134/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/1134/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/1134/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/1134/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/1134/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/1134/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/1134/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1134&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/11/14/the-first-unknown-case-of-polynomial-dhj/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, X</title>
		<link>http://gowers.wordpress.com/2009/11/10/a-conversation-about-complexity-lower-bounds-x-2/</link>
		<comments>http://gowers.wordpress.com/2009/11/10/a-conversation-about-complexity-lower-bounds-x-2/#comments</comments>
		<pubDate>Tue, 10 Nov 2009 11:57:16 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=1195</guid>
		<description><![CDATA[This is the final post in the series about complexity lower bounds. It ends not with any grand conclusion, but just with the three characters running out of steam. The main focus of this final instalment is the Gaussian-elimination problem mentioned in earlier instalments (find an explicit nonsingular matrix over  that needs a superlinear [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1195&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This is the final post in the series about complexity lower bounds. It ends not with any grand conclusion, but just with the three characters running out of steam. The main focus of this final instalment is the Gaussian-elimination problem mentioned in earlier instalments (find an explicit nonsingular matrix over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' /> that needs a superlinear number of row operations to turn it into the identity). The discussion follows a familiar pattern, starting out with some ideas for solving the question, understanding why they are hopelessly over-optimistic, and ending with some speculations about why even this problem might be extremely hard. <span id="more-1195"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You know, since we started thinking about the Walsh matrix, I&#8217;ve begun to have my doubts about the burst of enthusiasm I had for the natural-proofs barrier when you first told me about it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> How do you mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, I started applying it to pretty well everything with no evidence to back up what I was doing. I gave a model for a random formula that turned out to be pretty hopeless, though it seems I was lucky and that pseudorandom generators with low formula complexity are known to exist (subject to the usual hypotheses). But I also said that it was plausible that no polynomial-time algorithm could distinguish between a random non-singular matrix over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' /> and a random matrix obtained from the identity by <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> row operations. We&#8217;ve subsequently seen that that statement is false for an obvious reason: the second matrix will be sparse. It is possible to get round that problem, but I don&#8217;t see a strong argument for its being hard to give a superlinear lower bound, particularly given that the obvious way of producing a Walsh matrix takes time <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n.' title='n\log n.' class='latex' /> I think the point at which the problem becomes hard could just as easily be at around <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n,' title='n\log n,' class='latex' /> which is also the point at which a random sequence of row operations ought to produce a random-looking matrix.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> So do you have some idea of how you might go about proving a lower bound of <img src='http://l.wordpress.com/latex.php?latex=cn%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='cn\log n' title='cn\log n' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I do actually. Not a properly thought through idea, so it&#8217;s very unlikely to work. But it&#8217;s just one of those ideas that you have to think about before dismissing it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think I know the kind you mean &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ha ha. Anyhow, my thought was that when you build the Walsh matrix in the obvious way, you start with the <img src='http://l.wordpress.com/latex.php?latex=2%5Ek%5Ctimes+2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k\times 2^k' title='2^k\times 2^k' class='latex' /> identity matrix. Then you make that into a block diagonal matrix with <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bk-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{k-1}' title='2^{k-1}' class='latex' /> copies of <img src='http://l.wordpress.com/latex.php?latex=W_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_1' title='W_1' class='latex' /> going down the diagonal. Then you pair those up to make <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bk-2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{k-2}' title='2^{k-2}' class='latex' /> copies of <img src='http://l.wordpress.com/latex.php?latex=W_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_2,' title='W_2,' class='latex' /> and so on. You have to do <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> steps, and each step takes you <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> row operations. </p>
<p>Now perhaps there is some parameter <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> that you can associate with matrices, with the following property. Let <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> be the block diagonal matrix that consists of <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bk-j%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{k-j}' title='2^{k-j}' class='latex' /> copies of <img src='http://l.wordpress.com/latex.php?latex=W_j.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_j.' title='W_j.' class='latex' /> We would like to show that the number of row operations you need to apply to a <img src='http://l.wordpress.com/latex.php?latex=2%5Ek%5Ctimes+2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k\times 2^k' title='2^k\times 2^k' class='latex' /> matrix <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28A%29%5Cleq%5Ckappa%28V_%7Bj%2Ck%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(A)\leq\kappa(V_{j,k})' title='\kappa(A)\leq\kappa(V_{j,k})' class='latex' /> in order to obtain a matrix <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28B%29%5Cgeq%5Ckappa%28V_%7Bj%2B1%2Ck%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(B)\geq\kappa(V_{j+1,k})' title='\kappa(B)\geq\kappa(V_{j+1,k})' class='latex' /> is at least <img src='http://l.wordpress.com/latex.php?latex=2%5Ek.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k.' title='2^k.' class='latex' /></p>
<p>I also have in mind an obvious parameter to try.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oh yes?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes. It&#8217;s quite similar to the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> norm, and comes from the theory of quasirandom graphs. You take your <img src='http://l.wordpress.com/latex.php?latex=01&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='01' title='01' class='latex' /> matrix, and convert it into a <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> matrix in the usual way. If the converted matrix is <img src='http://l.wordpress.com/latex.php?latex=A%28x%2Cy%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A(x,y)' title='A(x,y)' class='latex' /> then the parameter is </p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+%5Cmathbb%7BE%7D_%7Bx%2Cx%27%2Cy%2Cy%27%7DA%28x%2Cy%29A%28x%2Cy%27%29A%28x%27%2Cy%29A%28x%27%2Cy%27%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle \mathbb{E}_{x,x&#039;,y,y&#039;}A(x,y)A(x,y&#039;)A(x&#039;,y)A(x&#039;,y&#039;).' title='\displaystyle \mathbb{E}_{x,x&#039;,y,y&#039;}A(x,y)A(x,y&#039;)A(x&#039;,y)A(x&#039;,y&#039;).' class='latex' /></p>
<p>If <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is 1 everywhere, then you obviously get 1 for this expression. Also, since you can rewrite the expression as</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+%5Cmathbb%7BE%7D_%7Bx%2Cx%27%7D%28%5Cmathbb%7BE%7D_yA%28x%2Cy%29A%28x%27%2Cy%29%29%5E2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle \mathbb{E}_{x,x&#039;}(\mathbb{E}_yA(x,y)A(x&#039;,y))^2,' title='\displaystyle \mathbb{E}_{x,x&#039;}(\mathbb{E}_yA(x,y)A(x&#039;,y))^2,' class='latex' /></p>
<p>and since <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_yA%28x%2Cy%29A%28x%27%2Cy%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_yA(x,y)A(x&#039;,y)=1' title='\mathbb{E}_yA(x,y)A(x&#039;,y)=1' class='latex' /> when <img src='http://l.wordpress.com/latex.php?latex=x%3Dx%27%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x=x&#039;,' title='x=x&#039;,' class='latex' /> it follows that the value taken by the parameter is at least the probability that <img src='http://l.wordpress.com/latex.php?latex=x%3Dx%27%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x=x&#039;,' title='x=x&#039;,' class='latex' /> which is of course the reciprocal of the size of the set that <img src='http://l.wordpress.com/latex.php?latex=x%2Cx%27%2Cy&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x,x&#039;,y' title='x,x&#039;,y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y&#039;' title='y&#039;' class='latex' /> come from. In our case this size is <img src='http://l.wordpress.com/latex.php?latex=n%3D2%5Ek.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=2^k.' title='n=2^k.' class='latex' /></p>
<p>Now what we are basically doing in calculating this parameter is taking the inner products of all pairs of rows, squaring them, and taking the average. Since the rows of a Walsh matrix (in its <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> version) are orthogonal, we see that it has exactly the minimal value of the parameter I&#8217;ve just defined. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I think that means you want to take the reciprocal. Earlier you said you wanted it to take at least <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> row operations to get the parameter to <em>increase</em> by a certain amount. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, you&#8217;re right. Or we might find that we wanted to go for some kind of dual definition to this one, as we did when we were looking at <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> norms, in which case we&#8217;d have an increasing parameter. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Do you mind saying exactly what you mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Not at all. If <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> are two real matrices of the same size, define <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+A%2CB%5Crangle&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle A,B\rangle' title='\langle A,B\rangle' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_%7Bx%2Cy%7DA%28x%2Cy%29B%28x%2Cy%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_{x,y}A(x,y)B(x,y).' title='\mathbb{E}_{x,y}A(x,y)B(x,y).' class='latex' /> And then, if <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> is the parameter I defined above, define a dual parameter <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> by </p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28A%29%3D%28%5Cmax%5C%7B%5Clangle+A%2CB%5Crangle%3A%5Ckappa%28B%29%5Cleq+1%5C%7D%29%5E4.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(A)=(\max\{\langle A,B\rangle:\kappa(B)\leq 1\})^4.' title='\lambda(A)=(\max\{\langle A,B\rangle:\kappa(B)\leq 1\})^4.' class='latex' /></p>
<p>If <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is a <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> matrix, then taking <img src='http://l.wordpress.com/latex.php?latex=B%3DA%2F%5Ckappa%28A%29%5E%7B1%2F4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B=A/\kappa(A)^{1/4}' title='B=A/\kappa(A)^{1/4}' class='latex' /> gives us the inequality <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28A%29%5Cgeq%5Ckappa%28A%29%5E%7B-1%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(A)\geq\kappa(A)^{-1}.' title='\lambda(A)\geq\kappa(A)^{-1}.' class='latex' /> So the Walsh matrix has parameter at least (and in fact equal to) <img src='http://l.wordpress.com/latex.php?latex=2%5Ek%3Dn.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k=n.' title='2^k=n.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So the fourth power that you put in there was just to make the maximum equal to <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> rather than <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B1%2F4%7D%3F&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{1/4}?' title='n^{1/4}?' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sort of. Without the fourth power we get a norm, but the parameter <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> was the fourth power of a norm and somehow it feels nicer to reflect that by making <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> the fourth power of a norm as well.</p>
<p>Anyhow, what I&#8217;d very much like is for the value of the parameter at <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> to double (or possibly halve if we decide to decrease) when <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> is replaced by <img src='http://l.wordpress.com/latex.php?latex=j%2B1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j+1.' title='j+1.' class='latex' /> It seems that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> is not going to do that for us: the problem is that the identity matrix becomes a diagonal of <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' />s in a sea of <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />s, which is very unbalanced. Indeed, all the <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' />s in their <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> versions are unbalanced.</p>
<p>But being unbalanced is not so much of a defect when we take dual parameters, so perhaps we&#8217;ll have better luck there. What is <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> of the identity?</p>
<p>Hmm &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  You&#8217;ve gone all quiet.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sorry, I had to do some calculations. I&#8217;m coming to the conclusion that it is a nuisance to have <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> matrices here. I want to define the parameter to be <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(A)' title='\lambda(A)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> is defined exactly as above but <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is a 01 matrix rather than a <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> matrix. And now, if some hasty and unchecked guesses and calculations are to be believed, <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> of the identity is something like <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B-3%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{-3}.' title='n^{-3}.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Just as a matter of interest, how did you arrive at that figure?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I guessed that the Walsh matrix would be a good matrix for norming the identity. I suppose I should have tried the identity itself. What do we get there? Well, <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> of the identity is <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B-3%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{-3},' title='n^{-3},' class='latex' /> so <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28n%5E%7B3%2F4%7DI_n%29%3D1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(n^{3/4}I_n)=1.' title='\kappa(n^{3/4}I_n)=1.' class='latex' /> Since <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+I_n%2Cn%5E%7B3%2F4%7DI_n%5Crangle%3Dn%5E%7B-1%2F4%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle I_n,n^{3/4}I_n\rangle=n^{-1/4},' title='\langle I_n,n^{3/4}I_n\rangle=n^{-1/4},' class='latex' /> it follows that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28I_n%29%5Cgeq+n%5E%7B-1%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(I_n)\geq n^{-1}.' title='\lambda(I_n)\geq n^{-1}.' class='latex' /></p>
<p>I must apologize. That is now my revised guess. Indeed, I would guess that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28V_%7Bj%2Ck%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(V_{j,k})' title='\lambda(V_{j,k})' class='latex' /> is always worked out by hitting it with the appropriate multiple of <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}.' title='V_{j,k}.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well it looks easy to work out <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28V_%7Bj%2Ck%7D%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(V_{j,k}).' title='\kappa(V_{j,k}).' class='latex' /> The probability that <img src='http://l.wordpress.com/latex.php?latex=x%2Cx%27%2Cy&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x,x&#039;,y' title='x,x&#039;,y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y&#039;' title='y&#039;' class='latex' /> all come from the same block is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-3%28k-j%29%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-3(k-j)},' title='2^{-3(k-j)},' class='latex' /> and the expectation of the thing you average over given that they do is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-j%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-j}.' title='2^{-j}.' class='latex' /> So you end up with <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-3k%2B2j%7D%3D2%5E%7B2j%7Dn%5E%7B-3%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-3k+2j}=2^{2j}n^{-3}.' title='2^{-3k+2j}=2^{2j}n^{-3}.' class='latex' /> So the right multiple of <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> should be <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B3%2F4%7D2%5E%7B-j%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{3/4}2^{-j/2}.' title='n^{3/4}2^{-j/2}.' class='latex' /> The inner product of <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> with itself is, up to a constant factor, <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B2j%7D2%5E%7Bk-j%7Dn%5E%7B-2%7D%3D2%5Ejn%5E%7B-1%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{2j}2^{k-j}n^{-2}=2^jn^{-1}.' title='2^{2j}2^{k-j}n^{-2}=2^jn^{-1}.' class='latex' /> Putting in the normalizing factor we get <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bj%2F2%7Dn%5E%7B-1%2F4%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{j/2}n^{-1/4}.' title='2^{j/2}n^{-1/4}.' class='latex' /> Finally, raising this to the power 4 gives us <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B2j%7Dn%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{2j}n^{-1}' title='2^{2j}n^{-1}' class='latex' /> as the value, up to a constant, of <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28V_%7Bj%2Ck%7D%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(V_{j,k}).' title='\lambda(V_{j,k}).' class='latex' /> Note that this isn&#8217;t a proof, because we haven&#8217;t proved that <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> is self-norming. But it seems to be at least potentially true.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Right, and <em>if</em> it is true, then we could hazard a guess of the following kind. Perhaps in order to double the value of <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(A)' title='\lambda(A)' class='latex' /> you have to apply at least a linear number of row operations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m afraid I&#8217;m starting to feel very suspicious again.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Because the matrices <img src='http://l.wordpress.com/latex.php?latex=V_%7Bj%2Ck%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_{j,k}' title='V_{j,k}' class='latex' /> are very sparse when <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> is small, and could therefore be giving you a misleading impression. If we go back to the idea of creating an upper triangular matrix and then randomly applying row operations, we might well find that we could get in linear time to a matrix <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28A%29%5Cgeq+n%2F%5Clog+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(A)\geq n/\log n.' title='\lambda(A)\geq n/\log n.' class='latex' /> In fact, I wouldn&#8217;t be surprised if you could go all the way.</p>
<p>I&#8217;m not saying that you can get the Walsh matrix this way: it seems quite plausible, as you say, that the obvious way of producing it is the best. However, I think the parameter you are looking at may be too close to natural to work. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah yes. If you think about eigenvalues, I think you find that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> of a random matrix will be within a constant of the maximum, so if the proof worked it would certainly distinguish between random matrices and matrices that can be produced in <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> operations. </p>
<p>Maybe to get a handle on this we should consider the following model for a random low-complexity matrix. Given any <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\times n' title='n\times n' class='latex' /> matrix <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and any permutation <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' /> of the numbers from <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n,' title='n,' class='latex' /> let <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%2AA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi*A' title='\pi*A' class='latex' /> stand for the matrix you obtain from <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> by adding row <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(1)' title='\pi(1)' class='latex' /> to row <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%282%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(2)' title='\pi(2)' class='latex' /> then row <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%282%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(2)' title='\pi(2)' class='latex' /> to row <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%283%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(3),' title='\pi(3),' class='latex' /> and so on all the way up to <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28n%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(n).' title='\pi(n).' class='latex' /> So row <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28j%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(j)' title='\pi(j)' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%2AA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi*A' title='\pi*A' class='latex' /> is the sum of rows <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(1)' title='\pi(1)' class='latex' /> up to <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28j-1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(j-1)' title='\pi(j-1)' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /> </p>
<p>And now, to produce a matrix in linear time, choose <img src='http://l.wordpress.com/latex.php?latex=1000&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000' title='1000' class='latex' /> permutations <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_1%2C%5Cdots%2C%5Cpi_%7B1000%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_1,\dots,\pi_{1000},' title='\pi_1,\dots,\pi_{1000},' class='latex' /> and take the matrix </p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+A%3D%5Cpi_%7B1000%7D%2A%5Cpi_%7B999%7D%2A%5Cdots%2A%5Cpi_2%2A%5Cpi_1%2A%28I_n%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle A=\pi_{1000}*\pi_{999}*\dots*\pi_2*\pi_1*(I_n).' title='\displaystyle A=\pi_{1000}*\pi_{999}*\dots*\pi_2*\pi_1*(I_n).' class='latex' /> </p>
<p>This can of course be thought of in another way: let <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T' title='T' class='latex' /> be the matrix that is 0 below the diagonal and 1 on and above it; then take <img src='http://l.wordpress.com/latex.php?latex=1000&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000' title='1000' class='latex' /> matrices obtained by randomly and independently permuting the rows of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T' title='T' class='latex' />; then multiply them all together.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That&#8217;s more like it. Now we have a matrix that looks as though it could be hard to distinguish from a purely random (non-singular) matrix.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  It seems to me that if this is a good model, then even <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cpi_1%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\pi_1*I_n' title='\pi_2*\pi_1*I_n' class='latex' /> should be pretty scrambled. Indeed, without loss of generality, <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_1' title='\pi_1' class='latex' /> is the identity permutation. That means that <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_1%2A%28I_n%29%28x%2Cy%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_1*(I_n)(x,y)=1' title='\pi_1*(I_n)(x,y)=1' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=x%5Cleq+y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x\leq y' title='x\leq y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=x%3Ey.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x&gt;y.' title='x&gt;y.' class='latex' /> Writing <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cpi_1%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\pi_1*I_n' title='\pi_2*\pi_1*I_n' class='latex' /> and writing <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2,' title='\pi_2,' class='latex' /> we then find that <img src='http://l.wordpress.com/latex.php?latex=A%28%5Cpi%28x%29%2Cy%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A(\pi(x),y)' title='A(\pi(x),y)' class='latex' /> equals the parity of the set of all <img src='http://l.wordpress.com/latex.php?latex=u%3Cx&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u&lt;x' title='u&lt;x' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28u%29%5Cleq+y.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(u)\leq y.' title='\pi(u)\leq y.' class='latex' /> </p>
<p>If we now take <img src='http://l.wordpress.com/latex.php?latex=x%3Cx%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x&lt;x&#039;' title='x&lt;x&#039;' class='latex' /> and add rows <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(x)' title='\pi(x)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28x%27%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(x&#039;)' title='\pi(x&#039;)' class='latex' /> (mod 2), then we get the parity of the set of all <img src='http://l.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=x%5Cleq+u%3Cx%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x\leq u&lt;x&#039;' title='x\leq u&lt;x&#039;' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28u%29%5Cleq+y.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(u)\leq y.' title='\pi(u)\leq y.' class='latex' /> This will be some random variable (depending on <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' />), and as long as <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y&#039;' title='y&#039;' class='latex' /> are not too close to each other and <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x&#039;' title='x&#039;' class='latex' /> are also not too close to each other (a sufficient condition is that <img src='http://l.wordpress.com/latex.php?latex=%7Cx%27-x%7C%7Cy%27-y%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|x&#039;-x||y&#039;-y|' title='|x&#039;-x||y&#039;-y|' class='latex' /> is significantly bigger than <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />) there will be very little correlation between the value we get at <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> and the value we get at <img src='http://l.wordpress.com/latex.php?latex=y%27.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y&#039;.' title='y&#039;.' class='latex' /> It would seem to follow that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> of this matrix will be very small, which does I think disprove the lemma that I hoped might show that you can&#8217;t get the Walsh matrix in linearly many operations. I&#8217;d have to check the details, but I definitely don&#8217;t believe in that approach any more. In fact, I think I&#8217;d now want to suggest that this new model of random low-complexity matrices could be polynomially indistinguishable from purely random matrices and go back to thinking that there cannot be a &#8220;natural&#8221; solution to the Gaussian-elimination problem.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Of course, this suggestion is backed up by some serious hard thought, heuristic arguments, etc.?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  No, I&#8217;m just making it off the top of my head, and am ready to abandon it if you can think of a clever way of working out what <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2' title='\pi_2' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_3' title='\pi_3' class='latex' /> are if you are given the matrix <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_3%2A%5Cpi_2%2A%5Cmathrm%7Bid%7D%2AI_n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_3*\pi_2*\mathrm{id}*I_n.' title='\pi_3*\pi_2*\mathrm{id}*I_n.' class='latex' /> (Right at this moment I don&#8217;t even have a way of working out <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2' title='\pi_2' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cmathrm%7Bid%7D%2AI_n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\mathrm{id}*I_n,' title='\pi_2*\mathrm{id}*I_n,' class='latex' /> but I think there might be enough residual non-randomness to do that. But <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_3' title='\pi_3' class='latex' /> should clear that away I think.)</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can I just share with you a thought that&#8217;s so naive-seeming that it really does seem as though it couldn&#8217;t possibly work?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Wow, that&#8217;s quite a plug. Let&#8217;s hear it!</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  The thought I had was that the number of row operations you need to get from the identity to the Walsh matrix is the same as the number of row operations you need to get from the Walsh matrix to the identity.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Trivially. What good does that do?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well, as you pointed out, it is initially tempting to base arguments on the false assumption that the intermediate matrices have to be quite sparse. Unfortunately, <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8217;s new model shows that no such argument can work. But what if we argued instead that when you work backwards from the Walsh matrix, it is very hard to get rid of all the 1s that you need to get rid of in order to end up with just <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> of them down the diagonal? If you do some random scramblings, you certainly won&#8217;t make them any sparser.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> There&#8217;s a pretty quick answer to that. If you&#8217;re trying to get from <img src='http://l.wordpress.com/latex.php?latex=W_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_k' title='W_k' class='latex' /> to the identity, why should you want to get rid of 1s as quickly as you can? Perhaps that greedy algorithm, so to speak, is far from optimal. Perhaps the best way to get the identity is to do some very clever sequence of operations that gives you a sequence of matrices that look pretty random but are secretly getting simpler (according to some complexity measure that we cannot calculate in polynomial time), and suddenly, ta da, we end up with a matrix that is 1 on and below the diagonal and 0 above it, which we can easily convert into the identity in a further <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-1' title='n-1' class='latex' /> steps.</p>
<p>Basically, the property you were considering was &#8220;multiply by the Walsh matrix and see how many zeros you have,&#8221; which is far too natural a property to be likely to work.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  OK, I thought it was too much to hope for.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  By the way, I think it&#8217;s easy to work out <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2' title='\pi_2' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cmathrm%7Bid%7D%2AI_n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\mathrm{id}*I_n.' title='\pi_2*\mathrm{id}*I_n.' class='latex' /> Indeed, let <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T' title='T' class='latex' /> be the lower-triangular matrix that&#8217;s 1 everywhere on and below the diagonal. Then each row of the matrix <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cmathrm%7Bid%7D%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\mathrm{id}*I_n' title='\pi_2*\mathrm{id}*I_n' class='latex' /> is a sum of rows of <img src='http://l.wordpress.com/latex.php?latex=T.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T.' title='T.' class='latex' /> More precisely, the <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%28j%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2(j)' title='\pi_2(j)' class='latex' />th row is a sum of the rows <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%281%29%2C%5Cdots%2C%5Cpi_2%28j-1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2(1),\dots,\pi_2(j-1)' title='\pi_2(1),\dots,\pi_2(j-1)' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=T.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T.' title='T.' class='latex' /> But that means that the difference between rows <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%28j-1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2(j-1)' title='\pi_2(j-1)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%28j%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2(j)' title='\pi_2(j)' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%2A%5Cmathrm%7Bid%7D%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2*\mathrm{id}*I_n' title='\pi_2*\mathrm{id}*I_n' class='latex' /> is the <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%28j%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2(j)' title='\pi_2(j)' class='latex' />th row of <img src='http://l.wordpress.com/latex.php?latex=T.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T.' title='T.' class='latex' /> I&#8217;m fairly sure all you have to do now is search for pairs of rows that have differences that look like this: <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%2C%5Cdots%2C1%2C0%2C0%2C%5Cdots%2C0%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,1,\dots,1,0,0,\dots,0).' title='(1,1,\dots,1,0,0,\dots,0).' class='latex' /> That will give you a directed graph, inside which you need a directed path of length <img src='http://l.wordpress.com/latex.php?latex=n-1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-1.' title='n-1.' class='latex' /> I think the graph will already be such a path, but haven&#8217;t checked this. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What happens if you no longer assume that <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_1' title='\pi_1' class='latex' /> is the identity permutation?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah, perhaps for this algorithmic problem you do lose generality when you take <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_2%3D%5Cmathrm%7Bid%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_2=\mathrm{id}.' title='\pi_2=\mathrm{id}.' class='latex' /> I see your point: it&#8217;s not that obvious what row-differences to look out for. But there are a few extremes. For example, you know that there will be a row difference that&#8217;s all 1s and another that has just one 1 in it. But I&#8217;m not sure that&#8217;s enough information to allow for an inductive proof or something like that. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Another question. Suppose it really is the case that no polynomial-time algorithm can distinguish between a random matrix of the form <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_3%2A%5Cpi_2%2A%5Cpi_1%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_3*\pi_2*\pi_1*I_n' title='\pi_3*\pi_2*\pi_1*I_n' class='latex' /> and a purely random matrix. (I&#8217;ve gone for three steps to be on the safe side.) What does that tell us about the Gaussian-elimination problem? Is there any conceivable argument that could solve it?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, we&#8217;d be in a similar situation to the one we are in for circuit complexity. We&#8217;d be wanting to find a property that is not given by a polynomial-time algorithm (and if we were being pessimistic we might even guess that the best algorithm took time <img src='http://l.wordpress.com/latex.php?latex=%5Cexp%28n%5Ec%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\exp(n^c)' title='\exp(n^c)' class='latex' />), but that is somehow not &#8220;trivial&#8221;, so that it really does split the original hard problem into two easier parts.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can you say more precisely what you mean by a &#8220;trivial&#8221; property?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That is something I would very much like to be able to do. As things stand, I just feel as though I know one when I see it. For example, here the obvious &#8220;trivial&#8221; property is &#8220;Can be obtained using at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> row operations&#8221;. This distinguishes beautifully between matrices of the form <img src='http://l.wordpress.com/latex.php?latex=%5Cpi_3%2A%5Cpi_2%2A%5Cpi_1%2AI_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi_3*\pi_2*\pi_1*I_n' title='\pi_3*\pi_2*\pi_1*I_n' class='latex' /> and purely random matrices, but to prove that any given matrix does not have the property is trivially equivalent to the original problem.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So perhaps we should focus on what would in principle count as a non-trivial property.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I have a necessary condition, but it&#8217;s very very far from sufficient.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What&#8217;s that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Simply that the property should be something that is shared not just by low-complexity functions but also by other functions as well.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Great!</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, let me try to say something a tiny bit more interesting. I&#8217;d like to focus on properties in NP, since the contrast between &#8220;trivial&#8221; and &#8220;potentially useful&#8221; properties shows up already here.</p>
<p>Suppose, then, that we have a property in NP that we want to use to distinguish between low-complexity Boolean functions and random Boolean functions. We can obtain this as follows. Let <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> be a polynomially computable set of Boolean functions defined on <img src='http://l.wordpress.com/latex.php?latex=F%5Ctimes+G%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='F\times G,' title='F\times G,' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='F' title='F' class='latex' /> is the set of all Boolean functions defined on <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> is the set of all Boolean functions defined on <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5Em.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^m.' title='\{0,1\}^m.' class='latex' /> Then let <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> be the set of all Boolean functions <img src='http://l.wordpress.com/latex.php?latex=f%5Cin+F&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\in F' title='f\in F' class='latex' /> such that there exists <img src='http://l.wordpress.com/latex.php?latex=g%5Cin+G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g\in G' title='g\in G' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%28f%2Cg%29%5Cin+Q.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(f,g)\in Q.' title='(f,g)\in Q.' class='latex' /> I want to distinguish between properties <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> that say that <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> in some way includes a computation of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and more &#8220;natural&#8221; properties <img src='http://l.wordpress.com/latex.php?latex=Q.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q.' title='Q.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> How do you propose to do that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I don&#8217;t know, but I have a small idea about a possible way of getting started. It would be to use some logic. Roughly speaking, I&#8217;d want to introduce a language that allowed me various notions such as correlation between Boolean functions as primitives, and then I&#8217;d want to say that any formula that could be built in that language would give a property that could not give rise to an interesting lower bound.</p>
<p>For example, the property &#8220;correlates better with a quadratic than a typical random function does&#8221; would be something like &#8220;there exists <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is quadratic and <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle+%5Cgeq+t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle \geq t' title='\langle f,g\rangle \geq t' class='latex' />&#8220;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> But how would you express &#8220;<img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is quadratic&#8221; in your language?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m not sure. Possibly I would allow all polynomially computable properties of <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> and just place restrictions on how <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> can relate to <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />. But perhaps I&#8217;d insist on saying something like </p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+g%28x%29g%28x%2Ba%29g%28x%2Bb%29g%28x%2Bc%29g%28x%2Ba%2Bb%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle g(x)g(x+a)g(x+b)g(x+c)g(x+a+b)' title='\displaystyle g(x)g(x+a)g(x+b)g(x+c)g(x+a+b)' class='latex' /><br />
<img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+%5Cqquad+g%28x%2Ba%2Bc%29g%28x%2Bb%2Bc%29g%28x%2Ba%2Bb%2Bc%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle \qquad g(x+a+c)g(x+b+c)g(x+a+b+c)=1' title='\displaystyle \qquad g(x+a+c)g(x+b+c)g(x+a+b+c)=1' class='latex' /></p>
<p>for every <img src='http://l.wordpress.com/latex.php?latex=x%2Ca%2Cb%2Cc%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x,a,b,c,' title='x,a,b,c,' class='latex' /> which is a nice first-order formula concerning the values of <img src='http://l.wordpress.com/latex.php?latex=g.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g.' title='g.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But ultimately your property is second order because you say &#8220;there exists <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' />&#8221; and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is a function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes. But I just do that once to make a property in NP. Perhaps then one could go further than NP and quantify more than once, obtaining a whole class of properties that cannot work. So in one sense they would be more general than NP, but in another sense they would form a smaller class, because we would be replacing &#8220;polynomially computable&#8221; by something much more restricted &#8212; but perhaps still general enough to be interesting and rule out all the attempts I have made so far. I would find this interesting even if I couldn&#8217;t actually prove it rigorously, as long as I could come up with a formulation that was not obviously false. I&#8217;d be even happier if I could show that the property <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> in some sense <em>had</em> to involve <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> encoding a computation of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, but I don&#8217;t know how to make that idea precise.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But if you could do that, wouldn&#8217;t you have shown that there was no proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Probably not. I agree that it sounds a bit like that &#8212; as though the only property that can distinguish between low-complexity functions and random functions is a property that essentially says &#8220;<img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has low complexity&#8221; &#8212; but that still leaves open the possibility of a property that fails the largeness condition. In other words, there might still be a property that <em>holds</em> for random functions but is carefully designed to fail for some specific function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> in NP.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So where do all these conversations leave us?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sadder but wiser, just as <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> predicted.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/1195/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/1195/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/1195/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/1195/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/1195/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/1195/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/1195/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/1195/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/1195/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/1195/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1195&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/11/10/a-conversation-about-complexity-lower-bounds-x-2/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Polymath and the origin of life</title>
		<link>http://gowers.wordpress.com/2009/11/07/polymath-and-the-origin-of-life/</link>
		<comments>http://gowers.wordpress.com/2009/11/07/polymath-and-the-origin-of-life/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 23:01:28 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[polymath]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=1158</guid>
		<description><![CDATA[This is the first of a few posts I plan (one other of which is written and another of which is in draft form but in need of a few changes) in which I discuss various Polymath proposals in more detail than I did in my earlier post on possible projects. 
One of my suggestions, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1158&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This is the first of a few posts I plan (one other of which is written and another of which is in draft form but in need of a few changes) in which I discuss various Polymath proposals in more detail than I did in <a href="http://gowers.wordpress.com/2009/09/16/possible-future-polymath-projects/">my earlier post on possible projects</a>. </p>
<p>One of my suggestions, albeit a rather tentative one, was to try to come up with a model that would show convincingly how life could emerge from non-life by purely naturalistic processes. But before this could become a sensible project it would be essential to have a more clearly defined mathematical question. By that I don&#8217;t mean a conjecture that Polymath would be trying to prove rigorously, but rather a list of properties that a model would have to have for it to count as successful. Such a list need not be fully precise, but in my view it should be reasonably precise, so that the task is reasonably well defined. It would of course be possible to change the desiderata as one went along.</p>
<p>In this post I&#8217;d like to make a preliminary list. It will undoubtedly be unsatisfactory in many ways, but I hope that there will be a subsequent discussion and that from it a better list will emerge. The purpose of this is not to start a Polymath project, but simply to attempt to define a Polymath proposal that might at some future date be an actual project. For two reasons I wouldn&#8217;t want this to be a serious project just yet: it seems a good idea to think quite hard about how it would actually work in practice, and someone who I hope will be a key participant is very busy for the next few months and less busy thereafter. <span id="more-1158"></span> </p>
<p>As a starting point, let me mention two ideas that are already out there and have attracted a lot of attention. One is the idea of <a href="http://en.wikipedia.org/wiki/Cellular_automaton">cellular automata</a>. A fairly general type of cellular automaton can be defined as follows. You have a graph (usually something like an infinite two-dimensional lattice), and at some points you have 1s and at other points you have 0s. You then let the system evolve in rounds according to some simple rule that is usually the same for every vertex. It might be something like this: if at least two of my neighbours are 1s then I will become a 1, and otherwise I will become a 0. It turns out that very simple rules can lead to extremely complicated and interesting behaviour.</p>
<p>What counts as complicated and interesting? Well, perhaps it is better to say what counts as dull. One possible form of dullness is if a system evolves to some state such as the all-1s state, or perhaps a big rectangle full of 1s with 0s outside, or an oscillation between two configurations. Another form of dullness is a system that tends to disperse the 1s until they form some fairly random looking bunch of 1s that never stops looking fairly random. But in between, there are systems that tend to evolve towards some kind of criticality, where you get fractal structures with organization at many different distance scales. One thing that interests people about cellular automata is that there are very simple rules that seem to want to evolve towards these nice &#8220;edge of chaos&#8221; patterns.</p>
<p>The second idea is <a href="http://en.wikipedia.org/wiki/Self-organized_criticality">self-organized criticality</a>, which is a phenomenon exhibited by certain models in statistical physics &#8212; notably the so-called sandpile models. These are supposed to model what happens if you drop grains of sand one by one on to a pile. They will start to build up into a conical shape, but if the sides get too steep there are avalanches. The sizes of these avalanches vary, and if you plot the frequency of avalanches of various sizes, you find (experimentally at least) that they obey a power law. And power laws get people excited because they are what you find associated with critical phenomena. A typical sandpile model is something like this. You have a big square divided into a grid of small squares. You then set all squares equal to 0 except a few randomly chosen ones that you give small integers to. You then add 1 to the central square (let&#8217;s assume there is one), and after you have done so you have a rule that says that if any square has value at least 4 it must give 1 to each of its neighbours. This procedure you iterate until no square has value at least 4. (It can be shown that the order in which you do these operations doesn&#8217;t matter.) You then add 1 to the central square again, and keep going.</p>
<p>It turns out that the sizes of the &#8220;avalanches&#8221; that take place here (that is, how many iterations you have to do of the simple rule before all squares have value 0 to 3) also obey a power law, and also that systems such as these have a tendency to evolve towards interesting (that is, not too random and not too structured) configurations. That is, you can get critical phenomena without having to fine-tune some parameter. Again, this has got people excited as it seems to promise an explanation of how the complexity in nature could have started.</p>
<p>In the above description, I made the starting configuration random but after that the way the model evolved was deterministic. There are of course many different possible models, and in some of them the new &#8220;grains of sand&#8221; are dropped in random places. Again you get interesting critical behaviour. </p>
<p>Now as far as I know, with both cellular automata and sandpile models you get nice critical phenomena appearing, but while they give you pretty patterns they do not give you anything resembling an ecosystem. Yes, <a href="http://en.wikipedia.org/wiki/Conway's_Game_of_Life">Conway&#8217;s game of life</a> gives you glider guns and configurations that can reproduce themselves, but you have to set them up carefully in advance, and they don&#8217;t seem to do anything all that exciting. They also support universal computation, but again if you want to program the game of life to create an artificial-life simulator, you might as well use a much more powerful computer to do so. What a Polymath project would be looking for is a very simple system with the property that, regardless of the starting configuration, it would tend to develop and eventually produce something that looked like a complex ecosystem.</p>
<p>This brings me to a point that is worth making. The idea of this Polymath project would not be to produce yet another <a href="http://en.wikipedia.org/wiki/Artificial_life">artificial life</a> program, fascinating though those programs can be. One could think of it more like this: can one come up with a very simple model that almost always &#8220;self-organizes&#8221; and produces something that looks a bit like what you get with artificial life programs? In other words, we would be trying to model <a href="http://en.wikipedia.org/wiki/Abiogenesis">abiogenesis</a> rather than <a href="http://en.wikipedia.org/wiki/Evolution">evolution</a>. </p>
<p>After that discussion, I think I can have a stab at saying what the properties are that would make a truly interesting and new model. (I am much less sure about the &#8220;new&#8221; part, and would be interested to hear from people with more knowledge about this kind of topic what the state of the art is.) Some of the properties below seem to be more important than others, but for now I won&#8217;t bother to distinguish between those that I regard as essential and those that are merely desirable.</p>
<p>1. It should be a dynamic model that evolves according to simple rules.</p>
<p>2. It should have a tendency to evolve towards patterns with a &#8220;critical&#8221; character &#8212; not too random and not too simple, with interesting features at many distance scales.</p>
<p>3. Probably it should be a somewhat randomized model (to give it a certain robustness). Here I am referring to the rules by which the model develops rather than the initial conditions, but perhaps the initial conditions should be randomized as well.</p>
<p>4. It should have a tendency to produce identifiable macroscopic structures.</p>
<p>5. It should be possible to classify these macroscopic structures in interesting ways. (That is, we would like to be able to say that certain structures look more or less the same as certain others, and ideally this similarity would be a bit more flexible than one just being a translation of another.)</p>
<p>6. These structures should interact with one another, and the interaction should sometimes be destructive (thereby providing some selection pressure).</p>
<p>7. With high probability, self-reproducing structures should eventually emerge. (Before posting this I showed it to Michael Nielsen, who made some interesting points. One of them is that experience in the actual universe suggests that perhaps there should be some fine tuning of parameters before the probability becomes high: after all, life does not evolve on all planets.)</p>
<p>I could go on, but the idea is that once you&#8217;ve got 6 and 7, and perhaps a few other properties (for instance, one might decide to have major environmental changes from time to time just to stimulate the development of the system), then natural selection can begin to operate. </p>
<p>Of course, the major challenge is 7. The most plausible route I can see to 7 is a purely probabilistic one: almost all configurations are not self-reproducing, but if a self-reproducing one ever does arise, then it will reproduce itself and start appearing all over the place. But in that case 5 is also a huge challenge. The kinds of structures one would ideally like are not things like the bullets from Conway&#8217;s glider guns, but larger configurations that can move about and that are defined more topologically. Indeed, that could be a huge and general problem: the geometry of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^3' title='\mathbb{Z}^3' class='latex' /> just isn&#8217;t the same as the geometry of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}^3' title='\mathbb{R}^3' class='latex' />, but a continuous model would be very difficult to design and simulate (or would it?). But perhaps there could be some cleverly chosen simple rule that would tend to protect &#8220;clumps&#8221; of 1s and allow them to move, and to do complicated things like rotating (whatever that can be made to mean in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^3' title='\mathbb{Z}^3' class='latex' />). Or perhaps a complicated ecosystem could develop that was more <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^3' title='\mathbb{Z}^3' class='latex' />-like than <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}^3' title='\mathbb{R}^3' class='latex' />-like. </p>
<p>Here, incidentally, is a paragraph from the Wikipedia article on Conway&#8217;s game of life, which shows that it is not already an example of what I am talking about:</p>
<blockquote><p>From a random initial pattern of living cells on the grid, observers will find the population constantly changing as the generations tick by. The patterns that emerge from the simple rules may be considered a form of beauty. Small isolated subpatterns with no initial symmetry tend to become symmetrical. Once this happens the symmetry may increase in richness, but it cannot be lost unless a nearby subpattern comes close enough to disturb it. In a very few cases the society eventually dies out, with all living cells vanishing, though this may not happen for a great many generations. Most initial patterns eventually &#8220;burn out&#8221;, producing either stable figures or patterns that oscillate forever between two or more states; many also produce one or more gliders or spaceships that travel indefinitely away from the initial location.</p></blockquote>
<p>We would be looking for something a bit like the Game of Life, possibly randomized, with the important difference that it almost always got more and more complicated and more and more interesting.</p>
<p><strong> Physics </strong></p>
<p>There is one other property that I think would make a model more convincing as an argument for the probability of life arising out of non-life without any magic processes operating. I partly owe this thought to Michael Nielsen, who included the following two questions in a comment he made on the post where I originally mentioned this problem.</p>
<blockquote><p>
(1) How would you go about recognizing self-replicating beings?</p>
<p>(2) What sort of models are “reasonable”, in the sense of both reflecting what we know of physics, and being simple enough to be tractable? The Game of Life isn’t very physical, in that it disobeys many basic physical principles, like conservation of energy, conservation of mass, conservation of momentum, and so on.
</p></blockquote>
<p>One of the things that people often say about life, evolution, biological systems and the like is that they are ways of locally combatting the second law of thermodynamics. So perhaps one could add the following property as one that it would be very nice to have.</p>
<p>8. The general tendency for the model is to become more and more disordered, and eventually to end in heat death, but for there to be many local <em>increases</em> in order. </p>
<p>Of course, one would need to be clear what that meant. The other physical principles that Michael mentioned would also be good to have.</p>
<p>Here is a subproblem that occurs to me as I am writing this. It is connected with the thought that one would like macroscopic structures to have some tendency to survive. In the Game of Life, it seems that structures that survive do so almost by accident &#8212; they settle down into some sort of periodicity, say. But structures in the biological world are held together by physical forces, and they have identifiable boundaries and things like that. So one might try to develop a model that captures just this behaviour. As with the main problem, I&#8217;m not sure how to formulate this subproblem precisely, but let me have a go. Does there exist a model with the following properties?</p>
<p>(i) If you draw some large-scale shape (think of the 0s and 1s as black and white pixels, say, so the shape is on a much larger distance scale than the distance between two neighbouring points of the grid), it has a tendency to move &#8220;continuously&#8221;.</p>
<p>(ii) There is a tendency for mass and momentum to be conserved.</p>
<p>To give an idea of the kind of thing I mean here, let&#8217;s suppose that &#8220;mass&#8221; is represented by 1s, and you take a large annulus, place it over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^2' title='\mathbb{Z}^2' class='latex' />, and put a 1 at every grid point that lies in the annulus. Then in the interior circle of the annulus put a random scattering of not too many 1s. And then slightly move the annulus part, and slightly move all the little particles inside. If the first position of the annulus represents where some very simple structure is at time 1 and the second where it is at time 2, then conservation of mass and momentum would tell us to expect it to continue moving in the same direction (so it would be more sophisticated than a cellular automaton of the kind described earlier because its behaviour would depend not just on how it behaved an instant earlier), and to stay the same size. We might also have &#8220;forces&#8221; between neighbouring 1s that encouraged them to stay together somewhat, and so on.</p>
<p>Of course, as with Conway&#8217;s Game of Life, the idea would be to devise the simplest possible set of rules that did what one wanted (in this case preserve macroscopic shapes at least to some extent and allow them to move about reasonably flexibly but without distorting themselves too much). It would not be to try to create the most realistic model one could of the actual world. </p>
<p>Since writing the above paragraphs I&#8217;ve found out the following relevant facts. First this from the Wikipedia article on <a href="http://en.wikipedia.org/wiki/Life-like_cellular_automaton">Life-like cellular automata</a>:</p>
<blockquote><p>
Larger than Life is a family of cellular automata studied by Kellie Michele Evans. They have very large radius neighbourhoods, but perform `birth/death&#8217; thresholding similar to Conway&#8217;s life. The LtL CA manifest eerily organic `glider&#8217; and `blinker&#8217; structures.</p>
<p>RealLife is the “continuum limit″ of Evan&#8217;s Larger Than Life CA, in the limit as the neighbourhood radius goes to infinity, while the lattice spacing goes to zero. Technically, they are not cellular automata at all, because the underlying “space” is the continuous Euclidean plane R2, not the discrete lattice Z2. They have been studied by Marcus Pivato.
</p></blockquote>
<p>Secondly, here is the <a href="http://arxiv.org/abs/math.DS/0503504">paper by Marcus Pivato</a> mentioned above.</p>
<p><strong>Chemistry and the problem of scale.</strong></p>
<p>By far the most famous contribution to our understanding of how life started is the <a href="http://en.wikipedia.org/wiki/Miller%E2%80%93Urey_experiment">Miller-Urey experiment</a>, in which Miller and Urey attempted to simulate the chemical conditions that might have prevailed early on in the life of the Earth. They used electrodes to create lightning-like sparks that passed through a vapour that was formed of water, methane, ammonia and hydrogen, and found that they produced complex amino acids, which are essential building blocks of life.</p>
<p>What relevance would this experiment have for a computer simulation? My view is that one should not necessarily try to produce a virtual Miller-Urey experiment (complete with virtual lightning, virtual ammonia, etc.) but that the experiment does raise a couple of questions that it is essential to address.</p>
<p>A fundamental fact about life as it exists in the physical world is that it is carbon based. The great virtue of carbon is that its particular bonding properties allow it to combine with other atoms to form molecules that are large and complicated enough to encode highly sophisticated information. So an obvious question is this.</p>
<p>Question 1: Should one design some kind of rudimentary virtual chemistry that would make complicated &#8220;molecules&#8221; possible in principle?</p>
<p>The alternative is to have some very simple physical rule and hope that the chemistry emerges from it (which would be more like the Game of Life approach). </p>
<p>This is just one example of a general tension. The more features you design into a model, the less &#8220;universal&#8221; it becomes and the less convincing it is as a demonstration of the inevitability of life. However, one can also argue for at least some designed features. After all, if we want to explain the origin of life, it is not necessary to start with a virtual Big Bang and get from there to the possibility of complex molecules. It may be that designing rules to make complex molecules possible (and then arguing that with probability 1 this possibility is actually realized) is attacking the problem at the correct level. </p>
<p>I do not have a strong view about what the right answer to this question is. Obviously I would prefer the chemistry to emerge as if by magic, but that may be an unrealistic hope.</p>
<p>The second question does not arise directly out of the Miller-Urey experiment, but it is related.</p>
<p>Question 2: How large and how complicated should we expect &#8220;organisms&#8221; to be?</p>
<p>A real-world organism, even a micro-organism, is made out of more atoms than one could hope to simulate on a computer. (I am not certain that that last sentence is correct, but I would be very surprised if it wasn&#8217;t. Added later: Michael Nielsen tells me that there are rudimentary organisms that are so small that they could perhaps be simulated in full.) Moreover, although it has many levels of complexity, there will also be distance scales at which it is relatively simple. For example, if I look at my hand from a distance of about a yard, my skin looks smooth. Similarly, if I were to look through a powerful microscope at one of the cells of my hand, then the boundary of that cell would be reasonably smooth, rather than fractal-like. In general, it seems that if you look at a typical organism, it is not equally complicated at all distance scales, but is more like  this: you take some small objects and put them together in a reasonably simple way to form bigger objects; you then use these bigger objects as building blocks for yet bigger objects; continuing this process for eight or nine (??) levels (perhaps if I knew more biology I would revise this number up considerably) you end up with a complex organism.</p>
<p>If that picture is roughly correct, then the number of &#8220;atoms&#8221; in a complex multicellular organism might be prohibitively large for a simulation. Is this a problem?</p>
<p>I think it shouldn&#8217;t be too problematic. Just as we are not trying to start with the Big Bang, neither are we trying to end with mammals. The main aim is to get to the point where evolution can take over. In particular, if a readily identifiable micro-organism appeared that could reproduce itself with small modifications, then the simulation would surely be declared a success.</p>
<p>Nevertheless, the question of scale remains. Would we want such a micro-organism to consist of a small handful of &#8220;pixels&#8221; that by some magic local rule gives rise to a copy of itself? Or would we want something much larger that had &#8220;smooth boundaries&#8221; at some distance scales and was composed of &#8220;complex molecules&#8221;? My inclination at the moment is to prefer the second for two reasons: it is less like the Game of Life (and therefore more likely to be novel and interesting) and it is closer to the life forms that we actually observe.</p>
<p>Added later: I haven&#8217;t quite made clear that one aim of such a project would be to come up with <em>theoretical</em> arguments. That is, it would be very nice if one could do more than have a discussion, based on intelligent guesswork, about how to design a simulation, followed (if we were lucky and found collaborators who were good at programming) by attempts to implement the designs, followed by refinements of the designs, etc. Even that could be pretty good, but some kind of theoretical (but probably not rigorous) argument  that gave one good reason to expect certain models to work well would be better still. Getting the right balance between theory and experiment could be challenging. The reason I am in favour of theory is that I feel that that is where mathematicians have more chance of making a genuinely new contribution to knowledge.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/1158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/1158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/1158/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1158&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/11/07/polymath-and-the-origin-of-life/feed/</wfw:commentRss>
		<slash:comments>42</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, IX</title>
		<link>http://gowers.wordpress.com/2009/11/03/a-conversation-about-complexity-lower-bounds-ix/</link>
		<comments>http://gowers.wordpress.com/2009/11/03/a-conversation-about-complexity-lower-bounds-ix/#comments</comments>
		<pubDate>Mon, 02 Nov 2009 23:04:47 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[complexity]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=990</guid>
		<description><![CDATA[This instalment has a brief discussion of another barrier to proving that PNP, known as algebrization. I don&#8217;t fully understand it, and therefore neither do my characters. (I&#8217;m hoping that maybe someone can help me with this.) But even a fuzzy understanding has some consequences, and the characters are led to formulate a simpler (and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=990&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This instalment has a brief discussion of another barrier to proving that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP, known as algebrization. I don&#8217;t fully understand it, and therefore neither do my characters. (I&#8217;m hoping that maybe someone can help me with this.) But even a fuzzy understanding has some consequences, and the characters are led to formulate a simpler (and almost certainly already considered by the experts) problem that has the merit that when trying to solve it one is not tempted by proof techniques that would run up against the algebrization barrier. However, for all the usual reasons, this &#8220;simpler&#8221; problem looks very hard as well.</p>
<p>**************************************</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m afraid I&#8217;m not yet ready to tell you what basic 3-bit operations do to quadratic phase functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> In that case, can I instead mention something I read that looks relevant?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sure, go ahead.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, you may remember my mentioning that Scott Aaronson and Avi Wigderson have a paper in which they introduce <em>another</em> barrier to lower bound proofs, which they call &#8220;algebrization&#8221;. If a proof can be algebrized, then it can&#8217;t prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP.<span id="more-990"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What does &#8220;algebrized&#8221; mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Unfortunately, my understanding of it is rather hazy. But it refers to a technique where in order to prove complexity results, you try to approximate low-complexity functions by low-degree polynomials over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2.' title='\mathbb{F}_2.' class='latex' /> It turns out to be helpful to look at low-degree field extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2,' title='\mathbb{F}_2,' class='latex' /> so one can talk of results holding in various extensions.</p>
<p>A proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP would algebrize if you can show that there is a function in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5E%7B%5Ctilde%7BA%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^{\tilde{A}}' title='\mathrm{P}^{\tilde{A}}' class='latex' /> that is not in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^A' title='\mathrm{P}^A' class='latex' />. Here, <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> is any oracle, and <img src='http://l.wordpress.com/latex.php?latex=%5Ctilde%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tilde{A}' title='\tilde{A}' class='latex' /> is a low-degree extension of <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What&#8217;s an oracle?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m a bit hazy on oracles too. Basically, it means that you can take some fairly complicated function and assume that your computer takes only one step to calculate it. (The word &#8220;oracle&#8221; is used for obvious reasons: it&#8217;s as though when the computer wants to know the value of this very complicated function, it just goes and asks the oracle.) One then writes <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^A' title='\mathrm{P}^A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BNP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{NP}^A' title='\mathrm{NP}^A' class='latex' /> for the sets of functions you can compute/verify in polynomial time if you have access to the oracle <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /> It is already known that there are oracles <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5EA%3D%5Cmathrm%7BNP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^A=\mathrm{NP}^A' title='\mathrm{P}^A=\mathrm{NP}^A' class='latex' /> and other oracles <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5EB%5Cne%5Cmathrm%7BNP%7D%5EB.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^B\ne\mathrm{NP}^B.' title='\mathrm{P}^B\ne\mathrm{NP}^B.' class='latex' /> It follows that any proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP must not <em>relativize</em>: that is, must not remain valid even if you throw in an oracle. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why would a proof remain valid for an arbitrary oracle?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> The kinds of proofs we&#8217;ve been thinking about wouldn&#8217;t, but there is a very different class of techniques that would. The observation about oracles shows that you can&#8217;t hope to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP by means of some clever counting or diagonalization argument.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What do you mean by &#8220;diagonalization argument&#8221; in this context?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, one might try to run through all functions in P and construct a clever function in NP making sure that it disagrees with each function in P in at least one place. That&#8217;s the kind of thing you do to prove the insolubility of the halting problem. But for proving that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP it is known not to work.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why is that called &#8220;algebrization&#8221;?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It isn&#8217;t. That&#8217;s relativization. A proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP must not relativize, meaning that it must not be valid &#8220;relative to an arbitrary oracle&#8221;. By the way, you might like to check out a <a href="http://terrytao.wordpress.com/2009/08/01/pnp-relativisation-and-multiple-choice-exams/">post by Terence Tao</a> about the relativization barrier.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I definitely would!</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Anyhow, algebrization is a further twist, I think, where you have <em>two</em> oracles, and one is allowed to be a low-degree field extension of the other. Roughly, they show that you cannot prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP if your proof would also show that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5E%7B%5Ctilde%7BA%7D%7D%5Cnot%5Csubset%5Cmathrm%7BNP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^{\tilde{A}}\not\subset\mathrm{NP}^A' title='\mathrm{P}^{\tilde{A}}\not\subset\mathrm{NP}^A' class='latex' /> for every oracle <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and every low-degree field extension <img src='http://l.wordpress.com/latex.php?latex=%5Ctilde%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tilde{A}' title='\tilde{A}' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But surely that&#8217;s already included in the relativization result? Just take <img src='http://l.wordpress.com/latex.php?latex=A%3D%5Ctilde%7BA%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A=\tilde{A}.' title='A=\tilde{A}.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Hmm, that is quite confusing I agree. It may explain why I find this concept quite hard to grasp. </p>
<p>Aha, in the paper, which, incidentally, <a href="http://www.scottaaronson.com/papers/alg.pdf">may be found here</a> (or just Google &#8220;algebrization&#8221;), they say that they are referring to <em>algebraic oracles</em>, which they define as oracles that can work out not just the value of a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> but also its value in any field extension. Since that doesn&#8217;t seem to make much sense for an arbitrary Boolean function, I think it must mean that the oracles are not arbitrary&#8212;since then you would surely be right&#8212;but only allow you to work out the kinds of functions, such as polynomials, that can be interpreted in a larger field. So their result is stronger in one sense&#8212;a proof is ruled out if it yields similar results for a rather small class of oracles&#8212;but weaker in another&#8212;those &#8220;similar results&#8221; have to show not just that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5EA%5Cnot%5Csubset%5Cmathrm%7BNP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^A\not\subset\mathrm{NP}^A' title='\mathrm{P}^A\not\subset\mathrm{NP}^A' class='latex' /> for every algebraic oracle, but the stronger statement that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BP%7D%5E%7B%5Ctilde%7BA%7D%7D%5Cnot%5Csubset%5Cmathrm%7BNP%7D%5EA&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{P}^{\tilde{A}}\not\subset\mathrm{NP}^A' title='\mathrm{P}^{\tilde{A}}\not\subset\mathrm{NP}^A' class='latex' /> for every algebraic oracle <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and every low-degree field extension <img src='http://l.wordpress.com/latex.php?latex=%5Ctilde%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tilde{A}' title='\tilde{A}' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But surely <em>every</em> Boolean function can be written as a polynomial.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, yes, but the polynomial will depend on <img src='http://l.wordpress.com/latex.php?latex=n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n,' title='n,' class='latex' /> which doesn&#8217;t really count. Or does it? I&#8217;m afraid I&#8217;m not sure what the right response is to that question.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What if you have two different polynomials that take equal values over one field but not over an extension?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I don&#8217;t think that&#8217;s a big problem. You&#8217;d just give some algebraic formula to the oracle and the oracle would be able to tell you the answer. It wouldn&#8217;t matter if some other algebraic formula happened to agree with it.</p>
<p>I&#8217;m sorry my understanding of this is less than perfect, but what I wanted to draw your attention to was the following few sentences from the Aaronson-Wigderson paper:</p>
<blockquote><p>Can we pinpoint what it is about arithmetization that makes it incapable of solving these problems? In our view, arithmetization simply fails to “open the black box wide enough.” In a typical arithmetization proof, one starts with a polynomial-size Boolean formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' />, and uses <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> to produce a low-degree polynomial <img src='http://l.wordpress.com/latex.php?latex=p.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p.' title='p.' class='latex' /> But having done so, one then treats <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> as an arbitrary black-box function, subject only to the constraint that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bdeg%7D%28p%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathrm{deg}(p)' title='\mathrm{deg}(p)' class='latex' /> is small. Nowhere does one exploit the small size of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi,' title='\phi,' class='latex' /> except insofar as it lets one evaluate <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> in the ﬁrst place. The message of this paper has been that, to make further progress, one will have to probe <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> in some “deeper” way. </p></blockquote>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah, I see why you&#8217;re saying that. Even if we don&#8217;t fully understand the algebrization barrier, we can still be a little sceptical of any proof attempt that makes use of polynomials without also making use of how those polynomials are put together. I think we&#8217;ve slightly run up against this thought in some of our earlier discussions actually. It seems to suggest that it is unlikely that we would be able to prove that applying 3-bit operations to arbitrary linear combinations of degree-<img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> polynomial phase functions gave you linear combinations with coefficient sums that were not much bigger. That seems to tie in with my experiences with quadratics.</p>
<p>However, having said that I&#8217;d like to mention that the degree is by no means the only interesting parameter one can associate with a polynomial over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2.' title='\mathbb{F}_2.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why, what else is there?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  There&#8217;s also its rank.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What does the rank of a polynomial mean? I thought ranks were things that linear maps had.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, let me first say what the rank of a quadratic is. Suppose you have a quadratic polynomial <img src='http://l.wordpress.com/latex.php?latex=q%28x%29%3D%5Csum_%7B%5C%7Bi%2Cj%5C%7D%5Cin%5Cmathcal%7BA%7D%7Dx_ix_j.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q(x)=\sum_{\{i,j\}\in\mathcal{A}}x_ix_j.' title='q(x)=\sum_{\{i,j\}\in\mathcal{A}}x_ix_j.' class='latex' /> We can associate <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{A}' title='\mathcal{A}' class='latex' /> with the symmetric matrix <img src='http://l.wordpress.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> that takes the value <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%5C%7D%5Cin%5Cmathcal%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j\}\in\mathcal{A}' title='\{i,j\}\in\mathcal{A}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> otherwise. What&#8217;s more, this is quite a natural thing to do, because you get precisely the matrix of the bilinear form <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta%28x%2Cy%29%3Dq%28x%2By%29-q%28x%29-q%28y%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\beta(x,y)=q(x+y)-q(x)-q(y),' title='\beta(x,y)=q(x+y)-q(x)-q(y),' class='latex' /> as you will quickly see if you check it. And there is a very close association between quadratic forms and bilinear forms&#8212;though not quite as close over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' /> as it is over fields of higher characteristic.</p>
<p>We can now define the rank of the original quadratic to be the rank of the associated bilinear form (or equivalently of the matrix). The rank has a direct impact on the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> norm of the quadratic phase function that you build out of the quadratic. Indeed, if we write <img src='http://l.wordpress.com/latex.php?latex=q%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q(x)' title='q(x)' class='latex' /> for the quadratic and <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' /> for the Boolean function <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7Bq%28x%29%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^{q(x)},' title='(-1)^{q(x)},' class='latex' /> then we get</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+f%28x%29f%28x%2Ba%29f%28x%2Bb%29f%28x%2Ba%2Bb%29%3D%28-1%29%5E%7Bq%28x%29%2Bq%28x%2Ba%29%2Bq%28x%2Bb%29%2Bq%28x%2Ba%2Bb%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle f(x)f(x+a)f(x+b)f(x+a+b)=(-1)^{q(x)+q(x+a)+q(x+b)+q(x+a+b)}.' title='\displaystyle f(x)f(x+a)f(x+b)f(x+a+b)=(-1)^{q(x)+q(x+a)+q(x+b)+q(x+a+b)}.' class='latex' /></p>
<p>Using the fact that addition and subtraction are the same over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2,' title='\mathbb{F}_2,' class='latex' /> one then finds that this is <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5Cbeta%28a%2Cb%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^{\beta(a,b)}.' title='(-1)^{\beta(a,b)}.' class='latex' /> Therefore, the fourth power of the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> norm of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_%7Ba%2Cb%7D%28-1%29%5E%7B%5Cbeta%28a%2Cb%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_{a,b}(-1)^{\beta(a,b)}.' title='\mathbb{E}_{a,b}(-1)^{\beta(a,b)}.' class='latex' /> Now for any fixed <img src='http://l.wordpress.com/latex.php?latex=a%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a,' title='a,' class='latex' /> the expectation over <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> unless the function <img src='http://l.wordpress.com/latex.php?latex=b%5Cmapsto%5Cbeta%28a%2Cb%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b\mapsto\beta(a,b)' title='b\mapsto\beta(a,b)' class='latex' /> is identically zero, in which case it is <img src='http://l.wordpress.com/latex.php?latex=1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1.' title='1.' class='latex' /> In other words, you get <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> unless <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> belongs to the kernel, so to speak, of <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\beta.' title='\beta.' class='latex' /> This shows that the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> norm is equal to the density of the kernel, which is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-r%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-r},' title='2^{-r},' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> is the rank. In other words, the rank of <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' /> is just <img src='http://l.wordpress.com/latex.php?latex=-%5Clog_2%5C%7Cf%5C%7C_%7BU%5E2%7D%5E4.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-\log_2\|f\|_{U^2}^4.' title='-\log_2\|f\|_{U^2}^4.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So if you&#8217;re talking about quadratics, then the rank is <em>exactly</em> what you are interested in, rather than the degree. Sorry, that was slightly silly&#8212;if the degree is fixed then obviously you aren&#8217;t interested in it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But what you are saying is still basically right&#8212;it&#8217;s the rank that tells you all about the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> norm, and in fact the <img src='http://l.wordpress.com/latex.php?latex=U%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^2' title='U^2' class='latex' /> dual norm as well because for quadratic phase functions it turns out that <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C_%7BU%5E2%7D%5C%7Cf%5C%7C_%7BU%5E2%7D%5E%2A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|_{U^2}\|f\|_{U^2}^*' title='\|f\|_{U^2}\|f\|_{U^2}^*' class='latex' /> is <em>equal</em> to 1 rather than just <em>at least</em> 1.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So presumably for cubics you do the same thing, except that this time you get a trilinear map. Is that right?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes and no. Yes you do get a trilinear map, but now it isn&#8217;t obvious what you mean by the word &#8220;rank&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why not?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Because there are several natural candidates for a definition, and they do not agree. The result is that there is no consensus about what the &#8220;right&#8221; definition should be for a multilinear map.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What are some of these natural candidates?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, let <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau' title='\tau' class='latex' /> be a trilinear form on an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />-dimensional vector space. One could define its rank to be <img src='http://l.wordpress.com/latex.php?latex=n-r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-r' title='n-r' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> is the dimension of the space of all <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%28a%2Cb%2Cc%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau(a,b,c)=0' title='\tau(a,b,c)=0' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=%28b%2Cc%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(b,c).' title='(b,c).' class='latex' /> Or one could take it to be the smallest <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> such that it is possible to write <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%28a%2Cb%2Cc%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau(a,b,c)' title='\tau(a,b,c)' class='latex' /> as <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Eru_i%28a%29v_i%28b%29w_i%28c%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=1}^ru_i(a)v_i(b)w_i(c).' title='\sum_{i=1}^ru_i(a)v_i(b)w_i(c).' class='latex' /> Or one could take it to be the smallest <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> such that we can write <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau' title='\tau' class='latex' /> as a sum <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5Erf_i%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=1}^rf_i,' title='\sum_{i=1}^rf_i,' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=f_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_i' title='f_i' class='latex' /> is a product of a linear function in one of <img src='http://l.wordpress.com/latex.php?latex=a%2Cb%2Cc&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a,b,c' title='a,b,c' class='latex' /> and a bilinear function in the other two. But actually I like a definition that was proposed in a recent and not yet published (or even arXived, it seems) paper of Gowers and Wolf, and is also implicit in work of Green and Tao. It&#8217;s to forget about the algebra and simply define the rank of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau' title='\tau' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=-%5Clog_2%5Csum_%7Ba%2Cb%2Cc%7D%28-1%29%5E%7B%5Ctau%28a%2Cb%2Cc%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-\log_2\sum_{a,b,c}(-1)^{\tau(a,b,c)}.' title='-\log_2\sum_{a,b,c}(-1)^{\tau(a,b,c)}.' class='latex' /> (That&#8217;s the definition when the vector space is over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2.' title='\mathbb{F}_2.' class='latex' /> There is a similar definition for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_p%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_p,' title='\mathbb{F}_p,' class='latex' /> and also in some other contexts.)</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  That sounds a little tautological.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I know, but Gowers and Wolf observed that one could prove that this &#8220;analytic&#8221; rank had a number of useful properties. For example, they showed that if <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mu' title='\mu' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cnu&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\nu' title='\nu' class='latex' /> are <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' />-linear, then the rank of <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmu%2B%5Cnu%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\mu+\nu)' title='(\mu+\nu)' class='latex' /> is at most <img src='http://l.wordpress.com/latex.php?latex=2%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^d' title='2^d' class='latex' /> times the rank of <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mu' title='\mu' class='latex' /> plus the rank of <img src='http://l.wordpress.com/latex.php?latex=%5Cnu.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\nu.' title='\nu.' class='latex' /> I can&#8217;t quite remember whether that proof worked in low characteristic&#8212;I&#8217;ll have to check. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So their result is weaker than what you&#8217;d get in the bilinear case from algebraic methods?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, that&#8217;s odd. I expect it would be possible to improve the factor to <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bd-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{d-1}' title='2^{d-1}' class='latex' /> by tweaking their argument a bit. But that&#8217;s not my main concern here. I just want to point out that this definition exists and is quite useful. Also, and I think I mentioned this in one of our earlier conversations, Green and Tao showed that a low-rank polynomial phase function could be decomposed into phase functions of lower degree. To give a simple example of this, an arbitrary Boolean function that depends on just <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables has at most <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> non-zero Fourier coefficients, so in particular a quadratic phase function that depends on at most <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables, and therefore has rank at most <img src='http://l.wordpress.com/latex.php?latex=k%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k,' title='k,' class='latex' /> can be decomposed into at most <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> linear phase functions.  </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Is there an algebraic interpretation of this notion of rank?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sort of. In the trilinear case, for instance, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_c%28-1%29%5E%7B%5Ctau%28a%2Cb%2Cc%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_c(-1)^{\tau(a,b,c)}' title='\mathbb{E}_c(-1)^{\tau(a,b,c)}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> if the linear map <img src='http://l.wordpress.com/latex.php?latex=c%5Cmapsto%5Ctau%28a%2Cb%2Cc%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c\mapsto\tau(a,b,c)' title='c\mapsto\tau(a,b,c)' class='latex' /> is not identically zero and is <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> if it <em>is</em> identically zero. So you could say that the rank is related to the size of the &#8220;kernel&#8221; of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau.' title='\tau.' class='latex' /> However, the set of all <img src='http://l.wordpress.com/latex.php?latex=%28a%2Cb%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(a,b)' title='(a,b)' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Ctau%28a%2Cb%2Cc%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau(a,b,c)=0' title='\tau(a,b,c)=0' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' /> is not a linear subspace but rather some bilinear structure. And its density does not have to be a power of 2, so the rank of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau' title='\tau' class='latex' /> is not necessarily an integer. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m not sure why you think that <em>any</em> notion of rank is going to be of any use in proving lower bound results.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why not?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Because the rank is never going to be more than <img src='http://l.wordpress.com/latex.php?latex=n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n,' title='n,' class='latex' /> since if <img src='http://l.wordpress.com/latex.php?latex=a%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a=0' title='a=0' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%28a%2Cb%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(a,b)' title='(a,b)' class='latex' /> belongs to the &#8220;kernel&#8221;. Also, it goes up in a sort of subadditive way. So it seems to me that after a linear number of scrambling operations you&#8217;ll have reached something with maximal rank&#8212;or at least, it won&#8217;t be possible to use just the rank to prove that you haven&#8217;t. I think you could perhaps extend what Aaronson and Wigderson said: you have to use more about your polynomials than just their degree and their rank.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That does sound plausible, alas. I wonder if some principle like that follows from their results on algebrization. I feel like asking an expert about this.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Meanwhile, perhaps we should return to the question of what basic operations do to quadratic phase functions?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes, thanks for reminding me. But I&#8217;m afraid I still don&#8217;t have any progress to report. But I have had a little thought about it that I&#8217;d like to float past you.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> OK.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I was a bit worried by your point that it is unlikely that a basic operation applied to a polynomial phase function of high degree will have <img src='http://l.wordpress.com/latex.php?latex=1-o%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1-o(1)' title='1-o(1)' class='latex' /> correlation with another polynomial phase function of the same degree. And I&#8217;m also worried by your point about the combination of degree and rank probably not being a refined enough tool for proving complexity lower bounds. But it now occurs to me that it might just conceivably be possible to define a set-valued complexity measure using these sorts of ideas.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Ah, that sounds potentially interesting if you really can do it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, the idea I had was this. If one is trying to prove that a function of low complexity is a linear combination, with coefficients that aren&#8217;t too big, of functions that are in some sense structured, such as low-degree polynomials and the like, then perhaps one can argue that the set of functions you use for <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> is not much bigger than the union of the set of functions you use for <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and the set of functions you use for <img src='http://l.wordpress.com/latex.php?latex=g.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g.' title='g.' class='latex' /> That&#8217;s not quite what I mean because it makes no mention of the coefficients, but you get the idea: if you&#8217;ve got a straight-line computation of <img src='http://l.wordpress.com/latex.php?latex=f%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f,' title='f,' class='latex' /> then perhaps the set of functions you need to express all the functions that occur in the computation as efficient linear combinations is not too vast.   </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I sort of get the idea, and it sounds interesting in a way, but I worry that much of its interest may derive from its being rather vague and may vanish as soon as you try to express the idea more precisely. Can you give me a precise  statement that says something like, &#8220;If there exists a set of functions <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BF%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{F}' title='\mathcal{F}' class='latex' /> with such and such a property, then there is a function in NP with a superlinear circuit lower bound&#8221;? Of course, in itself that&#8217;s not enough because some such statements are useless, such as &#8220;If there exists a set of functions <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BF%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{F}' title='\mathcal{F}' class='latex' /> that contains all functions of circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n' title='n\log n' class='latex' /> and does not contain the clique function, then there is a function in NP with a superlinear circuit lower bound.&#8221;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Let me tell you the kind of thing I had in mind. If you don&#8217;t mind I&#8217;ll call the set of functions <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}' title='\mathcal{U}' class='latex' /> instead of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BF%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{F}.' title='\mathcal{F}.' class='latex' /> Now suppose that we have two Boolean functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> and that we can write them as <img src='http://l.wordpress.com/latex.php?latex=f%3D%5Csum%5Clambda_iu_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f=\sum\lambda_iu_i' title='f=\sum\lambda_iu_i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g%3D%5Csum%5Cmu_jv_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g=\sum\mu_jv_j' title='g=\sum\mu_jv_j' class='latex' /> with the functions <img src='http://l.wordpress.com/latex.php?latex=u_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i' title='u_i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=v_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j' title='v_j' class='latex' /> belonging to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}.' title='\mathcal{U}.' class='latex' /> </p>
<p>What can we then say about <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' />? I was hoping that we would be able to write <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> as <img src='http://l.wordpress.com/latex.php?latex=%5Csum%5Cnu_kw_k%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum\nu_kw_k,' title='\sum\nu_kw_k,' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=w_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_k' title='w_k' class='latex' /> was either a <img src='http://l.wordpress.com/latex.php?latex=u_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i' title='u_i' class='latex' /> or a <img src='http://l.wordpress.com/latex.php?latex=v_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j' title='v_j' class='latex' /> or one of a small number of &#8220;error&#8221; functions that came in. And also there would have to be some condition about the sum of the <img src='http://l.wordpress.com/latex.php?latex=%5Cnu_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\nu_k' title='\nu_k' class='latex' /> not being too big in terms of the sums of the <img src='http://l.wordpress.com/latex.php?latex=%5Clambda_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda_i' title='\lambda_i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmu_j.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mu_j.' title='\mu_j.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I can see a number of problems with this. To start with, if the coefficient sums go up by even a small constant factor each time you do a Boolean operation, then after a linear number of steps they will be exponentially large. But if your set <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}' title='\mathcal{U}' class='latex' /> is reasonably balanced, it will be possible to write <em>every</em> Boolean function as a linear combination with coefficients adding up to at most <img src='http://l.wordpress.com/latex.php?latex=C%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C^n.' title='C^n.' class='latex' /> So you won&#8217;t have distinguished between functions of linear complexity and random functions.</p>
<p>But even if you can solve this problem (which is conceivable&#8212;for instance, there might be parameters other than the sum of the coefficients that could give you a measure of the &#8220;size&#8221; of the linear combination), there is another that seems to me to be more fundamental. Recall from an earlier conversation of ours that Boolean operations felt more like products than sums. Now what happens if you take the product of <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i%5Clambda_iu_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_i\lambda_iu_i' title='\sum_i\lambda_iu_i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csum_j%5Cmu_jv_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_j\mu_jv_j' title='\sum_j\mu_jv_j' class='latex' />? You get <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%2Cj%7D%5Clambda_i%5Cmu_ju_iv_j.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i,j}\lambda_i\mu_ju_iv_j.' title='\sum_{i,j}\lambda_i\mu_ju_iv_j.' class='latex' /> Now there&#8217;s good news and bad news here. The good news is that if <img src='http://l.wordpress.com/latex.php?latex=u_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i' title='u_i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=v_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j' title='v_j' class='latex' /> are degree-<img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> phase functions then so is <img src='http://l.wordpress.com/latex.php?latex=u_iv_j%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_iv_j,' title='u_iv_j,' class='latex' /> so we&#8217;ve got another linear combination of functions in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}.' title='\mathcal{U}.' class='latex' /> But the bad news is that we have used all the functions <img src='http://l.wordpress.com/latex.php?latex=u_iv_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_iv_j' title='u_iv_j' class='latex' /> (and we have multiplied together the coefficient sums, but we&#8217;re ignoring that aspect now). So it seems that the set of functions we use for <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> is going to be something like the product set of the sets we use for <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and for <img src='http://l.wordpress.com/latex.php?latex=g.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g.' title='g.' class='latex' /> Therefore, after linearly many steps of the process it seems that we will end up with exponentially large sets. But <em>no</em> function needs more than exponentially many functions from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}' title='\mathcal{U}' class='latex' />, since the dimension of the space we are talking about is <img src='http://l.wordpress.com/latex.php?latex=2%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n.' title='2^n.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  As ever, the water you pour over my ideas is well and truly cold. But there&#8217;s one point you made that I think needs to be thought about slightly harder.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> As ever, you carry a little immersion heater around with you.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Something like that. What I wanted to say was that at a certain point you made the assumption that the cardinality of the set of functions <img src='http://l.wordpress.com/latex.php?latex=u_iv_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_iv_j' title='u_iv_j' class='latex' /> was equal to the product of the number of <img src='http://l.wordpress.com/latex.php?latex=u_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i' title='u_i' class='latex' /> and the number of <img src='http://l.wordpress.com/latex.php?latex=v_j.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j.' title='v_j.' class='latex' /> Obviously this is an upper bound, but perhaps there are coincidences. For example, if <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{U}' title='\mathcal{U}' class='latex' /> is the set of all quadratic phase functions, and <img src='http://l.wordpress.com/latex.php?latex=u_i%3D%28-1%29%5E%7Bq_i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i=(-1)^{q_i}' title='u_i=(-1)^{q_i}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=v_j%3D%28-1%29%5E%7Br_j%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j=(-1)^{r_j}' title='v_j=(-1)^{r_j}' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=u_iv_j%3D%28-1%29%5E%7Bq_ir_j%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_iv_j=(-1)^{q_ir_j},' title='u_iv_j=(-1)^{q_ir_j},' class='latex' /> so if <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R' title='R' class='latex' /> are the sets of quadratics used by <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g,' title='g,' class='latex' /> then the set of quadratics used by <img src='http://l.wordpress.com/latex.php?latex=fg&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='fg' title='fg' class='latex' /> is the sumset <img src='http://l.wordpress.com/latex.php?latex=Q%2BR.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q+R.' title='Q+R.' class='latex' /> </p>
<p>Now in general <img src='http://l.wordpress.com/latex.php?latex=Q%2BR&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q+R' title='Q+R' class='latex' /> can have size <img src='http://l.wordpress.com/latex.php?latex=%7CQ%7C%7CR%7C%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|Q||R|,' title='|Q||R|,' class='latex' /> but if the sets <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R' title='R' class='latex' /> have additive structure (an extreme example would be when they are parallel affine subspaces of the space of quadratics over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' />) then <img src='http://l.wordpress.com/latex.php?latex=%7CQ%2BR%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|Q+R|' title='|Q+R|' class='latex' /> can be much smaller. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> OK, but now you&#8217;ve introduced another idea into the picture. Your hypothesis would be not so much that you can write low-complexity functions as a linear combination of <em>few</em> structured functions, and more that you can write them as a linear combination of <em>structured sets</em> of structured functions. But where is your set-theoretic complexity measure now?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;ve thought of a question that might be worth considering. It just feels as though it could clarify the discussion a bit.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, what is it?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well, it seems to me that, motivated by the remarks of Aaronson and Wigderson, you are trying to distinguish between low-complexity and high-complexity polynomials, even when they have the same degree. This seems worth thinking about even for quadratics: a simple counting argument tells us that most quadratics have superlinear (in fact, almost quadratic) circuit complexity. So what is the difference between a high-complexity quadratic and a low-complexity quadratic? If you force yourself to think about just this question, then maybe you won&#8217;t keep coming up with properties that fail for known reasons.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That sounds like an excellent idea, but potentially also a very difficult one, because it might be possible to define &#8220;weird&#8221; quadratics by devising some clever low-complexity calculation that involves functions that are nothing like quadratics but that magically produce a quadratic at the end. But perhaps one could have a restricted model of computation that allowed more gates but insisted that all functions at every stage were quadratics.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That is sounding very like arithmetic complexity to me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What is arithmetic complexity?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It&#8217;s very like circuit complexity, but you replace the Boolean operations <img src='http://l.wordpress.com/latex.php?latex=%5Cvee&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\vee' title='\vee' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge' title='\wedge' class='latex' /> by arithmetic operations such as <img src='http://l.wordpress.com/latex.php?latex=%2B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='+' title='+' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctimes.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\times.' title='\times.' class='latex' /> I think a good illustration of the basic idea is the function <img src='http://l.wordpress.com/latex.php?latex=x%5E%7B2%5En%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^{2^n}.' title='x^{2^n}.' class='latex' /> This has arithmetic complexity <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> because you can get it in <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> steps by starting with <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and repeatedly squaring. (I&#8217;m now talking about arithmetic complexity of polynomials over the reals, by the way.) This shows that the arithmetic complexity of a polynomial can be far smaller than the degree. And in general it seems to be hard to prove lower bounds for arithmetic complexity.</p>
<p>A very important fact in cryptography is that the arithmetic complexity of raising a number to a power modulo a large index <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is small. If you want to work out <img src='http://l.wordpress.com/latex.php?latex=x%5Em&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^m' title='x^m' class='latex' /> mod <img src='http://l.wordpress.com/latex.php?latex=n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n,' title='n,' class='latex' /> then you can do the repeated-squaring trick to work out <img src='http://l.wordpress.com/latex.php?latex=x%5E%7B2%5Ek%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^{2^k}' title='x^{2^k}' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=k%5Cleq%5Clog_2m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k\leq\log_2m' title='k\leq\log_2m' class='latex' /> and then multiply together the ones you need to make <img src='http://l.wordpress.com/latex.php?latex=x%5Em.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^m.' title='x^m.' class='latex' /> This shows that calculating <img src='http://l.wordpress.com/latex.php?latex=x%5Em&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^m' title='x^m' class='latex' /> has arithmetic complexity that is polylogarithmic in <img src='http://l.wordpress.com/latex.php?latex=m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m.' title='m.' class='latex' /> Equally important is the fact that it seems to be much harder to calculate factorials: if there were a quick way of calculating <img src='http://l.wordpress.com/latex.php?latex=%28n-1%29%21&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(n-1)!' title='(n-1)!' class='latex' />, for example, then there would be a quick primality test. (Of course, one could say that there <em>is</em> a quick way: use AKS to test whether <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is prime and then the answer is <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' /> if it is prime and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> otherwise. But being able to calculate factorials and similar polynomials quickly would have major consequences besides primality testing.)</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But I don&#8217;t see how this applies to quadratics, because they are not closed under taking products.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I agree that that&#8217;s a problem. But here&#8217;s the type of thing one could do. You&#8217;ll agree that the map <img src='http://l.wordpress.com/latex.php?latex=x%5Cmapsto+x_1x_2%2B%5Cdots%2Bx_%7Bn-1%7Dx_n%2Bx_nx_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x\mapsto x_1x_2+\dots+x_{n-1}x_n+x_nx_1' title='x\mapsto x_1x_2+\dots+x_{n-1}x_n+x_nx_1' class='latex' /> is a high-rank quadratic.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes. I think it has rank either <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=n-1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-1.' title='n-1.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> And it&#8217;s also computable in linear time, has linear arithmetic complexity, etc.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, but its associated matrix is very sparse, since it has only <img src='http://l.wordpress.com/latex.php?latex=2n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n' title='2n' class='latex' /> entries.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Sorry, but why <img src='http://l.wordpress.com/latex.php?latex=2n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n' title='2n' class='latex' /> rather than <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Because the bilinear form you get is symmetric: corresponding to each term <img src='http://l.wordpress.com/latex.php?latex=x_ix_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_ix_j' title='x_ix_j' class='latex' /> you get a term <img src='http://l.wordpress.com/latex.php?latex=x_iy_j%2Bx_jy_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_iy_j+x_jy_i' title='x_iy_j+x_jy_i' class='latex' /> in the bilinear form.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Ah, I see.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> The point I wanted to make is that you can produce less sparse matrices by producing some more interesting ensembles of linear forms than <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cdots%2Cx_n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1,\dots,x_n.' title='x_1,\dots,x_n.' class='latex' /> What you do is start with a straight-line computation that allows only addition mod 2. In other words, you write a sequence <img src='http://l.wordpress.com/latex.php?latex=x_1%2Cx_2%2C%5Cdots%2Cx_n%2Cy_1%2C%5Cdots%2Cy_m%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1,x_2,\dots,x_n,y_1,\dots,y_m,' title='x_1,x_2,\dots,x_n,y_1,\dots,y_m,' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=y_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y_i' title='y_i' class='latex' /> is a sum of two earlier terms in the sequence (either <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />s or <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />s). Then at the end of that process you create a quadratic form <img src='http://l.wordpress.com/latex.php?latex=%5Csum_j+u_jv_j%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_j u_jv_j,' title='\sum_j u_jv_j,' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=u_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_j' title='u_j' class='latex' /> and each <img src='http://l.wordpress.com/latex.php?latex=v_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_j' title='v_j' class='latex' /> is one of the linear forms from the nice little supply that you have built up.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah, I like that. First of all, it really does seem a natural model for the computational complexity of quadratic forms over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2.' title='\mathbb{F}_2.' class='latex' /> OK, it doesn&#8217;t allow for computations that produce quadratic forms by magic, but it does seem to allow all &#8220;natural&#8221; ways that one might want to produce them. The second thing I like about it is that it relates <em>very</em> closely to the Gaussian-elimination problem. In fact, the only difference is that in that problem when you work out the mod-2 sum of two of your functions, you have to choose one of them and vow never to use it again, whereas here we drop that restriction. It shows that what makes a quadratic computationally simple is the possibility of writing it as a sum of rank-one quadratics (by which I mean functions of the form <img src='http://l.wordpress.com/latex.php?latex=x%5Cmapsto+%5Clambda%28x%29%5Cmu%28x%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x\mapsto \lambda(x)\mu(x),' title='x\mapsto \lambda(x)\mu(x),' class='latex' /> where both <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mu' title='\mu' class='latex' /> are linear) that are somehow related to each other in a &#8220;simple&#8221; way. We see that the rank is far too crude a parameter because it takes no account of this relationship (or lack of it). </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Quick question: to what extent is the representation of a quadratic unique? In particular, might it have a &#8220;simple&#8221; representation <em>and</em> a &#8220;complex&#8221; one? Then in order to prove that a quadratic could not be computed in linear time (in this model) one would have to consider all possible representations rather than just one, which would be annoying.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I think uniqueness of any kind is too much to hope for. For example, if you work mod 3 instead and choose an arbitrary &#8220;orthonormal basis&#8221; (by which I mean a collection of binary sequences <img src='http://l.wordpress.com/latex.php?latex=v_1%2C%5Cdots%2Cv_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_1,\dots,v_n' title='v_1,\dots,v_n' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Csum_r+v_i%28r%29v_j%28r%29%3D%5Cdelta_%7Bij%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_r v_i(r)v_j(r)=\delta_{ij}' title='\sum_r v_i(r)v_j(r)=\delta_{ij}' class='latex' /> mod 3 for every <img src='http://l.wordpress.com/latex.php?latex=i%2Cj&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i,j' title='i,j' class='latex' />), then the matrix <img src='http://l.wordpress.com/latex.php?latex=a_%7Brs%7D%3D%5Csum_iv_i%28r%29v_i%28s%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{rs}=\sum_iv_i(r)v_i(s)' title='a_{rs}=\sum_iv_i(r)v_i(s)' class='latex' /> works out to be the identity (as can be seen if you multiply it by any of the vectors <img src='http://l.wordpress.com/latex.php?latex=v_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_i' title='v_i' class='latex' />). The resulting quadratic form is <img src='http://l.wordpress.com/latex.php?latex=x_1%5E2%2B%5Cdots%2Bx_n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1^2+\dots+x_n^2' title='x_1^2+\dots+x_n^2' class='latex' />. There are annoying characteristic-2 problems if you want to do something similar over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2,' title='\mathbb{F}_2,' class='latex' /> since then <img src='http://l.wordpress.com/latex.php?latex=x_i%5E2%3Dx_i%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_i^2=x_i,' title='x_i^2=x_i,' class='latex' /> but I suspect it would be easy to come up with an example that avoids the diagonal and makes the same general point.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Sorry, I got lost there&#8212;what is the general point?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It&#8217;s that one can probably find &#8220;weird&#8221; ensembles of linear forms that give you quadratics that can also be produced in non-weird ways. But now it&#8217;s my turn to be lost. What was the point <em>you</em> were making?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I haven&#8217;t quite got it clear myself. But I liked this connection between the complexity of quadratic forms and the Gaussian-elimination problem and was trying to see how closely they were related. It seems that what you do here is a slightly generalized Gaussian-elimination procedure to produce two matrices (one with rows <img src='http://l.wordpress.com/latex.php?latex=u_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u_i' title='u_i' class='latex' /> and one with rows <img src='http://l.wordpress.com/latex.php?latex=v_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v_i' title='v_i' class='latex' />) and you then multiply the first by the transpose of the second to get the matrix of the quadratic form. So we are not in fact asking whether the matrix of the quadratic form can be obtained in linear time using Gaussian row operations, or anything like that at all. And one can express the matrix as a product of two other matrices in many different ways, so it looks as though it will be pretty difficult to identify some quasirandomness property that forces all decompositions of the form <img src='http://l.wordpress.com/latex.php?latex=AB%5ET&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='AB^T' title='AB^T' class='latex' /> to be complex in the Gaussian-elimination sense.</p>
<p>I suppose what I&#8217;m saying is that even if we could solve the Gaussian-elimination problem, we&#8217;d still have a long way to go.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I think that you are right, which is unfortunate.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Thanks.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I don&#8217;t mean your correctness is unfortunate, but just that the correct fact you&#8217;ve identified (if it is correct) is a pity. But I still like your proposal that we should look at quadratics, because it still seems to be subject to the natural-proofs barrier, and we won&#8217;t be tempted to use crude parameters such as degree and rank because all our polynomials are quadratics and they can quite easily have maximal rank. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes, but now you haven&#8217;t the faintest idea how to prove anything.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s better than having lots of faint ideas that are guaranteed not to work. And in any case, now, despite what <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  has pointed out, I want to think about the Gaussian-elimination problem. I have a simple question: how many Gaussian operations do you need in order to produce a quasirandom matrix?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What do you mean by that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I mean a matrix where the number of 1s is approximately <img src='http://l.wordpress.com/latex.php?latex=n%5E2%2F2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2/2,' title='n^2/2,' class='latex' /> and for almost any two rows they agree in approximately half the places. (Actually, the second property implies the first.)</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That sounds like a very natural property to me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Thank you.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I didn&#8217;t mean it as a compliment. It sounds easily computable, and hence very unlikely to be a good indication of complexity. I&#8217;m sure it will be possible to produce such matrices in a linear number of steps.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes. Well, perhaps we should just confirm that.</p>
<p>A random sequence of row operations doesn&#8217;t work I think, because each row ends up depending on only very few of the original rows. To put it another way, the matrix you end up with is far too sparse. But one can deal with that by first forming a triangular matrix with 0s on one side of the diagonal and 1s on the other side&#8212;it&#8217;s easy to see how to do that. Perhaps if one <em>then</em> applies a random sequence of Gaussian operations one reaches a quasirandom matrix in linear time.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Hmm, I&#8217;m not sure. For a start there will be a few rows that you will never touch again.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, but all I need is for <em>almost</em> all rows to look random at the end of the process.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think you need to be fussier than that. After all, the Paley matrix is <em>very</em> quasirandom according to your definition, so if you can&#8217;t produce extremely good quasirandomness in linear time then the Walsh matrix needs a superlinear number of row operations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That was indeed an example I had in mind actually.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What&#8217;s the Walsh matrix?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  A quick definition is this: <img src='http://l.wordpress.com/latex.php?latex=W_n%3D%5Cleft%28%5Cbegin%7Bmatrix%7DW_%7Bn-1%7D%26W_%7Bn-1%7D%5C%5CW_%7Bn-1%7D%26-W_%7Bn-1%7D%5Cend%7Bmatrix%7D%5Cright%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_n=\left(\begin{matrix}W_{n-1}&amp;W_{n-1}\\W_{n-1}&amp;-W_{n-1}\end{matrix}\right).' title='W_n=\left(\begin{matrix}W_{n-1}&amp;W_{n-1}\\W_{n-1}&amp;-W_{n-1}\end{matrix}\right).' class='latex' /> If <img src='http://l.wordpress.com/latex.php?latex=W_0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_0' title='W_0' class='latex' /> is the <img src='http://l.wordpress.com/latex.php?latex=1%5Ctimes+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1\times 1' title='1\times 1' class='latex' /> matrix with just a 1 inside, then you&#8217;ll end up with a <img src='http://l.wordpress.com/latex.php?latex=2%5En%5Ctimes+2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n\times 2^n' title='2^n\times 2^n' class='latex' /> orthogonal matrix of <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />s and <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' />s. If you change that to a 01-matrix in the obvious way then you get what one might call perfect quasirandomness, apart from the fact that one row is all 1s.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Right, how do you get <em>that</em> level of quasirandomness with linearly many row operations?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Hmm, we have an inductive construction, so presumably we can translate that into some inductively defined sequence of row operations. How do we produce <img src='http://l.wordpress.com/latex.php?latex=W_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_1' title='W_1' class='latex' /> from the <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2\times 2' title='2\times 2' class='latex' /> identity matrix? That&#8217;s quite easy: add the second row to the first to get <img src='http://l.wordpress.com/latex.php?latex=%5Cleft%28%5Cbegin%7Bmatrix%7D1%261%5C%5C+0%261%5C%5C+%5Cend%7Bmatrix%7D%5Cright%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\left(\begin{matrix}1&amp;1\\ 0&amp;1\\ \end{matrix}\right)' title='\left(\begin{matrix}1&amp;1\\ 0&amp;1\\ \end{matrix}\right)' class='latex' />, then the first to the second to get <img src='http://l.wordpress.com/latex.php?latex=W_1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_1.' title='W_1.' class='latex' /></p>
<p>In general, if we&#8217;ve got the matrix <img src='http://l.wordpress.com/latex.php?latex=%5Cleft%28%5Cbegin%7Bmatrix%7DW_%7Bk-1%7D%260%5C%5C+0%26W_%7Bk-1%7D%5C%5C+%5Cend%7Bmatrix%7D%5Cright%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\left(\begin{matrix}W_{k-1}&amp;0\\ 0&amp;W_{k-1}\\ \end{matrix}\right)' title='\left(\begin{matrix}W_{k-1}&amp;0\\ 0&amp;W_{k-1}\\ \end{matrix}\right)' class='latex' />, we can add the second &#8220;block row&#8221; to the first and then the first to the second. That takes <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> operations. So if it takes <img src='http://l.wordpress.com/latex.php?latex=f%28k%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(k)' title='f(k)' class='latex' /> operations to produce <img src='http://l.wordpress.com/latex.php?latex=W_k%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_k,' title='W_k,' class='latex' /> then we seem to have a recurrence like <img src='http://l.wordpress.com/latex.php?latex=f%28k%29%3D2f%28k-1%29%2B2%5Ek.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(k)=2f(k-1)+2^k.' title='f(k)=2f(k-1)+2^k.' class='latex' /> That recursion is satisfied by the function <img src='http://l.wordpress.com/latex.php?latex=f%28k%29%3Dk2%5Ek.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(k)=k2^k.' title='f(k)=k2^k.' class='latex' /> If we now set <img src='http://l.wordpress.com/latex.php?latex=n%3D2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=2^k' title='n=2^k' class='latex' /> we get a bound of <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n' title='n\log n' class='latex' /> rather than a linear bound. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  It would be very surprising if that was not the quickest way of producing <img src='http://l.wordpress.com/latex.php?latex=W_k.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='W_k.' title='W_k.' class='latex' /> It just seems so obvious and natural. Is it really true that the best known lower bounds for this problem are only linear?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Maybe we should check. There&#8217;s obviously lots to think about here, but I&#8217;m getting tired. Shall we stop for now?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' />  <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />   OK. </p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/990/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=990&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/11/03/a-conversation-about-complexity-lower-bounds-ix/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, VIII</title>
		<link>http://gowers.wordpress.com/2009/10/27/a-conversation-about-complexity-lower-bounds-viii/</link>
		<comments>http://gowers.wordpress.com/2009/10/27/a-conversation-about-complexity-lower-bounds-viii/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 16:01:40 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[complexity]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=970</guid>
		<description><![CDATA[In this next instalment, our characters discuss the norm you get by looking for the best possible correlation with a quadratic phase function. They end up discussing a heuristic argument that might, just conceivably, show that this norm is one of a wide class of norms that cannot possibly give rise to superlinear lower bounds. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=970&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>In this next instalment, our characters discuss the norm you get by looking for the best possible correlation with a quadratic phase function. They end up discussing a heuristic argument that might, just conceivably, show that this norm is one of a wide class of norms that cannot possibly give rise to superlinear lower bounds. Along the way they have several thoughts, some of which are quite interesting, some not interesting at all, and some plain wrong. (The more interesting ones are mostly later on in the instalment.)</p>
<p>*******************************</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Last time we met, I came to the understanding that if you build a norm by means of a formula of the form <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%3D%5Cmax%5C%7B%7C%5Clangle+f%2Cg%5Crangle%7C%3Ag%5Cin%5CGamma%5C%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|=\max\{|\langle f,g\rangle|:g\in\Gamma\},' title='\|f\|=\max\{|\langle f,g\rangle|:g\in\Gamma\},' class='latex' /> then there are two properties that <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> might have that will give the norm an outside chance of being a useful quasirandomness norm for proving nontrivial lower bounds. The first is that the cardinality of <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> should be superpolynomial in <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n' title='2^n' class='latex' /> (or else there is a trivial polynomial-time algorithm for working out the norm). The second, which implies the first, but which I prefer to think of as a separate property, is there should not be some clever polynomial-time way of working out the norm&#8212;which, when <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> has superexponential size would require one to exploit special properties of the particular set of functions.</p>
<p>As I see it, if you take <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> to be the set of all quadratic phase functions, then you get a set that definitely has the first property and could well have the second. So I want to go back to thinking about this quadratic-correlation norm. Earlier I convinced myself that a random low-complexity function should not correlate with any quadratic phase function. But if for any fixed quadratic phase function I can get only an exponentially small probability of a huge correlation, and if there are superexponentially many quadratic phase functions, then perhaps we need to revisit this statement. Is it conceivable that every function of linear circuit complexity correlates quite heavily with a quadratic phase function?<span id="more-970"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Why should that happen?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I have absolutely no idea. In fact, I don&#8217;t think it does happen. But I&#8217;d like a convincing (even if nonrigorous) argument to that effect. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why do you think it doesn&#8217;t happen?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Basically because there seems to be no reason for it to happen. But here&#8217;s what happens when I try to argue that it doesn&#8217;t. A first thought is to take a cubic polynomial. But I now see that that gives us two problems: how do you find a suitable cubic that you can calculate in linear time, and how do you prove a bound that&#8217;s better than <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/4}' title='N^{-1/4}' class='latex' />? The second question may have an answer but I&#8217;d be surprised if people can get down to <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%2B%5Cepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2+\epsilon}' title='N^{-1/2+\epsilon}' class='latex' />.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Yes, but you said you just wanted a heuristic argument: is there perhaps a plausible guess for what the correct bounds should be for how little cubics can correlate with quadratics (which is just the same as asking how small the average of a cubic phase function can be)? Or a similar question: if you take a random set <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{A}' title='\mathcal{A}' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> triples <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j,k\}' title='\{i,j,k\}' class='latex' /> and define a cubic polynomial to be <img src='http://l.wordpress.com/latex.php?latex=p%28x%29%3D%5Csum_%7B%5C%7Bi%2Cj%2Ck%5C%7D%5Cin%5Cmathcal%7BA%7D%7Dx_ix_jx_k%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(x)=\sum_{\{i,j,k\}\in\mathcal{A}}x_ix_jx_k,' title='p(x)=\sum_{\{i,j,k\}\in\mathcal{A}}x_ix_jx_k,' class='latex' /> then what is the expected size of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D%28-1%29%5E%7Bp%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}(-1)^{p(x)}' title='\mathbb{E}(-1)^{p(x)}' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m afraid I don&#8217;t know the answers to these interesting questions. But let me point out that if you want to take a random cubic like that, then you&#8217;ll need to take the maximum over all quadratic correlations. That is, you&#8217;ll need to prove that the cubic phase function <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28x%29%3D%28-1%29%5E%7Bp%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(x)=(-1)^{p(x)}' title='\kappa(x)=(-1)^{p(x)}' class='latex' /> has some property that causes it not to correlate with quadratic phase functions. And I don&#8217;t know of a property that gives really good bounds for that. </p>
<p>So let me turn to the other suggestion, which is to take a random function that&#8217;s computable in linear time.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  You mean with that clever pre-processing step?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah&#8212;as it happens, I&#8217;ve thought of a different model that seems better. It doesn&#8217;t use the pre-processing step and it doesn&#8217;t involve the worryingly low depth that <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> was complaining about.</p>
<p>What you do is this. Let&#8217;s call the basic functions and their negatives level-0 functions. First create from the basic functions a collection of <img src='http://l.wordpress.com/latex.php?latex=2%5Calpha+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2\alpha n' title='2\alpha n' class='latex' /> level-1 functions by picking random pairs of level-0 functions and randomly applying <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge' title='\wedge' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cvee&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\vee' title='\vee' class='latex' /> to them. Here, <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a constant less than 1: I like to think of it as 0.99. From these, create <img src='http://l.wordpress.com/latex.php?latex=2%5Calpha%5E2+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2\alpha^2 n' title='2\alpha^2 n' class='latex' /> level-2 functions, again by picking random pairs and randomly applying either <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge' title='\wedge' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cvee.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\vee.' title='\vee.' class='latex' /> Continue in this way, until you&#8217;re down to just one function. The total number of functions you create is linear in <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> (since it is the sum of a GP that starts with <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> and has common ratio <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' />) but the depth of the circuit is <img src='http://l.wordpress.com/latex.php?latex=%5Clog+n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log n,' title='\log n,' class='latex' /> all the coordinates make a big difference, and so on.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Surely with high probability at least some of the coordinates will be missed out when you go from level 0 to level 1.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes. Well, one could deal with that by just making sure that all the functions are used at least once when you go from one layer to the next. Incidentally, one could let the layers increase in size for a short while to begin with before then decreasing geometrically.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You&#8217;re right, I prefer this model to your previous one, but I have a semi-serious objection to it. By that I mean that it&#8217;s a serious objection but I think there will be a way of dealing with it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  It&#8217;s good to know that in advance. What is the objection though?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It&#8217;s that your final function is obviously not going to be pseudorandom.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why not?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Because if your model creates pseudorandom functions, then presumably the two functions you get in the penultimate layer will be pseudorandom, and one would hope that they would also be &#8220;independent&#8221; in some sense (since if not then applying a Boolean operation to them could produce a highly non-random function). But if your final operation is <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge,' title='\wedge,' class='latex' /> say, then applying it to two independent pseudorandom functions gives you a function that takes the value 1 about 1/4 of the time instead of about 1/2 the time. This problem also applies to your earlier model of a random formula. I have to admit that I didn&#8217;t notice it myself, but <a href="http://gowers.wordpress.com/2009/09/22/a-conversation-about-complexity-lower-bounds/#comment-4017">was told about it by Luca Trevisan</a>. He suggested using the basic operations from the Gowers model instead.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, that shouldn&#8217;t create too many problems, since we were using them in our previous model of random functions of linear circuit complexity. Let me have a go at defining something.</p>
<p>We start with a string of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> bits. We then apply a random sequence of <img src='http://l.wordpress.com/latex.php?latex=%5Calpha+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha n' title='\alpha n' class='latex' /> basic operations (that is, permutations that act on just three bits at a time, as before). Then we randomly restrict to a set of bits of size <img src='http://l.wordpress.com/latex.php?latex=%5Calpha+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha n.' title='\alpha n.' class='latex' /> And then we repeat. </p>
<p>In fact, I&#8217;d rather do <img src='http://l.wordpress.com/latex.php?latex=10n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='10n' title='10n' class='latex' /> basic operations before randomly restricting to a set of size <img src='http://l.wordpress.com/latex.php?latex=m%3D%5Calpha+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m=\alpha n.' title='m=\alpha n.' class='latex' /> Then we can do <img src='http://l.wordpress.com/latex.php?latex=10m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='10m' title='10m' class='latex' /> basic operations on just those <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> bits before randomly restricting further, to a set of size <img src='http://l.wordpress.com/latex.php?latex=%5Calpha%5E2n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha^2n' title='\alpha^2n' class='latex' /> bits. And so on until we get down to a constant number of bits, at which point we could just choose one.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Does that really avoid the problem that Luca Trevisan pointed out?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes. The big difference between this model and the previous one can be summed up as follows: the basic operations take random functions to random functions. If <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> are random functions, then <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> will not be random, but if <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a random even permutation of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n,' title='\{0,1\}^n,' class='latex' /> then the composition of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> with one of the basic 3-bit permutations will also be a random even permutation.</p>
<p>If you like that model, then you&#8217;ll agree that the following question is quite a good formulation of what I was asking just now: if you generate a function randomly in that way, will it necessarily correlate significantly with a quadratic phase function? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I can see that it might be hard to prove that the answer is no, but don&#8217;t you at least have a heuristic argument that the answer should be no?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Can I chip in with what is either rubbish or an amusing observation?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Of course.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, if you managed to prove that every random linear-complexity function correlated significantly with some quadratic, then, as you point out, you&#8217;d have a property that distinguishes between linear-complexity functions and purely random functions. Now Razborov and Rudich tell us that such a property cannot be computable in polynomial time. However, the property in question certainly belongs to NP (since once you know <em>which</em> quadratic phase function you&#8217;re supposed to correlate with, it is easy to check that you really do). So in order to get this approach to work, wouldn&#8217;t you have to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  If that argument is correct, then it&#8217;s weird. It shows that an attempt to obtain merely superlinear lower bounds would prove superpolynomial bounds for a different problem. I can&#8217;t get my head round this.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But I thought you said that Razborov and Rudich&#8217;s result was conditional on the existence of a strong hypothesis in complexity.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Ah yes. The argument I gave would rely on the existence of a certain sort of strong pseudorandom generator, which exists if factoring is hard. So one would have a proof that either factoring is easy or P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP. But that&#8217;s trivial: if P=NP then factoring is easy.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m wondering if there is a non-trivial statement one can make here. What your argument would show is that either factoring is easy or the quadratic correlation problem&#8212;let me call it QC&#8212;cannot be solved in polynomial time. But I don&#8217;t see any reason to suppose that QC is NP-complete, so although the intractability of QC would imply that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP, the converse is far from obvious. So we&#8217;d have a proof that the hardness of factoring implies the hardness of QC. Or to put it another way, if we can solve QC then we can use that to factor quickly. That doesn&#8217;t seem trivial to me.</p>
<p>Of course, all this is so utterly conditional on statements we don&#8217;t know how to prove and don&#8217;t even necessarily believe that it is not all that exciting.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Which statements?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, it depends on a proof that linear-complexity functions all correlate with at least one quadratic phase function, which I don&#8217;t have any reason to believe, and it then depends on finding a clever algorithm for QC, which I am at best neutral about. (An argument in favour of such an algorithm existing is that it exists for linear correlations&#8212;take the Fourier transform&#8212;and there are at least some quadratic analogues of linear Fourier analysis. But those analogues are far far weaker than what would be needed, and for correlations as small as the ones that would interest us it is not clear that anything can be said at all.)</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  You say that you don&#8217;t have any reason to believe that every linear-complexity function correlates substantially with at least one quadratic phase function. But so far you haven&#8217;t given us any reason to believe the opposite either.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I know. So far all I have to say in that direction is that a surprising event like a quadratic correlation ought to happen, if it happens, for a reason. And I don&#8217;t see a reason. It&#8217;s a bit like most people&#8217;s belief that <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' /> is normal because there is no earthly reason to suppose that the digits of <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' /> would do anything special.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But for that analogy to work, you need to convince me that correlating a bit more than a random function would with just one out of the <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bcn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{cn^2}' title='2^{cn^2}' class='latex' /> quadratic phase functions counts as &#8220;doing something special&#8221;. Perhaps it&#8217;s just what one would expect.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I think I see what you&#8217;re getting at, and it&#8217;s similar to the discussion we were having earlier. There are <img src='http://l.wordpress.com/latex.php?latex=n%5E%7BCn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{Cn}' title='n^{Cn}' class='latex' /> functions of linear circuit complexity, so no probabilistic statements we make can hope to be accurate to better than <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B-Cn%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{-Cn}.' title='n^{-Cn}.' class='latex' /> Furthermore, we actually expect them to be accurate to no better than <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-Cn%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-Cn}.' title='2^{-Cn}.' class='latex' /> If we believe strongly enough that detecting small quadratic correlations cannot be done in polynomial time, then we might argue that there is basically nothing we can say about the probability of such correlations occurring: the probability would be much smaller than exponential for a purely random function, but for pseudorandom functions we cannot say anything has a subexponential probability, so it could be that every linear-complexity function correlates with some quadratic phase function.</p>
<p>But if that were the case, how on earth might it be proved? One approach might be to show that each time you do a basic 3-bit operation, in some sense you multiply by a constant the number of quadratic phase functions that you no longer correlate with. Since there are <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bcn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{cn^2}' title='2^{cn^2}' class='latex' /> quadratic phase functions, it might even take quadratically long to end up correlating with none of them.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That sounds worryingly optimistic.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I agree, so let me try to think about what I even meant in that heuristic argument, which as it stands is nonsense of course.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=f_1%2Cf_2%2C%5Cdots&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,f_2,\dots' title='f_1,f_2,\dots' class='latex' /> be a random sequence of functions, each produced from the previous one by using a random 3-bit operation (with occasional random restrictions to a fraction <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> of the bits, as above). Let us also fix a Boolean function <img src='http://l.wordpress.com/latex.php?latex=g.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g.' title='g.' class='latex' /> My guess is that the correlation of <img src='http://l.wordpress.com/latex.php?latex=f_r%28x%29_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_r(x)_1' title='f_r(x)_1' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=g%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g(x)' title='g(x)' class='latex' /> will behave a bit like a random walk with drift: that is, it will multiply or divide by a constant amount at each step, but the expectation of the log of the factor is negative rather than zero.</p>
<p>Now <em>if</em> one could say that all these random walks (for different quadratic phase functions <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' />) behaved reasonably independently of one another, then one might guess that after <img src='http://l.wordpress.com/latex.php?latex=Cn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Cn' title='Cn' class='latex' /> steps, an exponentially small fraction of them had done stupid things like never really getting any smaller. But that &#8220;reasonable independence&#8221; assumption looks decidedly shaky.  </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think we ought to separate out two ideas here. The first is an attempt to prove superlinear circuit lower bounds by first showing that <em>every</em> function of linear complexity correlates heavily with some quadratic phase function. Of course, then one would have to find a function that did not correlate heavily with any quadratic phase function, which could be hard. Indeed, it might require one to improve on the result of Viola and Wigderson.</p>
<p>But for the discussion we have just been having about the hardness of QC, we didn&#8217;t need <em>every</em> function of linear complexity to correlate with a quadratic phase function. All we needed was for a substantial fraction of them to do so. After all, virtually no random functions do, so that would give us a way of distinguishing pseudorandom from random, which in turn would prove that either QC is hard or factoring is easy.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;m slightly troubled by that last step. Wouldn&#8217;t one need to use Razborov and Rudich&#8217;s model of random low-complexity functions if one wanted to say anything about factoring?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oh dear, you&#8217;re right. I think the conclusion one would reach is not interesting after all. So perhaps one would indeed need to prove that <em>every</em> function of linear complexity had a large quadratic correlation. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I think it is nevertheless helpful to separate out the two problems. I&#8217;d be very interested in a proof that almost every function of linear circuit complexity correlated quite well with at least one quadratic phase function. It would at least show that there are <em>some</em> differences between random low-complexity functions and genuinely random functions. And perhaps it would give a clue about how to change &#8220;almost every&#8221; to &#8220;every&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Unless it was true for &#8220;almost every&#8221; but false for &#8220;every&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Fine, but we&#8217;d still have the random/pseudorandom distinction, which would be pretty nice I think.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;d like to give a possible heuristic argument that goes in the opposite direction to yours.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, that sounds interesting.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, I was convinced by your random-walk-with-drift idea when it came to individual correlations, but I think you are right to be suspicious of the independence. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Do you have some specific reason for that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes. In fact, when I think about it a bit more I realize even the random-walk-with-drift idea could be wrong. I think what was in the back of your mind was that when you apply a basic 3-bit operation, it will usually chop things up, but it might put back together again something that you have already chopped up.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Something like that. In an extreme case it might even invert the previous basic 3-bit operation.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Right, but that happens with probability <img src='http://l.wordpress.com/latex.php?latex=Cn%5E%7B-3%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Cn^{-3},' title='Cn^{-3},' class='latex' /> and it seems to me that in fact the tendency to get more complicated is very strong, and the &#8220;stupid&#8221; examples will occur with probability <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B-cm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{-cm}' title='n^{-cm}' class='latex' /> rather than <img src='http://l.wordpress.com/latex.php?latex=e%5E%7B-cm%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{-cm}.' title='e^{-cm}.' class='latex' /> And they will be <em>very</em> stupid examples that do things like inverting previous operations a lot.</p>
<p>If that is correct, then perhaps one could prove that every composition of basic 3-bit operations that obeys a few simple rules, such as not permuting the same set of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> bits too many times in a row (which would allow cancellation to take place), will end up failing to correlate with <img src='http://l.wordpress.com/latex.php?latex=g%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g,' title='g,' class='latex' /> independently of which quadratic phase function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> one has chosen. Perhaps it just becomes &#8220;less quadratic&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You&#8217;re beginning to convince me, or at the very least to weaken my already weak belief that this quadratic correlation idea could work.</p>
<p>But if your heuristic argument is correct, can we turn it into a much more general barrier? After all, it seems to be defeating a proof that is not obviously naturalizable.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think you&#8217;re jumping ahead a little fast. To get my counterargument to work one would have to understand what it was about quadratic phase functions that caused &#8220;sensible&#8221; products of basic 3-bit operations to correlate less and less with them. Only then could one think about whether other classes of functions might have a similar property. But I agree that it would be interesting if we could identify a class of NP properties that were unlikely to be usable in any proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Before you dive into that, could you explain more clearly what you mean by a &#8220;sensible&#8221; product?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I can try. What I&#8217;m trying to formulate is a sort of non-triviality condition on products of basic 3-bit operations. For technical reasons I&#8217;m going to restrict my attention to just one 3-bit operation, which works like this. You choose an ordered triple <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cj%2Ck%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,j,k)' title='(i,j,k)' class='latex' /> of distinct elements of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' /> and then given an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />-bit sequence <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> you look at the triple <img src='http://l.wordpress.com/latex.php?latex=%28x_i%2Cx_j%2Cx_k%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x_i,x_j,x_k).' title='(x_i,x_j,x_k).' class='latex' /> If this equals <img src='http://l.wordpress.com/latex.php?latex=%280%2C0%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(0,0,1)' title='(0,0,1)' class='latex' /> then you change it to <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,1,0)' title='(1,1,0)' class='latex' /> and if it equals <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,1,0)' title='(1,1,0)' class='latex' /> you change it to <img src='http://l.wordpress.com/latex.php?latex=%280%2C0%2C1%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(0,0,1).' title='(0,0,1).' class='latex' /> Otherwise, you leave it alone. For example, if <img src='http://l.wordpress.com/latex.php?latex=n%3D5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=5' title='n=5' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cj%2Ck%29%3D%282%2C3%2C5%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,j,k)=(2,3,5),' title='(i,j,k)=(2,3,5),' class='latex' /> then the sequence <img src='http://l.wordpress.com/latex.php?latex=%281%2C0%2C0%2C0%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,0,0,0,1)' title='(1,0,0,0,1)' class='latex' /> maps to <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%2C1%2C0%2C0%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,1,1,0,0),' title='(1,1,1,0,0),' class='latex' /> while the sequence <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%2C0%2C1%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,1,0,1,0)' title='(1,1,0,1,0)' class='latex' /> maps to itself.</p>
<p>I haven&#8217;t checked, but I&#8217;m fairly sure that these operations are enough to generate all even permutations of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n,' title='\{0,1\}^n,' class='latex' /> and if they do then they will do so in a rapidly mixing way.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  They clearly don&#8217;t generate all even permutations of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n.' title='\{0,1\}^n.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oh, really? Why not?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Because the all-1s and all-0s sequences are fixed points of all those transformations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oh yes, of course. Well, even so I&#8217;m pretty sure they generate a very large group&#8212;possibly even all even permutations that fix those two sequences. Let&#8217;s not worry about that too much for now.</p>
<p>Instead, let us ask ourselves how cancellation can occur when we take a product of these generators. One way is trivial: if cancellation would have occurred even in the free group, then it occurs here. In other words, this is the cancellation you get just by removing inverse pairs. But there is also a less trivial form of cancellation that follows from the fact that the group is finite: we know that if we take all products of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> generators, then when <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> is large enough we must have a lot of equalities. So we can take two products of generators that give you the same permutation and multiply one by the inverse of the other. In that way you get very random looking products of generators that miraculously cancel. However, the miracle depends on the products being very long indeed. To be more precise about it, the number of generators is <img src='http://l.wordpress.com/latex.php?latex=Cn%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Cn^3' title='Cn^3' class='latex' /> and the number of even permuations of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bcn2%5En%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{cn2^n},' title='2^{cn2^n},' class='latex' /> so to start getting lots of coincidences you need to multiply at least <img src='http://l.wordpress.com/latex.php?latex=cn2%5En%2F%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='cn2^n/\log n' title='cn2^n/\log n' class='latex' /> generators together.</p>
<p>However, this observation leads to an intermediate way of obtaining cancellation, which is to take a smallish integer <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and look at products of generators that act on some fixed set of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> bits. Now if you want to get coincidences, you only need to multiply <img src='http://l.wordpress.com/latex.php?latex=ck2%5Ek%2F%5Clog+k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ck2^k/\log k' title='ck2^k/\log k' class='latex' /> generators together.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Right, but what&#8217;s the point of these observations?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m coming to that. One might make the bold conjecture that the cancellations I have just talked about are in some sense the only ones that can occur. And in fact the first type (cancelling inverse pairs) is the special case of the second in which you take <img src='http://l.wordpress.com/latex.php?latex=k%3D3.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=3.' title='k=3.' class='latex' /> </p>
<p>Loosely, one could state the conjecture as follows: the only way of getting the identity as a product of these 3-bit operations is by means of cancellations that arise when you stick around in some fixed set of bits for a long time (relative to the size of that set of bits).</p>
<p>Turning that round, the statement would be that as long as you spread the operations around enough then you are guaranteed to produce a non-trivial element of the group <img src='http://l.wordpress.com/latex.php?latex=A_%7B2%5En%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_{2^n}.' title='A_{2^n}.' class='latex' /> And one could also propose a stronger heuristic principle that the function you get is maximally complex, in a sense that I haven&#8217;t worked out how to make precise. But a consequence would be that, for example, if <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' /> is a product of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> 3-bit operations that obeys the non-triviality condition, then the correlation of the first bit of <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(x)' title='\pi(x)' class='latex' /> with any fixed quadratic phase function decays exponentially.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Don&#8217;t you have the problem that the first bit of <img src='http://l.wordpress.com/latex.php?latex=%5Cpi%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi(x)' title='\pi(x)' class='latex' /> might not depend on all that many of the basic operations you use?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes. I think what I mean is something like that the decay is exponential in the number of basic operations that actually make some contribution to the answer. I&#8217;ll have to think how to formulate that properly. </p>
<p>Before I do that, I want to make an important remark, which is that two basic 3-bit operations that act on disjoint sets of bits commute with each other. So the &#8220;trivial&#8221; cancellations I&#8217;m talking about don&#8217;t necessarily happen between consecutive terms in the product. </p>
<p>Here&#8217;s a way I like to visualize it. Let&#8217;s imagine we have an array of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> dots, one for each bit. We now start applying basic operations. Associated with each operation is a subset of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,\dots,n\}' title='\{1,2,\dots,n\}' class='latex' /> of size 3. Let <img src='http://l.wordpress.com/latex.php?latex=A_1%2CA_2%2C%5Cdots%2CA_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_1,A_2,\dots,A_m' title='A_1,A_2,\dots,A_m' class='latex' /> be the sequence of all these subsets, and form a graph inductively as follows. The vertices of the graph are ordered pairs <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cu%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,u)' title='(i,u)' class='latex' /> of integers. You start with the graph <img src='http://l.wordpress.com/latex.php?latex=G_0%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_0,' title='G_0,' class='latex' /> which consists of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> isolated vertices <img src='http://l.wordpress.com/latex.php?latex=%281%2C0%29%2C%5Cdots%2C%28n%2C0%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,0),\dots,(n,0).' title='(1,0),\dots,(n,0).' class='latex' /> (These are the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> dots I just mentioned.) Then if <img src='http://l.wordpress.com/latex.php?latex=A_1%3D%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_1=\{i,j,k\}' title='A_1=\{i,j,k\}' class='latex' /> you add in vertices <img src='http://l.wordpress.com/latex.php?latex=%28i%2C1%29%2C%28j%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,1),(j,1)' title='(i,1),(j,1)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28k%2C1%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(k,1),' title='(k,1),' class='latex' /> joining all three of them to all three of <img src='http://l.wordpress.com/latex.php?latex=%28i%2C0%29%2C%28j%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,0),(j,0)' title='(i,0),(j,0)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28k%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(k,0)' title='(k,0)' class='latex' />. That gives you the graph <img src='http://l.wordpress.com/latex.php?latex=G_1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_1.' title='G_1.' class='latex' /> And you then continue this process. If you&#8217;ve got <img src='http://l.wordpress.com/latex.php?latex=G_%7Br-1%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_{r-1},' title='G_{r-1},' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=A_r%3D%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_r=\{i,j,k\}' title='A_r=\{i,j,k\}' class='latex' /> (not the same <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j,k\}' title='\{i,j,k\}' class='latex' />), then you add new vertices <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cu%29%2C%28j%2Cv%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,u),(j,v)' title='(i,u),(j,v)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28k%2Cw%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(k,w),' title='(k,w),' class='latex' /> choosing for <img src='http://l.wordpress.com/latex.php?latex=u%2Cv&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u,v' title='u,v' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w' title='w' class='latex' /> the smallest integers that have not yet been used (so <img src='http://l.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='u' title='u' class='latex' /> will be the number of <img src='http://l.wordpress.com/latex.php?latex=s%5Cleq+r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s\leq r' title='s\leq r' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=i%5Cin+A_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i\in A_s' title='i\in A_s' class='latex' />) and joining all of <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cu%29%2C%28j%2Cv%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,u),(j,v)' title='(i,u),(j,v)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28k%2Cw%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(k,w)' title='(k,w)' class='latex' /> to all of <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cu-1%29%2C%28j%2Cv-1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,u-1),(j,v-1)' title='(i,u-1),(j,v-1)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28k%2Cw-1%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(k,w-1).' title='(k,w-1).' class='latex' /> Let me also imagine that these edges are directed downwards. And let me say that the <em>index</em> of a vertex <img src='http://l.wordpress.com/latex.php?latex=%28i%2Cu%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(i,u)' title='(i,u)' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=i.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i.' title='i.' class='latex' /></p>
<p>Given any set of vertices I can follow the paths down from that set. I call the graph non-trivial if I can&#8217;t find some small set of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> vertices such that when I follow the paths downwards I go a very long way without involving any new indices. And if the graph is non-trivial, then I say that the product of generators was non-trivial.</p>
<p>One can look at this as a conjecture that there is a sort of pseudo group homomorphism. Whenever I <em>do</em> have a bunch of generators that sticks around in the same place for too long (possibly after commuting them with other generators to get them together) then I&#8217;m allowed to cancel them. So I&#8217;m imagining a non-existent group where the generators are sets of size 3 and the relations are that disjoint sets of size 3 commute and all (rather than just some) large bunches of sets of size 3 that are subsets of a fixed set of size <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> cancel.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Am I right in thinking that you&#8217;re introducing these thoughts in order to try to come up with some kind of heuristic principle that would suggest that norms like the quadratic correlation norm will not help us?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes, I was going to return to that, though I&#8217;m not sure I know how to do it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well while you&#8217;re thinking about it, let me float another idea, which is that if what you say is correct, then perhaps it could be used to prove lower bounds.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Eh? I was trying to use it to show that it was <em>difficult</em> to prove lower bounds.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, but if what you say is true, then it seems to establish a very strong principle that says that you can&#8217;t compute the same function in two essentially different ways. So perhaps one could prove a lower bound by finding an NP function and a method of computing it that didn&#8217;t &#8220;collapse&#8221; when you &#8220;mapped it to your imaginary pseudogroup&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oh. I see what you mean. Of course, I was talking about reversible computations, but I don&#8217;t know whether I was relying on that. Perhaps I could have just added some extra &#8220;rubbish&#8221; bits and argued in more or less the same way.</p>
<p>I&#8217;m inclined to take what you say not as a promising avenue for proving complexity lower bounds, but rather as a strong indication that my &#8220;bold conjecture&#8221; is far too bold. After all, it would seem that it <em>is</em> often possible to compute the same function in radically different ways. For example, to work out <img src='http://l.wordpress.com/latex.php?latex=a%5E%7B6%5Ek%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{6^k}' title='a^{6^k}' class='latex' /> mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> I could start with <img src='http://l.wordpress.com/latex.php?latex=a%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a,' title='a,' class='latex' /> cube it <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> times and then square that <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> times, or I could do the squaring first and then the cubing. That looks like two genuinely different ways of working out the function. </p>
<p>It now looks to me as though the word problem in <img src='http://l.wordpress.com/latex.php?latex=A_%7B2%5En%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_{2^n},' title='A_{2^n},' class='latex' /> with 3-bit operations as generators, is going to be fantastically difficult. Of course, nobody would expect it to be possible to determine in polynomial time whether a given word gave you the identity, but I was hoping for something much less: finding a tidy sufficient condition for a word not to be the identity. But it seems that even this is far too much to ask. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Are you sure? Wouldn&#8217;t it be the case that in order to do sensible computations like raising <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> to the power <img src='http://l.wordpress.com/latex.php?latex=6%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^k' title='6^k' class='latex' /> mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> you have to do quite a lot of &#8220;sticking around in the same small set of bits&#8221;? Perhaps your conjecture is true, and our impression that  there are often genuinely different ways of computing the same function is an illusion, because we don&#8217;t know of <em>any</em> &#8220;non-collapsing&#8221; computation techniques. Maybe if you&#8217;re very strict about spreading the sets <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_i' title='A_i' class='latex' /> about, then you&#8217;ll only ever get very random-looking functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I see your point. I now don&#8217;t know what to believe.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Neither do I but I think it&#8217;s a nice question. Is there some sense in which computation takes place in an &#8220;almost free&#8221; group? Roughly speaking, this should mean that two circuits cannot give the same function unless they operate on similar sets of bits in a similar order (in so far as the order matters). </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I think I have an argument against that.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh really? What is it?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  It&#8217;s that if you take two unsatisfiable bunches of clauses you could create a function that&#8217;s identically zero in two different ways. And you could choose those sets of clauses randomly&#8212;if you have a reasonable number of them then they will give you unsatisfiable formulae.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes, I was a little careless there. But I think the question still makes sense if we look at basic 3-bit operations rather than AND, OR and NOT gates and consider whether two essentially different products can be equal.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But I thought you said you could simulate AND, OR and NOT gates using basic 3-bit operations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You can, but in order to do so you have to introduce extra &#8220;rubbish&#8221; bits that start out as 0s and end up scrambled in some peculiar way that depends on the computation you carried out. So with your example of two different ways of calculating a function that&#8217;s identically zero, the difference would show up in the rubbish bits.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I see.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  So I think the bold conjecture, if we can really call it a conjecture, is still alive, but one has to understand that it is very much a conjecture about reversible computations. Unfortunately, I don&#8217;t know enough about combinatorial/geometric group theory to know how to tackle such a problem: that is, a problem where one is trying to prove that a word is non-trivial, using certain features of how the word is put together. Does anyone know of any results of that kind?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I don&#8217;t.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Neither do I, I&#8217;m afraid.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  In that case, can I float past you two more ideas that have occurred to me?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Of course.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, you wanted to prove that &#8220;sensible&#8221; products of basic 3-bit operations would give you functions that became &#8220;less and less quadratic&#8221;. I interpret that as follows. If <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_m%5Cdots%5Cphi_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi_m\dots\phi_1' title='\phi_m\dots\phi_1' class='latex' /> is a &#8220;sensible&#8221; product of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> basic operations and we write <img src='http://l.wordpress.com/latex.php?latex=f_r%3D%5Cphi_r%5Cdots%5Cphi_1%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_r=\phi_r\dots\phi_1,' title='f_r=\phi_r\dots\phi_1,' class='latex' /> then the correlation of <img src='http://l.wordpress.com/latex.php?latex=f_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_r' title='f_r' class='latex' /> with <em>any</em> quadratic decreases exponentially with <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> until it becomes almost minimal.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I don&#8217;t know that I actually believe that, but it&#8217;s roughly the hypothesis that I was entertaining in order to cast doubt on your argument that functions of linear circuit complexity might magically correlate with quadratic phase functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  The first thought is that it might follow from the &#8220;bold conjecture&#8221;. I&#8217;m going to oversimplify a few details here, but the bold conjecture has as a consequence that no &#8220;sensible&#8221; product of basic transformations can ever compute a function that can be computed by a &#8220;collapsing&#8221; product. Since the only computations we can think of tend to involve sticking around for a long time dealing with small sets of bits (though perhaps I am wrong about this &#8212; for example, does the PCP theorem give rise to &#8220;weird&#8221; computations that can be described by means of &#8220;sensible&#8221; products?), that would imply that no &#8220;sensible&#8221; product can ever give us any of the functions that we know and love.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> At least if they can be computed reversibly, which is a big restriction.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, you&#8217;re right. But it suggests that if we could find a reversible way of creating a quadratic polynomial, then maybe it would be reasonable to conjecture that a &#8220;sensible&#8221; product of basic transformations would automatically not correlate with it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Aren&#8217;t you confusing permutations of the cube with Boolean functions defined on the cube? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, but that was deliberate &#8212; it was the detail I was leaving out. I don&#8217;t know the best way of formulating a conjecture along these lines, but maybe if one could reversibly create a permutation that was quadratic in each coordinate, or something like that, then one could get the argument to work.</p>
<p>The big problem is that proving the general principle that the 3-bit operations create a group that is in some sense &#8220;free apart from the obvious kinds of relations&#8221; looks pretty hard. But my other thought was that we could at least have a look at what happens in the <em>linear</em> case, because I think it is reasonably straightforward to see what a basic 3-bit operation does to the Fourier transform of a function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  How do you mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, a basic 3-bit operation is a permutation of the cube <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n,' title='\{0,1\}^n,' class='latex' /> but we can also think of it as a linear map that takes functions defined on the cube to functions defined on the cube. We can then change basis from the delta functions at each vertex to the Walsh functions. That is, we see what happens to the Fourier transform.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I think I&#8217;d like to see this in a bit more detail.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  All right. Let <img src='http://l.wordpress.com/latex.php?latex=w_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A' title='w_A' class='latex' /> stand for the function <img src='http://l.wordpress.com/latex.php?latex=x%5Cmapsto+%28-1%29%5E%7Bx.A%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x\mapsto (-1)^{x.A},' title='x\mapsto (-1)^{x.A},' class='latex' /> where I have written <img src='http://l.wordpress.com/latex.php?latex=x.A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x.A' title='x.A' class='latex' /> for the parity of <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' />. (In other words, it&#8217;s 1 if <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> has an odd number of 1s in <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> if it has an even number of 1s.) Now let&#8217;s take as an example the basic transformation <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> that acts on the bits <img src='http://l.wordpress.com/latex.php?latex=x_i%2Cx_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_i,x_j' title='x_i,x_j' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_k' title='x_k' class='latex' /> and interchanges <img src='http://l.wordpress.com/latex.php?latex=000&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='000' title='000' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=111&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='111' title='111' class='latex' />, leaving <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> fixed if it is not the case that <img src='http://l.wordpress.com/latex.php?latex=x_i%3Dx_j%3Dx_k.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_i=x_j=x_k.' title='x_i=x_j=x_k.' class='latex' /> The way that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> acts as a linear map is via the formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28f%29%28x%29%3Df%28%5Cphi%5E%7B-1%7D%28x%29%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi(f)(x)=f(\phi^{-1}(x)),' title='\phi(f)(x)=f(\phi^{-1}(x)),' class='latex' /> which equals <img src='http://l.wordpress.com/latex.php?latex=f%28%5Cphi%28x%29%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(\phi(x))' title='f(\phi(x))' class='latex' /> since <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> is its own inverse. </p>
<p>What is the expansion of <img src='http://l.wordpress.com/latex.php?latex=w_A%28%5Cphi%28x%29%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A(\phi(x))' title='w_A(\phi(x))' class='latex' /> in terms of Walsh functions? By definition it is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_xw_A%28%5Cphi%28x%29%29w_B%28x%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_xw_A(\phi(x))w_B(x),' title='\mathbb{E}_xw_A(\phi(x))w_B(x),' class='latex' /> which equals </p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%5Cmathbb%7BE%7D_x%28-1%29%5E%7B%5Cphi%28x%29.A%2Bx.B%7D%3D%5Cmathbb%7BE%7D_%7Bx_1%2C%5Cdots%2Cx_n%7D%5Cprod_%7Bt%3D1%7D%5En%28-1%29%5E%7B%5Cphi%28x%29_ta_t%2Bx_tb_t%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle\mathbb{E}_x(-1)^{\phi(x).A+x.B}=\mathbb{E}_{x_1,\dots,x_n}\prod_{t=1}^n(-1)^{\phi(x)_ta_t+x_tb_t},' title='\displaystyle\mathbb{E}_x(-1)^{\phi(x).A+x.B}=\mathbb{E}_{x_1,\dots,x_n}\prod_{t=1}^n(-1)^{\phi(x)_ta_t+x_tb_t},' class='latex' /> </p>
<p>where we are setting <img src='http://l.wordpress.com/latex.php?latex=a_t%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_t=1' title='a_t=1' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=t%5Cin+A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t\in A' title='t\in A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> otherwise, and similarly for <img src='http://l.wordpress.com/latex.php?latex=b_t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b_t' title='b_t' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B.' title='B.' class='latex' /></p>
<p>We can split the product up according to whether or not <img src='http://l.wordpress.com/latex.php?latex=t%5Cin%5C%7Bi%2Cj%2Ck%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t\in\{i,j,k\}.' title='t\in\{i,j,k\}.' class='latex' /> To simplify the expression, let us take (without loss of generality) <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j,k\}' title='\{i,j,k\}' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C3%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,3\}.' title='\{1,2,3\}.' class='latex' /> If we do all this, then we obtain the expression</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+%5Cmathbb%7BE%7D_%7Bx_1%2C%5Cdots%2Cx_n%7D%5Cprod_%7Bi%3D1%7D%5E3%28-1%29%5E%7B%5Cphi%28x%29_ia_i%2Bx_ib_i%7D%5Cprod_%7Bi%3D4%7D%5En%28-1%29%5E%7Bx_i%28a_i%2Bb_i%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle \mathbb{E}_{x_1,\dots,x_n}\prod_{i=1}^3(-1)^{\phi(x)_ia_i+x_ib_i}\prod_{i=4}^n(-1)^{x_i(a_i+b_i)}.' title='\displaystyle \mathbb{E}_{x_1,\dots,x_n}\prod_{i=1}^3(-1)^{\phi(x)_ia_i+x_ib_i}\prod_{i=4}^n(-1)^{x_i(a_i+b_i)}.' class='latex' /></p>
<p>Now we can think of <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cdots%2Cx_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1,\dots,x_n' title='x_1,\dots,x_n' class='latex' /> as independent random variables, and if we do then the expectation of the product above can be rewritten as a product of expectations. But <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BE%7D_%7Bx_i%7D%28-1%29%5E%7Bx_i%28a_i%2Bb_i%29%7D%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{E}_{x_i}(-1)^{x_i(a_i+b_i)}=0' title='\mathbb{E}_{x_i}(-1)^{x_i(a_i+b_i)}=0' class='latex' /> unless <img src='http://l.wordpress.com/latex.php?latex=a_i%3Db_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_i=b_i' title='a_i=b_i' class='latex' />, in which case it is 1. Therefore, the entire expression is zero unless <img src='http://l.wordpress.com/latex.php?latex=a_i%3Db_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_i=b_i' title='a_i=b_i' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=i%5Cgeq+4.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i\geq 4.' title='i\geq 4.' class='latex' /> This tells us that <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+w_A%5Ccirc%5Cphi%2Cw_B%5Crangle%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle w_A\circ\phi,w_B\rangle=0' title='\langle w_A\circ\phi,w_B\rangle=0' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> that does not equal <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> outside the set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C3%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1,2,3\}.' title='\{1,2,3\}.' class='latex' /> (More generally, this will be true for <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j,k\}' title='\{i,j,k\}' class='latex' /> for the same reason.) Another observation is that if <img src='http://l.wordpress.com/latex.php?latex=%7CA%5Ccap%5C%7Bi%2Cj%2Ck%5C%7D%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|A\cap\{i,j,k\}|' title='|A\cap\{i,j,k\}|' class='latex' /> is even, then replacing <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' /> does not change the parity of the restriction to <img src='http://l.wordpress.com/latex.php?latex=A%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A,' title='A,' class='latex' /> so in that case <img src='http://l.wordpress.com/latex.php?latex=w_A%28%5Cphi%28x%29%29%3Dw_A%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A(\phi(x))=w_A(x)' title='w_A(\phi(x))=w_A(x)' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=x%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x,' title='x,' class='latex' /> which tells us that <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+w_A%5Ccirc%5Cphi%2Cw_B%5Crangle%3D%5Cdelta_%7BA%2CB%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle w_A\circ\phi,w_B\rangle=\delta_{A,B}.' title='\langle w_A\circ\phi,w_B\rangle=\delta_{A,B}.' class='latex' /> If, however, <img src='http://l.wordpress.com/latex.php?latex=%7CA%5Ccap%5C%7Bi%2Cj%2Ck%5C%7D%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|A\cap\{i,j,k\}|' title='|A\cap\{i,j,k\}|' class='latex' /> is odd, then a small calculation shows that <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+w_A%5Ccirc%5Cphi%2Cw_B%5Crangle&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle w_A\circ\phi,w_B\rangle' title='\langle w_A\circ\phi,w_B\rangle' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1%2F2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1/2' title='\pm 1/2' class='latex' /> for four of the eight sets <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> that agree with <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> outside <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bi%2Cj%2Ck%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{i,j,k\}' title='\{i,j,k\}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> for the other four.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Wow, that was more detail than I was expecting.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well if you want to skip the calculations, just think about the conclusion. What we find is that the effect of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> on a Walsh function <img src='http://l.wordpress.com/latex.php?latex=w_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A' title='w_A' class='latex' /> is either to leave it unchanged or to replace it by a linear combination of four Walsh functions <img src='http://l.wordpress.com/latex.php?latex=w_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_B' title='w_B' class='latex' />, where the sets <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B' title='B' class='latex' /> are close to <img src='http://l.wordpress.com/latex.php?latex=A.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A.' title='A.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I simply <em>have</em> to make a remark here. What you have is an orthogonal map that takes the space of all linear combinations of Walsh functions, which we can identify with functions defined on <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' />, to itself. Moreover, it&#8217;s a particularly simple orthogonal map in that it takes each Walsh function to a linear combination of Walsh functions that correspond to sets that differ only in the places <img src='http://l.wordpress.com/latex.php?latex=i%2Cj&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i,j' title='i,j' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k.' title='k.' class='latex' /> If you identify <img src='http://l.wordpress.com/latex.php?latex=w_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A' title='w_A' class='latex' /> with the sequence that is <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> otherwise, then what this is saying is that each sequence is mapped to a linear combination of sequences that differ from it only in the coordinates <img src='http://l.wordpress.com/latex.php?latex=i%2Cj&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i,j' title='i,j' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k.' title='k.' class='latex' /> But this is precisely the sort of thing that one does in a quantum computation!</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I thought quantum computations involved complex numbers.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> They do, but it&#8217;s hardly surprising that we don&#8217;t get arbitrary unitary maps, since the permutations of the cube give rise to a very small subgroup of the unitary group. It&#8217;s just amusing that the Fourier transform of a basic transformation is one of the basic transformations of quantum computation.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Are you suggesting that quantum computation could be useful to us?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> No I&#8217;m not. It was just meant as an amusing observation. Sorry if I&#8217;ve got you both excited.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Don&#8217;t apologize &#8212; I like it too. But let me get back to what I was saying. Once we see that Walsh functions have a tendency to map to linear combinations of closely related Walsh functions with coefficients equal to <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1%2F2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1/2' title='\pm 1/2' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' />, it becomes rather obvious that if you apply a typical sequence of basic operations to a Walsh function <img src='http://l.wordpress.com/latex.php?latex=w_A%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_A,' title='w_A,' class='latex' /> then it will spread out and out, preserving its <img src='http://l.wordpress.com/latex.php?latex=L_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L_2' title='L_2' class='latex' /> norm but decreasing its <img src='http://l.wordpress.com/latex.php?latex=L_%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L_\infty' title='L_\infty' class='latex' /> norm. Eventually you&#8217;ll start getting superpositions and the rate of decrease will become slower, and harder to analyse, but at least to start with it will be exponentially fast.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I think I see what you mean.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, I wasn&#8217;t all that precise. But it seems to me that there might be some hope of proving that the decay happens not just for random products of basic 3-bit operations but even for 8)&#8217; s &#8220;sensible&#8221; products.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You know, I think you&#8217;ve influenced my way of thinking.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Delighted to hear it, but in what way?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, very early on in our conversations, you pointed out that even a heuristic argument can have an influence on what proof methods one regards as potentially fruitful. And here, even if we haven&#8217;t the faintest idea how to prove the &#8220;bold conjecture&#8221;, it does seem to rule out certain methods for proving circuit lower bounds. Suppose, for example, that some ludicrously strong version of the principle held, and let <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> be some class of nice functions (such as quadratic phase functions). We might attempt to prove lower bounds by showing that every low-complexity function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> correlates with some <img src='http://l.wordpress.com/latex.php?latex=g%5Cin%5CGamma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g\in\Gamma.' title='g\in\Gamma.' class='latex' /> And this could be a highly non-natural property if <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> has superexponential size, as it does with quadratics. Nevertheless, it could also be the case that any &#8220;sensible&#8221; product of basic 3-bit operations would have tiny correlation with <em>every</em> function in <img src='http://l.wordpress.com/latex.php?latex=%5CGamma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma.' title='\Gamma.' class='latex' /> The key point here is that we go for &#8220;every&#8221; rather than &#8220;almost every&#8221;. To get that, we are arguing that when we pick a random product <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of 3-bit operations, the events &#8220;<img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> correlates with <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' />&#8221; for <img src='http://l.wordpress.com/latex.php?latex=g%5Cin%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g\in\Gamma' title='g\in\Gamma' class='latex' /> could be strongly dependent. This would be the case if they always held when the product had certain properties, such as being &#8220;sensible&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What exactly are you claiming here?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m pointing out something very weak, in the sense that it&#8217;s supposition piled on supposition. But I think it&#8217;s at least <em>possible</em> that there is a very wide class of NP functions that cannot be used to prove circuit lower bounds. In fact, I&#8217;m tempted to say that there should be two sorts of NP functions: useless ones, such as &#8220;has low circuit complexity&#8221;, that lead to trivial reformulations of the problem, and potentially useful ones, such as &#8220;correlates with a quadratic&#8221;, that don&#8217;t work. I&#8217;d like to understand this distinction a little better, and then maybe I could formulate a very depressing conjecture that would kill off all ideas you have had so far and all that you are ever likely to have. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh. Well I have a confession to make.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> What on earth could that be?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I did in fact have another idea for how to prove superlinear lower bounds. It&#8217;s still highly speculative.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You mean you don&#8217;t have a fully worked out argument? Wonders never cease.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ha ha. Just let me try it out on you. I&#8217;ve just argued that if you take a linear phase function, a.k.a. a Walsh function, and apply to it a basic 3-bit transformation, then the result lies in twice the convex hull of some other linear phase functions. That means that the decay that we observed in the <img src='http://l.wordpress.com/latex.php?latex=L_%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L_\infty' title='L_\infty' class='latex' /> norm could not be <em>too</em> fast. </p>
<p>Now suppose we were able to prove that if you apply a basic transformation to a <em>quadratic</em> phase function then the result could be written more efficiently, in some sense, as a combination of other quadratic phase functions. Then perhaps this would prove that the maximum correlation with a quadratic phase function went down more slowly than it does in the linear case. The decrease would probably still be exponential, but perhaps the rate would be smaller. </p>
<p>If we could establish that, then perhaps we would be able to go on to show that if we considered polynomials of unbounded degree, then we could prove that the maximum correlation went down more slowly than exponential. But there still wouldn&#8217;t be too many of these polynomial phase functions, so a random function would have very small correlation with them. This might give us a distinction between functions of linear complexity and random functions. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, I&#8217;ve listened politely to what you say, but it doesn&#8217;t sound to me as though it would work. If you want to show that the maximum correlation goes down slowly, don&#8217;t you have to prove that if <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is a polynomial phase function and <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> is a basic transformation, then <img src='http://l.wordpress.com/latex.php?latex=g%5Ccirc%5Cphi%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g\circ\phi^{-1}' title='g\circ\phi^{-1}' class='latex' /> has <img src='http://l.wordpress.com/latex.php?latex=1-o%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1-o(1)' title='1-o(1)' class='latex' /> correlation with another polynomial phase function of the same degree? That doesn&#8217;t sound remotely plausible to me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Hmm, maybe you&#8217;re right. But I think there may be something to the basic idea. Perhaps instead of showing that a single polynomial phase function transforms to a function that can be written efficiently as a convex combination, one would prove that if you do a sequence of basic transformations to a polynomial phase function, then you can write the result in some nice way as a linear combination of polynomial phase functions. I&#8217;m not quite sure what I&#8217;m proposing exactly, but it feels as though there is a method that could in principle prove non-trivial lower bound results.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why don&#8217;t you at least see what a basic 3-bit operation does to a quadratic phase function?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I will, but I think I&#8217;d better do it in private and report back at our next meeting.  </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I don&#8217;t see how this would get past my &#8220;depressing conjecture&#8221;.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  How about actually making the conjecture before you say that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> OK, I&#8217;ve got things to think about too. Let me have a quick go before we call it a day.</p>
<p>Let&#8217;s suppose we have some NP property we are trying to use. If we want to, we can phrase this in the following way: we are going to call a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> &#8220;simple&#8221; if there is a Boolean function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> that depends on at most linearly many bits such that <img src='http://l.wordpress.com/latex.php?latex=P%28f%2Cg%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P(f,g),' title='P(f,g),' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is some property that can be computed in polynomial time. We want to contrast properties of two kinds. A good example of the first kind is this: <img src='http://l.wordpress.com/latex.php?latex=P%28f%2Cg%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P(f,g)' title='P(f,g)' class='latex' /> if and only if <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is a quadratic and <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle' title='\langle f,g\rangle' class='latex' /> is large. And a good example of the second is this: <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> encodes a circuit of size at most <img src='http://l.wordpress.com/latex.php?latex=n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2' title='n^2' class='latex' /> that computes <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />.</p>
<p>Both these properties could be thought of as correlation with a function in some class <img src='http://l.wordpress.com/latex.php?latex=%5CGamma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma.' title='\Gamma.' class='latex' /> It&#8217;s just that in the second case, the correlation has to be so perfect that you actually <em>equal</em> a function in <img src='http://l.wordpress.com/latex.php?latex=%5CGamma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma.' title='\Gamma.' class='latex' /> A difference between the two is then that membership of <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> is easy to establish in the quadratic-functions case and hard to establish in the low-complexity case. </p>
<p>What I think I might be heading towards here is the possibility that any NP property of the form &#8220;correlates with something in <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' />&#8221; for some polynomially-computable property <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> cannot work. Or it might be that, but with a stronger restriction on <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> that said that it could be computed in a &#8220;collapsing&#8221; way. But that might apply to all properties in P that we could actually understand. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  So we might still be able to prove lower bounds if we took a simple class <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> but asked for a more complicated relation to our function than &#8220;correlates with&#8221;?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think that would be worth thinking about, yes.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Just before we finish, can I suggest a way that might in principle lead to a more precise notion of &#8220;sensible product&#8221; or &#8220;collapsing product&#8221;?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That would be great.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well, I haven&#8217;t actually done it, but perhaps one could come up with a presentation of the alternating group <img src='http://l.wordpress.com/latex.php?latex=A_%7B2%5En%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_{2^n}' title='A_{2^n}' class='latex' /> with the following properties. The generators are just the basic 3-bit operations. To each relation one associates a set <img src='http://l.wordpress.com/latex.php?latex=R%5Csubset%5C%7B1%2C2%2C%5Cdots%2Cn%5C%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R\subset\{1,2,\dots,n\},' title='R\subset\{1,2,\dots,n\},' class='latex' /> such that the following properties hold. (They may not be a complete set of all the properties one would need.)</p>
<p>1. A relation associated with the set <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R' title='R' class='latex' /> is a product of 3-bit operations that operate only on bits that belong to the set <img src='http://l.wordpress.com/latex.php?latex=R.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R.' title='R.' class='latex' /></p>
<p>2. For each <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R' title='R' class='latex' /> let <img src='http://l.wordpress.com/latex.php?latex=G_R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_R' title='G_R' class='latex' /> be the group generated by all the relations for which the associated set is a proper subset of <img src='http://l.wordpress.com/latex.php?latex=R.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R.' title='R.' class='latex' /> Then <img src='http://l.wordpress.com/latex.php?latex=G_R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_R' title='G_R' class='latex' /> is an infinite group, and no relation with index <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R' title='R' class='latex' /> can be written as a short word in the group <img src='http://l.wordpress.com/latex.php?latex=G_R.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G_R.' title='G_R.' class='latex' /></p>
<p>3. Basic operations that operate on disjoint sets of bits commute.</p>
<p>The rough idea then would be to prove that if you have a product of basic operations that doesn&#8217;t stick around for a long time in the same set <img src='http://l.wordpress.com/latex.php?latex=R%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='R,' title='R,' class='latex' /> then there is no way of shortening it. So one would have the &#8220;no unexpected relations&#8221; property that you are talking about.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That sounds interesting, but it also sounds pretty hard to come up with a presentation of the kind you are talking about.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I wonder if we could find a combinatorial group theorist who would be able to comment on whether it is remotely feasible.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/970/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=970&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/10/27/a-conversation-about-complexity-lower-bounds-viii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Triple negatives and Conservapedia&#8217;s support for Hitler</title>
		<link>http://gowers.wordpress.com/2009/10/23/triple-negatives-and-conservapedias-support-for-hitler/</link>
		<comments>http://gowers.wordpress.com/2009/10/23/triple-negatives-and-conservapedias-support-for-hitler/#comments</comments>
		<pubDate>Fri, 23 Oct 2009 20:17:25 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Use of language]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=1114</guid>
		<description><![CDATA[In an entry entitled &#8220;Negatives&#8221; in his Modern English Usage, Henry Fowler gave an amusing collection of examples of blunders that had been made with them. (If you follow this link, you have to scroll down a page to find the article I&#8217;m talking about.) Unaware of this, though not surprised to see it, I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1114&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>In an entry entitled &#8220;Negatives&#8221; in his Modern English Usage, Henry Fowler gave <a href="http://books.google.com/books?id=Z4HI0RQIDK0C&amp;pg=RA1-PA874&amp;lpg=RA1-PA874&amp;dq=fowler+double+negative&amp;source=bl&amp;ots=wbE3ZtUmMI&amp;sig=znO6by_a0kKtbpdtRYW2wERHRyo&amp;hl=en&amp;ei=p-7hSs2_Op6hjAeYq8G8AQ&amp;sa=X&amp;oi=book_result&amp;ct=result&amp;resnum=5&amp;ved=0CBgQ6AEwBA#v=onepage&amp;q=&amp;f=false">an amusing collection of examples of blunders that had been made with them</a>. (If you follow this link, you have to scroll down a page to find the article I&#8217;m talking about.) Unaware of this, though not surprised to see it, I have been making a little collection myself. Since this is supposed to be a maths blog, let me feebly justify posting it by saying that it is a reflection on the fact that <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E3%3D-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^3=-1' title='(-1)^3=-1' class='latex' /> (and at one point on the corollary that <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E4%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^4=1' title='(-1)^4=1' class='latex' />).<span id="more-1114"></span></p>
<p>1. The first example is something I was once told about a landmark I had asked the way to. I was told that it was large, so that at a certain point on the route, &#8220;You can&#8217;t fail to miss it.&#8221; Fortunately, I did fail.</p>
<p>2. On February 25th 2009, David Thompson, a political correspondent from BBC News, wrote: &#8220;No-one would deny that David and Samantha Cameron come from anything other than extremely privileged backgrounds.&#8221; I find this a very hard example to understand directly. The easiest way to deal with it is to substitute &#8220;claim&#8221; for &#8220;deny&#8221; and see what you get. But I could just go ahead and be a counterexample to Thompson&#8217;s assertion: I hereby deny that David and Samantha Cameron came from anything other than extremely privileged backgrounds.</p>
<p>3. On May 2nd 2009, at the weigh-in before his fight with Manny Pacquiao, Ricky Hatton declared to his fans, &#8220;You will not go undisappointed.&#8221; This defeatist attitude is unusual in a boxer, but it was justified: he was knocked down twice in the first round and knocked out completely in the second.</p>
<p>4. In July 2009, Nathan Hauritz, part of the Australian cricket team, was talking about Australia&#8217;s prospects for the last two days of the Test match at Lord&#8217;s. He said: &#8220;None of the boys don&#8217;t think we can&#8217;t do the job.&#8221; As an Englishman, I am delighted to say that his team&#8217;s collective pessimism was again justified.</p>
<p>5. When it came to the one-day series a couple of months later, the tables were turned. An English supporter, reflecting on the situation, wrote in to the BBC website and included the sentence, &#8220;Doesn&#8217;t mean we shouldn&#8217;t overlook the weaknesses in this team though.&#8221; He gets a prize for a <em>quadruple</em> negative, but I cannot agree with him: moaning about the weaknesses of our national sports teams is one of life&#8217;s pleasures if you are English.</p>
<p>6. Finally, I come to a deeply sinister example. I recently learned that there is a website called Conservapedia, which describes itself as &#8220;An encyclopaedia with articles written from a conservative viewpoint.&#8221; I look forward to their mathematics section when they get round to it, which so far they haven&#8217;t: Grigory Perelman, a noted communist, claims to have solved the Poincar&eacute; conjecture, but he has REFUSED to publish his results in a conventional journal, in case they are scrutinized objectively and not just by a team of hand-picked liberal academics; the heat equation and the wave equation are examples of the now-discredited category of evolution equations; using a notion of creation and annihilation operators, quantum field theorists are starting to understand what the Bible could have told them all along; etc. etc. Unfortunately, it seems that Conservapedia is not a Wiki-style site &#8212; I can&#8217;t think why not &#8212; so we mathematicians cannot help them build up a decent mathematics section.</p>
<p>Just for fun I thought I&#8217;d have a look at what they have to say about Richard Dawkins. Some of their criticisms were only to be expected: he supports evolution, is a noted atheist, and so on. But I was taken aback by one of them: <em>he is insufficiently solid in his support for Hitler</em>. Here is the passage that shows this (at least as the article was on October 13th 2009 &#8212; I hope this blog&#8217;s portion of cyberspace is sufficiently disconnected from Conservapedia&#8217;s that it will remain that way, but if not, it will be interesting to see whether it can still be found in the history of the revisions of the article):</p>
<blockquote><p>Dawkins&#8217; Comment Regarding Adolf Hitler</p>
<p>When asked in an interview, &#8220;If we do not acknowledge some sort of external [standard], what is to prevent us from saying that the Muslim [extremists] aren’t right?&#8221;, Dawkins replied, &#8220;What’s to prevent us from saying Hitler wasn’t right? I mean, that is a genuinely difficult question. But whatever [defines morality], it’s not the Bible. If it was, we’d be stoning people for breaking the Sabbath.&#8221;<br />
The interviewer wrote, regarding the Hitler comment, &#8220;I was stupefied. He had readily conceded that his own philosophical position did not offer a rational basis for moral judgments. His intellectual honesty was refreshing, if somewhat disturbing on this point.&#8221;</p></blockquote>
<p>Well, in one sense it is a genuinely difficult question I suppose. HITLER WASN&#8217;T RIGHT. HITLER WASN&#8217;T RIGHT. HITLER WASN&#8217;T RIGHT!! Hmm, nothing seems to be preventing me from saying that, and I say it without apology. But why was Dawkins not offended by the suggestion that something should be preventing him from saying that Muslim extremists aren&#8217;t right**? Perhaps, being a somewhat literal-minded scientist, he was having thoughts such as this: &#8220;Well, it does at first seem as though there is nothing to prevent my saying that Muslim extremists aren&#8217;t right, but I can imagine circumstances under which I would be so prevented. For example, if I was kidnapped by al-Qaeda and held at knifepoint, then it would be reasonable to say that I was effectively prevented from drawing attention to the lack of rightness of their views. Or for a less artificial example, if I was in the middle of a dinner party with some of my super-bright liberal atheist friends, then I might be prevented by embarrassment from saying that Muslim extremists weren&#8217;t right: after all, the statement is normally held to be too obvious to be worth saying, so I might be thought to be protesting too much. Yes, on reflection, this is a genuinely difficult question. And similar difficulties apply if I substitute Hitler for Muslim extremists.&#8221;</p>
<p>So much for Dawkins. The motives of his interviewer, and of Conservapedia for gleefully reporting the interview, cannot be explained so innocently. The only reasonable explanation for the interviewer being &#8220;stupefied&#8221; is that he regarded it as manifestly and shockingly wrong to say that Hitler wasn&#8217;t right. In other words, the interviewer was not just a neo-Nazi, but so convinced of his/her neo-Nazism that a contrary view was stupefying. And Conservapedia wholeheartedly agrees with this. We live in worrying times. </p>
<p>As a postscript, let me deal with a small technical point concerning the last example. It might seem as though &#8220;What is to prevent us from saying that Hitler wasn&#8217;t right?&#8221; has just the two negatives &#8220;prevent&#8221; and &#8220;not&#8221;. So why do I call it a triple negative? The answer is that I am talking morality rather than semantics in this example. Hitler is the embodiment of evil, so we can build up as follows:</p>
<p>(i) Hitler   &#8212;   bad.</p>
<p>(ii) Saying that Hitler was right  &#8212;   bad.</p>
<p>(iii) Saying that Hitler wasn&#8217;t right &#8212; good.</p>
<p>(iv) Preventing someone from saying that Hitler wasn&#8217;t right &#8212; bad.</p>
<p>(v) Implying that one ought to be prevented from saying that Hitler wasn&#8217;t right &#8212; bad.</p>
<p>And therefore, </p>
<p>(v) Conservapedia &#8212; bad.</p>
<p>Needless to say, if anyone else has some good examples of triple negatives, I&#8217;d be delighted to hear them.</p>
<p>**Most embarrassingly, when I posted this, I wrote &#8220;are right&#8221; instead of &#8220;aren&#8217;t right&#8221; here. My only consolation is that I noticed before it was pointed out to me.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/1114/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/1114/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/1114/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/1114/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/1114/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/1114/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/1114/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/1114/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/1114/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/1114/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1114&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/10/23/triple-negatives-and-conservapedias-support-for-hitler/feed/</wfw:commentRss>
		<slash:comments>44</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, VII</title>
		<link>http://gowers.wordpress.com/2009/10/21/a-conversation-about-complexity-lower-bounds-vii/</link>
		<comments>http://gowers.wordpress.com/2009/10/21/a-conversation-about-complexity-lower-bounds-vii/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 17:22:54 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[complexity]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=909</guid>
		<description><![CDATA[Another somewhat inconclusive instalment, but it introduces another candidate for a non-natural property that might be of some use. (In a later instalment, a heuristic argument is proposed that would show, if correct, that in fact it is not of any use &#8230;)
*****************************************
  What&#8217;s on the agenda for today?
   Well, there are [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=909&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Another somewhat inconclusive instalment, but it introduces another candidate for a non-natural property that might be of some use. (In a later instalment, a heuristic argument is proposed that would show, if correct, that in fact it is not of any use &#8230;)</p>
<p>*****************************************</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> What&#8217;s on the agenda for today?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, there are two main things I&#8217;d like to discuss. One is our &#8220;proof&#8221; that a random low-complexity function was indistinguishable from a purely random function if you use anything <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' />-ish to do the distinguishing. I have a small problem with the argument, though I wouldn&#8217;t go so far as to say that it is wrong. </p>
<p>The second is that we haven&#8217;t got round to having a serious discussion of the lessons that can be drawn from the failure of the approach. If it&#8217;s true that it doesn&#8217;t work, then I think we can come up with a pretty general class of &#8220;simplicity properties&#8221; that fail for similar reasons. But I&#8217;d like to be as precise about that as possible.<span id="more-909"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Sounds like a good plan to me. Shall we start with your doubts about my argument against your <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' />-dual approach?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes. I&#8217;m talking about the modified one where we ignore the degenerate cases. And for now I&#8217;m forgetting about the duality. So the question I want to address is this. Suppose you take a random function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of superlinear complexity and look at the probability, for a random subspace <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> of dimension <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> (which is around <img src='http://l.wordpress.com/latex.php?latex=%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log n' title='\log n' class='latex' />) that the restriction of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> has an even number of 1s. (Here I am fixing <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and letting <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> vary, so this probability is a parameter associated with the function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />.)</p>
<p>Now let&#8217;s just suppose that the subspace <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> in question is the subspace generated by the first <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> basis vectors (that is, the subspace consisting of all sequences <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=x_%7Bk%2B1%7D%3D%5Cdots%3Dx_n%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_{k+1}=\dots=x_n=0' title='x_{k+1}=\dots=x_n=0' class='latex' />). Then if you do random Boolean operations, most of them will not involve any of the first <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> coordinates. Indeed, if you do <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> operations, then the probability that you miss the first <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> coordinates completely is something like <img src='http://l.wordpress.com/latex.php?latex=%281-Ck%2Fn%29%5Em&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1-Ck/n)^m' title='(1-Ck/n)^m' class='latex' />, or <img src='http://l.wordpress.com/latex.php?latex=%5Cexp%28-Ckm%2Fn%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\exp(-Ckm/n)' title='\exp(-Ckm/n)' class='latex' />. And for that to be exponentially small we need <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=n%5E2%2FCk&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2/Ck' title='n^2/Ck' class='latex' />.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Yes, but you&#8217;re now fixing <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> and letting <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> vary.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I know, and for that reason I think this argument may be going nowhere. But let me pursue it a tiny bit further. I&#8217;m now trying to think how many Boolean operations are needed if you want to scramble every single subspace that is generated by <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> unit vectors. And it&#8217;s not obvious to me that linearly many operations will do the job. If we choose random Boolean operations, then we need around <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n' title='n\log n' class='latex' /> of them before we can be fairly sure that we&#8217;ve involved every single coordinate. And if we haven&#8217;t involved <img src='http://l.wordpress.com/latex.php?latex=x_1%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1,' title='x_1,' class='latex' /> say, then flipping <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1' title='x_1' class='latex' /> makes no difference to the value of the function, so <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is guaranteed to have even parity in any subspace that involves <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_1' title='x_1' class='latex' />.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But why is it a problem for <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to have even parity in <em>some</em> subspaces?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s something we&#8217;d have to think about. Possibly one could go further and argue for some kind of bias. But let me continue with what I was saying. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  OK.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  We could try starting with a couple of non-random layers in order to ensure in a crude way that we involve every coordinate. But involving every coordinate isn&#8217;t enough. For example, suppose we chose random operations &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Can I stop you here?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I suppose so.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It&#8217;s just that if all you&#8217;re planning to do is look at subspaces of this special type then you&#8217;re looking at only <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom+nk&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom nk' title='\binom nk' class='latex' /> subspaces. Even if you allow the fixed coordinates to take arbitrary (fixed) values, then you&#8217;re still looking at only <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-k%7D%5Cbinom+nk&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n-k}\binom nk' title='2^{n-k}\binom nk' class='latex' /> subspaces. So it&#8217;s hard to see how you will end up saying anything that can&#8217;t be checked in polynomial time in <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n' title='2^n' class='latex' />. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK you&#8217;re right. I suppose it means that there is some clever pre-scrambling that can be done with linearly many operations. For instance, perhaps there&#8217;s a nice function from <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^{2n}' title='\{0,1\}^{2n}' class='latex' /> such that the image of every sequence has large support. If we applied that function and then did random Boolean operations, we&#8217;d probably get all our probabilities decaying exponentially.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You&#8217;re asking there for a nice code that can be computed in linear time. I don&#8217;t know enough about coding theory to know whether such a thing exists.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  A quick Google search reveals that <a href='http://gowers.files.wordpress.com/2009/10/linearcode.pdf'>it does</a>. [<em>Apologies -- this file seems to be stored as an image and takes a long time to download.</em>]</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, I admit it. Any argument that tries to make use of the behaviour of the restriction of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to subspaces generated by <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> unit vectors is doomed to failure, and there are two ways of seeing why. The first is to observe, as <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> did, that anything you can say about these restrictions will be checkable in polynomial time. The second is to note that in linear time you can map <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^{2n}' title='\{0,1\}^{2n}' class='latex' /> in such a way that every vector has an image with lots of 0s and lots of 1s. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;m sorry, I don&#8217;t understand the second of those arguments.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  No apologies needed &#8212; I was far too vague. But explaining myself will take us to the second topic I wanted to discuss, which is trying to come up with informal, but reasonably precise, descriptions of the kinds of arguments that run up against the natural-proofs barrier. In the light of what we&#8217;ve just said, I want to refine the heuristic picture I had earlier that was based on the Gowers model of random computable functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I wish you wouldn&#8217;t call it the Gowers model. It&#8217;s just the obvious model of a random reversible computation.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why is it obvious? In particular, why do the basic operations depend on three bits rather than say two or four?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s easy to answer. It&#8217;s because if you base them on two bits then the transformations you get are affine over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2,' title='\mathbb{F}_2,' class='latex' /> and therefore are by no means quasirandom. By contrast, if you use three bits then you can generate all even permutations of the cube. So three is chosen because it&#8217;s the smallest number that works. A related reason is that if you allow some extra &#8220;rubbish&#8221; bits then you can do any computation you like. For example, to simulate an AND gate, you take the two bits you want to do AND to, and a third bit that you set to 0. Then on those three bits you do the transposition that swaps <img src='http://l.wordpress.com/latex.php?latex=110&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='110' title='110' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=111.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='111.' title='111.' class='latex' /> At the end of that transposition, the only way that the third bit can end up as a 1, given that it started as a 0, is if the previous two bits are 11. </p>
<p>Gowers looked at random compositions of basic operations, but his probabilities were affected by the fact that if a pair of sequences differed in just one coordinate, then the probability of doing anything to that coordinate when you did a random basic operation was <img src='http://l.wordpress.com/latex.php?latex=1-3%2Fn%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1-3/n,' title='1-3/n,' class='latex' /> which meant that the probability of not touching it after <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> steps was around <img src='http://l.wordpress.com/latex.php?latex=%5Cexp%28-3m%2Fn%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\exp(-3m/n),' title='\exp(-3m/n),' class='latex' /> which did not become smaller than <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{-1}' title='n^{-1}' class='latex' /> until <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> was around <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n.' title='n\log n.' class='latex' /> </p>
<p>This is actually a genuine phenomenon: for example, if you do a random walk in the discrete cube, then the mixing time is about <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n' title='n\log n' class='latex' /> because before then there will be directions that you have not moved in.</p>
<p>However, we should not be misled into thinking that we can somehow exploit this phenomenon in order to prove complexity lower bounds of <img src='http://l.wordpress.com/latex.php?latex=n%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\log n' title='n\log n' class='latex' />, because if we do a bit of preprocessing, by applying a linear-time code first, so that any two sequences now differ in some constant fraction of their bits, then the expected time it takes to separate any two sequences is constant. This means that after only a linear number of operations we can expect to have chopped everything up.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Is it possible for you to say more precisely what you mean by &#8220;chopped everything up&#8221;?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I can try. Recall what a basic operation does. It takes the discrete cube and divides it into eight equal parts, according to the values taken by three selected coordinates. It then shuffles these parts by translating them. For example, if the basic transformation looks at the first three bits and exchanges 000 with 111, then to every sequence that begins 000 or 111 you add the vector <img src='http://l.wordpress.com/latex.php?latex=%28111000%5Cdots+0%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(111000\dots 0).' title='(111000\dots 0).' class='latex' /> So if <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> belong to the same part, then they haven&#8217;t really been shuffled relative to each other. </p>
<p>I suppose what I&#8217;m saying there is that <img src='http://l.wordpress.com/latex.php?latex=x-y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x-y' title='x-y' class='latex' /> remains the same if <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> are in the same part. But one can also look at more complicated but very important relationships. For example, for a composition of basic operations to look random, we need it to &#8220;break up linearity&#8221;. One way we could measure that is to look at the extent to which the equation <img src='http://l.wordpress.com/latex.php?latex=x%2By%3Dz%2Bw&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x+y=z+w' title='x+y=z+w' class='latex' /> implies that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29%2B%5Cphi%28y%29%3D%5Cphi%28z%29%2B%5Cphi%28w%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi(x)+\phi(y)=\phi(z)+\phi(w),' title='\phi(x)+\phi(y)=\phi(z)+\phi(w),' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\phi' title='\phi' class='latex' /> is the composition of basic operations. </p>
<p>Now <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> differ in a constant fraction of places, as do <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='z' title='z' class='latex' />, and as do <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='z' title='z' class='latex' />. So with constant probability we choose three coordinates such that the restrictions of <img src='http://l.wordpress.com/latex.php?latex=x%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x,' title='x,' class='latex' /> <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='z' title='z' class='latex' /> are all different. And if <img src='http://l.wordpress.com/latex.php?latex=x%2By%3Dz%2Bw&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x+y=z+w' title='x+y=z+w' class='latex' /> then it follows that the restriction of <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w' title='w' class='latex' /> is different again, and the four restrictions form a 2-dimensional subspace of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E3.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^3.' title='\{0,1\}^3.' class='latex' /> With positive probability the basic transformation will turn that into a subset of size 4 that is not a subspace, and we will have &#8220;broken the linearity&#8221; of the quadruple.</p>
<p>At this point I want to jump straight to the vague thought that I hope to make precise. Let <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> be a random Boolean function defined on <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> as follows. You first map <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^{2n}' title='\{0,1\}^{2n}' class='latex' /> as described, then do a random sequence of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> basic operations, and finally you look at the first coordinate of the result (or take <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' /> raised to that power if you want an answer in <img src='http://l.wordpress.com/latex.php?latex=%5C%7B-1%2C1%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{-1,1\}' title='\{-1,1\}' class='latex' />). I want to suggest that if <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> is any set of Boolean functions on <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> such that membership of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> can be computed in polynomial time (in <img src='http://l.wordpress.com/latex.php?latex=2%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n,' title='2^n,' class='latex' /> or equivalently in exponential time in <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />), then the probability that <img src='http://l.wordpress.com/latex.php?latex=f%5Cin+S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\in S' title='f\in S' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B-2%5En%7D%7CS%7C%2Bc%28m%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{-2^n}|S|+c(m),' title='2^{-2^n}|S|+c(m),' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=c%28m%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c(m)' title='c(m)' class='latex' /> tends to zero exponentially quickly.</p>
<p>I now want to think about (i) whether that is actually true and (ii) what the consequences are for complexity proofs if it is true. I feel as though it was something like the above fact that caused us problems earlier.</p>
<p>I don&#8217;t really have anything sensible to say about (i). We sort of know from Razborov and Rudich that something like it is probably true, so I think I&#8217;ll just concentrate on (ii), because that is what really interests me.</p>
<p>So let&#8217;s suppose that S is a set of Boolean functions. I&#8217;m looking for properties of S that ensure that it cannot distinguish between random low-complexity functions and purely random functions. And the starting point will be what you told me right at the beginning of our conversation: that if S is a polynomially computable property then it has no chance of working. </p>
<p>Now it follows trivially from this that if S (which we think of as our &#8220;simplicity&#8221; property) implies some property T that is polynomially computable and does not hold for random functions, then again S cannot work. Why? Because then T would hold for random low-complexity functions and not for purely random functions, while also being polynomially computable. And that takes us back to our starting point.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I should make clear that Razborov and Rudich included this observation in their paper, and indeed gave examples of properties S that had been used in the literature that were not themselves natural but that were &#8220;naturalizable&#8221; in the sense of implying natural properties T.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, I think you mentioned that earlier. Perhaps I should make clear that I&#8217;m not claiming originality for any of this. One way of describing what I&#8217;m doing now is to say that I&#8217;m trying to describe general classes of properties that turn out to be naturalizable.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I agree that that&#8217;s interesting, even if it doesn&#8217;t count as a theoretical advance.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK. Now one of the first &#8220;unnatural&#8221; (as far as I could tell) properties that I was interested in was the property of having a not too large <img src='http://l.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^3' title='U^3' class='latex' /> dual norm. But as <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  pointed out, that property is easily naturalizable, since it implies the property of having a not too small <img src='http://l.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm. I had been hoping that duality was a way of building nice properties that were not polynomially computable out of ones that were, and therefore of generating a large number of good candidates for properties that might distinguish between pseudorandom functions and random functions. However, it seems that <em>no</em> construction of this kind can work. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can you say that formally?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  It&#8217;s not at all hard to say it formally. The question is how generally one can say it. Here&#8217;s what I&#8217;ve got so far. Let <img src='http://l.wordpress.com/latex.php?latex=%5C%7C.%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|.\|' title='\|.\|' class='latex' /> be a polynomially computable norm. Then the property <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5E%2A%5Cleq+C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|^*\leq C' title='\|f\|^*\leq C' class='latex' /> does not work as a simplicity property. The reason is that if <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a Boolean function, then <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cf%5Crangle%3D1%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,f\rangle=1,' title='\langle f,f\rangle=1,' class='latex' /> so the property <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5E%2A%5Cleq+C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|^*\leq C' title='\|f\|^*\leq C' class='latex' /> implies the property <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5Cgeq+C%5E%7B-1%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|\geq C^{-1}.' title='\|f\|\geq C^{-1}.' class='latex' /> So if <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|' title='\|f\|' class='latex' /> is small for random functions (as it is for <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> norms), then we have a polynomial-time way of distinguishing between random and pseudorandom, which is not allowed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  You don&#8217;t seem to have ruled out the possibility that there might be a norm <img src='http://l.wordpress.com/latex.php?latex=%5C%7C.%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|.\|' title='\|.\|' class='latex' /> that is <em>not</em> small for random functions, but such that the dual norm <img src='http://l.wordpress.com/latex.php?latex=%5C%7C.%5C%7C%5E%2A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|.\|^*' title='\|.\|^*' class='latex' /> <em>is</em> large for random functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s true. So maybe I need to add that as an assumption. I&#8217;ll call <img src='http://l.wordpress.com/latex.php?latex=%5C%7C.%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|.\|' title='\|.\|' class='latex' /> a <em>quasirandomness norm</em> if it is small for random functions. So then the statement above is that duals of polynomially computable quasirandomness norms cannot work.</p>
<p>We should perhaps bear in mind what you&#8217;ve just said, though it seems to me to be a very long shot. Perhaps one could devise a norm <img src='http://l.wordpress.com/latex.php?latex=%5C%7C.%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|.\|' title='\|.\|' class='latex' /> that was large for random functions but small for functions that were &#8220;even better than random&#8221;. To give an example, if you define <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|' title='\|f\|' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Chat%7Bf%7D%5C%7C_%5Cinfty%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|\hat{f}\|_\infty,' title='\|\hat{f}\|_\infty,' class='latex' /> then the norm of a random function is <img src='http://l.wordpress.com/latex.php?latex=%28%5Clog+N%2FN%29%5E%7B1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\log N/N)^{1/2}' title='(\log N/N)^{1/2}' class='latex' /> rather than <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2}.' title='N^{-1/2}.' class='latex' /> The reason is that each Fourier coefficient has a Gaussian concentration around 0 with standard deviation <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2},' title='N^{-1/2},' class='latex' /> so if you&#8217;ve got <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> Fourier coefficients to worry about, then you&#8217;ll expect one of them to have size equal to about <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B%5Clog+N%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{\log N}' title='\sqrt{\log N}' class='latex' /> standard deviations. However, if you construct a function using a high-rank quadratic form, then you can produce several functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Chat%7Bf%7D%5C%7C_%5Cinfty%5Cleq+CN%5E%7B-1%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|\hat{f}\|_\infty\leq CN^{-1/2}.' title='\|\hat{f}\|_\infty\leq CN^{-1/2}.' class='latex' /></p>
<p>Now the dual norm of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Chat%7Bf%7D%5C%7C_1%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|\hat{f}\|_1,' title='\|\hat{f}\|_1,' class='latex' /> where this is the <img src='http://l.wordpress.com/latex.php?latex=%5Cell_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ell_1' title='\ell_1' class='latex' /> norm rather than the <img src='http://l.wordpress.com/latex.php?latex=L_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L_1' title='L_1' class='latex' /> norm (meaning that you add up the moduli of the Fourier coefficients rather than taking their average). Since a typical Fourier coefficient of a random function has modulus about <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2},' title='N^{-1/2},' class='latex' /> it seems that the dual norm of a random function is bigger by a factor of <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B%5Clog+N%7D%3D%5Csqrt%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{\log N}=\sqrt{n}' title='\sqrt{\log N}=\sqrt{n}' class='latex' /> than the reciprocal of its (non-dual) norm.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Are you trying to say that you&#8217;ve got a new candidate for a property that could be used to prove nontrivial lower bounds?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I don&#8217;t know, since this idea has only just occurred to me and I haven&#8217;t had time to think about it. But let&#8217;s break off for a second and try to see whether we have some way of ruling out this property.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Of course we do. To calculate the dual norm you just work out the <img src='http://l.wordpress.com/latex.php?latex=%5Cell_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ell_1' title='\ell_1' class='latex' /> norm of the Fourier transform!</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oops. But that doesn&#8217;t quite kill off the underlying idea. For example, perhaps we could define a &#8220;quadratic&#8221; version of the above norm. We would let <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|' title='\|f\|' class='latex' /> be the largest inner product of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> with any quadratic phase function <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7Bq%28x%29%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^{q(x)}.' title='(-1)^{q(x)}.' class='latex' /> There are <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bcn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{cn^2}' title='2^{cn^2}' class='latex' /> of these, so we might expect one of the inner products to have size about <img src='http://l.wordpress.com/latex.php?latex=n%3D%5Clog+N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=\log N' title='n=\log N' class='latex' /> times the typical size of <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2}' title='N^{-1/2}' class='latex' />. But perhaps we can also show that a random function correlates very well with a function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> that has only the expected correlation of <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2}.' title='N^{-1/2}.' class='latex' /> </p>
<p>If that worked, then we might conceivably be able to get superlinear bounds. There would be two stages to the proof, both of which look as though they are either true but very hard to prove, or false.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What are they?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, unfortunately superlinear bounds for formula complexity are not interesting since they are already known (e.g. for the parity function). So we&#8217;d have to look at circuit complexity. We&#8217;d need to show that for every Boolean function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=n%5Comega%28n%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\omega(n)' title='n\omega(n)' class='latex' /> (for some function <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\omega' title='\omega' class='latex' /> that tends to infinity) there is a quadratic phase function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> that has very slightly more than the expected correlation with <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />. That is, one would be looking for a bound like <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle%5Cgeq+N%5E%7B-1%2F2%7D%5Clog+N%2F1000%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle\geq N^{-1/2}\log N/1000,' title='\langle f,g\rangle\geq N^{-1/2}\log N/1000,' class='latex' /> or perhaps something smaller still. It seems highly unlikely that one could prove that by identifying a property of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and using the property to find a <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> with good correlation. But perhaps one could somehow prove that to get rid of all the correlations would take a superlinear number of steps. It might be a bit like the result that you need <img src='http://l.wordpress.com/latex.php?latex=%5Clog_2n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log_2n' title='\log_2n' class='latex' /> partitions of an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />-element set into two sets if you want every pair of elements to be in different cells of at least one of the partitions.</p>
<p>But at this stage I don&#8217;t even have a feel for what happens if you take a polynomial of the form <img src='http://l.wordpress.com/latex.php?latex=p%28x%29%3D%5Csum_%7B%5C%7Bi%2Cj%2Ck%5C%7D%5Cin%5Cmathcal%7BA%7D%7Dx_ix_jx_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(x)=\sum_{\{i,j,k\}\in\mathcal{A}}x_ix_jx_k' title='p(x)=\sum_{\{i,j,k\}\in\mathcal{A}}x_ix_jx_k' class='latex' /> for some collection <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{A}' title='\mathcal{A}' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%7B%7DCn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{}Cn' title='{}Cn' class='latex' /> sets of size 3. That will have linear circuit complexity, and perhaps <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{A}' title='\mathcal{A}' class='latex' /> can be chosen so that it has essentially minimal <img src='http://l.wordpress.com/latex.php?latex=U%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^4' title='U^4' class='latex' /> norm and hence minimal correlation with any quadratic phase function. At the moment my guess is that that is indeed what happens.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Mine too I have to say.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But if we were very lucky and that didn&#8217;t happen, then the second stage of the argument would be to show that &#8230; oh wait. No, the second stage would in fact be easy, because I&#8217;m sure there is at least <em>some</em> cubic phase function with minimal <img src='http://l.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm. So that would be a function in P that had superlinear circuit complexity. The issue, therefore, is whether we can find one that&#8217;s computable in linear time.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> There&#8217;s something that&#8217;s worrying me here. It&#8217;s that there&#8217;s a concept called <em>algebrization</em> that appears in a paper of Scott Aaronson and Avi Wigderson. It&#8217;s to do with low-degree polynomials, and says something like that if you have a proof that still works when you replace the field <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' /> by some extension of it, then you&#8217;re in trouble.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Can you be more precise?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m afraid not. I don&#8217;t know enough about it to know whether it&#8217;s relevant here, but I think you should check. It&#8217;s another barrier to proving that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP and it is known to kill off some proof techniques that get round the natural-proofs barrier. [<em>Algebrization is discussed in more detail in a later instalment of this dialogue.</em>]</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Hmm, that is indeed slightly worrying.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can I chip in here?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Of course.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well, even if everything works, I don&#8217;t see how you are proposing to distinguish between random and pseudorandom functions. You&#8217;ve argued that it&#8217;s conceivable that a random function of complexity <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> has a correlation with at least some quadratic phase function that is not quite minimal, but that&#8217;s true of random functions. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, but then I was going to look at the dual of that, arguing that random functions <em>do</em> have maximal dual norm.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Right, but why shouldn&#8217;t random functions of complexity <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> also have maximal dual norm?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah yes, I see your point. I&#8217;m not sure I have any reason to think that. </p>
<p>OK, let&#8217;s get back to the &#8220;negative&#8221; discussion and see whether we understand better the kinds of proofs that won&#8217;t work.</p>
<p>Oh, but hang on, I&#8217;ve just realized I&#8217;ve been making a calculation error.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Not for the first time &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m sorry about that &#8212; it&#8217;s a bad habit of mine, thinking about ideas without properly checking their flimsy foundations.</p>
<p>But it&#8217;s just occurred to me that the dreaded polynomial example isn&#8217;t as frightening as I thought. This is also relevant to the discussions we were having earlier about the <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> norm for unbounded <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />. Let me explain. </p>
<p>It can be shown that if <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> is a polynomial of degree <img src='http://l.wordpress.com/latex.php?latex=k-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k-1' title='k-1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is the polynomial phase function <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5Ep&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^p' title='(-1)^p' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is some other function, then <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle%5Cleq+%5C%7Cf%5C%7C_%7BU%5Ek%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle\leq \|f\|_{U^k}.' title='\langle f,g\rangle\leq \|f\|_{U^k}.' class='latex' /> Now the smallest that this can be is at least <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%5Ek%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2^k}' title='N^{-1/2^k}' class='latex' />, which is much bigger than the correlation between <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> and a random function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  But I thought you said that in the case where <img src='http://l.wordpress.com/latex.php?latex=k%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=2' title='k=2' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a suitably chosen quadratic you got <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2}' title='N^{-1/2}' class='latex' /> rather than <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F4%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/4}.' title='N^{-1/4}.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  So I did. What&#8217;s going on? </p>
<p>OK, I&#8217;ve done a few calculations, and it seems that using the inequality above is inefficient. You&#8217;ll just have to take my word for this, but the best bound one can hope to prove by easy Cauchy-Schwarz methods is that a well-chosen polynomial of degree <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> has a correlation of at most <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%5E%7Bk-1%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2^{k-1}}' title='N^{-1/2^{k-1}}' class='latex' /> with any polynomial of degree less than <img src='http://l.wordpress.com/latex.php?latex=k.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k.' title='k.' class='latex' /> So when <img src='http://l.wordpress.com/latex.php?latex=k%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=1' title='k=1' class='latex' /> we get <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/2}' title='N^{-1/2}' class='latex' /> and when <img src='http://l.wordpress.com/latex.php?latex=k%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=3' title='k=3' class='latex' /> we get <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B-1%2F4%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{-1/4}.' title='N^{-1/4}.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  When you say &#8220;by easy Cauchy-Schwarz methods&#8221; are you trying to suggest that more advanced techniques could give better bounds? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Possibly. This is very close to some difficult problems in analytic number theory concerning exponential sums of the form <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bx%3D1%7D%5En%5Cexp%28i%5Calpha+n%5Ek%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{x=1}^n\exp(i\alpha n^k),' title='\sum_{x=1}^n\exp(i\alpha n^k),' class='latex' /> on the assumption that <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is not too close to a rational with small denominator. I&#8217;m not sure what the state of the art is when <img src='http://l.wordpress.com/latex.php?latex=k%3D3%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=3,' title='k=3,' class='latex' /> but I&#8217;m pretty sure people don&#8217;t know how to get to bounds like <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7Bn%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{n}.' title='\sqrt{n}.' class='latex' /> Also, I think the best bounds that <em>are</em> known use algebraic geometry.</p>
<p>This changes things, of course. It means that it is probably hopeless to try to kill off the approach using a polynomial. (And as for my earlier attempts to do that for polynomials of degree <img src='http://l.wordpress.com/latex.php?latex=%5Clog+n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log n,' title='\log n,' class='latex' /> they now look laughable.) Instead, I think our best bet is to try to guess what happens if you take a random function of linear circuit complexity, where by that I mean a function taken from the model we were discussing earlier: first embed nicely into <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B2n%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^{2n},' title='\{0,1\}^{2n},' class='latex' /> then apply <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> random basic operations, and then look at the first coordinate. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;m sorry to interrupt you yet again, but I think that model is still flawed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Because most of the random basic operations you took will have no effect on the final answer. To see what I mean, let&#8217;s suppose that the sets of size 3 that are used for the basic operations are <img src='http://l.wordpress.com/latex.php?latex=A_1%2C%5Cdots%2CA_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_1,\dots,A_m' title='A_1,\dots,A_m' class='latex' />, in that order. Then we can certainly say that the <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />th basic operation has no effect on the first coordinate unless <img src='http://l.wordpress.com/latex.php?latex=1%5Cin+A_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1\in A_m.' title='1\in A_m.' class='latex' /> So we&#8217;ll expect the last <img src='http://l.wordpress.com/latex.php?latex=cn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='cn' title='cn' class='latex' /> of the basic operations to have no effect. But that&#8217;s not all. Let&#8217;s suppose that <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> is maximal such that <img src='http://l.wordpress.com/latex.php?latex=1%5Cin+A_r.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1\in A_r.' title='1\in A_r.' class='latex' /> We now know that no previous <img src='http://l.wordpress.com/latex.php?latex=A_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_s' title='A_s' class='latex' /> has any effect unless either <img src='http://l.wordpress.com/latex.php?latex=A_s%5Ccap+A_r%5Cne%5Cemptyset.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A_s\cap A_r\ne\emptyset.' title='A_s\cap A_r\ne\emptyset.' class='latex' /> And that happens with probability <img src='http://l.wordpress.com/latex.php?latex=c%2Fn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c/n' title='c/n' class='latex' /> as well. Indeed, it seems to me that only a <em>constant</em> number of the basic operations will have any effect at all on the final answer.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You&#8217;re quite right of course. But I think we can deal with that problem in a simple way. Instead of looking at the first coordinate, let&#8217;s do something more global like taking the parity. In other words, the revised model is this. First you map into <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^{2n}' title='\{0,1\}^{2n}' class='latex' /> in a clever way, then you do a random sequence of <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> basic operations, and finally you count how many 1s you&#8217;ve got and set <img src='http://l.wordpress.com/latex.php?latex=f%28x%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)=1' title='f(x)=1' class='latex' /> if this number is even and <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' /> otherwise.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> It&#8217;s still not obvious to me that this model is a good one, because the randomized part (that is, the product of a random collection of basic transformations) is very simple and very easy to distinguish from a truly random function. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I find that quite interesting. It&#8217;s not clear to me either way. The random part does indeed look very simple, but I don&#8217;t see why it should remain simple when composed with a carefully chosen function. For instance, the random part will have constant depth and bounded fanin, which computer scientists would regard as <em>very</em> simple indeed, but the explicit parts that come before and after cannot be computed by bounded-depth circuits (because the parity function can&#8217;t), so perhaps one part of the procedure gives you depth and the other part gives you randomness.</p>
<p>But in a way, all this is irrelevant, because I still haven&#8217;t addressed <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' /> &#8217;s earlier question. Do I have any proposal at all for a property that would distinguish between pseudorandom functions and random ones? Unfortunately, I don&#8217;t think I do, or at least not one that I believe in. Let me explain and then we can get back to trying to rule out more general kinds of approach.</p>
<p>The property I&#8217;ve just been considering is the property of having a very large dual norm, where the norm itself is given by the maximum possible correlation with a quadratic phase function.</p>
<p>Let me say that more clearly. I&#8217;ll let <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> be the set of all quadratic phase functions, which are functions of the form <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7Bp%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-1)^{p(x)}' title='(-1)^{p(x)}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> is a quadratic in <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> variables over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2.' title='\mathbb{F}_2.' class='latex' /> I&#8217;ll then define <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|' title='\|f\|' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5Cmax%5C%7B%5Clangle+f%2Cg%5Crangle%3Ag%5Cin+Q%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\max\{\langle f,g\rangle:g\in Q\}.' title='\max\{\langle f,g\rangle:g\in Q\}.' class='latex' /> Finally, the property I&#8217;m interested in is the property of satisfying the inequality <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5E%2A%5Cgeq+N%5E%7B1%2F2%7D%2F1000.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|^*\geq N^{1/2}/1000.' title='\|f\|^*\geq N^{1/2}/1000.' class='latex' /> </p>
<p>I don&#8217;t have a proof, but I&#8217;m guessing that random functions have this property (because they do if you replace &#8220;quadratic&#8221; by &#8220;linear&#8221;). Also, I&#8217;m guessing that this property is not in P.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can you quickly justify that guess?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I can, but not all that quickly. A first step would be the claim that distinguishing between Boolean functions with <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5Capprox+CN%5E%7B-1%2F2%7D%5Clog+N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|\approx CN^{-1/2}\log N' title='\|f\|\approx CN^{-1/2}\log N' class='latex' /> (the random functions) and Boolean functions with <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C%5Capprox+CN%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|\approx CN^{-1/2}' title='\|f\|\approx CN^{-1/2}' class='latex' /> (the better-than-random functions) is impossible in polynomial time. My reasoning there would be that there are <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n^2}' title='2^{n^2}' class='latex' /> functions in <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' />, and the sort of correlation one is looking for is so small that there are no clever methods to choose which of these functions will correlate with some random Boolean function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />. So one has to do a brute-force search, and <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n^2}' title='2^{n^2}' class='latex' /> is a superpolynomial function of <img src='http://l.wordpress.com/latex.php?latex=2%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n.' title='2^n.' class='latex' /> </p>
<p>And then I&#8217;d claim that calculating the dual is even harder. One can in fact prove that the dual norm of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is the smallest possible sum <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i%7C%5Clambda_i%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_i|\lambda_i|' title='\sum_i|\lambda_i|' class='latex' /> for which it is possible to write <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> as a linear combination <img src='http://l.wordpress.com/latex.php?latex=%5Csum%5Clambda_iq_i%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum\lambda_iq_i,' title='\sum\lambda_iq_i,' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=q_i%5Cin+Q.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_i\in Q.' title='q_i\in Q.' class='latex' /> But there seems to be no earthly way of guessing which functions <img src='http://l.wordpress.com/latex.php?latex=q_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_i' title='q_i' class='latex' /> will be useful for the purposes of expressing <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> as a linear combination of this kind. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Thanks &#8212; that sounds reasonably convincing.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I think so too, but unfortunately it doesn&#8217;t help. Prompted again by what goes on in the linear case, one might well guess that the dual norm of a random function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is maximal to within a constant &#8212; in other words, it is around <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B1%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N^{1/2}.' title='N^{1/2}.' class='latex' /> The rough reason for this would be that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> itself would correlate with some quadratic phase functions, but by modifying it a bit, one should be able to come up with a function <img src='http://l.wordpress.com/latex.php?latex=f%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f&#039;' title='f&#039;' class='latex' /> that correlates well with <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> but no longer correlates with those few quadratic phase functions that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> correlated with.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Now you have stopped convincing me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Because it seems to me that there&#8217;s an important difference between the linear case and the quadratic case. The expected number of linear phase functions that a random <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> correlates with is a small fraction of <img src='http://l.wordpress.com/latex.php?latex=2%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n.' title='2^n.' class='latex' /> So it seems reasonable to suppose that with a small amount of tinkering one can get rid of these correlations without radically changing <img src='http://l.wordpress.com/latex.php?latex=f.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f.' title='f.' class='latex' /> But in the quadratic case the number of troublesome correlations is a small fraction of <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn%5E2%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n^2},' title='2^{n^2},' class='latex' /> which will be far bigger than <img src='http://l.wordpress.com/latex.php?latex=2%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n,' title='2^n,' class='latex' /> the dimension of the space we are talking about. So it&#8217;s not at all obvious that we can get rid of all these correlations: the functions we are trying to avoid will probably span the whole space many times over.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s actually quite an interesting point. It affects what I was saying, but it doesn&#8217;t affect the main point I was leading up to, which is that I still don&#8217;t see any distinction between random and pseudorandom functions. To see why I&#8217;m saying this, consider some fixed quadratic phase function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> and let&#8217;s consider the behaviour of <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle,' title='\langle f,g\rangle,' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a pseudorandom function of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> (from the model that <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> doesn&#8217;t entirely trust). My heuristic principle is that the expected correlation decays exponentially, so when <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> is something like <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> then it is as small as it can be, on average at least. </p>
<p>Next, I get to a statement that I don&#8217;t really see how to justify, even heuristically, which is that the probability of deviating from the mean should be similar in the pseudorandom case to what it is in the random case. So if you now look at all quadratic phase functions, you will find that about <img src='http://l.wordpress.com/latex.php?latex=e%5E%7B-ct%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{-ct^2}' title='e^{-ct^2}' class='latex' /> of them will have correlations greater than <img src='http://l.wordpress.com/latex.php?latex=tN%5E%7B-1%2F2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='tN^{-1/2}.' title='tN^{-1/2}.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think I might be able to justify that.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh really? How? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, one could say that the property of having an inner product of at least <img src='http://l.wordpress.com/latex.php?latex=tN%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='tN^{-1/2}' title='tN^{-1/2}' class='latex' /> with some fixed quadratic phase function is a polynomially computable property that holds for an appreciable fraction of all Boolean functions (that fraction being <img src='http://l.wordpress.com/latex.php?latex=e%5E%7B-ct%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{-ct^2}' title='e^{-ct^2}' class='latex' />,) so the probability that it happens for a pseudorandom function should be roughly the same as the probability that it happens for a random function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes. I think I buy that. I think from that argument we can extract a fairly general principle. Let <img src='http://l.wordpress.com/latex.php?latex=%5CPhi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Phi' title='\Phi' class='latex' /> be any collection of functions that take Boolean functions to the real numbers. (It may well be possible to genearalize this, but I&#8217;ll stick with real-valued functions for now.) Suppose that every function in <img src='http://l.wordpress.com/latex.php?latex=%5CPhi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Phi' title='\Phi' class='latex' /> is polynomial-time computable, in the sense that we can decide in polynomial time (in <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n' title='2^n' class='latex' />) the approximate value of <img src='http://l.wordpress.com/latex.php?latex=%5CPhi%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Phi(f)' title='\Phi(f)' class='latex' /> (up to an error of <img src='http://l.wordpress.com/latex.php?latex=C%5E%7B-n%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C^{-n},' title='C^{-n},' class='latex' /> say). Then the distribution of <img src='http://l.wordpress.com/latex.php?latex=%5CPhi%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Phi(f)' title='\Phi(f)' class='latex' /> over random functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> should be almost exactly the same as the distribution over pseudorandom functions. </p>
<p>Oh &#8230; just a moment.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> What&#8217;s happened this time?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, it&#8217;s occurred to me that there might after all be a difference between the norm of a random function and the norm of a pseudorandom function, where the norm is that quadratic-correlation norm.</p>
<p>It goes back to your argument that purported to show that the two norms were the same. It seems to me that that argument is fine up to <img src='http://l.wordpress.com/latex.php?latex=t%3D%5Csqrt%7Bn%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t=\sqrt{n},' title='t=\sqrt{n},' class='latex' /> but once you get bigger than that, the probability <img src='http://l.wordpress.com/latex.php?latex=e%5E%7B-ct%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{-ct^2}' title='e^{-ct^2}' class='latex' /> becomes subexponential, so exponentially approximating it is not good enough. </p>
<p>So at the moment it feels as though the norm of a random function should be around <img src='http://l.wordpress.com/latex.php?latex=cnN%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='cnN^{-1/2}' title='cnN^{-1/2}' class='latex' /> (because there are <img src='http://l.wordpress.com/latex.php?latex=e%5E%7Bcn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{cn^2}' title='e^{cn^2}' class='latex' /> chances for it to be big, with a probability of <img src='http://l.wordpress.com/latex.php?latex=e%5E%7B-cn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e^{-cn^2}' title='e^{-cn^2}' class='latex' /> each), whereas it seems very unlikely that a random model based on just <img src='http://l.wordpress.com/latex.php?latex=%7B%7DCn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{}Cn' title='{}Cn' class='latex' /> random steps would give rise to events with that low a probability.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Are you saying that pseudorandom functions are more random than random ones?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh dear, something odd is going on here isn&#8217;t it? Maybe I am paying the price for ignoring the fact that the values of <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+f%2Cg%5Crangle&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\langle f,g\rangle' title='\langle f,g\rangle' class='latex' /> for the various quadratic phase functions <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> are not independent. I was hoping that they would be sufficiently independent, however. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Yes, but once you know <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n' title='2^n' class='latex' /> of those correlations, then morally you know all the rest, so perhaps it was just completely unrealistic to hope for events to have subexponential probabilities.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  There&#8217;s also something else that I haven&#8217;t quite got to the bottom of, which is that if we define a norm by taking <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> to be the set of all functions of circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B%5Clog+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{\log n}' title='n^{\log n}' class='latex' /> and set <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|' title='\|f\|' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5Cmax%5C%7B%7C%5Clangle+f%2Cg%5Crangle%7C%3Ag%5Cin%5CGamma%5C%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\max\{|\langle f,g\rangle|:g\in\Gamma\},' title='\max\{|\langle f,g\rangle|:g\in\Gamma\},' class='latex' /> then trivially all functions of polynomial circuit complexity have norm 1, whereas random functions have very small norm. But why can&#8217;t we say that for <em>any</em> fixed function <img src='http://l.wordpress.com/latex.php?latex=g%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g,' title='g,' class='latex' /> the probability that <img src='http://l.wordpress.com/latex.php?latex=%7C%5Clangle+f%2Cg%5Crangle%7C%5Cgeq+tN%5E%7B-1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='|\langle f,g\rangle|\geq tN^{-1/2}' title='|\langle f,g\rangle|\geq tN^{-1/2}' class='latex' /> given that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=1000n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1000n' title='1000n' class='latex' /> is roughly the same as the probability for a purely random <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think you can. It&#8217;s just that &#8220;roughly the same as&#8221; means that the two probabilities are exponentially close. So the fact that a random function is astronomically unlikely to have almost perfect correlation with a fixed function <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> of circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B%5Clog+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{\log n}' title='n^{\log n}' class='latex' /> allows us to deduce that a pseudorandom function has an exponentially small probability of this. The latter is compatible with every pseudorandom function correlating with at least one function of circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B%5Clog+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{\log n}' title='n^{\log n}' class='latex' /> (as there are more than exponentially many of these), while the former is compatible with almost every random function <em>not</em> correlating with any such functions.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  There&#8217;s a simple but quite useful general moral to draw from that, I think. It&#8217;s that if you want your simplicity property to imply correlation with a function from some collection <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma,' title='\Gamma,' class='latex' /> then on the one hand you need <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> to have superexponential size in <img src='http://l.wordpress.com/latex.php?latex=n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n,' title='n,' class='latex' /> or equivalently superpolynomial size in <img src='http://l.wordpress.com/latex.php?latex=2%5En.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n.' title='2^n.' class='latex' /> That&#8217;s because otherwise you can check in polynomial time (in <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n' title='2^n' class='latex' />) just by brute force what the largest correlation is with a function in <img src='http://l.wordpress.com/latex.php?latex=%5CGamma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma.' title='\Gamma.' class='latex' /> Conversely, if <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> does have superexponential size, then there is at least a theoretical chance that it can be used as the basis for a simplicity property. All this makes me want to think some more about the quadratic correlation norm, where <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' />, the set of quadratic phase functions, has size <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bcn%5E2%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{cn^2}.' title='2^{cn^2}.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;m getting a bit tired. Can we save that for another day?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Fine by me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> And by me.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/909/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=909&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/10/21/a-conversation-about-complexity-lower-bounds-vii/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>Miscellaneous matters</title>
		<link>http://gowers.wordpress.com/2009/10/20/miscellaneous-matters/</link>
		<comments>http://gowers.wordpress.com/2009/10/20/miscellaneous-matters/#comments</comments>
		<pubDate>Tue, 20 Oct 2009 13:40:33 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[Mathematics on the internet]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Princeton Companion To Mathematics]]></category>
		<category><![CDATA[polymath]]></category>
		<category><![CDATA[polymath1]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=1096</guid>
		<description><![CDATA[Michael Nielsen and I have written an Opinion Piece for Nature about the Polymath project and related matters. Thanks almost entirely to Ryan O&#8217;Donnell, a preprint  at last exists that contains Polymath&#8217;s proof of the density Hales-Jewett theorem with all the details. It will be posted on the arXiv very soon and I will [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1096&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Michael Nielsen and I have written an <a href="http://www.nature.com/nature/journal/v461/n7266/full/461879a.html">Opinion Piece for Nature</a> about the Polymath project and related matters. Thanks almost entirely to Ryan O&#8217;Donnell, a preprint  at last exists that contains Polymath&#8217;s proof of the density Hales-Jewett theorem with all the details. It will be posted on the arXiv very soon and I will update this post when it is. </p>
<p>Update: it can be found <a href="http://arxiv.org/abs/0910.3926">here</a>. Owing to a misunderstanding, it was posted before I had any input into it, but in any case, the full proof is here, even if the version that is submitted for publication will have some changes.</p>
<p>The Notices of the AMS have published <a href="http://www.ams.org/notices/200910/rtx091001276p.pdf">five back-to-back reviews</a> of the Princeton Companion to Mathematics. They are by Bryan Birch, Simon Donaldson, Gil Kalai, Richard Kenyon and Angus Macintyre. </p>
<p>From <a href="http://quomodocumque.wordpress.com/2009/10/17/why-math-overflow-works-and-why-it-might-not/">Quomodocumque</a> I learned of a new website, <a href="http://mathoverflow.net/"> Math Overflow</a>, where you can ask and answer mathematical questions. It seems to be very active, with a lot of users, rating systems for comments and commenters, and the like. So in principle it could be another mechanism for pooling the resources of mathematicians with the help of the internet. For example, if you need a certain statement to be true and do not know whether it is known, then my guess is that you could find out pretty quickly if you post a question there. For more discussion, see <a href="http://sbseminar.wordpress.com/2009/10/14/math-overflow/"> a post over at the Secret Blogging Seminar</a>.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/1096/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/1096/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/1096/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/1096/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/1096/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/1096/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/1096/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/1096/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/1096/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/1096/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=1096&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/10/20/miscellaneous-matters/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, VI</title>
		<link>http://gowers.wordpress.com/2009/10/16/a-conversation-about-complexity-lower-bounds-vi/</link>
		<comments>http://gowers.wordpress.com/2009/10/16/a-conversation-about-complexity-lower-bounds-vi/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 09:06:36 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[complexity]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=781</guid>
		<description><![CDATA[The story so far: our three characters,  ,  and  , have discussed in detail an approach that   has to proving complexity lower bounds. At the beginning of the discussion, they established that the idea would be very unlikely to prove lower bounds for circuit complexity, which is what would be [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=781&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The story so far: our three characters, <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> , <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> and <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' /> , have discussed in detail an approach that <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  has to proving complexity lower bounds. At the beginning of the discussion, they established that the idea would be very unlikely to prove lower bounds for circuit complexity, which is what would be needed to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP. More recently, they have given arguments that strongly suggest that the idea cannot prove interesting new bounds even for weaker models. In this segment of the dialogue, they nevertheless consider what kinds of methods would be needed in order to prove lower bounds for circuit complexity (as opposed to formula complexity). They don&#8217;t really get anywhere, and this part of the discussion ends rather abruptly, but they mention one or two interesting facts along the way.</p>
<p>***************************************</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  After all that, I have to say that I&#8217;m still slightly disappointed that it seems that even if the ideas I&#8217;ve been having <em>could</em> be got to work, they are most unlikely to show that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I didn&#8217;t really understand why not. Can somebody remind me? <span id="more-781"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I can. Recall that a <em>formal complexity measure</em> is a function <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)=1' title='\kappa(f)=1' class='latex' /> for every basic function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29%3D%5Ckappa%28-f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)=\kappa(-f)' title='\kappa(f)=\kappa(-f)' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f\vee g)' title='\kappa(f\vee g)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%5Cwedge+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f\wedge g)' title='\kappa(f\wedge g)' class='latex' /> are both at most <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29%2B%5Ckappa%28g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)+\kappa(g)' title='\kappa(f)+\kappa(g)' class='latex' /> for every <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' />. Now if all you know about <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> is that it is a formal complexity measure, then all you can deduce from the fact that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29%5Cleq+m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)\leq m' title='\kappa(f)\leq m' class='latex' /> is that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has formula size at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />. Therefore, if there exist functions with low circuit complexity but very high formula size, then one cannot use a formal complexity measure to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP unless it has some further properties. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Hmm &#8230; it seems that you need a pretty strong statement to back up what you&#8217;ve just said. If I can find a formal complexity measure <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> and also a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> in NP with <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29%3DM&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)=M' title='\kappa(f)=M' class='latex' />, then that will prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP unless there exist functions of polynomial circuit complexity and formula size at least <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='M' title='M' class='latex' />. Is it believed that a function of polynomial circuit complexity could have exponentially large formula size, for instance?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;d have to check. But if you take a fairly random circuit, then the obvious way of turning it into a formula certainly does require the size of that formula to be exponential, and it&#8217;s hard to see how to do any better.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  How do you mean, &#8220;the obvious way of turning it into a formula&#8221;? </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, if you&#8217;ve got a sequence of functions <img src='http://l.wordpress.com/latex.php?latex=f_1%2C%5Cdots%2Cf_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,\dots,f_m' title='f_1,\dots,f_m' class='latex' /> with each one either a basic function or a Boolean operation of two earlier functions, you can just build up a formula in the same way. For instance, if we write <img src='http://l.wordpress.com/latex.php?latex=e_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e_i' title='e_i' class='latex' /> for the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />th basic function, then we could define a function by taking <img src='http://l.wordpress.com/latex.php?latex=f_1%3De_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1=e_1' title='f_1=e_1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_2%3De_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_2=e_2' title='f_2=e_2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_3%3De_3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_3=e_3' title='f_3=e_3' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_4%3De_4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_4=e_4' title='f_4=e_4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_5%3Df_1%5Cvee+f_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_5=f_1\vee f_2' title='f_5=f_1\vee f_2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_6%3Df_3%5Cvee+f_4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_6=f_3\vee f_4' title='f_6=f_3\vee f_4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_7%3Df_5%5Cwedge+f_6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_7=f_5\wedge f_6' title='f_7=f_5\wedge f_6' class='latex' />. Unpacking this, we get</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+f_7%3D%28f_1%5Cvee+f_2%29%5Cwedge%28f_3%5Cvee+f_4%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle f_7=(f_1\vee f_2)\wedge(f_3\vee f_4).' title='\displaystyle f_7=(f_1\vee f_2)\wedge(f_3\vee f_4).' class='latex' /></p>
<p>If you don&#8217;t count the basic functions in the definition of circuit complexity, then the circuit complexity of <img src='http://l.wordpress.com/latex.php?latex=f_7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_7' title='f_7' class='latex' /> is 3, while this formula has length 4. And that lack of economy goes up rapidly as more and more functions in the circuit depend on earlier ones. If you play around with it a bit, you&#8217;ll be convinced.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Great, thanks, I believe you. In fact, it&#8217;s sort of obvious really: as you increase the length of the sequence of functions, to get the formula complexity of <img src='http://l.wordpress.com/latex.php?latex=f_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_m' title='f_m' class='latex' /> you add the formula complexities of earlier functions, whereas to get the circuit complexity you just add 1. So the formula complexity really does look as though it goes up much more rapidly. Of course, that&#8217;s not a rigorous proof, since there might be some less obvious way of producing a formula that was far shorter. But I agree that that looks very unlikely.</p>
<p>Actually, couldn&#8217;t one use this thought to tackle the formula-size question? Perhaps one way to produce a function in NP with large formula size is actually to produce a function in P with large formula size by picking a random sequence of Boolean operations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Two comments on that. One is that a random sequence of Boolean operations will give you a function of low circuit complexity, but that&#8217;s not quite the same as a function that can be computed in polynomial time. The second is that you&#8217;ll still need a proof that your random function has high formula complexity.</p>
<p>[<em>See <a href="http://gowers.wordpress.com/2009/10/16/a-conversation-about-complexity-lower-bounds-vi/#comment-4177">this comment of Ryan Williams</a> for an explanation of why the first objection can be dealt with.</em>]</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I was wondering whether maybe it would have <em>maximal</em> formula complexity in the following sense: the obvious formula really is the best you can do, as long as you obey some simple rules that don&#8217;t allow you to produce a shorter formula in a trivial way. For example, you can shorten <img src='http://l.wordpress.com/latex.php?latex=%28f%5Cvee+g%29%5Cwedge%28f%5Cvee+h%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(f\vee g)\wedge(f\vee h)' title='(f\vee g)\wedge(f\vee h)' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee%28g%5Cwedge+h%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee(g\wedge h)' title='f\vee(g\wedge h)' class='latex' />, but that depended on the fact that <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee h' title='f\vee h' class='latex' /> have some overlap. Perhaps in the random case you just wouldn&#8217;t get that kind of overlap.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> If that&#8217;s the sort of thing you&#8217;d like to do, then let me tell you something depressing. It&#8217;s a well-known approach that morally ought to give a lower bound for formula complexity, but nobody has managed to get it to work.</p>
<p>Suppose that <img src='http://l.wordpress.com/latex.php?latex=f_1%2C%5Cdots%2Cf_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,\dots,f_k' title='f_1,\dots,f_k' class='latex' /> are formulae, and that they depend on disjoint sets of variables. Suppose also that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a formula that depends on <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables. Then you might expect that the shortest formula for computing the function <img src='http://l.wordpress.com/latex.php?latex=f%28f_1%28x%29%2C%5Cdots%2Cf_k%28x%29%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(f_1(x),\dots,f_k(x))' title='f(f_1(x),\dots,f_k(x))' class='latex' /> is the obvious one, where you compute <img src='http://l.wordpress.com/latex.php?latex=f_i%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_i(x)' title='f_i(x)' class='latex' /> for each <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and then apply <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />. Therefore, if <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has formula size <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and all the <img src='http://l.wordpress.com/latex.php?latex=f_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_i' title='f_i' class='latex' /> have formula size <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' />, you might expect the formula size of <img src='http://l.wordpress.com/latex.php?latex=f%28f_1%28x%29%2C%5Cdots%2Cf_k%28x%29%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(f_1(x),\dots,f_k(x))' title='f(f_1(x),\dots,f_k(x))' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=rs&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='rs' title='rs' class='latex' />. </p>
<p>Now choose a random function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=r%3D%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r=\log n' title='r=\log n' class='latex' /> variables. This will have formula size <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />, which is somewhere close to <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, by a simple counting argument. We can then build a new function <img src='http://l.wordpress.com/latex.php?latex=f_1%3Df%28f%2Cf%2C%5Cdots%2Cf%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1=f(f,f,\dots,f)' title='f_1=f(f,f,\dots,f)' class='latex' />, where it is to be understood that we are applying <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> disjoint sets of <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> variables. Then <img src='http://l.wordpress.com/latex.php?latex=f_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1' title='f_1' class='latex' /> has formula size <img src='http://l.wordpress.com/latex.php?latex=m%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^2' title='m^2' class='latex' />, if we believe our previous argument. Then we can build another function <img src='http://l.wordpress.com/latex.php?latex=f_2%3Df%28f_1%2C%5Cdots%2Cf_1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_2=f(f_1,\dots,f_1)' title='f_2=f(f_1,\dots,f_1)' class='latex' />, which now depends on <img src='http://l.wordpress.com/latex.php?latex=r%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r^3' title='r^3' class='latex' /> variables and has formula size <img src='http://l.wordpress.com/latex.php?latex=m%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^3' title='m^3' class='latex' />, and so on. We keep going until we use all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> variables, which happens when we reach the function <img src='http://l.wordpress.com/latex.php?latex=f_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_k' title='f_k' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=r%5Ek%3Dn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r^k=n' title='r^k=n' class='latex' />. This function has formula size <img src='http://l.wordpress.com/latex.php?latex=m%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^k' title='m^k' class='latex' />, which equals <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B%5Clog+m%2F%5Clog+r%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{\log m/\log r}' title='n^{\log m/\log r}' class='latex' />, which is about <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B%5Clog+n%2F%5Clog%5Clog+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{\log n/\log\log n}' title='n^{\log n/\log\log n}' class='latex' />, which is superpolynomial. But clearly the function <img src='http://l.wordpress.com/latex.php?latex=f_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_k' title='f_k' class='latex' /> can be computed in polynomial time, since <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> can (as it depends on just <img src='http://l.wordpress.com/latex.php?latex=%5Clog+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log n' title='\log n' class='latex' /> variables).</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  That&#8217;s pretty cool. So what&#8217;s wrong with the argument?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Just the fact that I stated a major lemma and didn&#8217;t prove it. I didn&#8217;t justify the statement that the shortest formula for <img src='http://l.wordpress.com/latex.php?latex=f%28f_i%2Cf_i%2C%5Cdots%2Cf_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(f_i,f_i,\dots,f_i)' title='f(f_i,f_i,\dots,f_i)' class='latex' /> is the obvious one. And I think that&#8217;s an unsolved problem: certainly, it&#8217;s either an unsolved problem or someone&#8217;s found a counterexample, since I know that the formula-size problem is still open.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I suppose another point is that even if this argument worked, it wouldn&#8217;t actually give a superpolynomial bound for formula size, because the function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> that you used to build everything up was chosen randomly rather than given by some algorithm.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You&#8217;re right. In the absence of any further tricks, what this argument would show is that there was a function of polynomial circuit complexity and superpolynomial formula size. [<em>See again <a href="http://gowers.wordpress.com/2009/10/16/a-conversation-about-complexity-lower-bounds-vi/#comment-4177">Ryan Williams's comment.</a></em>] But even without any new ideas, one would in principle be able to use this approach to obtain arbitrarily large polynomial lower bounds for formula size.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  How would you do that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, even if we can&#8217;t choose a random function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to get things going, we could in theory take some fixed <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and find a suitable function by brute force.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Sorry to be slow, but how would you do even that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You&#8217;d look at every single Boolean function of <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> variables and every single formula of size at most, say, <img src='http://l.wordpress.com/latex.php?latex=%283%2F2%29%5Er&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3/2)^r' title='(3/2)^r' class='latex' />, and pick out a Boolean function that is not given by one of the formulae.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Wow. There are <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B2%5Er%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{2^r}' title='2^{2^r}' class='latex' /> Boolean functions in <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> variables, so that&#8217;s going to take a pretty long time, even for very small <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes. That&#8217;s why I used the phrase &#8220;in principle&#8221; earlier. But even distinguishing between polynomial circuit complexity and polynomial formula size would be very interesting.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Er, excuse me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  We seem to have got sidetracked. I was just wondering whether it would be possible to return to the main topic.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> How do you mean, &#8220;the main topic&#8221;?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I mean the question of whether any of the ideas we&#8217;ve been discussing could be used to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP rather than just the weaker statement that some explicit functions have large formula size. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Ah yes, your obsession with the million-dollar prize &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  That&#8217;s not fair. I absolutely agree that the formula-size problem is a great problem, but that doesn&#8217;t mean we shouldn&#8217;t even <em>think</em> about circuit complexity.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Relax, I&#8217;m just teasing you. <em>Everyone</em> would prefer to solve the P versus NP problem if there was a chance of doing so, including me.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, well perhaps you&#8217;d like to answer a question that&#8217;s been bugging me. Here is a proof that the notion of a formal complexity measure is very natural if one is thinking about formula size. We&#8217;ve already discussed it, but let me quickly go through it again just so I can be clear about the question I want to go on to ask.</p>
<p>One can arrive at the notion of a formal complexity measure by asking the following question: if we define <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(f)' title='\lambda(f)' class='latex' /> to be the formula size of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, then what properties are satisfied by <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' />? And one quickly observes that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28f%29%3D%5Clambda%28-f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(f)=\lambda(-f)' title='\lambda(f)=\lambda(-f)' class='latex' />, and that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(f\vee g)' title='\lambda(f\vee g)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28f%5Cwedge+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(f\wedge g)' title='\lambda(f\wedge g)' class='latex' /> are both at most <img src='http://l.wordpress.com/latex.php?latex=%5Clambda%28f%29%2B%5Clambda%28g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda(f)+\lambda(g)' title='\lambda(f)+\lambda(g)' class='latex' />. Having done that, one can define a formal complexity measure to be any function with these properties, and one then observes that <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> is the largest possible formal complexity measure.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Indeed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  The idea behind our previous discussion was this. In principle, one can prove that a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has large formula size by finding a formal complexity measure <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> and proving that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)' title='\kappa(f)' class='latex' /> is large. But we have two problems. Obviously, to increase our chances of success, we&#8217;d like <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> to be as large a function as possible. But if we take <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' /> itself then we have done precisely nothing: we&#8217;ve shown that a function has large formula size if and only if it has large formula size. So there&#8217;s another requirement: we want <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> to be a function that we can actually say something about, so that proving that <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa(f)' title='\kappa(f)' class='latex' /> is large is strictly easier than proving that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> has large formula size. However, here we run into the problem that if <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> is too simple, and in particular if it can be calculated using a polynomial-time algorithm (in <img src='http://l.wordpress.com/latex.php?latex=N%3D2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N=2^n' title='N=2^n' class='latex' />), then it will almost certainly not work. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Let me quickly interject here that in their paper Razborov and Rudich showed that if a formal complexity measure is ever large, then it will be large for a positive fraction of all Boolean functions. So if you believe that natural proofs can&#8217;t solve the formula-size problem either, as you seem to do, then you definitely can&#8217;t use a polynomially computable formal complexity measure.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Right, that just confirms what I was saying. </p>
<p>Anyhow, on to my question. I&#8217;m just wondering whether we can go through the whole thought process again, but this time for circuit complexity. But I seem to have got stuck at the very first step.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> How do you mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, the first step of the previous argument was to look for properties satisfied by the function <img src='http://l.wordpress.com/latex.php?latex=%5Clambda.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda.' title='\lambda.' class='latex' /> So I thought I&#8217;d define <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f)' title='\sigma(f)' class='latex' /> to be the circuit complexity of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and try to find some properties of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma.' title='\sigma.' class='latex' /> But the properties I am coming up with don&#8217;t seem to be strong enough. For instance, if you know <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f)' title='\sigma(f)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28g%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(g),' title='\sigma(g),' class='latex' /> what can you say about <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f\vee g)' title='\sigma(f\vee g)' class='latex' />? The best I can come up with is that it is at most <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%29%2B%5Csigma%28g%29%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f)+\sigma(g),' title='\sigma(f)+\sigma(g),' class='latex' /> which is no stronger than we have for formula size.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Ah ha, that is a very good question, particularly as there isn&#8217;t any reason to think that that statement <em>can</em> be strengthened.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Why do you say that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, suppose that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> depended on entirely different sets of variables. Then it is hard to see how to calculate <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> more efficiently than calculating <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, calculating <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' />, and taking the max of the two. That gives an upper bound <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%5Cvee+g%29%5Cleq%5Csigma%28f%29%2B%5Csigma%28g%29%2B1.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f\vee g)\leq\sigma(f)+\sigma(g)+1.' title='\sigma(f\vee g)\leq\sigma(f)+\sigma(g)+1.' class='latex' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  What? You mean it&#8217;s even <em>worse</em> than it is for <img src='http://l.wordpress.com/latex.php?latex=%5Clambda&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lambda' title='\lambda' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I think so. The trivial bound requires you to add 1. </p>
<p>I ought to say that the argument I&#8217;ve just given is another &#8220;obvious&#8221; argument that is not in fact known to be correct. I claimed that calculating the max of two functions that depend on disjoint sets of variables cannot be done more efficiently than the obvious way of doing it. But if that is really so, then one can obtain arbitrarily large linear lower bounds for circuit complexity as follows. Let <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> be a small constant, and by brute force find a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> that has circuit complexity at least <img src='http://l.wordpress.com/latex.php?latex=%283%2F2%29%5Er.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3/2)^r.' title='(3/2)^r.' class='latex' /> Then let <img src='http://l.wordpress.com/latex.php?latex=f_1%2C%5Cdots%2Cf_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,\dots,f_s' title='f_1,\dots,f_s' class='latex' /> be copies of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, each depending on a disjoint set of <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> variables. Here, <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Clfloor+n%2Fr%5Crfloor.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lfloor n/r\rfloor.' title='\lfloor n/r\rfloor.' class='latex' /> Then the obvious straight-line computation of <img src='http://l.wordpress.com/latex.php?latex=f_1%5Cvee%5Cdots%5Cvee+f_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1\vee\dots\vee f_s' title='f_1\vee\dots\vee f_s' class='latex' /> has length around <img src='http://l.wordpress.com/latex.php?latex=%283%2F2%29%5Ers&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3/2)^rs' title='(3/2)^rs' class='latex' />, which is around <img src='http://l.wordpress.com/latex.php?latex=%283%2F2%29%5Ern%2Fr.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3/2)^rn/r.' title='(3/2)^rn/r.' class='latex' /> However, the best known lower bound for circuit complexity is something like <img src='http://l.wordpress.com/latex.php?latex=4n%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4n,' title='4n,' class='latex' /> so I&#8217;m pretty sure that the result about functions depending on disjoint sets of variables is not known. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  What you&#8217;ve just said manages to be more irritating than one might have thought possible. I&#8217;m trying to say something about circuit complexity, and you have managed convince me not only that I can&#8217;t, but also that I won&#8217;t even be able to make rigorous the demonstration that I can&#8217;t.  </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Welcome to the wonderful world of unconditional results in theoretical computer science.</p>
<p>Hey, cheer up.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> OK, let me try to say something to make you feel better. I want to point out that you&#8217;re not forced to define <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sigma(f)' title='\sigma(f)' class='latex' /> to be a number, and if you don&#8217;t then there&#8217;s a slightly better chance of proving something interesting.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I feel better already. Tell me more.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;d better warn you that what I&#8217;m about to say is less promising than it might at first seem. It&#8217;s still worth mentioning, but promise me you won&#8217;t get too excited about it. It&#8217;s a brilliant idea of Razborov, but he has also proved that it cannot be used to obtain interesting lower bounds for circuit complexity.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  What is it with this guy?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, I think like you he&#8217;d love to prove that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP. But unlike you he has made (at least) two extraordinary contributions to the area: he has proved major results in the direction of the problem, and he has checked very very carefully that they cannot be pushed any further &#8212; indeed, he has proved this rigorously.</p>
<p>Anyhow, let&#8217;s not worry about that for now. I just want to show you that there is a different notion of formal complexity measure, where you allow it to take values that are <em>sets</em> rather than numbers.</p>
<p>To explain what I mean, let&#8217;s suppose that we have a function <img src='http://l.wordpress.com/latex.php?latex=%5Crho&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho' title='\rho' class='latex' /> that associates with each Boolean function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> some set <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f).' title='\rho(f).' class='latex' /> We are thinking of <img src='http://l.wordpress.com/latex.php?latex=%5Crho&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho' title='\rho' class='latex' /> as being a measure of the complexity of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, in some sense. What properties might <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> have that could be useful? One obvious one is to insist that <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f\vee g)' title='\rho(f\vee g)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%5Cwedge+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f\wedge g)' title='\rho(f\wedge g)' class='latex' /> are both contained in <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29%5Ccup%5Crho%28g%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)\cup\rho(g).' title='\rho(f)\cup\rho(g).' class='latex' /> But that is not terribly helpful, because it implies that every <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> is contained in the union <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28e_1%29%5Ccup%5Cdots%5Ccup%5Crho%28e_n%29%5Ccup%5Crho%28-e_1%29%5Ccup%5Cdots%5Ccup%5Crho%28-e_n%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(e_1)\cup\dots\cup\rho(e_n)\cup\rho(-e_1)\cup\dots\cup\rho(-e_n).' title='\rho(e_1)\cup\dots\cup\rho(e_n)\cup\rho(-e_1)\cup\dots\cup\rho(-e_n).' class='latex' /> So it looks as though <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> can&#8217;t be bigger than <img src='http://l.wordpress.com/latex.php?latex=2n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n' title='2n' class='latex' /> times what it is for a basic function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Hang on. It seems to me that you are implicitly taking your sets to have some notion of size attached to them.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Yes I am. What I want is for <img src='http://l.wordpress.com/latex.php?latex=%5Crho&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho' title='\rho' class='latex' /> to be small for basic functions and very large for some function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> in NP. And I also want it to grow only slowly when you do Boolean operations.</p>
<p>Here is a set of axioms that would be very useful to us if we could obtain them. The idea is to allow a small error on the right-hand side. That is, instead of asking for <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f\vee g)' title='\rho(f\vee g)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%5Cwedge+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f\wedge g)' title='\rho(f\wedge g)' class='latex' /> to be contained in <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29%5Ccup%5Crho%28g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)\cup\rho(g)' title='\rho(f)\cup\rho(g)' class='latex' />, we ask for them to be contained in <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29%5Ccup%5Crho%28g%29%5Ccup%5CDelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)\cup\rho(g)\cup\Delta' title='\rho(f)\cup\rho(g)\cup\Delta' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5CDelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta' title='\Delta' class='latex' /> is a set of small measure. And to get things started, we&#8217;ll assume that <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29%3D%5Cemptyset&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)=\emptyset' title='\rho(f)=\emptyset' class='latex' /> whenever <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is plus or minus a basic function.</p>
<p>If that is the case, then what can we say about <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> if it has small circuit complexity? Well, we have a sequence of functions <img src='http://l.wordpress.com/latex.php?latex=f_1%2C%5Cdots%2Cf_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,\dots,f_m' title='f_1,\dots,f_m' class='latex' />, and for each <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> we can find <img src='http://l.wordpress.com/latex.php?latex=s%2Ct%3Cr&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s,t&lt;r' title='s,t&lt;r' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_r%29%5Csubset%5Crho%28f_s%29%5Ccup%5Crho%28f_t%29%5Ccup%5CDelta_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_r)\subset\rho(f_s)\cup\rho(f_t)\cup\Delta_r' title='\rho(f_r)\subset\rho(f_s)\cup\rho(f_t)\cup\Delta_r' class='latex' /> for some small set <img src='http://l.wordpress.com/latex.php?latex=%5CDelta_r.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta_r.' title='\Delta_r.' class='latex' /> From this and a trivial inductive argument it follows that <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_m%29%5Csubset%5CDelta_1%5Ccup%5Cdots%5Ccup%5CDelta_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_m)\subset\Delta_1\cup\dots\cup\Delta_m.' title='\rho(f_m)\subset\Delta_1\cup\dots\cup\Delta_m.' class='latex' /> So if we can now show for some function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> in NP that <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> is a set with measure that is larger than that of any set <img src='http://l.wordpress.com/latex.php?latex=%5CDelta_r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta_r' title='\Delta_r' class='latex' /> by a superpolynomial factor, then we have proved that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> cannot have polynomial circuit complexity.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK, I&#8217;ve got lots of questions about this, but here are just two to get started. How might one go about defining a useful set-theoretic complexity measure like this? And is there some trivial argument, as there is the case of formula complexity, that gives a kind of &#8220;maximal&#8221; set-theoretic complexity measure such that the size of <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> is actually equal to the circuit complexity of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Let me deal with the first of these questions by telling you about Razborov&#8217;s method of approximations. The idea of this is to construct a lattice <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> of Boolean functions, where all that this means is that you have the basic functions (and their negatives), together with some operations, <img src='http://l.wordpress.com/latex.php?latex=%5Csqcap&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcap' title='\sqcap' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csqcup&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcup' title='\sqcup' class='latex' />, under which <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> is closed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Aren&#8217;t you going to assume <em>anything</em> about those operations? For example, shouldn&#8217;t they be associative?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Oddly enough, we don&#8217;t need to assume that, but if <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> is to be of any use, then the operations will have to be chosen very carefully, as you&#8217;ll see.</p>
<p>Indeed, what we want from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> is two properties. We would like the operations <img src='http://l.wordpress.com/latex.php?latex=%5Csqcup&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcup' title='\sqcup' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csqcap&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcap' title='\sqcap' class='latex' /> to <em>approximate</em> the operations <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge' title='\wedge' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cvee%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\vee,' title='\vee,' class='latex' /> and we would also like <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> to be as small as possible.</p>
<p>Now suppose that we have a straight-line computation <img src='http://l.wordpress.com/latex.php?latex=f_1%2C%5Cdots%2Cf_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_1,\dots,f_m.' title='f_1,\dots,f_m.' class='latex' /> If we carry out exactly the same computation but replacing the operations <img src='http://l.wordpress.com/latex.php?latex=%5Cwedge&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\wedge' title='\wedge' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cvee&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\vee' title='\vee' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%5Csqcup&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcup' title='\sqcup' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csqcap%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqcap,' title='\sqcap,' class='latex' /> then we obtain a sequence <img src='http://l.wordpress.com/latex.php?latex=g_1%2C%5Cdots%2Cg_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_1,\dots,g_m.' title='g_1,\dots,g_m.' class='latex' /> Now for each <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=s%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s,' title='s,' class='latex' /> let&#8217;s suppose that <img src='http://l.wordpress.com/latex.php?latex=g_r%5Csqcup+g_s%3Dg_r%5Cvee+g_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_r\sqcup g_s=g_r\vee g_s' title='g_r\sqcup g_s=g_r\vee g_s' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g_r%5Csqcap+g_s%3Dg%5Cwedge+g_s%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_r\sqcap g_s=g\wedge g_s,' title='g_r\sqcap g_s=g\wedge g_s,' class='latex' /> except on a small set <img src='http://l.wordpress.com/latex.php?latex=%5CDelta%28g_r%2Cg_s%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta(g_r,g_s).' title='\Delta(g_r,g_s).' class='latex' /> Then define <img src='http://l.wordpress.com/latex.php?latex=%5CDelta%28f_m%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta(f_m)' title='\Delta(f_m)' class='latex' /> to be the set <img src='http://l.wordpress.com/latex.php?latex=%5CDelta%28g_r%2Cg_s%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta(g_r,g_s)' title='\Delta(g_r,g_s)' class='latex' /> for which <img src='http://l.wordpress.com/latex.php?latex=f_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_m' title='f_m' class='latex' /> equals <img src='http://l.wordpress.com/latex.php?latex=f_r%5Cvee+f_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_r\vee f_s' title='f_r\vee f_s' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=f_r%5Cwedge+f_s.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_r\wedge f_s.' title='f_r\wedge f_s.' class='latex' /> An easy inductive argument shows that <img src='http://l.wordpress.com/latex.php?latex=f_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_m' title='f_m' class='latex' /> agrees with <img src='http://l.wordpress.com/latex.php?latex=g_m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g_m' title='g_m' class='latex' /> except possibly on <img src='http://l.wordpress.com/latex.php?latex=%5CDelta_1%5Ccup%5Cdots%5Ccup%5CDelta_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta_1\cup\dots\cup\Delta_m.' title='\Delta_1\cup\dots\cup\Delta_m.' class='latex' /> So we can set <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_m%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_m)' title='\rho(f_m)' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5CDelta_1%5Ccup%5Cdots%5Ccup%5CDelta_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta_1\cup\dots\cup\Delta_m.' title='\Delta_1\cup\dots\cup\Delta_m.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Something&#8217;s bothering me here. It&#8217;s that your function <img src='http://l.wordpress.com/latex.php?latex=%5Crho&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho' title='\rho' class='latex' /> isn&#8217;t obviously well-defined. It seems to depend on the particular way that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is computed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You&#8217;re absolutely right of course. I must apologize. But it turns out that for some problems, one can prove highly non-trivial lower bounds using this kind of idea, though I have in fact oversimplified what Razborov did.</p>
<p>The bad news is that Razborov has proved that the method of approximations doesn&#8217;t yield even superlinear lower bounds for circuit complexity, unless you allow extra variables, in which case you can produce a suitable lattice in a trivial universal way.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Ah, that&#8217;s sounding close to an answer to my second question. But perhaps we can think about it for ourselves.</p>
<p>Wait, there is indeed a spectacularly trivial set-theoretic complexity measure if it&#8217;s allowed to depend on how the function is computed.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  What&#8217;s that?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You just take the set of all functions that you used in the computation. Then if <img src='http://l.wordpress.com/latex.php?latex=f_m%3Df_r%5Cvee+f_s%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_m=f_r\vee f_s,' title='f_m=f_r\vee f_s,' class='latex' /> say, then you&#8217;d have <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_m%29%3D%5Crho%28f_r%29%5Ccup%5Crho%28f_s%29%5Ccup%5C%7Bf_m%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_m)=\rho(f_r)\cup\rho(f_s)\cup\{f_m\}.' title='\rho(f_m)=\rho(f_r)\cup\rho(f_s)\cup\{f_m\}.' class='latex' /> So the size of <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_m%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_m)' title='\rho(f_m)' class='latex' /> would be at most <img src='http://l.wordpress.com/latex.php?latex=m%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m,' title='m,' class='latex' /> and it&#8217;s not hard to show that the size of <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f_m%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f_m)' title='\rho(f_m)' class='latex' /> is in fact the length of a computation closely related to the one given. (For uninteresting reasons it could be different, since <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' /> might both be less than <img src='http://l.wordpress.com/latex.php?latex=f_%7Bm-1%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_{m-1},' title='f_{m-1},' class='latex' /> but in that case <img src='http://l.wordpress.com/latex.php?latex=f_%7Bm-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_{m-1}' title='f_{m-1}' class='latex' /> would not have been used in the computation of <img src='http://l.wordpress.com/latex.php?latex=f_m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f_m.' title='f_m.' class='latex' />)</p>
<p>I&#8217;m still interested to know whether there might exist a set-theoretic complexity measure that applies to functions rather than to computations of functions. Has Razborov shown that they cannot exist?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m not sure. What I can say is that his method of approximations doesn&#8217;t give you one. If you wanted it to do so then you&#8217;d have to find some way of associating with each <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> a function <img src='http://l.wordpress.com/latex.php?latex=g%3DL%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g=L(f)' title='g=L(f)' class='latex' /> that belonged to the lattice <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L},' title='\mathcal{L},' class='latex' /> in such a way that <img src='http://l.wordpress.com/latex.php?latex=L%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f\vee g)' title='L(f\vee g)' class='latex' /> was equal to <img src='http://l.wordpress.com/latex.php?latex=L%28f%29%5Csqcup+L%28g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)\sqcup L(g)' title='L(f)\sqcup L(g)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=L%28f%5Cwedge+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f\wedge g)' title='L(f\wedge g)' class='latex' /> was equal to <img src='http://l.wordpress.com/latex.php?latex=L%28f%29%5Csqcap+L%28g%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)\sqcap L(g).' title='L(f)\sqcap L(g).' class='latex' /> But that would be asking for a partition of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{0,1\}^n' title='\{0,1\}^n' class='latex' /> that somehow respected Boolean operations. I&#8217;m basically certain that such a thing doesn&#8217;t exist. In fact, I think I&#8217;ve got a proof. If <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> is small, then by the pigeonhole principle <img src='http://l.wordpress.com/latex.php?latex=L%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)' title='L(f)' class='latex' /> would have to be the same for a large set of functions <img src='http://l.wordpress.com/latex.php?latex=f.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f.' title='f.' class='latex' /> But then you&#8217;d have two functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> that were almost orthogonal, from which it would follow that <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> was nothing like either <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=g%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g,' title='g,' class='latex' /> but <img src='http://l.wordpress.com/latex.php?latex=L%28f%29%5Cvee+L%28g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)\vee L(g)' title='L(f)\vee L(g)' class='latex' /> was equal to <img src='http://l.wordpress.com/latex.php?latex=L%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)' title='L(f)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=L%28g%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(g).' title='L(g).' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I don&#8217;t think an additive combinatorialist would give up just yet.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> What do you mean?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, I can see that there is a problem with random functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> if you want to define <img src='http://l.wordpress.com/latex.php?latex=L%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L(f)' title='L(f)' class='latex' /> to be a Boolean function. But couldn&#8217;t one generalize the notion of a lattice to allow functions to take values other than <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1' title='-1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />? In additive combinatorics, one tends to think of a random <img src='http://l.wordpress.com/latex.php?latex=%5Cpm+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm 1' title='\pm 1' class='latex' /> function as being &#8220;just a random perturbation of the zero function&#8221;. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> If you want to try something like that, then you&#8217;ve still got some serious thinking to do.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Why?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Well, if you take two independent random Boolean functions <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g,' title='g,' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> is a random function as well, but it takes the value <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> with probability <img src='http://l.wordpress.com/latex.php?latex=3%2F4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3/4' title='3/4' class='latex' /> instead of <img src='http://l.wordpress.com/latex.php?latex=1%2F2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/2.' title='1/2.' class='latex' /> This suggests that if the zero function <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> belongs to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L},' title='\mathcal{L},' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=0%5Cvee+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0\vee 0' title='0\vee 0' class='latex' /> should be the constant function <img src='http://l.wordpress.com/latex.php?latex=1%2F2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/2.' title='1/2.' class='latex' /> But if <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> are <em>not</em> independent, then this is no longer appropriate. For instance, if <img src='http://l.wordpress.com/latex.php?latex=g%3D-f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g=-f' title='g=-f' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=f%5Cvee+g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f\vee g' title='f\vee g' class='latex' /> is the constant function 1. So you can&#8217;t just send all random functions to the zero function and hope to preserve Boolean operations in a nice way.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I see what you mean. That looks pretty bad. But does it show that no set-theoretic complexity measure that depends just on <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> can exist? Or does it merely show that it cannot be derived from some lattice <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' />?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I&#8217;m not sure. I&#8217;d be quite surprised if one existed. But perhaps it&#8217;s possible to find one in a trivial way.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  It&#8217;s difficult to see how one could define such a set without making any reference to circuits. So perhaps one would have to associate a set with each straight-line computation and then for each <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> one would take the intersection of all the sets associated with computations of <img src='http://l.wordpress.com/latex.php?latex=f.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f.' title='f.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  I&#8217;m not sure I like the sound of that. For instance, if you apply it to the &#8220;trivial&#8221; function that associates with each computation the set of Boolean functions involved in that computation, then you don&#8217;t get anything interesting when you intersect over all computations.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, you&#8217;re right. One would probably have to do something fairly clever.</p>
<p>It&#8217;s a very long shot, but I wonder whether the <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> idea could somehow be converted into a set-theoretic complexity measure. For example &#8212; and I should quickly say that I do not for a moment expect this particular example to work &#8212; one could associate with each Boolean function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> the set <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> of all Boolean functions <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g' title='g' class='latex' /> is a witness to the fact that <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C_%7BU%5Ek%7D%5E%2A%5Cgeq+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|_{U^k}^*\geq 1' title='\|f\|_{U^k}^*\geq 1' class='latex' />. That is, <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29%3D%5C%7Bg%3A%5Clangle+f%2Cg%5Crangle%5Cgeq%5C%7Cg%5C%7C_%7BU%5Ek%7D%5C%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)=\{g:\langle f,g\rangle\geq\|g\|_{U^k}\}.' title='\rho(f)=\{g:\langle f,g\rangle\geq\|g\|_{U^k}\}.' class='latex' /> I&#8217;ve chosen that just because the sets get bigger when <img src='http://l.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C_%7BU%5Ek%7D%5E%2A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\|f\|_{U^k}^*' title='\|f\|_{U^k}^*' class='latex' /> gets bigger. But I have no reason to suppose that anything much can be said about <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%5Cvee+g%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f\vee g)' title='\rho(f\vee g)' class='latex' /> in terms of <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28f%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(f)' title='\rho(f)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Crho%28g%29.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\rho(g).' title='\rho(g).' class='latex' /></p>
<p>In fact, I can&#8217;t face thinking about it for now. I need a rest.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' />  <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />    OK, see you again some other time.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But just to end on not too depressed a note, I&#8217;d like to point out that a property such as &#8220;The set of functions that witness that the dual norm of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is at least 1 is not too large&#8221; is not completely obviously naturalizable. More generally, perhaps we can get somewhere by considering properties that are not in NP but in some class like #P (but probably not quite as extreme as that) where you have to recognise roughly how many ways that there are of checking that an NP property holds. We may manage to come up with some reason for this not working, but I think we&#8217;d have to think about it first.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/gowers.wordpress.com/781/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/gowers.wordpress.com/781/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/gowers.wordpress.com/781/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/gowers.wordpress.com/781/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/gowers.wordpress.com/781/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/gowers.wordpress.com/781/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/gowers.wordpress.com/781/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/gowers.wordpress.com/781/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/gowers.wordpress.com/781/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/gowers.wordpress.com/781/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=781&subd=gowers&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://gowers.wordpress.com/2009/10/16/a-conversation-about-complexity-lower-bounds-vi/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/24ee673de88d3b72ddf2772a8e49008d?s=96&#38;d=identicon" medium="image">
			<media:title type="html">gowers</media:title>
		</media:content>
	</item>
		<item>
		<title>A conversation about complexity lower bounds, V</title>
		<link>http://gowers.wordpress.com/2009/10/11/a-conversation-about-complexity-lower-bounds-v/</link>
		<comments>http://gowers.wordpress.com/2009/10/11/a-conversation-about-complexity-lower-bounds-v/#comments</comments>
		<pubDate>Sun, 11 Oct 2009 13:40:30 +0000</pubDate>
		<dc:creator>gowers</dc:creator>
				<category><![CDATA[complexity]]></category>

		<guid isPermaLink="false">http://gowers.wordpress.com/?p=898</guid>
		<description><![CDATA[Here is the next instalment of the dialogue. Whereas the previous one ended on an optimistic note, this one is quite the reverse: plausible arguments are given that suggest that the  dual norm approach is unlikely to give superlinear lower bounds. A few other ideas are introduced in the process.
********************************************************
   Can I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gowers.wordpress.com&blog=1659011&post=898&subd=gowers&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Here is the next instalment of the dialogue. Whereas the previous one ended on an optimistic note, this one is quite the reverse: plausible arguments are given that suggest that the <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> dual norm approach is unlikely to give superlinear lower bounds. A few other ideas are introduced in the process.</p>
<p>********************************************************</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can I make a slightly cheeky comment?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  OK</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Well, over the years I&#8217;ve heard about a number of approaches that people have had to famous problems, including the P versus NP problem, that have the following characteristic. Somebody who works in a different area of mathematics comes in and claims that the right way of solving the famous problem is to reformulate it as a problem in their area. They then make grand claims for the reformulation, and there&#8217;s quite a bit of excitement, though also scepticism from those who have thought hardest about the problem, and ten years later nothing has actually happened. I&#8217;m usually a little suspicious of these reformulations, unless they feel so natural as to be almost unavoidable. So the fact that you like additive combinatorics and are trying to reformulate the P versus NP problem (or even just the formula-complexity problem) as a problem in additive combinatorics makes me very slightly wary.<span id="more-898"></span></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Yes, I was conscious of that myself, although the <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> norms have already been considered by theoretical computer scientists. It&#8217;s another reason I don&#8217;t want to make grand claims for this approach. I certainly wouldn&#8217;t say that the additive-combinatorial perspective is forced, but I do think I can argue that at least some of the ideas &#8212; particularly the duality idea &#8212; are quite natural. I think the next step is for real computer-science experts to have a look at them. <em>[This has to a large extent already happened in the comments on the previous instalments of this dialogue.]</em> I hope, 8), that you don&#8217;t mind my implication that you are not a true expert &#8212; I&#8217;m very grateful for the comments you&#8217;ve made.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> No no, absolutely not. I admit that there&#8217;s a lot I don&#8217;t know in this area, and I could easily have missed some serious objection to your proposals.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Also, I&#8217;ve been thinking a bit about your objection that the <img src='http://l.wordpress.com/latex.php?latex=U%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^k' title='U^k' class='latex' /> dual norm is too much of a guess to be taken seriously. I think I have a heuristic argument in its favour.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Ah, that&#8217;s much more interesting. What is it?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I can summarize it as follows. I have the following picture of what a random function of circuit complexity (or perhaps formula complexity &#8212; let me be vague about this, since we can always think about it properly later if we want to develop this heuristic argument further) looks like. It springs from a result that appeared in a little known paper by Gowers about 15 years ago. In it, he defined a model of random computable functions, though in fact they were rather special functions because they were reversible, and proved that if you choose a function of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> from this model, then it will be approximately <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independent, where <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> depends in a power-type way on <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n.' title='n.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  You&#8217;re losing me a bit here.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Let me not bother to say precisely what the model is. But to say that it is approximately <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independent is to say that if you choose <em>any</em> <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> Boolean strings of length <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> and look at the value of a randomly chosen function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />, then the values that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> takes at those strings is distributed almost uniformly amongst all the <img src='http://l.wordpress.com/latex.php?latex=2%5Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^k' title='2^k' class='latex' /> possible assignments of values to those <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> strings. In other words, you cannot tell that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is not a purely random function by just looking at <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> strings of its output.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  So in a weak sense Gowers&#8217;s random model was a pseudorandom generator.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  In a very weak sense, yes, but still quite an interesting sense.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Did you know that if Gowers had waited a decade or so then he could have had a stronger result for free?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Eh? He told me he had wondered about stronger results, but got nowhere.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That&#8217;s amusing, because it has subsequently been proved that almost <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independence implies pseudorandomness for circuits of constant depth. This result is <a href="http://scottaaronson.com/blog/?p=381">discussed by Scott Aaronson</a> on his blog.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  So you&#8217;re saying that in fact he defined a generator that was pseudorandom for circuits of constant depth, and provably so. I must ask him whether he knows about that.</p>
<p>Anyhow, all I need for this discussion is the original result. Let us take &#8220;random computable function&#8221; to mean a function that comes from his model. I should add that Gowers was not in fact the first to define reversible computations, though he rather sweetly appeared to think he was. In particular, they are important in quantum computation. But the result about almost <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independence was his. And <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independence is enough to imply that the <img src='http://l.wordpress.com/latex.php?latex=U%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^d' title='U^d' class='latex' /> norm is typically small, where <img src='http://l.wordpress.com/latex.php?latex=d%3D%5Clog_2k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d=\log_2k' title='d=\log_2k' class='latex' />. We&#8217;ve noted this already.</p>
<p>What follows is plausible guesswork. I&#8217;m going to guess that a random function of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> is almost <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independent for some <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> that is comparable to <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />. In fact, I&#8217;ll guess that the <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> you get is almost as big as the trivial upper bound you&#8217;d get by counting how many different functions of complexity at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> there are. And I&#8217;ll also guess that if you take <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> larger than the bound for which almost independence holds, then in some sense the distribution is &#8220;maximally dependent&#8221;. </p>
<p>In other words, what I&#8217;m saying is that if you choose a random function of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />, then in a certain sense it should be completely random on sets of size up to around <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> (including on subspaces of dimension <img src='http://l.wordpress.com/latex.php?latex=d%3D%5Clog+m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d=\log m' title='d=\log m' class='latex' />, which is what we care about when evaluating the <img src='http://l.wordpress.com/latex.php?latex=U%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^d' title='U^d' class='latex' /> norm), and as structured as it can be, given this &#8220;local&#8221; randomness, on larger sets.</p>
<p>If that is correct, then how might we detect the structure? Well, a fairly natural idea is to look at a <img src='http://l.wordpress.com/latex.php?latex=U%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^d' title='U^d' class='latex' /> norm for some <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=2%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^d' title='2^d' class='latex' /> is bigger than <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />. The hope is that this would pick up the large-scale correlations that are invisible to lower uniformity norms.</p>
<p>This is supposed to give an answer to the following question. If we are presented with lots of random computable functions, then it makes sense to talk about almost <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independence. But what does it mean to look at an <em>individual</em> function and say that &#8220;it looks like a <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-wise independent function but no more&#8221;? At first that seems nonsensical. But perhaps if <img src='http://l.wordpress.com/latex.php?latex=2%5Ed%3Ek&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^d&gt;k' title='2^d&gt;k' class='latex' />, then it could say something like that the restrictions of the function to <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' />-dimensional subspaces are sufficiently related to each other for some correlation to be picked up by the <img src='http://l.wordpress.com/latex.php?latex=U%5Ed&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='U^d' title='U^d' class='latex' /> norm. In other words, our attention turns from correlation <em>between</em> functions to correlation between restrictions of <em>the same</em> function.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Hmm, that does make me feel somewhat happier with your idea. But it also suggests to me that there might be other ways of exploiting these large-scale correlations. You haven&#8217;t given any particular reason to go for subspaces rather than arbitrary sets of size <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  True, but you have to admit that subspaces are pretty natural objects.</p>
<p>Actually, wait a moment. Now that I&#8217;ve gone to the trouble of explaining my heuristic picture of what a random low-complexity function is like, I&#8217;m starting to wonder whether I might have stumbled on a two-line proof that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You haven&#8217;t, but go on, let&#8217;s hear it all the same.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, the idea is closely related to what I was saying just now. I was thinking of a random function of complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> as being utterly random if you look at any <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> (or thereabouts) values, and much more structured if you look at more values than this. But I wanted to think about just a single function rather than a probability distribution based on such functions. However, I can convert a single function into a collection of functions in a very simple way: I just look at restrictions of it.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I hope there&#8217;s going to be more to your argument than looking at random restrictions, because those have been done to death.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  I&#8217;m slightly worried about that, I have to admit, because I don&#8217;t seem to be using much more. But just hear me out.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> OK.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Well, suppose you&#8217;ve got a function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> of circuit complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />. We can convert <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> into a function of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables by fixing the values of <img src='http://l.wordpress.com/latex.php?latex=n-k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-k' title='n-k' class='latex' /> variables and allowing the other <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> to vary. It is easy to check that if you do that, then the resulting function will have circuit complexity at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />. Now there are at most <img src='http://l.wordpress.com/latex.php?latex=n%5E%7BCm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^{Cm}' title='n^{Cm}' class='latex' /> functions of circuit complexity <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />, whereas the number of Boolean functions of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B2%5Ek%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{2^k}' title='2^{2^k}' class='latex' />. Therefore, if <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> is big compared with <img src='http://l.wordpress.com/latex.php?latex=%5Clog+m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\log m' title='\log m' class='latex' />, the number of possible restrictions of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables is tiny compared with the number of Boolean functions of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables.</p>
<p>We have been searching for a difference between random low-complexity functions and purely random functions. Well, here it is: a random low-complexity function has just a limited number of possible restrictions, whereas a purely random function is completely arbitrary.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Hmm, I don&#8217;t yet see what&#8217;s wrong with your argument, but I do have a proof that it&#8217;s wrong.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  How can one prove that an argument is wrong without seeing what is wrong with it?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> That&#8217;s not mysterious at all. For example, one can show that it implies something false. But that&#8217;s not what I&#8217;m going to do here. Rather, I&#8217;d just like to point out that the property that <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  claims distinguishes between random low-complexity functions and purely random functions is a natural property.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  You mean, not only have I proved that P<img src='http://l.wordpress.com/latex.php?latex=%5Cne&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\ne' title='\ne' class='latex' />NP, but I&#8217;ve also shown how to factorize large numbers?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Er, I think not.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Cool it, I was joking.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> You had me there for a moment.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Can you explain why this property is natural? In fact, what property are we talking about?</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I assume that what <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  has in mind is to look at the restrictions of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> variables and simply count how many of them are different. But the number of restrictions is <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-k%7D%5Cbinom+nk%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n-k}\binom nk,' title='2^{n-k}\binom nk,' class='latex' /> which is polynomial in <img src='http://l.wordpress.com/latex.php?latex=2%5En%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^n,' title='2^n,' class='latex' /> and checking how many of them are different can clearly be done in polynomial time as well. </p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_neutral.gif' alt=':|' class='wp-smiley' />  Ah, thanks. But doesn&#8217;t that also show what&#8217;s wrong with the argument? After all, if there are only <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-k%7D%5Cbinom+nk&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n-k}\binom nk' title='2^{n-k}\binom nk' class='latex' /> possible restrictions, then the number of distinct functions you can get is not <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B2%5Ek%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{2^k}' title='2^{2^k}' class='latex' /> but <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-k%7D%5Cbinom+nk%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{n-k}\binom nk,' title='2^{n-k}\binom nk,' class='latex' /> which is far less. Indeed, it&#8217;s far less than <img src='http://l.wordpress.com/latex.php?latex=n%5Em%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^m,' title='n^m,' class='latex' /> our upper bound for the number of functions of circuit complexity <img src='http://l.wordpress.com/latex.php?latex=m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m.' title='m.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oops, you&#8217;re right. To see in an extreme way why the argument is wrong, one could take the case <img src='http://l.wordpress.com/latex.php?latex=k%3Dn.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=n.' title='k=n.' class='latex' /> Then there is only one restriction of <img src='http://l.wordpress.com/latex.php?latex=f%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f,' title='f,' class='latex' /> and the fact that it is one of at most <img src='http://l.wordpress.com/latex.php?latex=n%5Em&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^m' title='n^m' class='latex' /> functions has nothing to do with anything. It just points out the familiar fact that the number of low-complexity functions is much smaller than the number of all Boolean functions. How ridiculous that this argument could have fooled me for even a millisecond. Though I did in fact share your feeling that it was too simple to be correct.</p>
<p>But wait a moment.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Here we go again &#8230;</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Sorry, but there&#8217;s something else to check. What if, instead of looking at restrictions to subspaces generated by <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> of the basic variables, we were to look at <em>arbitrary</em> <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-dimensional subspaces? There are far more of these: indeed, if <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> is much smaller than <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, as it is in our case, then almost any <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> elements of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2^n' title='\mathbb{F}_2^n' class='latex' /> generate a subspace, and those subspaces don&#8217;t coincide all that much. So, speaking crudely, there are something like <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bnk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^{nk}' title='2^{nk}' class='latex' /> subspaces. And that &#8230; damn, that&#8217;s still smaller than <img src='http://l.wordpress.com/latex.php?latex=n%5Em&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^m' title='n^m' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> is something like <img src='http://l.wordpress.com/latex.php?latex=n%5E2.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2.' title='n^2.' class='latex' /> But perhaps we could still get a superlinear bound this way.</p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> I don&#8217;t think so, because now it&#8217;s no longer the case that the restriction of a function of complexity at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> has complexity at most <img src='http://l.wordpress.com/latex.php?latex=m.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m.' title='m.' class='latex' /></p>
<p> <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Oh yes, you&#8217;re right. But perhaps it must still be quite special. How many functions can you get by taking a function of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> variables of complexity at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and restricting it to a <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-dimensional subspace?</p>
<p>Hmm, one way of thinking about it is this. Each such function is obtained by taking a Boolean string of length <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />, multiplying it by an <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n\times k' title='n\times k' class='latex' /> matrix over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{F}_2' title='\mathbb{F}_2' class='latex' />, and applying a function of complexity at most <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;