<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: The exchange lemma and Gaussian elimination</title>
	<atom:link href="http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/feed/" rel="self" type="application/rss+xml" />
	<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/</link>
	<description>Mathematics related discussions</description>
	<lastBuildDate>Wed, 15 May 2013 08:08:31 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Mark Meckes</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-450</link>
		<dc:creator><![CDATA[Mark Meckes]]></dc:creator>
		<pubDate>Tue, 30 Oct 2007 21:03:58 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-450</guid>
		<description><![CDATA[Regarding names, I&#039;ve always found it curious that almost every mention of Burnside&#039;s lemma seems to be accompanied by the comment that it wasn&#039;t first proved by Burnside.  Given that the same is true of most results with a person&#039;s name attached, I don&#039;t understand why Burnside is singled out for such abuse.]]></description>
		<content:encoded><![CDATA[<p>Regarding names, I&#8217;ve always found it curious that almost every mention of Burnside&#8217;s lemma seems to be accompanied by the comment that it wasn&#8217;t first proved by Burnside.  Given that the same is true of most results with a person&#8217;s name attached, I don&#8217;t understand why Burnside is singled out for such abuse.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: gowers</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-446</link>
		<dc:creator><![CDATA[gowers]]></dc:creator>
		<pubDate>Tue, 30 Oct 2007 17:30:53 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-446</guid>
		<description><![CDATA[I&#039;m afraid I just tend to use the standard names for things and don&#039;t look into the history. But it&#039;s always interesting to hear from people who do know about it. To make up (partially at any rate) for my laziness in this respect, when I give lectures I tell people that a good rule of thumb is that if somebody&#039;s name is attached to a result (at least if the result is sufficiently simple) then they probably weren&#039;t the first person to prove it. I&#039;ve always wondered how Vandermonde&#039;s theorem came to be known as that: if only all theorems were as easy to obtain ...]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m afraid I just tend to use the standard names for things and don&#8217;t look into the history. But it&#8217;s always interesting to hear from people who do know about it. To make up (partially at any rate) for my laziness in this respect, when I give lectures I tell people that a good rule of thumb is that if somebody&#8217;s name is attached to a result (at least if the result is sufficiently simple) then they probably weren&#8217;t the first person to prove it. I&#8217;ve always wondered how Vandermonde&#8217;s theorem came to be known as that: if only all theorems were as easy to obtain &#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joerg Liesen</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-444</link>
		<dc:creator><![CDATA[Joerg Liesen]]></dc:creator>
		<pubDate>Tue, 30 Oct 2007 15:09:53 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-444</guid>
		<description><![CDATA[Dear Timothy,

I have a comment/question concerning the history of the subject. In your blog you speak of the &quot;Steinitz&quot; exchange theorem. Terence Tao uses the same name in his blog, so it seems that this is the common name for this result. Do you know how this name got attached to this result? Who used this name first?

The result of Ernst Steinitz (1871-1928) was published 1913 in Crelles Journal. There he writes that (rough translation follows) &quot;the basics of n-dimensional geometry could have been assumed as well known, but I prefer to rederive them here.&quot; Apparently, the result was already known to Steinitz. Indeed, the result -- in almost the exact form as in your blog -- already appears in Hermann Grassmann&#039;s Ausdehnungslehre of 1862, a book that was completed in the summer of 1861, i.e. 10 years before Steinitz was born. Steinitz does not cite Grassmann, which is fortunate for him, as now the result is known under his name.]]></description>
		<content:encoded><![CDATA[<p>Dear Timothy,</p>
<p>I have a comment/question concerning the history of the subject. In your blog you speak of the &#8220;Steinitz&#8221; exchange theorem. Terence Tao uses the same name in his blog, so it seems that this is the common name for this result. Do you know how this name got attached to this result? Who used this name first?</p>
<p>The result of Ernst Steinitz (1871-1928) was published 1913 in Crelles Journal. There he writes that (rough translation follows) &#8220;the basics of n-dimensional geometry could have been assumed as well known, but I prefer to rederive them here.&#8221; Apparently, the result was already known to Steinitz. Indeed, the result &#8212; in almost the exact form as in your blog &#8212; already appears in Hermann Grassmann&#8217;s Ausdehnungslehre of 1862, a book that was completed in the summer of 1861, i.e. 10 years before Steinitz was born. Steinitz does not cite Grassmann, which is fortunate for him, as now the result is known under his name.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nugae</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-250</link>
		<dc:creator><![CDATA[nugae]]></dc:creator>
		<pubDate>Fri, 12 Oct 2007 11:22:12 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-250</guid>
		<description><![CDATA[This is what mathematics ought to be, not the sterile freeze-dried stuff you see in published papers. Mathematics is about intuition, surprise, and delight: why else do we do it?

I&#039;m having a go at conveying these things in writing about a relatively infantile problem in number theory. It&#039;s a challenge - I keep on slipping into mathematical-paper mode - but rewarding when it works.]]></description>
		<content:encoded><![CDATA[<p>This is what mathematics ought to be, not the sterile freeze-dried stuff you see in published papers. Mathematics is about intuition, surprise, and delight: why else do we do it?</p>
<p>I&#8217;m having a go at conveying these things in writing about a relatively infantile problem in number theory. It&#8217;s a challenge &#8211; I keep on slipping into mathematical-paper mode &#8211; but rewarding when it works.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kay</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-214</link>
		<dc:creator><![CDATA[Kay]]></dc:creator>
		<pubDate>Fri, 05 Oct 2007 19:08:27 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-214</guid>
		<description><![CDATA[1. This is all fascinating and quite above me (at least for the moment!)

2. Terry, my linear algebra is growing by leaps and bounds, and I am seeing why your comment &quot;As regards the identification between matrices and linear transformations ... it’s important to understand both perspectives, and how to swap back and forth.&quot; makes much more sense than my first naive impression.]]></description>
		<content:encoded><![CDATA[<p>1. This is all fascinating and quite above me (at least for the moment!)</p>
<p>2. Terry, my linear algebra is growing by leaps and bounds, and I am seeing why your comment &#8220;As regards the identification between matrices and linear transformations &#8230; it’s important to understand both perspectives, and how to swap back and forth.&#8221; makes much more sense than my first naive impression.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: derek hacon</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-206</link>
		<dc:creator><![CDATA[derek hacon]]></dc:creator>
		<pubDate>Thu, 04 Oct 2007 21:33:23 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-206</guid>
		<description><![CDATA[Easier than using Steinitz seems to be  to do things matricially:  use row and colummn operations (rather than just row operations) to show easily that any matrix is PAQ equivalent (P and Q being square invertible matrices) to canonical form (having least number of nonzero entries). To prove facts like  linear dependence of the columns of any matrix which has  less rows than columns, first check the statement is invariant by PAQ equivalence (P and Q being square invertible matrices) and then that it´s obviously true for canonical form.  From this the corresponding vectors property (i.e. the linear dependence of  n+1 vectors, each of which is a linear combination of n vectors) is easily deduced.
The interest of Steinitz seems to be that it pops up in  other contexts...for example, in the proof that Kruskal´s algorithm works (i.e. provides a minimal spanning tree for a weighted graph).]]></description>
		<content:encoded><![CDATA[<p>Easier than using Steinitz seems to be  to do things matricially:  use row and colummn operations (rather than just row operations) to show easily that any matrix is PAQ equivalent (P and Q being square invertible matrices) to canonical form (having least number of nonzero entries). To prove facts like  linear dependence of the columns of any matrix which has  less rows than columns, first check the statement is invariant by PAQ equivalence (P and Q being square invertible matrices) and then that it´s obviously true for canonical form.  From this the corresponding vectors property (i.e. the linear dependence of  n+1 vectors, each of which is a linear combination of n vectors) is easily deduced.<br />
The interest of Steinitz seems to be that it pops up in  other contexts&#8230;for example, in the proof that Kruskal´s algorithm works (i.e. provides a minimal spanning tree for a weighted graph).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Top Posts &#171; WordPress.com</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-195</link>
		<dc:creator><![CDATA[Top Posts &#171; WordPress.com]]></dc:creator>
		<pubDate>Thu, 04 Oct 2007 13:52:30 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-195</guid>
		<description><![CDATA[[...]  The exchange lemma and Gaussian elimination Thanks to this comment, I have finally decided to try to understand in what sense Gaussian elimination and the Steinitz [&#8230;] [...]]]></description>
		<content:encoded><![CDATA[<p>[...]  The exchange lemma and Gaussian elimination Thanks to this comment, I have finally decided to try to understand in what sense Gaussian elimination and the Steinitz [&#8230;] [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Dyer</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-194</link>
		<dc:creator><![CDATA[Jason Dyer]]></dc:creator>
		<pubDate>Thu, 04 Oct 2007 13:41:22 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-194</guid>
		<description><![CDATA[I&#039;m still trying to parse everything here, but since it was my comment I wanted to thank you for posting this!]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m still trying to parse everything here, but since it was my comment I wanted to thank you for posting this!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: When are two proofs essentially the same? &#171; Gowers&#8217;s Weblog</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-193</link>
		<dc:creator><![CDATA[When are two proofs essentially the same? &#171; Gowers&#8217;s Weblog]]></dc:creator>
		<pubDate>Thu, 04 Oct 2007 09:12:04 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-193</guid>
		<description><![CDATA[[...] is, methods for transforming a proof into another that is not interestingly different. See this comment for some interesting links, though here I am not so much looking for a formal theory right down at [...]]]></description>
		<content:encoded><![CDATA[<p>[...] is, methods for transforming a proof into another that is not interestingly different. See this comment for some interesting links, though here I am not so much looking for a formal theory right down at [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-192</link>
		<dc:creator><![CDATA[Kevin]]></dc:creator>
		<pubDate>Thu, 04 Oct 2007 03:44:16 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-192</guid>
		<description><![CDATA[Here are some blog posts related to &quot;homotopies between proofs&quot;. Maybe you&#039;ll find them interesting. The idea of somehow using ideas from topology to study proofs doesn&#039;t seem all that crazy ...

&lt;a href=&quot;http://math.ucr.edu/home/baez/week227.html&quot; rel=&quot;nofollow&quot;&gt;Week 227 of This Week&#039;s Finds&lt;/a&gt;
&lt;a href=&quot;http://sigfpe.blogspot.com/2006/03/homotopies-between-proofs-and-between.html&quot; rel=&quot;nofollow&quot;&gt;Neighborhood of Infinity&lt;/a&gt;
&lt;a href=&quot;http://theojf.blogspot.com/2006_01_01_theojf_archive.html&quot; rel=&quot;nofollow&quot;&gt;Orange Juice Files&lt;/a&gt;]]></description>
		<content:encoded><![CDATA[<p>Here are some blog posts related to &#8220;homotopies between proofs&#8221;. Maybe you&#8217;ll find them interesting. The idea of somehow using ideas from topology to study proofs doesn&#8217;t seem all that crazy &#8230;</p>
<p><a href="http://math.ucr.edu/home/baez/week227.html" rel="nofollow">Week 227 of This Week&#8217;s Finds</a><br />
<a href="http://sigfpe.blogspot.com/2006/03/homotopies-between-proofs-and-between.html" rel="nofollow">Neighborhood of Infinity</a><br />
<a href="http://theojf.blogspot.com/2006_01_01_theojf_archive.html" rel="nofollow">Orange Juice Files</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: gowers</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-190</link>
		<dc:creator><![CDATA[gowers]]></dc:creator>
		<pubDate>Wed, 03 Oct 2007 21:09:25 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-190</guid>
		<description><![CDATA[Dear Terry,

That&#039;s very interesting. I had vaguely wondered about similar ideas, but I was thinking more about inverses than adjoints. I still haven&#039;t got my head completely round what you&#039;ve written, but will at some stage try to convert it into a very explicit demonstration of how the exchange lemma can yield an algorithm that is more or less the same as Gaussian elimination (but probably applied to columns rather than rows). 

The last thing you mentioned reminds me of a fascinating fact that I learned as a result of editing an article on model theory for the Princeton Companion. It&#039;s the most gorgeous proof (which, from what you write, you probably know of) of the following fact: if $latex P$ is an injective polynomial map from $latex \mathbb{C}^n$ to $latex \mathbb{C}^n$ then it must be surjective. The rough idea of the proof is the same as what you say above. By model-theoretic considerations one can show that the statement is true in $latex \mathbb{C}$ if and only if it is true in a finite field $latex \mathbb{F}$ of sufficiently high characteristic. But there the statement is trivial. It&#039;s a great example of a non-obvious statement that doesn&#039;t seem to have anything to do with logic, but which is proved by logical means. (The proof was discovered by Ax, apparently. For more details, see the PCM when it comes out.)

Another thing that&#039;s interesting about your remark is that it is connected with a question that I like and that I intend to pursue more on this blog, namely what it means for two proofs to be &quot;essentially the same.&quot; It&#039;s an intriguing phenomenon, because it so often happens that two rather different-looking proofs eventually turn out to be less different than they at first appear, as you make very clear in your article, &quot;What is good mathematics?&quot; Given that, it&#039;s all the more interesting to find two proofs that really do seem to be genuinely different. And the model-theoretic argument that $latex n+1$ vectors in $latex \mathbb{R}^n$ cannot be independent appears to fall into that category (which is not to say that it wouldn&#039;t be worth trying to &quot;expand it out,&quot; just to check).]]></description>
		<content:encoded><![CDATA[<p>Dear Terry,</p>
<p>That&#8217;s very interesting. I had vaguely wondered about similar ideas, but I was thinking more about inverses than adjoints. I still haven&#8217;t got my head completely round what you&#8217;ve written, but will at some stage try to convert it into a very explicit demonstration of how the exchange lemma can yield an algorithm that is more or less the same as Gaussian elimination (but probably applied to columns rather than rows). </p>
<p>The last thing you mentioned reminds me of a fascinating fact that I learned as a result of editing an article on model theory for the Princeton Companion. It&#8217;s the most gorgeous proof (which, from what you write, you probably know of) of the following fact: if <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='P' title='P' class='latex' /> is an injective polynomial map from <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{C}^n' title='&#92;mathbb{C}^n' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{C}^n' title='&#92;mathbb{C}^n' class='latex' /> then it must be surjective. The rough idea of the proof is the same as what you say above. By model-theoretic considerations one can show that the statement is true in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{C}' title='&#92;mathbb{C}' class='latex' /> if and only if it is true in a finite field <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BF%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{F}' title='&#92;mathbb{F}' class='latex' /> of sufficiently high characteristic. But there the statement is trivial. It&#8217;s a great example of a non-obvious statement that doesn&#8217;t seem to have anything to do with logic, but which is proved by logical means. (The proof was discovered by Ax, apparently. For more details, see the PCM when it comes out.)</p>
<p>Another thing that&#8217;s interesting about your remark is that it is connected with a question that I like and that I intend to pursue more on this blog, namely what it means for two proofs to be &#8220;essentially the same.&#8221; It&#8217;s an intriguing phenomenon, because it so often happens that two rather different-looking proofs eventually turn out to be less different than they at first appear, as you make very clear in your article, &#8220;What is good mathematics?&#8221; Given that, it&#8217;s all the more interesting to find two proofs that really do seem to be genuinely different. And the model-theoretic argument that <img src='http://s0.wp.com/latex.php?latex=n%2B1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n+1' title='n+1' class='latex' /> vectors in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mathbb{R}^n' title='&#92;mathbb{R}^n' class='latex' /> cannot be independent appears to fall into that category (which is not to say that it wouldn&#8217;t be worth trying to &#8220;expand it out,&#8221; just to check).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Terence Tao</title>
		<link>http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-189</link>
		<dc:creator><![CDATA[Terence Tao]]></dc:creator>
		<pubDate>Wed, 03 Oct 2007 17:40:44 +0000</pubDate>
		<guid isPermaLink="false">http://gowers.wordpress.com/2007/10/03/the-exchange-lemma-and-gaussian-elimination/#comment-189</guid>
		<description><![CDATA[Dear Tim,

It seems to me that the key fact that underlies exchange lemma is in fact the &lt;I&gt;adjoint&lt;/I&gt; of the key fact that underlies Gaussian elimination; the difficulty you are having in equating the two is thus the same difficulty one encounters when trying to identify a vector space with its dual.

To explain, let me use $latex {\Bbb R}^A = \{ (x_i)_{i \in A}: x_i \in {\Bbb R} \}$ to denote the real vector space indexed by a finite set A.  The exchange lemma algorithm is based on iterating the following:

&lt;B&gt;Lemma 1.&lt;/B&gt;  If $latex T: {\Bbb R} \to {\Bbb R}^A$ is non-zero, then there exists $latex i \in A$ such that $latex {\Bbb R}^A$ is the direct sum of $latex T({\Bbb R})$ (which is isomorphic to $latex {\Bbb R}^{\{i\}}$) and $latex {\Bbb R}^{A \backslash \{i\}}$ (i.e. these spaces are complementary).

(Proof: expand $latex T(1) \in {\Bbb R}^A$ in coordinates.  There must exist i such that the $latex i^{th}$ coordinate of T(1) is non-zero.  The claim then easily follows for this choice of i.)

This lemma is what lets you exchange $latex w_1$ for one of the $latex v_1,\ldots,v_n$.  (To exchange $latex w_i$ for one of the $latex v_i,\ldots,v_n$, one can first quotient out by the span of $latex w_1,\ldots,w_{i-1}$ and then argue as before.)

In contrast, the ability to use Gaussian elimination to reduce any matrix to row-echelon form is based on iterating the following:

&lt;B&gt;Lemma 2.&lt;/B&gt;  If $latex T: {\Bbb R}^A \to {\Bbb R}$ is non-zero, then there exists $latex i \in A$ such that $latex {\Bbb R}^A$ is the direct sum of $latex \hbox{ker}(T)$ (which is isomorphic to $latex {\Bbb R}^{A \backslash \{i\}}$) and $latex {\Bbb R}^{\{i\}}$.

(Proof: there must exist a basis element $latex e_i$ such that $latex T(e_i)$ is non-zero.   One can then express all the other $latex e_j$ as a linear combination of $latex e_i$ and something in the kernel of T.)

To see how row-echelon form follows from this, consider n vectors $latex v_1,\ldots,v_n$ in a standard vector space $latex {\Bbb R}^d$, and let $latex T: {\Bbb R}^n \to {\Bbb R}$ be defined by setting $latex T(x_1,\ldots,x_n)$ to be the first coordinate of $latex x_1 v_1 + \ldots + x_n v_n$.  If T is non-zero, then by Lemma 2 we can use Gaussian elimination to extract one row $latex v_i$ with a non-vanishing first coordinate, and n-1 other vectors with vanishing first coordinate; if instead T is zero, then all rows already have vanishing first coordinate.  Throwing away $latex v_i$ and the first column and iterating, one eventually gets row echelon form.

It is not hard to see that Lemma 1 and Lemma 2 are essentially adjoints of each other.

This strongly suggests that the exchange lemma should in fact be related to the use of &lt;I&gt;column&lt;/I&gt; operations to place a matrix in &lt;I&gt;column-echelon&lt;/I&gt; form.  If we let $latex u_1,\ldots,u_d \in {\Bbb R}^n$ denote the rows of an $latex d \times n$ matrix, column operations correspond to &quot;passive&quot; transformations which change the basis for the vector space $latex {\Bbb R}^d$ without affecting the vectors $latex u_1,\ldots,u_d$ themselves (in contrast, the &quot;active&quot; row transformations modify those vectors but leave the coordinate basis unchanged).  To formulate the exchange lemma in this language, I think what one needs to do is express the r independent vectors $latex w_1,\ldots,w_r$, together with the n spanning vectors $latex v_1,\ldots,v_n$, as linear combinations of the n spanning vectors $latex v_1,\ldots,v_n$, leading to a $latex r+n \times n$ matrix which has an $latex n \times n$ identity matrix at the bottom.  If one applies adjoint Gaussian elimination to this matrix to place it in column-echelon form, I believe that one obtains all of the $latex w_1,\ldots,w_r$ rows and n-r of the $latex v_1,\ldots,v_n$ rows as pivot rows (because of the linear independence of the $latex w_1,\ldots,w_r$), and that these pivot rows span the whole space, thus yielding the exchange lemma.


Incidentally, over a finite field F, it is trivial to show by a counting argument that a vector space V spanned by n vectors cannot contain n+1 independent vectors, since the former hypothesis easily implies $latex &#124;V&#124; \leq &#124;F&#124;^n$ and the latter hypothesis would imply $latex &#124;V&#124; \geq &#124;F&#124;^{n+1}$.  One can adapt this counting argument to vector spaces over the reals by various discretisation tricks (e.g. using metric entropy, or Minkowski dimension, or taking a numerical perspective and rounding off to some finite degree of accuracy).  Alternatively, one can argue more algebraically (or model-theoretically) by invoking a &quot;Lefschetz principle&quot; or &quot;compactness principle&quot; to equate linear algebra assertions over the reals with linear algebra assertions over fields of sufficiently large characteristic.   

It seems difficult (though perhaps not entirely impossible) to recast these arguments in algorithmic form.]]></description>
		<content:encoded><![CDATA[<p>Dear Tim,</p>
<p>It seems to me that the key fact that underlies exchange lemma is in fact the <i>adjoint</i> of the key fact that underlies Gaussian elimination; the difficulty you are having in equating the two is thus the same difficulty one encounters when trying to identify a vector space with its dual.</p>
<p>To explain, let me use <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5EA+%3D+%5C%7B+%28x_i%29_%7Bi+%5Cin+A%7D%3A+x_i+%5Cin+%7B%5CBbb+R%7D+%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^A = &#92;{ (x_i)_{i &#92;in A}: x_i &#92;in {&#92;Bbb R} &#92;}' title='{&#92;Bbb R}^A = &#92;{ (x_i)_{i &#92;in A}: x_i &#92;in {&#92;Bbb R} &#92;}' class='latex' /> to denote the real vector space indexed by a finite set A.  The exchange lemma algorithm is based on iterating the following:</p>
<p><b>Lemma 1.</b>  If <img src='http://s0.wp.com/latex.php?latex=T%3A+%7B%5CBbb+R%7D+%5Cto+%7B%5CBbb+R%7D%5EA&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T: {&#92;Bbb R} &#92;to {&#92;Bbb R}^A' title='T: {&#92;Bbb R} &#92;to {&#92;Bbb R}^A' class='latex' /> is non-zero, then there exists <img src='http://s0.wp.com/latex.php?latex=i+%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i &#92;in A' title='i &#92;in A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5EA&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^A' title='{&#92;Bbb R}^A' class='latex' /> is the direct sum of <img src='http://s0.wp.com/latex.php?latex=T%28%7B%5CBbb+R%7D%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T({&#92;Bbb R})' title='T({&#92;Bbb R})' class='latex' /> (which is isomorphic to <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7B%5C%7Bi%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^{&#92;{i&#92;}}' title='{&#92;Bbb R}^{&#92;{i&#92;}}' class='latex' />) and <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7BA+%5Cbackslash+%5C%7Bi%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^{A &#92;backslash &#92;{i&#92;}}' title='{&#92;Bbb R}^{A &#92;backslash &#92;{i&#92;}}' class='latex' /> (i.e. these spaces are complementary).</p>
<p>(Proof: expand <img src='http://s0.wp.com/latex.php?latex=T%281%29+%5Cin+%7B%5CBbb+R%7D%5EA&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(1) &#92;in {&#92;Bbb R}^A' title='T(1) &#92;in {&#92;Bbb R}^A' class='latex' /> in coordinates.  There must exist i such that the <img src='http://s0.wp.com/latex.php?latex=i%5E%7Bth%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i^{th}' title='i^{th}' class='latex' /> coordinate of T(1) is non-zero.  The claim then easily follows for this choice of i.)</p>
<p>This lemma is what lets you exchange <img src='http://s0.wp.com/latex.php?latex=w_1&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_1' title='w_1' class='latex' /> for one of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />.  (To exchange <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_i' title='w_i' class='latex' /> for one of the <img src='http://s0.wp.com/latex.php?latex=v_i%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i,&#92;ldots,v_n' title='v_i,&#92;ldots,v_n' class='latex' />, one can first quotient out by the span of <img src='http://s0.wp.com/latex.php?latex=w_1%2C%5Cldots%2Cw_%7Bi-1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_1,&#92;ldots,w_{i-1}' title='w_1,&#92;ldots,w_{i-1}' class='latex' /> and then argue as before.)</p>
<p>In contrast, the ability to use Gaussian elimination to reduce any matrix to row-echelon form is based on iterating the following:</p>
<p><b>Lemma 2.</b>  If <img src='http://s0.wp.com/latex.php?latex=T%3A+%7B%5CBbb+R%7D%5EA+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T: {&#92;Bbb R}^A &#92;to {&#92;Bbb R}' title='T: {&#92;Bbb R}^A &#92;to {&#92;Bbb R}' class='latex' /> is non-zero, then there exists <img src='http://s0.wp.com/latex.php?latex=i+%5Cin+A&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='i &#92;in A' title='i &#92;in A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5EA&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^A' title='{&#92;Bbb R}^A' class='latex' /> is the direct sum of <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bker%7D%28T%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;hbox{ker}(T)' title='&#92;hbox{ker}(T)' class='latex' /> (which is isomorphic to <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7BA+%5Cbackslash+%5C%7Bi%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^{A &#92;backslash &#92;{i&#92;}}' title='{&#92;Bbb R}^{A &#92;backslash &#92;{i&#92;}}' class='latex' />) and <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7B%5C%7Bi%5C%7D%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^{&#92;{i&#92;}}' title='{&#92;Bbb R}^{&#92;{i&#92;}}' class='latex' />.</p>
<p>(Proof: there must exist a basis element <img src='http://s0.wp.com/latex.php?latex=e_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e_i' title='e_i' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=T%28e_i%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(e_i)' title='T(e_i)' class='latex' /> is non-zero.   One can then express all the other <img src='http://s0.wp.com/latex.php?latex=e_j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e_j' title='e_j' class='latex' /> as a linear combination of <img src='http://s0.wp.com/latex.php?latex=e_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='e_i' title='e_i' class='latex' /> and something in the kernel of T.)</p>
<p>To see how row-echelon form follows from this, consider n vectors <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> in a standard vector space <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ed&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^d' title='{&#92;Bbb R}^d' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=T%3A+%7B%5CBbb+R%7D%5En+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' title='T: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' class='latex' /> be defined by setting <img src='http://s0.wp.com/latex.php?latex=T%28x_1%2C%5Cldots%2Cx_n%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='T(x_1,&#92;ldots,x_n)' title='T(x_1,&#92;ldots,x_n)' class='latex' /> to be the first coordinate of <img src='http://s0.wp.com/latex.php?latex=x_1+v_1+%2B+%5Cldots+%2B+x_n+v_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x_1 v_1 + &#92;ldots + x_n v_n' title='x_1 v_1 + &#92;ldots + x_n v_n' class='latex' />.  If T is non-zero, then by Lemma 2 we can use Gaussian elimination to extract one row <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> with a non-vanishing first coordinate, and n-1 other vectors with vanishing first coordinate; if instead T is zero, then all rows already have vanishing first coordinate.  Throwing away <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_i' title='v_i' class='latex' /> and the first column and iterating, one eventually gets row echelon form.</p>
<p>It is not hard to see that Lemma 1 and Lemma 2 are essentially adjoints of each other.</p>
<p>This strongly suggests that the exchange lemma should in fact be related to the use of <i>column</i> operations to place a matrix in <i>column-echelon</i> form.  If we let <img src='http://s0.wp.com/latex.php?latex=u_1%2C%5Cldots%2Cu_d+%5Cin+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_1,&#92;ldots,u_d &#92;in {&#92;Bbb R}^n' title='u_1,&#92;ldots,u_d &#92;in {&#92;Bbb R}^n' class='latex' /> denote the rows of an <img src='http://s0.wp.com/latex.php?latex=d+%5Ctimes+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;times n' title='d &#92;times n' class='latex' /> matrix, column operations correspond to &#8220;passive&#8221; transformations which change the basis for the vector space <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ed&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='{&#92;Bbb R}^d' title='{&#92;Bbb R}^d' class='latex' /> without affecting the vectors <img src='http://s0.wp.com/latex.php?latex=u_1%2C%5Cldots%2Cu_d&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='u_1,&#92;ldots,u_d' title='u_1,&#92;ldots,u_d' class='latex' /> themselves (in contrast, the &#8220;active&#8221; row transformations modify those vectors but leave the coordinate basis unchanged).  To formulate the exchange lemma in this language, I think what one needs to do is express the r independent vectors <img src='http://s0.wp.com/latex.php?latex=w_1%2C%5Cldots%2Cw_r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_1,&#92;ldots,w_r' title='w_1,&#92;ldots,w_r' class='latex' />, together with the n spanning vectors <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, as linear combinations of the n spanning vectors <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, leading to a <img src='http://s0.wp.com/latex.php?latex=r%2Bn+%5Ctimes+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='r+n &#92;times n' title='r+n &#92;times n' class='latex' /> matrix which has an <img src='http://s0.wp.com/latex.php?latex=n+%5Ctimes+n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='n &#92;times n' title='n &#92;times n' class='latex' /> identity matrix at the bottom.  If one applies adjoint Gaussian elimination to this matrix to place it in column-echelon form, I believe that one obtains all of the <img src='http://s0.wp.com/latex.php?latex=w_1%2C%5Cldots%2Cw_r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_1,&#92;ldots,w_r' title='w_1,&#92;ldots,w_r' class='latex' /> rows and n-r of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> rows as pivot rows (because of the linear independence of the <img src='http://s0.wp.com/latex.php?latex=w_1%2C%5Cldots%2Cw_r&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='w_1,&#92;ldots,w_r' title='w_1,&#92;ldots,w_r' class='latex' />), and that these pivot rows span the whole space, thus yielding the exchange lemma.</p>
<p>Incidentally, over a finite field F, it is trivial to show by a counting argument that a vector space V spanned by n vectors cannot contain n+1 independent vectors, since the former hypothesis easily implies <img src='http://s0.wp.com/latex.php?latex=%7CV%7C+%5Cleq+%7CF%7C%5En&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|V| &#92;leq |F|^n' title='|V| &#92;leq |F|^n' class='latex' /> and the latter hypothesis would imply <img src='http://s0.wp.com/latex.php?latex=%7CV%7C+%5Cgeq+%7CF%7C%5E%7Bn%2B1%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='|V| &#92;geq |F|^{n+1}' title='|V| &#92;geq |F|^{n+1}' class='latex' />.  One can adapt this counting argument to vector spaces over the reals by various discretisation tricks (e.g. using metric entropy, or Minkowski dimension, or taking a numerical perspective and rounding off to some finite degree of accuracy).  Alternatively, one can argue more algebraically (or model-theoretically) by invoking a &#8220;Lefschetz principle&#8221; or &#8220;compactness principle&#8221; to equate linear algebra assertions over the reals with linear algebra assertions over fields of sufficiently large characteristic.   </p>
<p>It seems difficult (though perhaps not entirely impossible) to recast these arguments in algorithmic form.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
