<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Bootstrapping and Subsampling: Part I</title>
	<atom:link href="http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/feed/" rel="self" type="application/rss+xml" />
	<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/</link>
	<description>Thoughts on Statistics and Machine Learning</description>
	<lastBuildDate>Tue, 21 May 2013 17:33:31 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Comunidades tribais são mais violentas? O quão próxima é a distribuição normal? O papel do BNDES. &#124; Análise Real</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-7059</link>
		<dc:creator><![CDATA[Comunidades tribais são mais violentas? O quão próxima é a distribuição normal? O papel do BNDES. &#124; Análise Real]]></dc:creator>
		<pubDate>Tue, 05 Feb 2013 01:14:40 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-7059</guid>
		<description><![CDATA[[...] Sobre bootstraping I e [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Sobre bootstraping I e [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: normaldeviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6841</link>
		<dc:creator><![CDATA[normaldeviate]]></dc:creator>
		<pubDate>Tue, 29 Jan 2013 16:02:00 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6841</guid>
		<description><![CDATA[the original]]></description>
		<content:encoded><![CDATA[<p>the original</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bootstrapping and Subsampling: Part II &#171; Normal Deviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6766</link>
		<dc:creator><![CDATA[Bootstrapping and Subsampling: Part II &#171; Normal Deviate]]></dc:creator>
		<pubDate>Sun, 27 Jan 2013 14:02:09 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6766</guid>
		<description><![CDATA[[...] &#171; Bootstrapping and Subsampling: Part&#160;I [...]]]></description>
		<content:encoded><![CDATA[<p>[...] &laquo; Bootstrapping and Subsampling: Part&nbsp;I [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Martin Azizyan</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6532</link>
		<dc:creator><![CDATA[Martin Azizyan]]></dc:creator>
		<pubDate>Wed, 23 Jan 2013 10:46:12 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6532</guid>
		<description><![CDATA[rj444, I believe the &quot;original data&quot; in the statement &quot;drawing a sample of size n from Pn is the same as drawing n points with replacement from the original data&quot; refers to the finite sample we were given, not the true distribution. This is not a statistical claim. It follows from the definitions of the empirical distribution and of sampling with replacement.

I think that&#039;s what you were asking, but I&#039;m sorry if I misunderstood the subject of your confusion.]]></description>
		<content:encoded><![CDATA[<p>rj444, I believe the &#8220;original data&#8221; in the statement &#8220;drawing a sample of size n from Pn is the same as drawing n points with replacement from the original data&#8221; refers to the finite sample we were given, not the true distribution. This is not a statistical claim. It follows from the definitions of the empirical distribution and of sampling with replacement.</p>
<p>I think that&#8217;s what you were asking, but I&#8217;m sorry if I misunderstood the subject of your confusion.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Hennig</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6378</link>
		<dc:creator><![CDATA[Christian Hennig]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 23:11:57 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6378</guid>
		<description><![CDATA[n is not always large, though.]]></description>
		<content:encoded><![CDATA[<p>n is not always large, though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: normaldeviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6353</link>
		<dc:creator><![CDATA[normaldeviate]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 19:32:58 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6353</guid>
		<description><![CDATA[P_n is a very accurate estimate of P
This follows from standard empirical process theory
For example, in  one-dimension, P( &#124;&#124;P_n - P&#124;&#124;_infty &gt; epsilon) &lt; 2e^{-2n epsilon^2}

The issue is the behavior of T(P).]]></description>
		<content:encoded><![CDATA[<p>P_n is a very accurate estimate of P<br />
This follows from standard empirical process theory<br />
For example, in  one-dimension, P( ||P_n &#8211; P||_infty &gt; epsilon) &lt; 2e^{-2n epsilon^2}</p>
<p>The issue is the behavior of T(P).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rj444</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6350</link>
		<dc:creator><![CDATA[rj444]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 19:28:25 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6350</guid>
		<description><![CDATA[Yes, I was confused by the comment - &quot;A moment’s reflection should convince you that drawing a sample of size n from Pn is the same as drawing  n points with replacement from the original data&quot;. The adequacy of the Pn approximation is exactly what I have trouble convincing myself of even after a moment&#039;s reflection, especially when n is relatively small and T(P) is relatively complex.

I find it interesting that Larry says &quot;an example of a bootstrap failure is in the problem of estimating phylogenetic trees&quot;. In practice this is one of the areas where boostrap is used most often.]]></description>
		<content:encoded><![CDATA[<p>Yes, I was confused by the comment &#8211; &#8220;A moment’s reflection should convince you that drawing a sample of size n from Pn is the same as drawing  n points with replacement from the original data&#8221;. The adequacy of the Pn approximation is exactly what I have trouble convincing myself of even after a moment&#8217;s reflection, especially when n is relatively small and T(P) is relatively complex.</p>
<p>I find it interesting that Larry says &#8220;an example of a bootstrap failure is in the problem of estimating phylogenetic trees&#8221;. In practice this is one of the areas where boostrap is used most often.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Keith O'Rourke</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6333</link>
		<dc:creator><![CDATA[Keith O'Rourke]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 18:12:10 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6333</guid>
		<description><![CDATA[I prefer to think of the bootstrap as sampling paths from the product space Pn(A)^n WITH replacement which is less efficient than without replacement, but that very quickly decreases with increasing &quot;n&quot;. 

Makes it teachable as simple survey sampling, which worked well with undergrad class once. But the BCA correction stuff and why it fails for complex problems seemed way too hard to get accross. 

Remember Efron wrote a paper in the early 2000 complaining most statisticians actual do bootstrapping incorrectly. 

That included me, in that when the percentile intervals were _simmilar_ to the BCA intervals in applications, I (mis)thought they had no advantage (i.e. I forgot to think about the distribution of intervals over repeated applications).]]></description>
		<content:encoded><![CDATA[<p>I prefer to think of the bootstrap as sampling paths from the product space Pn(A)^n WITH replacement which is less efficient than without replacement, but that very quickly decreases with increasing &#8220;n&#8221;. </p>
<p>Makes it teachable as simple survey sampling, which worked well with undergrad class once. But the BCA correction stuff and why it fails for complex problems seemed way too hard to get accross. </p>
<p>Remember Efron wrote a paper in the early 2000 complaining most statisticians actual do bootstrapping incorrectly. </p>
<p>That included me, in that when the percentile intervals were _simmilar_ to the BCA intervals in applications, I (mis)thought they had no advantage (i.e. I forgot to think about the distribution of intervals over repeated applications).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Hennig</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6317</link>
		<dc:creator><![CDATA[Christian Hennig]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 15:35:28 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6317</guid>
		<description><![CDATA[I like bootstrap and use it occasionally. However, I think that an important issue with (nonparametric) bootstrap is that some features of P_n are essentially different from P (or our typical idea of P). P_n is basically discrete. This has implications. For example, in clustering, samples from P_n tend to produce larger between-cluster separation than the original sample, because points that could spoil separation can be taken away, but never added. Producing multiple points at the same location can lead to artifacts in clustering, but also in covariance-matrix estimation or group-wise covariance matrices in classification (implosion of eigenvalues). Samples from P_n (with or without multiple points) can have a smaller but not larger convex hull then the original samples etc. So I&#039;d say that one needs to be very careful with bootstrapping statistics that can somehow be affected by discreteness, the convex hull and other problematic features somebody else comes up with.]]></description>
		<content:encoded><![CDATA[<p>I like bootstrap and use it occasionally. However, I think that an important issue with (nonparametric) bootstrap is that some features of P_n are essentially different from P (or our typical idea of P). P_n is basically discrete. This has implications. For example, in clustering, samples from P_n tend to produce larger between-cluster separation than the original sample, because points that could spoil separation can be taken away, but never added. Producing multiple points at the same location can lead to artifacts in clustering, but also in covariance-matrix estimation or group-wise covariance matrices in classification (implosion of eigenvalues). Samples from P_n (with or without multiple points) can have a smaller but not larger convex hull then the original samples etc. So I&#8217;d say that one needs to be very careful with bootstrapping statistics that can somehow be affected by discreteness, the convex hull and other problematic features somebody else comes up with.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Geoff Gordon</title>
		<link>http://normaldeviate.wordpress.com/2013/01/19/bootstrapping-and-subsampling-part-i/comment-page-1/#comment-6316</link>
		<dc:creator><![CDATA[Geoff Gordon]]></dc:creator>
		<pubDate>Mon, 21 Jan 2013 15:32:25 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=351#comment-6316</guid>
		<description><![CDATA[I think one reason the bootstrap isn&#039;t more popular is what Larry mentioned: &quot;it is most useful in complex situations, but these are often the situations where the theory breaks down&quot;.  (There&#039;s not much point in going to a lot of effort to get an error bound if the error bound is probably wrong.)  Another is that the bootstrap requires re-running your estimation procedure a very large number of times, and often in ML we&#039;re lucky if we can run the estimation procedure once.]]></description>
		<content:encoded><![CDATA[<p>I think one reason the bootstrap isn&#8217;t more popular is what Larry mentioned: &#8220;it is most useful in complex situations, but these are often the situations where the theory breaks down&#8221;.  (There&#8217;s not much point in going to a lot of effort to get an error bound if the error bound is probably wrong.)  Another is that the bootstrap requires re-running your estimation procedure a very large number of times, and often in ML we&#8217;re lucky if we can run the estimation procedure once.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
