<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Bootstrapping and Subsampling: Part II</title>
	<atom:link href="http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/feed/" rel="self" type="application/rss+xml" />
	<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/</link>
	<description>Thoughts on Statistics and Machine Learning</description>
	<lastBuildDate>Sat, 25 May 2013 23:18:25 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Comunidades tribais são mais violentas? O quão próxima é a distribuição normal? O papel do BNDES. &#124; Análise Real</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-7060</link>
		<dc:creator><![CDATA[Comunidades tribais são mais violentas? O quão próxima é a distribuição normal? O papel do BNDES. &#124; Análise Real]]></dc:creator>
		<pubDate>Tue, 05 Feb 2013 01:14:43 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-7060</guid>
		<description><![CDATA[[...] - Sobre bootstraping I e II; [...]]]></description>
		<content:encoded><![CDATA[<p>[...] &#8211; Sobre bootstraping I e II; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: normaldeviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6975</link>
		<dc:creator><![CDATA[normaldeviate]]></dc:creator>
		<pubDate>Fri, 01 Feb 2013 13:40:18 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6975</guid>
		<description><![CDATA[I agree. I think with some work, something could be said about the finite sample properties although
it would not be easy.]]></description>
		<content:encoded><![CDATA[<p>I agree. I think with some work, something could be said about the finite sample properties although<br />
it would not be easy.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Hennig</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6968</link>
		<dc:creator><![CDATA[Christian Hennig]]></dc:creator>
		<pubDate>Fri, 01 Feb 2013 12:46:08 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6968</guid>
		<description><![CDATA[I see a problem with this kind of asymptotic result that is not a problem with all asymptotic results in statistics (but with some others, too). This is that in reality we have a fixed n and a fixed b, and the asymptotic relies on *how b changes with n*. Now the interpretation of increasing n is clear: We observe more and get more information. No statistician would *choose* to have a small n if large n is available (OK, let&#039;s say I don&#039;t bother for the moment with situations where she would). However, what happens to b if n goes to infinity is actually *decided* by the statistician. And it is only *hypothetically* decided by the statistician if in reality there is only one fixed n and one fixed b. So if my n is 500 and I choose b to be 250, whether the asymptotic result holds or not depends on what b I&#039;d choose for n=1,000, 10,000, 1 billion and so on, which I don&#039;t need to bother about because that&#039;s not the n that I have. 
So how useful is an asymptotic result in a given real situation if its validity depends on how the reader believes the statistician would choose b with much larger n, which is a choice that in fact the statistician never has to make?]]></description>
		<content:encoded><![CDATA[<p>I see a problem with this kind of asymptotic result that is not a problem with all asymptotic results in statistics (but with some others, too). This is that in reality we have a fixed n and a fixed b, and the asymptotic relies on *how b changes with n*. Now the interpretation of increasing n is clear: We observe more and get more information. No statistician would *choose* to have a small n if large n is available (OK, let&#8217;s say I don&#8217;t bother for the moment with situations where she would). However, what happens to b if n goes to infinity is actually *decided* by the statistician. And it is only *hypothetically* decided by the statistician if in reality there is only one fixed n and one fixed b. So if my n is 500 and I choose b to be 250, whether the asymptotic result holds or not depends on what b I&#8217;d choose for n=1,000, 10,000, 1 billion and so on, which I don&#8217;t need to bother about because that&#8217;s not the n that I have.<br />
So how useful is an asymptotic result in a given real situation if its validity depends on how the reader believes the statistician would choose b with much larger n, which is a choice that in fact the statistician never has to make?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Schaffer</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6927</link>
		<dc:creator><![CDATA[Mark Schaffer]]></dc:creator>
		<pubDate>Thu, 31 Jan 2013 13:49:11 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6927</guid>
		<description><![CDATA[Great, thanks!]]></description>
		<content:encoded><![CDATA[<p>Great, thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: normaldeviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6925</link>
		<dc:creator><![CDATA[normaldeviate]]></dc:creator>
		<pubDate>Thu, 31 Jan 2013 13:32:08 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6925</guid>
		<description><![CDATA[Yes. That was one of the motivations for subsampling, it works on dependent data.
You need to divide the data into blocks and subsample blocks.
Most of the Romano book is actually about the dependent case.
If you google ``time series subsampling&#039;&#039; you&#039;ll
find lots of good stuff.]]></description>
		<content:encoded><![CDATA[<p>Yes. That was one of the motivations for subsampling, it works on dependent data.<br />
You need to divide the data into blocks and subsample blocks.<br />
Most of the Romano book is actually about the dependent case.<br />
If you google &#8220;time series subsampling&#8221; you&#8217;ll<br />
find lots of good stuff.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Schaffer</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6924</link>
		<dc:creator><![CDATA[Mark Schaffer]]></dc:creator>
		<pubDate>Thu, 31 Jan 2013 13:28:01 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6924</guid>
		<description><![CDATA[First of all, thanks for writing this - really interesting.  And a question - what if your data aren&#039;t i.i.d.?  There are bootstrap flavours for that, and I wonder if there are subsampling flavours as well.]]></description>
		<content:encoded><![CDATA[<p>First of all, thanks for writing this &#8211; really interesting.  And a question &#8211; what if your data aren&#8217;t i.i.d.?  There are bootstrap flavours for that, and I wonder if there are subsampling flavours as well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Beam</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6911</link>
		<dc:creator><![CDATA[Andrew Beam]]></dc:creator>
		<pubDate>Thu, 31 Jan 2013 02:03:08 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6911</guid>
		<description><![CDATA[I would love to see this lead into a post about bagging. Bagging is obviously a technique widely used in machine learning (which as you mentioned the bootstrap is not) and I would love to hear this type of discussion on it.]]></description>
		<content:encoded><![CDATA[<p>I would love to see this lead into a post about bagging. Bagging is obviously a technique widely used in machine learning (which as you mentioned the bootstrap is not) and I would love to hear this type of discussion on it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: László Sándor</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6837</link>
		<dc:creator><![CDATA[László Sándor]]></dc:creator>
		<pubDate>Tue, 29 Jan 2013 15:18:01 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6837</guid>
		<description><![CDATA[This is really helpful, thanks. Back to an older question of mine, would you say that subsampling is more promising for &quot;bunching&quot;  estimates that people try to do from big data, distributions these days? So far, bootstrapping was what all there was, see e.g. http://obs.rc.fas.harvard.edu/chetty/denmark_adjcost.pdf

I also wonder whether this is could be used from regression discontinuity estimates, the literature seems to be heading into a different direction, e.g. http://www-personal.umich.edu/~cattaneo/papers/RD-biascorrection.pdf

And by the way, if we are &quot;sticking to the data&quot; this much, are we getting standard errors (closer to those) conditional on covariates, or not?
I basically mean this paper: http://www.nber.org/papers/w17442.pdf]]></description>
		<content:encoded><![CDATA[<p>This is really helpful, thanks. Back to an older question of mine, would you say that subsampling is more promising for &#8220;bunching&#8221;  estimates that people try to do from big data, distributions these days? So far, bootstrapping was what all there was, see e.g. <a href="http://obs.rc.fas.harvard.edu/chetty/denmark_adjcost.pdf" rel="nofollow">http://obs.rc.fas.harvard.edu/chetty/denmark_adjcost.pdf</a></p>
<p>I also wonder whether this is could be used from regression discontinuity estimates, the literature seems to be heading into a different direction, e.g. <a href="http://www-personal.umich.edu/~cattaneo/papers/RD-biascorrection.pdf" rel="nofollow">http://www-personal.umich.edu/~cattaneo/papers/RD-biascorrection.pdf</a></p>
<p>And by the way, if we are &#8220;sticking to the data&#8221; this much, are we getting standard errors (closer to those) conditional on covariates, or not?<br />
I basically mean this paper: <a href="http://www.nber.org/papers/w17442.pdf" rel="nofollow">http://www.nber.org/papers/w17442.pdf</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: normaldeviate</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6806</link>
		<dc:creator><![CDATA[normaldeviate]]></dc:creator>
		<pubDate>Mon, 28 Jan 2013 20:23:12 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6806</guid>
		<description><![CDATA[No. You can actually estimate that too.]]></description>
		<content:encoded><![CDATA[<p>No. You can actually estimate that too.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: anon</title>
		<link>http://normaldeviate.wordpress.com/2013/01/27/bootstrapping-and-subsampling-part-ii/comment-page-1/#comment-6804</link>
		<dc:creator><![CDATA[anon]]></dc:creator>
		<pubDate>Mon, 28 Jan 2013 20:22:05 +0000</pubDate>
		<guid isPermaLink="false">http://normaldeviate.wordpress.com/?p=358#comment-6804</guid>
		<description><![CDATA[Don&#039;t you need to know \beta when constructing the subsampling distribution L?]]></description>
		<content:encoded><![CDATA[<p>Don&#8217;t you need to know \beta when constructing the subsampling distribution L?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
