Sort of kind of, but it is hard to extrapolate over a size range of >10 and that is the scale difference we are talking about.
On Fri, Dec 16, 2011 at 11:44 AM, Dmitriy Lyubimov <dlie...@gmail.com>wrote: > and there's no way to estimate a difference for a bigger sample? > > On Fri, Dec 16, 2011 at 11:37 AM, Ted Dunning <ted.dunn...@gmail.com> > wrote: > > This doesn't work because the correct value for a sub-sampled batch will > be > > smaller than for a full data set. > > > > On Fri, Dec 16, 2011 at 10:05 AM, Dmitriy Lyubimov <dlie...@gmail.com > >wrote: > > > >> if it > >> makes sense to find a better guess for lambda by just doing an R > >> simulation on a randomly subsampled data before putting it into > >> pipeline? or there's a fundamental problem with this approach? > >> >