Start here: http://leon.bottou.org/projects/sgd

Then here: http://www.cs.jhu.edu/~mdredze/publications/icml_variance.pdf

On Fri, May 27, 2011 at 3:54 PM, Josh Patterson <[email protected]> wrote:

> Ted mentioned I believe (sequential) SGD performance last night as
> being pretty incredible in terms of overall training time especially
> compared to some other MR based techniques (If I have this wrong, Ted,
> please correct me) --
>
> I was talking about this with a coworker this morning; Are there any
> online published stats comparing this in hard numbers?
>

Reply via email to