Start here: http://leon.bottou.org/projects/sgd
Then here: http://www.cs.jhu.edu/~mdredze/publications/icml_variance.pdf On Fri, May 27, 2011 at 3:54 PM, Josh Patterson <[email protected]> wrote: > Ted mentioned I believe (sequential) SGD performance last night as > being pretty incredible in terms of overall training time especially > compared to some other MR based techniques (If I have this wrong, Ted, > please correct me) -- > > I was talking about this with a coworker this morning; Are there any > online published stats comparing this in hard numbers? >
