Hey Sean, Yeah, thanks for this. I am about to start getting into some more of the details in the code and wanted a higher level overview. It seemed to me like some of the similarity calculations were more difficult to distribute than others. Anyway, Ill dig down a bit farther in the next few weeks.
Chris On Nov 14, 2011, at 1:18 PM, Sean Owen wrote: > ably differs slightly due to bits of logic in the Hadoop job that > would prune small or insignificant dat Chris Schilling Sr. Data Mining Engineer Clever Sense, Inc. "Curating the World Around You" -------------------------------------------------------------- Winner of the 2011 Fortune Brainstorm Start-up Idol Wanna join the Clever Team? We're hiring! --------------------------------------------------------------
