Hi Shashi, On Fri, Dec 18, 2009 at 1:36 AM, Shashikant Kore <[email protected]> wrote:
> (.. cluster assignment is already there. Wonder why you had to redo > it.) Ahh, yes. I didn't have to re-do it, but I did wanted to learn the internal structure of the data files and to point out that it was easy enough to achieve. The code is quite straightforward. > Drew, are you using the latest code? Overnight sounds too long. That's good to know. This was a couple month or two ago before the matrix/math stuff was rolled in. I'll collect exact times on the next run I do. Has anyone else run LDA outside of the canned Reuters example? I would be interested to hear about corpus characteristics and processing power required to successfully produce LDA clusters. I've had all sorts of issues, but mostly related to hadoop configuration nits related to my environment however Drew
