Sure. LSH is a fine candidate for parallelism and scaling.
I would recommend starting small and testing as you go rather than leaping into a parallelized full-fledged implementation. Look for other open-source implementaions of LSH algorithms. Be warned that the parameter selection for LSH can be pretty tricky (so I hear, anyway). You should pick a reasonable and realistic test problem so that you can experiment with that. On Wed, Apr 13, 2011 at 12:19 AM, ke xie <[email protected]> wrote: > Can we implement one and contribute into the mahout project? Any > suggestions? >
