Seeing mapred is about to be folded into trunk, 3 questions: 1. Any benchmarks/estimates on when the scalability of map-reduce surpasses its overhead/complexity? e.g. with > 10 reduce workers.. 2. Will there be an option of a plain vanilla single-box Nutch crawler vs a map-reduce version? 3. What are the options for users who don't want to jump onboard map-red? Will pre-mapred be actively maintained?
thanks.. k
