Zaheed Haque wrote:
1. How do you see the 0.7 version evolving beside maintenance update?
Will it have a life of its own? I mean 0.7 is very good for intranet
use or mid-size public site. Why would you want to use mapred version
when you don't need it? (Maybe I don't know enough :-)

Using MapReduce should have little overall impact. The 0.8 release should be as easy and efficient to use for intranets as the 0.7 release.

2. What I also understand mapred version requires extra extra process,

That's not true. By default it is configured to run everything in-process. But it can easily be configured to run things instead on a network of machines, greatly increasing the scalability.

I have also read from various posting that it is possible to run
downgraded mapred version. Will it possible in the future?

Yes.

if so what
is the technical benefit mapred vs. 0.7 versions?

It scales to much larger configurations. The advantage is that folks who work on large search systems and folks who work on smaller systems can share nearly all of their code. The vast majority of improvements (e.g., to plugins) will be shared by all.

3. It would be great to get a long term vision or view about mapred version!

3. There has also been discussion about Nutch API. Is there any works
going on this front? I have also seen postings regarding use of JMX
any update?

We would like to have an administration GUI that tracks changes to the core and plugins, without a lot of GUI-specific maintenance. JMX looks like a promising approach for this. The plan is to first improve Nutch's configuration APIs to facilitate this. For example, Nutch does not currently support multiple configurations in the same JVM. Most of the command-line tools in the mapred branch are now implemented so that they can support multiple, simultaneous invocations within the same JVM. Next we must change the plugin APIs to support multiple configurations as well.

Long term, Nutch is what we make it. Developers needs drive the project, not a master plan.

Doug


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to