Re: ndfs stuff

Piotr Kosiorowski Fri, 08 Jul 2005 10:30:56 -0700

[EMAIL PROTECTED] wrote:

Dear Piotr,


Thanks for link, I readed your great documentation. ;)

It is not mine - it is written by Michael Cafarella - I di only someupdates.

I have only few questions after it:
1. The NDFS is not slower than 'bin/nutch server' alternatives on searchs?

2. I think if I migrate to NDFS stucture from 'server' structure (with 8million page), I need the followings:
- A web server with 1 GByte RAM.
- A fetcher with 1 GByte RAM.
- A namenode server with 4 GByte RAM
- 4 datenode servers with 4 GByte RAM, on each will be 2 million pages +replications.
This is true?

3. When I like to remove old segments (I would like refresh after 30days), how to do it? How to remove entirely segment directories fromNDFS (rm remove only one file)?

I am not using NDFS in production - I have played only a bit with it butI do not think NDFS can be treated as an alternative to "bin/nutchserver". I do not have enough experience with it but it was writtensome time ago on this list that it is not ready for production use yet- the work that is going in mapreduce branch is also connected with NDFSso we will probably see more advanced NDFS version in near future (justmy guess). So if you are going to use it now and in poductionenvironment I will stay with your current approach.

Regards
Piotr



Piotr Kosiorowski wrote:

Hello Ferenc,
Some documentation on running ndfs can be found on wiki:
http://wiki.apache.org/nutch/NutchDistributedFileSystem
Regards,
Piotr

[EMAIL PROTECTED] wrote:

Have any location the ndfs usage documentation?
Regards,
Ferenc

Re: ndfs stuff

Reply via email to