Re: ndfs problem needs fix

Doug Cutting Mon, 08 Aug 2005 13:13:46 -0700

Jay Pound wrote:

1.) we need to split up chunks of data into sub-folders as not to run the
filesystem out of  its physical limitations of concurrent files in a single
directory, like the way squid splits up its data into directories.

I agree. I am currently using reiser with NDFS so this is not apriority, but long-term it should be fixed. Please file a bug report,and, ideally, contribute a patch.

2.)when a datanode is set to store data on a nfs share / samba share [...]


That is not a recommended configuration.

A datanode should reasonably handle disk failures. Developing anddebugging this may take time, however. I'm not yet sure how diskfailures appear to a JVM. Things are currently written so that if anexception is thrown during disk i/o then the datanode should take itselfoffline, initiating replication of its data. We'll see if that'ssufficient.

3.)we need to set a limit on how much of the filesystem can be used by ndfs,
or a max # of 32mb chunks to be stored, when a single machine runs out of
space the same thing happens as in #2 ndfs hangs waiting to write data to
that particular datanode not transmitting data to the other datanodes

The max storage per datanode was configurable, but we found that to bedifficult, as it required separate configuration per datanode ifdatanodes have different devices. So now all space on the device isassumed to be available to NDFS. Probably making this optionallyconfigurable would be better. Please file a bug report, and, ideally,contribute a patch.


Doug

Re: ndfs problem needs fix

Reply via email to