Hello Friends,
The tutorial @ http://wiki.apache.org/nutch/NutchHadoopTutorial says don't use DFS on an NFS mount (this would be pretty stupid anyway).. I am setting up multi node nutch/hadoop environment with lots of storage available on a SAN device. I tried running nutch/hadoop with DFS file system pointed to a directory on the SAN drive which was mounted as NFS . It failed ( expected ?). How do I make it work ? In other words how do I make nutch/hadoop node stores its content on a SAN drive directly ? Is it possible ? If not what is the work around ? Do I have to make lots of local hard drive available for crawling and then copy it to the NFS drive when the crawl completes ? That would be very stupid solution ! Thanks !
