Marcin Okraszewski wrote:
Hi, What is the proper way of removing segment in context of hadoop? Recrawl scripts tend to use normal "rm" command for this (eg.: http://wiki.apache.org/nutch/IntranetRecrawl). Does it work with nutch distributed on several computers? Shouldn't it use "bin/hadoop fs -rmr"?
They should use the Hadoop version - normal /bin/rm doesn't work on HDFS. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
