Building trunk is tricky and runs with issues. Don't use it in production unless you know what you're doing. It's safer to checkout 1.3 stable although 1.4-dev runs fine as well and has some fixes for 1.3 that users mentioned on the list.
There seems to be no crawl-urlfilter file indeed. Don't know why it's gone since the crawl command is still there. You can find the file in the 1.2 release: http://svn.apache.org/viewvc/nutch/branches/branch-1.2/conf/ > Thanks for a quick reply. > > I searched in the nutch directory but still do not see that file :(. Here's > complete file list inside runtime/local/conf directory. > > us137390:conf parampreetsethi$ pwd > /Users/parampreetsethi/Documents/workspace/nutch/runtime/local/conf > us137390:conf parampreetsethi$ ls -t > automaton-urlfilter.txt domain-urlfilter.txt nutch-default.xml > prefix-urlfilter.txt solrindex-mapping.xml > configuration.xsl httpclient-auth.xml nutch-site.xml > regex-normalize.xml subcollections.xml > domain-suffixes.xml log4j.properties parse-plugins.dtd > regex-urlfilter.txt suffix-urlfilter.txt > domain-suffixes.xsd nutch-conf.xsl parse-plugins.xml > schema.xml tika-mimetypes.xml > > By the way, I tried deploying the code by checking out from svn repository, > but could not build it. I was getting following error: > > resolve-default: > [ivy:resolve] :: Ivy 2.2.0 - 20100923230623 :: http://ant.apache.org/ivy/ > :: [ivy:resolve] :: loading settings :: file = > /Users/parampreetsethi/Documents/workspace/nutch/ivy/ivysettings.xml > [ivy:resolve] > [ivy:resolve] :: problems summary :: > [ivy:resolve] :::: WARNINGS > [ivy:resolve] module not found: > org.apache.gora#gora-core;0.2-incubating > [ivy:resolve] ==== local: tried > [ivy:resolve] > /Users/parampreetsethi/.ivy2/local/org.apache.gora/gora-core/0.2-incubating > / ivys/ivy.xml > [ivy:resolve] -- artifact > org.apache.gora#gora-core;0.2-incubating!gora-core.jar: > [ivy:resolve] > /Users/parampreetsethi/.ivy2/local/org.apache.gora/gora-core/0.2-incubating > / jars/gora-core.jar > [ivy:resolve] module not found: > org.apache.gora#gora-sql;0.2-incubating > [ivy:resolve] ==== local: tried > [ivy:resolve] > /Users/parampreetsethi/.ivy2/local/org.apache.gora/gora-sql/0.2-incubating/ > i vys/ivy.xml > [ivy:resolve] -- artifact > org.apache.gora#gora-sql;0.2-incubating!gora-sql.jar: > [ivy:resolve] > /Users/parampreetsethi/.ivy2/local/org.apache.gora/gora-sql/0.2-incubating/ > j ars/gora-sql.jar > [ivy:resolve] :::::::::::::::::::::::::::::::::::::::::::::: > [ivy:resolve] :: UNRESOLVED DEPENDENCIES :: > [ivy:resolve] :::::::::::::::::::::::::::::::::::::::::::::: > [ivy:resolve] :: org.apache.gora#gora-core;0.2-incubating: not > found [ivy:resolve] :: org.apache.gora#gora-sql;0.2-incubating: > not found [ivy:resolve] > :::::::::::::::::::::::::::::::::::::::::::::: [ivy:resolve] > [ivy:resolve] :: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS > > BUILD FAILED > /Users/parampreetsethi/Documents/workspace/nutch/build.xml:458: impossible > to resolve dependencies: > resolve failed - see output for details > > > -param > > On 7/11/11 5:56 PM, "Jerry E. Craig, Jr." <[email protected]> wrote: > > Look down a little further for the > > > > or > > runtime/local/bin/nutch (version >= 1.3) > > > > If you download the bin then it's in the runtime directory. > > > > Jerry E. Craig, Jr. > > > > -----Original Message----- > > From: Sethi, Parampreet [mailto:[email protected]] > > Sent: Monday, July 11, 2011 2:51 PM > > To: [email protected] > > Subject: Nutch Novice help > > > > Hi All, > > > > Sorry for such a naïve question, I downloaded nutch 1.3 binary today and > > trying to set it up as mentioned in Tutorial at > > http://wiki.apache.org/nutch/NutchTutorial > > > > How ever I am not able to find crawl-urlfilter.txt inside conf directory. > > Is there any other place where I should look for this file? > > > > Thanks > > Param

