Nutch is not able to find the urls file you have specified on the command line. The filename you have mentioned is urls.txt and not urls. Correct this by changing the filename or by specifying urls.txt on the command line.
- Ravi On 3/3/06, Pine Cone <[EMAIL PROTECTED]> wrote: > Hello, > > I am having some problem when I run the "bin/nutch crawl urls -dir ct > -depth 3 >& crawl.log" > > I get this Error in my crawl.log file: > Created webdb at LocalFS, /root/Desktop/nutch/nutch-0.7/ct/db > Exception in thread "main" java.io.FileNotFoundException: urls (No such > file or directory) > > My urls.txt file look like this > http://localhost:8080/tomcat-docs/introduction.html > > My crawl-urlfilter.txt looks like this: > +^http://([a-z0-9]*\.)*localhost:8080/ > > I am running my tomcat webserver as a local host and I want to crawl the > content of my webserver. my webserver is not connected to the internet. > > Thanks, > > P. Cone > > > > > --------------------------------- > Yahoo! Mail > Bring photos to life! New PhotoMail makes sharing a breeze. >
