Thanks for the quick responses...it was something stupid that I was doing...(I guess it is still to early here :)).
I was saving the urls file with no extension and all of the views of it showed it with no extension but then when I went into the folder via the command line, I saw it was saved as .rtf so it now crawls. Thanks and now to get the search working so I am sure I will have many more questions. Thanks! On 7/24/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote: > put your url file to the root of your nutch and run > "bin/nutch ....." from there to point to your url > file, > > Michael > > --- blackwater dev <[EMAIL PROTECTED]> wrote: > > > I am a nutch newbie and I have created a simple urls > > file with one > > domain. I have tried putting it in a few places but > > am getting > > errors. Where should it go? I am running the crawl > > command from the > > tutorial. > > > > Thanks! > > > > > > expr: syntax error > > 050724 081642 No NutchFileSystem indicated, so > > defaulting to local fs. > > 050724 081642 loading > > file:/Users/e/nutch-0.6/conf/nutch-default.xml > > 050724 081643 loading > > file:/Users/e/nutch-0.6/conf/crawl-tool.xml > > 050724 081643 loading > > file:/Users/e/nutch-0.6/conf/nutch-site.xml > > 050724 081643 crawl started in: crawl.test > > 050724 081643 rootUrlFile = urls > > 050724 081643 threads = 10 > > 050724 081643 depth = 3 > > 050724 081643 Created webdb at > > LocalFS,/Users/e/nutch-0.6/crawl.test/db > > Exception in thread "main" > > java.io.FileNotFoundException: urls (No > > such file or directory) > > at java.io.FileInputStream.open(Native Method) > > at > > > java.io.FileInputStream.<init>(FileInputStream.java:106) > > at java.io.FileReader.<init>(FileReader.java:55) > > at > > > net.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:359) > > at > > > net.nutch.db.WebDBInjector.main(WebDBInjector.java:510) > > at > > net.nutch.tools.CrawlTool.main(CrawlTool.java:121) > > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
