Thanks for the quick responses...it was something stupid that I was
doing...(I guess it is still to early here :)).

I was saving the urls file with no extension and all of the views of
it showed it with no extension but then when I went into the folder
via the command line, I saw it was saved as .rtf so it now crawls.

Thanks and now to get the search working so I am sure I will have many
more questions.

Thanks!

On 7/24/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote:
> put your url file to the root of your nutch and run
> "bin/nutch ....." from there to point to your url
> file,
> 
> Michael
> 
> --- blackwater dev <[EMAIL PROTECTED]> wrote:
> 
> > I am a nutch newbie and I have created a simple urls
> > file with one
> > domain.  I have tried putting it in a few places but
> > am getting
> > errors.  Where should it go?  I am running the crawl
> > command from the
> > tutorial.
> >
> > Thanks!
> >
> >
> > expr: syntax error
> > 050724 081642 No NutchFileSystem indicated, so
> > defaulting to local fs.
> > 050724 081642 loading
> > file:/Users/e/nutch-0.6/conf/nutch-default.xml
> > 050724 081643 loading
> > file:/Users/e/nutch-0.6/conf/crawl-tool.xml
> > 050724 081643 loading
> > file:/Users/e/nutch-0.6/conf/nutch-site.xml
> > 050724 081643 crawl started in: crawl.test
> > 050724 081643 rootUrlFile = urls
> > 050724 081643 threads = 10
> > 050724 081643 depth = 3
> > 050724 081643 Created webdb at
> > LocalFS,/Users/e/nutch-0.6/crawl.test/db
> > Exception in thread "main"
> > java.io.FileNotFoundException: urls (No
> > such file or directory)
> >       at java.io.FileInputStream.open(Native Method)
> >       at
> >
> java.io.FileInputStream.<init>(FileInputStream.java:106)
> >       at java.io.FileReader.<init>(FileReader.java:55)
> >       at
> >
> net.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:359)
> >       at
> >
> net.nutch.db.WebDBInjector.main(WebDBInjector.java:510)
> >       at
> > net.nutch.tools.CrawlTool.main(CrawlTool.java:121)
> >
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to