I may have found my problem but not sure....

Here's my /usr/local/nutch directory:

drwxr-xr-x   2 root root     4096 Apr  2 13:06 bin
-rw-rw-r--   1 root root    15567 Mar 31 13:40 build.xml
-rw-rw-r--   1 root root    19814 Mar 31 13:40 CHANGES.txt
drwxr-xr-x   2 root root     4096 Apr 12 08:47 conf
drwxr-xr-x   2 root root     4096 Apr  8 17:22 crawl
drwxr-xr-x   3 root root     4096 Apr  8 17:29 db
-rw-rw-r--   1 root root     1845 Mar 31 13:40 default.properties
drwxr-xr-x  19 root root     4096 Mar 31 13:40 docs
drwxr-xr-x   2 root root     4096 Apr  2 11:15 lib
-rw-rw-r--   1 root root      615 Mar 31 13:40 LICENSE.txt
-rw-rw-r--   1 root root   755034 Mar 31 13:40 nutch-0.7.2.jar
-rw-rw-r--   1 root root 15806453 Mar 31 13:40 nutch-0.7.2.war
drwxr-xr-x  26 root root     4096 Mar 31 13:40 plugins
-rw-rw-r--   1 root root      403 Mar 31 13:40 README.txt
drwxr-xr-x   4 root root     4096 Apr  8 17:28 segments
drwxr-xr-x  11 root root     4096 Mar 31 13:40 src
-rw-r--r--   1 root root       65 Apr  8 17:28 urls

My crawl directory is empty

My db directory is this:

[EMAIL PROTECTED] db]# ls -l
total 4
-rw-r--r--  1 root root    0 Apr  8 17:29 dbreadlock
-rw-r--r--  1 root root    0 Apr  8 17:29 dbwritelock
drwxr-xr-x  6 root root 4096 Apr  8 17:29 webdb

Inside webdb is this:

drwxr-xr-x  2 root root 4096 Apr  8 17:29 linksByMD5
drwxr-xr-x  2 root root 4096 Apr  8 17:29 linksByURL
drwxr-xr-x  2 root root 4096 Apr  8 17:29 pagesByMD5
drwxr-xr-x  2 root root 4096 Apr  8 17:29 pagesByURL
-rw-r--r--  1 root root   17 Apr  8 17:29 stats

Back in my /usr/local/nutch/segments directory I have:

drwxr-xr-x  8 root root 4096 Apr  8 17:30 20060408172630
drwxr-xr-x  8 root root 4096 Apr  8 17:30 20060408172823

So something must be wrong then if I use /usr/local/nutch as my
searcher.dir right?  Even though I'm sure I followed the tutorial I'm
obviously missing something...?

Thanks again.

Paul
 

-----Original Message-----
From: sudhendra seshachala [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, April 11, 2006 10:45 PM
To: nutch-user@lucene.apache.org
Subject: RE: Nutch 500 Error

check the nutch-default.xml
there should be a property searcher.dir
Provide the path for the index folder.
Better still copy the property node and paste it in nutch-site.xml
provide the path for the index folder.
For ex:
If the index folder is stored as
home/nutch/crawl
    - crawldb
    - segments
    - index
    - indexes

point searcher.dir to home/nutch/crawl.
Hope this helps.

Thanks
Sudhi

Paul Stewart <[EMAIL PROTECTED]> wrote: Thanks.... I was doing
the java command wrong...

Back to my original problem - I re-ran throught the entire tutorial to
ensure I was doing it right and it seems proper.... How do I tell Nutch
where to look specifically in the code for the segments and indexes in
case it is in the wrong place?

All the best,
Paul
 

Reply via email to