Hello,
I have been using Nutch 0.6 for quite some time and have several
machines serving up indexes. Each server has a segments directory, with
individual segments beneath them. The search server is pointed at the
root segment dir and then serves up all the segments beneath it. This
allows me to add additional segments to the searchable index by simply
copying a new, indexed segment into the root /segments dir.
For example:
/segments/seg1
/segments/seg2
But now I am confused a bit with the new 0.8 methodology. It appears
that the index directories are no longer a subdir of the segment dir
(the Wiki suggests the index at crawl/index). When a new segment is
crawled, I cannot add that index to crawl/index because it tells me an
index already exists at that location.
I guess I just need a little bit of information to help clear things up.
Can I still use the same directory structure as before, or are indexes
required to be outside of segments? If so is there a way to add more
segments without doing a merge?
Thanks for the clarification,
Greg
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general