Ben Halsted wrote:
I was wondering what the required file structure is for the web gui to work
properly.

Are all of these required?
/db/crawldb
/db/index
/db/indexes
/db/segments
/db/linkdb

The indexes directory is not used when a merged index is present.

The crawldb and segments/*/crawl_parse directories are not used by the web ui.

Also -- What is the proper way to merge segments and indexes? Can I simply
move segments all into one directory then re-index it, or is there a better
way?

You should update the linkdb so that it contains links from all segments. Then you can use the dedup and merge commands to create a new index. Ideally you should also re-index after updating the linkdb, but this is not required.

Doug


-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.  Get Certified Today
Register for a JBoss Training Course.  Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to