Re: [Nutch-general] how to combine two run's result for search

Dennis Kubes Mon, 04 Sep 2006 19:54:57 -0700

You can keep the indexes separate and use the distributed search server, 
one per index or you can use the mergedb and mergesegs commands to merge 
the two runs into a single crawldb and a single segments then re-run the 
invertlinks and index to create a single index file which can then be 
searched.


Dennis

Feng Ji wrote:
> Hi there,
>
> In Nutch 08, I have crawled down from two webDB independently.
>
> For each run, I did invertlinks and index. So each one is searchable.
>
> Now I want to combine them togeter for search. I tried "merge" command to
> merge two indexes, but the search for the result index output dir is 
> dull.
> Do I need put output dir to the same directory as above two crawl/ ?
>
> I wonder what is proper steps to combine two seperate run into one search
> result. Do I need to combine two webdb, merge two segments and do
> invertlinks and do index?
>
> thanks your time,
>
> Michael,
>

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Re: [Nutch-general] how to combine two run's result for search

Reply via email to