You can keep the indexes separate and use the distributed search server, one per index or you can use the mergedb and mergesegs commands to merge the two runs into a single crawldb and a single segments then re-run the invertlinks and index to create a single index file which can then be searched.
Dennis Feng Ji wrote: > Hi there, > > In Nutch 08, I have crawled down from two webDB independently. > > For each run, I did invertlinks and index. So each one is searchable. > > Now I want to combine them togeter for search. I tried "merge" command to > merge two indexes, but the search for the result index output dir is > dull. > Do I need put output dir to the same directory as above two crawl/ ? > > I wonder what is proper steps to combine two seperate run into one search > result. Do I need to combine two webdb, merge two segments and do > invertlinks and do index? > > thanks your time, > > Michael, > ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
