To answer your first question, change the conf/nutch-default.xml file. The parameter in question is searcher.dir.
I'm not sure how to search two different indexes using the same instance of Tomcat. You could have multiple instances of Tomcat running and map one to http://general.mycompany.com and the other to http://jobs.mycompany.com. That might work. Or you could just index all the job pages with some Field. Something like category:jobs. Then create a QueryFilter plugin that subclasses RawFieldQueryFilter to search for your "category" field. When you do a job search, you just put "category:jobs" in the search string. Howie
So there is no way to set up different databases? I mean if I crawl a series of web pages and put them one place then crawl a series of job webplaces and put them in another folder, how can I let the web user click a button and search either the general web pages or the specific job web sites? To do this I would somehow be able to tell tomcat where to get the folder from within the jsp. Thanks! On 7/24/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote: > > I think tomcat doesn't do indexing at all; > > just follow the tutorial under doc in Nutch home; > > after getting your segments, starting tomcat at the > same level will give the tomcat direction where to > find SE indexed data; > > Hope that helps, > > Michael, > > --- blackwater dev <[EMAIL PROTECTED]> wrote: > > > Forgive me if this is a dumb question but how can I > > tell nutch where > > to pull the files for the index? I am just getting > > exceptions now > > when I do a search. How does the code under tomcat > > know where I pull > > the files from the crawl? > > > > > > Thanks! > > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com >
------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
