To answer your first question, change the conf/nutch-default.xml
file. The parameter in question is searcher.dir.

I'm not sure how to search two different indexes using the same
instance of Tomcat. You could have multiple instances of Tomcat
running and map one to http://general.mycompany.com and the
other to http://jobs.mycompany.com. That might work.

Or you could just index all the job pages with some Field.
Something like category:jobs.  Then create a QueryFilter plugin
that subclasses RawFieldQueryFilter to search for your "category"
field. When you do a job search, you just put "category:jobs"
in the search string.

Howie


So there is no way to set up different databases?  I mean if I crawl a
series of web pages and put them one place then crawl a series of job
webplaces and put them in another folder, how can I let the web user
click a button and search either the general web pages or the specific
job web sites?  To do this I would somehow be able to tell tomcat
where to get the folder from within the jsp.

Thanks!

On 7/24/05, Feng (Michael) Ji <[EMAIL PROTECTED]> wrote:
>
> I think tomcat doesn't do indexing at all;
>
> just follow the tutorial under doc in Nutch home;
>
> after getting your segments, starting tomcat at the
> same level will give the tomcat direction where to
> find  SE indexed data;
>
> Hope that helps,
>
> Michael,
>
> --- blackwater dev <[EMAIL PROTECTED]> wrote:
>
> > Forgive me if this is a dumb question but how can I
> > tell nutch where
> > to pull the files for the index?  I am just getting
> > exceptions now
> > when I do a search.  How does the code under tomcat
> > know where I pull
> > the files from the crawl?
> >
> >
> > Thanks!
> >
>
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>




-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to