Is this a crawler indexing the pages? If so, i would point it to whatever you 
need. If, for some reason, you cannot, you can modifiy the host/domain in your 
index using pattern char filters or maybe the stored (returned) values using a 
custom update processor.

> Hi,
> 
> I am getting ready to index a recent copy of Wikipedia's pages-articles
> dump.  I have two servers, foo and bar.  On foo.com/mediawiki I have a
> Mediawiki install serving up the pages. On bar.com/solr I have my solr
> install. I have the pages-articles.xml file from Wikipedia and the solr
> instructions  at
> http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia.
>  It looks pretty straightforward but I have a couple of preparatory
> questions.
> 
> If I index the pages-articles.xml on bar.com/solr, they will then be
> pointing to the relative links on solr.com/mediawiki, which don't exist,
> right?  So is there a way to tell solr that the base url for a bunch of
> index records is different than what it thinks they are? Or would it be
> easier simply to put a solr installation on foo.com?
> 
> 
> 
> 
> \
> 
> FredZ

Reply via email to