That answer may serve the OP well, but I can't help but propagate this
link when the idea of parsing XML with regex comes up:

http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags/1732454#1732454

:)

Michael 


On Mon, 2012-05-14 at 17:03 -0400, Jack Krupansky wrote:
> A regex transformer should do the trick:
> 
> http://wiki.apache.org/solr/DataImportHandler#RegexTransformer
> 
> -- Jack Krupansky
> 
> -----Original Message----- 
> From: Ramo Karahasan
> Sent: Monday, May 14, 2012 4:54 PM
> To: solr-user@lucene.apache.org
> Subject: Index an xml filed that is saved in a database
> 
> Hi,
> 
> 
> 
> I have an XML document saved in a column of a database table. Is it possible
> to index just one part of that xml string, e.g. <content>.</content> with
> the DIH handler or is it necessary to extract this information previously?
> 
> 
> 
> Thanks,
> 
> Ramo
> 


Reply via email to