RE: XML Stripping from DIH

2011-02-22 Thread Olson, Ron
...@lbpc.com To: solr-user@lucene.apache.org solr-user@lucene.apache.org Sent: Fri, February 18, 2011 4:05:15 PM Subject: XML Stripping from DIH Hi all- I have some XML in a database that I am trying to index and store; I am interested in the various pieces of text, but none of the tags. I've been

Re: XML Stripping from DIH

2011-02-20 Thread Otis Gospodnetic
-user@lucene.apache.org solr-user@lucene.apache.org Sent: Fri, February 18, 2011 4:05:15 PM Subject: XML Stripping from DIH Hi all- I have some XML in a database that I am trying to index and store; I am interested in the various pieces of text, but none of the tags. I've been trying

XML Stripping from DIH

2011-02-18 Thread Olson, Ron
Hi all- I have some XML in a database that I am trying to index and store; I am interested in the various pieces of text, but none of the tags. I've been trying to figure out a way to strip all the tags out, but haven't found anything within Solr to do so; the XML parser seems to want XPath to