Please post a small sample file that has this problem with CDATA. On Fri, Dec 11, 2009 at 9:41 AM, Feroze Daud <fero...@zillow.com> wrote: > CDATA didn’t work either.It still complained about the input doc not being in > correct format. > > -----Original Message----- > From: Lance Norskog [mailto:goks...@gmail.com] > Sent: Thursday, December 10, 2009 7:43 PM > To: solr-user@lucene.apache.org > Subject: Re: full-text indexing XML files > > Or CDATA (much easier to work with). > > On Wed, Dec 9, 2009 at 10:37 PM, Shalin Shekhar Mangar > <shalinman...@gmail.com> wrote: >> On Thu, Dec 10, 2009 at 5:13 AM, Feroze Daud <fero...@zillow.com> wrote: >> >>> Hi! >>> >>> >>> >>> I am trying to full text index an XML file. For various reasons, I >>> cannot use Tika or other technology to parse the XML file. The >>> requirement is to full-text index the XML file, including Tags and >>> everything. >>> >>> >>> >>> So, I created a input index spec like this: >>> >>> >>> >>> <add> >>> >>> <doc> >>> >>> <field name="id">1001</field> >>> >>> <field name="name">NASA Advanced Research Labs</field> >>> >>> <field name="address">1010 Main Street, Chattanooga, FL 32212</field> >>> >>> <field name="content"><listing><id>1001</id>< name > NASA Advanced >>> Research Labs </ name ><address>1010 main street, chattanooga, FL >>> 32212</address></listing></field> >>> >>> </doc> >>> >>> </add> >>> >>> >>> >> You need to XML encode the value of the "content" field. >> >> -- >> Regards, >> Shalin Shekhar Mangar. >> > > > > -- > Lance Norskog > goks...@gmail.com >
-- Lance Norskog goks...@gmail.com