I think all the attribute values together with element text values should be indexed in the "content" part. Also a xml map file could be used to pick up the nodes need to be indexed separately so we do not create too many fields by indexing non-critical nodes separately. Simple xpath could be used for the map source, the field name and index type should be the map target.
Regards, Hui ----- Original Message ----- From: "Robert Koberg" <[EMAIL PROTECTED]> To: "'Lucene Users List'" <[EMAIL PROTECTED]> Sent: Wednesday, September 17, 2003 10:09 AM Subject: RE: Lucene demo ideas? > Hi, > > Here are a couple of ideas for XML demos: > > 1. simply index the content into one 'content' field. Don't worry about > attributes. > > 2. index a linked Dublin core meta data file: > <link rel="meta" href="index.rdf" /> > And add fields for every element after rdf:Description > > Best, > -Rob > > > > > -----Original Message----- > > From: Erik Hatcher [mailto:[EMAIL PROTECTED] > > Sent: Wednesday, September 17, 2003 6:08 AM > > To: Lucene Users List > > > > On Wednesday, September 17, 2003, at 08:43 AM, Killeen, Tom wrote: > > > I would suggest XML as well. > > > > Again, I'd like to hear more about how you'd do this generically. Tell > > me what the field names and values would correspond to when presented > > with an XML file. > > > > Erik > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
