On Wednesday 17 September 2003 07:07, Erik Hatcher wrote: > On Wednesday, September 17, 2003, at 08:43 AM, Killeen, Tom wrote: > > I would suggest XML as well. > > Again, I'd like to hear more about how you'd do this generically. Tell > me what the field names and values would correspond to when presented > with an XML file.
Perhaps just one generic "content" field, which would contain tokenized content from all XML segments. That could be done easily & efficiently with just sax event handling? Since it's a simple demo, you can't get much simpler than that, but it should still be fairly useful? Attributes could/should be ignored by default; common practice for XML markup seems to be for attributes not to contain any content that would make sense to index. So I'd think just stripping out all tags (and comments, PIs etc) might be reasonable plain simple approach for demo app. -+ Tatu +- --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
