On Wednesday 17 September 2003 07:07, Erik Hatcher wrote:
> On Wednesday, September 17, 2003, at 08:43  AM, Killeen, Tom wrote:
> > I would suggest XML as well.
>
> Again, I'd like to hear more about how you'd do this generically.  Tell
> me what the field names and values would correspond to when presented
> with an XML file.

Perhaps just one generic "content" field, which would contain tokenized
content from all XML segments. That could be done easily & efficiently
with just sax event handling? Since it's a simple demo, you can't get much
simpler than that, but it should still be fairly useful?
Attributes could/should be ignored by default; common practice for XML markup
seems to be for attributes not to contain any content that would make sense to 
index.

So I'd think just stripping out all tags (and comments, PIs etc) might be 
reasonable plain simple approach for demo app.

-+ Tatu +-


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to