>At 9:05 AM +0100 3/21/01, Michael Schulz wrote:
>>is it possible to index XML documents with htdig?
>
>Well, there's nothing stopping you. At the moment, unless you have a specified
>external parser/converter, documents of text/xml will be indexed as plaintext. Not
>the greatest, but certainly not bad.
>
>But this isn't what most people mean when they say "indexing XML." ;-)
>
>Certainly you can easily work up some sort of parser or converter for given types of
>XML documents. But the 3.1 code has no context for restricting searches based on
>context. So if you want to search the <author></author> field, you're pretty much out
>of luck.
I might add that specific searches of the type <author>content</author>
can also be done with XSLT stylesheet templates. In fact, XSLT
is far more powerful than a simple search, since it can also transform the
result of the search or compute something from it. Arguably it takes
searches engines into a new era. Obviously however, this might be tricky
if its operating on a very large number of XML documents.
--
Henry Rzepa. +44 (0)20 7594 5774 (Office) +44 (0870) 132-3747 (eFax)
Dept. Chemistry, Imperial College, London, SW7 2AY, UK.
http://www.ch.ic.ac.uk/rzepa/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html