Re: [htdig] Indexing XML

Rzepa, Henry Fri, 23 Mar 2001 00:25:04 -0800
>At 9:05 AM +0100 3/21/01, Michael Schulz wrote:
>>is it possible to index XML documents with htdig?
>
>Well, there's nothing stopping you. At the moment, unless you have a specified 
>external parser/converter, documents of text/xml will be indexed as plaintext. Not 
>the greatest, but certainly not bad.
>
>But this isn't what most people mean when they say "indexing XML." ;-)
>
>Certainly you can easily work up some sort of parser or converter for given types of 
>XML documents. But the 3.1 code has no context for restricting searches based on 
>context. So if you want to search the <author></author> field, you're pretty much out 
>of luck.


I might add that specific searches of the type  <author>content</author>
can also be done with XSLT stylesheet templates. In fact, XSLT
is far more powerful than a simple search, since it can also transform the
result of the search or compute something from it. Arguably  it takes
searches engines into a new era. Obviously however, this might be tricky
if its operating on a very large number of  XML documents.
-- 

Henry Rzepa. +44 (0)20 7594 5774 (Office) +44 (0870) 132-3747 (eFax)
Dept. Chemistry, Imperial College, London, SW7  2AY, UK. 
http://www.ch.ic.ac.uk/rzepa/


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html
Re: [htdig] Indexing XML

Reply via email to