On 28 Sep 2004, at 03:26, Chris Lilley wrote:
> On Saturday, September 25, 2004, 2:55:25 PM, Holger wrote: > > HW> Hi Chris > > HW> <> >>>> Google has been asked to index XML content and W3C staff even >>>> showed >>>> them how simple that was for any +xml media type, and even >>>> volunteered >>>> code (which is trivial). But, little response. > > HW> a bit off topic but > HW> do you think it would be possible to contribute that code to the > open source > HW> distributed web search engine "nutch" ? > HW> http://www.nutch.org > HW> i think that would be really cool. > > IIRC the proof of concept was an XSLT that took any xml source and > removed all the start and end tags ;-) ... which is the default XSLT transform. So in fact, your XSLT is empty. If for some reason you *really* wanted to go overboard, you could do it with a regexp in less than 10 bytes (exercise for the reader :) Bluntly, it is so simple to extract the text from SVG that there really isn't any excuse. By comparison, I fetched the search SDK for another popular vector graphics format and it was about a 600K download iirc. To be fair, you could go a lot further with searching SVG than converting to text. My suggestion would be to hack something that converted SVG to HTML and index the HTML (I did a proof of concept on that as well - http://www.w3.org/2002/05/svg2stuff.html). Then you'd get all the hyperlinks, and more semantics (the root <title> as document <title> for example). Doing something more on this is on my todo, but a long way down the list. Dean ------------------------ Yahoo! Groups Sponsor --------------------~--> $9.95 domain names from Yahoo!. Register anything. http://us.click.yahoo.com/J8kdrA/y20IAA/yQLSAA/1U_rlB/TM --------------------------------------------------------------------~-> ----- To unsubscribe send a message to: [EMAIL PROTECTED] -or- visit http://groups.yahoo.com/group/svg-developers and click "edit my membership" ---- Yahoo! Groups Links <*> To visit your group on the web, go to: http://groups.yahoo.com/group/svg-developers/ <*> To unsubscribe from this group, send an email to: [EMAIL PROTECTED] <*> Your use of Yahoo! Groups is subject to: http://docs.yahoo.com/info/terms/

