Re: [webkit-dev] Writing a new XML parser with no external libraries

Ryosuke Niwa Wed, 29 Jun 2011 08:33:32 -0700

On Wed, Jun 29, 2011 at 6:55 AM, Alex Milowski <[email protected]> wrote:


> On Tue, Jun 28, 2011 at 6:50 PM, Eric Seidel <[email protected]> wrote:
> >
> > I'm in general in favor of this effort (having worked extensively on
> > the existing XML parsers).
> >
> > But I would caution you that xml is a ridiculously tiny fraction of
> > the web.  And it may not be worth the engineering effort to make a
> > better parser.
> >
> > http://www.google.com/search?q=filetype:html = 25,270,000,000
> > http://www.google.com/search?q=filetype:xml = 71,000,000
> >
>
> I can't let this one just pass by! ;)
>
> First, filetype is by extension and not media type [1].  As such, that
> is an incorrect accounting of the amount of XML on the web.  Secondly,
> just using file extensions, you'd have to enumerate and sum all the
> extensions used by all XML media types (e.g. .xhtml, .svg, etc.).
> Third, there is plenty of content on the web that Google does not
> crawl (the "dark web") where there are petabytes of XML waiting for
> browsers to do something with it (e.g. astronomical data cone search
> services).
>

+1.  Also, a lot of .asp, .php, etc... files serve XHTML contents.

- Ryosuke

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] Writing a new XML parser with no external libraries

Reply via email to