[whatwg] Client-side includes proposal

Shannon Sun, 17 Aug 2008 23:34:33 -0700

The discussion on seamless iframes reminded me of something I've feltwas missing from HTML - an equivalent client functionality toserver-side includes as provided by PHP, Coldfusion and SSI. Inserver-side includes the document generated from parts appears as asingle entity rather than nested frames. In other words the source codeseen by the UA is indistiguishable from a non-frames HTML page in every way.

iframes are good for some things but they can be really messy whenyou're trying to build a single seamless page with shared styles andscripts from multiple files. It makes code reuse a real pain withoutrelying on a server-side dynamic language. The seamless iframes proposaldoesn't really address this well because you'll have more than one HTMLand BODY element causing strange behaviour or complex exceptions withseamless CSS.

The other issue with iframes is that for many page snippets the conceptof a title, meta tags and other headers don't make sense or simplyrepeat what was in the main document. More often than not the <head>section is meaningless yet must still be included for the frame to be"well-formed" or indexed by spiders.


The proposal would work like this:

--- Master Document ---
<html>
   <head>
      <title>Include Example</title>
      <meta name="includes" content="allow">
      <include src="global_head.ihtml">
   </head>
   <body>
         <include src="header.ihtml">
         <include src="http://www.pagelets.com/foo.ihtml";>
         <include src="footer.ihtml">
   </body>
</html>

--- Header.html ---
<div id="header">
   <h1>Header</h1>
</div>

With this proposal seamless CSS would work perfectly because childselectors won't see an intervening <body> element between sections.

Includes should allow any html segments except the initial <doctype> and<head> (for reasons explained below) and should allow start and end tagsto be split across includes. Only tags themselves may not contain aninclude (eg, <body <include src="body_attributes.ihtml">>). Manyserver-side includes allow this but it breaks the syntax of HTML/XML.

Includes must respect their own HTTP headers but inherit all otherproperties, styles and scripts from the surrounding page. If an includeis not set to expire immediately the browser should reuse it frommemory, otherwise it should retreive it once for each include. Eachbehaviour has its own merits depending on the application.

The standard would recommend (but not require) includes to use an .ihtmlextension. This will make it easier for authors, UAs and logging systemsto distinguish partial and complete pages (ie, not count includestowards page views in a stats package).

UAs or UA extensions like the Mozilla-based "Web Developer" should allowthe user to view the actual source and the "final" source (with allincludes substituted).

HTTP 1.1 pipelining should remove any performance concerns that includeswould have over traditional SSI since the retrieval process onlyrequires the sending of a few more bytes of request and responseheaders. In some ways it is actually better because UAs and proxies cancache the static includes and only fetch the dynamic parts.

The only real issue with this proposal is security for untrusted contentlike myspace profiles. Traditional sanitisers would be unfamiliar with<include> and may allow it through, providing a backdoor for maliciouscode. For this reason it is necessary that includes be opt-in. Thesimplest mechanism is to use a meta tag in the head of the master document:


<meta name="includes" content="allow">

I would consider any content system that allowed untrusted users towrite their own head tags to be incurable insecure; however thisrequirement should ensure that the majority do not suddenly experience awave of new exploits in HTML5 browsers.


Shannon

[whatwg] Client-side includes proposal

Reply via email to