[pubsubhubbub] r411 committed - Fork 0.3 spec into 0.4

codesite-noreply Fri, 28 Jun 2013 01:14:07 -0700

Revision: 411
Author:   [email protected]
Date:     Fri Jun 28 01:11:46 2013
Log:      Fork 0.3 spec into 0.4
http://code.google.com/p/pubsubhubbub/source/detail?r=411


Added:
 /trunk/pubsubhubbub-core-0.4.xml

=======================================
--- /dev/null
+++ /trunk/pubsubhubbub-core-0.4.xml    Fri Jun 28 01:11:46 2013
@@ -0,0 +1,1034 @@
+<?xml version="1.0" encoding="US-ASCII"?>

+<?xml-stylesheet type='text/xsl'href='http://xml.resource.org/authoring/rfc2629.xslt' ?>

+<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
+<!--
+  For editting this, see:
+
+  http://xml.resource.org/public/rfc/html/rfc2629.html
+
+  In a nutshell,
+
+  $ sudo apt-get install xml2rfc
+  $ xml2rfc pubsubhubbub-core-0.1.xml pubsubhubbub-core-0.1.html
+
+-->
+<rfc category="info" docName="pubsubhubbub-core-0.3.xml" ipr="full3978">
+  <?rfc toc="yes" ?>
+
+  <?rfc tocdepth="2" ?>
+
+  <?rfc symrefs="yes" ?>
+
+  <?rfc sortrefs="yes"?>
+
+  <?rfc strict="no" ?>
+
+  <?rfc iprnotified="no" ?>
+
+  <?rfc private="Draft" ?>
+
+  <front>
+    <title>PubSubHubbub Core 0.3 -- Working Draft</title>
+

+ <author fullname="Brad Fitzpatrick" initials="B."surname="Fitzpatrick">

+      <organization>Google, Inc</organization>
+
+      <address>
+        <email>[email protected]</email>
+      </address>
+    </author>
+
+    <author fullname="Brett Slatkin" initials="B." surname="Slatkin">
+      <organization>Google, Inc</organization>
+
+      <address>
+        <email>[email protected]</email>
+      </address>
+    </author>
+
+    <author fullname="Martin Atkins" initials="M." surname="Atkins">
+      <organization>Six Apart Ltd.</organization>
+
+      <address>
+        <email>[email protected]</email>
+      </address>
+    </author>
+
+    <date day="8" month="February" year="2010" />
+
+    <abstract>

+ <t>An open, simple, web-scale pubsub protocol, along with an opensource+ reference implentation targetting Google App Engine. Notably,however,

+      nothing in the protocol is centralized, or Google- or App
+      Engine-specific. Anybody can play.</t>
+

+ <t>As opposed to more developed (and more complex) pubsub specs like<xref+ target="XEP-0060">Jabber Publish-Subscribe</xref> this spec's baseprofile+ (the barrier-to-entry to speak it) is dead simple. The fancy bitsrequired+ for high-volume publishers and subscribers are optional. The baseprofile

+      is HTTP-based, as opposed to XMPP (see more on this below).</t>
+

+ <t>To dramatically simplify this spec in several places where we hadto+ choose between supporting A or B, we took it upon ourselves tosay "only

+      A", rather than making it an implementation decision.</t>
+

+ <t>We offer this spec in hopes that it fills a need or at leastadvances+ the state of the discussion in the pubsub space. Polling sucks. Wethink

+      a decentralized pubsub layer is a fundamental, missing layer in the

+ Internet architecture today and its existence, more than justenabling

+      the obvious lower latency feed readers, would enable many cool
+      applications, most of which we can't even imagine. But we're looking
+      forward to decentralized social networking.</t>
+    </abstract>
+  </front>
+
+  <middle>
+    <section title="Notation and Conventions">

+ <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALLNOT",

+      "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
+      document are to be interpreted as described in <xref
+      target="RFC2119"></xref>. Domain name examples use <xref
+      target="RFC2606"></xref>.</t>
+    </section>
+
+    <section title="Definitions">
+      <t><list style="hanging">

+ <t hangText="Topic:">An <xref target="RFC4287">Atom</xref> or<xref+ target="RSS20">RSS</xref> feed <xreftarget="RFC3986">URL</xref>. The+ unit to which one can subscribe to changes. This spec currentlyonly

+          addresses feed URLs that require no additional authorization
+          headers.</t>
+
+          <t hangText="Hub (&quot;the hub&quot;):">The server (<xref
+          target="RFC3986">URL</xref>) which implements both sides of this

+ protocol. We have implemented this and are running a server at<eref

+          target="http://pubsubhubbub.appspot.com/";>

+ http://pubsubhubbub.appspot.com</eref> that is, at least fornow, open+ to anybody for use, as either a publisher or subscriber. Any hubMAY

+          implement its own policies on who can use it.</t>
+

+ <t hangText="Publisher:">An owner of a topic. Notifies the hubwhen

+          the topic feed has been updated. It just notifies that it <spanx

+ style="emph">has</spanx> been updated, but not how. As in almostall+ pubsub systems, the publisher is unaware of the subscribers, ifany.

+          Other pubsub systems might call the publisher the "source".</t>
+

+ <t hangText="Subscriber:">An entity (person or program) thatwants

+          to be notified of changes on a topic. The subscriber must be
+          directly network-accessible and is identified by its Subscriber
+          Callback URL.</t>
+
+          <t hangText="Subscription:">A unique relation to a topic by a

+ subscriber that indicates it should receive updates for thattopic. A+ subscription's unique key is the tuple (Topic URL, SubscriberCallback+ URL). Subscriptions may (at the hub's decision) have expirationtimes

+          akin to DHCP leases which must be periodically renewed.</t>
+
+          <t hangText="Subscriber Callback URL:">The <xref

+ target="RFC3986">URL</xref> at which a subscriber wishes toreceive

+          notifications.</t>
+

+ <t hangText="Event:">An event that causes updates to multipletopics.

+          For each event that happens (e.g. "Brad posted to the Linux

+ Community."), multiple topics could be affected (e.g. "Bradposted."+ and "Linux community has new post"). Publisher events causetopics to+ be updated and the hub looks up all subscriptions for affectedtopics,

+          sending out notifications to subscribers.</t>
+
+          <t hangText="Notification:">A payload describing how a topic's

+ contents have changed. This difference (or "delta") is computedby the+ hub and sent to all subscribers. The format of the notificationwill+ be an Atom or RSS feed served by the publisher with only thoseentries+ present which are new or have changed. The notification can bethe

+          result of a publisher telling the hub of an update, or the hub

+ proactively polling a topic feed, perhaps for a subscribersubscribing+ to a topic that's not pubsub-aware. Note also that anotification to a+ subscriber can be a payload consisting of updates for multipletopics.+ Hubs MAY choose to send multi-topic notifications as anoptimization+ for heavy subscribers, but subscribers MUST understand them. See<xref

+          target="contentdistribution" /> for format details.</t>
+        </list></t>
+    </section>
+
+    <section title="High-level protocol flow">
+      <t>(This section is non-normative.)</t>
+
+      <t><list style="symbols">

+ <t>Publishers POST a ping to their hub(s) URLs when theirtopic(s)

+          change.</t>
+
+          <t>Subscribers POST to one or more of the advertised hubs for a
+          topic they're interested in. Alternatively, some hubs may offer

+ auto-polling capability, to let {their,any} subscriberssubscribe to

+          topics which don't advertise a hub.</t>
+
+          <t>The hub caches minimal metadata (id, data, entry digest) about

+ each topic's previous state. When the hub re-fetches a topicfeed (on+ its own initiative or as a result of a publisher's ping) andfinds a+ delta, it enqueues a notification to all registeredsubscribers.</t>

+        </list></t>
+    </section>
+
+    <section title="Atom Details">
+      <t>Notification and source formats will be <xref
+      target="RFC4287">Atom</xref> or <xref target="RSS20">RSS</xref>. The

+ Publisher makes the decision as to include full body, truncatedbody, or

+      meta data of most recent event(s). One of:</t>
+
+      <t><list style="symbols">
+          <t>URL + metadata</t>
+
+          <t>URL + metadata + truncated</t>
+
+          <t>URL + metadata + full</t>
+        </list></t>
+

+ <t>The trade-off between including all content in outgoingnotifications

+      or having the thundering herd (by clients who fetch the <spanx

+ style="verb">//atom:feed/entry/link</spanx> in response to anotification)+ is up to the publisher. Entries of the most recent events (forrecipient+ to know whether or not they'd missed any recent items-- like TCPSACK) MAY+ be provided as context. Some examples of Atom feed entriesfollow.</t>

+
+      <figure>
+        <artwork><![CDATA[<?xml version="1.0"?>
+<atom:feed>
+  <!-- Normally here would be source, title, author, id, etc ... -->
+
+  <link rel="hub" href="http://myhub.example.com/endpoint"; />
+  <link rel="self" href="http://publisher.example.com/happycats.xml"; />
+  <updated>2008-08-11T02:15:01Z</updated>
+
+  <!-- Example of a full entry. -->
+  <entry>
+    <title>Heathcliff</title>
+    <link href="http://publisher.example.com/happycat25.xml"; />
+    <id>http://publisher.example.com/happycat25.xml</id>
+    <updated>2008-08-11T02:15:01Z</updated>
+    <content>
+      What a happy cat. Full content goes here.
+    </content>
+  </entry>
+
+  <!-- Example of an entity that isn't full/is truncated. This is implied

+ by the lack of a <content> element and a <summary> element instead.-->

+  <entry >
+    <title>Heathcliff</title>
+    <link href="http://publisher.example.com/happycat25.xml"; />
+    <id>http://publisher.example.com/happycat25.xml</id>
+    <updated>2008-08-11T02:15:01Z</updated>
+    <summary>
+      What a happy cat!
+    </summary>
+  </entry>
+
+  <!-- Meta-data only; implied by the lack of <content> and
+       <summary> elements. -->
+  <entry>
+    <title>Garfield</title>

+ <link rel="alternate"href="http://publisher.example.com/happycat24.xml"; />

+    <id>http://publisher.example.com/happycat25.xml</id>
+    <updated>2008-08-11T02:15:01Z</updated>
+  </entry>
+
+  <!-- Context entry that's meta-data only and not new. -->
+  <entry>
+    <title>Nermal</title>

+ <link rel="alternate"href="http://publisher.example.com/happycat23s.xml"; />

+    <id>http://publisher.example.com/happycat25.xml</id>
+    <updated>2008-07-10T12:28:13Z</updated>
+  </entry>
+
+</atom:feed>
+]]></artwork>
+      </figure>
+
+    </section>
+
+    <section anchor="discovery" title="Discovery">

+ <t>A potential subscriber initiates discovery by retrieving the feedto

+      which it wants to subscribe. A feed that acts as a topic as per this
+      specification MUST publish, as a child of <spanx
+      style="verb">//atom:feed</spanx> or <spanx
+      style="verb">//rss:rss/channel</spanx> , an <spanx
+      style="verb">atom:link</spanx> element whose <spanx
+      style="verb">rel</spanx> attribute has the value <spanx
+      style="verb">hub</spanx> and whose <spanx style="verb">href</spanx>
+      attribute contains the hub's endpoint URL. Feeds MAY contain multiple
+      <spanx style="verb">atom:link[@rel="hub"]</spanx> elements if the
+      publisher wishes to notify multiple hubs. When a potential subscriber

+ encounters one or more such links, that subscriber MAY subscribe tothe

+      feed using one or more hubs URLs as described in <xref
+      target="subscribing"></xref>.</t>
+
+      <t>Example:</t>
+
+      <figure>
+ <artwork><![CDATA[<?xml version="1.0"?>
+<atom:feed>
+  <!-- Normally here would be source, title, author, id, etc ... -->
+  <link rel="hub" href="https://myhub.example.com/endpoint"; />
+  <link rel="self" href="http://publisher.example.com/topic.xml"; />
+  ....
+  <entry>
+     ....
+  </entry>
+  <entry>
+     ....
+  </entry>
+</atom:feed>
+]]></artwork></figure>
+
+    <t>Hubs MUST use the same URL for both the publishing and subscribing

+ interfaces, which is why only a single <spanxstyle="verb">atom:link</spanx>

+    element is required to declare a hub. Publishers SHOULD use <xref
+    target="RFC2616">HTTPS</xref> in their hubs' discovery URLs. However,
+    subscribers that do not support <xref target="RFC2818">HTTPS</xref> MAY
+    try to fallback to <xref target="RFC2616">HTTP</xref>, which MAY work
+    depending on the hub's policy.</t>
+
+    </section>
+
+    <section anchor="subscribing" title="Subscribing and Unsubscribing">
+      <t>Subscribing to a topic URL consists of three parts that may occur
+      immediately in sequence or have a delay.</t>
+      <t><list style="symbols">
+          <t>Requesting a subscription using the hub</t>
+          <t>Confirming the subscription was actually desired</t>
+          <t>Periodically reconfirming the subscription is still active</t>
+        </list></t>
+

+ <t>Unsubscribing works in the same way, except with a singleparameter

+      changed to indicate the desire to unsubscribe.</t>
+
+      <section title="Subscriber Sends Subscription Request">
+        <t>Subscription is initiated by the subscriber making an <xref

+ target="RFC2616">HTTP</xref> POST request to the hub URL. Thisrequest

+        has a Content-Type of <spanx

+ style="verb">application/x-www-form-urlencoded</spanx> (describedin+ Section 17.13.4 of <xref target="W3C.REC-html401-19991224"/>) andthe

+        following parameters in its body:</t>
+
+        <t><list style="hanging">

+ <t hangText="hub.callback">REQUIRED. The subscriber's callbackURL where notifications should be delivered.</t>

+ <t hangText="hub.mode">REQUIRED. The literalstring "subscribe" or

+            "unsubscribe", depending on the goal of the request.</t>
+

+ <t hangText="hub.topic">REQUIRED. The topic URL that thesubscriber

+            wishes to subscribe to.</t>
+

+ <t hangText="hub.verify">REQUIRED. Keyword describingverification

+            modes supported by this subscriber, as described below. This

+ parameter may be repeated to indicate multiple supportedmodes.</t>

+
+            <t hangText="hub.lease_seconds">OPTIONAL. Number of seconds for

+ which the subscriber would like to have the subscriptionactive. If+ not present or an empty value, the subscription will bepermanent

+            (or active until <xref target="autorefresh">automatic
+            refreshing</xref> removes the subscription). Hubs MAY choose to

+ respect this value or not, depending on their own policies.This+ parameter MAY be present for unsubscription requests and MUSTbe

+            ignored by the hub in that case.</t>
+
+            <t hangText="hub.secret">OPTIONAL. A subscriber-provided secret
+            string that will be used to compute an HMAC digest for <xref

+ target="authednotify">authorized content distribution</xref>.If not

+            supplied, the HMAC digest will not be present for content

+ distribution requests. This parameter SHOULD only be specifiedwhen+ the request was made over <xref target="RFC2818">HTTPS</xref>.This

+            parameter MUST be less than 200 bytes in length.</t>
+
+            <t hangText="hub.verify_token">OPTIONAL. A subscriber-provided

+ opaque token that will be echoed back in the verificationrequest to+ assist the subscriber in identifying which subscriptionrequest is+ being verified. If this is not included, no token will beincluded

+            in the verification request.</t>
+
+          </list></t>
+
+        <t>The following keywords are supported for hub.verify:</t>
+
+        <t><list style="hanging">
+            <t hangText="sync">The subscriber supports synchronous

+ verification, where the verification request must occur beforethe

+            subscription request's HTTP response is returned.</t>
+
+            <t hangText="async">The subscriber supports asynchronous

+ verification, where the verification request may occur at alater

+            point after the subscription request has returned.</t>
+          </list></t>
+
+          <t>Where repeated keywords are used, their order indicates the

+ subscriber's order of preference. Subscribers MUST use at leastone of

+          the modes indicated in the list above, but MAY include additional

+ keywords defined by extension specifications. Hubs MUST ignoreverify

+          mode keywords that they do not understand.</t>
+
+          <t>Hubs MUST ignore additional request parameters they do not
+          understand.</t>
+

+ <t>Hubs MUST allow subscribers to re-request subscriptions thatare+ already activate. Each subsequent request to a hub to subscribeor

+          unsubscribe MUST override the previous subscription state for a

+ specific topic URL and callback URL combination once the actionis+ verified. Any failures to confirm the subscription action MUSTleave+ the subscription state unchanged. This is required sosubscribers can

+          renew their subscriptions before the lease seconds period is over
+          without any interruption.</t>
+
+        <section title="Subscription Parameter Details">
+

+ <t>The topic and callback URLs MUST NOT contain an anchorfragment.

+          These URLs MAY use <xref target="RFC2616">HTTP</xref> or <xref
+          target="RFC2818">HTTPS</xref> schemes. These URLs MAY have port

+ numbers specified; however, hubs MAY choose to disallow certainports+ based on their own policies (e.g., security) and return errorsfor+ these requests. The topic URL can otherwise be free-formfollowing+ <xref target="RFC3986">the URI spec</xref>. Hubs MUST alwaysdecode+ non-reserved characters for these URL parameters; see section2.4 on

+          <spanx style="emph">"When to Encode or Decode"</spanx> in <xref
+          target="RFC3986">the URI spec</xref>.</t>
+
+          <t>The callback URL MAY contain arbitrary query string parameters
+          (e.g., <spanx style="verb">?foo=bar&red=fish</spanx>). Hubs MUST
+          preserve the query string during subscription verification by
+          appending new parameters to the end of the list using the <spanx
+          style="verb">&</spanx> (ampersand) character to join. Existing

+ parameters with names that overlap with those used byverification+ requests will not be overwritten; Hubs MUST only appendverification+ parameters to the existing list, if any. For event notification,the+ callback URL will be POSTed to including any query-stringparameters+ in the URL portion of the request, not as POST bodyparameters.</t>

+ <t>Subscribers MAY choose to use <xreftarget="RFC2818">HTTPS</xref>

+          for their callback URLs if they care about the privacy of
+          notifications as they come over the wire from the Hub. The use of
+          mechanisms (such as XML signatures) to verify the integrity of

+ notifications coming from the original publisher is out of thescope

+          of this specification.</t>
+
+        </section>
+
+        <section title="Subscription Response Details">
+

+ <t>The hub MUST respond to a subscription request with an HTTP204 "No+ Content" response to indicate that the request was verified andthat+ the subscription is active. If the subscription has yet to beverified

+          (i.e., the hub is using asynchronous verification), the hub MUST
+          respond with a 202 "Accepted" code.</t>
+
+           <t>If a hub finds any errors in the subscription request, an

+ appropriate HTTP error response code (4xx or 5xx) MUST bereturned. In+ the event of an error, hubs SHOULD return a description of theerror+ in the response body as plain text. Hubs MAY decide to rejectsome+ callback URLs or topic URLs based on their own policies (e.g.,domain

+          authorization, topic URL port numbers).</t>
+
+           <t>In synchronous mode, the verification (<xref

+ target="verifysub"></xref>) MUST be completed before the hubreturns a+ response. In asynchronous mode, the verification MAY be deferreduntil+ a later time. This is useful to enable hubs to defer work; thiscould+ allow them to alleviate servers under heavy load or doverification

+          work in batches.</t>
+
+        </section>
+
+      </section>
+

+ <section anchor="verifysub" title="Hub Verifies Intent of theSubscriber">+ <t>In order to prevent an attacker from creating unwantedsubscriptions+ on behalf of a subscriber (or unsubscribing desired ones), a hubmust+ ensure that the subscriber did indeed send the subscriptionrequest.</t>

+
+        <t>The hub verifies a subscription request by sending an <xref

+ target="RFC2616">HTTP</xref> GET request to the subscriber'scallback+ URL as given in the subscription request. This request has thefollowing+ query string arguments appended (format described in Section17.13.4 of

+        <xref target="W3C.REC-html401-19991224"/>):</t>
+
+        <t><list style="hanging">

+ <t hangText="hub.mode">REQUIRED. The literalstring "subscribe" or+ "unsubscribe", which matches the original request to the hubfrom

+            the subscriber.</t>
+
+            <t hangText="hub.topic">REQUIRED. The topic URL given in the
+            corresponding subscription request.</t>
+

+ <t hangText="hub.challenge">REQUIRED. A hub-generated, randomstring

+            that MUST be echoed by the subscriber to verify the
+            subscription.</t>
+
+            <t hangText="hub.lease_seconds">REQUIRED/OPTIONAL. The

+ hub-determined number of seconds that the subscription willstay

+            active before expiring, measured from the time the verification

+ request was made from the hub to the subscriber. Hubs MUSTsupply

+            this parameter for subscription requests. This parameter MAY be

+ present for unsubscribe requests and MUST be ignored bysubscribers

+            during unsubscription.</t>
+

+ <t hangText="hub.verify_token">OPTIONAL. Thesubscriber-provided+ opaque token from the corresponding subscription request, ifone was

+            provided.</t>
+
+          </list></t>
+
+        <section title="Verification Details">
+

+ <t>The subscriber MUST confirm that the <spanxstyle="verb">hub.topic+ </spanx> and <spanx style="verb">hub.verify_token</spanx>correspond+ to a pending subscription or unsubscription that it wishes tocarry+ out. If so, the subscriber MUST respond with an HTTP success(2xx)

+          code with a response body equal to the <spanx

+ style="verb">hub.challenge</spanx> parameter. If the subscriberdoes+ not agree with the action, the subscriber MUST respond with a404 "Not

+          Found" response.</t>
+

+ <t>For synchronous verification, the hub MUST consider otherserver+ response codes (3xx, 4xx, 5xx) to mean that the verificationrequest+ has immediately failed and no retries should occur. If thesubscriber+ returns an HTTP success (2xx) but the content body does notmatch the+ <spanx style="verb">hub.challenge</spanx> parameter, the hubMUST also

+          consider verification to have failed.</t>
+

+ <t>For asynchronous verification, the hub MUST consider otherserver

+          response codes (3xx, 4xx, and 5xx) to mean that the subscription

+ action was <spanx style="emph">temporarily</spanx> not verified.If+ the subscriber returns an HTTP success (2xx) but the contentbody does+ not match the <spanx style="verb">hub.challenge</spanx>parameter, the+ hub MUST consider this to be a <spanxstyle="emph">temporary</spanx>

+          failure and retry. The hub SHOULD retry verification a reasonable
+          number of times over the course of a longer time period (e.g., 6
+          hours) until a definite acknowledgement (positive or negative) is

+ received. If a definite response still cannot be determinedafter this+ retry period, the subscription action verification MUST beabandoned,

+          leaving the previous subscription state.</t>
+

+ <t>Hubs MAY make the <spanxstyle="verb">hub.lease_seconds</spanx>

+          equal to the period the subscriber passed in their subscription

+ request but MAY change the value depending on the hub'spolicies. To+ sustain a temporary subscription, the subscriber MUST re-requestthe

+          subscription on the hub before <spanx
+          style="verb">hub.lease_seconds</spanx> seconds has elapsed. For
+          permanent subscriptions with no <spanx

+ style="verb">hub.lease_seconds</spanx> value specified, thebehavior

+          is different as described in the section on <xref

+ target="autorefresh">automatic subscriptionrefreshing</xref>.</t>

+
+        </section>
+
+      </section>
+

+ <section anchor="autorefresh" title="Automatic SubscriptionRefreshing">

+
+        <t>Before a subscription expires (i.e., before <spanx

+ style="verb">hub.lease_seconds</spanx> elapses), Hubs MUST recheckwith+ subscribers to see if a continued subscription is desired. Hubs dothis

+        by sending the subscriber a <xref target="verifysub">verification
+        request</xref> with <spanx style="verb">hub.mode</spanx> equal to

+ subscribe. This request MUST match the original verificationrequest

+        sent to the subscriber (but with a new <spanx
+        style="verb">hub.challenge</spanx>).</t>
+

+ <t>The response codes returned by the subscriber MUST beinterpreted the+ same way as during a subscriber-initiated verification flow.However,+ this refresh request MUST behave like an initial subscriptionrequest;

+        this means that if an auto-refresh response from the subscriber

+ constantly returns an error, the hub MUST give up on thesubscription

+        verification action altogether and remove the subscription.</t>
+
+        <t>In the case of permanent subscriptions (with no <spanx
+        style="verb">hub.lease_seconds</spanx> specified in the original
+        request), the <spanx style="verb">hub.lease_seconds</spanx> value

+ supplied by the hub in the verification request to the subscriberSHOULD+ represent how many seconds until the hub expects it will nextinitiate+ automatic subscription refreshing to ensure that the subscriber isstill+ interested in the topic. This behavior provides the best of bothworlds:

+        maximum simplicity of the subscriber through infinitely-long
+        subscriptions, but still garbage collectable subscriptions for hub
+        hygiene.</t>
+
+      </section>
+
+    </section>
+
+    <section anchor="publishing" title="Publishing">

+ <t>A publisher pings the hub with the topic URL(s) which have beenupdated+ and the hub schedules those topics to be fetched and delivered.Because+ it's just a ping to notify the hub of the topic URL (without apayload),

+      no authentication from the publisher is required.</t>
+
+      <section title="New Content Notification">
+        <t>When new content is added to a feed, a notification is sent to
+        the hub by the publisher. The hub MUST accept a POST request to
+        the hub URL containing the notification. This request MUST have a

+ Content-Type of <spanxstyle="verb">application/x-www-form-urlencoded

+        </spanx> (described in Section 17.13.4 of <xref

+ target="W3C.REC-html401-19991224"/>) and the following parametersin

+        its body:</t>
+
+        <t><list style="hanging">

+ <t hangText="hub.mode">REQUIRED. The literalstring "publish".</t>

+
+            <t hangText="hub.url">REQUIRED. The topic URL of the topic that

+ has been updated. This field may be repeated to indicatemultiple

+            topics that have been updated.</t>
+          </list></t>
+

+ <t>The new content notification is a signal to the hub that thereis new+ content available. The hub SHOULD arrange for a content fetchrequest+ (<xref target="contentfetch"></xref>) to be performed in the nearfuture+ to retrieve the new content. If the notification was acceptable,the hub

+        MUST return a 204 No Content response. If the notification is not
+        acceptable for some reason, the hub MUST return an appropriate HTTP

+ error response code (4xx and 5xx). Hubs MUST return a 204 NoContent

+        response even when they do not have any subscribers for all of the
+        specified topic URLs.</t>
+      </section>
+
+      <section anchor="contentfetch" title="Content Fetch">
+        <t>When the hub wishes to retrieve new content for a topic, the hub

+ sends an <xref target="RFC2616">HTTP</xref> GET request to thetopic

+        URL. The hub MUST follow HTTP redirects. The Hub SHOULD use best
+        practices for caching headers in its requests (e.g., <spanx
+        style="verb">If-None-Match</spanx>, <spanx
+        style="verb">If-Modified-Since</spanx>).</t>
+
+        <t>The request SHOULD include the header field <spanx
+        style="verb">User-Agent</spanx> in the form expected by <xref

+ target="RFC2616">HTTP</xref>. The header MAY have one or moreadditional+ suffixes like <spanx style="verb">(example.com; 20subscribers)</spanx>

+        to indicate the number of subscriptions the hub has active for this
+        topic.</t>
+
+        <t>If present, the first suffix MUST indicate the total number of

+ subscriptions the hub has aggregated across all subscriber domainsby

+        means of the <spanx style="verb">X-Hub-On-Behalf-Of</spanx> header

+ (<xref target="contentdistribution"></xref>). Any additionalsuffixes+ indicate a breakdown of subscriber counts across subscriberdomains.

+        This allows content publishers to distinguish the source of their

+ subscriber counts and mitigate subscriber count spam. An exampleheader

+        could be (ignoring line-wrapping):</t>
+
+        <figure>
+          <artwork>
+User-Agent: MyHub (+http://hub.example.com; 26 subscribers)
+    (sub.example.com; 4 subscribers)
+    (other-sub.example.com; 22 subscribers)
+          </artwork>
+        </figure>
+

+ <t>If, after a content fetch, the hub determines that the topicfeed+ content has changed, the hub MUST send information about thechanges to

+        each of the subscribers to the topic (<xref
+        target="contentdistribution"></xref>). Hubs MUST consider new feed

+ entries, updated entries, or changes to the surrounding feeddocument as

+        significant content changes that require content distribution.</t>
+
+      </section>
+
+      <section anchor="contentdistribution" title="Content Distribution">
+
+        <t>A content distribution request is an <xref

+ target="RFC2616">HTTP</xref> POST request from hub to thesubscriber's+ callback URL with the list of new and changed entries. Thisrequest MUST

+        have a <spanx style="verb">Content-Type</spanx> of <spanx

+ style="verb">application/atom+xml</spanx> when the request body isan

+        <xref target="RFC4287">Atom</xref> feed document, or a <spanx
+        style="verb">Content-Type</spanx> of <spanx

+ style="verb">application/rss+xml</spanx> when the request body isan+ <xref target="RSS20">RSS</xref> feed document. The behavior forother+ content types is not yet defined. Hubs MAY transform the contenttype

+        and request body as desired (e.g., for language translation).</t>
+

+ <t>If the document represents a single feed being replicated forthe+ subscriber, then the feed-level elements SHOULD be preserved asidefrom

+        the <spanx style="verb">atom:entry</spanx> or <spanx
+        style="verb">rss:item</spanx> elements. However, the <spanx

+ style="verb">atom:id</spanx> element MUST be reproduced exactly.The

+        other <spanx style="verb">atom:updated</spanx> and <spanx
+        style="verb">atom:title</spanx> elements required by the Atom
+        specification SHOULD be present. Each <spanx

+ style="verb">atom:entry</spanx> or <spanxstyle="verb">rss:item</spanx>+ element in the feed contains the content from an entry in thesingle+ topic that the subscriber has an active subscription for.Essentially,

+        in the single feed case the subscriber will receive a feed document
+        that looks like the original but with old content removed.</t>
+

+ <t>The successful response from the subscriber's callback URL MUSTbe an

+        HTTP success (2xx) code. The hub MUST consider all other subscriber

+ response codes as failures; that means subscribers MUST not useHTTP

+        redirects for moving subscriptions. The response body from the

+ subscriber MUST be ignored by the hub. Hubs SHOULD retrynotifications

+        repeatedly until successful (up to some reasonable maximum over a

+ reasonable time period). Subscribers SHOULD respond tonotifications as+ quickly as possible; their success response code SHOULD onlyindicate

+        receipt of the message, not acknowledgment that it was successfully
+        processed by the subscriber.</t>
+
+        <t>The subscriber's callback response MAY include the header field

+ <spanx style="verb">X-Hub-On-Behalf-Of</spanx> with an integervalue,+ possibly approximate, representing the number of subscribers onbehalf

+        of which this feed notification was delivered. This value SHOULD be

+ aggregated by the hub across all subscribers and used to providethe+ subscriber counts in the <spanx style="verb">User-Agent</spanx>header+ field sent to publishers (<xref target="contentfetch"></xref>).Hubs MAY+ ignore or respect <spanx style="verb">X-Hub-On-Behalf-Of</spanx>values

+        from subscribers depending on their own policies (i.e., to prevent
+        spam).</t>
+
+      </section>
+

+ <section anchor="authednotify" title="Authenticated ContentDistribution">

+
+        <t>If the subscriber supplied a value for <spanx

+ style="verb">hub.secret</spanx> in their subscription request, thehub

+        MUST generate an HMAC signature of the payload and include that

+ signature in the request headers of the content distributionrequest.+ The <spanx style="verb">X-Hub-Signature</spanx> header's valueMUST be

+        in the form <spanx style="verb">sha1=signature</spanx> where <spanx

+ style="verb">signature</spanx> is a 40-byte, hexadecimalrepresentation+ of a <xref target="RFC3174">SHA1 signature</xref>. The signatureMUST be+ computed using the <xref target="RFC2104">HMAC algorithm</xref>with the+ request body as the data and the <spanxstyle="verb">hub.secret</spanx>

+        as the key.</t>
+
+        <t>When subscribers receive a content distribution request with the
+        <spanx style="verb">X-Hub-Signature</spanx> header specified, they

+ SHOULD recompute the SHA1 signature with the shared secret usingthe+ same method as the hub. If the signature does not match,subscribers+ MUST still return a 2xx success response to acknowledge receipt,but+ locally ignore the message as invalid. Using this technique alongwith+ HTTPS for subscription requests enables simple subscribers toreceive

+        authenticated content delivery from hubs without the need for
+        subscribers to run an HTTPS server.</t>
+
+      </section>
+
+      <section anchor="aggregatedistribution"
+               title="Aggregated Content Distribution">
+
+        <t>For Atom feeds only. Pending further review.</t>
+
+        <t>When a subscriber indicates the same callback URL is used for
+        multiple subscriptions, hubs MAY choose to combine content delivery

+ requests into a single payload containing an aggregated set offeeds.

+        This bulk delivery results in fewer requests and more efficient
+        distribution. If the subscriber indicated a <spanx
+        style="verb">hub.secret</spanx> value for these overlapping

+ subscriptions, the secret MUST also be the same for allsubscriptions.

+        This allows the hub to generate a single <spanx

+ style="verb">X-Hub-Signature</spanx> header to sign the entirepayload.

+        Hubs MUST return an error response (4xx, 5xx) for subscription

+ requests with overlapping callback URLs and different secretvalues.

+        </t>
+

+ <t>With an aggregated set of feeds, the hub SHOULD reproduce allof the+ elements from the source feed <spanx style="emph">inside</spanx>the

+        corresponding <spanx style="verb">atom:entry</spanx> in the content

+ distribution request by using an <spanxstyle="verb">atom:source</spanx>+ element. However, the <spanx style="verb">atom:id</spanx> valueMUST be+ reproduced exactly within the source element. If the source entrydoes+ not have an <spanx style="verb">atom:source</spanx> element, thehub

+        MUST create an <spanx style="verb">atom:source</spanx> element
+        containing the <spanx style="verb">atom:id</spanx> element. The hub

+ SHOULD also include the <spanx style="verb">atom:title</spanx>element

+        and an <spanx style="verb">atom:link</spanx> element with <spanx
+        style="verb">rel="self"</spanx> values that are functionally

+ equivalent to the corresponding elements in the original topicfeed.</t>

+
+        <t>Example aggregated feed:</t>
+
+      <figure>
+        <artwork><![CDATA[<?xml version="1.0"?>
+<atom:feed>
+  <title>Aggregated feed</title>
+  <updated>2008-08-11T02:17:44Z</updated>
+  <id>http://myhub.example.com/aggregated?1232427842-39823</id>
+
+  <entry>
+    <source>
+      <id>http://www.example.com/foo</id>
+      <link rel="self" href="http://publisher.example.com/foo.xml"; />
+      <author>
+        <name>Mr. Bar</name>
+      </author>
+    </source>
+    <title>Testing Foo</title>
+    <link href="http://publisher.example.com/foo24.xml"; />
+    <id>http://publisher.example.com/foo24.xml</id>
+    <updated>2008-08-11T02:15:01Z</updated>
+    <content>
+      This is some content from the user named foo.
+    </content>
+  </entry>
+
+  <entry>
+    <source>
+      <id>http://www.example.com/bar</id>
+      <link rel="self" href="http://publisher.example.com/bar.xml"; />
+      <author>
+        <name>Mr. Bar</name>
+      </author>
+    </source>
+    <title>Testing Bar</title>
+    <link href="http://publisher.example.com/bar18.xml"; />
+    <id>http://publisher.example.com/bar18.xml</id>
+    <updated>2008-08-11T02:17:44Z</updated>
+    <content>
+      Some data from the user named bar.
+    </content>
+  </entry>
+
+</atom:feed>
+]]></artwork>
+      </figure>
+
+      </section>
+    </section>
+
+    <section anchor="bestpractices" title="Best Practices">
+      <t>(This section is non-normative.)</t>
+      <section anchor="hubbestpractices" title="For Hubs">
+        <t>The vast majority of feeds on the web have multiple <spanx

+ style="emph">variants</spanx> that contain essentially the samecontent

+        but appear on different URLs. A common set of variants is <spanx
+        style="verb">http://example.com/feed?format=atom</spanx> and <spanx

+ style="verb">http://example.com/feed?format=rss</spanx>. Inpractice,

+        these URLs also fail to set their <spanx
+        style="verb">//atom:feed/link[@rel="self"]</spanx> values properly,

+ meaning it's difficult for subscribers to discover which feed URLthey

+        truly wanted. Making matters worse, feeds often have redirects

+ (temporary or permanent) to new hosting locations, analyticsproviders,

+        or other feed-processors. To solve this, it's important for Hub

+ implementations to determine feed URL equivalences usingheuristics.+ Examples: follow all feed URLs redirects and see if they end up atthe+ same location; use overlaps in feed HTML alternate links; use the<spanx+ style="verb">atom:id</spanx> value across all domains. Thisspecific

+        problem is considered out of scope of this specification but this

+ section is meant as a reminder to implementors that feed URLaliasing is

+        an important issue that hubs should address instead of putting
+        the burden on publishers and subscribers.</t>
+      </section>
+
+      <section anchor="subbestpractices" title="For Subscribers">
+        <t>The <spanx style="verb">hub.verify_token</spanx> parameter in

+ subscription requests enables subscribers to verify the identityand+ intent of the hub making the verification request. Subscribersshould

+        use the token to retrieve internal state to ensure the subscription
+        request outcome is what they intended.</t>
+      </section>
+    </section>
+  </middle>
+
+  <back>
+    <references title="References">
+
+      <reference anchor='RFC2104'>
+        <front>

+ <title abbrev='HMAC'>HMAC: Keyed-Hashing for MessageAuthentication</title>+ <author initials='H.' surname='Krawczyk' fullname='HugoKrawczyk'>

+            <organization>IBM, T.J. Watson Research Center</organization>
+          </author>
+          <author initials='M.' surname='Bellare' fullname='Mihir Bellare'>

+ <organization>University of California at San Diego, Dept ofComputer Science and Engineering</organization>

+          </author>
+          <author initials='R.' surname='Canetti' fullname='Ran Canetti'>
+            <organization>IBM T.J. Watson Research Center</organization>
+          </author>
+        </front>
+        <seriesInfo name='RFC' value='2104' />
+      </reference>
+
+      <reference anchor="RFC2606">
+        <front>
+          <title>Reserved Top Level DNS Names</title>
+
+          <author fullname="D. Eastlake" initials="D.E" surname="Eastlake">
+            <organization />
+          </author>
+
+          <author fullname="A. Panitz" initials="A.P" surname="Panitz">
+            <organization />
+          </author>
+        </front>
+
+        <seriesInfo name="RFC" value="2606" />
+      </reference>
+
+      <reference anchor="RFC2119">
+        <front>
+          <title>Key words for use in RFCs to Indicate Requirement
+          Levels</title>
+

+ <author fullname="Scott Bradner" initials="B.S"surname="Bradner">

+            <organization>Alis Technologies</organization>
+          </author>
+        </front>
+
+        <seriesInfo name="RFC" value="2119" />
+      </reference>
+
+      <reference anchor="RFC2616">
+        <front>
+          <title>Hypertext Transfer Protocol -- HTTP/1.1</title>
+
+          <author fullname="R. Fielding" initials="R.F" surname="Fielding">
+            <organization>UC Irvine</organization>
+          </author>
+
+          <author fullname="J. Gettys" initials="J.G" surname="Gettys">
+            <organization>Compaq/W3C</organization>
+          </author>
+
+          <author fullname="J. Mogul" initials="J.M" surname="Mogul">
+            <organization>Compaq</organization>
+          </author>
+
+          <author fullname="H. Frystyk" initials="H.F" surname="Frystyk">
+            <organization>W3C/MIT</organization>
+          </author>
+
+          <author fullname="L. Masinter" initials="L.M" surname="Masinter">
+            <organization>Xerox</organization>
+          </author>
+
+          <author fullname="P. Leach" initials="P.L" surname="Leach">
+            <organization>Microsoft</organization>
+          </author>
+
+          <author fullname="T. Berners-Lee" initials="T.L"
+                  surname="Berners-Lee">
+            <organization>W3C/MIT</organization>
+          </author>
+        </front>
+
+        <seriesInfo name="RFC" value="2616" />
+      </reference>
+
+      <reference anchor='RFC2818'>
+        <front>
+        <title>HTTP Over TLS</title>
+        <author initials='E.' surname='Rescorla' fullname='E. Rescorla'>
+        <organization /></author>
+        <date year='2000' month='May' />
+        <abstract>

+ <t>This memo describes how to use Transport Layer Security (TLS)to secure Hypertext Transfer Protocol (HTTP) connections over theInternet. This memo provides information for the Internetcommunity.</t></abstract></front>

+
+        <seriesInfo name='RFC' value='2818' />

+ <format type='TXT' octets='15170'target='ftp://ftp.isi.edu/in-notes/rfc2818.txt' />

+      </reference>
+
+      <reference anchor='RFC3174'>
+        <front>
+        <title>US Secure Hash Algorithm 1 (SHA1)</title>
+        <author initials='D.' surname='Eastlake' fullname='D. Eastlake'>
+        <organization /></author>
+        <author initials='P.' surname='Jones' fullname='P. Jones'>
+        <organization /></author>
+        <date year='2001' month='September' />
+        <abstract>

+ <t>The purpose of this document is to make the SHA-1 (Secure HashAlgorithm 1) hash algorithm conveniently available to the Internetcommunity. This memo provides information for the Internetcommunity.</t></abstract></front>

+
+        <seriesInfo name='RFC' value='3174' />
+

+ <format type='TXT' octets='35525'target='ftp://ftp.isi.edu/in-notes/rfc3174.txt' />

+      </reference>
+
+      <reference anchor="RFC3986">
+        <front>
+          <title>Uniform Resource Identifiers (URI): Generic Syntax</title>
+
+          <author fullname="T. Berners-Lee" initials="T.L"
+                  surname="Berners-Lee">
+            <organization />
+          </author>
+        </front>
+
+        <seriesInfo name="RFC" value="3986" />
+      </reference>
+
+      <reference anchor='RFC4287'>
+        <front>
+          <title abbrev='Atom Format'>The Atom Syndication Format</title>
+

+ <author initials='M.' surname='Nottingham' fullname='MarkNottingham' role='editor'>

+            <organization />
+          </author>
+

+ <author initials='R.' surname='Sayre' fullname='Robert Sayre'role='editor'>

+            <organization />
+          </author>
+        </front>
+
+        <seriesInfo name='RFC' value='4287' />

+ <format type='HTML' octets='150786'target='http://xml.resource.org/public/rfc/html/rfc4287.html' />

+      </reference>
+
+      <reference anchor='W3C.REC-html401-19991224'
+                 target='http://www.w3.org/TR/1999/REC-html401-19991224'>
+        <front>
+          <title>HTML 4.01 Specification</title>
+
+          <author initials='D.' surname='Raggett' fullname='David Raggett'>
+            <organization />
+          </author>
+
+          <author initials='A.' surname='Hors' fullname='Arnaud Le Hors'>
+            <organization />
+          </author>
+
+          <author initials='I.' surname='Jacobs' fullname='Ian Jacobs'>
+            <organization />
+          </author>
+          <date month='December' day='24' year='1999' />
+        </front>
+

+ <seriesInfo name='World Wide Web Consortium Recommendation'value='REC-html401-19991224' />+ <format type='HTML'target='http://www.w3.org/TR/1999/REC-html401-19991224' />

+      </reference>
+
+      <reference anchor="XEP-0060">
+        <front>
+          <title>Publish-Subscribe</title>
+          <author initials="P." surname="Millard" fullname="Peter Millard">
+            <organization/>
+          </author>

+ <author initials="P." surname="Saint-Andre" fullname="PeterSaint-Andre">

+            <organization/>
***The diff for this file has been truncated for email.***

--

---You received this message because you are subscribed to the Google Groups "Pubsubhubbub" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

[pubsubhubbub] r411 committed - Fork 0.3 spec into 0.4

Reply via email to