Re: Additional 'namespace' attribute on content element?

2007-01-05 Thread Henri Sivonen


On Jan 5, 2007, at 09:47, Martin Duerst wrote:


So I think that Atom is a bit out on it's egde if it says
'if you don't have a +xml Mime Type, you're not XML'.


No, Atom says that if you don't use application/xml or an +xml type,  
you don't get to use an embedded subtree and your format gets the  
kind of treatment that non-XML formats get.


Atom is quite reasonable here. An XML vocabulary that doesn't have a  
MIME type that follows the convention *and* doesn't have a namespace  
is itself a badly-behaved XML vocabulary.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: base within HTML content

2007-01-02 Thread Henri Sivonen


On Jan 2, 2007, at 12:40, A. Pagaltzis wrote:


* Henri Sivonen [EMAIL PROTECTED] [2007-01-02 01:35]:

I hadn't thought that Atom was supposed to use innerHTML
parsing. I'd have said that you prepend
!DOCTYPE htmltitle/titlediv to what travels in the
feed and append /div to it,  parse the resulting string and
grab the first div in the document order.


That will lead to silent data loss if the content is malformed
such that it contains an extraneous `/div`.


Good point.

Prepending !DOCTYPE htmltitle/title and grabbing the contents  
of body would work better.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: base within HTML content

2007-01-01 Thread Henri Sivonen


On Jan 1, 2007, at 22:46, Anne van Kesteren wrote:

I suppose you could raise this on the WHATWG list. Asking what  
happens if you set innerHTML of a div where the setted value has  
both a base and an a for instance.


Interesting. I hadn't thought that Atom was supposed to use innerHTML  
parsing. I'd have said that you prepend !DOCTYPE htmltitle/ 
titlediv to what travels in the feed and append /div to it,  
parse the resulting string and grab the first div in the document order.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Atom Entry docs

2006-12-14 Thread Henri Sivonen


On Dec 13, 2006, at 17:51, Mark Baker wrote:


But
given that an alternative exists which shouldn't break those servers,
why not use it when there's no apparent downside?


The downside is that implementations that (quite reasonably) assume  
that application/atom+xml == feed are also reasonable when they  
ignore unknown media type parameters.


Given the options of a new type or a new parameter, I am +1 on the  
new type. (Although in general, I don't like the proliferation of  
application/*+xml types, because apps need to do root sniffing for  
application/xml anyway.)


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: PaceAutoDiscoveryDraftIsPointless

2006-11-28 Thread Henri Sivonen


On Nov 28, 2006, at 22:11, Edward O'Connor wrote:

WHAT WG seems like a neutral ground, syndication-format wise, so  
perhaps they're best positioned to spec feed autodiscovery in a way  
that makes everybody happy.


+1 for leaving speccing this to the WHATWG.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: PaceResurrectAutodiscovery

2006-11-24 Thread Henri Sivonen


On Nov 24, 2006, at 10:28, Thomas Broyer wrote:


My main problem with autodiscovery is this use case: the following
links on an entry page in a blog-like scenario, where comments on
the entry are shown at the bottom of the page:
link rel=alternate type=application/atom+xml
   title=A standalone Atom Entry alternate representation, might
even be updatable using HTTP PUT /

...

1. the first one should not be seen as a feed link;


That ship sailed long ago. For backwards compatibility, UAs will  
treat links like the one quoted above as feed autodiscovery links-- 
not as standalone APP editing links. It is too late to apply elegance  
retroactively.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: PaceResurrectAutodiscovery

2006-11-24 Thread Henri Sivonen


On Nov 23, 2006, at 22:42, Henry Story wrote:

This is very nice, in that it opens up the possibility of placing  
good RDF descriptions of these links at the http://www.iana.org/ 
assignments/relation/,


How could new link relations be described in RDF to such a degree  
that the description would actually be useful for software processing  
but simple enough to actually be implemented? Is it realistic to have  
UAs whacking IANA server effectively performing a DDoS on it?


as well as making the link relation very extensible (people who  
want to try out new link relations, can just use their own,  
unambiguous url).


How are full URIs distinguished from strings that need to be appended  
to http://www.iana.org/assignments/relation/; to obtain the full  
URI? Are UAs supposed to look for a colon as with XForms input  
methods? Where is this specified?


In practice people seem to want to use one-word link relations even  
when they are coming up with their own.


I would recommend you adopt that too. Perhaps you can even adopt  
the iana name space. If we could get them to put the appropriate  
rdf document at that location, people who created/coined new link  
relations could describe these relations as being superproperties  
or subroperties of relations the browser already knows, which would  
allow the browser to partially interpret those.


Well, RDF is not viewed that favorably by the WHATWG. Also, the  
barrier for entry for the IANA registration process is likely too  
high. (It certainly is for MIME types.) As for using the same  
namespace, the HTML5 definitions for the link types don't necessarily  
match the Atom definitions.


Recently, a WHATWG-managed registry for HTML5 rel values has been  
discussed informally. The idea was that conformance checkers could  
consult an online registry instead of only allowing a fixed list of  
values or allowing everything. RDF is an overkill for this. Even XMDP  
isn't the simplest thing that could possibly work. The simplest thing  
that could possibly work is a GETtable text/plain; charset=utf-8  
resource at a well-known URI with one rel value per LF-separated line.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Forward Compatibility

2006-11-19 Thread Henri Sivonen


On Nov 19, 2006, at 04:33, Tse Shing Chi ((Franklin/Whale)) wrote:


This means that XHTML 2 contents can be used as follows?

content type=xhtml
  div xmlns=http://www.w3.org/2002/06/xhtml2/;
!-- Contents XHTML 2.0 --
  /div
/content

content type=application/xhtml+xml
  div xmlns=http://www.w3.org/2002/06/xhtml2/;
!-- Contents XHTML 2.0 --
  /div
/content

By the way, are they the same?


They are not the same and neither is correct. The correct way would be:

content type=application/xml
  html xmlns=http://www.w3.org/2002/06/xhtml2/;
head
  titleThe title of the entry again/title
/head
body
  !-- Contents XHTML 2.0 --
/body
  /html
/content

But it isn't worthwhile to spend energy on this issue. Browser  
vendors have been ignoring XHTML 2.0 and now even the W3C itself is  
moving aside the group that has been working on the XHTML 2.0 spec.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Forward Compatibility

2006-11-19 Thread Henri Sivonen


On Nov 19, 2006, at 12:10, Tse Shing Chi ((Franklin/Whale)) wrote:

First of all, the media type should still be application/xhtml 
+xml for XHTML 2.0, although application/xml or even text/html may  
be used.


RFC 3236 makes it pretty clear that it is for XHTML Family documents  
in the Modularization of XHTML sense. XHTML 2.0 documents are not  
XHTML Family documents. They are incompatible documents with a  
confusing name.


I don't think whole XHTML 2.0 document needs to be contained in the  
content or summary element, it is too complicated. Using the div  
element is still a suitable way (div is remained in XHTML 2.0  
without major changes, http://www.w3.org/TR/xhtml2/mod- 
structural.html#edef_structural_div) I think.


It is not the same div. It is a completely different div in another  
namespace. That the local names are identical does not make them the  
same.


The Atom spec provides shortcuts for plain text, HTML and XHTML 1.x.  
That's it. Calling an incompatible language XHTML doesn't help. As  
far as Atom is concerned, XHTML 2.0 documents get the same treatment  
as any other XML document that doesn't use the XHTML 1.x namespace.


(Yes, I am aware of http://lists.w3.org/Archives/Public/www-html/ 
2006Nov/0025 , but that resolution doesn't really make XHTML 2.0 any  
more likely to be implemented.)


For browsers' implementation, in fact, XHTML 1.1 is still not well- 
supported. CSS 2 is another example of poor implementation by  
browsers. I do think that Atom should not be easy to change or be  
updated very frequently. That's why Atom should leave some extents  
of forward compatibility.


If you start from XHTML 1.0 and go forward, you don't end up at XHTML  
2.0 but at XHTML5. Embedding XHTML5 in Atom will work out just fine.


Forward is here:
http://whatwg.org/specs/web-apps/current-work/

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Forward Compatibility

2006-11-18 Thread Henri Sivonen


On Nov 18, 2006, at 22:06, Nikola Stojanovic wrote:

What would then be the purpose of 2005 in the atom namespace  
xmlns:atom=http://www.w3.org/2005/Atom;?


It indicates the year when the URI was minted. It is debatable  
whether it is a useful purpose, but that's the way the W3C mints URIs.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: http://www.intertwingly.net/wiki/pie/XhtmlContentDivConformanceTests

2006-06-28 Thread Henri Sivonen


On Jun 28, 2006, at 02:27, James M Snell wrote:


That is, what is
the exact behavior that a library must perform when a code library has
an API like, getContent on the content element.


One sane behaviour is to return an org.w3c.dom.DocumentFragment with  
the deep copies of the children of the namespace div with the  
xml:base and xml:lang context pushed down on each child.


That's a bit awkward, so I guess using a placeholder root element  
with the xml:base and xml:lang context would make sense, provided  
that the API doc says that the root is not part of the logical  
content. This could be emphasized by using a root in a private  
namespace instead of an XHTML div. (Just to be obnoxious enough to  
make sure users of the API take note. :-)


Or, alternatively, the API could construct a full XHTML  
nu.xom.Document or org.w3c.dom.Document and thereby unify the return  
value for type=application/xhtml+xml, type=text/html,  
type=xhtml and type=html. (Assuming, that is that the library  
runs TagSoup and automatically converts HTML to XHTML.) Actually, I  
think this would be the best way.


In any case, returning a String as the value of the content means  
that the library is not fully doing its job when the logical value is  
an XML document fragment.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: http://www.intertwingly.net/wiki/pie/XhtmlContentDivConformanceTests

2006-06-28 Thread Henri Sivonen


On Jun 28, 2006, at 23:53, James M Snell wrote:


or instance, if I have content xml:lang=endiv
xml:lang=fr.../div/content and I drop the div silently, then  
I've

got a problem.


Dropping the div shouldn't mean dropping the language and base URL  
context. You need to communicate those anyway in the case they are  
inherited from higher up in the document tree.


(When the script that generates my feed copies node from a document  
tree to another, it checks the inherited language of the node being  
copied. If it differs from the inherited language of the insertion  
target, the newly inserted copy gets an explicit xml:lang.)


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Datatype for IRIs in RELAX NG

2006-04-28 Thread Henri Sivonen


On Mar 21, 2006, at 11:29, Julian Reschke wrote:

maybe it's time that *some* specification adds new datatypes that  
do *exactly* what RFC3986 and RFC3987 ask for :-)


I have a draft spec at
http://hsivonen.iki.fi/html5-datatypes/

A snapshot of the reference implementation (which currently lacks the  
'language' datatype) is available for testing as part of my  
validation service:

http://hsivonen.iki.fi/validator/

The reference implementation just wraps the IRI library from Jena  
(pulled from CVS; the new version does not ship in a Jena release, yet).


Disclaimer: The datatype library has undergone *very* little testing.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: Error in example ?

2006-04-04 Thread Henri Sivonen


On Apr 4, 2006, at 13:36, cedric wrote:


In 3.1.1.3 XHTML of
http://www.atomenabled.org/developers/syndication/atom-format- 
spec.php#atom.documents


There is example using summary but this is deprecated in atom 1.0  
I think.


It is not deprecated.
See: http://www.atomenabled.org/developers/syndication/atom-format- 
spec.php#element.summary


Actually, there is nothing deprecated in Atom 1.0. There are only  
things that are in the spec and things that are not in the spec.  
There is no Atom 0.3. :-) These are not the droids you are looking  
for. :-)


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Datatype for IRIs in RELAX NG (was: Re: Atom syndication schema)

2006-03-19 Thread Henri Sivonen


(Discussion started on atom-syntax, but this is a more general RELAX  
NG issue, so cross-posting to rng-users.)


On Mar 19, 2006, at 09:33, Martin Duerst wrote:


At 18:49 06/03/17, Bjoern Hoehrmann wrote:

* Martin Duerst wrote:
When looking with a microscope, you will find some little
differences, because xs:anyURI was described before the IRI
spec (RFC 3987) was approved. These differences are:

1) xs:aryURI also allows spaces and a few other ASCII characters
that are not allowed in URIs nor in IRIs (but the IRI spec has
an escape hatch for such cases).
2) The IRI spec contains many more details than the xs:anyURI
description, in particular also some requirements re.
normalization. However, some of the requirements in this
area of the IRI spec may be lowered or removed in the future
because we have received feedback from implementers that
there are difficulties to implement these.

I agree with Martin that it would be incorrect to use xsd:anyURI  
here.


Sorry, but I never said that it would be incorrect to use
xsd:anyURI. I personally think that it should be okay to
use xsd:anyURI. The differences are microscopic, and they should
become even smaller, or hopefully go away completely, over time.


I need datatypes for IRIs in general (relative, absolute or just  
fragment identifiers) and for absolute IRIs (possibly with a fragment  
id) in a RELAX NG schema.


Is it really the best practice to use xsd:anyURI and sweep the  
discrepancies under the rug in the hope that future definitions of  
xsd:anyURI change the meaning of the schema later? Can xsd:anyURI be  
augmented with a regexp pattern to restrict spaces and a few other  
ASCII characters in such a way that the resulting datatype  
restriction matches the definition of IRI? Has anyone implemented a  
strictly correct IRI datatype in a Java datatype library (for Jing  
and MSV)?


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/




Re: If you want Fat Pings just use Atom!

2005-08-23 Thread Henri Sivonen


On Aug 23, 2005, at 07:27, A. Pagaltzis wrote:


I still dislike the idea of a “virtual closing tag” – it ain’t
1995 anymore, after all.


You seem to be thinking that an XML parser needs to consider the whole 
document before reporting data to the app. This is not the case.


If you have a document that lacks the end tag, the fatal error is hit 
when the data stream ends. All the subtrees rooted at children of the 
document element will be just fine. Try it with any streaming SAX 
parser.


In this case, the data stream ends only if either party decides to 
close the connection. The parser will never know how long the document 
would have been. From the parsers point of view it looks like a broken 
stream while reading a finite doc. There's nothing in the XML spec 
requiring the app to throw away the data structures it has already 
built when the parser reports the error.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: spec bug: can we fix for draft-11?

2005-08-09 Thread Henri Sivonen


On Aug 4, 2005, at 21:55, Joe Gregorio wrote:


Some sections of this specification are illustrated with fragments of
a non-normative RELAX NG Compact schema [RELAX-NG]. However, the text
of this specification provides the definition of conformance. A
complete schema appears in Appendix B.

This is quoted directly from Section 1.3.

This whitespace issue is a good illustration of why the schema isn't
normative ;)


This thread makes me wonder if taking the schema seriously enough to 
actually encourage validation against the schema causes more harm than 
good. Perhaps I should remove the Atom schema from the presets of 
http://hsivonen.iki.fi/validator/ .



I would vote for leaving the text as is and having the validator give
errors on whitespace.


ITYM, the Feed Validator and not a RELAX NG validator.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: Notes on the latest draft.

2005-07-22 Thread Henri Sivonen


On Jul 20, 2005, at 08:08, James Cerra wrote:


HTML has many entities predefined.  If you use HTML content, are those
entities allowed (after being escaped, of course)?  That would make it 
really

really hard to normalize to text or XML without doctype processing.


No. You are supposed to use a tag soup parser for HTML. Tag soup 
parsers don't process the DTD. They have a hard-coded table of 
character entities.



Also, how does Atom interact with HTML 5,


You put the stuff in there and in time clients will adapt?


I still say that version or doctype attributes should be allowed
to solve disambiguities and allow compatibility with future versions 
of those

specs.


I am strongly against catering for doctype sniffing on the XHTML side.


Public/System identifiers can still be used to identify type without
validating it


No, they cannot. See http://hsivonen.iki.fi/doctype/#xml

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: RNG validators capable of fully using the Atom schema?

2005-07-13 Thread Henri Sivonen


On Jul 12, 2005, at 19:16, Norman Walsh wrote:


I use Kohsuke's MSV. (msv.dev.java.net)


Cool. Thanks.

I had allowed relaxng.org tell me that MSV Supports XML syntax only.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



RNG validators capable of fully using the Atom schema?

2005-07-10 Thread Henri Sivonen


I'm wondering which validator was used for testing the Atom 
RNG+Schematron schema. Which validators support Compact Syntax with 
embedded Schematron? I am particularly interested in Java solutions.


(Specifically, I would like to fully support the Atom schema in 
http://hsivonen.iki.fi/validator/ )


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: atom:type, xsl:output

2005-05-28 Thread Henri Sivonen


On May 28, 2005, at 05:28, James Cerra wrote:


Henri Sivonen,


Yes, but MSIE^H^H^H^Hsome xml processors (cough  cough) still
inappropriately
use comments for that purpose.


I am not familiar with that. What purpose exactly? Why should Atom
support it?


MSIE conditional comments.  See other person's reply.


I thought they were for text/html tag soup only. Anyway, I think Atom 
should not try to enable such bogosity in XML.


And yes: I know that Atom is not FTP or HTTP.  I'm looking at the Atom 
entry
uploader's POV rather than the downloader's perspective.  The Atom API 
is a way

to simplify managing web sites:


Is it really? Straight bytes over HTTP PUT seems a lot simpler to me 
than Base64 in an XML envelope over HTTP PUT.



Vanilla XML processors don't act on PIs. They report them to the
application--in this case the Atom processor.


So Atom Processors MUST be between any XML and PI processor stages?  
That
sucks!  It means just to display Atom, I can't use a generic XSLT 
processor

like Saxon or MSIE.


What PIs do they act on besides the style sheet PI, which must appear 
in the prolog (so conforming tools should not act on it if it appears 
within atom:content)?



Depending on an entity reference and not being able to accept the
straight replacement text is just wrong.


I agree.  I'm just bringing up possible incompatibilities for debate!


I don't think that's an incompatibility that deserves catering for.

That's a non-issue. You don't just throw away the DOCTYPE but parse 
the

original document with it and reserialize as a DOCTYPEless fragment.
You don't lose well-formedness or content. You only lose the shallow
attribute data typing provided by the DTD.


Well, that data could be valuable.


IDness is. For practical purposes, the other stuff is not. There are 
other solutions for IDness.


With DOCTYPE information associated with embedded content, it becomes 
possible to

transform entries into valid XML or HTML documents.


What do you need valid documents for if the validity is just an 
property internal to the document? That is, the document is valid 
according to its own rules as opposed to conforming to the rules 
required by some downstream software.



Some apps change their behavior with well-formed and valid modes.


That seems like a bad idea. Can you give examples?


DOCTYPE switching may be evil, but it is currently a necessary evil.


It is not a necessary evil for XML. I am strongly against any Atom 
feature motivated by or designed for enabling and/or encouraging 
doctype sniffing in XML.


Please see http://hsivonen.iki.fi/doctype/#xml

I see this as low risk (it doesn't break anything and could even be an 
OPTIONAL feature)


Optional features always have a cost. Also, they tend to become de 
facto mandatory for everyone or de facto useless for everyone.



Atom's main purpose is to facilitate software to software
communication. When interop or implementation ease and readability
conflict, readability should yield.


That's why I proposed they should be externally referenced (or 
base64ed), so
the Atom processor doesn't touch them!  This is a hint to authors on 
how to

avoid a common error.


Base64 XML in XML was rejected earlier in the working group process.


There is nothing intrinsically bad about DOCTYPE sections


Yes, there is. On the Web you cannot trust that the recipient is using
an XML processor that processes the DOCTYPE beyond checking the
internal subset for well-formedness. An optional feature that does not
degrade gracefully when not supported is bad. DOCTYPE is such a
feature. In the cases where the DOCTYPE can be gracefully ignored, the
DOCTYPE is pointless.


No, any well formed XML processor MUST expand internal entities and add
replaced attributes as well.  This is NOT OPTIONAL.


So is expat non-conforming?

Atom is not claiming that you could embed the literal source of any 
XML

doc. You can embed the stuff that canonicalization would preserve.


Yet you can do that with PNG, PDF, or other content via base 64 
encodings.  It

is silly to allow that for them and not XML.


Looks like straight HTTP without an Atom envelope in between suits your 
needs better.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: atom, xslt processors (Re: atom:type, xsl:output)

2005-05-28 Thread Henri Sivonen


On May 28, 2005, at 04:54, James Cerra wrote:


Aristotle, Henri Sivonen,


Yes, but any Atom document MUST be processable from just an XML 
processor, an

?xml-stylesheet? processor,


A conforming xml-stylesheet processor will only act on the PI if it 
appears in the prolog. For this reason, it would be possible to allow 
xml-stylesheet PIs in atom:content and spec that they are part of the 
content document, so that an XML document with the PI in its prolog 
could be constructed from the content.


(2) RSS and Atom documents are now processed with ?xml-stylesheet? 
PIs, and
there is no way to formally specify when those instructions should be 
executed

(except for specific instances that don't matter here).


Perhaps it follows that you should not use xml-stylesheet if you don't 
want it to be acted upon.


So any ?xml-stylesheet? nodes, and PI nodes in general, must be 
assumed to be

linked to the document as a whole;


Only if the PI appears in the prolog.


That's one reason the following is not true:


Vanilla XML processors don't act on PIs. They report them to the
application--in this case the Atom processor.


Instead, they could be reported to the PI processor before the atom 
processor.


Why do you think it is appropriate to have a random PI processor in 
between the XML processor and the Atom processor?


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: Review of Atom 0.8 Spec against W3C QA Specification Guidelines

2005-05-25 Thread Henri Sivonen


On May 24, 2005, at 20:54, Karl Dubost wrote:


A feed validator doesn't have to use a schema, but it could use it for
a non-normative check (you may have problems - or even kicking in
more lengthy checking code only when the schema is violated). This is
up to the validator authors.


Validation is something very precise. It can be validated against a 
DTD, or against a Schema or another grammar language, etc. At least 
the Feed validator could become a Feed checker which develops a 
heuristic to check if the requirements of the specification are 
verified. :))) up to the validator authors :)


The Feed Validator has always been a Feed Checker technically. I 
complained about the naming terminology when the Feed Validator came 
out. I don't expect the name to change now.


Even with validators that perform a precise check using a schema, it is 
important to know what kind of check the validator performs. In the 
case of Atom, the schema that ships with the specification is a RELAX 
NG Compact Syntax schema with embedded Schematron. When used with a 
RELAX NG validator that does not support embedded Schematron (eg. 
http://hsivonen.iki.fi/validator/), the validator will precisely ignore 
the embedded Schematron.


A hand-crafted checker can potentially be more useful than a validator 
that is limited to what is checkable using a given schema language. 
Case point: DTDs and attribute data types.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: Review of Atom 0.8 Spec against W3C QA Specification Guidelines

2005-05-25 Thread Henri Sivonen


On May 24, 2005, at 23:44, Paul Hoffman wrote:

So why are we discussing where the Atom format document doesn't match 
up to W3C test specification guidelines?


Atom has been advertised as thoroughly specified before it is even 
known what exactly the upcoming RFC will specify. Assessing the level 
of thoroughness using an outside metric makes sense to me. Whether the 
WG wants to act on the assessment is another matter.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: atom:type, xsl:output

2005-05-25 Thread Henri Sivonen


On May 25, 2005, at 06:46, James Cerra wrote:

fter being processed by a XML processor, any entites should be 
dereferenced to

their text values and placed into the document tree.  So this:


atom:content type=application/xml
lt;?xml version=1.0?
lt;tag //atom:content


would become the text string:


atom:content type=application/xml
xml version=1.0?
tag //atom:content



No. It becomes a tree:
Element atom:content having attribute type=application/xml
   |
   \ Text: xml version=1.0? tag /

(Newlines omitted.)

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: atom:type, xsl:output

2005-05-25 Thread Henri Sivonen


On May 25, 2005, at 06:43, James Cerra wrote:


If the intended Atom content contains essential comments,


There should be no such thing as essential comments. The XML spec does 
not require XML processors to report comments to the app. Hence, 
comments are inappropriate for transferring essential data.



processing instructions,


These could be supported in embedded content if the Atom spec said PIs 
in atom:content belong in content and should not be acted upon by the 
Atom processor.



entities


Entities can be flattened.


DOCTYPE declaration(s)


If DOCTYPE is essential for the receiving app, you've got bigger 
problems than Atom. Hardwiring IDness based on namespaces is more 
practical than relying on DTD-based data typing.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: some small comments on 08

2005-05-24 Thread Henri Sivonen


On May 23, 2005, at 12:31, Julian Reschke wrote:

For the record: I am +1 on 
http://www.intertwingly.net/wiki/pie/PaceOptionalXhtmlDiv.


+1 from me too.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/



Re: Change Control (was: extensions -- Atom NS and unprefixed attributes)

2005-05-18 Thread Henri Sivonen
On May 17, 2005, at 17:47, Antone Roundy wrote:
What possible advantage would there be to allowing just anyone to add 
elements to Atom's namespace, or any other namespace, for that matter? 
 I can't think of any.
If you add to an existing namespace, you don't need to add confusing 
namespace declarations.

In my opinion, alteration of a namespace by anyone other than the 
entity that created it, or someone authorized by its creator, would 
completely violate the nature of namespaces.
Sure. But it happens for practical reasons.
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: PaceOtherVocabularies

2005-05-17 Thread Henri Sivonen
On May 17, 2005, at 01:35, Robert Sayre wrote:
Markup from other vocabularies (foreign markup) can be used in an 
Atom document,
but MUST be namespace-qualified and in a namespace other than Atom's.
Surely attributes on extension elements don't need to be ns-qualified?
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Atom 1.0?

2005-05-11 Thread Henri Sivonen

Marketing: Atom
I'm looking forward to an article by Mark Pilgrim about the 
incompatible versions of Atom deceitfully marketed as one thing. :-)

(Which is why I said +1 to Atom 1.0.)
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Atom 1.0?

2005-05-10 Thread Henri Sivonen
On May 10, 2005, at 05:29, Tim Bray wrote:
Atom 1.0
+1 for Atom 1.0 in order to distinguish from 0.3.
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Last Call: 'The Atom Syndication Format' to Proposed Standard

2005-05-08 Thread Henri Sivonen
On May 8, 2005, at 06:30, Walter Underwood wrote:
White space is not particularly meaningful in some of these languages,
so we cannot expect them to suddenly pay attention to that just so
they can use Atom.
Why not? We expect them not no insert other random characters there. 
What do the same producers do with XHTML? Opera 7.53 and Safari 1.3 
render a space between the second and third Kanji in
http://hsivonen.iki.fi/test/cjk-whitespace.xhtml

There will be plenty of content from other formats
with this linguistically meaningless white space.
Why not just get rid of it in the producer end like you have to get rid 
of form feeds?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: the atom:copyright element

2005-05-08 Thread Henri Sivonen
On May 8, 2005, at 19:45, Eric Scheid wrote:
On 9/5/05 12:18 AM, Graham [EMAIL PROTECTED] wrote:
What other rights can be associated with content? Habeas Corpus? The
right to bear arms?
trademarks?
If your lawyer tells you that you need to recite some legalese about 
trademarks, wouldn't (s)he tell you to put the legalese where people 
will see it with the trademarks--ie. in the content?

moral rights (not just attribution)?
Can you give an example of a jurisdiction where it is useful to declare 
moral rights in a way other than indicating the author? Do you have a 
jurisdiction in mind where the moral rights can be vested in an entity 
other than the author who is a natural person?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Atom feed refresh rates

2005-05-05 Thread Henri Sivonen
On May 5, 2005, at 16:24, Walter Underwood wrote:
--On May 5, 2005 8:07:15 AM -0500 Mark Pilgrim [EMAIL PROTECTED] 
wrote:
Not to be flippant, but we have one that's widely available.  It's
called the Expires header.
You need the information outside of HTTP. To quote from the RSS spec
for ttl:
  This makes it possible for RSS sources to be managed by a 
file-sharing
  network such as Gnutella.

Caching information is about knowing when your client cache is stale,
regardless of how you got the feed.
Virtually everyone with IP connectivity can do HTTP, and HTTP has the 
Expires header. If this feature is important to you, why would you 
switch to a transfer protocol that doesn't have the feature? (I am not 
claiming anything about the actual Gnutella features.) To me, the what 
if the feed is not over HTTP argumentation seems theoretical 
over-generalization.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Autodiscovery

2005-05-04 Thread Henri Sivonen
On May 4, 2005, at 02:56, David Nesting wrote:
Plus, feed is kind of application-specific.  What about related?
It's a spec for discovering *feeds*. It is proper to have an 
app-specific rel value to avoid feed-specific apps downloading non-feed 
related documents.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Last Call: 'The Atom Syndication Format' to Proposed Standard

2005-05-04 Thread Henri Sivonen
On Apr 29, 2005, at 12:17, Martin Duerst wrote:
Making this more precise is definitely desirable. But there is also
an i18n issue: This works fine for languages that use spaces between
words. It doesn't work for languages that don't have spaces between
words (Chinese, Japanese, Thai,...). If Text elements are only used
for short things such as names or titles, that's not a big issue,
the text in question can just be put on a single line. However,
when the texts in question are long, it's a serious issue, and
should be fixed.
You seem to be assuming that the length of a line is restricted in 
XML source. Why? As far as I can tell, it should be permissible to 
produce Atom documents that contain no LF or CR characters.

Can't languages without spaces use long source lines and apply soft 
wrapping in a source view if necessary? Why is this a wire format 
problem?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: xml:base and html rendering

2005-04-21 Thread Henri Sivonen
On Apr 21, 2005, at 15:26, Robert Sayre wrote:
No. xml:base doesn't mean anything in (x)html, ever, so there is no 
need for a structural measure.
In practice, anything that end up in the xml: namespace is part of the 
infrastructure and generic underpinnings won't turn them off based on 
higher-level specs.

Are you suggesting Mozilla is wrong in supporting xml:base in a generic 
way (including in XHTML)?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Standalone schema?

2005-04-20 Thread Henri Sivonen
On Apr 20, 2005, at 01:29, Robert Sayre wrote:
Henri Sivonen wrote:
Is the Relax NG schema from the appendix hosted as a standalone HTTP 
resource somewhere on atompub.org ?
Now it is.
http://atompub.org/2005/04/18/atom.rnc
Thanks.
Could you also set the MIME type to application/vnd.relax-ng.rnc 
instead of text/plain?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: PaceXmlContentWrapper

2005-04-20 Thread Henri Sivonen
On Apr 20, 2005, at 05:21, Graham wrote:
iii) Require that xhtml and *xml content elements have only a single 
child node. That is, all xml must be wrapped in an enclosing element 
(eg contenttext bbold/b more text/content would be invalid).
-1. The Atom content element is the wrapper.
If your rendering mechanism requires a single root but not 
atom:content, shouldn't it be your responsibility to add it?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: PaceXmlContentWrapper

2005-04-20 Thread Henri Sivonen
On Apr 20, 2005, at 11:53, Bill de hÓra wrote:
Henri Sivonen wrote:
On Apr 20, 2005, at 05:21, Graham wrote:
iii) Require that xhtml and *xml content elements have only a single 
child node. That is, all xml must be wrapped in an enclosing element 
(eg contenttext bbold/b more text/content would be invalid).
-1. The Atom content element is the wrapper.
If your rendering mechanism requires a single root but not 
atom:content, shouldn't it be your responsibility to add it?
Not if the content formats have single roots by design.
Yes, */xml and */*+xml obviously require a single root. However, 
type='xhtml' is a fragment--not a full XHTML doc.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Standalone schema?

2005-04-20 Thread Henri Sivonen
On Apr 20, 2005, at 17:52, Robert Sayre wrote:
Henri Sivonen wrote:
Could you also set the MIME type to application/vnd.relax-ng.rnc 
instead of text/plain?
Hmm. That's not a registered type, is it?
It doesn't appear in the registry. I thought that type had consensus on 
the OASIS Relax NG mailing list in last July, but apparently the 
registration has not been carried out. I guess that leaves 
application/octet-stream, then. :-/

(It seems to me the MIME type system is dysfunctional and 
extension/foo, where foo is the commonly used filename extension would 
go a long way for new types.)

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Standalone schema?

2005-04-19 Thread Henri Sivonen
Is the Relax NG schema from the appendix hosted as a standalone HTTP 
resource somewhere on atompub.org ?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-17 Thread Henri Sivonen
On Apr 14, 2005, at 22:07, Robert Sayre wrote:
Bill de hÓra wrote:
If there is not a tidy way out of this, or we don't just agree to
scope it as a point solution for a particular flavour of XHTML, I
suggest we revisit the xhtml:div approach altogether.
Bill, I don't think the problem is exclusive to the outer div. I can't 
find any version of XHTML that allows xh:p/ where xh is bound to the 
XHTML namespace URI.
DTDs and namespaces are inherently incompatible. I think the 
restrictions the official DTDs place on namespace declarations should 
be ignored when embedding XHTML in Atom. (Note that the XHTML 
Modularization DTDs have stuff that can be parametrized.)

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-14 Thread Henri Sivonen
On Apr 14, 2005, at 10:07, Thomas Broyer wrote:
[1] http://www.w3.org/TR/xhtml1-schema/
For the record, James Clark has made Relax NG schemas for some flavors 
of XHTML 1.x. Surely James Clark is at least as good an authority as 
the W3C. :-)

http://www.thaiopensource.com/relaxng/xhtml/
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-13 Thread Henri Sivonen
On Apr 13, 2005, at 01:02, Robert Sayre wrote:
How about this text for XHTML:
 If the value of type is xhtml, the content of the Text construct
 MUST be a single XHTML div element [XHTML transitional reference], and
 SHOULD be suitable for handling as XHTML.
Instead of saying XHTML it would be clearer to say XHTML 1.x or 
defining it in terms of the XHTML 1.x namespace URI.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-13 Thread Henri Sivonen
On Apr 13, 2005, at 03:18, Asbjørn Ulsberg wrote:
namely to use a Strict DOCTYPE.
type='xhtml' takes a fragment and Atom is DTDless. Better to stay away 
from the word DOCTYPE.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-13 Thread Henri Sivonen

On Apr 13, 2005, at 15:37, A. Pagaltzis wrote:
* Henri Sivonen [EMAIL PROTECTED] [2005-04-13 08:25]:
On Apr 13, 2005, at 01:02, Robert Sayre wrote:
[XHTML transitional reference]
Instead of saying XHTML it would be clearer to say XHTML
1.x or defining it in terms of the XHTML 1.x namespace URI.
He already did, no?
Not as clearly as possible. If 1.x is meant, it should be clearly 
pointed out. If 1.0 (not Ruby nor the What WG stuff) is meant, it 
should be clearly pointed out.

I prefer 1.x over 1.0 or 2.0.
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-13 Thread Henri Sivonen
On Apr 14, 2005, at 00:10, Graham wrote:
On 13 Apr 2005, at 8:31 pm, David Powell wrote:
I agree that the Atom RFC shouldn't be a moving target.  How about if 
we:

  Make an IANA registry of Atom Text Types, requiring some level of
  RFC to create new ones.
  Put text, html, and xhtml in the registry, and specify that xhtml
  means XHTML1.0
  Describe what processors should do if they encounter an Atom Text
  Type that they don't understand.
-1
-1 from me as well. Overkill.
I think it will be safe to leave any further formats to a new rev of 
the spec, since whatever new thing needs to be added would have to 
have reached similar popularity to either plain text or HTML.
That pretty much rules out XHTML 2.0, doesn't it? Good ideas from the 
XHTML 2.0 namespace are being backported to the XHTML 1.x namespace, so 
it is unlikely for XHTML 2.0 to reach large-scale popularity.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: HTML/XHTML type issues, was: FW: XML Directorate Reviewer Comments

2005-04-12 Thread Henri Sivonen
On Apr 11, 2005, at 22:23, Norman Walsh wrote:
But I hope not. I don't really want to have to rev the Atom format
spec when XHTML 2.0 comes out. With care, I want to just put XHTML 2.0
stuff in my xhtml:div elements and let the down-stream appliation work
it out.
It will be nice to have the div in the XHTML 1.x namespace and the 
contents in the XHTML 2.0 namespace. Namespace div indeed. :-)

But XHTML 2.0 is a different language form XHTML 1.x. Why do you think 
XHTML 2.0 fragments should be allowed as type='xhtml'? Just because 
XHTML 2.0 has XHTML in the name? If it is not about the name, why not 
DocBook NG fragments? One might argue that DocBook NG has a better 
chance of getting browser support than XHTML 2.0. ;-)

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: copyright

2005-03-25 Thread Henri Sivonen
On Mar 25, 2005, at 14:41, Henry Story wrote:
Clearly it would be very helpful if there were a machine readable way 
to set copyright
policy on entries. Any thoughts on that?
Legalese is complicated enough that it cannot be fully expressed in 
machine readable form unless the machine is AI-complete. CC has created 
an RDF vocabulary for describing their license elements. However, when 
you start describing eg. the GFDL with CC's vocabulary, you are on 
shaky ground, because the license details may not be exactly equivalent 
to the CC terms.

It seems to me that it is more straight-forward to identify a known 
license than to try to describe a license. Tantek Çelik has publicized 
and mozilla.org (among others) has adopted a 'license' rel value for 
HTML link. This approach avoids the issue of machine-readable 
licenses and software interpreting legalese, because a human programmer 
needs to supply the software with known license URIs. Obviously, this 
doesn't work nicely with obscure licenses but assumes that the license 
URIs are well-known. The solution is also less crufty than hiding RDF 
in discardable comments. (Hiding copyright metadata in discardable 
comments is a bad idea, because in some jurisdictions there may soon be 
legislation that makes it illegal to discard copyright metadata. The 
XML spec says software may discard comments.)

However, you cannot make everyone who provides useful feeds to describe 
their policy, so someone running a feed aggregation service still needs 
all lawyer who can convincingly argue against the permission culture.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: copyright, Creative Commons, and DRM in Atom/RSS

2005-03-25 Thread Henri Sivonen
On Mar 25, 2005, at 18:38, Bob Wyman wrote:
http://bobwyman.pubsub.com/main/2005/03/lazyweb_query_a.html
	The basic message is that we should not be writing anything that
implies that Creative Commons should be used for Digital Rights 
Management.
+1
BTW, I think the anti-DRM clause of CC is really intended as an 
anti-anti-circumvention clause and should be rephrased as such. See 
http://lists.debian.org/debian-legal/2005/03/msg00423.html

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: Attributes on the xhtml:div wrapper

2005-03-17 Thread Henri Sivonen
On Mar 17, 2005, at 00:57, David Powell wrote:
  c) disallow XHTML attributes on the xhtml:div wrapper, but
 allow xml:lang.
If you allow declaring the language, why do you disallow declaring the 
dominant writing direction (dir)? Shouldn't they be allowed or 
disallowed together?

d) Get rid of the wrapper div.
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: On PaceXhtmlNamespaceDiv

2005-02-17 Thread Henri Sivonen
On Feb 17, 2005, at 15:09, Bill de hÓra wrote:
1. About Rob's example from MT; the point is twofold a) that what the 
tool provides ootb
Wasn't that what type='HTML' was for?
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: On PaceXhtmlNamespaceDiv

2005-02-16 Thread Henri Sivonen
On Feb 16, 2005, at 10:53, Robert Sayre wrote:
This is how I generate my Atom 0.3 feed, using the popular Movable 
Type program:

content type=text/html
 mode=escaped
 xml:lang=en
 xml:base=$MTBlogURL encode_xml=1$
  $MTEntryBody encode_xml=1$
/content
How's that relevant wrt. type='XHTML'? If you doubt the robustness of 
concatenation with XHTML, why wouldn't you keep using entity-encoded 
HTML?

(FWIW, I don't believe the pace removes the telescoping divs issue. I 
predict the div as part of the envelope is unintuitive to the 
ViewSourceClan, who will assume everything in the Atom namespace is 
part of the envelope and everything in the XHTML namespace is part of 
the payload.)

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: On PaceXhtmlNamespaceDiv

2005-02-16 Thread Henri Sivonen
On Feb 17, 2005, at 00:18, Robert Sayre wrote:
 However, this is not going to reflect what's in MySQL.
Does it need to? Could anybody except for you check?
Should anybody (including you) care? Is it part of conformance?
Yes. No. Yes. Yes.
Why does the feed have to reflect what is in MySQL? What if you had 
Markdown content (or similar non-(X)HTML) in MySQL?

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Re: PaceXhtmlNamespaceDiv

2005-02-10 Thread Henri Sivonen
On Feb 10, 2005, at 18:02, Sam Ruby wrote:
We have seen on this very mailing list people who have an above 
average understanding of XML trip over this particular area numerous 
times.
Those trip-ups have not been as much about div vs. no div but about 
XMLNS which we can't and should not attempt to change. I should also 
note that typed examples on the list and output from debugged 
serializers are different things.*

* Aka. the tools will save us argument. Despite the tools will save 
us argument being unpopular, I think it is unwise for an average 
developer to approach XMLNS without proper tools.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: type=HTML

2005-02-09 Thread Henri Sivonen
On Feb 9, 2005, at 20:06, Julian Reschke wrote:
I'll agree that it's easier to produce for those who don't understand 
XML. But shouldn't we recommend XHTML for those who *do* understand 
XML/XHTML (and actually read the spec)?
Don't you expect those people to use XHTML without being told so just 
out of an aesthetic dislike of entity-encoded HTML?

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: type=HTML

2005-02-09 Thread Henri Sivonen
On Feb 10, 2005, at 01:00, Graham wrote:
Does type=XHTML have *any* strengths compared to type=HTML (expect 
being easier to produce if you don't have HTML in the first place?).
 * Mixing MathML for future Gecko-based aggregators
 * Mixing SVG for aggregators that use a future version of 
Gecko/WebCore/Presto
 * Less repulsive for people who really don't like entity-encoded HTML
 * Easier to produce with some tools (even if you have HTML in the 
first place)

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-08 Thread Henri Sivonen
On Feb 7, 2005, at 23:21, Sam Ruby wrote:
  1) Graham (who uses proper XML tools) will have to do more work.
Which tools? How much more? One loop more?
(FWIW, I do not consider Apple's Core Foundation XML parser a proper 
XML tool.)

  2) Tim (who uses string concatenation) will have to do more work.
When I did a string concatenation implementation, using colonified 
names was not a big deal.

  5) For some combinations of clients and servers, entries produced
 via an HTTP POST will end up with multiple divs.
I can anticipate that happening either way.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-08 Thread Henri Sivonen
On Feb 8, 2005, at 07:03, Henri Sivonen wrote:
On Feb 8, 2005, at 01:55, Sam Ruby wrote:
Can I get one of you three to mock up what Tim's feed should look 
like?
a:feed xmlns:a='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
version='draft-ietf-atompub-format-04 : do not deploy' 
xmlns='http://www.w3.org/1999/xhtml' xml:lang='en-us'
...
 a:entry
...
  a:content type='XHTML'I was in the drugstore picking up a 
prescription and wandered into the computer section, where I found 
myself impulse-buying a 
href='http://dvforge.com/themousebt.shtml'The Mouse BT/a from some 
outfit
...
OR (even less impact on string concatenators):
feed xmlns='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
version='draft-ietf-atompub-format-04 : do not deploy' 
xml:lang='en-us'
...
 entry
...
  a:content type='XHTML' 
xmlns:a='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
xmlns='http://www.w3.org/1999/xhtml'I was in the drugstore picking up 
a prescription and wandered into the computer section, where I found 
myself impulse-buying a href='http://dvforge.com/themousebt.shtml'The 
Mouse BT/a from some outfit
...

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-08 Thread Henri Sivonen
On Feb 7, 2005, at 22:58, Sam Ruby wrote:
If you are opposed to this pace, then what div element?
If the pace does not get through, it is still permissible to put a div 
in there as part of the content. In fact, either way it is permissible 
to add meaningless extra divs, since a div can nest in a div.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceLangSpecific

2005-02-08 Thread Henri Sivonen
On Feb 8, 2005, at 10:43, David Powell wrote:
text/plain can have a Content-Language header though - is that 
compatible with
xml:lang?
For practical purposes, yes. In theory, the semantics are different.
xml:lang specifies the language of the content but Content-Language 
describes the language of the intended audience.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-08 Thread Henri Sivonen
On Feb 8, 2005, at 15:59, Julian Reschke wrote:
When a Text construct or atom:content's type is XHTML, require it 
to have a single xhtml:div as a child, and require that div to declare 
the XHTML namespace.

Am I looking at the wrong pace? 
(http://www.intertwingly.net/wiki/pie/PaceXhtmlNamespaceDiv)
The abstract no longer matches the proposal. This seems to cause 
confusion.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


PaceXhtmlNamespaceDiv

2005-02-07 Thread Henri Sivonen
On Feb 7, 2005, at 19:50, Paul Hoffman wrote:
Even if you sent in a +1, 0, or -1 previously on a particular Pace, 
send it in again. Hopefully, add a short or long comment on why you 
feel it should or should not be considered part of the Atom core.
-1 on PaceXhtmlNamespaceDiv
The div is an additional container inside the Text construct container. 
The main purpose of the div is to save one for loop in a 
document-tree/pull-based client or, alternatively, to give a false 
sense of correctness in tag soup concatenation-based clients. IMO, 
neither rationale warrants a meaningless element inside a Text 
construct.

(Noting that the Pace was retracted by the original author.)
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-07 Thread Henri Sivonen
On Feb 7, 2005, at 21:52, Antone Roundy wrote:
+1, but I wouldn't object to a variant that required either the DIV 
with a namespace declaration OR for the namespace to be declared in 
content or before content.  Examples of what I'd think was 
acceptable:

feed ... xmlns:xhtml=... /
...
contentThis is xhtml:bbold/xhtml:b/content
This is against the Pace. Are you really supporting the pace?
OR
content xmlns:xhtml=... /This is xhtml:bbold/xhtml:b/content
This is against the Pace, too.
OR
contentdiv xmlns=... /This is bbold/b/content
This one is against the pace as well. Also, the 'b' element would be in 
the same namespace as 'content', which seems wrong.

Here's what I think is just plain ugly and shouldn't be allowed:
contentThis is b xmlns=... bold/b/content
IMO, that should be allowed. Atom has no business in restricting the 
syntactic sugar provided by Namespaces in XML.

OR
contentThis is xhtml:b xmlns:xhtml=... bold/xhtml:b/content
IMO, this on should be allowed on the same grounds.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceArchiveDocument

2005-02-07 Thread Henri Sivonen
On Feb 7, 2005, at 20:04, Robert Sayre wrote:
Paul Hoffman wrote:
-1. Not core. The current text has a simple way of creating archives, 
and extensions can be used to create more specialized archive 
formats.
-1 to the Pace. Agree in full.
-1 to the Pace. Me three.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-07 Thread Henri Sivonen
For the record, the additional loop that the div would save in a 
DOM-based client is not that hard:

copyLangAndBase(atomTextCostruct, targetDivInTemplate);
for (Node n = atomTextCostruct.getFirstChild(); n != null; n = 
n.getNextSibling()) {
	targetDivInTemplate.appendChild(templateXhtmlDocument.importNode(n, 
true));
}

Is that too much to ask from clients?
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv

2005-02-07 Thread Henri Sivonen
On Feb 8, 2005, at 01:55, Sam Ruby wrote:
Can I get one of you three to mock up what Tim's feed should look like?
a:feed xmlns:a='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
version='draft-ietf-atompub-format-04 : do not deploy' 
xmlns='http://www.w3.org/1999/xhtml' xml:lang='en-us'
...
 a:entry
...
  a:content type='XHTML'I was in the drugstore picking up a 
prescription and wandered into the computer section, where I found 
myself impulse-buying a href='http://dvforge.com/themousebt.shtml'The 
Mouse BT/a from some outfit
...

OR
a:feed xmlns:a='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
version='draft-ietf-atompub-format-04 : do not deploy' 
xmlns:h='http://www.w3.org/1999/xhtml' xml:lang='en-us'
...
 a:entry
...
  a:content type='XHTML'I was in the drugstore picking up a 
prescription and wandered into the computer section, where I found 
myself impulse-buying h:a 
href='http://dvforge.com/themousebt.shtml'The Mouse BT/h:a from some 
outfit
...

OR
feed xmlns='http://purl.org/atom/ns#draft-ietf-atompub-format-04' 
version='draft-ietf-atompub-format-04 : do not deploy' 
xmlns:h='http://www.w3.org/1999/xhtml' xml:lang='en-us'
...
 entry
...
  content type='XHTML'I was in the drugstore picking up a 
prescription and wandered into the computer section, where I found 
myself impulse-buying h:a 
href='http://dvforge.com/themousebt.shtml'The Mouse BT/h:a from some 
outfit
...

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: RELAX NG Grammar question

2005-02-03 Thread Henri Sivonen
On Feb 4, 2005, at 01:02, Norman Walsh wrote:
| As for the XHTML thing, I think this is going to happen all the time 
(foreign
| markup in embedded XHTML) and I don't think we should try to get in 
the way.
| However, I just now tried to think of some spec language to express 
this and
| came up empty. -Tim

I think the spec language is fine
   If the value of type is XHTML, the content of the Text construct
   MAY contain child elements.  The content SHOULD be XHTML text and
   markup that could validly appear directly within an xhtml:div
   element.
Depending on what W3C DTD you choose to read, you could put MathML or 
even SVG there, which is fine, IMO, because I don't think it makes 
sense to exclude well-known vocabularies that you could legitimately 
serve embedded in application/xhtml+xml.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Posted PaceTextRules

2005-02-01 Thread Henri Sivonen
On Feb 1, 2005, at 11:08, Martin Duerst wrote:
+1 with the minor nit that lone line breaks should be considered 
spaces--not disregarded. Thus: Software displaying this text SHOULD 
remove white-space at the beginning and end; and treat sequences of 
one or more XML white-space characters as a single space.
Except that that's completely inadequate for some Asian languages
(Chinese, Japanese, Thai,...). Point to a spec developed
by experts if you need to rather than trying to reinvent the wheel.
I used http://www.w3.org/TR/2004/REC-xml-20040204/#AVNormalize and the 
default white-space behavior of browsers as the precedent.

Could you please point to a spec that is developed by experts and does 
not reinvent the wheel? Also, I would appreciate a pointer to a 
document explaining the issues with Chinese, Japanese and Thai. I don't 
think what I wrote above in any way precludes soft line breaks for 
display at places that do not have adjacent XML white-space.

(All this would be so much easier if XML used linefeeds and tabs for 
pretty-printing and those could always be dropped and spaces were 
always meaningful content. Too late to change it though.)

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Issues with draft-ietf-atompub-format-04

2005-01-31 Thread Henri Sivonen
On Jan 31, 2005, at 03:11, Asbjørn Ulsberg wrote:
On Sun, 30 Jan 2005 22:06:23 +0200, Henri Sivonen [EMAIL PROTECTED] 
wrote:

So how many European sites besides the EU have the resources to 
provide translations of the *same* content in multiple languages at 
the same time?
The company I work in (the Norwegian Broadcasting Company), for 
instance. And most definately the BBC. And many other broadcasters, 
including RN (Radio Netherlands) which broadcasts and publishes 
content in many different languages. Don't any Finish broadcasters 
publish or broadcast content in both Finish and Sami for instance?
There are TV programs in Swedish, but they are not translations of 
Finnish content. Both the Finnish and Swedish branches have news, but 
they have separate editorial processes. I expect the Sami radio to have 
separate editorial staff as well so that their content is not a mere 
translation of the content any Finnish-language channel.

How many of those can't provide multiple feed links and really want to
stuff everything in a single feed?
All of the above, I'd presume. If it can be stuffed into the same 
feed, everyone would of course rather do that than create separate 
documents for each language.
Why? The feed takes longer to download that way and you pay more for 
bandwidth.

Aren't language alternatives within a feed over-engineering.
No, not at all. It makes it possible to do for those who want, but 
don't have any implications on other producers.
All features involve a cost at the consumer side.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Posted PaceTextRules

2005-01-31 Thread Henri Sivonen
On Jan 31, 2005, at 03:00, Graham wrote:
Software displaying this text SHOULD remove white-space at the 
beginning and end; collapse white-space between words; and disregard 
line break, tab and other formatting characters. in 3.1.1 (TEXT).
+1 with the minor nit that lone line breaks should be considered 
spaces--not disregarded. Thus: Software displaying this text SHOULD 
remove white-space at the beginning and end; and treat sequences of one 
or more XML white-space characters as a single space.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Comments on draft 05, was: I-D ACTION:draft-ietf-atompub-form at-05.txt

2005-01-31 Thread Henri Sivonen
On Jan 31, 2005, at 01:32, Sam Ruby wrote:
Julian Reschke wrote:
atomTitle type=XHTML xmlns:xhtml=http://www.w3.org/1999/xhtml;
  Less: xhtml:em lt; /xhtml:em
/atomTitle
(hope I got these right).
This is not only right, but also a good example of why many people 
would prefer to have another element so that they don't have to deal 
with prefixes:
Their serializer should deal with them. If they are concatenating 
source strings, using XHTML is risky.

  atomTitle type=XHTML
div xmlns=http://www.w3.org/1999/xhtml;Less: em lt; 
/em/div
  /atomTitle

The above is similar to your example, but not _identical_ to your 
example, given the current spec.
You could also have
atom:title type=XHTML xmlns=http://www.w3.org/1999/xhtml;
  Less: em lt; /em
/atom:title
(assuming the atom prefix to be bound higher up)
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Comments on draft 05, was: I-D ACTION:draft-ietf-atompub-form at-05.txt

2005-01-31 Thread Henri Sivonen
On Jan 31, 2005, at 16:05, Julian Reschke wrote:
Can't we just get rid of the defaulting? That would make the spec 
simpler with little additional verbosity in the instance documents.
If there is no default, someone will still omit the attribute and 
consumers will try to cope. I think it is better if they cope in the 
same way normatively.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Issues with draft-ietf-atompub-format-04

2005-01-30 Thread Henri Sivonen

On Jan 30, 2005, at 19:06, Anne van Kesteren wrote:
In Europe there are lots of different languages. It does not make 
sense to provide a feed based on language negotiation since feed 
aggregators do not support that.
So how many European sites besides the EU have the resources to provide 
translations of the *same* content in multiple languages at the same 
time? How many of those can't provide multiple feed links and really 
want to stuff everything in a single feed?

Aren't language alternatives within a feed over-engineering.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv posted

2005-01-30 Thread Henri Sivonen
On Jan 30, 2005, at 21:06, Julian Reschke wrote:
OK, I'll try to rephrase: changing the protocol format because one 
implementor says that this makes it easier to implement IMHO is a bad 
idea. Of course things look differently if this issue affects more 
platforms/parsers/toolkits.
FWIW, one could let WebCore see the Atom Text construct element without 
harm. So if a single container is needed, the Text construct itself is 
the obvious container.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv posted

2005-01-29 Thread Henri Sivonen
On Jan 29, 2005, at 00:39, Sam Ruby wrote:
Henri Sivonen wrote:
On Jan 28, 2005, at 20:21, Sam Ruby wrote:
I also don't like the restriction on where namespace declarations 
must
be placed, but overall, I believe that the pace is a good idea.
I, for one, use gnu.xml.pipeline.NSFilter for ensuring the namespace 
correctness in my RSS feed.  If the current spec language stands, I 
will be able to trivially add Atom output with type='XHTML' by doing 
a DOM to DOM copy (without div cruft) and letting the serialization 
phase sort out the namespace declarations for me. If this pace was 
accepted, I'd have to break the namespace abstraction and fiddle with 
the namespace declaration details to meet additional requirements 
that are unnecessary as per Namespaces in XML.
Are you saying that you can do a DOM to DOM copy to place a series of 
elements inside the following:

  atom:feed/atom:entry/atom:content
But you would find it extraordinarily difficult to place the exact 
same series of elements inside the following:

  atom:feed/atom:entry/atom:content/xhtml:div
If so, I would find such an assertion to be hard to accept.
No, of course those two are equally easy. The latter is just crufty.
My point was that having the namespace declarations appear in a 
particular choice of syntactic sugar would hinder the use of an 
automatic namespace declaration manager.

If element names are prefixed, string concatenation is not an option 
anyway.
Which is why it would be harmful to imply that a string concatenation 
implementation was ok.

If element names are not prefixed (as is the case with the overwheming 
majority of existing HTML), adding a div is exactly what makes things 
safe for simple string concatenators.
I don't see how it makes string concatenation easier that concatenating 
the contents of the Text construct otherwise. But since it doesn't work 
in all cases, it certainly should not be specced for. Speccing for tag 
soup concatenation would just encourage implementors to do tag soup 
concatenation. Wasn't the whole reason for the existence of 
type='XHTML' to provide something that is more sound from the XML point 
of view than entity-encoded HTML?

And no, a div does not give any useful hint to the ViewSourceClan, 
because it is an anything goes element.

I am sorry I brought up the subject of Text constructs. How about 
withdrawing both PaceTypeTextRedundant and PaceXhtmlNamespaceDiv and 
sticking to what draft-ietf-atompub-format-04 said?

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Difference of TEXT and XHTML?

2005-01-27 Thread Henri Sivonen
On Jan 27, 2005, at 09:41, Tim Bray wrote:
OK, you've advanced this argument several times now.  If you want to 
change the Atom format to remove type=TEXT, write a Pace (it'll be 
short  easy) and see if you can build consensus.
http://www.intertwingly.net/wiki/pie/PaceTypeTextRedundant
I have to warn you that the process by which we got to consensus on 
the current setup was long, arduous, and involved a huge volume of 
email, and you may get -1's based on people just not wanting to 
revisit this space.
Sure, the current version is better than the mode/type stuff.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Difference of TEXT and XHTML?

2005-01-27 Thread Henri Sivonen
On Jan 27, 2005, at 17:50, Tim Bray wrote:
On Jan 27, 2005, at 4:46 AM, Eric Scheid wrote:
however, the spec says:
The content SHOULD be XHTML text and markup that could
validly appear directly within an xhtml:div element.
which could lead others to make the same mistake I must have made.
The easiest way to solve this, I think, is with examples.
Avoiding the word markup would also help avoiding any associations 
with unparsed source.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Mandatory method of specifying XHTML namespace?

2005-01-27 Thread Henri Sivonen
On Jan 27, 2005, at 19:30, Antone Roundy wrote:
Given how common it is even for us, when posting examples of content 
type=XHTML without declaring the XHTML namespace, might it be a 
good idea to specify a mandatory method of declaring the XHTML 
namespace to ensure that implementors don't forget?
+1 That should be obvious.
If the value of type is XHTML, the content of the Text construct 
MUST be a single xhtml:div element
-1 gratuitous element cruft. The text construct element itself serves 
as a container.

which MUST declare the XHTML namespace, either as the default 
namespace or with a namespace prefix
-1 Atom should not micro-manage how and on which elements namespace 
declarations appear.

Feed in the wild that declares the namespaces on root:
http://macsanomat.fi/atom
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: PaceXhtmlNamespaceDiv posted

2005-01-27 Thread Henri Sivonen
On Jan 27, 2005, at 22:30, Antone Roundy wrote:
I'm not in favor of mandating restrictions, because there are probably 
legitimate uses for anything we might try to protect people against.
The namespace div places restrictions on where namespace declarations 
appear and, therefore, limits the legitimate use of serializers that 
take care of namespace declarations.

-1 for the pace, still.
--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Mandatory method of specifying XHTML namespace?

2005-01-27 Thread Henri Sivonen
On Jan 28, 2005, at 01:38, Eric Scheid wrote:
On 28/1/05 7:39 AM, Henri Sivonen [EMAIL PROTECTED] wrote:
If the value of type is XHTML, the content of the Text construct
MUST be a single xhtml:div element
-1 gratuitous element cruft. The text construct element itself serves
as a container.
but atom:title != xhtml:title
also, are there such things as xhtml:copyright, xhtml:info, or
xhtml:summary?
No, but that is not the point. The *Atom* Text constructs work as 
containers. The div is just cruft.

IMO,
a:copyright type='XHTML'Copyright 2005 John Doe, h:emall rights 
reserved/h:em/a:copyright
(assuming 'a' to be bound to the Atom namespace and 'h' to the XHTML 
namespace) is less crufty than
a:copyright type='XHTML'div 
xmlns='http://www.w3.org/1999/xhtml'Copyright 2005 John Doe, emall 
rights reserved/em/div/a:copyright
.

Even if you wanted to declare the XHTML namespace on the spot, you 
could put the declaration on the Atom Text construct. Any feed reader 
using namespace-aware XML tools properly will have no trouble dealing 
with the less crufty versions.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Issues with draft-ietf-atompub-format-04

2005-01-27 Thread Henri Sivonen
On Jan 27, 2005, at 22:39, Robert Sayre wrote:
Anne van Kesteren wrote:
So I can not include MathML in the TITLE of my weblog? I do not see 
why this restriction is necessary.
Nope. Can any aggregator display it?
I expect Gecko-based aggregators to support MathML eventually. After 
all, once you support XHTML content in a Gecko-based aggregator in a 
non-bogotic way (XML DOM to XML DOM copy with platupus filtering), you 
get MathML for free.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Difference of TEXT and XHTML?

2005-01-26 Thread Henri Sivonen
Quoting from draft-ietf-atompub-format-04:
3.1.1  type Attribute
...
   If the value is TEXT, the content of the Text construct MUST NOT
   contain child elements.  Such text is intended to be presented to
   humans in a readable fashion.  Thus, software MAY display it using
   normal text rendering techniques such as proportional fonts,
   white-space collapsing, and justification.
...
   If the value of type is XHTML, the content of the Text construct
   MAY contain child elements.  The content SHOULD be XHTML text and
   markup that could validly appear directly within an xhtml:div
   element.  Receiving software which displays the content MAY use the
   markup to aid in displaying it.
Considering that TEXT MAY be white-space collapsed (why not SHOULD?) 
and XHTML allows anything that could go in div including text, isn't 
TEXT redundant?

What's the difference between:
atom:title type='TEXT'I   do not like 
![CDATA[]]marqueegt;/atom:title
and
atom:title type='XHTML'I   do not like 
![CDATA[]]marqueegt;/atom:title
?

Shouldn't both render as I do not like marquee?
FWIW, with the exception of content, I think allowing only %inline 
XHTML elements would make more sense than allowing %flow.

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Difference of TEXT and XHTML?

2005-01-26 Thread Henri Sivonen
On Jan 26, 2005, at 23:18, Tim Bray wrote:
On Jan 26, 2005, at 12:44 PM, Henri Sivonen wrote:
What's the difference between:
atom:title type='TEXT'I   do not like 
![CDATA[]]marqueegt;/atom:title
and
atom:title type='XHTML'I   do not like 
![CDATA[]]marqueegt;/atom:title
?

Shouldn't both render as I do not like marquee?
Yeah, but if you wanted to have 'not' in bold:
title type='TEXT'I do not like lt;marquee/title (can't do it)
title type='HTML'I do lt;bnotlt;/b like amp;lt;marquee/title
title type='XHTML'I do bnot/b like lt;marquee/title
But if you can always substitute type='TEXT' with type='XHTML' but not 
the other way round, what's the point of having type='TEXT' in the 
spec?

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/


Re: Difference of TEXT and XHTML?

2005-01-26 Thread Henri Sivonen
On Jan 27, 2005, at 00:45, Robert Sayre wrote:
But guess what, TEXT is *never* coming out of the spec, because it 
will eventually become impossible to write something that looks like 
markup if we don't have it.
How so? What does type='TEXT' make possible to write that type='XHTML' 
with a single text node does not make possible to write?

--
Henri Sivonen
[EMAIL PROTECTED]
http://iki.fi/hsivonen/