Re: A simpler Web service response format

Henri Sivonen Tue, 26 Dec 2006 12:41:01 -0800


On Dec 19, 2006, at 11:55, Bjoern Hoehrmann wrote:

You might want to read my notes on the subject. They do not proposeanyparticular format, but list requirements and problems to be solvedby aformat of this kind, see <http://esw.w3.org/topic/MarkupValidator/M12N>.

"The current validator supports multiple input sources, file upload,textarea, and retrieval of remote resources. An observation isnaturally bound to the input retrieved through these sources (ortheir metadata) and should thus be identified in the observationinstance."

I don't see why the source needs to be identified. Surely the clientinvoking the checker knows what it sent as the input.

(Associating the URI of the entity with a source line and column issubtly different from merely echoing things about the input that theprovider of the input already knows.)

"The descriptor should be extensible to allow for different locationaddressing schemes"

Then consumers of the format would need to support differentaddressing schemes.

"A related question is how the results would be presented in theXHTML interface, it could be a hierarchy like"

"Well-formedness errors:"
"DTD-Validitiy errors:"
"Link Check"

Since off-the-shelf libraries don't usually categorize errors likethat, introducing such categorization as an afterthought could wellgo into the territory of diminishing returns, because the cost ofintroducing categorization would be great compared to the benefit.

For example, the SAX2 ErrorHandler interface doesn't guarantee that areport of an "error" carries any data beyond stating that an erroroccurred. In practice, an English-language message is available. Mostoften also an approximate source location is available. Extractingany more data than that generally requires hacking into the off-the-shelf libraries and subverting the usual reporting mechanism.

"bla bla ... branding ... outreach ... community ... positivestatements ... terminology ..."

:-)

In the context of that document, the need for a common format camefrom

the desire to enable multiple independent tools to combine the results
at low and high levels, for example, to combine multiple "microformat"
checkers with a general-purpose XHTML Validator.

If I were to integrate a microformat checker with my validationservice, I'd prefer to integrate them in-process. That is thecheckers would need to consume SAX2 ContentHandler events and reportto a SAX2 ErrorHandler. Of course, such an arrangement would requirethe checkers to be written in Java.

The primary use case for the Web service format that I am consideringis allowing e.g. a blogging system to send a document off to a Webservice for checking so that the blogging system doesn't need tocontain an in-process conformance checker.

Also note that the ISO 19757-3 (Schematron) specification defines areporting format.

The way I have seen Schematron used (which is also how I use itmyself) makes whether an error check is implemented as a failingassertion or as a succeeding report an implementation detail whichshouldn't be exposed to end users or even software observers outsidethe Schematron engine. I think I'll patch my copy of Jing/oNVDL atsome point to hide whether a message was generated by a failedassertion or a report.

Moreover, of late, I have started to consider the Schematron of theHTML5 conformance checker a mere rapid prototype of a hand-craftedmore CPU-efficient and more memory-efficient exclusion andreferential integrity checker.


Thank you for the pointers.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/

Re: A simpler Web service response format

Reply via email to