That was my understanding all along !

I have submitted a final(?) patch and sample results to Jira-242. Can you fix the tests, two of them fail, commit and merge back to the main branch.

Imtiaz

----- Original Message ----- From: "Ethan Jewett" <[email protected]>
To: <[email protected]>
Sent: Monday, July 26, 2010 12:00 AM
Subject: Re: Metadata handling (was "Release planning")


Responding to myself because there was indeed something I was
"fundamentally misunderstanding":

On Sun, Jul 25, 2010 at 8:09 PM, Ethan Jewett <[email protected]> wrote:

I think if the user sends non-entity-encoded strings to the API for
the metadata, then we need to return the same string. We're returning
UTF-8 XML here, not HTML, so my thought was that no one would be
expected entity-encoded output. If we return JSON or other formats in
the future (and I do plan to do that), then we'll have to encode
however necessary for that format.

Maybe I'm fundamentally misunderstanding how this output is supposed
to be encoded. Encoding is definitely not my area of expertise.

If I'm not misunderstanding, then yes, I think we need to decode the
entity-encoding so that we see " < and > in the output. I'm not sure
if it is currently getting encoded because of something we do or
because of Scala's XML handling.

My fundamental misunderstanding was around how you deal with the
characters <, >, ", ', and & in XML:
http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references#Predefined_entities_in_XML

They indeed must be encoded (though they are a much smaller subset of
entities that must be encoded than in HTML), so I suppose that we
should return them in encoded form. Do you agree?

Ethan

Reply via email to