Re: [FileAPI] Updates to FileAPI Editor's Draft

Arun Ranganathan Tue, 21 Jun 2011 10:18:41 -0700

Sorry if these have all been discussed before. I just read the FileAPI for the first time and 2 random questions popped in my head.
1) If I'm using readAsText with a particular encoding and the data inthe file is not actually in that encoding such that code points in thefile can not be mapped to valid code points what happens? Is thatimplementation specific or is it specified? I can imagine at least 3different behaviors.

This should be specified better and isn't. I'm inclined to then returnthe file in the encoding it is in rather than force an encoding (inother words, ignore the encoding parameter if it is determined that codepoints can't be mapped to valid code points in the encoding... also notethat we say to "Replace bytes or sequences of bytes that are not validaccording to thecharsetwith a single U+FFFD character [Unicode<http://dev.w3.org/2006/webapi/FileAPI/#Unicode>]"). Right now, thespec isn't specific to this scenario ("... if the user agent cannotdecode blob using encoding, then let charset be null" before thealgorithmic steps, which essentially forces UTF-8).

Can we list your three behaviors here, just so we get them on record?Which behavior do you think is ideal? More importantly, issubstituting U+FFFD and "defaulting" to UTF-8 good enough for yourscenario above?

2) If I'm reading using readAsText a multibyte encoding (utf-8,shift-jis, etc..) is it implementation dependent whether or not it canreturn partial characters when returning partial results duringreading? In other words, Let's say the next character in a file is a3 byte code point but the reader has only read 2 of those 3 bytes sofar. Is implementation dependent whether result includes those 2 bytesbefore reading the 3rd byte or not?

Yes, partial results are currently implementation dependent; the spec.only says they SHOULD be returned. There was reluctance to have MUSTcondition on partial file reads. I'm open to revisiting this decisionif the justification is a really good one.


-- A*

Re: [FileAPI] Updates to FileAPI Editor's Draft

Reply via email to