The point of that formulation was to make clear that you can parse a .desktop fils into individual lines, keys and value-blobs without concern for the actual encoding used. Back in the days .desktop files could use different encodings for different lines of the file, there was no single encoding for the file as a whole.
Maybe it should have said "8-bit octets" instead of "8-bit characters". Now that everyone is supposed to use UTF8 there isn't much difference either way. The only difference would be that the current spec is clear on what to do if a file were to had a truncated (invalid) UTF8 sequence followed by a LF. I don't know if old-style mixed encoding .desktop files are likely to trigger that pattern. Cheers, Waldo Intel Corporation - Platform Software Engineering, UMG - Hillsboro, Oregon -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Vincent Untz Sent: Friday, June 29, 2007 9:38 AM To: xdg@lists.freedesktop.org Subject: Small change for the desktop entry spec Hi, Someone reported that in the current version of the spec, we say: "Desktop entry files are encoded as lines of 8-bit characters separated by LF characters." But UTF-8 characters can use several bytes for one character. We can either just say "lines separated by LF characters", "lines of text separated by LF characters" or "lines of UTF-8 characters separated by LF characters". Any opinion on what's best here? Vincent -- Les gens heureux ne sont pas pressés. _______________________________________________ xdg mailing list xdg@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/xdg _______________________________________________ xdg mailing list xdg@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/xdg