Yasuhito FUTATSUKI wrote on Sun, 04 Oct 2020 21:56 +0900: > (Sorry for reply too late.)
No worries. > On 2020/09/26 19:12, Daniel Shahaf wrote: > > 1 % svn propset svn:ignore "予定表.txt" ./ > > 2 property 'svn:ignore' set on '.' > > 3 % svn propset foo:ignore "予定表.txt" ./ > > 4 property 'foo:ignore' set on '.' > > 5 % LC_ALL=ja_JP.eucjp svn pl -v > > 6 Properties on '.': > > 7 foo:ignore > > 8 予定表.txt > > 9 svn:ignore > > 10 ͽɽ.txt > > > > 11 % LC_ALL=C svn pg --strict svn:ignore > > 12 {U+4E88}{U+5B9A}{U+8868}.txt > > > > 13 % svn propset svn:ignore "{U+4E88}.txt" ./ > > 14 property 'svn:ignore' set on '.' > > 15 % sqlite3 .svn/wc.db .dump | me > > 16 (svn:ignore 29 {U+4E88}{U+5B9A}{U+8868}.txt ) > > 17 % svn pg --strict svn:ignore > > 18 {U+4E88}{U+5B9A}{U+8868}.txt > > . > > So, I think there are a number of different issues/gotchas here: > > > > - It's not possible to get the raw value of an svn:* property in > > a working copy if the value is not representable in the local encoding. > > I belive that if we want to get property values precisely, we should > use xml output, although --no-newline is enough in most case except > this case. Hmm, that's an interesting one. On the one hand, «propget --xml» does resolve the ambiguity issue of the ad-hoc escaping; on the other hand: - We shouldn't require CLI users to use an XML parser in order to retrieve values of binary blobs. - The XML document declares itself to be in UTF-8. Does that mean XML parsers are allowed to treat the dumped property values as UTF-8 and, for example, convert the byte sequence (that comprises the value) to another byte sequence, that's equivalent when treated as UTF-8 but not equivalent when treated as binary blobs? (For example, convert the UTF-8 to composed or decomposed normal form.) Thanks for the (snipped) remainder; I agree. Cheers, Daniel