#28: Encoding problems when converting svn -> darcs
--------------------------------------------+-------------------------------
Reporter: Luca <[EMAIL PROTECTED]> | Owner: lele
Type: defect | Status: new
Priority: critical | Milestone: VersionOne
Component: tailor | Version: 0.9
Keywords: svn svndump non ascii error í |
--------------------------------------------+-------------------------------
I'm having some problems with the encoding of the character '''í''' ('i'
with acute accent, í as HTML entity). When converting a svn
repository to darcs I get this message:
{{{
00:37:07 [I] Changeset "43"
00:37:07 [I] Log message: - Nuevo nivel de logging CRITICAL (L_CRI) para
concordar con python.
- Mínimo cambio en el formato de logging.
- Cambio de sección de configuración de DB_DataObject a DBO.
00:37:07 [I] 110 pending changesets in state file
00:37:07 [C] Upstream change application failed
Configuration error: 'ascii' codec can't encode character u'\xed' in
position 216: ordinal not in range(128): it seems that current encoding
"UTF-8" cannot properly represent at least one of the characters in the
upstream changelog. You need to use a wider character set, using
"encoding" option.
}}}
My locale is UTF-8, but I even used the encoding option with no results.
The weir thing is other non-ascii characters seems to work fine (á, é,
ó, ú). When I use the svndump as the source, I've got no errors, but
'í' characters are not encoded properly:
{{{
Fri Feb 4 12:19:47 ART 2005 luca
* - Nuevo nivel de logging CRITICAL (L_CRI) para concordar con python.
- MÃ\adnimo cambio en el formato de logging.
- Cambio de sección de configuración de DB_DataObject a DBO.
}}}
As you can see, '''ó''' in '''configuración''' is just fine, but
'''í''' in '''Mínimo''' is encoded as '''MÃ''\ad''nimo''', which is
wrong.
It's easy to reproduce the problem:
{{{
cd /tmp
svnadmin create testrepo
svn co file:///tmp/testrepo testwc
touch testwc/test
svn add testwc/test
svn ci -m 'í' testwc
}}}
Now you can tailor this repository to convert it to darcs with '''svn'''
as repo and you'll get the error, or 'svnadmin dump'it and use
'''svndump''' as repo to get the wrong encoding.
Versions:
* Subversion: 1.2.3 (r15833)
* Darcs: 1.0.4
* Tailor: 0.9.19
--
Ticket URL: <http://progetti.arstecnica.it/tailor/ticket/28>
Tailor <http://www.darcs.net/DarcsWiki/Tailor>
An inter-VCs changeset exchanger_______________________________________________
Tailor mailing list
[email protected]
http://lists.zooko.com/mailman/listinfo/tailor