Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-26 Thread Michal Suchanek
On 25 June 2010 21:37, Owen Shepherd owen.sheph...@e43.eu wrote: On 25 June 2010 19:36, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 20:18, Owen Shepherd owen.sheph...@e43.eu wrote: One of the reasons that I'm a fan of SCSU is that, with even a relatively simple encoder, it

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-26 Thread Owen Shepherd
On 26 June 2010 13:47, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 21:37, Owen Shepherd owen.sheph...@e43.eu wrote: On 25 June 2010 19:36, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 20:18, Owen Shepherd owen.sheph...@e43.eu wrote: One of the reasons that I'm a

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-26 Thread Richard Hipp
On Sat, Jun 26, 2010 at 12:05 PM, Owen Shepherd owen.sheph...@e43.euwrote: SCSU is not that useful for storage compression since fossil already uses zlib and it has no other advantages I am aware of. Deflate compression is only applied to commits. Deflate has significant overhead, and is

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-26 Thread Michal Suchanek
On 26 June 2010 18:05, Owen Shepherd owen.sheph...@e43.eu wrote: On 26 June 2010 13:47, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 21:37, Owen Shepherd owen.sheph...@e43.eu wrote: On 25 June 2010 19:36, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 20:18, Owen

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-26 Thread Owen Shepherd
On 26 June 2010 20:59, Michal Suchanek hramr...@centrum.cz wrote: Indeed, the loss is at the end in case of web pages, parts which are missing in the middle are result of inserting different streams so SCSU would not suffer more breakage than other encodings. Still there is no apparent benefit

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Ruslan Popov
Sergey, now I use emacs and its mule-utf-8-unix encoding for commit buffer. On Fri, Jun 25, 2010 at 2:15 PM, Sergey Sfeli sergey.sf...@gmail.comwrote: Ruslan Popov wrote: I've tried to use Fossil on russian version of Windows 7. I made commit with russian text in comment, when I run the

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Michal Suchanek
On 25 June 2010 12:15, Sergey Sfeli sergey.sf...@gmail.com wrote: Ruslan Popov wrote: I've tried to use Fossil on russian version of Windows 7. I made commit with russian text in comment, when I run the UI and look at timeline, I saw that russian text looks like squares. Why don't just use

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Michael Richter
On 25 June 2010 21:34, Michal Suchanek hramr...@centrum.cz wrote: Perhaps fossil should have a system encoding which it would get from the environment (locales, windows codepage) and mark all commit messages with it. I vote that this is an extraordinarily bad idea. Fossil is a *distributed*

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Owen Shepherd
On 25 June 2010 11:15, Sergey Sfeli sergey.sf...@gmail.com wrote: Ruslan Popov wrote: I've tried to use Fossil on russian version of Windows 7. I made commit with russian text in comment, when I run the UI and look at timeline, I saw that russian text looks like squares. Why don't just

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Owen Shepherd
The trouble is that UTF-8 is a poor standard. It bloats many texts, is quite expensive to parse, and has only one redeeming feature: It never creates embedded nulls. I suppose that it shares its encoding with ASCII is a feature too, but only a minor one. Personally, I think that most systems

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Owen Shepherd
One of the reasons that I'm a fan of SCSU is that, with even a relatively simple encoder, it produces output which is comparable in efficiency to that of most legacy encodings. On 25 June 2010 18:53, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 18:09, Owen Shepherd

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Andreas Kupries
As an FYI I googled SCSU http://en.wikipedia.org/wiki/Standard_Compression_Scheme_for_Unicode Owen Shepherd wrote: One of the reasons that I'm a fan of SCSU is that, with even a relatively simple encoder, it produces output which is comparable in efficiency to that of most legacy encodings.

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Michal Suchanek
On 25 June 2010 20:18, Owen Shepherd owen.sheph...@e43.eu wrote: One of the reasons that I'm a fan of SCSU is that, with even a relatively simple encoder, it produces output which is comparable in efficiency to that of most legacy encodings. SCSU is a horrendous encoding because it uses

Re: [fossil-users] Mix of UTF-8 and CP1251 (Russian cyrillic) in project

2010-06-25 Thread Owen Shepherd
On 25 June 2010 19:36, Michal Suchanek hramr...@centrum.cz wrote: On 25 June 2010 20:18, Owen Shepherd owen.sheph...@e43.eu wrote: One of the reasons that I'm a fan of SCSU is that, with even a relatively simple encoder, it produces output which is comparable in efficiency to that of most