If you wish to choose weblate I am sure we can work with what it does

On Mon, Aug 15, 2022 at 12:12 AM Alexandre Gacon <alexandre.ga...@gmail.com>
wrote:

> Hi Jody,
>
> I had a look at the git history for the UTF8 file: the file was created by
> you by renaming the CN ISO file (
> https://osgeo-org.atlassian.net/browse/GEOS-10282). If I understand the
> JIRA issue correctly, this was related to the encoding issue in the
> translation files (the same old story) and the idea was perhaps to migrate
> all the translations to UTF-8 encoding.
>
> Weblate UI manages to display the characters in all options:
> - ISO-8859 characters in ISO-8859 encoded file
> - U encoded characters (\u00e9 for example) in ISO-8859 encoded file
> - UTF-8 file
>
> Weblate can read and write either ISO-8859 files or UTF-8 files but inside
> for a given component you can have only one of the two options. For
> ISO-8859 files, weblate can read either ISO-8859 characters (é) and U
> encoded characters (\u00e9) but when you change a translation, it will
> always be written in U encoded characters.
>
> Regards
> Alexandre
>
> Le ven. 12 août 2022 à 21:46, Jody Garnett <jody.garn...@gmail.com> a
> écrit :
>
>> You may have to search back in the geoserver-devel history or bug tracker
>> for details. I assume we were provided utf8 translations and wished to
>> preserve them?
>>
>> When using the UI in weblate can folks see the native characters
>> regardless of encoding used?
>>
>> What I remember is that we needed UTF8 to support some
>> chinese characters; but the java properties file format is ISO-8859-1 (like
>> the actual format on disk).
>> Wicket had some naming convention where you could append utf8 to the
>> filename - but no other tools understand this.
>>
>> If I understand you correctly:
>> - Weblate wants to work with UTF8 if it can?
>> - Java properties file want ISO-8859-1
>>
>> Is there any way to hint to weblate what encoding to use? Looks like yes
>> https://docs.weblate.org/no/latest/formats.html#java-properties
>> I cannot tell from that page if you can choose the encoding of individual
>> property files (ie do this manually)?
>>
>> Following links weblate uses
>> http://docs.translatehouse.org/projects/translate-toolkit/en/latest/formats/properties.html
>>
>> Which uses:
>> -
>> http://docs.translatehouse.org/projects/translate-toolkit/en/latest/commands/prop2po.html
>>
>> Reading through those docs there is an option "--personality=mozilla"
>> which would preserve UTF8 characters rather than force them into Latin1
>> encoding.
>> --
>> Jody
>>
>>
>> On Fri, 12 Aug 2022 at 04:25, Alexandre Gacon <alexandre.ga...@gmail.com>
>> wrote:
>>
>>> I try to keep only the UTF-8 file as the source for Chinese, along with
>>> the other languages in ISO-8859-1: the display of the file content in
>>> weblate is totally broken. I have to define another fake component
>>> configured to support UTF-8 encoding to be able to manage it correctly.
>>>
>>> What is the reason to keep the two encoding?
>>>
>>> Alexandre
>>>
>>> Le ven. 12 août 2022 à 08:26, Alexandre Gacon <alexandre.ga...@gmail.com>
>>> a écrit :
>>>
>>>> Hi Jody,
>>>>
>>>> You are guessing well: it is all about the Chinese language : for
>>>> weblate it is the same language defined twice so it cannot cope with it.
>>>>
>>>> I have to check how weblate can work with having one of the languages
>>>> in UTF-8 whereas the other ones are in ISO-8859-1 : I fear that it will
>>>> mean to have two components side by side.
>>>>
>>>>  For your last point, it seems to work well for ages: Romanian is
>>>> mixing both characters encoding whereas Japanese and Korean are totally
>>>> with unicode characters inside a ISO-8859-1 encoded file.
>>>>
>>>> Alexandre
>>>>
>>>> Le jeu. 11 août 2022 à 20:36, Jody Garnett <jody.garn...@gmail.com> a
>>>> écrit :
>>>>
>>>>> Weblate is a good idea; we held back perviously because of lack of
>>>>> hosting. However if OSGeo is able to host :)
>>>>>
>>>>> I do not understand about two items for the same language: can you
>>>>> provide links in the github repo? Do you mean two properties files; or two
>>>>> entries in the same property file. Or two entries in different property
>>>>> files?
>>>>>
>>>>> I am guessing you mean two property files for the chinese language;
>>>>> where we followed some wicket convention for having property files in
>>>>> different encodings. I think we should just use the utf8 encoding? Which 
>>>>> is
>>>>> not a java standard but wicket supports it.
>>>>>
>>>>> -
>>>>> https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication.properties
>>>>> -
>>>>> https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication_zh.properties
>>>>> -
>>>>> https://github.com/geoserver/geoserver/blob/main/src/web/wms/src/main/resources/GeoServerApplication_zh_CN.utf8.properties
>>>>>
>>>>> My understanding is we should keep the utf8 properties if weblate is
>>>>> willing to understand utf8 encoding??
>>>>>
>>>>> About replacing text with unicode; not sure how that works with java
>>>>> properties being specified in a in ISO-8859-1 as part of the java api.
>>>>> --
>>>>> Jody Garnett
>>>>>
>>>>>
>>>>> On Wed, 10 Aug 2022 at 23:11, Alexandre Gacon <
>>>>> alexandre.ga...@gmail.com> wrote:
>>>>>
>>>>>> Hi all,
>>>>>>
>>>>>> I am studying if we could migrate the translation tooling from
>>>>>> Transifex to Weblate. I have started this because with the current
>>>>>> setup Transifex is changing a lot of translations when I upload updates 
>>>>>> of
>>>>>> the translation source, making it difficult to do the synchronization
>>>>>> between GitHub and Transifex.
>>>>>>
>>>>>> Weblate is a copyleft libre software and OSGeo is hosting its own
>>>>>> instance, already used by several OSGeo projects (postgis, pgrouting and
>>>>>> grass gis at least).
>>>>>>
>>>>>> Thanks to Regina Obe, I have set up a GeoServer project on the OSGeo
>>>>>> instance to study how weblate works and if there is something which
>>>>>> can prevent us from using it.
>>>>>>
>>>>>> I have already two points to share with you to get some feedback:
>>>>>>
>>>>>>    - First, when you configure a component into weblate, you cannot
>>>>>>    have two items for the same language, even if they are in a different
>>>>>>    encoding. As a consequence, I cannot directly integrate most of the 
>>>>>> core
>>>>>>    components since they contain 2 files for the Chinese language: is it
>>>>>>    something which can be changed? Which one is used by GeoServer?
>>>>>>    - Second, when you change the translation of a text in weblate,
>>>>>>    it automatically replaces special characters by their equivalent in
>>>>>>    unicode, even if the character exists in the ISO-8859-1 encoding. For
>>>>>>    example:
>>>>>>
>>>>>> org.geoserver.security.GeoServerAuthenticationKeyFilter.name=Clé
>>>>>> d'authentification
>>>>>> is replaced by
>>>>>> org.geoserver.security.GeoServerAuthenticationKeyFilter.name=Cl\u00E9
>>>>>> d'authentification
>>>>>>
>>>>>> (my own change in the translation was to add a space at the end of
>>>>>> the string, to match the original layout of the source string)
>>>>>>
>>>>>> From a technical point of view, it does not break anything but it
>>>>>> would make it more difficult to work on a translation without using
>>>>>> weblate.
>>>>>>
>>>>>> Do you see any problems around these two points? Anything else to
>>>>>> check?
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Alexandre Gacon
>>>>>> _______________________________________________
>>>>>> Geoserver-devel mailing list
>>>>>> Geoserver-devel@lists.sourceforge.net
>>>>>> https://lists.sourceforge.net/lists/listinfo/geoserver-devel
>>>>>>
>>>>>
>>>>
>>>> --
>>>> Alexandre Gacon
>>>>
>>>
>>>
>>> --
>>> Alexandre Gacon
>>>
>>
>
> --
> Alexandre Gacon
>
-- 
--
Jody Garnett
_______________________________________________
Geoserver-devel mailing list
Geoserver-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/geoserver-devel

Reply via email to