Hello!
On 16/02/2016 22:56, Andrea Pescetti wrote:
I have tested the new "unduplicate simple meanings" on the US thesaurus
and it found duplicated meanings in 752 synonyms:
(moving the conversation to BCC for l10n and QA; interested people can
follow-up on dev)
May I know what is the definition of a "duplicate" meaning for your
tool? It looks interesting. I may want to test it on the Italian
dictionary too.
Andrea, it means for example:
apple|3:
one
two
one
It means that it would remove the "one" once becoming:
apple|2:
one
two
Not sure if I should convert the thesaurus to UTF-8 and then remove the
duplicates... what do you suggest?
I converted the Italian dictionary to UTF-8 long ago without any
reported issues. I fail to see how/this is related to a de-duplication
of some kind.
Converting the dictionary to UTF-8 wouldn't remove the duplicates. I
just mentioned it because my tool uses UTF-8 and warns that opening
non-UTF-8 files may lead to damaged characters :-[
To make sure (100%) that no data is lost, I would first need to convert
it to UTF-8.
:-P
Andrea, on Friday I am planning an official release for PTG (Windows and
Linux) but you can download a Windows only version from my Dropbox:
https://dl.dropboxusercontent.com/u/30674540/ProofingToolGUI_V0092.zip
It has all the files in the ZIP including the source, images, etc. and
the two executables for Windows (x64 and x86).
Thanks!
Kind regards,
>Marco A.G.Pinto
------------------------
--