Hello!

On 16/02/2016 22:56, Andrea Pescetti wrote:
I have tested the new "unduplicate simple meanings" on the US thesaurus
and it found duplicated meanings in 752 synonyms:

(moving the conversation to BCC for l10n and QA; interested people can follow-up on dev)

May I know what is the definition of a "duplicate" meaning for your tool? It looks interesting. I may want to test it on the Italian dictionary too.


Andrea, it means for example:
apple|3:
one
two
one

It means that it would remove the "one" once becoming:
apple|2:
one
two



Not sure if I should convert the thesaurus to UTF-8 and then remove the
duplicates... what do you suggest?

I converted the Italian dictionary to UTF-8 long ago without any reported issues. I fail to see how/this is related to a de-duplication of some kind.


Converting the dictionary to UTF-8 wouldn't remove the duplicates. I just mentioned it because my tool uses UTF-8 and warns that opening non-UTF-8 files may lead to damaged characters :-[

To make sure (100%) that no data is lost, I would first need to convert it to UTF-8.

:-P

Andrea, on Friday I am planning an official release for PTG (Windows and Linux) but you can download a Windows only version from my Dropbox:
https://dl.dropboxusercontent.com/u/30674540/ProofingToolGUI_V0092.zip

It has all the files in the ZIP including the source, images, etc. and the two executables for Windows (x64 and x86).

Thanks!

Kind regards,
     >Marco A.G.Pinto
       ------------------------

--

Reply via email to