English Dictionaries gives an error while attempting to download
Hello! As usual, I have updated the English Dictionaries, and it gives an error while attempting to download. Could someone take a look at the issue?: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Thank you! Kind regards, >Marco A.G.Pinto -- -- signature.asc Description: OpenPGP digital signature
"are flagged" vs "appear as a typo"
Hello! Sorry for this philosophical question. Can I replace in the GB dictionary FAQ the word "appears as a typo" by "are/is flagged"? FAQ: http://marcoagpinto.cidadevirtual.pt/faq.html I believe that both mean the same, but I need some advice before changing all occurrences. Thanks! Kind regards from the English Dictionaries maintainer, >Marco A.G.Pinto -- -- signature.asc Description: OpenPGP digital signature
Fwd: Proofing Tool GUI 3.0 - beta - build 104 - 2017-03-21 - Released
Hello! Just to inform everyone that I have released an update for Proofing Tool GUI: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html The main changes are: - GUI improvements and fixes; - It is now possible to have custom "AFF Aid" files with 16x16 PNGs flags; - Support for FLAG NUM and LONG (no recursivity yet); - Modern menus look for Windows; - Pop-up menu now has an extra option "Clone" (very useful); - Major speed gain in the .AFF optimising code (gl_ES). When I have more time I need to add an "undo/redo" feature (Shantanu suggested it) and optimise the .AFF decoding words as I came up with a brilliant idea a day or two ago (just like I had for the gl_ES). Thanks! Kind regards, >Marco A.G.Pinto -- -- -- signature.asc Description: OpenPGP digital signature
Updated the English dictionaries - 2017-03-01
Hello! I have updated the English dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Dates: - en_AU (Australian) (updated: 2017-01-22) - en_CA (Canadian) (updated: 2017-01-22) - en_GB (British) (updated: 2017-03-01)* - en_US (American) (updated: 2017-01-22) - en_ZA (South African) (updated: 2012-07-10) * British has 638 new words (2017-02-01) + 326 new words (2017-03-01). However, as usual, it gives an error while downloading: "file not found" or whatever... could an admin look at it? Thanks! Kind regards, >Marco A.G.Pinto -- -- signature.asc Description: OpenPGP digital signature
Proofing Tool GUI 3.0 - beta - build 100 - 2017-01-01 Released
Hello! Just to announce that I have updated PTG for Windows and Linux. It can be downloaded from its official site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html The latest build has the GUI improved a lot and also accepts items multiselection and to choose the window size: 1024x600 or 1280x600. In the next update I will try to implement GTK3 support (Linux) and allow undo/redo like Shantanu suggested. Also, notice the extra tabs which are still not working but they mean that in a future version it will be possible to edit/create rules for LanguageTool and have language specific operations (text processing). Does anyone know where I can get an hyphenator so that I see how it works and can code its tab in PTG? This tab has been blank for years. PS->Notice that I added some persons in Bcc in this e-mail. Thanks! Kind regards from the English Dictionaries maintainer, >Marco A.G.Pinto -- -- signature.asc Description: OpenPGP digital signature
English Dictionaries update - 2017-01-01
Hello! I have updated the English dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice But, while trying to download, it says "file not found". Could someone take a look at it? Thanks! Kind regards, >Marco A.G.Pinto -- -- signature.asc Description: OpenPGP digital signature
AOO 4.2.0 - English Dictionaries + pt_PT Autocorrect
Hello! I haven't been following the discussion carefully thus this e-mail. Could someone tell me when I should reopen the cases: 1) Update English Dictionaries *(now to 1-APR-2016)*; 2) Update the pt_PT autocorrect. Please notice that I opened those two cases a couple of months ago or so when it was mentioned about a 4.2.0 release. Regarding 2) I have added near 90 new words since the previous upload to Bugzilla which means that it will have around 250 new words in total since the year 2010. Thanks! Kind regards, >Marco A.G.Pinto --- -- signature.asc Description: OpenPGP digital signature
English Dictionaries updated - 2016-03-30
Hello! I have updated the English Dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice You can either download it from the link above or wait a few days before AOO triggers it. Changes: Updated the Dictionaries: - Australian (2016-03-14)* - British (Marco A.G.Pinto)** - South African (Dwayne Bailey) (2012-07-10) * Australian had all UTF-8 corrupt words fixed by Marco. ** British has 819 new words and attempted to make compatible with the command line using the UTF-8 apostrophe. Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature
Proofing Tool GUI 3.0 (beta) - build 96 - 2016-03-23
Hello! I have just released a bugfix of PTG. The .AFF file would get garbage at the end: What is strange is that I only noticed it yesterday. It seems that PureBasic for a couple of versions or so required a special "flag" in the command I was using to populate the EditorGadget after loading the .AFF file. Anyway, one just deletes the garbage and resaves. You can download PTG build 96 from: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature
Australian dictionary - Fixes - 2016-03-14
Hello! Tom Anderson opened a case in Bugzilla about corrupted accents in words in UTF-8 due to a bad conversion by the old maintainers. I have tried to fix all words myself by auto-correct + proofreading using en_GB which is the dictionary with more words. Please check: https://bz.apache.org/ooo/show_bug.cgi?id=126863 and tell me if you find any incorrect words in the .ZIP . Please also advice me what I should add in the README file according to this fix. If all goes well, this dictionary will be updated in the next release. Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature
Proofing Tool GUI 3.0 (beta) - build 95 - 2016-03-02
Hello! I have just released build 95, downloadable from: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html I fixed a crash in Windows (caused by PureBasic 5.41 - PB 5.42 has been released and fixed it), and also improved the Autocorrect code, also adding colours to it: So far I have added 200+ new words to the pt_PT autocorrect. Before the official release of AOO 4.2 I will upload the most recent file. Thanks! Kind regards, >Marco A.G.Pinto --- --
English Dictionaries updated - 1-MAR-2016
Hello! I have updated the English Dictionaries which can be downloaded from: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Or you can wait a few days until AOO triggers it. Changes: *MAGP 2016-03-01** ** **Updated the Dictionaries:** **- British (Marco A.G.Pinto)*** ** * British has 935 new words.** **Converted the US Thesaurus into UTF-8 and removed 752 duplicated meaning*s Thanks! Kind regards, >Marco A.G.Pinto --- --
Proofing Tool GUI 3.0 - beta - build 94 - 2016-02-19 Released
Hello! *With Bcc to: l10n and QA.* Just to inform everyone that, as planned, I have released an update for my Hunspell tool: Proofing Tool GUI. It can be downloaded from: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html I had a previous build ready to ship but, while testing it in Ubuntu, there were some issues using CURS UP and CURS DOWN to scroll the thesaurus, which I seem to have fixed in build 94. Please give it a try. Thanks! Kind regards, >Marco A.G.Pinto --- --
Thesaurus US - 752 synonyms with duplicate meanings
Hello! Today I have finished coding a bit more of my software Proofing Tool GUI. I have tested the new "unduplicate simple meanings" on the US thesaurus and it found duplicated meanings in 752 synonyms: Not sure if I should convert the thesaurus to UTF-8 and then remove the duplicates... what do you suggest? I am exhausted and tomorrow I will try to resume the coding and attempt to release an official build of PTG for Windows and Linux on Friday. The good news is that I have fixed the pause issue in Linux by converting the remaining of the code into dynamic arrays (the thesaurus part) instead of loading all synonyms into the ListIconGadget. I have been postponing this for a couple of years or so but I have finally done it (it took a long time because I would have to make changes all over the code). Just to share the news! Thanks for your time! Kind regards, >Marco A.G.Pinto --- --
English Dictionaries updated - 2016-02-01 + Autocorrect pt_PT updated 2016-01-29
Hello! I have updated the English Dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Changes: - American (Kevin Atkinson) - Canadian (Kevin Atkinson) - British (Marco A.G.Pinto)* * British has 1622 new words. As usual I always update a couple of days or so before the official date because I have it packed for several days already and it is risky to leave things to be done in the last moment. Also notice that I have updated the Autocorrect for pt_PT adding around 168 new words. This is an ongoing task since I have grammar books from where I am taking the words and it will take months to add the many dozens or even hundreds of words. The .DAT file became 5 kB compared to the old 15 kB from 2010 because I used commercial software to compress. Andreas, I have opened the following cases on Bugzilla: Bugzilla - English Dictionaries: https://bz.apache.org/ooo/show_bug.cgi?id=126814 Bugzilla - Autocorrect pt_PT https://bz.apache.org/ooo/show_bug.cgi?id=126815 Thanks for your time! Kind regards from your friend, >Marco A.G.Pinto --- --
Remove comments in English Dictionaries?
Hello! I am writing to ask if it is okay for me to remove (hide) the comments in the English Dictionaries extension site. This happens because I have just added the text to the description, below the visit the "Proofing Tool GUI project for help blah blah": "Or use the mailing lists." The thing is the comments are always the same which is annoying and they fill the page with garbage. Anyway, if someone asks how to install, just give them the direct URL with the help steps: http://marcoagpinto.cidadevirtual.pt/en_installing.html What is your opinion? Should I hide the comments? Thanks! Kind regards, >Marco A.G.Pinto --
Proofing Tool GUI 3.0 - build 89 - 20160109 - Autocorrect working
Hello, I have just released a new build of PTG, which now allows to edit the "Autocorrect words": Above is the en_GB autocorrect file. You can download PTG from the official site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Special instructions for working with autocorrect files: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html#build89 The GUI makes it a lot easier to edit because it avoids using the command line. I have checked the en_GB and pt_PT XMLs and they are from 2010, so I was wondering if someone could work on the English ones? I hereby offer myself to update the pt_PT since I have several grammar books with dozens/hundreds of examples of incorrect/correct words. How shall I do to make the pt_PT be updated both in AOO and LO? Thanks for your time! Kind regards, >Marco A.G.Pinto --
Re: Cannot download 4.1.2
Tenta: https://www.openoffice.org/pt/ On 05/01/2016 21:05, MARIA LIDRONETA AMBRÓSIO PEREIRA BITENCOURT wrote: desculpe-me, mas não sei inglês. conheço apenas algumas palavras. Hello Marcus= Alo Marcos; esse é o motivo de não conseguir baixar corretamente Enviado do Email do Windows De: Marcus Enviado: 5 de janeiro de 2016 16:56 Para: l10n@openoffice.apache.org Assunto: Re: Cannot download 4.1.2 Am 01/05/2016 06:59 PM, schrieb Dick Groskamp: Op 5-1-2016 om 15:26 schreef Frank Micheelsen: Hello Marcus Thank for your answer. I try to download from the site, where the 4.1.2 version is introduced. The site contained mainly green and yellow colors and it apparently does not exist anymore. It can not be opened anymore. Normal download procedure runs ok, but it is only the 4.1.1 programme english version that is being installed om my MacBook Air. The danish Language programme is also downloaded, but as I do not have the 4.1.2 it cannot be installed. I hope this information can help, as I am a very satisfied user of OpenOffice and I have made donation although my creditcard was copied in the US, which is the only place I have used it. Best regards Frank Den 2. jan. 2016 kl. 18.14 skrev Marcus: Am 01/02/2016 11:51 AM, schrieb Frank Micheelsen: .. Why not? Have OS X 10.11.2 Captain Ps.: Please give me an answer. sure, we can try to give you an answer. But we need more details: - What's the URL you want to download from? - What's happening when downloading? - Do you get any error message(s)? - If so, please give us the exact wording. Thanks Marcus - To unsubscribe, e-mail: l10n-unsubscr...@openoffice.apache.org For additional commands, e-mail: l10n-h...@openoffice.apache.org Frank, I saw there is a problem with the official Danish downloadpage. It is giving you a 404-error. have you used "da" or "dk" in the URL? "da" is the right one. Marcus - To unsubscribe, e-mail: l10n-unsubscr...@openoffice.apache.org For additional commands, e-mail: l10n-h...@openoffice.apache.org --
English Dictionaries updated - 1-JAN-2016
Hello! I have updated the English Dictionaries. The official release date was 1-JAN-2016 but everything has been ready for a week already. You can wait a few days until AOO triggers it, or download it now from the official site: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Also, I have updated all pages of the Proofing Tool GUI project: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html This includes a new installation guide with new information and screenshots. The changes in this version are: - Updated en_GB (721 new words). Added 721 new words, including lots of Star Wars proper names. Even the Oxford Dictionaries have words such as "Jedi" and "lightsabre", so it was about time I added most of them. I wish everyone a Happy New Year! Kind regards from your friend and brother, >Marco A.G.Pinto --- --
English Dictionaries updated - 2015-12-01
Hello! Just to inform everyone that I have updated the English Dictionaries. Manual download link: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Noticed that I skipped the previous version because a new AOO was being released. So, the news is: MAGP 2015-12-01 Updated the Dictionaries: - British (Marco A.G.Pinto) * 367 new words MAGP 2015-11-01 Updated the Dictionaries: - British (Marco A.G.Pinto) * 446 new words Enjoy! Kind regards, >Marco A.G.Pinto --- --
Re: English Dictionaries - Remove a paragraph from the description
Done! On 10/11/2015 08:28, Andrea Pescetti wrote: Marco A.G.Pinto wrote: Can I remove the paragraph in the official extensions site: This is a locally hosted copy of the English dictionaries with fixed dash handling and new ligature and phonetic suggestion support extension. - See more at: ... I created the extension that is now maintained by you because, due to the new build constraints, we needed to have the extension available on the extensions.openoffice.org site and the one we were using was hosted externally. So it was indeed born as a simple copy of another extension. In the last months/years instead you improved it a lot, so the paragraph is no longer correct. I suggest that you still mention the original extension in internal files (README, Changelog) but feel free to remove the paragraph that I had inserted when creating that extension. Regards, Andrea. --
English Dictionaries - Remove a paragraph from the description
Hello! Can I remove the paragraph in the official extensions site: *"* *This is a locally hosted copy of the English dictionaries with fixed dash handling and new ligature and phonetic suggestion support extension. - See more at: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice#sthash.9xwP5Me6.dpuf* *This is a locally hosted copy of the English dictionaries with fixed dash handling and new ligature and phonetic suggestion support extension."* This is a locally hosted copy of the English dictionaries with fixed dash handling and new ligature and phonetic suggestion support extension. - See more at: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice#sthash.9xwP5Me6.dpuf The link it points to is from 2010 and it is almost 2016 so the old text is useless and outdated and we should place the minimum amount of text possible to keep it simple to understandable to all users. Andrea, what is your opinion? Thanks! Kind regards from your friend, >Marco A.G.Pinto --
English dictionaries updated - 2015-10-01
Hello! I have updated the English dictionaries adding 462 new words to en_GB. Andreas, since AOO 4.1.2 is delayed, could you add this one? Thanks! Kind regards, >Marco A.G.Pinto --- --
Updating English dictionaries today?
Hello, Andrea, I was wondering if, since there are several release blockers around, if I could still be in time to update the English dictionaries to be used in AOO 4.1.2? I have everything ready for an update and I could do it today (although the official release date is always the 1st of every month). I have only added 320 new words to en_GB this month but Kevin Atkinson updated US+CA which is good. I also have now a LibreOffice account where I release the dictionaries there too. Thanks! Kind regards from your friend, Marco A.G.Pinto --
Re: Dictionary updates for OpenOffice 4.1.2
I have reopened the case since the English dictionaries (at least the en_GB) have thousands of new words since the last AOO release a year ago. I have gone beyond the 14'000 new words in en_GB :-P Thanks! Kind regards, Marco A.G.Pinto --- On 08/08/2015 19:29, Andrea Pescetti wrote: If you want a dictionary to be updated in OpenOffice 4.1.2, please open an issue like https://bz.apache.org/ooo/show_bug.cgi?id=125965 (note: for non-English builds, you will use the Native-Lang component) AND notify this list, or just notify this list. Dictionary updates suggested within the next week (by 16 August) will be included in OpenOffice 4.1.2. It will still be possible to suggest updates later, but I can't guarantee that we will have time to include them. Please remember to ALWAYS notify this list: Bugzilla has a lot of noise and if an issue is not filed properly we may miss it. Regards, Andrea. --
English Dictionaries updated - 1-AUG-2015
Hello! I have just confirmed that the latest update is fully downloadable from AOO's Extension page: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice You can either directly download it or wait until AOO triggers it. The news is that it is now compatible with LibreOffice as well. I have created a LO account and I am just waiting for the administrator to give me access rights. en_GB: - I have added a total of 541 new words; - I have added many dozens of English cities and names. I examined full Wikipedia pages with British cities. en_US/CA: Kevin Atkinson has been busy and only in August he will have some free time to update. Kind regards, Marco A.G.Pinto -- --
English Dictionaries - Issue with Extension site
Hello! I was doing the monthly update and after uploading the OXT and filling-up the fields, I attempted to download it in order to check if it was working. It opened the usual SourceForge page but I got a warning on Firefox that the file wasn't found. So, I tried to reupload it and republish it and then unpublish the first one. I got a warning: This extension is currently not published, please contact the system administrator at aooad...@users.sourceforge.net to have it published. Now I can only access the extension page when I am logged in, otherwise I get an Access Denied: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice I have already e-mailed the SourceForge guy but I was wondering if someone with administrator rights could make it work? Thanks! Kind regards, Marco A.G.Pinto -- --
AOO 5.0 - Portuguese dictionary pre- and -post reform
Hello! I was on Freenode IRC in the LibreOffice channels suggesting them to implement an ODT property for the Portuguese dictionaries in order to accept pre- and post- reform, so that later, when it becomes official, AOO would have to implement it as well. They told me that it depends on the BCP 47 language tags (not supported by AOO) and there was a special site for it: http://langtag.net Then, they found it was already there (Updated Mon 8 Jun 2015): http://r12a.github.io/apps/subtags/ - search for pt-ao1990 - returns valid subtag/variant If we expand show notes - it shows that it was updated on June 8th, 2015 (the channel operator said: assume updated = newly created in this case). The post-reform is based on the language agreement of 1990. Could someone implement this into AOO 5.0? Then, instead of two OXTs with the pt_PT dictionaries we could probably have just one with both variants, just like we have for the English dictionaries. Thanks! Kind regards, Marco A.G.Pinto --- --
Proofing Tool GUI 3.0 - Build 76 - It now saves .idx (Thesaurus) files
Hello! Today I have released build 76. I have added .idx support, which means that while saving a Thesaurus (.dat) it also creates an .idx . I know I have been trying to do it for several months or even a year, but the php script people told me about was very hard to understand. I also lacked free time because of my two jobs and PhD. Today I received an e-mail of a person asking me about .idx support and I replied that in August (the university is closed so I will only have one job then) I would probably have free time. But, I decided to give it a try today and in around 10-15 minutes it was done although I only tested it with the pt_PT Thesaurus... Alexandro Colorado has said several times that I was a lazy ass and he was right :-( Please give it a try: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Thanks! PS-To the guys complaining that every month an update tip appears on the top right of AOO, that happens because I update the English dictionaries monthly (so, every month a new release comes out and everyone gets a notification in AOO). Kind regards from your friend, Marco A.G.Pinto -- --
LanguageTool - place link on AOO site
Hello! Just wondering if you place a link to the grammar checker, LanguageTool, somewhere in the area of the AOO downloads: https://languagetool.org I am involved on the Portuguese rules for it. LanguageTool is an important add-on for AOO since it supports over 20 languages and it suggests grammar corrections, which is something missing in AOO but which MS Office has by default. Thanks! Kind regards, Marco A.G.Pinto -- --
English Dictionaries updated - 1-JUL-2015
Hello! I have released a new version of the English dictionaries for OpenOffice: - en_GB (British) (updated: 2015-07-01)* * en_GB has 634 new words. You canwait a few days until AOO triggers it or download it from the Extension site: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Also, the Mozilla en_GB: https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 And the Proofing Tool GUI project site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html (please notice the new link at the top: Get Involved) PS-I know it is still not officially the 1st of July but I have the update ready/packed for a couple of days or so. Thanks! Kind regards from your friend, Marco A.G.Pinto -- --
Proofing Tool GUI 3.0 - beta - build 76 - WIP
Hello! If you guys recall, I am improving en_GB with my own Hunspell tool. I have decided to dedicate some time in my vacations to add support for the three flags: 1) chr 2) num 3) long So far I have added the part that grabs the .AFF and optimises it (removes all spaces and creates an array with the position of each code). With numbers it takes 18 seconds on my T4300 laptop with the *gl_ES.aff* after I optimised the code to the limit... before optimising it, it would take near one minute. Since numbers have a variable size, that is what makes it slower, since I have to check when the number ends (last digit). I found dictionaries in my hdd with *2) num* and *3) long* which I had access to some time ago thanks to the LanguageTool team. I don't understand how recursivity works: *gl_ES.aff* SFX 520 er éren/666,134,135 er . is:infinitivo P6 + enclítico *nl.aff* SFX Yb 0 jes-/CaCbCp [^m] ts:NN2r These two examples shown above are from Mozilla dictionaries. Could one of you send me a small dictionary that I could test with Proofing Tool GUI and explain how I should make it work? Thanks! Kind regards, Marco A.G.Pinto --- --
English Dictionaries updated - 1-JUN-2015
Hello! I have released a new version of the English dictionaries for OpenOffice: - en_CA (Canadian) (updated: 2015-05-18) - en_GB (British) (updated: 2015-06-01)* - en_US (American) (updated: 2015-05-18) * en_GB has 564 new words. You canwait a few days until AOO triggers it or download it from the Extension site: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Also, the Mozilla en_GB: https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 And the Proofing Tool GUI project site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html In the second half of June I will try to enhance Proofing Tool GUI (the Hunspell tool I use for editing the dictionaries) in order for it to accept also: 1) FLAG long 2) FLAG num At first I thought it would be very hard but, during sleep, I came up with an easy idea (just waiting for my vacation). Still, it won't support recursivity since it is very complex... but I am always coming up with new ideas... and maybe I will figure out something then. Also, I only added 564 new words this time because my PhD supervisor has been over me saying More studying and less hobbies :-( Thanks! Kind regards from your friend, Marco A.G.Pinto -- --
English Dictionaries updated - 1-APR-2015
Hello! Just to inform everyone that I have updated the dictionaries for OpenOffice and Mozilla. Only the British one has been changed and it now has 738 new words since the previous version. In this version I focused a lot in the names of places and persons, since it is one of the biggest weaknesses in the dictionaries. The AOO may take a few days to trigger, but you can download it manually from its official site. Direct sites where it can be found: - AOO http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice - Mozilla (Firefox/Thunderbird/SeaMonkey) https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 - Official project site (Proofing Tool GUI) http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html - Official project site (GitHub) https://github.com/marcoagpinto/aoo-mozilla-en-dict Please notice that I added a paragraph with some extra text at the bottom of GitHub to explain to people how to check if a word is already there. Could someone tell me if the text is okay or if should add/remove something? Thanks! Kind regards from your friend, Marco A.G.Pinto -- --
English Dictionaries updated - 1-MAR-2015
Hello! Just to inform everyone that I have updated the English Dictionaries: - en_CA (Canadian) (updated: 2015-02-15) - en_GB (British) (updated: 2015-03-01)* - en_US (American) (updated: 2015-02-15) * en_GB has 735 new words. This is the monthly update and it may take a few days to trigger in AOO, but you can download directly from the official repository: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice PS-It has already been packed (ready) for a couple of days, so no need to wait until the 1st of March! Thanks! Kind regards, Marco A.G.Pinto -- --
Dictionaries US/CA
Hello! Yesterday (2015-02-15) Kevin Atkinson released an update for US and CA. I have been working on en_GB but so far I have only added around 300 new words this month, because I have been terribly busy. I will try to add as many as possible before the official .OXT release on 2015-03-01. :-P The news: the next update will have newer US+CA+the usual monthly GB update. We are heading in the right direction and, in a year or two, we will have the best open-source dictionaries... at least for GB since it has over 10 000 new words since I embraced the project... I want people to write Masters/PhD thesis with AOO in a year from now... the trick is to use also the LanguageTool add-on for grammar checking (I am creating the Portuguese rules for it). PS-I can't remember if I already mentioned this, but I bought an Oxford Gold Account in the beginning of the year to have access to all examples. I hope people will realize that this is a serious effort to develop the best en_GB around, soon better than commercial software. :-) ;-) :-P :-D 8-) Kind regards, Marco A.G.Pinto -- --
Fwd: [Apache OpenOffice Extensions] Bud commented on English dictionaries for Apache OpenOffice
Hello! It seems to be true. I tried to download it too and it says that the file cannot be found. I have just re-uploaded it, but it still doesn't work. Could someone have a look at it? PS-I have to leave to my job soon and can't stay here for long. Thanks! Kind regards, Marco A.G.Pinto -- Forwarded Message Subject: [Apache OpenOffice Extensions] Bud commented on English dictionaries for Apache OpenOffice Date: Sat, 31 Jan 2015 00:15:30 -0800 From: Apache OpenOffice Extensions - Beta aooad...@users.sourceforge.net Reply-To: aooad...@users.sourceforge.net To: marcoagpinto marcoagpi...@mail.telepac.pt Dear Marco A.G.Pinto, The following comment has been posted by Bud on your extension English dictionaries for Apache OpenOffice: once i know how to do something no matter how quirky it is it seems 'incredibly simple to me as this site is for you but so far I have never had good luck just clicking on a link and downloading. I have no problem anywhere else getting files for my windows, ios or android device. I click the link here and it loops back around. there is no file to download this time. your reply is completely worthless. View the comment http://extensions.openoffice.org/en/comment/3830#comment-3830 Delete the comment http://extensions.openoffice.org/en/comment/delete/3830/GQQ9HaPYgDUFHksq Apache OpenOffice Extensions Team --
Re: Fwd: [Apache OpenOffice Extensions] Bud commented on English dictionaries for Apache OpenOffice
Strange... now it is working... Kind regards, Marco A.G.Pinto - On 31/01/2015 11:15, Marco A.G.Pinto wrote: Hello! It seems to be true. I tried to download it too and it says that the file cannot be found. I have just re-uploaded it, but it still doesn't work. Could someone have a look at it? PS-I have to leave to my job soon and can't stay here for long. Thanks! Kind regards, Marco A.G.Pinto -- Forwarded Message Subject: [Apache OpenOffice Extensions] Bud commented on English dictionaries for Apache OpenOffice Date: Sat, 31 Jan 2015 00:15:30 -0800 From: Apache OpenOffice Extensions - Beta aooad...@users.sourceforge.net Reply-To: aooad...@users.sourceforge.net To: marcoagpinto marcoagpi...@mail.telepac.pt Dear Marco A.G.Pinto, The following comment has been posted by Bud on your extension English dictionaries for Apache OpenOffice: once i know how to do something no matter how quirky it is it seems 'incredibly simple to me as this site is for you but so far I have never had good luck just clicking on a link and downloading. I have no problem anywhere else getting files for my windows, ios or android device. I click the link here and it loops back around. there is no file to download this time. your reply is completely worthless. View the comment http://extensions.openoffice.org/en/comment/3830#comment-3830 Delete the comment http://extensions.openoffice.org/en/comment/delete/3830/GQQ9HaPYgDUFHksq Apache OpenOffice Extensions Team -- --
English Dictionaries updated - 2015-02-01
Hello! I have just updated the English Dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice In some countries it is already the 1st of February, so I have decided to release it with that date (since all was ready already). Changes: - en_CA (Canadian) (updated: 2015-01-28) - en_GB (British) (updated: 2015-02-01) * - en_US (American) (updated: 2015-01-28) * en_GB has 1559 new words. en_GB now has over 10'000 words since I embraced the project around a year ago. I was able to add to it 1559 words in January because I was home on vacation from one of my jobs. Thanks! Kind regards, Marco A.G.Pinto -- --
Question regarding .AFF in en_GB dictionary
Hello! I was wondering if I should write in the en_GB .AFF the source of the words since I embraced the project a year ago: *# Affix file for British English MySpell dictionary** **# Also suitable as basis for Commonwealth and European English.** **# Built from scratch for MySpell. Released under LGPL.** **#** # Source of wordlist (Marco Pinto): # 1) Oxford Dictionaries; # 2) Collins Dictionary; # 3) Macmillan Dictionary; # 4) Wiktionary (used with caution); # 5) Wikipedia (used with caution); # 6) Physical dictionaries. # **# David Bartlett, Andrew Brown,** **# Kevin Atkinson, Marco A.G.Pinto.** **# R 2.21, 2015-02-XX* Thanks for your time! Kind regards, Marco A.G.Pinto -- --
English Dictionaries updated - 1-JAN-2015
Hello! Just to inform everyone that I have updated the repository. The updated languages are: - en_CA (Canadian) (updated: 2014-11-17) - en_GB (British) (updated: 2015-01-01) - 814 new words - en_US (American) (updated: 2014-11-17) I have also added a link to Proofing Tool GUI's project, which has lots of information and also a link explaining how to install dictionaries: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Thanks for your time! Kind regards, Marco A.G.Pinto -- --
English Dictionaries auto-update
Hello! The other day I was on IRC talking with JZA. I believe he asked if we had the English dictionaries auto-updating. Is there a way of making it automatic? What happens is that, every time one clicks in the update link, it jumps to the extension page. Thanks! Kind regards, Marco A.G.Pinto -- --
Hunspell unmunching question
Hello! Around a week ago, Peter from England sent me an e-mail suggesting new words to be added to en_GB. One of them was unsubscribe. Here is what appears in Proofing Tool GUI: The strange thing is that I tried the variants in Mozilla and OpenOffice and none of them was marked as a typo. I started meditating about it and wondered if in Hunspell the prefixes would attach themselves to all suffixes. Today I made a test, please see the archive: https://dl.dropboxusercontent.com/u/30674540/hunspell_issue_marcoagpinto_20141204.zip It contains the extracted wordlists both in PTG and Unmunch and also the .DIC + .AFF I created for the tests. In my PTG 3.0 build 67 I get: *subscribe** **resubscribe** **subscribing** **oversubscribe** **subscribes** **subscribed** **unsubscribe** **000** **subscribe** **unsubscribe** **resubscribe** **subscribing** **oversubscribe** **subscribes** **subscribed** ** *In Unmunch for Linux I got: *subscribe** **subscribing** **subscribed** **subscribes** **resubscribing** **oversubscribing** **unsubscribing** **resubscribed** **oversubscribed** **unsubscribed** **resubscribes** **oversubscribes** **unsubscribes** **resubscribe** **oversubscribe** **unsubscribe** **000** **subscribe** **subscribing** **subscribed** **subscribes** **resubscribing** **oversubscribing** **unsubscribing** **resubscribed** **oversubscribed** **unsubscribed** **resubscribes** **oversubscribes** **unsubscribes** **resubscribe** **oversubscribe** **unsubscribe** ** *I placed a 000 to divide the same word with an exchanged order of the code U to make sure it would produce the same results, no matter its position. What this means is that I probably need to change the code of my tool, maybe create three arrays: 1st - to store the words with suffixes 2nd - to store the codes of the prefixes 3rd - to store 1st plus all its combinations with the prefixes (it would apply prefixes to 1st and store them in 3rd ) Then, I would display the prefixes at the bottom in PTG not following the order of the codes? What this also means is that there are hundreds of combinations not appearing in the wordlist which I always publish in .txt in the GitHub of the project but that are processed by Hunspell in Mozilla (Firefox, Thunderbird and SeaMonkey) and Apache OpenOffice. Thanks for your time! Kind regards, Marco A.G.Pinto -- --
English Dictionaries updated - 2014-12-01
Hello! Just to inform everyone that I have updated the English dictionaries: en_GB - 824 new words. I haven't updated US and CA because Kevin Atkinson couldn't release them today as he had planned. Thanks! Kind regards, Marco A.G.Pinto --- --
English Dictionaries updated - 1-NOV-2014
Hello! Just to inform everyone that I have updated the dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice The only change was in en_GB which now has 641 new words. As usual, on 1-DEC-2014 there will be another update (monthly update). I am in contact with Kevin Atkinson (US and CA). Anyone knows how to contact the other two countries' dictionary guys? I would like to update all instead of just the usual GB, US and CA. Thanks! Kind regards, Marco A.G.Pinto -- --
English Dictionaries updated - 1-OCT-2014
Hello! I have just updated the English dictionaries: - en_GB (824 new words) This means that the en_GB now has over 6000 new words since I grabbed it last year. It can be downloaded when AOO triggers it, or from the official page: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Thanks! Kind regards, Marco A.G.Pinto -- --
Release notes for English dictionaries
Hello! Next week I am going to update the English dictionaries with 700+ new words in the en_GB. I wanted to add release notes to the XML like JZA suggested weeks ago, but I have read the help documents and didn't understand how to create a .txt with them. release-notes src xlink:href=http://extensions.openoffice.org/testarea/desktop/publisher/release-notes_de.txt; lang=de / *src xlink:href=http://extensions.openoffice.org/testarea/desktop/publisher/release-notes_en.txt; lang=en /* /release-notes How do I create such a file? Thanks! Kind regards, Marco A.G.Pinto --- --
Re: Problems downloading English Dictionaries
Hello! I user or two reported in the extension page that they can't download either. Here is what I get every time I press the download button: https://dl.dropboxusercontent.com/u/30674540/aoo-download-english-dics-screenshot.png Not sure if it has to do with Windows 8.1 and Firefox 32. I tried to download from my VM running Ubuntu 14.04 and Firefox 31 and it works okay. Thanks for your time! Kind regards, Marco A.G.Pinto -- On 04/09/2014 08:16, Andrea Pescetti wrote: On 03/09/2014 Marco A.G.Pinto wrote: Today the update finally appeared on OpenOffice 4.1.1. I went to the page to try to download it, and it doesn't download. I can't reproduce this. I simply started OpenOffice, I was notified about the updated extension, and it downloaded and installed normally. Maybe you found a malfunctioning mirror? Regards, Andrea. --
Problems downloading English Dictionaries
Hello! Today the update finally appeared on OpenOffice 4.1.1. I went to the page to try to download it, and it doesn't download. Could someone take a look at it? Thanks! Kind regards, Marco A.G.Pinto - --
.IDX file for Thesaurus
Hello! If you all well remember, my Proofing Tool GUI (PTG) software allows also to edit the Thesaurus by editing .DAT files. But I have peeked into a a few extensions and they also have an .IDX file for the Thesaurus. Does anyone know how the .IDX is generated so that I code it into PTG? Thanks! Kind regards, Marco A.G.Pinto -- --
Re: .IDX file for Thesaurus
Please follow-up to the l10n list? Dear Andrea and team, I sent it to both lists because it is a coding question relating proofing, so I am still not sure which ML is the best. I was looking at the URL and it does have some small source code. Unfortunately I have some problems understanding the code since I don't master the language. Could someone please translate it for me in human language (the algorithm) so that I can code it in PureBasic? I know that I am annoying with so many questions all the time, but I want to build the ultimate proofing tool so that it can be used by everyone and that it serves all needs. Thanks! Kind regards from your brother and friend, Marco A.G.Pinto -- On 01/09/2014 17:26, Andrea Pescetti wrote: Marco A.G.Pinto wrote: But I have peeked into a a few extensions and they also have an .IDX file for the Thesaurus. Does anyone know how the .IDX is generated so that I code it into PTG? With th_gen_idx.pl ; you can find it in, e.g., (randomly found with a search engine) http://wordnet-gaeilge.googlecode.com/svn/trunk/th_gen_idx.pl I'm not sure that repository has a copy of the license, but my copy has the license you find under this message. A note: when writing to more than one list, it's useful to know which one is the preferred one for follow-up (so we can avoid sending too many messages). So for example, in the first line say Please follow-up to the l10n list and we'll know we should post answers only there. Regards, Andrea. $ cat LICENSE_th_gen_idx.txt /* * Copyright 2003 Kevin B. Hendricks, Stratford, Ontario, Canada * And Contributors. All rights reserved. * * Redistribution and use in source and binary forms, with or without * modification, are permitted provided that the following conditions * are met: * * 1. Redistributions of source code must retain the above copyright *notice, this list of conditions and the following disclaimer. * * 2. Redistributions in binary form must reproduce the above copyright *notice, this list of conditions and the following disclaimer in the *documentation and/or other materials provided with the distribution. * * 3. All modifications to the source code must be clearly marked as *such. Binary redistributions based on modified source code *must be clearly marked as modified versions in the documentation *and/or other materials provided with the distribution. * * THIS SOFTWARE IS PROVIDED BY KEVIN B. HENDRICKS AND CONTRIBUTORS * ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT * LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS * FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL * KEVIN B. HENDRICKS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, * INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, * BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; * LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) * HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF * SUCH DAMAGE. * */ - To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org For additional commands, e-mail: dev-h...@openoffice.apache.org --
English dictionaries updated - 2014-09-01
Hello! Just to inform everyone that I have updated the English dictionaries: - en_US (Kevin) - en_CA (Kevin) - en_GB (Marco) en_GB 2.16 has around 2000 new words since 2.14 (included in the previous extension). You can download it from: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice The GitHub where the wordlists are kept is: https://github.com/marcoagpinto/aoo-mozilla-en-dict My Proofing Tool GUI site project: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Thanks for your time! Kind regards, Marco A.G.Pinto -- --
Proofing Tool GUI 3.0 - beta - build 62 - 18-JUL-2014
Hello! I have released build 62 of PTG after months without a release: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html This version has a better manual and better decoding of complex .AFF rules and even an option to export/count the wordlist of a dictionary. In the past I have been using unmunch for Linux to extract the wordlist of en_GB but recently I found out that it creates 1000s of duplicates. From now on I will use PTG V3+ to extract wordlists but please notice that the en_GB 2.14 wordlist (.txt) is still the one from unmunch. Only next month I will update the dictionary and the wordlist. So far I have added over 800 words since the beginning of the month because I am on vacation until this Friday and I had extra time to add words and fixes. Thanks for your time! Kind regards from your friend, Marco A.G.Pinto --- --
Available Extension Updates doesn't show the most recent English dictionaries
Hello! If you well remember, I have uploaded V2.14 on Monday. I installed an old version in my AOO just to check if it would trigger the most recent one. But, in the updates notification, it always says: 2014-06-01 instead of 2014-07-01. Could someone give a look at it? Thanks! Kind regards, Marco A.G.Pinto --- --
English Dictionaries updated
Hello! Just to inform everyone that I have uploaded the latest English Dictionaries: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Updated: - en_CA (Canadian) (updated: 2014-05-13) (Kevin) - en_GB (British) (updated: 2014-06-01) (Marco) - en_US (American) (updated: 2014-05-13) (Kevin) Thanks! Kind regards, Marco A.G.Pinto --- --
English dictionaries - beta - 20140601
Hello, I have created a beta .OXT of the English dictionaries: https://dl.dropboxusercontent.com/u/30674540/ApacheOpenOffice41-DicTest-20140601.oxt I have updated: - en_CA (Canadian) (updated: 2014-05-13) - en_GB (British) (updated: 2014-06-01) - The date is 1-JUN because it is feature freeze - en_US (American) (updated: 2014-05-13) If no one finds any issues, on Monday I will release an official version. Thanks! Kind regards, Marco A.G.Pinto --
Re: Proofing Tool GUI 2.3 released - 14.Feb.2014
Thanks, my dear friend Andrea! :-P :-P ;-) :-) On 14/02/2014 13:35, Andrea Pescetti wrote: On 14/02/2014 Marco A.G.Pinto wrote: Also, I want to implement the Hyphenation. Does anyone know how it works, so that I can implement it? Maybe there is documentation somewhere? The tool is called Hyphen and is part of Hunspell. So I think you should start from http://sourceforge.net/projects/hunspell/files/Hyphen/ The syntax of auxiliary files is very, very obscure! Most of the times the hyphenation patterns are not created, but they are converted from the ones available for TeX. The Italian package http://extensions.openoffice.org/project/dict-it should contain README files with links to the TeX sources. Regards, Andrea. --
Extension URL has changed since yesterday?
Hello! I have found out why the page gives forbidden: previous URL: http://extensions.services.openoffice.org/en/project/english-dictionaries-apache-openoffice URL now: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice The "services" vanished in the new URL. Is there a way of fixing it, or will I need to upload a new OXT? Or can Andrea change it personally? The issue is that the link in the Extension Manager will point to the previous one. Thanks! Kind regards, Marco A.G.Pinto - --
Re: Bundled dictionaries and forced extension update for en-US dictionary ???
ature and phonetic suggestion support extension. - See more at: http://extensions.services.openoffice.org/en/project/english-dictionaries-apache-openoffice#sthash.Ny8W0L9C.dpuf English dictionaries for Apache OpenOffice 4.x and later, in a form suitable for inclusion as individual extension. These are the dictionary files bundled with the English version of Apache OpenOffice. It includes: - en_AU (Australian) - en_CA (Canadian) - en_GB (British) (updated on 2014-01-01) - en_US (American) - en_ZA (South African) This is a locally hosted copy of the a href="" class="moz-txt-link-rfc2396E" href="http://extensions.openoffice.org/en/node/3785">"http://extensions.openoffice.org/en/node/3785"English dictionaries with fixed dash handling and new ligature and phonetic suggestion support/a extension. " In this release, only the en_GB has been updated to V2.08. V2.09 or V2.10 will appear shortly after AOO 4.1 is released. I try to release a new version every month or so, with hundreds of new words. The site of my project, so that you know why I have embraced the project is: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html I spoke yesterday on IRC with JZA (Alexandro Colorado) about some things that need to be improved. I wanted also to update the en_US and add a New Zealand dictionary too, but I need some time to write a big e-mail contacting the right people and planning some things. If you need any more information, just ask :-P Thanks! Kind regards, Marco A.G.Pinto --- On 04/02/2014 12:30, Jrgen Schmidt wrote: Hi, I noticed that since last week I got always an extensions update for the bundled English dictionary when I work/test with a new profile. We bundle 2 dictionaries which seems to be English. I am not sure what the difference is or which one should be used. I am wondering why it is different and why nl and ru doesn't bundle the same as en*, de or it. [ language=en.* || language=de || language=it ] 3c3f841f5d40403bcc4d6bb685a38160 http://sourceforge.net/projects/aoo-extensions/files/17102/1/dict-en.oxt/download "dict-en.oxt" # English (USA, en_US) dictionary [ language==nl || language==ru ] e2eab80772ab1aa09716954219351a80 http://sourceforge.net/projects/aoo-extensions/files/1470/1/en_US.oxt/download "en_US.oxt" But more confusing do I find the fact that the notified update download a dictionary with the name "dict-en-marcoagpinto-20140107.oxt". What is this for an dictionary and is it correct that it uses the same extension identifier than "dict-en.oxt". Why is the name part of the file name? If this is the intended new one I would suggest to update the existing "dict-en.oxt" asap that we bundle this new one. Or if not use a different extension identifier. We should avoid that we bundle an old dictionary with 4.1 and all new users get directly an extension update. Any further information from people who knows more about the details? Juergen PS: follow up on the dev mailing list please - To unsubscribe, e-mail: l10n-unsubscr...@openoffice.apache.org For additional commands, e-mail: l10n-h...@openoffice.apache.org --
Developing an en_US Dictionary - 2014-01-23
Hello! I am the person working on the en_GB (English British) dictionary. Rob contacted me the other day to ask if I also maintained the en_US dictionary (English American). Unfortunately, that dictionary is obfuscated and the solution would be to use the one from Mozilla, just like I did for en_GB, and it would be a huge task since I have already found over 1000 unique words missing/typos/etc. (for en_GB). And I am not sure if the en_US has an explanation .AFF file describing all codes. Official site of en_GB: http://en-gb.pyxidium.co.uk Just look there for the .AFF description. What I would suggest is that someone grabs the en_US from Mozilla and pastes the contents of my .RTF in Thunderbird or OpenOffice to see what words aren't there (they will appear as typos). I have developed a tool for the task (Proofing Tool GUI): http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html My current list of added/removed/fixed words (in Rich Text Format) (Remember it is for en_GB): https://dl.dropboxusercontent.com/u/30674540/en_UK_speller_for_Mozilla%2BAOO%2BLO_2013%2B.rtf Please bear in mind that some words are slightly different in the US such as words that have one L instead of two. If someone takes on the project I will merge the dictionary into the big OXT after talking with Andrea for the details. Thanks! Kind regards, Marco A.G.Pinto -- --
English Dictionaries 2014-01-01
Hello! I pressed the "PUBLISH" button in the "English Dictionaries" but the new version, 2014-01-01, doesn't appear in replacement of the old one. I saw written somewhere in the site: " This extension is currently not published, please contact the system administrator at aooad...@users.sourceforge.net to have it published. This extension is currently not published, please contact the system administrator at aooad...@users.sourceforge.net to have it published." What shall I do? I am sending an e-mail anyway. Thanks! Kind regards, Marco A.G.Pinto - --
Dictionary - en_GB - release it?
Hello! One week has gone by and only one person replied to the testing e-mail, with a "+1". Does this mean I can press the "Publish" button? Also, every month I update the dictionary with around 200-300 new unique words (we are only in half of January and the planned February release already has near 300 new unique words, which means that this month I worked harder). Should I update Apache's dictionary every month or only at a certain pace? I update Mozilla's version every month. Thanks! Kind regards, Marco A.G.Pinto --- --
Dictionary en_GB - 2014-01-01 - Test version
Hello! I am following Andrea's steps to try to release an updated version of the English (British) speller with over 800 unique words added by me (plus several dozens of derived words). This OXT includes all English dictionaries as before, with only the en_GB updated. I was told to place an unpublished version of it and send the link to the DEV ML in order for people to test it for one week and give feedback. If all is fine, it will be published. Please notice that I had some difficulty in getting the direct link, as I pressed the publish button for it to appear and then unpublished it again... not sure if the procedure was 100% the correct one. Here are the download links: Direct link: http://extensions.openoffice.org/en/download/17669 Test ODT #1 (simple): https://dl.dropboxusercontent.com/u/30674540/test_words_speller_20140108.odt Test ODT #2 (large): https://dl.dropboxusercontent.com/u/30674540/test_words_speller_20140107.odt For the reasons why I have decided to embrace this project, please read: http://marcoagpinto.cidadevirtual.pt/en_GB_README.html Happy testing! Kind regards from, Marco A.G.Pinto --- --
Speller - en_GB V2.08 - 7-JAN-2014 - Fixes suggested by Andrea
Hello Andrea, Alexandro Colorado and all, Here are the changes: 1) changelog.txt Added that it was based on the non-obfuscated Mozilla version. 2) README_en_GB.txt Added the text suggested by Andrea, which appeared in the original .txt shipped with his OXT. 3) Created a test ODT to check with the 2013-07-31 and with 2014-01-01 https://dl.dropboxusercontent.com/u/30674540/test_words_speller_20140107.odt 4) Created the OXT with the requested changes https://dl.dropboxusercontent.com/u/30674540/dict-en-marcoagpinto-20140107.oxt PS- I noticed that removing and installing the previous EN OXT and this one, sometimes the typos are not detected. Just if, even if I removed all extensions, it is still using my version. I don't know why this is happening. I think I followed all of Andrea's steps. What shall I do now? Thanks! Kind regards, Marco A.G.Pinto --
Re: en_GB V2.08 - 1-JAN-2014
Hello Andrea and L10N, On 04/01/2014 15:28, Andrea Pescetti wrote: The file README_en_GB.txt is not OK: we need to keep its existing content. Just add your notes above or below the existing content, but do not remove attribution and references (so, to be safe, do not remove anything, just add!). So the current release is not OK, but the fix is easy. Andrea, I grabbed the en_GB dictionary from the Mozilla repository, since the AOO one is obfuscated. So, my "blueprint" is Mozilla's. The text I am using in the README is exactly the same, and I just replaced the: Please let David Bartlett d...@openoffice.org know of any errors that you find. The current release is R 1.18, 11/04/05 With the text you see after the "---" (my own text) I noticed that my OXT became around 500 kB smaller than the old one. Maybe before it was not used maximum ZIP compression? Yes, extracted contents are identical. But with DeflateX the compressed size of the largest file, th_en_US_v2.dat, is 250K smaller in your OXT file, compared to my DeflateN version. Now, this works for OpenOffice but it would be good to see if the ODF standard allows DeflateX as a compression method... Please read http://mail-archives.apache.org/mod_mbox/openoffice-users/201306.mbox/%3ccap-ksoj7ima4hjozqehmn0fb4ryqbqewzso3fnk-i6k62g_...@mail.gmail.com%3E (everything, including quoted messages) and let's try to understand it. I see that the only compression method allowed by the standard is "deflated" (since "stored" is not really a compression). Zipinfo returns "deflated" for both my and your OXT file, so I believe it's OK to use DeflateX, even if this remains a bit unclear. Andrea, while talking to JZA (Alexandro Colorado) last week or so, he told me to test the extension myself since only a few files were replaced. I tested it in my AOO 4.0.1 and all worked okay. All words I added were no longer recognized as typos. Also, may I be given committer rights so that in future updates I can do it myself without bothering everyone? It's not "committer rights", it's ownership of the extension. As I already wrote, please register for the Extensions website http://extensions.openoffice.org/ , send us (or me) your username and I will have the extension reassigned, so that you will be able to make releases. Again, please always test your extension thoroughly and try to align it with the OpenOffice releases to minimize the confusion to users. But we can aim at making an update available in January, to test the new setup. Andrea, I created an account but I am having passwords issues: first I set the username to marcoagpinto, then, in the e-mail I received it said: "Marco A.G.Pinto" and when I clicked in the link I received it doesn't allow me to change the password, even if I did try to reset it. I don't know what I am doing wrong. Thanks for your help! Kind regards, Marco A.G.Pinto --- --
Proofing Tool GUI 2.2 released - 1-JAN-2014
Hello I have released V2.2 of Proofing Tool GUI with the date of tomorrow (1.JAN.2014). Mainly, I have fixed a few bugs decoding words and added the "Check for Updates" feature. I have updated the wiki. The tool's official page is: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Notice that I now have Ubuntu x64 in an SD Card on my Netbook and Ubuntu x86 on a VM, which means I can compile the code myself for both platforms. Kind regards, Marco A.G.Pinto --- --
en_GB speller - V2.07 - released - 27-NOV-2013
Hello my dear ones, I have released an update for my forked en_GB speller. Can Andrea pack an extension with it for AOO and LO? File: https://dl.dropboxusercontent.com/u/30674540/Dictionary_en_GB_marcoagpinto_20131127.zip Thanks! Kind regards, Marco A.G.Pinto --- LIST OF IMPROVEMENTS: V2.07 - 27-NOV-2013 547) newsfeed 548) telerecording 549) televiewer 550) televiewing 551) radiogenic (+ally) 552) sRGB 553) RGB 554) tech (Oxford - informal) 555) infotech 556) bandolero (+plural) 557) honeytrap 558) gangboard 559) gangsta (Oxford - black slang) 560) bottlescrew 561) shrimper (+plural) 562) longline 563) prawner (merged into prawn) 564) wanna (Oxford - informal - want to; want a) 565) Wankie 566) wanky (Oxford - British vulgar slang) 567) kbyte (+plural) 568) maxwell (if in lower case it is a physics measure) 569) bushelful (+plural) 570) kilometric (merged into kilometre) 571) fibreless (merged into fibre) 572) midline 573) inbox 574) linkbait 575) pseudorandom 576) pseudorandomly (merged into pseudorandom) 577) pseudoscience 578) pseudoscientific 579) pseudoscientist 580) parapsychological 581) parapsychologically (merged into parapsychological) 582) parapsychic 583) psychometrist (merged into psychometry) 584) psychometrician 585) demonology (plural + demonological + demonologist) 586) vampirism 587) quickstepping (merged into quickstep) 588) quickstepped (merged into quickstep) 589) Solomonic (merged into Solomon) 590) millenary (+plural) 591) Floral 592) Amazonia 593) timeout 594) undelete (+ed +ing) 595) screenable (merged into screen) 596) screenful (merged into screen) 597) preselection 598) preselective 599) techie (+plural - Oxford: informal) 600) techy (Oxford: informal) 601) chargrill (+ed) 602) technicals (added plural) 603) knockout 604) speedball 605) decametre (+ic) 606) decigram 607) decigramme 608) magnifico (Oxford: informal) 609) magnificoes (Oxford: informal) 610) postpositive (+vely) 611) intentionalism 612) grottoed (merged into grotto) 613) preadolescent 614) preambular 615) summariness (merged into summary) 616) summational 617) summative 618) Zeuxis (Oxford: Greek painter) 619) zettabyte 620) rollerball 621) autoroute 622) backstreets (plural) 623) downfold 624) masterdom 625) masterhood 626) masterless (merged into master) 627) zinced (merged into zinc) 628) cobaltic (merged into cobalt) 629) cobaltous 630) combust 631) combustor 632) multithreaded 633) multithreading 634) netizen (+plural - Oxford: informal) 635) Photoshop (Oxford: +s +ing +ed) 636) rangefinder 637) microcapsule 638) colonoscope 639) colonoscopy (+plural) 640) hypnotherapist (+plural) 641) abductee (+plural) 642) ET (+plural) 643) Harzan (name of person) 644) Denise (name of person) 645) celeb (+plural - Oxford: informal) 646) CEO (added +s and 's) 647) CNN (added 's) 648) scats (merged plural into scat) 649) synchronicity 650) Sananda (name of person) 651) micropyle 652) spermatozoid (+plural) --
Possible bug in en_GB .AFF file?
Hello! I am getting ready to release tonight the en_GB speller forked by me. I am confused because the code O appears both as SFX and PFX: *PFX O Y 1** **PFX O 0 non- . * *SFX O Y 12** **SFX O 0 l a ** **SFX O e al [^bcgv]e ** **SFX O e ial [bcgv]e ** **SFX O 0 ial [bcrx] ** **SFX O um al um ** **SFX O 0 al [^u]m ** **SFX O y al ty ** **SFX O y ial [^t]y ** **SFX O 0 ual [px]t ** **SFX O 0 tal [iu]t ** **SFX O 0 al [^ipux]t ** **SFX O 0 al [^aebcrtxmy]* Is this a bug in the .AFF? Which one should be used? Thanks! Kind regards, Marco A.G.Pinto
Speller en_GB - V2.06 - 1.Nov.2013
Hello! I know it is not 1-NOV-2013 yet in Portugal, but it is in other parts of the world. I think V2.06 of the speller is ready for an official release. It has a total of 500+ words with many fixes/removals/improvements over the original (outdated) versions. I was wondering if Mozilla could make it the official British speller, since the old ones haven't been maintained properly or the developers have given up on them: https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2/ Also, I talked with Andrea from Apache and it seems he can pack it and make it the official one for OpenOffice: https://dl.dropboxusercontent.com/u/30674540/Dictionary_en_GB_marcoagpinto_20131101.zip Andrea and friends, could you also make it available to LibreOffice? You have my word that as long as I am alive I will continue improving the dictionary since I want to help the community. Thanks! Kind regards, Marco A.G.Pinto --- LIST OF CHANGES: V2.00 - 25-AUG-2013 --- Added the words: 1) Thunderbird 2) Firefox 3) LibreOffice 4) Ubuntu 5) Amiga (from Commodore) 6) mIRC 7) Facebook 8) ChanServ 9) NickServ 10) IRC 11) PhD (+ plural) 12) Ph.D. 13) screenshot (+ plural) 14) PowerPC 15) internship (+ plural) 16) D-day 17) Wikipedia 18) committer (+ plural) 19) anymore 20) GitHub 21) Blu-ray (+ plural) 22) airplane (+ plural) 23) unsubscribe V2.01 - 31-AUG-2013 --- Added the words: 24) cyberterrorist (+plural) 25) cyberterrorism 26) DVD-RW 27) DVD-RAM 28) DVD+RW 29) cyberattack (+plural) 30) encyclopedia (+plural) 31) spyware 32) adware 33) malware 34) Trojan (First letter capitalized - Oxford Dictionary) 35) Freeware 36) rootkit (+plural) 37) keylogger (+plural) 38) keylogging 39) subprogram (+plural) 40) webcam (+plural) 41) antivirus 42) Coca-Cola 43) fuckable 44) fuckability 45) NSA 46) GnuPG 47) melancholiac 48) scammer (+plural) 49) XML 50) movie (+plural) (Oxford Dictionary says it is chiefly North American) 51) YouTube 52) cyberwar (+plural) V2.02 - 4-SEP-2013 --- Added the words: 53) aargh 54) ABC (+plural) 55) admin 56) ADSL 57) JPEG 58) megapixel (+plural) 59) PDF 60) HDD 61) FPU 62) boing 63) google 64) Google (capitalized-company) 65) IDE 66) fallback 67) re-establish 68) MUFON 69) transborder (+plural) 70) leveraging 71) geek (+plural) 72) bestseller (+plural) 73) playlist (+plural +ing) 74) PowerPoint 75) DALnet 76) meetup (Oxford Dictionary says it is chiefly North American) 77) traineeship (Fix - it was only inserted in plural) 78) phishing 79) phisher (+plural) 80) botnet (+plural) 81) guestbook 82) widescreen (fix - it was inserted wide-screen) 83) shortcut (+plural) 84) retrocession 85) hardcover (Oxford Dictionary says it is chiefly North American) 86) Bryon (name of person) 87) Aleister (name of person) 88) teleport 89) teleportation 90) etheric 91) telepath 92) trustable 93) contactee (+plural) 94) chakra (+plural) 95) Adamski (name of person) 96) diapers (Added plural) 97) Pleiades (place in the universe) 98) PGP 99) lineman 100) linemen 101) videography 102) videographer 103) HDMI 104) mentoring 105) mentored (merged to mentor+ed) 106) anytime 107) skywatch (+er +ers +er's) 108) statewide 109) caseload 110) experiencer (+plural) 111) experienceable 112) blog (+plural) 113) blogging 114) blogged 115) blogger 116) crimeware 117) multisession 118) cyberstalking 119) cyberstalker (+plural) 120) PCMCIA 121) failover 122) placeholder (+plural) 123) showstopper (+plural) 124) showstopping 125) API 126) SeaMonkey 127) markup 128) watchdogging 129) watchdogged 130) cockapoo 131) Elohim (name) 132) undine (+plural) 133) Plutonian 134) technostructure 135) uncommercial 136) uncompetitively 137) uncompetitiveness 138) uncomplicatedly 139) uncomplicatedness 140) graphitic 141) blacklead 142) ufological 143) UEFA V2.03 - 13-SEP-2013 --- 144) Merged uncompetitiveness and uncompetitively into uncompetitive. 145) e-book (+plural) 146) e-business 147) e-card (+plural) 148) e-crime (+plural) 149) e-criminal (plural) 150) infomediary 151) infomercial (Oxford Dictionary says it is chiefly North American) 152) infonaut 153) SQL 154) cybercafe 155) cyberbullying 156) cyberbully 157) microcontroller 158) microcircuitry 159) PDFs (plural) 160) AGP 161) BIOS (computer) 162) bio (+plural) - (Oxford Dictionary-informal-a biography) 163) biofuel 164) biodata 165) biodefence 166) biodiesel 167) bioconversion 168) biocomputer 169) biocomputing 170) bioclimatic 171) biochip 172) dpi 173) BitTorrent 174) spellcheck (+ er) 175) GPS 176) GIF 177) MPEG 178) NGO 179) NTSC 180) psychotomimetic 181) OLED 182) biocircuit 183) subwoofer 184) stereophonically 185) stereophony 186) subclinical 187) electro-oculography 188) electro-oculographic 189) electrochemistry 190) electrochemist 191) triplexed (added +ed to triplex) 192) sextodecimo (+plural) 193) thermography 194) thermographic 195) thermogram (+plural) 196
en_GB speller for AOO/LO/Mozilla
Hello! I have been working hard on the forked en_GB speller and it should now include 500+ new words (including derived ones). I have already released six versions of it for Mozilla (Firefox, Thunderbird, SeaMonkey): https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 I have been working on V2.06, to which I have added dozens of new words and fixes/improvements. It will be released in November since the Mozilla guys at the IRC channel said I am always releasing new versions and that I should try to release only one per month. After I have it ready, in November, I will also release it for AOO+LO since the files are the same. I just need to change the README file to include AOO and LO. In Mozilla I only used the .DIC and the .AFF and for AOO/LO we just need to include the Hyphenation file, which I didn't change, and the Thesaurus we can leave the same. Andrea, if I well recall, you said it is you who packs the extensions? Also, in November, how do I make LibreOffice also use it? I don't know who to contact there or if there is some kind of common repository for dictionaries. Thanks my dear friends! Kind regards, Marco A.G.Pinto --
Fixed the decoding of words in my tool
Hello! Thank you all for the advices. I have been able to improve the source code of my linguistic tool and it now decodes multiple conditions. This means I will proceed adding en_GB words to the speller and now with a better preview of each code. I will also release a new version of my tool soon, after I enhance some things, such as the ability to check for updates on-line. I am also going to buy a new pen drive and try to install there Ubuntu x64 so that I don't have to annoy Andrew Ferguson every time I want to compile a 64-bit version. Thank you all! Kind regards, Marco A.G.Pinto --- - To unsubscribe, e-mail: l10n-unsubscr...@openoffice.apache.org For additional commands, e-mail: l10n-h...@openoffice.apache.org
Re: Introduction (Regarding the .AFF file again)
Hello! Sorry for bringing this old subject, but I have been facing some dificulties improving the en_GB dictionary (at the moment for Mozilla). I have posted in the Mozilla ML: Hello! It seems the en_GB .aff file has some bugs. I have been testing it in the field. for example, in the suffix "ing": "modeling" suggests "modelling" (with two "l"s) The "G" code is a mess in the original .aff file. As you can see, it uses nonsense rules like: [^aeio][aeiou]b Could someone suggest a fix?: SFX G Y 24 SFX G e ing [^eioy]e SFX G 0 ing [eoy]e SFX G ie ying ie SFX G 0 bing [^aeio][aeiou]b SFX G 0 king [^aeio][aeiou]c SFX G 0 ding [^aeio][aeiou]d SFX G 0 fing [^aeio][aeiou]f SFX G 0 ging [^aeio][aeiou]g SFX G 0 king [^aeio][aeiou]k SFX G 0 ling [^aeio][eiou]l SFX G 0 ing [aeio][eiou]l SFX G 0 ling [^aeo]al SFX G 0 ing [aeo]al SFX G 0 ming [^aeio][aeiou]m SFX G 0 ning [^aeio][aeiou]n SFX G 0 ping [^aeio][aeiou]p SFX G 0 ring [^aeio][aeiou]r SFX G 0 sing [^aeio][aeiou]s SFX G 0 ting [^aeio][aeiou]t SFX G 0 ving [^aeio][aeiou]v SFX G 0 zing [^aeio][aeiou]z SFX G 0 ing [aeio][aeiou][bcdfgkmnprstvz] SFX G 0 ing [^aeiou][bcdfgklmnprstvz] SFX G 0 ing [^ebcdfgklmnprstvz] Someone replied to this rule: [^aeio][aeiou]b I'm not really familiar with the details of the .aff file, but I assume this is a regular-_expression_-like sequence, meaning that the letter before the "b" must be one of "aeiou", and the letter before THAT must be something OTHER than "aeio" (because this is a negated set, [^aeio]). So the two character sets in this rule correspond to the TWO letters preceding the "b"; they're not conflicting requirements for the SAME letter. Can Ricardo or someone help? Thanks! Kind regards, Marco A.G.Pinto --- On 13/08/2013 11:11, Ricardo Palomares Martnez wrote: El 13/08/13 10:30, Marco A.G.Pinto escribi: Hello! Thanks for your suggestion Andrea, Please, don't duplicate threads. People has a life and may need more than an hour to reply you. If in doubt whether your message has reached the list, either configure to subscription to get your own posts (and then don't use Gmail, as it eats them), or check the list archive: http://mail-archives.apache.org/mod_mbox/openoffice-l10n/201308.mbox/browser but I haven't been able to find in the archive any file with the 80 examples. It is right there, inside the /tests folder of the .tar.gz Andrea pointed you to. Each test seems to be made of several files which share the same name but end with different extensions (it is the first time I see them, too): - aff: a sample affixes file for the test - dic: a sample dic file for the test - sug: a file with the list of expected suggestions for Hunspell to offer - test: a Unix shell file to launch the test - wrong: a list of badly written words so Hunspell, using the test dictionary, offers you suggestions (that should match the list in the .sug file). Depending on the test, this file could not exist - good: a list of correctly written words so Hunspell, using the test dictionary and related affixes file, shouldn't complain for incorrect words. Keep in mind that the tests are intended to test Hunspell itself, so if anyone makes changes to the Hunspell source code and wants to be sure his/her changes don't break Hunspell, he/she can run all tests and check that the modified version still passes all the tests. Put it in another way: if you're creating a tool able to parse dic and aff files and derivate all the words, it should pass all these tests. So, I have edited a word from the en_GB dictionary of Thunderbird and I am posting here the PFX/SFX codes in the hope that someone can explain how they work. Please see the image here which has basic coding and shows wrong results: http://i.imgur.com/JEIxOmv.png Here is the .AFF code for the word: *abdicate/DNGSn* Please notice that this word only has suffixes. Please give an example with prefixes too, if it doesn't give much trouble. (...) I've learned by example while editing the dictionaries, so I may be wrong. Also, the
Re: Proofing Tool GUI V2.1 - 20130923 - Released
Hello Rob, I don't know why the dictionaries are not the same, but I would like to unify them. This way I would work in a "master" dictionary for both Mozilla and AOO/LO. I was just wondering if someone could upload somewhere a list of all the words from the AOO en_GB dictionary? JZA and I tried to unmunch the "obfuscated" .dic and .aff but it resulted in a corrupt filelist. What a challenge it was, I even installed Debian in my VM, but I messed up the installation and then used Ubuntu. Thanks! Kind regards, Marco A.G.Pinto --- On 27/09/2013 20:19, Rob Weir wrote: I have been using it to edit the Dictionary of Mozilla (en_GB) [2]. I had the crazy idea of unifying AOO and Mozilla en_GB dictionary so that I could update both at the same time, but, first I must merge part of AOO's into Mozilla's. That would be a good idea. Is there any good reason why the dictionaries are not the same today? -Rob --
Re: Introduction (Regarding the .AFF file again)
Thanks, Ricardo. :-) Kind regards, Marco A.G.Pinto --- On 27/09/2013 20:53, Ricardo Palomares Martnez wrote: The answer is right, or at least I think the same. I don't really know so much English grammar (or maybe I don't have enough vocabulary on the top of my head) to be able to review that suffix. If you do, you could try to find sample words for each line and see if it has sense. For instance, the word "come" applies for the very first line: SFX G e ing [^eioy]e ("come" ends with "e" and the letter before it is not an "e", "i", "o" or "y") and it translates to remove the ending "e" (- "com") and add "ing" (- "coming"), which is correct. As the "ing" is used to build the gerund verb form, you should try to find verbs complying each condition. You may try at: http://www.wordreference.com/ using an "*" followed by the intended ending. For instance, I didn't know any verb ending in "oe" (the second line), but I wrote "*oe" and I found the verb "hoe". HTH --
Re: en_GB dictionary for AOO
Hello Stuart, I visited the site but the dictionary there is outdated, this because the .OXT one from AOO has been updated in 2013 (or 2010). There is a difference of about 10 000 words from the V1.20 in his site, from the new one. Isn't there someone who could uncompress the current dictionary and send it to me? Maybe the tool used to compress also allows uncompress. Who are the persons in charge of it? Thanks! Kind regards, Marco A.G.Pinto On 02/09/2013 13:13, Stuart Swales wrote: Marco, A fully commented list of the affixes used in the en-GB dictionary is available from David Bartlett's site: http://en-gb.pyxidium.co.uk/dictionary/OOo.php Stuart Swale (en-GB UI Help) On 31/08/2013 14:19, Marco A.G.Pinto wrote: Andrea, I already have that .OXT file. But the dictionary uses "compression". In other words, instead of a "normal" code after each word, it uses numbers that represent a set of codes. Can't someone convert it back to a plain file? Otherwise I won't be able to edit it. Thanks! Kind regards, Marco A.G.Pinto On 31/08/2013 14:07, Andrea Pescetti wrote: Marco A.G.Pinto wrote: So, I am offering myself to improve AOO's dictionary as well. The big issue with it is that it is compressed, so my tool doesn't work normally with it. Could someone place download links, for uncompressed versions of it?: - en_GB.aff - en_GB.dic The dict-en.oxt file you download from http://extensions.openoffice.org/en/projectrelease/english-dictionaries-apache-openoffice-20130731 must simply be unzipped. You will find the en_GB.* files within the archive. Regards, Andrea. -- --
en_GB dictionary for AOO
Hello I have been working on the en_GB dictionary for Thunderbird, SeaMonkey and Firefox [1]. I have decided to paste all the words I have added so far (V2.00 + WIP V2.01) in AOO and a lot of them aren't recognized. Even very simple basic words such as "antivirus" aren't recognized. So, I am offering myself to improve AOO's dictionary as well. The big issue with it is that it is compressed, so my tool doesn't work normally with it. Could someone place download links, for uncompressed versions of it?: - en_GB.aff - en_GB.dic [1] https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 Thanks! Kind regards, Marco A.G.Pinto --
Preview of Proofing Tool GUI V2.1 (beta - build 31)
Hello! I have been working hard on my tool. Last night I was up to around 4am trying to fix a few bugs in the Prefix/Suffix procedures. I am sharing in my Dropbox the latest source + binaries. Please notice that there is still a pause in the Linux version after opening a large dictionary/thesaurus. Also, the Linux version was compiled using PureBasic 5.20 beta 10 and the Windows versions with PureBasic 5.11. There are still some improvements and optimisations to be done before releasing and official V2.1 . Maybe it will take a few more weeks or so. For example, in the derived words I want to make possible to jump to the line of the .AFF file if we click in a word. This is still an improvement I have in mind. Here are the links: - SOURCE CODE: https://dl.dropboxusercontent.com/u/30674540/ProofingToolGUI_V0031.zip - BINARY (WINDOWS AND LINUX) https://dl.dropboxusercontent.com/u/30674540/ProofingToolGUI_V21beta-build31_WinLin.zip If you find any bugs or have suggestions, please tell me. Thanks! Kind regards from, Marco A.G.Pinto --- --
Re: Introduction
Andrea and Ricardo My dear friends I have finally been able to code the suffix code into my tool, thanks to the trouble Ricardo had of explaining the rules to me. However, I still haven't coded the prefix code because I would have to think backwards. Is there a word or two which you could tell me from the English dictionary that uses prefixes? That way I would have a way to test while decoding it. Ricardo, just one little question: in my last mail I had the rules described such as: SFX D Y 25 You told me that if instead of a "Y" we would have a "N", it would only use that one. Does that mean I should first look for a "N" and if found it would only use it, and ignore all the "Y"s? I am not sure if the en_GB has any "N" as I haven't looked for it yet. Thanks! Kind regards, Marco A.G.Pinto --- On 14/08/2013 18:07, Andrea Pescetti wrote: On 13/08/2013 Ricardo Palomares Martnez wrote: It is right there, inside the /tests folder of the .tar.gz Andrea pointed you to. Each test seems to be made of several files I confirm those are the files I mentioned. I've learned by example while editing the dictionaries, so I may be wrong. Also, there are some keywords in the aff test files that I don't recognize. Same for me, and thank you Ricardo for the very comprehensive explanations. I'm a Hunspell user rather than a developer, so I learnt what I need for dictionary maintenance and I look at more advanced stuff as needed. Good luck with your software. To be honest, I was thinking of writting a Java program for the aff/dic part, but I'm lazy and I thought the first thing to do would be to port Hunspell to Java, which is a huge task for me In Marco's case, he only needs to expand words, so rather than writing a full Hunspell replacement he needs to replicate the functionality of the "unmunch" tool (a tool that takes the stem, say "love", and computes "loved", "loves", "loving" and other derivatives) that can be found in the same source package. Regards, Andrea. --
Examples for derived words (again)
Hello! Thanks for your suggestion Andrea, but I haven't been able to find in the archive any file with the 80 examples. So, I have edited a word from the en_GB dictionary of Thunderbird and I am posting here the PFX/SFX codes in the hope that someone can explain how they work. Please see the image here which has basic coding and shows wrong results: http://i.imgur.com/JEIxOmv.png Here is the .AFF code for the word: abdicate/DNGSn Please notice that this word only has suffixes. Please give an example with prefixes too, if it doesn't give much trouble. D: SFX D Y 25 SFX D 0 d [^e]e SFX D e d ee SFX D 0 bed [^aeio][aeiou]b SFX D 0 ked [^aeio][aeiou]c SFX D 0 ded [^aeio][aeiou]d SFX D 0 fed [^aeio][aeiou]f SFX D 0 ged [^aeio][aeiou]g SFX D 0 ked [^aeio][aeiou]k SFX D 0 led [^aeio][eiou]l SFX D 0 ed [aeio][eiou]l SFX D 0 led [^aeo]al SFX D 0 ed [aeo]al SFX D 0 med [^aeio][aeiou]m SFX D 0 ned [^aeio][aeiou]n SFX D 0 ped [^aeio][aeiou]p SFX D 0 red [^aeio][aeiou]r SFX D 0 sed [^aeio][aeiou]s SFX D 0 ted [^aeio][aeiou]t SFX D 0 ved [^aeio][aeiou]v SFX D 0 zed [^aeio][aeiou]z SFX D y ied [^aeiou]y SFX D 0 ed [aeiou]y SFX D 0 ed [aeio][aeiou][bcdfgkmnprstvz] SFX D 0 ed [^aeiou][bcdfgklmnprstvz] SFX D 0 ed [^ebcdfgklmnprstvyz] N: SFX N Y 26 SFX N b ption b SFX N d sion d SFX N be ption be SFX N e tion ce SFX N de sion de SFX N ke cation ke SFX N e ption ume SFX N e mation [^u]me SFX N e ion [^o]se SFX N e ition ose SFX N e ation [iou]te SFX N e ion [^iou]te SFX N e ation [^bcdkmst]e SFX N el ulsion el SFX N 0 lation [aiou]l SFX N 0 ation [^aeiou]l SFX N 0 mation [aeiou]m SFX N 0 ation [^aeiou]m SFX N er ration er SFX N 0 ation [^e]r SFX N 0 ion [sx] SFX N t ssion mit SFX N 0 ion [^m]it SFX N 0 ation [^i]t SFX N y ication y SFX N 0 ation [^bdelmrstxy] G: SFX G Y 24 SFX G e ing [^eioy]e SFX G 0 ing [eoy]e SFX G ie ying ie SFX G 0 bing [^aeio][aeiou]b SFX G 0 king [^aeio][aeiou]c SFX G 0 ding [^aeio][aeiou]d SFX G 0 fing [^aeio][aeiou]f SFX G 0 ging [^aeio][aeiou]g SFX G 0 king [^aeio][aeiou]k SFX G 0 ling [^aeio][eiou]l SFX G 0 ing [aeio][eiou]l SFX G 0 ling [^aeo]al SFX G 0 ing [aeo]al SFX G 0 ming [^aeio][aeiou]m SFX G 0 ning [^aeio][aeiou]n SFX G 0 ping [^aeio][aeiou]p SFX G 0 ring [^aeio][aeiou]r SFX G 0 sing [^aeio][aeiou]s SFX G 0 ting [^aeio][aeiou]t SFX G 0 ving [^aeio][aeiou]v SFX G 0 zing [^aeio][aeiou]z SFX G 0 ing [aeio][aeiou][bcdfgkmnprstvz] SFX G 0 ing [^aeiou][bcdfgklmnprstvz] SFX G 0 ing [^ebcdfgklmnprstvz] S: SFX S Y 9 SFX S y ies [^aeiou]y SFX S 0 s [aeiou]y SFX S 0 es [sxz] SFX S 0 es [cs]h SFX S 0 s [^cs]h SFX S 0 s [ae]u SFX S 0 x [ae]u SFX S 0 s [^ae]u SFX S 0 s [^hsuxyz] n: SFX n Y 28 SFX n 0 tion a SFX n e tion ce SFX n ke cation ke SFX n e ation [iou]te SFX n e ion [^iou]te SFX n e ation [^ckt]e SFX n el ulsion el SFX n 0 lation [aiou]l SFX n 0 ation [^aeiou]l SFX n er ration er SFX n 0 ation [^e]r SFX n y ation py SFX n y ication [^p]y SFX n 0 ation [^aelry] SFX n 0 tions a SFX n e tions ce SFX n ke cations ke SFX n e ations [iou]te SFX n e ions [^iou]te SFX n e ations [^ckt]e SFX n el ulsions el SFX n 0 lations [aiou]l SFX n 0 ations [^aeiou]l SFX n er rations er SFX n 0 ations [^e]r SFX n y ations py SFX n y ications [^p]y SFX n 0 ations [^aelry] Thanks! Kind regards, Marco A.G.Pinto --- --
Re: Introduction
Hello! Thanks for your suggestion Andrea, but I haven't been able to find in the archive any file with the 80 examples. So, I have edited a word from the en_GB dictionary of Thunderbird and I am posting here the PFX/SFX codes in the hope that someone can explain how they work. Please see the image here which has basic coding and shows wrong results: http://i.imgur.com/JEIxOmv.png Here is the .AFF code for the word: abdicate/DNGSn Please notice that this word only has suffixes. Please give an example with prefixes too, if it doesn't give much trouble. D: SFX D Y 25 SFX D 0 d [^e]e SFX D e d ee SFX D 0 bed [^aeio][aeiou]b SFX D 0 ked [^aeio][aeiou]c SFX D 0 ded [^aeio][aeiou]d SFX D 0 fed [^aeio][aeiou]f SFX D 0 ged [^aeio][aeiou]g SFX D 0 ked [^aeio][aeiou]k SFX D 0 led [^aeio][eiou]l SFX D 0 ed [aeio][eiou]l SFX D 0 led [^aeo]al SFX D 0 ed [aeo]al SFX D 0 med [^aeio][aeiou]m SFX D 0 ned [^aeio][aeiou]n SFX D 0 ped [^aeio][aeiou]p SFX D 0 red [^aeio][aeiou]r SFX D 0 sed [^aeio][aeiou]s SFX D 0 ted [^aeio][aeiou]t SFX D 0 ved [^aeio][aeiou]v SFX D 0 zed [^aeio][aeiou]z SFX D y ied [^aeiou]y SFX D 0 ed [aeiou]y SFX D 0 ed [aeio][aeiou][bcdfgkmnprstvz] SFX D 0 ed [^aeiou][bcdfgklmnprstvz] SFX D 0 ed [^ebcdfgklmnprstvyz] N: SFX N Y 26 SFX N b ption b SFX N d sion d SFX N be ption be SFX N e tion ce SFX N de sion de SFX N ke cation ke SFX N e ption ume SFX N e mation [^u]me SFX N e ion [^o]se SFX N e ition ose SFX N e ation [iou]te SFX N e ion [^iou]te SFX N e ation [^bcdkmst]e SFX N el ulsion el SFX N 0 lation [aiou]l SFX N 0 ation [^aeiou]l SFX N 0 mation [aeiou]m SFX N 0 ation [^aeiou]m SFX N er ration er SFX N 0 ation [^e]r SFX N 0 ion [sx] SFX N t ssion mit SFX N 0 ion [^m]it SFX N 0 ation [^i]t SFX N y ication y SFX N 0 ation [^bdelmrstxy] G: SFX G Y 24 SFX G e ing [^eioy]e SFX G 0 ing [eoy]e SFX G ie ying ie SFX G 0 bing [^aeio][aeiou]b SFX G 0 king [^aeio][aeiou]c SFX G 0 ding [^aeio][aeiou]d SFX G 0 fing [^aeio][aeiou]f SFX G 0 ging [^aeio][aeiou]g SFX G 0 king [^aeio][aeiou]k SFX G 0 ling [^aeio][eiou]l SFX G 0 ing [aeio][eiou]l SFX G 0 ling [^aeo]al SFX G 0 ing [aeo]al SFX G 0 ming [^aeio][aeiou]m SFX G 0 ning [^aeio][aeiou]n SFX G 0 ping [^aeio][aeiou]p SFX G 0 ring [^aeio][aeiou]r SFX G 0 sing [^aeio][aeiou]s SFX G 0 ting [^aeio][aeiou]t SFX G 0 ving [^aeio][aeiou]v SFX G 0 zing [^aeio][aeiou]z SFX G 0 ing [aeio][aeiou][bcdfgkmnprstvz] SFX G 0 ing [^aeiou][bcdfgklmnprstvz] SFX G 0 ing [^ebcdfgklmnprstvz] S: SFX S Y 9 SFX S y ies [^aeiou]y SFX S 0 s [aeiou]y SFX S 0 es [sxz] SFX S 0 es [cs]h SFX S 0 s [^cs]h SFX S 0 s [ae]u SFX S 0 x [ae]u SFX S 0 s [^ae]u SFX S 0 s [^hsuxyz] n: SFX n Y 28 SFX n 0 tion a SFX n e tion ce SFX n ke cation ke SFX n e ation [iou]te SFX n e ion [^iou]te SFX n e ation [^ckt]e SFX n el ulsion el SFX n 0 lation [aiou]l SFX n 0 ation [^aeiou]l SFX n er ration er SFX n 0 ation [^e]r SFX n y ation py SFX n y ication [^p]y SFX n 0 ation [^aelry] SFX n 0 tions a SFX n e tions ce SFX n ke cations ke SFX n e ations [iou]te SFX n e ions [^iou]te SFX n e ations [^ckt]e SFX n el ulsions el SFX n 0 lations [aiou]l SFX n 0 ations [^aeiou]l SFX n er rations er SFX n 0 ations [^e]r SFX n y ations py SFX n y ications [^p]y SFX n 0 ations [^aelry] Thanks! Kind regards, Marco A.G.Pinto --- On 11/08/2013 22:28, Andrea Pescetti wrote: On 11/08/2013 Marco A.G.Pinto wrote: Hunspell source distribution contains more than 80 examples for option usage." Can someone send me the text with the 80 examples? It's the "tests" folder in the sources .tar.gz file: http://downloads.sourceforge.net/hunspell/hunspell-1.3.2.tar.gz Regards, Andrea. - To unsubscribe, e-mail: l10n-unsubscr...@openof
Re: Introduction
Thanks, Andrea. I have stared the message and tomorrow I will dig into it. Kind regards from your friend, Marco A.G.Pinto --- On 11/08/2013 22:28, Andrea Pescetti wrote: On 11/08/2013 Marco A.G.Pinto wrote: Hunspell source distribution contains more than 80 examples for option usage." Can someone send me the text with the 80 examples? It's the "tests" folder in the sources .tar.gz file: http://downloads.sourceforge.net/hunspell/hunspell-1.3.2.tar.gz Regards, Andrea. --
Introduction
Hello! I am Marco Pinto from Portugal. Since the 1990s that I have dedicated my free time to several projects, both translating and coding and also beta testing. A few years ago I decided to take part of the OpenOffice.org project, so I unofficially sent e-mails to the persons in charge of the pt_PT suggesting words for the dictionary (speller) and thesaurus and also reported typos I found. Recently the Dic+The+Hyph of pt_PT is owned by Minho University in Portugal and I am in contact via e-mail with them. I wanted to do more, so I started developing my tool, Proofing Tool GUI [1] [2], and I sent an e-mail to the Minho University guys saying that I wanted to help improve the Thesaurus and I have already added several words using my tool. Nothing official though... As I improve my tool (I am currently working on the derived words showing like Andrea Pescetti and other persons suggested) I also wanted to get more involved in projects such as Firefox and Thunderbird. In Mozilla channels (IRC) I offered myself to improve the speller of en_GB since it is full of typos and missing words, and they told me to fork it. I tried to contact Lucas twice, who was the person making the dictionary compatible with latest releases of Thunderbird but I never got a reply. Also, regarding my tool, I need help concerning the decoding of words. I went to Hunspell on Source Forge and downloaded the three PDFs. The third one, hunspell4.pdf, says on page 2: " GENERAL OPTIONS Hunspell source distribution contains more than 80 examples for option usage." Can someone send me the text with the 80 examples? Thanks! Kind regards, Marco A.G.Pinto --- [1] (V2.0) http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html [2] (V2.1 being coded) http://i.imgur.com/yrUnvax.jpg --
Proofing Tool GUI V2.0 - Released
Hello! I have released V2.0 of my tool. I have edited the Dictionary Wiki page inserting the information and link. The date is 1-JUL-2013 but, since the tool has been ready already, I released it one day before (although in some parts of the world it is already Monday). The site for my tool is: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html I believe it now works with all Linuxes, even though there is a speed issue in Linux while opening the spellers and thesaurus. In Windows it works very well though. Thanks for your time! Kind regards, Marco A.G.Pinto --- --
Proofing Tool GUI V1.0 - Released
Hello! I have created and uploaded a Web page containing the manual, source and downloads for Proofing Tool GUI V1.0. The date there is 2-JUN-2013 but I have everything ready and, in some parts of the world, it is already that date. Site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html This tool will allow to edit the Thesaurus of OpenOffice and LibreOffice. Later I want it to edit also the Dictionary and Hyphenation to make it compatible also with Firefox and Thunderbird, since their EN-UK dictionary is bad and no one volunteered to make a better version. If you like the tool, maybe you could place it in the official AOO page of tools. Thanks! Kind regards, Marco A.G.Pinto --- --